WorldWideScience

Sample records for c-11dasb test-retest reproducibility

  1. Static and Dynamic Handgrip Strength Endurance: Test-Retest Reproducibility.

    Science.gov (United States)

    Gerodimos, Vassilis; Karatrantou, Konstantina; Psychou, Dimitra; Vasilopoulou, Theodora; Zafeiridis, Andreas

    2017-03-01

    This study investigated the reliability of static and dynamic handgrip strength endurance using different protocols and indicators for the assessment of strength endurance. Forty young, healthy men and women (age, 18-22 years) performed 2 handgrip strength endurance protocols: a static protocol (sustained submaximal contraction at 50% of maximal voluntary contraction) and a dynamic one (8, 10, and 12 maximal repetitions). The participants executed each protocol twice to assess the test-retest reproducibility. Total work and total time were used as indicators of strength endurance in the static protocol; the strength recorded at each maximal repetition, the percentage change, and fatigue index were used as indicators of strength endurance in the dynamic protocol. The static protocol showed high reliability irrespective of sex and hand for total time and work. The 12-repetition dynamic protocol exhibited moderate-high reliability for repeated maximal repetitions and percentage change; the 8- and 10-repetition protocols demonstrated lower reliability irrespective of sex and hand. The fatigue index was not a reliable indicator for the assessment of dynamic handgrip endurance. Static handgrip endurance can be measured reliably using the total time and total work as indicators of strength endurance. For the evaluation of dynamic handgrip endurance, the 12-repetition protocol is recommended, using the repeated maximal repetitions and percentage change as indicators of strength endurance. Practitioners should consider the static (50% maximal voluntary contraction) and dynamic (12 repeated maximal repetitions) protocols as reliable for the assessment of handgrip strength endurance. The evaluation of static endurance in conjunction with dynamic endurance would provide more complete information about hand function. Copyright © 2017 American Society for Surgery of the Hand. Published by Elsevier Inc. All rights reserved.

  2. Test-retest reproducibility of accommodative facility measures in primary school children.

    Science.gov (United States)

    Adler, Paul; Scally, Andrew J; Barrett, Brendan T

    2018-05-08

    To determine the test-retest reproducibility of accommodative facility (AF) measures in an unselected sample of UK primary school children. Using ±2.00 DS flippers and a viewing distance of 40 cm, AF was measured in 136 children (range 4-12 years, average 8.1 ± 2.1) by five testers on three occasions (average interval between successive tests: eight days, range 1-21 days). On each occasion, AF was measured monocularly and binocularly, for two minutes. Full datasets were obtained in 111 children (81.6 per cent). Intra-individual variation in AF was large (standard deviation [SD] = 3.8 cycles per minute [cpm]) and there was variation due to the identity of the tester (SD = 1.6 cpm). On average, AF was greater: (i) in monocular compared to binocular testing (by 1.4 cpm, p cpm, p cpm lower than in children ≥ 10 years old, p = 0.009); and (iv) on subsequent testing occasions (for example, visit-2 AF was 2.0 cpm higher than visit-1 AF, p cpm monocularly and ≥ 8 cpm binocularly), but this rose to 83.8 per cent after the third test. Using less stringent pass criteria (≥ 6 cpm monocularly and ≥ 3 cpm binocularly), the equivalent figures were 82.9 and 96.4 per cent, respectively. Reduced AF did not co-exist with abnormal near point of accommodation or reduced visual acuity. The results reveal considerable intra-individual variability in raw AF measures in children. When the results are considered as pass/fail, children who initially exhibit normal AF continued to do so on repeat testing. Conversely, the vast majority of children with initially reduced AF exhibit normal performance on repeat testing. Using established pass/fail criteria, the prevalence of persistently reduced AF in this sample is 3.6 per cent. © 2018 Optometry Australia.

  3. The QUASAR reproducibility study, Part II: Results from a multi-center Arterial Spin Labeling test-retest study

    DEFF Research Database (Denmark)

    Petersen, Esben Thade; Mouridsen, Kim; Golay, Xavier

    2010-01-01

    Quantitative STAR labeling of Arterial Regions or QUASAR), a method providing user independent quantification of CBF in a large test-retest study across sites from around the world, dubbed "The QUASAR reproducibility study". Altogether, 28 sites located in Asia, Europe and North America participated...... and a total of 284 healthy volunteers were scanned. Minimal operator dependence was assured by using an automatic planning tool and its accuracy and potential usefulness in multi-center trials was evaluated as well. Accurate repositioning between sessions was achieved with the automatic planning tool showing...

  4. The QUASAR reproducibility study, Part II: Results from a multi-center Arterial Spin Labeling test-retest study

    DEFF Research Database (Denmark)

    Petersen, Esben; Mouridsen, Kim; Golay, Xavier

    2009-01-01

    Quantitative STAR labeling of Arterial Regions or QUASAR), a method providing user independent quantification of CBF in a large test-retest study across sites from around the world, dubbed "The QUASAR reproducibility study". Altogether, 28 sites located in Asia, Europe and North America participated...... and a total of 284 healthy volunteers were scanned. Minimal operator dependence was assured by using an automatic planning tool and its accuracy and potential usefulness in multi-center trials was evaluated as well. Accurate repositioning between sessions was achieved with the automatic planning tool showing...

  5. The QUASAR reproducibility study, Part II: Results from a multi center Arterial Spin Labeling test-retest Study

    Science.gov (United States)

    Petersen, Esben Thade; Mouridsen, Kim; Golay, Xavier

    2009-01-01

    Arterial Spin Labeling (ASL) is a method to measure perfusion using magnetically labeled blood water as an endogenous tracer. Being fully non-invasive, this technique is attractive for longitudinal studies of cerebral blood flow in healthy and diseased individuals, or as a surrogate marker of metabolism. So far, ASL has been restricted mostly to specialist centers due to a generally low SNR of the method and potential issues with user-dependent analysis needed to obtain quantitative measurement of cerebral blood flow (CBF). Here, we evaluated a particular implementation of ASL (called Quantitative STAR labeling of Arterial Regions or QUASAR), a method providing user independent quantification of CBF in a large test-retest study across sites from around the world, dubbed “The QUASAR reproducibility study”. Altogether, 28 sites located in Asia, Europe and North America participated and a total of 284 healthy volunteers were scanned. Minimal operator dependence was assured by using an automatic planning tool and its accuracy and potential usefulness in multi-center trials was evaluated as well. Accurate repositioning between sessions was achieved with the automatic planning tool showing mean displacements of 1.87±0.95mm and rotations of 1.56±0.66°. Mean gray matter CBF was 47.4±7.5 [ml/100g/min] with a between subject standard variation SDb = 5.5 [ml/100g/min] and a within subject standard deviation SDw = 4.7 [ml/100g/min]. The corresponding repeatability was 13.0 [ml/100g/min] and was found to be within the range of previous studies. PMID:19660557

  6. The QUASAR reproducibility study, Part II: Results from a multi-center Arterial Spin Labeling test-retest study.

    Science.gov (United States)

    Petersen, Esben Thade; Mouridsen, Kim; Golay, Xavier

    2010-01-01

    Arterial Spin Labeling (ASL) is a method to measure perfusion using magnetically labeled blood water as an endogenous tracer. Being fully non-invasive, this technique is attractive for longitudinal studies of cerebral blood flow in healthy and diseased individuals, or as a surrogate marker of metabolism. So far, ASL has been restricted mostly to specialist centers due to a generally low SNR of the method and potential issues with user-dependent analysis needed to obtain quantitative measurement of cerebral blood flow (CBF). Here, we evaluated a particular implementation of ASL (called Quantitative STAR labeling of Arterial Regions or QUASAR), a method providing user independent quantification of CBF in a large test-retest study across sites from around the world, dubbed "The QUASAR reproducibility study". Altogether, 28 sites located in Asia, Europe and North America participated and a total of 284 healthy volunteers were scanned. Minimal operator dependence was assured by using an automatic planning tool and its accuracy and potential usefulness in multi-center trials was evaluated as well. Accurate repositioning between sessions was achieved with the automatic planning tool showing mean displacements of 1.87+/-0.95 mm and rotations of 1.56+/-0.66 degrees . Mean gray matter CBF was 47.4+/-7.5 [ml/100 g/min] with a between-subject standard variation SD(b)=5.5 [ml/100 g/min] and a within-subject standard deviation SD(w)=4.7 [ml/100 g/min]. The corresponding repeatability was 13.0 [ml/100 g/min] and was found to be within the range of previous studies.

  7. Interobserver and test-retest reproducibility of T1ρ and T2 mesurements of lumber intervertebral discs by 3t magnetic resonance imaging

    Energy Technology Data Exchange (ETDEWEB)

    Yoo, Yeon Hwa; Yoon, Choon Sik; Eun, Na Lae; Kim, Sung Jin; Chung, Tae Sub [Dept. of Radiology, Gangnam Severance Hospital, Yonsei University College of Medicine, Seoul (Korea, Republic of); Hwang, Moon Jung [GE Health Care, Seoul (Korea, Republic of); Yoo, Hanna [Biostatistics Collaboration Lab, Yonsei University College of Medicine, Seoul (Korea, Republic of); Peter, Robert D. [GE Health Care, Milwaukee (United States); Lee, Young Han; Suh, Jin Suck [Dept. of Radiology, Severance Hospital, Yonsei University College of Medicine, Seoul (Korea, Republic of)

    2016-11-15

    To investigate the interobserver and test-retest reproducibility of T1ρ and T2 measurements of lumbar intervertebral discs using 3T magnetic resonance imaging (MRI). This study included a total of 51 volunteers (female, 26; male, 25; mean age, 54 ± 16.3 years) who underwent lumbar spine MRI with a 3.0 T scanner. Amongst these subjects, 40 underwent repeat T1ρ and T2 measurement acquisitions with identical image protocol. Two observers independently performed the region of interest measurements in the nuclei pulposi of the discs from L1-2 through L5-S1 levels. Statistical analysis was performed using intraclass correlation coefficient (ICC) with a two-way random model of absolute agreement. Comparison of the ICC values was done after acquisition of ICC values using Z test. Statistical significance was defined as p value < 0.05. The ICCs of interobserver reproducibility were 0.951 and 0.672 for T1ρ and T2 mapping, respectively. The ICCs of test-retest reproducibility (40 subjects) for T1ρ and T2 measurements were 0.922 and 0.617 for observer A and 0.914 and 0.628 for observer B, respectively. In the comparison of the aforementioned ICCs, ICCs of interobserver and test-retest reproducibility for T1ρ mapping were significantly higher than T2 mapping (p < 0.001). The interobserver and test-retest reproducibility of T1ρ mapping were significantly higher than those of T2 mapping for the quantitative assessment of nuclei pulposi of lumbar intervertebral discs.

  8. Test-Retest Reproducibility of the Microperimeter MP3 With Fundus Image Tracking in Healthy Subjects and Patients With Macular Disease.

    Science.gov (United States)

    Palkovits, Stefan; Hirnschall, Nino; Georgiev, Stefan; Leisser, Christoph; Findl, Oliver

    2018-02-01

    To evaluate the test-retest reproducibility of a novel microperimeter with fundus image tracking (MP3, Nidek Co, Japan) in healthy subjects and patients with macular disease. Ten healthy subjects and 20 patients suffering from range of macular diseases were included. After training measurements, two additional microperimetry measurements were scheduled. Test-retest reproducibility was assessed for mean retinal sensitivity, pointwise sensitivity, and deep scotoma size using the coefficient of repeatability and Bland-Altman diagrams. In addition, in a subgroup of patients microperimetry was compared with conventional perimetry. Average differences in mean retinal sensitivity between the two study measurements were 0.26 ± 1.7 dB (median 0 dB; interquartile range [IQR] -1 to 1) for the healthy and 0.36 ± 2.5 dB (median 0 dB; IQR -1 to 2) for the macular patient group. Coefficients of repeatability for mean retinal sensitivity and pointwise retinal sensitivity were 1.2 and 3.3 dB for the healthy subjects and 1.6 and 5.0 dB for the macular disease patients, respectively. Absolute agreement in deep scotoma size between both study days was found in 79.9% of the test loci. The microperimeter MP3 shows an adequate test-retest reproducibility for mean retinal sensitivity, pointwise retinal sensitivity, and deep scotoma size in healthy subjects and patients suffering from macular disease. Furthermore, reproducibility of microperimetry is higher than conventional perimetry. Reproducibility is an important measure for each diagnostic device. Especially in a clinical setting high reproducibility set the basis to achieve reliable results using the specific device. Therefore, assessment of the reproducibility is of eminent importance to interpret the findings of future studies.

  9. Minimum joint space width (mJSW) of patellofemoral joint on standing ''skyline'' radiographs: test-retest reproducibility and comparison with quantitative magnetic resonance imaging (qMRI)

    International Nuclear Information System (INIS)

    Simoni, Paolo; Jamali, Sanaa; Alvarez Miezentseva, Victoria; Albert, Adelin; Totterman, Saara; Schreyer, Edward; Tamez-Pena, Jose G.; Zobel, Bruno Beomonte; Gillet, Philippe

    2013-01-01

    To assess the intraobserver, interobserver, and test-retest reproducibility of minimum joint space width (mJSW) measurement of medial and lateral patellofemoral joints on standing ''skyline'' radiographs and to compare the mJSW of the patellofemoral joint to the mean cartilage thickness calculated by quantitative magnetic resonance imaging (qMRI). A couple of standing ''skyline'' radiographs of the patellofemoral joints and MRI of 55 knees of 28 volunteers (18 females, ten males, mean age, 48.5 ± 16.2 years) were obtained on the same day. The mJSW of the patellofemoral joint was manually measured and Kellgren and Lawrence grade (KLG) was independently assessed by two observers. The mJSW was compared to the mean cartilage thickness of patellofemoral joint calculated by qMRI. mJSW of the medial and lateral patellofemoral joint showed an excellent intraobserver agreement (interclass correlation (ICC) = 0.94 and 0.96), interobserver agreement (ICC = 0.90 and 0.95) and test-retest agreement (ICC = 0.92 and 0.96). The mJSW measured on radiographs was correlated to mean cartilage thickness calculated by qMRI (r = 0.71, p < 0.0001 for the medial PFJ and r = 0.81, p < 0.0001 for the lateral PFJ). However, there was a lack of concordance between radiographs and qMRI for extreme values of joint width and KLG. Radiographs yielded higher joint space measures than qMRI in knees with a normal joint space, while qMRI yielded higher joint space measures than radiographs in knees with joint space narrowing and higher KLG. Standing ''skyline'' radiographs are a reproducible tool for measuring the mJSW of the patellofemoral joint. The mJSW of the patellofemoral joint on radiographs are correlated with, but not concordant with, qMRI measurements. (orig.)

  10. {sup 11}C-PBR28 imaging in multiple sclerosis patients and healthy controls: test-retest reproducibility and focal visualization of active white matter areas

    Energy Technology Data Exchange (ETDEWEB)

    Park, Eunkyung; Gallezot, Jean-Dominique; Planeta, Beata; Lin, Shu-Fei; Lim, Keunpoong; Chen, Ming-Kai; Huang, Yiyun; Carson, Richard E. [Yale School of Medicine, PET Center, Department of Diagnostic Radiology, 801 Howard Avenue, PO Box 208048, New Haven, CT (United States); Delgadillo, Aracely; Liu, Shuang; O' Connor, Kevin C.; Lee, Jae-Yun; Chastre, Anne; Pelletier, Daniel [Yale School of Medicine, Department of Neurology, New Haven, CT (United States); Seneca, Nicholas; Leppert, David [Hoffmann-La Roche Ltd, Pharmaceuticals Division, Basel (Switzerland)

    2015-04-02

    Activated microglia play a key role in inflammatory demyelinating injury in multiple sclerosis (MS). Microglial activation can be measured in vivo using a positron emission tomography (PET) ligand {sup 11}C-PBR28. We evaluated the test-retest variability (TRV) and lesion detectability of {sup 11}C-PBR28 binding in MS subjects and healthy controls (HCs) with high-resolution PET. Four clinically and radiologically stable relapsing-remitting MS subjects (age 41 ± 7 years, two men/two women) and four HCs (age 42 ± 8 years, 2 two men/two women), matched for translocator protein genotype [two high- and two medium-affinity binders according to DNA polymorphism (rs6971) in each group], were studied for TRV. Another MS subject (age 41 years, male) with clinical and radiological activity was studied for lesion detectability. Dynamic data were acquired over 120 min after injection of 634 ± 101 MBq {sup 11}C-PBR28. For the TRV study, subjects were scanned twice, on average 1.4 weeks apart. Volume of distribution (V{sub T}) derived from multilinear analysis (MA1) modeling (t* = 30 min, using arterial input data) was the main outcome measure. Mean test V{sub T} values (ml cm{sup -3}) were 3.9 ± 1.4 in the whole brain gray matter (GM), 3.6 ± 1.2 in the whole brain white matter (WM) or normal-appearing white matter (NAWM), and 3.3 ± 0.6 in MS WM lesions; mean retest V{sub T} values were 3.7 ± 1.0 in GM, 3.3 ± 0.9 in WM/NAWM, and 3.3 ± 0.7 in MS lesions. Test-retest results showed a mean absolute TRV ranging from 7 to 9 % across GM, WM/NAWM, and MS lesions. High-affinity binders demonstrated 30 % higher V{sub T} than medium-affinity binders in GM. Focal {sup 11}C-PBR28 uptake was detected in two enhancing lesions of the active MS patient. High-resolution {sup 11}C-PBR28 PET can visualize focal areas where microglial activation is known to be present and has good test-retest reproducibility in the human brain. {sup 11}C-PBR28 PET is likely to be valuable for monitoring both

  11. Test-retest reproducibility of [{sup 11}C]PBR28 binding to TSPO in healthy control subjects

    Energy Technology Data Exchange (ETDEWEB)

    Collste, K.; Forsberg, A.; Varrone, A.; Amini, N.; Halldin, C.; Farde, L.; Cervenka, S. [Karolinska Institutet, Department of Clinical Neuroscience, Centre for Psychiatry Research, Stockholm (Sweden); Aeinehband, S. [Karolinska Institutet, Department of Clinical Neuroscience, Neuroimmunology Unit, Stockholm (Sweden); Yakushev, I. [Karolinska Institutet, Department of Clinical Neuroscience, Centre for Psychiatry Research, Stockholm (Sweden); Technische Universitaet Muenchen, Department of Nuclear Medicine and TUM Neuroimaging Center (TUM-NIC), Munich (Germany)

    2016-01-15

    The PET radioligand [{sup 11}C]PBR28 binds to the translocator protein (TSPO), a marker of brain immune activation. We examined the reproducibility of [{sup 11}C]PBR28 binding in healthy subjects with quantification on a regional and voxel-by-voxel basis. In addition, we performed a preliminary analysis of diurnal changes in TSPO availability. Twelve subjects were examined using a high-resolution research tomograph and [{sup 11}C]PBR28, six in the morning and afternoon of the same day, and six in the morning on two separate days. Regional volumes of distribution (V{sub T}) were derived using a region-of-interest based two-tissue compartmental analysis (2TCM), as well as a parametric approach. Metabolite-corrected arterial plasma was used as input function. For the whole sample, the mean absolute variability in V{sub T} in the grey matter (GM) was 18.3 ± 12.7 %. Intraclass correlation coefficients in GM regions ranged from 0.90 to 0.94. Reducing the time of analysis from 91 to 63 min yielded a variability of 16.9 ± 14.9 %. There was a strong correlation between the parametric and 2TCM-derived GM values (r = 0.99). A significant increase in GM V{sub T} was observed between the morning and afternoon examinations when using secondary methods of quantification (p = 0.028). In the subjects examined at the same time of the day, the absolute variability was 15.9 ± 12.2 % for the 91-min 2TCM data. V{sub T} of [{sup 11}C]PBR28 binding showed medium reproducibility and high reliability in GM regions. Our findings support the use of parametric approaches for determining [{sup 11}C]PBR28 V{sub T} values, and indicate that the acquisition time could be shortened. Diurnal changes in TSPO binding in the brain may be a potential confounder in clinical studies and should be investigated further. (orig.)

  12. Test-retest reproducibility of the metabotropic glutamate receptor 5 ligand [{sup 18}F]FPEB with bolus plus constant infusion in humans

    Energy Technology Data Exchange (ETDEWEB)

    Park, Eunkyung; Sullivan, Jenna M.; Planeta, Beata; Gallezot, Jean-Dominique; Lim, Keunpoong; Lin, Shu-Fei; Ropchan, Jim; Huang, Yiyun; Carson, Richard E. [Yale School of Medicine, PET Center, Department of Diagnostic Radiology, 801 Howard Avenue, PO Box 208048, New Haven, CT (United States); McCarthy, Timothy J. [Pfizer Worldwide Research and Development, Cambridge, MA (United States); Ding, Yu-Shin [New York University School of Medicine, Department of Radiology, New York, NY (United States); Morris, Evan D.; Williams, Wendol A. [Yale School of Medicine, PET Center, Department of Diagnostic Radiology, 801 Howard Avenue, PO Box 208048, New Haven, CT (United States); Yale School of Medicine, Department of Psychiatry, New Haven, CT (United States)

    2015-09-15

    [{sup 18}F]FPEB is a promising PET radioligand for the metabotropic glutamate receptor 5 (mGluR5), a potential target for the treatment of neuropsychiatric diseases. The purpose of this study was to evaluate the test-retest reproducibility of [{sup 18}F]FPEB in the human brain. Seven healthy male subjects were scanned twice, 3 - 11 weeks apart. Dynamic data were acquired using bolus plus infusion of 162 ± 32 MBq [{sup 18}F]FPEB. Four methods were used to estimate volume of distribution (V{sub T}): equilibrium analysis (EQ) using arterial (EQ{sub A}) or venous input data (EQ{sub V}), MA1, and a two-tissue compartment model (2 T). Binding potential (BP{sub ND}) was also estimated using cerebellar white matter (CWM) or gray matter (CGM) as the reference region using EQ, 2 T and MA1. Absolute test-retest variability (aTRV) of V{sub T} and BP{sub ND} were calculated for each method. Venous blood measurements (C{sub V}) were compared with arterial input (C{sub A}) to examine their usability in EQ analysis. Regional V{sub T} estimated by the four methods displayed a high degree of agreement (r{sup 2} ranging from 0.83 to 0.99 among the methods), although EQ{sub A} and EQ{sub V} overestimated V{sub T} by a mean of 9 % and 7 %, respectively, compared to 2 T. Mean values of aTRV of V{sub T} were 11 % by EQ{sub A}, 12 % by EQ{sub V}, 14 % by MA1 and 14 % by 2 T. Regional BP{sub ND} also agreed well among the methods and mean aTRV of BP{sub ND} was 8 - 12 % (CWM) and 7 - 9 % (CGM). Venous and arterial blood concentrations of [{sup 18}F]FPEB were well matched during equilibrium (C{sub V} = 1.01 . C{sub A}, r{sup 2} = 0.95). [{sup 18}F]FPEB binding shows good TRV with minor differences among analysis methods. Venous blood can be used as an alternative for input function measurement instead of arterial blood in EQ analysis. Thus, [{sup 18}F]FPEB is an excellent PET imaging tracer for mGluR5 in humans. (orig.)

  13. Test-retest reproducibility of the metabotropic glutamate receptor 5 ligand [18F]FPEB with bolus plus constant infusion in humans

    International Nuclear Information System (INIS)

    Park, Eunkyung; Sullivan, Jenna M.; Planeta, Beata; Gallezot, Jean-Dominique; Lim, Keunpoong; Lin, Shu-Fei; Ropchan, Jim; Huang, Yiyun; Carson, Richard E.; McCarthy, Timothy J.; Ding, Yu-Shin; Morris, Evan D.; Williams, Wendol A.

    2015-01-01

    [ 18 F]FPEB is a promising PET radioligand for the metabotropic glutamate receptor 5 (mGluR5), a potential target for the treatment of neuropsychiatric diseases. The purpose of this study was to evaluate the test-retest reproducibility of [ 18 F]FPEB in the human brain. Seven healthy male subjects were scanned twice, 3 - 11 weeks apart. Dynamic data were acquired using bolus plus infusion of 162 ± 32 MBq [ 18 F]FPEB. Four methods were used to estimate volume of distribution (V T ): equilibrium analysis (EQ) using arterial (EQ A ) or venous input data (EQ V ), MA1, and a two-tissue compartment model (2 T). Binding potential (BP ND ) was also estimated using cerebellar white matter (CWM) or gray matter (CGM) as the reference region using EQ, 2 T and MA1. Absolute test-retest variability (aTRV) of V T and BP ND were calculated for each method. Venous blood measurements (C V ) were compared with arterial input (C A ) to examine their usability in EQ analysis. Regional V T estimated by the four methods displayed a high degree of agreement (r 2 ranging from 0.83 to 0.99 among the methods), although EQ A and EQ V overestimated V T by a mean of 9 % and 7 %, respectively, compared to 2 T. Mean values of aTRV of V T were 11 % by EQ A , 12 % by EQ V , 14 % by MA1 and 14 % by 2 T. Regional BP ND also agreed well among the methods and mean aTRV of BP ND was 8 - 12 % (CWM) and 7 - 9 % (CGM). Venous and arterial blood concentrations of [ 18 F]FPEB were well matched during equilibrium (C V = 1.01 . C A , r 2 = 0.95). [ 18 F]FPEB binding shows good TRV with minor differences among analysis methods. Venous blood can be used as an alternative for input function measurement instead of arterial blood in EQ analysis. Thus, [ 18 F]FPEB is an excellent PET imaging tracer for mGluR5 in humans. (orig.)

  14. Normal ranges and test-retest reproducibility of flow and velocity parameters in intracranial arteries measured with phase-contrast magnetic resonance imaging

    International Nuclear Information System (INIS)

    Correia de Verdier, Maria; Wikstroem, Johan

    2016-01-01

    The purpose of the present study was to investigate normal ranges and test-retest reproducibility of phase-contrast MRI (PC-MRI)-measured flow and velocity parameters in intracranial arteries. Highest flow (HF), lowest flow (LF), peak systolic velocity (PSV), and end diastolic velocity (EDV) were measured at two dates in the anterior (ACA), middle (MCA), and posterior (PCA) cerebral arteries of 30 healthy volunteers using two-dimensional PC-MRI at 3 T. Least detectable difference (LDD) was calculated. In the left ACA, HF was (mean (range, LDD)) 126 ml/min (36-312, 59 %), LF 61 ml/min (0-156, 101 %), PSV 64 cm/s (32-141, 67 %), and EDV 35 cm/s (18-55, 42 %); in the right ACA, HF was 154 ml/min (42-246, 49 %), LF 77 ml/min (0-156, 131 %), PSV 75 cm/s (26-161, 82 %), and EDV 39 cm/s (7-59, 67 %). In the left MCA, HF was 235 ml/min (126-372, 35 %), LF 116 ml/min (42-186, 48 %), PSV 90 cm/s (55-183, 39 %), and EDV 46 cm/s (20-66, 28 %); in the right MCA, HF was 238 ml/min (162-342, 44 %), LF 120 ml/min (72-216, 48 %), PSV 88 cm/s (55-141, 35 %), and EDV 45 cm/s (26-67, 23 %). In the left PCA, HF was 108 ml/min (42-168, 54 %), LF 53 ml/min (18-108, 64 %), PSV 50 cm/s (24-77, 63 %), and EDV 28 cm/s (14-40, 45 %); in the right PCA, HF was 98 ml/min (30-162, 49 %), LF 49 ml/min (12-84, 55 %), PSV 47 cm/s (27-88, 59 %), and EDV 27 cm/s (16-41, 45 %). PC-MRI-measured flow and velocity parameters in the main intracranial arteries have large normal ranges. Reproducibility is highest in MCA. (orig.)

  15. Normal ranges and test-retest reproducibility of flow and velocity parameters in intracranial arteries measured with phase-contrast magnetic resonance imaging

    Energy Technology Data Exchange (ETDEWEB)

    Correia de Verdier, Maria; Wikstroem, Johan [Uppsala University Hospital, Department of Radiology, Uppsala University, Uppsala (Sweden)

    2016-05-15

    The purpose of the present study was to investigate normal ranges and test-retest reproducibility of phase-contrast MRI (PC-MRI)-measured flow and velocity parameters in intracranial arteries. Highest flow (HF), lowest flow (LF), peak systolic velocity (PSV), and end diastolic velocity (EDV) were measured at two dates in the anterior (ACA), middle (MCA), and posterior (PCA) cerebral arteries of 30 healthy volunteers using two-dimensional PC-MRI at 3 T. Least detectable difference (LDD) was calculated. In the left ACA, HF was (mean (range, LDD)) 126 ml/min (36-312, 59 %), LF 61 ml/min (0-156, 101 %), PSV 64 cm/s (32-141, 67 %), and EDV 35 cm/s (18-55, 42 %); in the right ACA, HF was 154 ml/min (42-246, 49 %), LF 77 ml/min (0-156, 131 %), PSV 75 cm/s (26-161, 82 %), and EDV 39 cm/s (7-59, 67 %). In the left MCA, HF was 235 ml/min (126-372, 35 %), LF 116 ml/min (42-186, 48 %), PSV 90 cm/s (55-183, 39 %), and EDV 46 cm/s (20-66, 28 %); in the right MCA, HF was 238 ml/min (162-342, 44 %), LF 120 ml/min (72-216, 48 %), PSV 88 cm/s (55-141, 35 %), and EDV 45 cm/s (26-67, 23 %). In the left PCA, HF was 108 ml/min (42-168, 54 %), LF 53 ml/min (18-108, 64 %), PSV 50 cm/s (24-77, 63 %), and EDV 28 cm/s (14-40, 45 %); in the right PCA, HF was 98 ml/min (30-162, 49 %), LF 49 ml/min (12-84, 55 %), PSV 47 cm/s (27-88, 59 %), and EDV 27 cm/s (16-41, 45 %). PC-MRI-measured flow and velocity parameters in the main intracranial arteries have large normal ranges. Reproducibility is highest in MCA. (orig.)

  16. Test-retest reproducibility of dopamine D{sub 2/3} receptor binding in human brain measured by PET with [{sup 11}C]MNPA and [{sup 11}C]raclopride

    Energy Technology Data Exchange (ETDEWEB)

    Kodaka, Fumitoshi [National Institute of Radiological Sciences, Molecular Neuroimaging Program, Molecular Imaging Center, Chiba (Japan); Jikei University School of Medicine, Department of Psychiatry, Tokyo (Japan); Ito, Hiroshi [National Institute of Radiological Sciences, Molecular Neuroimaging Program, Molecular Imaging Center, Chiba (Japan); National Institute of Radiological Sciences, Biophysics Program, Molecular Imaging Center, Chiba (Japan); Kimura, Yasuyuki; Fujie, Saori; Takano, Harumasa; Fujiwara, Hironobu; Sasaki, Takeshi; Suhara, Tetsuya [National Institute of Radiological Sciences, Molecular Neuroimaging Program, Molecular Imaging Center, Chiba (Japan); Nakayama, Kazuhiko [Jikei University School of Medicine, Department of Psychiatry, Tokyo (Japan); Halldin, Christer; Farde, Lars [Karolinska Institutet, Department of Clinical Neuroscience, Stockholm (Sweden)

    2013-04-15

    Dopamine D{sub 2/3} receptors (D{sub 2/3}Rs) have two affinity states for endogenous dopamine, referred to as high-affinity state (D{sub 2/3} {sup HIGH}), which has a high affinity for endogenous dopamine, and low-affinity state (D{sub 2/3} {sup LOW}). The density of D{sub 2/3} {sup HIGH} can be measured with (R)-2-{sup 11}CH{sub 3}O-N-n-propylnorapomorphine ([{sup 11}C]MNPA), while total density of D{sub 2/3} {sup HIGH} and D{sub 2/3} {sup LOW} (D{sub 2/3}Rs) can be measured with [{sup 11}C]raclopride using positron emission tomography (PET). Thus, the ratio of the binding potential (BP) of [{sup 11}C]MNPA to that of [{sup 11}C]raclopride ([{sup 11}C]MNPA/[{sup 11}C]raclopride) may reflect the proportion of the density of D{sub 2/3} {sup HIGH} to that of D{sub 2/3}Rs. In the caudate and putamen, [{sup 11}C]MNPA/[{sup 11}C]raclopride reflects the proportion of the density of D{sub 2} {sup HIGH} to that of D{sub 2}Rs. To evaluate the reliability of the PET paradigm with [{sup 11}C]MNPA and [{sup 11}C]raclopride, we investigated the test-retest reproducibility of non-displaceable BP (BP{sub ND}) measured with [{sup 11}C]MNPA and of [{sup 11}C]MNPA/[{sup 11}C]raclopride in healthy humans. Eleven healthy male volunteers underwent two sets of PET studies on separate days that each included [{sup 11}C]MNPA and [{sup 11}C]raclopride scans. BP{sub ND} values in the caudate and putamen were calculated. Test-retest reproducibility of BP{sub ND} of [{sup 11}C]MNPA and [{sup 11}C]MNPA/[{sup 11}C]raclopride was assessed by intra-subject variability (absolute variability) and test-retest reliability (intraclass correlation coefficient: ICC). The absolute variability of [{sup 11}C]MNPA BP{sub ND} was 5.30 {+-} 3.96 % and 12.3 {+-} 7.95 % and the ICC values of [{sup 11}C]MNPA BP{sub ND} were 0.72 and 0.82 in the caudate and putamen, respectively. The absolute variability of [{sup 11}C]MNPA/[{sup 11}C]raclopride was 6.11 {+-} 3.68 % and 11.60 {+-} 5.70 % and the ICC values of [{sup

  17. Minimum joint space width (mJSW) of patellofemoral joint on standing ''skyline'' radiographs: test-retest reproducibility and comparison with quantitative magnetic resonance imaging (qMRI)

    Energy Technology Data Exchange (ETDEWEB)

    Simoni, Paolo; Jamali, Sanaa; Alvarez Miezentseva, Victoria [CHU de Liege, Diagnostic Imaging Departement, Domanine du Sart Tilman, Liege (Belgium); Albert, Adelin [CHU de Liege, Biostatistics Departement, Domanine du Sart Tilman, Liege (Belgium); Totterman, Saara; Schreyer, Edward; Tamez-Pena, Jose G. [Qmetrics Technologies, Rochester, NY (United States); Zobel, Bruno Beomonte [Campus Bio-Medico University, Diagnostic Imaging Departement, Rome (Italy); Gillet, Philippe [CHU de Liege, Orthopaedic surgery Department, Domanine du Sart Tilman, Liege (Belgium)

    2013-11-15

    To assess the intraobserver, interobserver, and test-retest reproducibility of minimum joint space width (mJSW) measurement of medial and lateral patellofemoral joints on standing ''skyline'' radiographs and to compare the mJSW of the patellofemoral joint to the mean cartilage thickness calculated by quantitative magnetic resonance imaging (qMRI). A couple of standing ''skyline'' radiographs of the patellofemoral joints and MRI of 55 knees of 28 volunteers (18 females, ten males, mean age, 48.5 {+-} 16.2 years) were obtained on the same day. The mJSW of the patellofemoral joint was manually measured and Kellgren and Lawrence grade (KLG) was independently assessed by two observers. The mJSW was compared to the mean cartilage thickness of patellofemoral joint calculated by qMRI. mJSW of the medial and lateral patellofemoral joint showed an excellent intraobserver agreement (interclass correlation (ICC) = 0.94 and 0.96), interobserver agreement (ICC = 0.90 and 0.95) and test-retest agreement (ICC = 0.92 and 0.96). The mJSW measured on radiographs was correlated to mean cartilage thickness calculated by qMRI (r = 0.71, p < 0.0001 for the medial PFJ and r = 0.81, p < 0.0001 for the lateral PFJ). However, there was a lack of concordance between radiographs and qMRI for extreme values of joint width and KLG. Radiographs yielded higher joint space measures than qMRI in knees with a normal joint space, while qMRI yielded higher joint space measures than radiographs in knees with joint space narrowing and higher KLG. Standing ''skyline'' radiographs are a reproducible tool for measuring the mJSW of the patellofemoral joint. The mJSW of the patellofemoral joint on radiographs are correlated with, but not concordant with, qMRI measurements. (orig.)

  18. Test-retest reliability of cognitive EEG

    Science.gov (United States)

    McEvoy, L. K.; Smith, M. E.; Gevins, A.

    2000-01-01

    OBJECTIVE: Task-related EEG is sensitive to changes in cognitive state produced by increased task difficulty and by transient impairment. If task-related EEG has high test-retest reliability, it could be used as part of a clinical test to assess changes in cognitive function. The aim of this study was to determine the reliability of the EEG recorded during the performance of a working memory (WM) task and a psychomotor vigilance task (PVT). METHODS: EEG was recorded while subjects rested quietly and while they performed the tasks. Within session (test-retest interval of approximately 1 h) and between session (test-retest interval of approximately 7 days) reliability was calculated for four EEG components: frontal midline theta at Fz, posterior theta at Pz, and slow and fast alpha at Pz. RESULTS: Task-related EEG was highly reliable within and between sessions (r0.9 for all components in WM task, and r0.8 for all components in the PVT). Resting EEG also showed high reliability, although the magnitude of the correlation was somewhat smaller than that of the task-related EEG (r0.7 for all 4 components). CONCLUSIONS: These results suggest that under appropriate conditions, task-related EEG has sufficient retest reliability for use in assessing clinical changes in cognitive status.

  19. Test-retest reliability of the Work Ability Index questionnaire

    NARCIS (Netherlands)

    de Zwart, B. C. H.; Frings-Dresen, M. H. W.; Van Duivenbooden, J. C.

    2002-01-01

    The goal of the study was to assess the test-retest reliability of the Work Ability Index (WAI) questionnaire. Reliability was tested using a test-retest design with a 4 week interval between measurements. Valid data were collected among 97 elderly construction workers aged 40 years and older. We

  20. Test-retest reliability of trunk accelerometric gait analysis

    DEFF Research Database (Denmark)

    Henriksen, Marius; Lund, Hans; Moe-Nilssen, R

    2004-01-01

    The purpose of this study was to determine the test-retest reliability of a trunk accelerometric gait analysis in healthy subjects. Accelerations were measured during walking using a triaxial accelerometer mounted on the lumbar spine of the subjects. Six men and 14 women (mean age 35.2; range 18...... a definite potential in clinical gait analysis....

  1. Test-retest reliability of the multifocal photopic negative response.

    Science.gov (United States)

    Van Alstine, Anthony W; Viswanathan, Suresh

    2017-02-01

    To assess the test-retest reliability of the multifocal photopic negative response (mfPhNR) of normal human subjects. Multifocal electroretinograms were recorded from one eye of 61 healthy adult subjects on two separate days using a Visual Evoked Response Imaging System software version 4.3 (EDI, San Mateo, California). The visual stimulus delivered on a 75-Hz monitor consisted of seven equal-sized hexagons each subtending 12° of visual angle. The m-step exponent was 9, and the m-sequence was slowed to include at least 30 blank frames after each flash. Only the first slice of the first-order kernel was analyzed. The mfPhNR amplitude was measured at a fixed time in the trough from baseline (BT) as well as at the same fixed time in the trough from the preceding b-wave peak (PT). Additionally, we also analyzed BT normalized either to PT (BT/PT) or to the b-wave amplitude (BT/b-wave). The relative reliability of test-retest differences for each test location was estimated by the Wilcoxon matched-pair signed-rank test and intraclass correlation coefficients (ICC). Absolute test-retest reliability was estimated by Bland-Altman analysis. The test-retest amplitude differences for neither of the two measurement techniques were statistically significant as determined by Wilcoxon matched-pair signed-rank test. PT measurements showed greater ICC values than BT amplitude measurements for all test locations. For each measurement technique, the ICC value of the macular response was greater than that of the surrounding locations. The mean test-retest difference was close to zero for both techniques at each of the test locations, and while the coefficient of reliability (COR-1.96 times the standard deviation of the test-retest difference) was comparable for the two techniques at each test location when expressed in nanovolts, the %COR (COR normalized to the mean test and retest amplitudes) was superior for PT than BT measurements. The ICC and COR were comparable for the BT/PT and

  2. Rorschach e pedofilia: a fidedignidade no teste-reteste = Rorschach and pedophilia: a reliability at test-retest

    Directory of Open Access Journals (Sweden)

    Scortegagna, Silvana Alba

    2013-01-01

    Full Text Available Esse estudo buscou investigar as características de personalidade de um indivíduo pedófilo, e evidenciar a fidedignidade do Rorschach no teste-reteste. O participante, com 38 anos de idade, masculino, respondeu a entrevista e ao método de Rorschach, em duas etapas. Os principais achados revelam: a uma tendência à fragmentação na percepção de si e dos outros; b autoimagem negativa e desfavorável em relação ao corpo e suas funções; c problemas nas relações interpessoais, falhas na capacidade de empatia; d déficit no ajustamento perceptivo da realidade; e vulnerabilidade a pressões subjetivas e impulsividade. Esses resultados mantiveram-se estáveis comparando-se as duas aplicações, permitindo ampliar a compreensão dos elementos psicológicos envolvidos na pedofilia, que se mantem, e apoiam a fidedignidade do Rorschach no teste-reteste

  3. Test-retest reliability for aerodynamic measures of voice.

    Science.gov (United States)

    Awan, Shaheen N; Novaleski, Carolyn K; Yingling, Julie R

    2013-11-01

    The purpose of this study was to investigate the intrasubject reliability of aerodynamic characteristics of the voice within typical/normal speakers across testing sessions using the Phonatory Aerodynamic System (PAS 6600; KayPENTAX, Montvale, NJ). Participants were 60 healthy young adults (30 males and 30 females) between the ages 18 and 31 years with perceptually typical voice. Participants were tested using the PAS 6600 (Phonatory Aerodynamic System) on two separate days with approximately 1 week between each session at approximately the same time of day. Four PAS protocols were conducted (vital capacity, maximum sustained phonation, comfortable sustained phonation, and voicing efficiency) and measures of expiratory volume, maximum phonation time, mean expiratory airflow (during vowel production) and target airflow (obtained via syllable repetition), peak air pressure, aerodynamic power, aerodynamic resistance, and aerodynamic efficiency were obtained during each testing session. Associated acoustic measures of vocal intensity and frequency were also collected. All phonations were elicited at comfortable pitch and loudness. All aerodynamic and associated variables evaluated in this study showed useable test-retest reliability (ie, intraclass correlation coefficients [ICCs] ≥ 0.60). A high degree of mean test-retest reliability was found across all subjects for aerodynamic and associated acoustic measurements of vital capacity, maximum sustained phonation, glottal resistance, and vocal intensity (all with ICCs > 0.75). Although strong ICCs were observed for measures of glottal power and mean expiratory airflow in males, weaker overall results for these measures (ICC range: 0.60-0.67) were observed in females subjects and sizable coefficients of variation were observed for measures of power, resistance, and efficiency in both men and women. Differences in degree of reliability from measure to measure were revealed in greater detail using methods such as ICCs and

  4. Estimation of macular pigment optical density in the elderly: test-retest variability and effect of optical blur in pseudophakic subjects

    NARCIS (Netherlands)

    Gallaher, Kevin T.; Mura, Marco; Todd, Wm Andrew; Harris, Tarsha L.; Kenyon, Emily; Harris, Tamara; Johnson, Karen C.; Satterfield, Suzanne; Kritchevsky, Stephen B.; Iannaccone, Alessandro

    2007-01-01

    The reproducibility of macular pigment optical density (MPOD) estimates in the elderly was assessed in 40 subjects (age: 79.1+/-3.5). Test-retest variability was good (Pearson's r coefficient: 0.734), with an average coefficient of variation (CV) of 18.4% and an intraclass correlation coefficient

  5. Resting-state test-retest reliability of a priori defined canonical networks over different preprocessing steps.

    Science.gov (United States)

    Varikuti, Deepthi P; Hoffstaedter, Felix; Genon, Sarah; Schwender, Holger; Reid, Andrew T; Eickhoff, Simon B

    2017-04-01

    Resting-state functional connectivity analysis has become a widely used method for the investigation of human brain connectivity and pathology. The measurement of neuronal activity by functional MRI, however, is impeded by various nuisance signals that reduce the stability of functional connectivity. Several methods exist to address this predicament, but little consensus has yet been reached on the most appropriate approach. Given the crucial importance of reliability for the development of clinical applications, we here investigated the effect of various confound removal approaches on the test-retest reliability of functional-connectivity estimates in two previously defined functional brain networks. Our results showed that gray matter masking improved the reliability of connectivity estimates, whereas denoising based on principal components analysis reduced it. We additionally observed that refraining from using any correction for global signals provided the best test-retest reliability, but failed to reproduce anti-correlations between what have been previously described as antagonistic networks. This suggests that improved reliability can come at the expense of potentially poorer biological validity. Consistent with this, we observed that reliability was proportional to the retained variance, which presumably included structured noise, such as reliable nuisance signals (for instance, noise induced by cardiac processes). We conclude that compromises are necessary between maximizing test-retest reliability and removing variance that may be attributable to non-neuronal sources.

  6. Resting-state test-retest reliability of a priori defined canonical networks over different preprocessing steps

    Science.gov (United States)

    Varikuti, Deepthi P.; Hoffstaedter, Felix; Genon, Sarah; Schwender, Holger; Reid, Andrew T.; Eickhoff, Simon B.

    2016-01-01

    Resting-state functional connectivity analysis has become a widely used method for the investigation of human brain connectivity and pathology. The measurement of neuronal activity by functional MRI, however, is impeded by various nuisance signals that reduce the stability of functional connectivity. Several methods exist to address this predicament, but little consensus has yet been reached on the most appropriate approach. Given the crucial importance of reliability for the development of clinical applications, we here investigated the effect of various confound removal approaches on the test-retest reliability of functional-connectivity estimates in two previously defined functional brain networks. Our results showed that grey matter masking improved the reliability of connectivity estimates, whereas de-noising based on principal components analysis reduced it. We additionally observed that refraining from using any correction for global signals provided the best test-retest reliability, but failed to reproduce anti-correlations between what have been previously described as antagonistic networks. This suggests that improved reliability can come at the expense of potentially poorer biological validity. Consistent with this, we observed that reliability was proportional to the retained variance, which presumably included structured noise, such as reliable nuisance signals (for instance, noise induced by cardiac processes). We conclude that compromises are necessary between maximizing test-retest reliability and removing variance that may be attributable to non-neuronal sources. PMID:27550015

  7. Learning effect and test-retest variability of pulsar perimetry.

    Science.gov (United States)

    Salvetat, Maria Letizia; Zeppieri, Marco; Parisi, Lucia; Johnson, Chris A; Sampaolesi, Roberto; Brusini, Paolo

    2013-03-01

    To assess Pulsar Perimetry learning effect and test-retest variability (TRV) in normal (NORM), ocular hypertension (OHT), glaucomatous optic neuropathy (GON), and primary open-angle glaucoma (POAG) eyes. This multicenter prospective study included 43 NORM, 38 OHT, 33 GON, and 36 POAG patients. All patients underwent standard automated perimetry and Pulsar Contrast Perimetry using white stimuli modulated in phase and counterphase at 30 Hz (CP-T30W test). The learning effect and TRV for Pulsar Perimetry were assessed for 3 consecutive visual fields (VFs). The learning effect were evaluated by comparing results from the first session with the other 2. TRV was assessed by calculating the mean of the differences (in absolute value) between retests for each combination of single tests. TRV was calculated for Mean Sensitivity, Mean Defect, and single Mean Sensitivity for each 66 test locations. Influence of age, VF eccentricity, and loss severity on TRV were assessed using linear regression analysis and analysis of variance. The learning effect was not significant in any group (analysis of variance, P>0.05). TRV for Mean Sensitivity and Mean Defect was significantly lower in NORM and OHT (0.6 ± 0.5 spatial resolution contrast units) than in GON and POAG (0.9 ± 0.5 and 1.0 ± 0.8 spatial resolution contrast units, respectively) (Kruskal-Wallis test, P=0.04); however, the differences in NORM among age groups was not significant (Kruskal-Wallis test, P>0.05). Slight significant differences were found for the single Mean Sensitivity TRV among single locations (Duncan test, PPulsar Perimetry CP-T30W test did not show significant learning effect in patients with standard automated perimetry experience. TRV for global indices was generally low, and was not related to patient age; it was only slightly affected by VF defect eccentricity, and significantly influenced by VF loss severity.

  8. Test-Retest Reliability of a Survey to Measure Transport-Related Physical Activity in Adults

    Science.gov (United States)

    Badland, Hannah; Schofield, Grant

    2006-01-01

    The present research details test-retest reliability of a newly developed, telephone-administered TPA survey for adults. This instrument examines barriers, perceptions, and current travel behaviors to place of work/study and local convenience shops. Demonstrated test-retest reliability of the Active Friendly Environments-Transport-Related Physical…

  9. Test-retest reliability of infant event related potentials evoked by faces.

    Science.gov (United States)

    Munsters, N M; van Ravenswaaij, H; van den Boomen, C; Kemner, C

    2017-04-05

    Reliable measures are required to draw meaningful conclusions regarding developmental changes in longitudinal studies. Little is known, however, about the test-retest reliability of face-sensitive event related potentials (ERPs), a frequently used neural measure in infants. The aim of the current study is to investigate the test-retest reliability of ERPs typically evoked by faces in 9-10 month-old infants. The infants (N=31) were presented with neutral, fearful and happy faces that contained only the lower or higher spatial frequency information. They were tested twice within two weeks. The present results show that the test-retest reliability of the face-sensitive ERP components is moderate (P400 and Nc) to substantial (N290). However, there is low test-retest reliability for the effects of the specific experimental manipulations (i.e. emotion and spatial frequency) on the face-sensitive ERPs. To conclude, in infants the face-sensitive ERP components (i.e. N290, P400 and Nc) show adequate test-retest reliability, but not the effects of emotion and spatial frequency on these ERP components. We propose that further research focuses on investigating elements that might increase the test-retest reliability, as adequate test-retest reliability is necessary to draw meaningful conclusions on individual developmental trajectories of the face-sensitive ERPs in infants. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.

  10. Test-retest studies of cerebral glucose metabolism using fluorine-18 deoxyglucose: validation of method

    International Nuclear Information System (INIS)

    Brooks, R.A.; Di Chiro, G.; Zukerberg, B.W.; Bairamian, D.; Larson, S.M.

    1987-01-01

    In studies using [ 18 F]deoxyglucose (FDG), one often wants to compare metabolic rates following stimulation (drug or motor-sensory) with the baseline values. However, because of reproducibility problems with baseline variations of 25% in the same individual not uncommon, the global effect of the stimulation may be difficult to see. One approach to this problem is to perform the two studies sequentially. This means that, with the 110-min half-life of 18 F, one must take into account the residual activity from the first study when calculating metabolic rates for the second. We performed TEST-RETEST baseline studies on four subjects, with a 1-hr interval between injections. These studies were done without stimulation, in order to validate the repeatability of the method. To reduce the amount of residual activity from the first study, the first injection was only 2 mCi in three cases, and only 1 mCi in one case, out of a total injected dose of 5 mCi. A correction for residual activity was included in the RETEST calculation of metabolic rate. The results showed a global metabolic shift between the two studies of 2% to 9%. An error analysis shows that the shift could be further reduced if anatomically comparable scans are done at comparable postinjection times

  11. Balance Assessment in Sports-Related Concussion: Evaluating Test-Retest Reliability of the Equilibrate System.

    Science.gov (United States)

    Odom, Mitchell J; Lee, Young M; Zuckerman, Scott L; Apple, Rachel P; Germanos, Theodore; Solomon, Gary S; Sills, Allen K

    2016-01-01

    This study evaluated the test-retest reliability of a novel computer-based, portable balance assessment tool, the Equilibrate System (ES), used to diagnose sports-related concussion. Twenty-seven students participated in ES testing consisting of three sessions over 4 weeks. The modified Balance Error Scoring System was performed. For each participant, test-retest reliability was established using the intraclass correlation coefficient (ICC). The ES test-retest reliability from baseline to week 2 produced an ICC value of 0.495 (95% CI, 0.123-0.745). Week 2 testing produced ICC values of 0.602 (95% CI, 0.279-0.803) and 0.610 (95% CI, 0.299-0.804), respectively. All other single measures test-retest reliability values produced poor ICC values. Same-day ES testing showed fair to good test-retest reliability while interweek measures displayed poor to fair test-retest reliability. Testing conditions should be controlled when using computerized balance assessment methods. ES testing should only be used as a part of a comprehensive assessment.

  12. Diffusion-weighted (DW) MRI in lung cancers. ADC test-retest repeatability

    Energy Technology Data Exchange (ETDEWEB)

    Weller, Alex; Papoutsaki, Marianthi Vasiliki; Blackledge, Matthew; DeSouza, Nandita M. [Institute of Cancer Research and Royal Marsden NHS Foundation Trust, CRUK Cancer Imaging Centre, Surrey (United Kingdom); Waterton, John C. [University of Manchester, Manchester (United Kingdom); Chiti, Arturo [Humanitas University, Milan (Italy); Stroobants, Sigrid [Universiteit Antwerpen, Antwerpen (Belgium); Kuijer, Joost [Vrije Universiteit Medisch Centrum, Amsterdam (Netherlands); Morgan, Veronica [Royal Marsden NHS Foundation Trust, Department of Medicine, London (United Kingdom)

    2017-11-15

    To determine the test-retest repeatability of Apparent Diffusion Coefficient (ADC) measurements across institutions and MRI vendors, plus investigate the effect of post-processing methodology on measurement precision. Thirty malignant lung lesions >2 cm in size (23 patients) were scanned on two occasions, using echo-planar-Diffusion-Weighted (DW)-MRI to derive whole-tumour ADC (b = 100, 500 and 800 s/mm{sup -2}). Scanning was performed at 4 institutions (3 MRI vendors). Whole-tumour volumes-of-interest were copied from first visit onto second visit images and from one post-processing platform to an open-source platform, to assess ADC repeatability and cross-platform reproducibility. Whole-tumour ADC values ranged from 0.66-1.94x10{sup -3} mm{sup 2}s{sup -1} (mean = 1.14). Within-patient coefficient-of-variation (wCV) was 7.1% (95% CI 5.7-9.6%), limits-of-agreement (LoA) -18.0 to 21.9%. Lesions >3 cm had improved repeatability: wCV 3.9% (95% CI 2.9-5.9%); and LoA -10.2 to 11.4%. Variability for lesions <3 cm was 2.46 times higher. ADC reproducibility across different post-processing platforms was excellent: Pearson's R{sup 2} = 0.99; CoV 2.8% (95% CI 2.3-3.4%); and LoA -7.4 to 8.0%. A free-breathing DW-MRI protocol for imaging malignant lung tumours achieved satisfactory within-patient repeatability and was robust to changes in post-processing software, justifying its use in multi-centre trials. For response evaluation in individual patients, a change in ADC >21.9% will reflect treatment-related change. (orig.)

  13. Test-Retest Reliability of the Short-Form Survivor Unmet Needs Survey.

    Science.gov (United States)

    Taylor, Karen; Bulsara, Max; Monterosso, Leanne

    2018-01-01

    Reliable and valid needs assessment measures are important assessment tools in cancer survivorship care. A new 30-item short-form version of the Survivor Unmet Needs Survey (SF-SUNS) was developed and validated with cancer survivors, including hematology cancer survivors; however, test-retest reliability has not been established. The objective of this study was to assess the test-retest reliability of the SF-SUNS with a cohort of lymphoma survivors ( n = 40). Test-retest reliability of the SF-SUNS was conducted at two time points: baseline (time 1) and 5 days later (time 2). Test-retest data were collected from lymphoma cancer survivors ( n = 40) in a large tertiary cancer center in Western Australia. Intraclass correlation analyses compared data at time 1 (baseline) and time 2 (5 days later). Cronbach's alpha analyses were performed to assess the internal consistency at both time points. The majority (23/30, 77%) of items achieved test-retest reliability scores 0.45-0.74 (fair to good). A high degree of overall internal consistency was demonstrated (time 1 = 0.92, time 2 = 0.95), with scores 0.65-0.94 across subscales for both time points. Mixed test-retest reliability of the SF-SUNS was established. Our results indicate the SF-SUNS is responsive to the changing needs of lymphoma cancer survivors. Routine use of cancer survivorship specific needs-based assessments is required in oncology care today. Nurses are well placed to administer these assessments and provide tailored information and resources. Further assessment of test-retest reliability in hematology and other cancer cohorts is warranted.

  14. Test-retest reliability and predictive validity of the Implicit Association Test in children.

    Science.gov (United States)

    Rae, James R; Olson, Kristina R

    2018-02-01

    The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many factors simultaneously (lag-time between testing administrations, domain, etc.), it is difficult to discern what factors may explain variability in existing test-retest reliability and predictive validity estimates. Across five studies (total N = 519; ages 6- to 11-years-old), we manipulated two factors that have varied in previous developmental research-lag-time and domain. An internal meta-analysis of these studies revealed that, across three different methods of analyzing the data, mean test-retest (rs of .48, .38, and .34) and predictive validity (rs of .46, .20, and .10) effect sizes were significantly greater than zero. While lag-time did not moderate the magnitude of test-retest coefficients, whether we observed domain differences in test-retest reliability and predictive validity estimates was contingent on other factors, such as how we scored the IAT or whether we included estimates from a unique sample (i.e., a sample containing gender typical and gender diverse children). Recommendations are made for developmental researchers that utilize the IAT in their research. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  15. The role of test-retest reliability in measuring individual and group differences in executive functioning.

    Science.gov (United States)

    Paap, Kenneth R; Sawi, Oliver

    2016-12-01

    Studies testing for individual or group differences in executive functioning can be compromised by unknown test-retest reliability. Test-retest reliabilities across an interval of about one week were obtained from performance in the antisaccade, flanker, Simon, and color-shape switching tasks. There is a general trade-off between the greater reliability of single mean RT measures, and the greater process purity of measures based on contrasts between mean RTs in two conditions. The individual differences in RT model recently developed by Miller and Ulrich was used to evaluate the trade-off. Test-retest reliability was statistically significant for 11 of the 12 measures, but was of moderate size, at best, for the difference scores. The test-retest reliabilities for the Simon and flanker interference scores were lower than those for switching costs. Standard practice evaluates the reliability of executive-functioning measures using split-half methods based on data obtained in a single day. Our test-retest measures of reliability are lower, especially for difference scores. These reliability measures must also take into account possible day effects that classical test theory assumes do not occur. Measures based on single mean RTs tend to have acceptable levels of reliability and convergent validity, but are "impure" measures of specific executive functions. The individual differences in RT model shows that the impurity problem is worse than typically assumed. However, the "purer" measures based on difference scores have low convergent validity that is partly caused by deficiencies in test-retest reliability. Copyright © 2016 Elsevier B.V. All rights reserved.

  16. Test-retest reliability of tibiofemoral joint space width measurements made using a low-dose standing CT scanner

    Energy Technology Data Exchange (ETDEWEB)

    Segal, Neil A. [University of Kansas Medical Center, Department of Rehabilitation Medicine, 3901 Rainbow Boulevard, Mailstop 1046, Kansas City, KS (United States); The University of Iowa, Iowa City, IA (United States); Bergin, John; Kern, Andrew; Findlay, Christian [The University of Iowa, Iowa City, IA (United States); Anderson, Donald D. [The University of Iowa, Department of Orthopaedics and Rehabilitation, Iowa City, IA (United States)

    2017-02-15

    To determine the test-retest reliability of knee joint space width (JSW) measurements made using standing CT (SCT) imaging. This prospective two-visit study included 50 knees from 30 subjects (66% female; mean ± SD age 58.2 ± 11.3 years; BMI 29.1 ± 5.6 kg/m{sup 2}; 38% KL grade 0-1). Tibiofemoral geometry was obtained from bilateral, approximately 20 fixed-flexed SCT images acquired at visits 2 weeks apart. For each compartment, the total joint area was defined as the area with a JSW <10 mm. The summary measurements of interest were the percentage of the total joint area with a JSW less than 0.5-mm thresholds between 2.0 and 5.0 mm in each tibiofemoral compartment. Test-retest reliability of the summary JSW measurements was assessed by intraclass correlation coefficients (ICC 2,1) for the percentage area engaged at each threshold of JSW and root-mean-square errors (RMSE) were calculated to assess reproducibility. The ICCs were excellent for each threshold assessed, ranging from 0.95 to 0.97 for the lateral and 0.90 to 0.97 for the medial compartment. RMSE ranged from 1.1 to 7.2% for the lateral and from 3.1 to 9.1% for the medial compartment, with better reproducibility at smaller JSW thresholds. The knee joint positioning protocol used demonstrated high day-to-day reliability for SCT 3D tibiofemoral JSW summary measurements repeated 2 weeks apart. Low-dose SCT provides a great deal of information about the joint while maintaining high reliability, making it a suitable alternative to plain radiographs for evaluating JSW in people with knee OA. (orig.)

  17. Test-retest reliability of tibiofemoral joint space width measurements made using a low-dose standing CT scanner

    International Nuclear Information System (INIS)

    Segal, Neil A.; Bergin, John; Kern, Andrew; Findlay, Christian; Anderson, Donald D.

    2017-01-01

    To determine the test-retest reliability of knee joint space width (JSW) measurements made using standing CT (SCT) imaging. This prospective two-visit study included 50 knees from 30 subjects (66% female; mean ± SD age 58.2 ± 11.3 years; BMI 29.1 ± 5.6 kg/m 2 ; 38% KL grade 0-1). Tibiofemoral geometry was obtained from bilateral, approximately 20 fixed-flexed SCT images acquired at visits 2 weeks apart. For each compartment, the total joint area was defined as the area with a JSW <10 mm. The summary measurements of interest were the percentage of the total joint area with a JSW less than 0.5-mm thresholds between 2.0 and 5.0 mm in each tibiofemoral compartment. Test-retest reliability of the summary JSW measurements was assessed by intraclass correlation coefficients (ICC 2,1) for the percentage area engaged at each threshold of JSW and root-mean-square errors (RMSE) were calculated to assess reproducibility. The ICCs were excellent for each threshold assessed, ranging from 0.95 to 0.97 for the lateral and 0.90 to 0.97 for the medial compartment. RMSE ranged from 1.1 to 7.2% for the lateral and from 3.1 to 9.1% for the medial compartment, with better reproducibility at smaller JSW thresholds. The knee joint positioning protocol used demonstrated high day-to-day reliability for SCT 3D tibiofemoral JSW summary measurements repeated 2 weeks apart. Low-dose SCT provides a great deal of information about the joint while maintaining high reliability, making it a suitable alternative to plain radiographs for evaluating JSW in people with knee OA. (orig.)

  18. A Test-Retest Analysis of the Vanderbilt Assessment for Leadership in Education in the USA

    Science.gov (United States)

    Minor, Elizabeth Covay; Porter, Andrew C.; Murphy, Joseph; Goldring, Ellen; Elliott, Stephen N.

    2017-01-01

    The Vanderbilt Assessment for Leadership in Education (VAL-ED) is a 360-degree learning-centered behaviors principal evaluation tool that includes ratings from the principal, supervisors, and teachers. The current study assesses the test-retest reliability of the VAL-ED for a sample of seven school districts as part of multiple validity and…

  19. Test-retest reliability of Eurofit Physical Fitness items for children with visual impairments

    NARCIS (Netherlands)

    Houwen, Suzanne; Visscher, Chris; Hartman, Esther; Lemmink, Koen A. P. M.

    The purpose of this study was to examine the test-retest reliability of physical fitness items from the European Test of Physical Fitness (Eurofit) for children with visual impairments. A sample of 21 children, ages 6-12 years, that were recruited from a special school for children with visual

  20. Temporal Stability of Strength-Based Assessments: Test-Retest Reliability of Student and Teacher Reports

    Science.gov (United States)

    Romer, Natalie; Merrell, Kenneth W.

    2013-01-01

    This study focused on evaluating the temporal stability of self-reported and teacher-reported perceptions of students' social and emotional skills and assets. We used a test-retest reliability procedure over repeated administrations of the child, adolescent, and teacher versions of the "Social-Emotional Assets and Resilience Scales".…

  1. Test-Retest Reliability of the Salutogenic Wellness Promotion Scale (SWPS)

    Science.gov (United States)

    Anderson, L. M.; Moore, J. B.; Hayden, B. M.; Becker, C. M.

    2014-01-01

    Objective: This study examined the temporal stability (i.e. test-retest reliability) of the Salutogenic Wellness Promotion Scale (SWPS) using intraclass correlation coefficients (ICC). Current intraclass results were also compared to previously published interclass correlations to support the use of the intraclass method for test-retest…

  2. Test-Retest Reliability of Self-Reported Sexual Health Measures among US Hispanic Adolescents

    Science.gov (United States)

    Jerman, Petra; Berglas, Nancy F.; Rohrbach, Louise A.; Constantine, Norman A.

    2016-01-01

    Objective: Although Hispanic adolescents in the USA are often the focus of sexual health interventions, their response to survey measures has rarely been assessed within evaluation studies. This study documents the test-retest reliability of a wide range of self-reported sexual health values, attitudes, knowledge and behaviours among Hispanic…

  3. Test-Retest Reliability and Predictive Validity of the Implicit Association Test in Children

    Science.gov (United States)

    Rae, James R.; Olson, Kristina R.

    2018-01-01

    The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…

  4. Test-retest reliability of the Progressive Isoinertial Lifting Evaluation (PILE).

    Science.gov (United States)

    Lygren, Hildegunn; Dragesund, Tove; Joensen, Jón; Ask, Tove; Moe-Nilssen, Rolf

    2005-05-01

    A repeated measures single group design. To investigate test-retest reliability of Progressive Isoinertial Lifting Evaluation on patients with long lasting musculoskeletal problems related to the lumbar spine. Test-retest reliability has been satisfactory in healthy men. Test-retest reliability for clinical populations has not been reported. A total of 31 patients (17 women and 14 men) with long lasting low back pain participated in the study. The patients were tested twice at an interval of 2 days and at the same time of the day. The heaviest load that the patient could lift 4 times was used as outcome measure. The error of measurement indicates that the true result in 95% of cases will be within +/-4.5 kg from the measured value, while the difference between 2 measurements in 95% of cases will be less than 6.4 kg. Intra-class correlation (1,1) was 0.91. Relative test-retest reliability was high assessed by intra-class correlation, but absolute measurement variability reported as the smallest detectable difference has relevance for the interpretation of clinical test results and should also be considered.

  5. Test-Retest Reliability of the Preschool Age Psychiatric Assessment (PAPA)

    Science.gov (United States)

    Egger, Helen Link; Erkanli, Alaattin; Keeler, Gordon; Potts, Edward; Walter, Barbara Keith; Angold, Adrian

    2006-01-01

    Objective: To examine the test-retest reliability of a new interviewer-based psychiatric diagnostic measure (the Preschool Age Psychiatric Assessment) for use with parents of preschoolers 2 to 5 years old. Method: A total of 1,073 parents of children attending a large pediatric clinic completed the Child Behavior Checklist 1 1/2-5. For 18 months,…

  6. Test-retest reliabilty of exercise-induced hypoalgesia after aerobic exercise

    DEFF Research Database (Denmark)

    Vaegter, Henrik Bjarke; Dørge, Daniel Bandholtz; Schmidt, Kristian Sonne

    2018-01-01

    Objective: Exercise increases pressure pain thresholds (PPTs) in exercising and nonexercising muscles, known as exercise-induced hypoalgesia (EIH). No studies have investigated the test-retest reliability of change in PPTs after aerobic exercise. Primary objectives were to compare the effect...

  7. Test-retest, inter-assessor and intra-assessor reliability of the modified Touwen examination

    NARCIS (Netherlands)

    Peters, Lieke H. J.; Maathuis, Karel G. B.; Kouw, Eva; Hamming, Marjolein; Hadders-Algra, Mijna

    Interest in the Touwen examination (1979) for the assessment of minor neurological dysfunction (MND) is growing. However, information on psychometric properties of this assessment is scarce. Therefore the present study aimed at assessing the test's test-retest, inter- and intra-assessor reliability.

  8. Test - retest reliability of two instruments for measuring public attitudes towards persons with mental illness

    Directory of Open Access Journals (Sweden)

    Leufstadius Christel

    2011-01-01

    Full Text Available Abstract Background Research has identified stigmatization as a major threat to successful treatment of individuals with mental illness. As a consequence several anti-stigma campaigns have been carried out. The results have been discouraging and the field suffers from lack of evidence about interventions that work. There are few reports on psychometric data for instruments used to assess stigma, which thus complicates research efforts. The aim of the present study was to investigate test-retest reliability of the Swedish versions of the questionnaires: FABI and "Changing Minds" and to examine the internal consistency of the two instruments. Method Two instruments, fear and behavioural intentions (FABI and "Changing Minds", used in earlier studies on public attitudes towards persons with mental illness were translated into Swedish and completed by 51 nursing students on two occasions, with an interval of three weeks. Test-retest reliability was calculated by using weighted kappa coefficient and internal consistency using the Cronbach's alpha coefficient. Results Both instruments attain at best moderate test-retest reliability. For the Changing Minds questionnaire almost one fifth (17.9% of the items present poor test-retest reliability and the alpha coefficient for the subscales ranges between 0.19 - 0.46. All of the items in the FABI reach a fair or a moderate agreement between the test and retest, and the questionnaire displays a high internal consistency, alpha 0.80. Conclusions There is a need for development of psychometrically tested instruments within this field of research.

  9. Test-Retest Reliability of Dual-Task Outcome Measures in People With Parkinson Disease

    NARCIS (Netherlands)

    Strouwen, C.; Molenaar, E.A.; Keus, S.H.; Munks, L.; Bloem, B.R.; Nieuwboer, A.

    2016-01-01

    BACKGROUND: Dual-task (DT) training is gaining ground as a physical therapy intervention in people with Parkinson disease (PD). Future studies evaluating the effect of such interventions need reliable outcome measures. To date, the test-retest reliability of DT measures in patients with PD remains

  10. Test-Retest Reliability of Computerized, Everyday Memory Measures and Traditional Memory Tests.

    Science.gov (United States)

    Youngjohn, James R.; And Others

    Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…

  11. Test-retest reliability of the isernhagen work systems functional capacity evaluation in healthy adults

    NARCIS (Netherlands)

    Reneman, MF; Brouwer, S; Meinema, A; Dijkstra, PU; Geertzen, JHB; Groothoff, JW

    2004-01-01

    Aim of this study was to investigate test-retest reliability of the Isernhagen Work System Functional Capacity Evaluation (IWS FCE) in healthy subjects. The IWS FCE consists of 28 tests that reflect work-related activities such as lifting, carrying, bending, etc. A convenience sample of 26 healthy

  12. Test-retest reliability of jump execution variables using mechanography: a comparison of jump protocols.

    Science.gov (United States)

    Fitzgerald, John S; Johnson, LuAnn; Tomkinson, Grant; Stein, Jesse; Roemmich, James N

    2018-05-01

    Mechanography during the vertical jump may enhance screening and determining mechanistic causes underlying physical performance changes. Utility of jump mechanography for evaluation is limited by scant test-retest reliability data on force-time variables. This study examined the test-retest reliability of eight jump execution variables assessed from mechanography. Thirty-two women (mean±SD: age 20.8 ± 1.3 yr) and 16 men (age 22.1 ± 1.9 yr) attended a familiarization session and two testing sessions, all one week apart. Participants performed two variations of the squat jump with squat depth self-selected and controlled using a goniometer to 80º knee flexion. Test-retest reliability was quantified as the systematic error (using effect size between jumps), random error (using coefficients of variation), and test-retest correlations (using intra-class correlation coefficients). Overall, jump execution variables demonstrated acceptable reliability, evidenced by small systematic errors (mean±95%CI: 0.2 ± 0.07), moderate random errors (mean±95%CI: 17.8 ± 3.7%), and very strong test-retest correlations (range: 0.73-0.97). Differences in random errors between controlled and self-selected protocols were negligible (mean±95%CI: 1.3 ± 2.3%). Jump execution variables demonstrated acceptable reliability, with no meaningful differences between the controlled and self-selected jump protocols. To simplify testing, a self-selected jump protocol can be used to assess force-time variables with negligible impact on measurement error.

  13. Evaluating the reliability of an injury prevention screening tool: Test-retest study.

    Science.gov (United States)

    Gittelman, Michael A; Kincaid, Madeline; Denny, Sarah; Wervey Arnold, Melissa; FitzGerald, Michael; Carle, Adam C; Mara, Constance A

    2016-10-01

    A standardized injury prevention (IP) screening tool can identify family risks and allow pediatricians to address behaviors. To assess behavior changes on later screens, the tool must be reliable for an individual and ideally between household members. Little research has examined the reliability of safety screening tool questions. This study utilized test-retest reliability of parent responses on an existing IP questionnaire and also compared responses between household parents. Investigators recruited parents of children 0 to 1 year of age during admission to a tertiary care children's hospital. When both parents were present, one was chosen as the "primary" respondent. Primary respondents completed the 30-question IP screening tool after consent, and they were re-screened approximately 4 hours later to test individual reliability. The "second" parent, when present, only completed the tool once. All participants received a 10-dollar gift card. Cohen's Kappa was used to estimate test-retest reliability and inter-rater agreement. Standard test-retest criteria consider Kappa values: 0.0 to 0.40 poor to fair, 0.41 to 0.60 moderate, 0.61 to 0.80 substantial, and 0.81 to 1.00 as almost perfect reliability. One hundred five families participated, with five lost to follow-up. Thirty-two (30.5%) parent dyads completed the tool. Primary respondents were generally mothers (88%) and Caucasian (72%). Test-retest of the primary respondents showed their responses to be almost perfect; average 0.82 (SD = 0.13, range 0.49-1.00). Seventeen questions had almost perfect test-retest reliability and 11 had substantial reliability. However, inter-rater agreement between household members for 12 objective questions showed little agreement between responses; inter-rater agreement averaged 0.35 (SD = 0.34, range -0.19-1.00). One question had almost perfect inter-rater agreement and two had substantial inter-rater agreement. The IP screening tool used by a single individual had excellent

  14. Construct Validity and Test-Retest Reliability of the Climbing Stairs Questionnaire in Lower-Limb Amputees

    NARCIS (Netherlands)

    de Laat, Fred A.; Rommers, Gerardus M.; Geertzen, Jan H.; Roorda, Leo D.

    de Laat FA, Rommers GM, Geertzen JH, Roorda LD. Construct validity and test-retest reliability of the Climbing Stairs Questionnaire in lower-limb amputees. Arch Phys Med Rehabil 2010;91:1396-401. Objective: To investigate the construct validity and test-retest reliability of the Climbing Stairs

  15. A Test-Retest Reliability Study of the Whiplash Disability Questionnaire in Patients With Acute Whiplash-Associated Disorders

    DEFF Research Database (Denmark)

    Stupar, Maja; Côté, Pierre; Beaton, Dorcas E

    2015-01-01

    OBJECTIVE: The purpose of this study was to determine the test-retest reliability and the Minimal Detectable Change (MDC) of the Whiplash Disability Questionnaire (WDQ) in individuals with acute whiplash-associated disorders (WADs). METHODS: We performed a test-retest reliability study. We includ...

  16. Establishing survey validity and reliability for American Indians through "think aloud" and test-retest methods.

    Science.gov (United States)

    Hauge, Cindy Horst; Jacobs-Knight, Jacque; Jensen, Jamie L; Burgess, Katherine M; Puumala, Susan E; Wilton, Georgiana; Hanson, Jessica D

    2015-06-01

    The purpose of this study was to use a mixed-methods approach to determine the validity and reliability of measurements used within an alcohol-exposed pregnancy prevention program for American Indian women. To develop validity, content experts provided input into the survey measures, and a "think aloud" methodology was conducted with 23 American Indian women. After revising the measurements based on this input, a test-retest was conducted with 79 American Indian women who were randomized to complete either the original measurements or the new, modified measurements. The test-retest revealed that some of the questions performed better for the modified version, whereas others appeared to be more reliable for the original version. The mixed-methods approach was a useful methodology for gathering feedback on survey measurements from American Indian participants and in indicating specific survey questions that needed to be modified for this population. © The Author(s) 2015.

  17. Test-retest reliability of a balance testing protocol with external perturbations in young healthy adults.

    Science.gov (United States)

    Robbins, Shawn M; Caplan, Ryan M; Aponte, Daniel I; St-Onge, Nancy

    2017-10-01

    External perturbations are utilized to challenge balance and mimic realistic balance threats in patient populations. The reliability of such protocols has not been established. The purpose was to examine test-retest reliability of balance testing with external perturbations. Healthy adults (n=34; mean age 23 years) underwent balance testing over two visits. Participants completed ten balance conditions in which the following parameters were combined: perturbation or non-perturbation, single or double leg, and eyes open or closed. Three trials were collected for each condition. Data were collected on a force plate and external perturbations were applied by translating the plate. Force plate center of pressure (CoP) data were summarized using 13 different CoP measures. Test-retest reliability was examined using intraclass correlation coefficients (ICC) and Bland-Altman plots. CoP measures of total speed and excursion in both anterior-posterior and medial-lateral directions generally had acceptable ICC values for perturbation conditions (ICC=0.46 to 0.87); however, many other CoP measures (e.g. range, area of ellipse) had unacceptable test-retest reliability (ICCbalance testing protocols that include external perturbations should be made to improve test-retest reliability and diminish learning including more extensive participant training and increasing the number of trials. CoP measures that consider all data points (e.g. total speed) are more reliable than those that only consider a few data points. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Long term test-retest reliability of Oswestry Disability Index in male office workers.

    Science.gov (United States)

    Irmak, Rafet; Baltaci, Gul; Ergun, Nevin

    2015-01-01

    The Oswestry Disability Index (ODI) is one of the most common condition specific outcome measures used in the management of spinal disorders. But there is insufficient study on healthy populations and long term test-retest reliability. This is important because healthy populations are often used for control groups in low back pain interventions, and knowing the reliability of the controls affects the interpretation of the findings of these studies. The purpose of this study is to determine the long term test-retest reliability of ODI in office workers. Participants who have no chronic low back pain history were included in study. Subjects were assessed by the Turkish-ODI 2.0 (e-forms) on 1st, 2nd, 4th, 8th, 15th, 30th days to determine the stability of ODI scores over time. The study began with 58 (12 female, 46 male) participants. 36 (3 female, 33 male) participated for the full 30 days. Kolmogorov-Smirnov and Friedman tests were used. Test-retest reliability was evaluated by using nonparametric statistics. All tests were done by using SPSS-11. There was no statistically significant difference among the median scores of each day. (χ= 6.482, p >  0.05). The difference between median score of the days with 1st day was neither statistically nor clinically significant. ODI has long term test re-test reliability in healthy subjects over a 1 month time interval.

  19. Test-retest reliability and cross validation of the functioning everyday with a wheelchair instrument.

    Science.gov (United States)

    Mills, Tamara L; Holm, Margo B; Schmeler, Mark

    2007-01-01

    The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.

  20. The Comprehensive Snack Parenting Questionnaire (CSPQ: Development and Test-Retest Reliability

    Directory of Open Access Journals (Sweden)

    Dorus W. M. Gevers

    2018-04-01

    Full Text Available The narrow focus of existing food parenting instruments led us to develop a food parenting practices instrument measuring the full range of food practices constructs with a focus on snacking behavior. We present the development of the questionnaire and our research on the test-retest reliability. The developed Comprehensive Snack Parenting Questionnaire (CSPQ covers 21 constructs. Test-retest reliability was assessed by calculating intra class correlation coefficients and percentage agreement after two administrations of the CSPQ among a sample of 66 Dutch parents. Test-retest reliability analysis revealed acceptable intra class correlation coefficients (≥0.41 or agreement scores (≥0.60 for all items. These results, together with earlier work, suggest sufficient psychometric characteristics. The comprehensive, but brief CSPQ opens up chances for highly essential but unstudied research questions to understand and predict children’s snack intake. Example applications include studying the interactional nature of food parenting practices or interactions of food parenting with general parenting or child characteristics.

  1. Morpho-Functional 1H-MRI of the Lung in COPD: Short-Term Test-Retest Reliability.

    Directory of Open Access Journals (Sweden)

    Bertram J Jobst

    Full Text Available Non-invasive end-points for interventional trials and tailored treatment regimes in chronic obstructive pulmonary disease (COPD for monitoring regionally different manifestations of lung disease instead of global assessment of lung function with spirometry would be valuable. Proton nuclear magnetic resonance imaging (1H-MRI allows for a radiation-free assessment of regional structure and function. The aim of this study was to evaluate the short-term reproducibility of a comprehensive morpho-functional lung MRI protocol in COPD.20 prospectively enrolled COPD patients (GOLD I-IV underwent 1H-MRI of the lung at 1.5T on two consecutive days, including sequences for morphology, 4D contrast-enhanced perfusion, and respiratory mechanics. Image quality and COPD-related morphological and functional changes were evaluated in consensus by three chest radiologists using a dedicated MRI-based visual scoring system. Test-retest reliability was calculated per each individual lung lobe for the extent of large airway (bronchiectasis, wall thickening, mucus plugging and small airway abnormalities (tree in bud, peripheral bronchiectasis, mucus plugging, consolidations, nodules, parenchymal defects and perfusion defects. The presence of tracheal narrowing, dystelectasis, pleural effusion, pulmonary trunk ectasia, right ventricular enlargement and, finally, motion patterns of diaphragma and chest wall were addressed.Median global scores [10(Q1:8.00;Q3:16.00 vs.11(Q1:6.00;Q3:15.00] as well as category subscores were similar between both timepoints, and kappa statistics indicated "almost perfect" global agreement (ĸ = 0.86, 95%CI = 0.81-0.91. Most subscores showed at least "substantial" agreement of MRI1 and MRI2 (ĸ = 0.64-1.00, whereas the agreement for the diagnosis of dystelectasis/effusion (ĸ = 0.42, 95%CI = 0.00-0.93 was "moderate" and of tracheal abnormalities (ĸ = 0.21, 95%CI = 0.00-0.75 "fair". Most MRI acquisitions showed at least diagnostic quality at

  2. Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system.

    Science.gov (United States)

    Thomas, Marianna S; Newman, David; Leinhard, Olof Dahlqvist; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N; Karlsson, Anette; Rosander, Johannes; Borga, Magnus; Toms, Andoni P

    2014-09-01

    To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19·32 L (SD9·1) and 19·28 L (SD9·12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1·0, 95% level of agreement -0·32-0·2 L). ICC for all automated test-retest muscle volumes were almost perfect (0·99-1·0) with 95% levels of agreement 1.8-6.6% of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1·68 L (2SD0·6) compared to automated 1·64 L (2SD 0·6), left lower leg: manual 1·69 L (2SD 0·64) compared to automated 1·63 L (SD0·61), correlation coefficients for automated and manual segmentation were 0·94-0·96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. Sarcopaenia is an important reversible complication of a number of diseases. Manual quantification of muscle volume is time-consuming and expensive. Muscles can be imaged using in and out of phase MRI. Automated atlas-based segmentation can identify muscle groups. Automated muscle volume segmentation is reproducible and can replace manual measurements.

  3. Acoustic stapedial reflexes in healthy neonates: normative data and test-retest reliability.

    Science.gov (United States)

    Kei, Joseph

    2012-01-01

    The acoustic stapedial reflex (ASR) test provides useful information about the function of the auditory system. While it is frequently used with adults and children in a clinical setting, its use with young infants is limited. Presently, there are few data for neonates and inadequate research into the test-retest reliability of the ASR test. This study aimed to establish normative data and evaluate the test-retest reliability of the ASR test in healthy neonates. A cross-sectional experimental design was used to establish ASR normative data and assess the test-retest reliability of ASR thresholds obtained from healthy neonates. Sixty-eight full-term neonates with mean chronological age of 2.5 days (SD = 1.8 day), who passed the automated auditory brainstem response, transient evoked otoacoustic emission, and high frequency (1 kHz) tympanometry (HFT) tests. One randomly selected ear from each neonate was tested using TEOAE (transient evoked otoacoustic emission), HFT, and ASR tests using a 1 kHz probe tone. ASR thresholds were elicited by presenting pure tones of 0.5, 2, and 4 kHz and broadband noise (BBN) separately to the test ear in an ipsilateral stimulation mode. The ASR procedure was repeated to acquire retest data within the same testing session. Descriptive statistics, χ2, and analysis of variance with repeated measures tests were used to analyze ASR data. All neonates exhibited ASR when stimulated by tonal stimuli or BBN. The mean ASRTs (acoustic stapedial reflex thresholds) for the 0.5, 2, and 4 kHz tones were 81.6 ± 7.9, 71.3 ± 7.9, and 65.4 ± 8.7 dB HL, respectively. The mean ASRT for the BBN was estimated to be smaller than 57.2 dB HL, given the limitation of the equipment. The 95th percentiles of the ASRT were 95, 85, 80, and 75 dB HL for the 0.5, 2, and 4 kHz and BBN, respectively. The test-retest reliability of the ASR test for all stimuli was high, with no significant difference in mean ASRTs across the test and retest conditions. Test-retest

  4. Test-retest assessment of cortical activation induced by repetitive transcranial magnetic stimulation with brain atlas-guided optical topography

    Science.gov (United States)

    Tian, Fenghua; Kozel, F. Andrew; Yennu, Amarnath; Croarkin, Paul E.; McClintock, Shawn M.; Mapes, Kimberly S.; Husain, Mustafa M.; Liu, Hanli

    2012-11-01

    Repetitive transcranial magnetic stimulation (rTMS) is a technology that stimulates neurons with rapidly changing magnetic pulses with demonstrated therapeutic applications for various neuropsychiatric disorders. Functional near-infrared spectroscopy (fNIRS) is a suitable tool to assess rTMS-evoked brain responses without interference from the magnetic or electric fields generated by the TMS coil. We have previously reported a channel-wise study of combined rTMS/fNIRS on the motor and prefrontal cortices, showing a robust decrease of oxygenated hemoglobin concentration (Δ[HbO2]) at the sites of 1-Hz rTMS and the contralateral brain regions. However, the reliability of this putative clinical tool is unknown. In this study, we develop a rapid optical topography approach to spatially characterize the rTMS-evoked hemodynamic responses on a standard brain atlas. A hemispherical approximation of the brain is employed to convert the three-dimensional topography on the complex brain surface to a two-dimensional topography in the spherical coordinate system. The test-retest reliability of the combined rTMS/fNIRS is assessed using repeated measurements performed two to three days apart. The results demonstrate that the Δ[HbO2] amplitudes have moderate-to-high reliability at the group level; and the spatial patterns of the topographic images have high reproducibility in size and a moderate degree of overlap at the individual level.

  5. Test-retest and interrater reliability of the functional lower extremity evaluation.

    Science.gov (United States)

    Haitz, Karyn; Shultz, Rebecca; Hodgins, Melissa; Matheson, Gordon O

    2014-12-01

    Repeated-measures clinical measurement reliability study. To establish the reliability and face validity of the Functional Lower Extremity Evaluation (FLEE). The FLEE is a 45-minute battery of 8 standardized functional performance tests that measures 3 components of lower extremity function: control, power, and endurance. The reliability and normative values for the FLEE in healthy athletes are unknown. A face validity survey for the FLEE was sent to sports medicine personnel to evaluate the level of importance and frequency of clinical usage of each test included in the FLEE. The FLEE was then administered and rated for 40 uninjured athletes. To assess test-retest reliability, each athlete was tested twice, 1 week apart, by the same rater. To assess interrater reliability, 3 raters scored each athlete during 1 of the testing sessions. Intraclass correlation coefficients were used to assess the test-retest and interrater reliability of each of the FLEE tests. In the face validity survey, the FLEE tests were rated as highly important by 58% to 71% of respondents but frequently used by only 26% to 45% of respondents. Interrater reliability intraclass correlation coefficients ranged from 0.83 to 1.00, and test-retest reliability ranged from 0.71 to 0.95. The FLEE tests are considered clinically important for assessing lower extremity function by sports medicine personnel but are underused. The FLEE also is a reliable assessment tool. Future studies are required to determine if use of the FLEE to make return-to-play decisions may reduce reinjury rates.

  6. Test-retest reliability of barbell velocity during the free-weight bench-press exercise.

    Science.gov (United States)

    Stock, Matt S; Beck, Travis W; DeFreitas, Jason M; Dillon, Michael A

    2011-01-01

    The purpose of this study was to calculate test-retest reliability statistics for peak barbell velocity during the free-weight bench-press exercise for loads corresponding to 10-90% of the 1-repetition maximum (1RM). Twenty-one healthy, resistance-trained men (mean ± SD age = 23.5 ± 2.7 years; body mass = 90.5 ± 14.6 kg; 1RM bench press = 125.4 ± 18.4 kg) volunteered for this study. A minimum of 48 hours after a maximal strength testing and familiarization session, the subjects performed single repetitions of the free-weight bench-press exercise at each tenth percentile (10-90%) of the 1RM on 2 separate occasions. For each repetition, the subjects were instructed to press the barbell as rapidly as possible, and peak barbell velocity was measured with a Tendo Weightlifting Analyzer. The test-retest intraclass correlation coefficients (model 2,1) and corresponding standard errors of measurement (expressed as percentages of the mean barbell velocity values) were 0.717 (4.2%), 0.572 (5.0%), 0.805 (3.1%), 0.669 (4.7%), 0.790 (4.6%), 0.785 (4.8%), 0.811 (5.8%), 0.714 (10.3%), and 0.594 (12.6%) for the weights corresponding to 10-90% 1RM. There were no mean differences between the barbell velocity values from trials 1 and 2. These results indicated moderate to high test-retest reliability for barbell velocity from 10 to 70% 1RM but decreased consistency at 80 and 90% 1RM. When examining barbell velocity during the free-weight bench-press exercise, greater measurement error must be overcome at 80 and 90% 1RM to be confident that an observed change is meaningful.

  7. Development, test-retest reliability, and construct validity of the resistance training skills battery.

    Science.gov (United States)

    Lubans, David R; Smith, Jordan J; Harries, Simon K; Barnett, Lisa M; Faigenbaum, Avery D

    2014-05-01

    The aim of this study was to describe the development and assess test-retest reliability and construct validity of the Resistance Training Skills Battery (RTSB) for adolescents. The RTSB provides an assessment of resistance training skill competency and includes 6 exercises (i.e., body weight squat, push-up, lunge, suspended row, standing overhead press, and front support with chest touches). Scoring for each skill is based on the number of performance criteria successfully demonstrated. An overall resistance training skill quotient (RTSQ) is created by adding participants' scores for the 6 skills. Participants (44 boys and 19 girls, mean age = 14.5 ± 1.2 years) completed the RTSB on 2 occasions separated by 7 days. Participants also completed the following fitness tests, which were used to create a muscular fitness score (MFS): handgrip strength, timed push-up, and standing long jump tests. Intraclass correlation (ICC), paired samples t-tests, and typical error were used to assess test-retest reliability. To assess construct validity, gender and RTSQ were entered into a regression model predicting MFS. The rank order repeatability of the RTSQ was high (ICC = 0.88). The model explained 39% of the variance in MFS (p ≤ 0.001) and RTSQ (r = 0.40, p ≤ 0.001) was a significant predictor. This study has demonstrated the construct validity and test-retest reliability of the RTSB in a sample of adolescents. The RTSB can reliably rank participants in regards to their resistance training competency and has the necessary sensitivity to detect small changes in resistance training skill proficiency.

  8. Test-Retest Reliability of a Serious Game for Delirium Screening in the Emergency Department.

    Science.gov (United States)

    Tong, Tiffany; Chignell, Mark; Tierney, Mary C; Lee, Jacques S

    2016-01-01

    Introduction: Cognitive screening in settings such as emergency departments (ED) is frequently carried out using paper-and-pencil tests that require administration by trained staff. These assessments often compete with other clinical duties and thus may not be routinely administered in these busy settings. Literature has shown that the presence of cognitive impairments such as dementia and delirium are often missed in older ED patients. Failure to recognize delirium can have devastating consequences including increased mortality (Kakuma et al., 2003). Given the demands on emergency staff, an automated cognitive test to screen for delirium onset could be a valuable tool to support delirium prevention and management. In earlier research we examined the concurrent validity of a serious game, and carried out an initial assessment of its potential as a delirium screening tool (Tong et al., 2016). In this paper, we examine the test-retest reliability of the game, as it is an important criterion in a cognitive test for detecting risk of delirium onset. Objective: To demonstrate the test-retest reliability of the screening tool over time in a clinical sample of older emergency patients. A secondary objective is to assess whether there are practice effects that might make game performance unstable over repeated presentations. Materials and Methods: Adults over the age of 70 were recruited from a hospital ED. Each patient played our serious game in an initial session soon after they arrived in the ED, and in follow up sessions conducted at 8-h intervals (for each participant there were up to five follow up sessions, depending on how long the person stayed in the ED). Results: A total of 114 adults (61 females, 53 males) between the ages of 70 and 104 years ( M = 81 years, SD = 7) participated in our study after screening out delirious patients. We observed a test-retest reliability of the serious game (as assessed by correlation r -values) between 0.5 and 0.8 across adjacent

  9. Test-retest reliability and factor structures of organizational citizenship behavior for Hong Kong workers.

    Science.gov (United States)

    Lam, S S

    2001-02-01

    In 1990 Podsakoff, MacKenzie, Moorman, and Fetter developed a scale to measure the five dimensions of organizational citizenship behavior. Test-retest data over 15 weeks are reported for this scale for a sample of 82 female and 32 male Chinese tellers (ages 18 to 54 years) from a large international bank in Hong Kong. Stability was .83, and there was no significant change between Times 1 and 2. Analysis indicated the five-factor structure and showed it to be a reliable measure when used with a nonwestern sample.

  10. Test-retest reliability and construct validity of the Helplessness, Hopelessness, and Haplessness Scale in patients with anxiety disorders.

    Science.gov (United States)

    Vatan, Sevginar; Ertaş, Sedar; Lester, David

    2011-04-01

    In a sample of 100 Turkish psychiatric patients with diagnoses of anxiety disorders, Lester's Helplessness, Hopelessness, and Haplessness inventory had moderate estimates of internal consistency, test-retest reliability, and construct validity.

  11. Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system

    International Nuclear Information System (INIS)

    Thomas, Marianna S.; Newman, David; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N.; Leinhard, Olof Dahlqvist; Karlsson, Anette; Borga, Magnus; Rosander, Johannes; Toms, Andoni P.

    2014-01-01

    To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19.32 L (SD9.1) and 19.28 L (SD9.12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1.0, 95 % level of agreement -0.32-0.2 L). ICC for all automated test-retest muscle volumes were almost perfect (0.99-1.0) with 95 % levels of agreement 1.8-6.6 % of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1.68 L (2SD0.6) compared to automated 1.64 L (2SD 0.6), left lower leg: manual 1.69 L (2SD 0.64) compared to automated 1.63 L (SD0.61), correlation coefficients for automated and manual segmentation were 0.94-0.96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. (orig.)

  12. Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system

    Energy Technology Data Exchange (ETDEWEB)

    Thomas, Marianna S.; Newman, David; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N. [Norfolk and Norwich University Hospital, Department of Radiology, Norwich (United Kingdom); Leinhard, Olof Dahlqvist [Linkoeping University, Center for Medical Image Science and Visualization, Linkoeping (Sweden); Linkoeping University, Department of Medical and Health Sciences, Linkoeping (Sweden); Karlsson, Anette; Borga, Magnus [Linkoeping University, Center for Medical Image Science and Visualization, Linkoeping (Sweden); Linkoeping University, Department of Biomedical Engineering, Linkoeping (Sweden); Rosander, Johannes [Advanced MR Analytics AB, Linkoeping (Sweden); Toms, Andoni P. [Norfolk and Norwich University Hospital, Department of Radiology, Norwich (United Kingdom); Radiology Academy, Cotman Centre, Norwich, Norfolk (United Kingdom)

    2014-09-15

    To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19.32 L (SD9.1) and 19.28 L (SD9.12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1.0, 95 % level of agreement -0.32-0.2 L). ICC for all automated test-retest muscle volumes were almost perfect (0.99-1.0) with 95 % levels of agreement 1.8-6.6 % of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1.68 L (2SD0.6) compared to automated 1.64 L (2SD 0.6), left lower leg: manual 1.69 L (2SD 0.64) compared to automated 1.63 L (SD0.61), correlation coefficients for automated and manual segmentation were 0.94-0.96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. (orig.)

  13. Test-Retest Reliability of Diffusion Tensor Imaging in Huntington's Disease.

    Science.gov (United States)

    Cole, James H; Farmer, Ruth E; Rees, Elin M; Johnson, Hans J; Frost, Chris; Scahill, Rachael I; Hobbs, Nicola Z

    2014-03-21

    Diffusion tensor imaging (DTI) has shown microstructural abnormalities in patients with Huntington's Disease (HD) and work is underway to characterise how these abnormalities change with disease progression. Using methods that will be applied in longitudinal research, we sought to establish the reliability of DTI in early HD patients and controls. Test-retest reliability, quantified using the intraclass correlation coefficient (ICC), was assessed using region-of-interest (ROI)-based white matter atlas and voxelwise approaches on repeat scan data from 22 participants (10 early HD, 12 controls). T1 data was used to generate further ROIs for analysis in a reduced sample of 18 participants. The results suggest that fractional anisotropy (FA) and other diffusivity metrics are generally highly reliable, with ICCs indicating considerably lower within-subject compared to between-subject variability in both HD patients and controls. Where ICC was low, particularly for the diffusivity measures in the caudate and putamen, this was partly influenced by outliers. The analysis suggests that the specific DTI methods used here are appropriate for cross-sectional research in HD, and give confidence that they can also be applied longitudinally, although this requires further investigation. An important caveat for DTI studies is that test-retest reliability may not be evenly distributed throughout the brain whereby highly anisotropic white matter regions tended to show lower relative within-subject variability than other white or grey matter regions.

  14. Test-retest reliability of the 40 Hz EEG auditory steady-state response.

    Directory of Open Access Journals (Sweden)

    Kristina L McFadden

    Full Text Available Auditory evoked steady-state responses are increasingly being used as a marker of brain function and dysfunction in various neuropsychiatric disorders, but research investigating the test-retest reliability of this response is lacking. The purpose of this study was to assess the consistency of the auditory steady-state response (ASSR across sessions. Furthermore, the current study aimed to investigate how the reliability of the ASSR is impacted by stimulus parameters and analysis method employed. The consistency of this response across two sessions spaced approximately 1 week apart was measured in nineteen healthy adults using electroencephalography (EEG. The ASSR was entrained by both 40 Hz amplitude-modulated white noise and click train stimuli. Correlations between sessions were assessed with two separate analytical techniques: a channel-level analysis across the whole-head array and b signal-space projection from auditory dipoles. Overall, the ASSR was significantly correlated between sessions 1 and 2 (p<0.05, multiple comparison corrected, suggesting adequate test-retest reliability of this response. The current study also suggests that measures of inter-trial phase coherence may be more reliable between sessions than measures of evoked power. Results were similar between the two analysis methods, but reliability varied depending on the presented stimulus, with click train stimuli producing more consistent responses than white noise stimuli.

  15. Test-retest and between-site reliability in a multicenter fMRI study.

    Science.gov (United States)

    Friedman, Lee; Stern, Hal; Brown, Gregory G; Mathalon, Daniel H; Turner, Jessica; Glover, Gary H; Gollub, Randy L; Lauriello, John; Lim, Kelvin O; Cannon, Tyrone; Greve, Douglas N; Bockholt, Henry Jeremy; Belger, Aysenil; Mueller, Bryon; Doty, Michael J; He, Jianchun; Wells, William; Smyth, Padhraic; Pieper, Steve; Kim, Seyoung; Kubicki, Marek; Vangel, Mark; Potkin, Steven G

    2008-08-01

    In the present report, estimates of test-retest and between-site reliability of fMRI assessments were produced in the context of a multicenter fMRI reliability study (FBIRN Phase 1, www.nbirn.net). Five subjects were scanned on 10 MRI scanners on two occasions. The fMRI task was a simple block design sensorimotor task. The impulse response functions to the stimulation block were derived using an FIR-deconvolution analysis with FMRISTAT. Six functionally-derived ROIs covering the visual, auditory and motor cortices, created from a prior analysis, were used. Two dependent variables were compared: percent signal change and contrast-to-noise-ratio. Reliability was assessed with intraclass correlation coefficients derived from a variance components analysis. Test-retest reliability was high, but initially, between-site reliability was low, indicating a strong contribution from site and site-by-subject variance. However, a number of factors that can markedly improve between-site reliability were uncovered, including increasing the size of the ROIs, adjusting for smoothness differences, and inclusion of additional runs. By employing multiple steps, between-site reliability for 3T scanners was increased by 123%. Dropping one site at a time and assessing reliability can be a useful method of assessing the sensitivity of the results to particular sites. These findings should provide guidance toothers on the best practices for future multicenter studies.

  16. Test-retest reliability of the proposed DSM-5 eating disorder diagnostic criteria

    Science.gov (United States)

    Sysko, Robyn; Roberto, Christina A.; Barnes, Rachel D.; Grilo, Carlos M.; Attia, Evelyn; Walsh, B. Timothy

    2012-01-01

    The proposed DSM-5 classification scheme for eating disorders includes both major and minor changes to the existing DSM-IV diagnostic criteria. It is not known what effect these modifications will have on the ability to make reliable diagnoses. Two studies were conducted to evaluate the short-term test-retest reliability of the proposed DSM-5 eating disorder diagnoses: anorexia nervosa, bulimia nervosa, binge eating disorder, and feeding and eating conditions not elsewhere classified. Participants completed two independent telephone interviews with research assessors (n=70 Study 1; n=55 Study 2). Fair to substantial agreements (κ= 0.80 and 0.54) were observed across eating disorder diagnoses in Study 1 and Study 2, respectively. Acceptable rates of agreement were identified for the individual eating disorder diagnoses, including DSM-5 anorexia nervosa (κ’s of 0.81 to 0.97), bulimia nervosa (κ=0.84), binge eating disorder (κ’s of 0.75 and 0.61), and feeding and eating disorders not elsewhere classified (κ’s of 0.70 and 0.46). Further, improved short-term test-retest reliability was noted when using the DSM-5, in comparison to DSM-IV, criteria for binge eating disorder. Thus, these studies found that trained interviewers can reliably diagnose eating disorders using the proposed DSM-5 criteria; however, additional data from general practice settings and community samples are needed. PMID:22401974

  17. Test-retest reliability of the driving habits questionnaire in older self-driving adults.

    Science.gov (United States)

    Song, Chiang-Soon; Chun, Byung-Yoon; Chung, Hyun-Sook

    2015-11-01

    [Purpose] The purpose of this study was to investigate the test-retest reliability of the Driving Habits Questionnaire in community-dwelling older self-drivers. [Subjects and Methods] Seventy-four participants were recruited by convenience sampling from local rehabilitation centers. This was a cross-sectional study design that used two clinical measures: the Driving Habits Questionnaire and Mini-mental State Examination. To examine the test-retest reliability of the Driving Habits Questionnaire, the clinical tool was measured twice, five days apart. [Results] The Driving Habits Questionnaire showed good reliability for older community-dwelling self-drivers. The Cronbach's alpha coefficients for the four domains of dependence (0.572), difficulty (0.871), crashes and citations (0.689), and driving space (0.961) of the Driving Habits Questionnaire indicated good or high internal consistency. Driving difficulty correlated significantly with self-reported crashes and citations and driving space. [Conclusion] The results of this study suggest that the Driving Habits Questionnaire is a reliable measure of self-reported interview-based driving behavior in the community-dwelling elderly.

  18. Test-retest reliability of trunk motor variability measured by large-array surface electromyography.

    Science.gov (United States)

    Abboud, Jacques; Nougarou, François; Loranger, Michel; Descarreaux, Martin

    2015-01-01

    The objective of this study was to evaluate the test-retest reliability of the trunk muscle activity distribution in asymptomatic participants during muscle fatigue using large-array surface electromyography (EMG). Trunk muscle activity distribution was evaluated twice, with 3 to 4 days between them, in 27 asymptomatic volunteers using large-array surface EMG. Motor variability, assessed with 2 different variables (the centroid coordinates of the root mean square map and the dispersion variable), was evaluated during a low back muscle fatigue task. Test-retest reliability of muscle activity distribution was obtained using Pearson correlation coefficients. A shift in the distribution of EMG amplitude toward the lateral-caudal region of the lumbar erector spinae induced by muscle fatigue was observed. Moderate to very strong correlations were found between both sessions in the last 3 phases of the fatigue task for both motor variability variables, whereas weak to moderate correlations were found in the first phases of the fatigue task only for the dispersion variable. These findings show that, in asymptomatic participants, patterns of EMG activity are less reliable in initial stages of muscle fatigue, whereas later stages are characterized by highly reliable patterns of EMG activity. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.

  19. Test-retest reliability of the Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale.

    Science.gov (United States)

    Gustafsson, Margareta; Blomberg, Karin; Holmefur, Marie

    2015-07-01

    The Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale evaluates the student nurses' perception of the learning environment and supervision within the clinical placement. It has never been tested in a replication study. The aim of the present study was to evaluate the test-retest reliability of the CLES + T scale. The CLES + T scale was administered twice to a group of 42 student nurses, with a one-week interval. Test-retest reliability was determined by calculations of Intraclass Correlation Coefficients (ICCs) and weighted Kappa coefficients. Standard Error of Measurements (SEM) and Smallest Detectable Difference (SDD) determined the precision of individual scores. Bland-Altman plots were created for analyses of systematic differences between the test occasions. The results of the study showed that the stability over time was good to excellent (ICC 0.88-0.96) in the sub-dimensions "Supervisory relationship", "Pedagogical atmosphere on the ward" and "Role of the nurse teacher". Measurements of "Premises of nursing on the ward" and "Leadership style of the manager" had lower but still acceptable stability (ICC 0.70-0.75). No systematic differences occurred between the test occasions. This study supports the usefulness of the CLES + T scale as a reliable measure of the student nurses' perception of the learning environment within the clinical placement at a hospital. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.

    Science.gov (United States)

    Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren

    2014-01-01

    This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Forward lunge as a functional performance test in ACL deficient subjects: test-retest reliability

    DEFF Research Database (Denmark)

    Alkjaer, Tine; Henriksen, Marius; Dyhre-Poulsen, Poul

    2009-01-01

    The forward lunge movement may be used as a functional performance test of anterior cruciate ligament (ACL) deficient and reconstructed subjects. The purposes were 1) to determine the test-retest reliability of a forward lunge in healthy subjects and 2) to determine the required numbers...... of repetitions necessary to yield satisfactory reliability. Nineteen healthy subjects performed four trials of a forward lunge on two different days. The movement time, impulses of the ground reaction forces (IFz, IFy), knee joint kinematics and dynamics during the forward lunge were calculated. The relative...... reliability was determined by calculation of Intraclass Correlation Coefficients (ICC). The IFz, IFy and the positive work of the knee extensors showed excellent reliability (ICC >0.75). All other variables demonstrated acceptable reliability (0.4>ICCreliability increased when more than...

  2. Test-retest reliability of an fMRI paradigm for studies of cardiovascular reactivity.

    Science.gov (United States)

    Sheu, Lei K; Jennings, J Richard; Gianaros, Peter J

    2012-07-01

    We examined the reliability of measures of fMRI, subjective, and cardiovascular reactions to standardized versions of a Stroop color-word task and a multisource interference task. A sample of 14 men and 12 women (30-49 years old) completed the tasks on two occasions, separated by a median of 88 days. The reliability of fMRI BOLD signal changes in brain areas engaged by the tasks was moderate, and aggregating fMRI BOLD signal changes across the tasks improved test-retest reliability metrics. These metrics included voxel-wise intraclass correlation coefficients (ICCs) and overlap ratio statistics. Task-aggregated ratings of subjective arousal, valence, and control, as well as cardiovascular reactions evoked by the tasks showed ICCs of 0.57 to 0.87 (ps reliability. These findings support using these tasks as a battery for fMRI studies of cardiovascular reactivity. Copyright © 2012 Society for Psychophysiological Research.

  3. Test-retest Agreement and Reliability of Quantitative Sensory Testing 1 Year After Breast Cancer Surgery

    DEFF Research Database (Denmark)

    Andersen, Kenneth Geving; Kehlet, Henrik; Aasvang, Eske Kvanner

    2015-01-01

    .5 SD) than within-patient variation (0.23 to 3.55 SD). There were no significant differences between pain and pain-free patients. The individual test-retest variability was higher on the operated side compared with the nonoperated side. DISCUSSION: The QST protocol reliability allows for group......OBJECTIVES: Quantitative sensory testing (QST) is used to assess sensory dysfunction and nerve damage by examining psychophysical responses to controlled, graded stimuli such as mechanical and thermal detection and pain thresholds. In the breast cancer population, 4 studies have used QST to examine...... persistent pain after breast cancer treatment, suggesting neuropathic pain being a prominent pain mechanism. However, the agreement and reliability of QST has not been described in the postsurgical breast cancer population, hindering exact interpretation of QST studies in this population. The aim...

  4. Isokinetic Strength and Endurance Tests used Pre- and Post-Spaceflight: Test-Retest Reliability

    Science.gov (United States)

    Laughlin, Mitzi S.; Lee, Stuart M. C.; Loehr, James A.; Amonette, William E.

    2009-01-01

    To assess changes in muscular strength and endurance after microgravity exposure, NASA measures isokinetic strength and endurance across multiple sessions before and after long-duration space flight. Accurate interpretation of pre- and post-flight measures depends upon the reliability of each measure. The purpose of this study was to evaluate the test-retest reliability of the NASA International Space Station (ISS) isokinetic protocol. Twenty-four healthy subjects (12 M/12 F, 32.0 +/- 5.6 years) volunteered to participate. Isokinetic knee, ankle, and trunk flexion and extension strength as well as endurance of the knee flexors and extensors were measured using a Cybex NORM isokinetic dynamometer. The first weekly session was considered a familiarization session. Data were collected and analyzed for weeks 2-4. Repeated measures analysis of variance (alpha=0.05) was used to identify weekly differences in isokinetic measures. Test-retest reliability was evaluated by intraclass correlation coefficients (ICC) (3,1). No significant differences were found between weeks in any of the strength measures and the reliability of the strength measures were all considered excellent (ICC greater than 0.9), except for concentric ankle dorsi-flexion (ICC=0.67). Although a significant difference was noted in weekly endurance measures of knee extension (p less than 0.01), the reliability of endurance measure by week were considered excellent for knee flexion (ICC=0.97) and knee extension (ICC=0.96). Except for concentric ankle dorsi-flexion, the isokinetic strength and endurance measures are highly reliable when following the NASA ISS protocol. This protocol should allow accurate interpretation isokinetic data even with a small number of crew members.

  5. Work-related measures of physical and behavioral health function: Test-retest reliability.

    Science.gov (United States)

    Marino, Molly Elizabeth; Meterko, Mark; Marfeo, Elizabeth E; McDonough, Christine M; Jette, Alan M; Ni, Pengsheng; Bogusz, Kara; Rasch, Elizabeth K; Brandt, Diane E; Chan, Leighton

    2015-10-01

    The Work Disability Functional Assessment Battery (WD-FAB), developed for potential use by the US Social Security Administration to assess work-related function, currently consists of five multi-item scales assessing physical function and four multi-item scales assessing behavioral health function; the WD-FAB scales are administered as Computerized Adaptive Tests (CATs). The goal of this study was to evaluate the test-retest reliability of the WD-FAB Physical Function and Behavioral Health CATs. We administered the WD-FAB scales twice, 7-10 days apart, to a sample of 376 working age adults and 316 adults with work-disability. Intraclass correlation coefficients were calculated to measure the consistency of the scores between the two administrations. Standard error of measurement (SEM) and minimal detectable change (MDC90) were also calculated to measure the scales precision and sensitivity. For the Physical Function CAT scales, the ICCs ranged from 0.76 to 0.89 in the working age adult sample, and 0.77-0.86 in the sample of adults with work-disability. ICCs for the Behavioral Health CAT scales ranged from 0.66 to 0.70 in the working age adult sample, and 0.77-0.80 in the adults with work-disability. The SEM ranged from 3.25 to 4.55 for the Physical Function scales and 5.27-6.97 for the Behavioral Health function scales. For all scales in both samples, the MDC90 ranged from 7.58 to 16.27. Both the Physical Function and Behavioral Health CATs of the WD-FAB demonstrated good test-retest reliability in adults with work-disability and general adult samples, a critical requirement for assessing work related functioning in disability applicants and in other contexts. Copyright © 2015 Elsevier Inc. All rights reserved.

  6. Test-retest reliability of a handheld dynamometer for measurement of isometric cervical muscle strength.

    Science.gov (United States)

    Vannebo, Katrine Tranaas; Iversen, Vegard Moe; Fimland, Marius Steiro; Mork, Paul Jarle

    2018-03-02

    There is a lack of test-retest reliability studies of measurements of cervical muscle strength, taking into account gender and possible learning effects. To investigate test-retest reliability of measurement of maximal isometric cervical muscle strength by handheld dynamometry. Thirty women (age 20-58 years) and 28 men (age 20-60 years) participated in the study. Maximal isometric strength (neck flexion, neck extension, and right/left lateral flexion) was measured on three separate days at least five days apart by one evaluator. Intra-rater consistency tended to improve from day 1-2 measurements to day 2-3 measurements in both women and men. In women, the intra-class correlation coefficients (ICC) for day 2 to day 3 measurements were 0.91 (95% confidence interval [CI], 0.82-0.95) for neck flexion, 0.88 (95% CI, 0.76-0.94) for neck extension, 0.84 (95% CI, 0.68-0.92) for right lateral flexion, and 0.89 (95% CI, 0.78-0.95) for left lateral flexion. The corresponding ICCs among men were 0.86 (95% CI, 0.72-0.93) for neck flexion, 0.93 (95% CI, 0.85-0.97) for neck extension, 0.82 (95% CI, 0.65-0.91) for right lateral flexion and 0.73 (95% CI, 0.50-0.87) for left lateral flexion. This study describes a reliable and easy-to-administer test for assessing maximal isometric cervical muscle strength.

  7. Test-retest reliability of the Military Pre-training Questionnaire.

    Science.gov (United States)

    Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D

    2010-09-01

    Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.

  8. Improving the Test-Retest Reliability of Resting State fMRI by Removing the Impact of Sleep.

    Science.gov (United States)

    Wang, Jiahui; Han, Junwei; Nguyen, Vinh T; Guo, Lei; Guo, Christine C

    2017-01-01

    Resting state functional magnetic resonance imaging (rs-fMRI) provides a powerful tool to examine large-scale neural networks in the human brain and their disturbances in neuropsychiatric disorders. Thanks to its low demand and high tolerance, resting state paradigms can be easily acquired from clinical population. However, due to the unconstrained nature, resting state paradigm is associated with excessive head movement and proneness to sleep. Consequently, the test-retest reliability of rs-fMRI measures is moderate at best, falling short of widespread use in the clinic. Here, we characterized the effect of sleep on the test-retest reliability of rs-fMRI. Using measures of heart rate variability (HRV) derived from simultaneous electrocardiogram (ECG) recording, we identified portions of fMRI data when subjects were more alert or sleepy, and examined their effects on the test-retest reliability of functional connectivity measures. When volumes of sleep were excluded, the reliability of rs-fMRI is significantly improved, and the improvement appears to be general across brain networks. The amount of improvement is robust with the removal of as much as 60% volumes of sleepiness. Therefore, test-retest reliability of rs-fMRI is affected by sleep and could be improved by excluding volumes of sleepiness as indexed by HRV. Our results suggest a novel and practical method to improve test-retest reliability of rs-fMRI measures.

  9. Improving the Test-Retest Reliability of Resting State fMRI by Removing the Impact of Sleep

    Directory of Open Access Journals (Sweden)

    Jiahui Wang

    2017-05-01

    Full Text Available Resting state functional magnetic resonance imaging (rs-fMRI provides a powerful tool to examine large-scale neural networks in the human brain and their disturbances in neuropsychiatric disorders. Thanks to its low demand and high tolerance, resting state paradigms can be easily acquired from clinical population. However, due to the unconstrained nature, resting state paradigm is associated with excessive head movement and proneness to sleep. Consequently, the test-retest reliability of rs-fMRI measures is moderate at best, falling short of widespread use in the clinic. Here, we characterized the effect of sleep on the test-retest reliability of rs-fMRI. Using measures of heart rate variability (HRV derived from simultaneous electrocardiogram (ECG recording, we identified portions of fMRI data when subjects were more alert or sleepy, and examined their effects on the test-retest reliability of functional connectivity measures. When volumes of sleep were excluded, the reliability of rs-fMRI is significantly improved, and the improvement appears to be general across brain networks. The amount of improvement is robust with the removal of as much as 60% volumes of sleepiness. Therefore, test-retest reliability of rs-fMRI is affected by sleep and could be improved by excluding volumes of sleepiness as indexed by HRV. Our results suggest a novel and practical method to improve test-retest reliability of rs-fMRI measures.

  10. Test-retest reliability of the Danish Adult Reading Test in patients with comorbid psychosis and cannabis-use disorder

    DEFF Research Database (Denmark)

    Hjorthøj, Carsten Rygaard; Vesterager, Lone; Nordentoft, Merete

    2013-01-01

    Background: The New Adult Reading Test is a common instrument for assessing pre-morbid IQ for patients with, for instance, schizophrenia. However, test-retest reliability has not been established for patients dually diagnosed with psychosis and substance use disorder. Furthermore, test......-retest reliability of the Danish adaptation has never been established in any population. Aims: To determine the test-retest reliability of the Danish Adult Reading Test (DART) (adapted from the National Adult Reading Test, NART) for patients dually diagnosed with psychosis and cannabis-use disorder. Methods......: This was a secondary analysis of the CapOpus randomized trial. As part of the trial, 103 patients were randomized, and completed the DART up to three times. Pearson's r and pairwise t-tests were calculated. Results: DART score was independent of randomization, cannabis-use frequency and psychopathology. Scores...

  11. The Physical Activity Scale for Individuals with Physical Disabilities: test-retest reliability and comparison with an accelerometer.

    Science.gov (United States)

    van der Ploeg, Hidde P; Streppel, Kitty R M; van der Beek, Allard J; van der Woude, Luc H V; Vollenbroek-Hutten, Miriam; van Mechelen, Willem

    2007-01-01

    The objective was to determine the test-retest reliability and criterion validity of the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). Forty-five non-wheelchair dependent subjects were recruited from three Dutch rehabilitation centers. Subjects' diagnoses were: stroke, spinal cord injury, whiplash, and neurological-, orthopedic- or back disorders. The PASIPD is a 7-d recall physical activity questionnaire that was completed twice, 1 wk apart. During this week, physical activity was also measured with an Actigraph accelerometer. The test-retest reliability Spearman correlation of the PASIPD was 0.77. The criterion validity Spearman correlation was 0.30 when compared to the accelerometer. The PASIPD had test-retest reliability and criterion validity that is comparable to well established self-report physical activity questionnaires from the general population.

  12. Test-retest reliability of the eating disorder examination-questionnaire (EDE-Q) in a college sample

    OpenAIRE

    Rose, Jennifer S; Vaewsorn, Adin; Rosselli-Navarra, Francine; Wilson, G Terence; Weissman, Ruth Striegel

    2013-01-01

    Background The Eating Disorder Examination-Questionnaire (EDE-Q), a widely used self-report instrument, is often used for measuring change in eating disorder symptoms over the course of treatment. However, limited data exist about test-retest reliability, particularly for men. The current study evaluated EDE-Q 7-day test-retest reliability in male (n = 47) and female (n = 44) undergraduate students together and separately by gender. Results Internal consistency was consistently higher for wom...

  13. Temporal stability of the Francis Scale of Attitude toward Christianity short-form: test-retest data over one week.

    Science.gov (United States)

    Lewis, Christopher Alan; Cruise, Sharon Mary; McGuckin, Conor

    2005-04-01

    This study evaluated the test-retest reliability of the Francis Scale of Attitude toward Christianity short-form. 39 Northern Irish undergraduate students completed the measure on two occasions separated by one week. Stability across the two administrations was high, r = .92, and there was no significant change between Time 1(M = 25.2, SD = 5.4) and Time 2 (M = 25.7, SD = 6.2). These data support the short-term test-retest reliability of the Francis Scale of Attitude toward Christianity short-form.

  14. Investigating univariate temporal patterns for intrinsic connectivity networks based on complexity and low-frequency oscillation: a test-retest reliability study.

    Science.gov (United States)

    Wang, X; Jiao, Y; Tang, T; Wang, H; Lu, Z

    2013-12-19

    Intrinsic connectivity networks (ICNs) are composed of spatial components and time courses. The spatial components of ICNs were discovered with moderate-to-high reliability. So far as we know, few studies focused on the reliability of the temporal patterns for ICNs based their individual time courses. The goals of this study were twofold: to investigate the test-retest reliability of temporal patterns for ICNs, and to analyze these informative univariate metrics. Additionally, a correlation analysis was performed to enhance interpretability. Our study included three datasets: (a) short- and long-term scans, (b) multi-band echo-planar imaging (mEPI), and (c) eyes open or closed. Using dual regression, we obtained the time courses of ICNs for each subject. To produce temporal patterns for ICNs, we applied two categories of univariate metrics: network-wise complexity and network-wise low-frequency oscillation. Furthermore, we validated the test-retest reliability for each metric. The network-wise temporal patterns for most ICNs (especially for default mode network, DMN) exhibited moderate-to-high reliability and reproducibility under different scan conditions. Network-wise complexity for DMN exhibited fair reliability (ICC<0.5) based on eyes-closed sessions. Specially, our results supported that mEPI could be a useful method with high reliability and reproducibility. In addition, these temporal patterns were with physiological meanings, and certain temporal patterns were correlated to the node strength of the corresponding ICN. Overall, network-wise temporal patterns of ICNs were reliable and informative and could be complementary to spatial patterns of ICNs for further study. Copyright © 2013 IBRO. Published by Elsevier Ltd. All rights reserved.

  15. Test-Retest Reliability and Practice Effects of the Stability Evaluation Test.

    Science.gov (United States)

    Williams, Richelle M; Corvo, Matthew A; Lam, Kenneth C; Williams, Travis A; Gilmer, Lesley K; McLeod, Tamara C Valovich

    2017-01-17

    Postural control plays an essential role in concussion evaluation. The Stability Evaluation Test (SET) aims to objectively analyze postural control by measuring sway velocity on the NeuroCom's VSR portable force platform (Natus, San Carlos, CA). To assess the test-retest reliability and practice effects of the SET protocol. Cohort. Research Laboratory. Fifty healthy adults (males=20, females=30, age=25.30±3.60 years, height=166.60±12.80 cm, mass=68.80±13.90 kg). All participants completed four trials of the SET. Each trial consisted of six 20-second balance tests with eyes closed, under the following conditions: double-leg firm (DFi), single-leg firm (SFi), tandem firm (TFi), double-leg foam (DFo), single-leg foam (SFo), and tandem foam (TFo). Each trial was separated by a 5-minute seated rest period. The dependent variable was sway velocity (deg/sec), with lower values indicating better balance. Sway velocity was recorded for each of the six conditions as well as a composite score for each trial. Test-retest reliability was analyzed across four trials with Intraclass Correlation Coefficients. Practice effects analyzed with repeated measures analysis of variance, followed by Tukey post-hoc comparisons for any significant main effects (preliability values were good to excellent: DFi (ICC=0.88;95%CI:0.81,0.92), SFi (ICC=0.75;95%CI:0.61,0.85), TFi (ICC=0.84;95%CI:0.75,0.90), DFo (ICC=0.83;95%CI:0.74,0.90), SFo (ICC=0.82;95%CI:0.72,0.89), TFo (ICC=0.81;95%CI:0.69,0.88), and composite score (ICC=0.93;95%CI:0.88,0.95). Significant practice effects (preliability for the assessment of postural control in healthy adults. Due to the practice effects noted, a familiarization session is recommended (i.e., all 6 conditions) prior to recording the data. Future studies should evaluate injured patients to determine meaningful change scores during various injuries.

  16. Test-Retest Reliability of Dual-Task Outcome Measures in People With Parkinson Disease.

    Science.gov (United States)

    Strouwen, Carolien; Molenaar, Esther A L M; Keus, Samyra H J; Münks, Liesbeth; Bloem, Bastiaan R; Nieuwboer, Alice

    2016-08-01

    Dual-task (DT) training is gaining ground as a physical therapy intervention in people with Parkinson disease (PD). Future studies evaluating the effect of such interventions need reliable outcome measures. To date, the test-retest reliability of DT measures in patients with PD remains largely unknown. The purpose of this study was to assess the reliability of DT outcome measures in patients with PD. A repeated-measures design was used. Patients with PD ("on" medication, Mini-Mental State Examination score ≥24) performed 2 cognitive tasks (ie, backward digit span task and auditory Stroop task) and 1 functional task (ie, mobile phone task) in combination with walking. Tasks were assessed at 2 time points (same hour) with an interval of 6 weeks. Test-retest reliability was assessed for gait while performing each secondary task (DT gait) for both cognitive tasks while walking (DT cognitive) and for the functional task while walking (DT functional). Sixty-two patients with PD (age=39-89 years, Hoehn and Yahr stages II-III) were included in the study. Intraclass correlation coefficients (ICCs) showed excellent reliability for DT gait measures, ranging between .86 and .95 when combined with the digit span task, between .86 and .95 when combined with the auditory Stroop task, and between .72 and .90 when combined with the mobile phone task. The standard error of measurements for DT gait speed varied between 0.06 and 0.08 m/s, leading to minimal detectable changes between 0.16 and 0.22 m/s. With regard to DT cognitive measures, reaction times showed good-to-excellent reliability (digit span task: ICC=.75; auditory Stroop task: ICC=.82). The results cannot be generalized to patients with advanced disease or to other DT measures. In people with PD, DT measures proved to be reliable for use in clinical studies and look promising for use in clinical practice to assess improvements after DT training. Large effects, however, are needed to obtain meaningful effect sizes.

  17. Test-retest reliability and validity of the Sniffin' TOM odor memory test.

    Science.gov (United States)

    Croy, Ilona; Zehner, Cora; Larsson, Maria; Zucco, Gesualdo M; Hummel, Thomas

    2015-03-01

    Few attempts have been made to develop an olfactory test that captures episodic retention of olfactory information. Assessment of episodic odor memory is of particular interest in aging and in the cognitively impaired as both episodic memory deficits and olfactory loss have been targeted as reliable hallmarks of cognitive decline and impending dementia. Here, 96 healthy participants (18-92 years) and an additional 19 older people with mild cognitive impairment were tested (73-82 years). Participants were presented with 8 common odors with intentional encoding instructions that were followed by a yes-no recognition test. After recognition completion, participants were asked to identify all odors by means of free or cued identification. A retest of the odor memory test (Sniffin' TOM = test of odor memory) took place 17 days later. The results revealed satisfactory test-retest reliability (0.70) of odor recognition memory. Both recognition and identification performance were negatively affected by age and more pronounced among the cognitively impaired. In conclusion, the present work presents a reliable, valid, and simple test of episodic odor recognition memory that may be used in clinical groups where both episodic memory deficits and olfactory loss are prevalent preclinically such as Alzheimer's disease. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. Test-retest reliability of behavioral measures of impulsive choice, impulsive action, and inattention.

    Science.gov (United States)

    Weafer, Jessica; Baggott, Matthew J; de Wit, Harriet

    2013-12-01

    Behavioral measures of impulsivity are widely used in substance abuse research, yet relatively little attention has been devoted to establishing their psychometric properties, especially their reliability over repeated administration. The current study examined the test-retest reliability of a battery of standardized behavioral impulsivity tasks, including measures of impulsive choice (i.e., delay discounting, probability discounting, and the Balloon Analogue Risk Task), impulsive action (i.e., the stop signal task, the go/no-go task, and commission errors on the continuous performance task), and inattention (i.e., attention lapses on a simple reaction time task and omission errors on the continuous performance task). Healthy adults (n = 128) performed the battery on two separate occasions. Reliability estimates for the individual tasks ranged from moderate to high, with Pearson correlations within the specific impulsivity domains as follows: impulsive choice (r range: .76-.89, ps reliable measures and thus can be confidently used to assess various facets of impulsivity as intermediate phenotypes for drug abuse.

  19. We need more replication research - A case for test-retest reliability.

    Science.gov (United States)

    Leppink, Jimmie; Pérez-Fuster, Patricia

    2017-06-01

    Following debates in psychology on the importance of replication research, we have also started to see pleas for a more prominent role for replication research in medical education. To enable replication research, it is of paramount importance to carefully study the reliability of the instruments we use. Cronbach's alpha has been the most widely used estimator of reliability in the field of medical education, notably as some kind of quality label of test or questionnaire scores based on multiple items or of the reliability of assessment across exam stations. However, as this narrative review outlines, Cronbach's alpha or alternative reliability statistics may complement but not replace psychometric methods such as factor analysis. Moreover, multiple-item measurements should be preferred above single-item measurements, and when using single-item measurements, coefficients as Cronbach's alpha should not be interpreted as indicators of the reliability of a single item when that item is administered after fundamentally different activities, such as learning tasks that differ in content. Finally, if we want to follow up on recent pleas for more replication research, we have to start studying the test-retest reliability of the instruments we use.

  20. Test-retest reliability and practice effects of the Wechsler Memory Scale-III.

    Science.gov (United States)

    Lo, Ada H Y; Humphreys, Michael; Byrne, Gerard J; Pachana, Nancy A

    2012-09-01

    Although serial administration of cognitive tests is increasingly common, there is a paucity of research on test-retest reliabilities and practice effects, both of which are important for evaluating changes in functioning. Reliability is generally conceptualized as involving short-lasting changes in performance. However, when repeated testing occurs over a period of years, there will be some longer lasting effects. The implications of these longer lasting effects and practice effects on reliability were examined in the context of repeated administrations of the Wechsler Memory Scale-III in 339 community-dwelling women aged 40-79 years over 2 to 7 years. The results showed that Logical Memory and Verbal Paired Associates subtests were consistently the most reliable subtests across the age cohorts. The magnitude of practice effects varied as a function of subtests and age. The largest practice effects were found in the youngest age cohort, especially on the Faces, Logical Memory, and Verbal Paired Associates subtests. ©2012 The British Psychological Society.

  1. Blink frequency and duration during perimetry and their relationship to test-retest threshold variability.

    Science.gov (United States)

    Wang, Yanfang; Toor, Sonia S; Gautam, Ramesh; Henson, David B

    2011-06-28

    To describe different patterns of blinking in patients undergoing a visual field test and to establish whether the blink parameters are related to threshold variability. Thirty-nine patients with diagnosed or suspected glaucoma were recruited to undertake a perimetric task twice. Blinks were detected with a video eye-tracker system that records at a sampling rate of 60 Hz. Blink frequency, duration, and episodes of microsleep (eye closures >500 ms) were analyzed, and correlated with test-retest threshold variability. The timing of blinks with respect to stimulus presentation was analyzed and the percentage of seen stimuli for all presentations (POS(overall)) and those overlapped with blinks (POS(overlapped)) were compared. Blink frequency ranged from 0 to 58 per minute. A significant increase in blink frequency was observed in the second test (P POS(overall) and POS(overlapped) was significant (P POS(overlapped) was observed with the increase of overlap duration. A wide range of blink frequencies was observed during perimetric testing. Although no blink parameters showed significant influence on threshold variability, when the blinks overlapped with a stimulus presentation, the probability of seeing was reduced. For suprathreshold stimuli, blinks often occurred after the presentation, whereas for subthreshold presentations, there was no relationship to presentation time.

  2. Test-retest reliability of a questionnaire to assess physical environmental factors pertaining to physical activity

    Directory of Open Access Journals (Sweden)

    McGinn Aileen P

    2005-06-01

    Full Text Available Abstract Background Despite the documented benefits of physical activity, many adults do not obtain the recommended amounts. Barriers to physical activity occur at multiple levels, including at the individual, interpersonal, and environmental levels. Only until more recently has there been a concerted focus on how the physical environment might affect physical activity behavior. With this new area of study, self-report measures should be psychometrically tested before use in research studies. Therefore the objective of this study was to document the test-retest reliability of a questionnaire designed to assess physical environmental factors that might be associated with physical activity in a diverse adult population. Methods Test and retest surveys were conducted over the telephone with 106 African American and White women and men living in either Forsyth County, North Carolina or Jackson, Mississippi. Reliability of self-reported environmental factors across four domains (e.g., access to facilities and destinations, functionality and safety, aesthetics, natural environment was determined using intraclass correlation coefficients (ICC overall and separately by gender and race. Results Generally items displayed moderate and sometimes substantial reliability (ICC between 0.4 to 0.8, with a few differences by gender or race, across each of the domains. Conclusion This study provides some psychometric evidence for the use of many of these questions in studies examining the effect of self-reported physical environmental measures on physical activity behaviors, among African American and White women and men.

  3. The Physical Activity Scale for Individuals with Physical Disabilities : test-retest reliability and comparison with an accelerometer

    NARCIS (Netherlands)

    van der Ploeg, Hidde P; Streppel, Kitty R M; van der Beek, Allard J; van der Woude, Luc H V; Vollenbroek-Hutten, Miriam; van Mechelen, Willem; van der Woude, Lucas

    BACKGROUND: The objective was to determine the test-retest reliability and criterion validity of the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). METHODS: Forty-five non-wheelchair dependent subjects were recruited from three Dutch rehabilitation centers. Subjects'

  4. Test-retest reliability of an interactive voice response (IVR) version of the EORTC QLQ-C30

    NARCIS (Netherlands)

    Lundy, J.J.; Coons, S.J.; Aaronson, N.K.

    2015-01-01

    Objective: The objective of this study was to assess the test-retest reliability of an interactive voice response (IVR) version of the European Organisation for Research and Treatment of Cancer (EORTC) QLQ-C30. Methods: A convenience sample of outpatient cancer clinic patients (n = 127) was asked to

  5. Adaptation, test-retest reliability, and construct validity of the Physical Activity Neighborhood Environment Scale in Nigeria (PANES-N).

    Science.gov (United States)

    Oyeyemi, Adewale L; Sallis, James F; Oyeyemi, Adetoyeje Y; Amin, Mariam M; De Bourdeaudhuij, Ilse; Deforche, Benedicte

    2013-11-01

    This study adapted the Physical Activity Neighborhood Environment Scale (PANES) to the Nigerian context and assessed the test-retest reliability and construct validity of the Nigerian version (PANESN). A multidisciplinary panel of experts adapted the original PANES to reflect the built and social environment of Nigeria. The adapted PANES was subjected to cognitive testing and test retest reliability in a diverse sample of Nigerian adults (N = 132) from different neighborhood types. Intraclass Correlation Coefficients (ICC) was used to assess test-retest reliability, and construct validity was investigated with Analysis of Covariance for differences in environmental attributes between neighborhoods. Four of the 17 items on the original PANES were significantly modified, 3 were removed and 2 new items were incorporated into the final version of adapted PANES-N. Test-retest reliability was substantial to almost perfect (ICC = 0.62-1.00) for all items on the PANES-N, and residents of neighborhoods in the inner city reported higher residential density, land use mix and safety, but lower pedestrian facilities and aesthetics than did residents of government reserved area/new layout neighborhoods. The PANES-N appears promising for assessing environmental perceptions related to physical activity in Nigeria, but further testing is required to assess its applicability across Africa.

  6. The eye-complaint questionnaire in a visual display unit work environment: Internal consistency and test-retest reliability

    NARCIS (Netherlands)

    Steenstra, Ivan A.; Sluiter, Judith K.; Frings-Dresen, Monique H. W.

    2009-01-01

    The internal consistency and test-retest reliability of a 10-item eye-complaint questionnaire (ECQ) were examined within a sample of office workers. Repeated within-subjects measures were performed within a single day and over intervals of 1 and 7 d. Questionnaires were completed by 96 workers (70%

  7. Test-retest reliability of Antonovsky's 13-item sense of coherence scale in patients with hand-related disorders

    DEFF Research Database (Denmark)

    Hansen, Alice Ørts; Kristensen, Hanne Kaae; Cederlund, Ragnhild

    2017-01-01

    to be a powerful tool to measure the ICF component personal factors, which could have an impact on patients' rehabilitation outcomes. Implications for rehabilitation Antonovsky's SOC-13 scale showed test-retest reliability for patients with hand-related disorders. The SOC-13 scale could be a suitable tool to help...... measure personal factors....

  8. Temporal stability of preferences and willingness to pay for natural areas in choice experiments: A test-retest

    NARCIS (Netherlands)

    Schaafsma, M.; Brouwer, R.; Liekens, I.; de Nocker, L.

    2014-01-01

    The main objective of this paper is to test the temporal stability of stated preferences and willingness to pay (WTP) values from a Choice Experiment (CE) in a test-retest. The same group of participants was asked the same choice tasks in an internet-based CE, conducted twice with a time interval of

  9. Construct Validity and Test-Retest Reliability of the Walking Questionnaire in People With a Lower Limb Amputation

    NARCIS (Netherlands)

    de Laat, Fred A.; Rommers, Gerardus M.; Geertzen, Jan H.; Roorda, Leo D.

    Objective: To investigate the construct validity and test-retest reliability of the Walking Questionnaire, a patient-reported measure of activity limitations in walking in people with a lower limb amputation. Design: Cross-sectional study. Setting: Outpatient department of a rehabilitation center.

  10. Test-retest reliability of the Middlesex Assessment of Mental State (MEAMS): a preliminary investigation in people with probable dementia.

    Science.gov (United States)

    Powell, T; Brooker, D J; Papadopolous, A

    1993-05-01

    Relative and absolute test-retest reliability of the MEAMS was examined in 12 subjects with probable dementia and 12 matched controls. Relative reliability was good. Measures of absolute reliability showed scores changing by up to 3 points over an interval of a week. A version effect was found to be in evidence.

  11. Test-Retest Reliability of the Parent Behavior Importance Questionnaire-Revised and the Parent Behavior Frequency Questionnaire-Revised

    Science.gov (United States)

    Mowder, Barbara A.; Shamah, Renee

    2011-01-01

    This study evaluated the test-retest reliability of two parenting measures: the Parent Behavior Importance Questionnaire-Revised (PBIQ-R) and Parent Behavior Frequency Questionnaire-Revised (PBFQ-R). These self-report parenting behavior assessment measures may be utilized as pre- and post-parent education program measures, with parents as well as…

  12. Test-retest reliability of the Food Allergy Quality of Life Questionnaires (FAQLQ) for children, adolescents and adults

    NARCIS (Netherlands)

    van der Velde, Jantina L.; Flokstra-de Blok, Bertine M. J.; Vlieg - Boerstra, Berber J.; Oude Elberink, Joanne N. G.; Schouten, Jan P.; DunnGalvin, Audrey; Hourihane, Jonathan O'B; Duiverman, Eric J.; Dubois, Anthony E. J.

    The self-administered Food Allergy Quality of Life Questionnaire-Child Form (FAQLQ-CF), -Teenager Form (FAQLQ-TF) and -Adult Form (FAQLQ-AF) were recently developed within EuroPrevall, a multi-centred study of food allergy in Europe. The primary aim of this study was to evaluate the test-retest

  13. Test-retest reliability of the Food Allergy Quality of Life Questionnaires (FAQLQ) for children, adolescents and adults

    NARCIS (Netherlands)

    van der Velde, Jantina L.; Flokstra-de Blok, Bertine M. J.; Vlieg-Boerstra, Berber J.; Oude Elberink, Joanne N. G.; Schouten, Jan P.; DunnGalvin, Audrey; Hourihane, Jonathan O.'B.; Duiverman, Eric J.; Dubois, Anthony E. J.

    2009-01-01

    The self-administered Food Allergy Quality of Life Questionnaire-Child Form (FAQLQ-CF), -Teenager Form (FAQLQ-TF) and -Adult Form (FAQLQ-AF) were recently developed within EuroPrevall, a multi-centred study of food allergy in Europe. The primary aim of this study was to evaluate the test-retest

  14. Test-retest reliability of the 20-sec Wingate test to assess anaerobic power in children with cerebral palsy

    NARCIS (Netherlands)

    Dallmeijer, A.J.; Scholtes, V.A.B.; Brehm, M.A.; Becher, J.G.

    2013-01-01

    OBJECTIVE: The aim of this study was to determine the test-retest reliability of the 20-sec Wingate anaerobic test in children with cerebral palsy. DESIGN: Participants were 22 ambulant children with cerebral palsy, with Gross Motor Function Classification System levels I (limitations in advanced

  15. Test-Retest Reliability of the 20-sec Wingate Test to Assess Anaerobic Power in Children with Cerebral Palsy

    NARCIS (Netherlands)

    Dallmeijer, Annet J.; Scholtes, Vanessa A. B.; Brehm, Merel-Anne; Becher, Jules G.

    2013-01-01

    Objective: The aim of this study was to determine the test-retest reliability of the 20-sec Wingate anaerobic test in children with cerebral palsy. Design: Participants were 22 ambulant children with cerebral palsy, with Gross Motor Function Classification System levels I (limitations in advanced

  16. Assessment of lower urinary tract symptoms in women by a self-administered questionnaire: test-retest reliability

    DEFF Research Database (Denmark)

    Bernstein, Inge Thomsen; Sejr, T; Able, I

    1996-01-01

    A self-administered questionnaire assessing female lower urinary tract symptoms and their impact on quality of life is described and validated, on 56 females in six participating departments. The patients answered two identical questionnaires on separate occasions before treatment. Test-retest re...

  17. Preclinical evaluation and test-retest studies of [{sup 18}F]PSS232, a novel radioligand for targeting metabotropic glutamate receptor 5 (mGlu{sub 5})

    Energy Technology Data Exchange (ETDEWEB)

    Milicevic Sephton, Selena; Mueller Herde, Adrienne; Keller, Claudia; Ruedisuehli, Sonja; Schibli, Roger; Kraemer, Stefanie D.; Ametamey, Simon M. [Center for Radiopharmaceutical Sciences of ETH, PSI and USZ, Zurich (Switzerland); Mu, Linjing [University Hospital Zuerich, Department of Nuclear Medicine, Zuerich (Switzerland); Auberson, Yves [Novartis Institutes for Biomedical Research, Novartis Pharma AG, Basel (Switzerland)

    2015-01-15

    A novel, {sup 18}F-labelled metabotropic glutamate receptor subtype 5 (mGlu{sub 5}) derivative of [{sup 11}C]ABP688 ([{sup 11}C]1), [{sup 18}F]PSS232 ([{sup 18}F]5), was evaluated in vitro and in vivo for its potential as a PET agent and was used in test-retest reliability studies The radiosynthesis of [{sup 18}F]5 was accomplished via a one-step reaction using a mesylate precursor. In vitro stability was determined in PBS and plasma, and with liver microsomal enzymes. Metabolite studies were performed using rat brain extracts, blood and urine. In vitro autoradiography was performed on horizontal slices of rat brain using 1 and 8, antagonists for mGlu{sub 5} and mGlu{sub 1}, respectively. Small-animal PET, biodistribution, and test-retest studies were performed in Wistar rats. In vivo, dose-dependent displacement studies were performed using 6 and blocking studies with 7. [{sup 18}F]5 was obtained in decay-corrected maximal radiochemical yield of 37 % with a specific activity of 80 - 400 GBq/μmol. Treatment with rat and human microsomal enzymes in vitro for 60 min resulted in 20 % and 4 % of hydrophilic radiometabolites, respectively. No hydrophilic decomposition products or radiometabolites were found in PBS or plasma. In vitro autoradiography on rat brain slices showed a heterogeneous distribution consistent with the known distribution of mGlu{sub 5} with high binding to hippocampal and cortical regions, and negligible radioactivity in the cerebellum. Similar distribution of radioactivity was found in PET images. Under displacement conditions with 6, reduced [{sup 18}F]5 binding was found in all brain regions except the cerebellum. 7 reduced binding in the striatum by 84 % on average. Test-retest studies were reproducible with a variability ranging from 6.8 % to 8.2 %. An extended single-dose toxicity study in Wistar rats showed no compound-related adverse effects. The new mGlu{sub 5} radiotracer, [{sup 18}F]5, showed specific and selective in vitro and in vivo

  18. Test-Retest Reliability of Rating of Perceived Exertion and Agreement With 1-Repetition Maximum in Adults.

    Science.gov (United States)

    Bove, Allyn M; Lynch, Andrew D; DePaul, Samantha M; Terhorst, Lauren; Irrgang, James J; Fitzgerald, G Kelley

    2016-09-01

    Study Design Clinical measurement. Background It has been suggested that rating of perceived exertion (RPE) may be a useful alternative to 1-repetition maximum (1RM) to determine proper resistance exercise dosage. However, the test-retest reliability of RPE for resistance exercise has not been determined. Additionally, prior research regarding the relationship between 1RM and RPE is conflicting. Objectives The purpose of this study was to (1) determine test-retest reliability of RPE related to resistance exercise and (2) assess agreement between percentages of 1RM and RPE during quadriceps resistance exercise. Methods A sample of participants with and without knee pathology completed a series of knee extension exercises and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale, then repeated the procedure 1 to 2 weeks later for test-retest reliability. To determine agreement between RPE and 1RM, participants completed knee extension exercises at various percentages of their 1RM (10% to 130% of predicted 1RM) and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale. Percent agreement was calculated between the 1RM and RPE at each resistance interval. Results The intraclass correlation coefficient indicated excellent test-retest reliability of RPE for quadriceps resistance exercises (intraclass correlation coefficient = 0.895; 95% confidence interval: 0.866, 0.918). Overall percent agreement between RPE and 1RM was 60%, but agreement was poor within the ranges that would typically be used for training (50% 1RM for muscle endurance, 70% 1RM and greater for strength). Conclusion Test-retest reliability of perceived exertion during quadriceps resistance exercise was excellent. However, agreement between the RPE and 1RM was poor, especially in common training zones for knee extensor strengthening. J Orthop Sports Phys Ther 2016;46(9):768-774. Epub 5 Aug 2016. doi:10.2519/jospt.2016.6498.

  19. Test-retest reliability and responsiveness of the Barthel Index-based Supplementary Scales in patients with stroke.

    Science.gov (United States)

    Lee, Ya-Chen; Yu, Wan-Hui; Hsueh, I-Ping; Chen, Sheng-Shiung; Hsieh, Ching-Lin

    2017-10-01

    A lack of evidence on the test-retest reliability and responsiveness limits the utility of the BI-based Supplementary Scales (BI-SS) in both clinical and research settings. To examine the test-retest reliability and responsiveness of the BI-based Supplementary Scales (BI-SS) in patients with stroke. A repeated-assessments design (1 week apart) was used to examine the test-retest reliability of the BI-SS. For the responsiveness study, the participants were assessed with the BI-SS and BI (treated as an external criterion) at admission to and discharge from rehabilitation wards. Seven outpatient rehabilitation units and one inpatient rehabilitation unit. Outpatients with chronic stroke. Eighty-four outpatients with chronic stroke participated in the test-retest reliability study. Fifty-seven inpatients completed baseline and follow-up assessments in the responsiveness study. For the test-retest reliability study, the values of the intra-class correlation coefficient and the overall percentage of minimal detectable change for the Ability Scale and Self-perceived Difficulty Scale were 0.97, 12.8%, and 0.78, 35.8%, respectively. For the responsiveness study, the standardized effect size and standardized response mean (representing internal responsiveness) of the Ability Scale and Self-perceived Difficulty Scale were 1.17 and 1.56, and 0.78 and 0.89, respectively. Regarding external responsiveness, the change in score of the Ability Scale had significant and moderate association with that of the BI (r=0.61, Ptest-retest reliability and sufficient responsiveness for patients with stroke. However, the Self-perceived Difficulty Scale of the BI-SS has substantial random measurement error and insufficient external responsiveness, which may affect its utility in clinical settings. The findings of this study provide empirical evidence of psychometric properties of the BI-SS for assessing ability and self-perceived difficulty of ADL in patients with stroke.

  20. Test-retest reliability and smallest detectable change of the Bristol Impact of Hypermobility (BIoH) questionnaire.

    Science.gov (United States)

    Palmer, S; Manns, S; Cramp, F; Lewis, R; Clark, E M

    2017-12-01

    The Bristol Impact of Hypermobility (BIoH) questionnaire is a patient-reported outcome measure developed in conjunction with adults with Joint Hypermobility Syndrome (JHS). It has demonstrated strong concurrent validity with the Short Form-36 (SF-36) physical component score but other psychometric properties have yet to be established. This study aimed to determine its test-retest reliability and smallest detectable change (SDC). A test-retest reliability study. Participants were recruited from the Hypermobility Syndromes Association, a patient organisation in the United Kingdom. Recruitment packs were sent to 1080 adults who had given permission to be contacted about research. BIoH and SF-36 questionnaires were administered at baseline and repeated two weeks later. An 11-point global rating of change scale (-5 to +5) was also administered at two weeks. Test-retest analysis and calculation of the SDC was conducted on 'stable' patients (defined as global rating of change -1 to +1). 462 responses were received. 233 patients reported a 'stable' condition and were included in analysis (95% women; mean (SD) age 44.5 (13.9) years; BIoH score 223.6 (54.0)). The BIoH questionnaire demonstrated excellent test-retest reliability (ICC 0.923, 95% CI 0.900-0.940). The SDC was 42 points (equivalent to 19% of the mean baseline score). The SF-36 physical and mental component scores demonstrated poorer test-retest reliability and larger SDCs (as a proportion of the mean baseline scores). The results provide further evidence of the potential of the BIoH questionnaire to underpin research and clinical practice for people with JHS. Copyright © 2017 Elsevier Ltd. All rights reserved.

  1. Test-Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study.

    Science.gov (United States)

    Palmer, Clare E; Langbehn, Douglas; Tabrizi, Sarah J; Papoutsi, Marina

    2017-01-01

    Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington's disease (HD) and Parkinson's disease (PD) across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test-retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test-retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test-retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT) from the combined-Simon-Stop-Signal task showed moderate test-retest reliability, however, the combined SSRT interference effect showed poor test-retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.

  2. Dual conception of risk in the Iowa Gambling Task: Effects of sleep deprivation and test-retest gap

    Directory of Open Access Journals (Sweden)

    Varsha eSingh

    2013-09-01

    Full Text Available Risk in the Iowa Gambling Task (IGT is often understood in terms of intertemporal choices, i.e., preference for immediate outcomes in favor of delayed outcomes is considered risky. According to behavioral economics, decision makers refrain from choosing the short-sighted immediate gain because, over time (10 trials, the immediate gains result in a net loss. Instead decision makers are expected to maximize their gains by choosing options that, over time (10 trials, result in net gain. However, task choices are sometimes made on the basis of the frequency of reward and punishment such that infrequent punishments are favored over frequent punishments. The presence of these two attributes (intertemporality and frequency may correspond to the emotion-cognition dichotomy and reflect a dual conception of risk. Decision making on the basis of the two attributes was tested under two conditions: test-retest gap and sleep deprivation. An interaction between these two was expected to attenuate the difference between the two attributes (n=40 male. Analysis of the effects of IGT attribute type (intertemporal vs. frequency, sleep deprivation (sleep deprivation vs. no sleep deprivation, and test-retest gap (short vs. long showed a significant effect of IGT attribute type thus confirming the difference between the two attributes. Sleep deprivation had no effect on the attributes, but test-retest gap and the three-way interaction between attribute type, test-retest gap, and sleep deprivation were significant. Post-hoc tests showed sleep deprivation and short test-retest gap to attenuate the difference between the two attributes. As expected intertemporal decision making benefited from repeated task exposure. The findings add to understanding of the emotion-cognition dichotomy and show a time-dependent effect of a universally experienced constraint (sleep deprivation.

  3. Validity and test-retest reliability of a novel simple back extensor muscle strength test.

    Science.gov (United States)

    Harding, Amy T; Weeks, Benjamin Kurt; Horan, Sean A; Little, Andrew; Watson, Steven L; Beck, Belinda Ruth

    2017-01-01

    To develop and determine convergent validity and reliability of a simple and inexpensive clinical test to quantify back extensor muscle strength. Two testing sessions were conducted, 7 days apart. Each session involved three trials of standing maximal isometric back extensor muscle strength using both the novel test and isokinetic dynamometry. Lumbar spine bone mineral density was examined by dual-energy X-ray absorptiometry. Validation was examined with Pearson correlations ( r ). Test-retest reliability was examined with intraclass correlation coefficients and limits of agreement. Pearson correlations and intraclass correlation coefficients are presented with corresponding 95% confidence intervals. Linear regression was used to examine the ability of peak back extensor muscle strength to predict indices of lumbar spine bone mineral density and strength. A total of 52 healthy adults (26 men, 26 women) aged 46.4 ± 20.4 years were recruited from the community. A strong positive relationship was observed between peak back extensor strength from hand-held and isokinetic dynamometry ( r  = 0.824, p  strength test, short- and long-term reliability was excellent (intraclass correlation coefficient = 0.983 (95% confidence interval, 0.971-0.990), p  strength measures with the novel back extensor strength protocol were -6.63 to 7.70 kg, with a mean bias of +0.71 kg. Back extensor strength predicted 11% of variance in lumbar spine bone mineral density ( p  strength ( p  strength is quick, relatively inexpensive, and reliable; demonstrates initial convergent validity in a healthy population; and is associated with bone mass at a clinically important site.

  4. Test-Retest Reliability of Isokinetic Knee Strength Measurements in Children Aged 8 to 10 Years.

    Science.gov (United States)

    Fagher, Kristina; Fritzson, Annelie; Drake, Anna Maria

    Isokinetic dynamometry is a useful tool to objectively assess muscle strength of children and adults in athletic and rehabilitative settings. This study examined test-retest reliability of isokinetic knee strength measurements in children aged 8 to 10 years and defined limits for the minimum difference (MD) in strength that indicates a clinically important change. Isokinetic knee strength measurements (using the Biodex System 4) in children will provide reliable results. Descriptive laboratory study. In 22 healthy children, 5 maximal concentric (CON) knee extensor (KE) and knee flexor (KF) contractions at 2 angular velocities (60 deg/s and 180 deg/s) and 5 maximal eccentric (ECC) KE/KF contractions at 60 deg/s were assessed 7 days apart. The intraclass correlation coefficient (ICC 2.1 ) was used to examine relative reliability, and the MD was calculated on the basis of standard error of measurement. ICCs for CON KE/KF peak torque measurements were fair to excellent (range, 0.49-0.81). The MD% values for CON KE and KF ranged from 31% to 37% at 60 deg/s and from 34% to 39% at 180 deg/s. ICCs in the ECC mode were good (range, 0.60-0.70), but associated MD% values were high (>50%). There was no systematic error for CON KE/KF and ECC KE strength measurements at 60 deg/s, but systematic error was found for all other measurements. The dynamometer provides a reliable analysis of isokinetic CON knee strength measurements at 60 deg/s in children aged 8 to 10 years. Measurements at 180 deg/s and in the ECC mode were not reliable, indicating a need for more familiarization prior to testing. The MD values may help clinicians to determine whether a change in knee strength is due to error or intervention.

  5. Evaluating test-retest reliability in patient-reported outcome measures for older people: A systematic review.

    Science.gov (United States)

    Park, Myung Sook; Kang, Kyung Ja; Jang, Sun Joo; Lee, Joo Yun; Chang, Sun Ju

    2018-03-01

    This study aimed to evaluate the components of test-retest reliability including time interval, sample size, and statistical methods used in patient-reported outcome measures in older people and to provide suggestions on the methodology for calculating test-retest reliability for patient-reported outcomes in older people. This was a systematic literature review. MEDLINE, Embase, CINAHL, and PsycINFO were searched from January 1, 2000 to August 10, 2017 by an information specialist. This systematic review was guided by both the Preferred Reporting Items for Systematic Reviews and Meta-Analyses checklist and the guideline for systematic review published by the National Evidence-based Healthcare Collaborating Agency in Korea. The methodological quality was assessed by the Consensus-based Standards for the selection of health Measurement Instruments checklist box B. Ninety-five out of 12,641 studies were selected for the analysis. The median time interval for test-retest reliability was 14days, and the ratio of sample size for test-retest reliability to the number of items in each measure ranged from 1:1 to 1:4. The most frequently used statistical methods for continuous scores was intraclass correlation coefficients (ICCs). Among the 63 studies that used ICCs, 21 studies presented models for ICC calculations and 30 studies reported 95% confidence intervals of the ICCs. Additional analyses using 17 studies that reported a strong ICC (>0.09) showed that the mean time interval was 12.88days and the mean ratio of the number of items to sample size was 1:5.37. When researchers plan to assess the test-retest reliability of patient-reported outcome measures for older people, they need to consider an adequate time interval of approximately 13days and the sample size of about 5 times the number of items. Particularly, statistical methods should not only be selected based on the types of scores of the patient-reported outcome measures, but should also be described clearly in

  6. Test-retest reliability of Brazilian version of Memorial Symptom Assessment Scale for assessing symptoms in cancer patients.

    Science.gov (United States)

    Menezes, Josiane Roberta de; Luvisaro, Bianca Maria Oliveira; Rodrigues, Claudia Fernandes; Muzi, Camila Drumond; Guimarães, Raphael Mendonça

    2017-01-01

    To assess the test-retest reliability of the Memorial Symptom Assessment Scale translated and culturally adapted into Brazilian Portuguese. The scale was applied in an interview format for 190 patients with various cancers type hospitalized in clinical and surgical sectors of the Instituto Nacional de Câncer José de Alencar Gomes da Silva and reapplied in 58 patients. Data from the test-retest were double typed into a Microsoft Excel spreadsheet and analyzed by the weighted Kappa. The reliability of the scale was satisfactory in test-retest. The weighted Kappa values obtained for each scale item had to be adequate, the largest item was 0.96 and the lowest was 0.69. The Kappa subscale was also evaluated and values were 0.84 for high frequency physic symptoms, 0.81 for low frequency physical symptoms, 0.81 for psychological symptoms, and 0.78 for Global Distress Index. High level of reliability estimated suggests that the process of measurement of Memorial Symptom Assessment Scale aspects was adequate. Avaliar a confiabilidade teste-reteste da versão traduzida e adaptada culturalmente para o português do Brasil do Memorial Symptom Assessment Scale. A escala foi aplicada em forma de entrevista em 190 pacientes com diversos tipos de câncer internados nos setores clínicos e cirúrgicos do Instituto Nacional de Câncer José de Alencar Gomes da Silva e reaplicada em 58 pacientes. Os dados dos testes-retestes foram inseridos num banco de dados por dupla digitação independente em Excel e analisados pelo Kappa ponderado. A confiabilidade da escala mostrou-se satisfatória nos testes-retestes. Os valores do Kappa ponderado obtidos para cada item da escala apresentaram-se adequados, sendo o maior item de 0,96 e o menor de 0,69. Também se avaliou o Kappa das subescalas, sendo de 0,84 para sintomas físicos de alta frequência, de 0,81 para sintomas físicos de baixa frequência, de 0,81 também para sintomas psicológicos, e de 0,78 para Índice Geral de Sofrimento

  7. Relative and absolute test-retest reliabilities of pressure pain threshold in patients with knee osteoarthritis.

    Science.gov (United States)

    Srimurugan Pratheep, Neeraja; Madeleine, Pascal; Arendt-Nielsen, Lars

    2018-04-25

    Pressure pain threshold (PPT) and PPT maps are commonly used to quantify and visualize mechanical pain sensitivity. Although PPT's have frequently been reported from patients with knee osteoarthritis (KOA), the absolute and relative reliability of PPT assessments remain to be determined. Thus, the purpose of this study was to evaluate the test-retest relative and absolute reliability of PPT in KOA. For that purpose, intra- and interclass correlation coefficient (ICC) as well as the standard error of measurement (SEM) and the minimal detectable change (MDC) values within eight anatomical locations covering the most painful knee of KOA patients was measured. Twenty KOA patients participated in two sessions with a period of 2 weeks±3 days apart. PPT's were assessed over eight anatomical locations covering the knee and two remote locations over tibialis anterior and brachioradialis. The patients rated their maximum pain intensity during the past 24 h and prior to the recordings on a visual analog scale (VAS), and completed The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) and PainDetect surveys. The ICC, SEM and MDC between the sessions were assessed. The ICC for the individual variability was expressed with coefficient of variance (CV). Bland-Altman plots were used to assess potential bias in the dataset. The ICC ranged from 0.85 to 0.96 for all the anatomical locations which is considered "almost perfect". CV was lowest in session 1 and ranged from 44.2 to 57.6%. SEM for comparison ranged between 34 and 71 kPa and MDC ranged between 93 and 197 kPa with a mean PPT ranged from 273.5 to 367.7 kPa in session 1 and 268.1-331.3 kPa in session 2. The analysis of Bland-Altman plot showed no systematic bias. PPT maps showed that the patients had lower thresholds in session 2, but no significant difference was observed for the comparison between the sessions for PPT or VAS. No correlations were seen between PainDetect and PPT and PainDetect and WOMAC

  8. CPM Test-Retest Reliability: "Standard" vs "Single Test-Stimulus" Protocols.

    Science.gov (United States)

    Granovsky, Yelena; Miller-Barmak, Adi; Goldstein, Oren; Sprecher, Elliot; Yarnitsky, David

    2016-03-01

    Assessment of pain inhibitory mechanisms using conditioned pain modulation (CPM) is relevant clinically in prediction of pain and analgesic efficacy. Our objective is to provide necessary estimates of intersession CPM reliability, to enable transformation of the CPM paradigm into a clinical tool. Two cohorts of young healthy subjects (N = 65) participated in two dual-session studies. In Study I, a Bath-Thermode CPM protocol was used, with hot water immersion and contact heat as conditioning- and test-stimuli, respectively, in a classical parallel CPM design introducing test-stimulus first, and then the conditioning- and repeated test-stimuli in parallel. Study II consisted of two CPM protocols: 1) Two-Thermodes, one for each of the stimuli, in the same parallel design as above, and 2) single test-stimulus (STS) protocol with a single administration of a contact heat test-stimulus, partially overlapped in time by a remote shorter contact heat as conditioning stimulus. Test-retest reliability was assessed within 3-7 days. The STS-CPM had superior reliability intraclass correlation (ICC 2 ,: 1  = 0.59) over Bath-Thermode (ICC 2 ,: 1  = 0.34) or Two-Thermodes (ICC 2 ,: 1  = 0.21) protocols. The hand immersion conditioning pain had higher reliability than thermode pain (ICC 2 ,: 1  = 0.76 vs ICC 2 ,: 1  = 0.16). Conditioned test-stimulus pain scores were of good (ICC 2 ,: 1  = 0.62) or fair (ICC 2 ,: 1  = 0.43) reliability for the Bath-Thermode and the STS, respectively, but not for the Two-Thermodes protocol (ICC 2 ,: 1  = 0.20). The newly developed STS-CPM paradigm was more reliable than other CPM protocols tested here, and should be further investigated for its clinical relevance. It appears that large contact size of the conditioning-stimulus and use of single rather than dual test-stimulus pain contribute to augmentation of CPM reliability. © 2015 American Academy of Pain Medicine. All rights reserved. For permissions, please e

  9. Test-Retest Reliability of fMRI During Nonverbal Semantic Decisions in Moderate-Severe Nonfluent Aphasia Patients

    Directory of Open Access Journals (Sweden)

    Jacquie Kurland

    2004-01-01

    Full Text Available Cortical reorganization in poststroke aphasia is not well understood. Few studies have investigated neural mechanisms underlying language recovery in severe aphasia patients, who are typically viewed as having a poor prognosis for language recovery. Although test-retest reliability is routinely demonstrated during collection of language data in single-subject aphasia research, this is rarely examined in fMRI studies investigating the underlying neural mechanisms in aphasia recovery.

  10. Test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy.

    Science.gov (United States)

    Savva, Christos; Giakas, Giannis; Efstathiou, Michalis; Karagiannis, Christos

    2014-01-01

    The purpose of this study was to evaluate the test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy (CR). A convenience sample of 19 participants (14 men and 5 women; mean ± SD age, 50.5 ± 12 years) with CR was measured using a Jamar hydraulic hand dynamometer by the same rater on 2 different testing sessions with an interval of 7 days between sessions. Data collection procedures followed standardized grip strength testing guidelines established by the American Society of Hand Therapists. During the repeated measures, patients were advised to rest their upper limb in the standardized arm position and encouraged to exert 3 maximum gripping efforts. The mean value of the 3 efforts (measured in kilogram force [Kgf]) was used for data analysis. The intraclass correlation coefficient, SEM, and the Bland-Altman plot were used to estimate test-retest reliability and measurement precision. Grip strength measurement in CR demonstrated an intraclass correlation coefficient of 0.976, suggesting excellent test-retest reliability. The small SEM in both testing sessions (SEM1, 2.41 Kgf; SEM2, 2.51 Kgf) as well as the narrow width of the 95% limits of agreements (95% limits of agreement, -4.9 to 4.4 Kgf) in the Bland-Altman plot reflected precise measurements of grip strength in both occasions. Excellent test-retest reliability for grip strength measurement was measured in patients with CR, demonstrating that a hydraulic hand dynamometer could be used as an outcome measure for these patients. Copyright © 2014 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.

  11. Test-Retest Reliability of Handgrip Strength as an Outcome Measure in Patients With Symptoms of Shoulder Impingement Syndrome.

    Science.gov (United States)

    Savva, Christos; Mougiaris, Paraskevas; Xadjimichael, Christoforos; Karagiannis, Christos; Efstathiou, Michalis

    The purpose of this study was to investigate the degree of test-retest reliability of grip strength measurement using a hand dynamometer in patients with shoulder impingement syndrome. A total of 19 patients (10 women and 9 men; mean ± standard deviation age, 33.2 ± 12.9 years; range 18-59 years) with shoulder impingement syndrome were measured using a hand dynamometer by the same data collector in 2 different testing sessions with a 7-day interval. During each session, patients were encouraged to exert 3 maximal isometric contractions on the affected hand and the mean value of the 3 efforts (measured in kilogram-force [Kgf]) was used for data analysis. The intraclass correlation coefficient (ICC 2,1 ) as well as the standard error of measurement (SEM) and Bland-Altman plot were used to estimate the degree of test-retest reliability and the measurement error, respectively. Grip strength data analysis revealed an ICC 2,1 score of 0.94, which, based on the Shrout classification, is considered as excellent test-retest reliability of grip strength measurement. The small values of SEMs reported in both sessions (SEM 1 , 2.55 Kgf; SEM 2 , 2.39 Kgf) and the small width of the 95% limits of agreement in the Bland-Altman plot (ranging from -7.39 Kgf to 7.03 Kgf) reflected the measurement precision and the narrow variation of the differences during the 2 testing sessions. Results from this study identified excellent test-retest reliability of grip strength measurement in shoulder impingement syndrome, indicating its potential use as an outcome measure in clinical practice. Copyright © 2018. Published by Elsevier Inc.

  12. Test-retest reliability of computer-based video analysis of general movements in healthy term-born infants.

    Science.gov (United States)

    Valle, Susanne Collier; Støen, Ragnhild; Sæther, Rannei; Jensenius, Alexander Refsum; Adde, Lars

    2015-10-01

    A computer-based video analysis has recently been presented for quantitative assessment of general movements (GMs). This method's test-retest reliability, however, has not yet been evaluated. The aim of the current study was to evaluate the test-retest reliability of computer-based video analysis of GMs, and to explore the association between computer-based video analysis and the temporal organization of fidgety movements (FMs). Test-retest reliability study. 75 healthy, term-born infants were recorded twice the same day during the FMs period using a standardized video set-up. The computer-based movement variables "quantity of motion mean" (Qmean), "quantity of motion standard deviation" (QSD) and "centroid of motion standard deviation" (CSD) were analyzed, reflecting the amount of motion and the variability of the spatial center of motion of the infant, respectively. In addition, the association between the variable CSD and the temporal organization of FMs was explored. Intraclass correlation coefficients (ICC 1.1 and ICC 3.1) were calculated to assess test-retest reliability. The ICC values for the variables CSD, Qmean and QSD were 0.80, 0.80 and 0.86 for ICC (1.1), respectively; and 0.80, 0.86 and 0.90 for ICC (3.1), respectively. There were significantly lower CSD values in the recordings with continual FMs compared to the recordings with intermittent FMs (ptest-retest reliability of computer-based video analysis of GMs, and a significant association between our computer-based video analysis and the temporal organization of FMs. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  13. Test-Retest Reliability and Minimal Detectable Change of the D2 Test of Attention in Patients with Schizophrenia.

    Science.gov (United States)

    Lee, Posen; Lu, Wen-Shian; Liu, Chin-Hsuan; Lin, Hung-Yu; Hsieh, Ching-Lin

    2017-12-08

    The d2 Test of Attention (D2) is a commonly used measure of selective attention for patients with schizophrenia. However, its test-retest reliability and minimal detectable change (MDC) are unknown in patients with schizophrenia, limiting its utility in both clinical and research settings. The aim of the present study was to examine the test-retest reliability and MDC of the D2 in patients with schizophrenia. A rater administered the D2 on 108 patients with schizophrenia twice at a 1-month interval. Test-retest reliability was determined through the calculation of the intra-class correlation coefficient (ICC). We also carried out Bland-Altman analysis, which included a scatter plot of the differences between test and retest against their mean. Systematic biases were evaluated by use of a paired t-test. The ICCs for the D2 ranged from 0.78 to 0.94. The MDCs (MDC%) of the seven subscores were 102.3 (29.7), 19.4 (85.0), 7.2 (94.6), 21.0 (69.0), 104.0 (33.1), 105.0 (35.8), and 7.8 (47.8), which represented limited-to-acceptable random measurement error. Trends in the Bland-Altman plots of the omissions (E1), commissions (E2), and errors (E) were noted, presenting that the data had heteroscedasticity. According to the results, the D2 had good test-retest reliability, especially in the scores of TN, TN-E, and CP. For the further research, finding a way to improve the administration procedure to reduce random measurement error would be important for the E1, E2, E, and FR subscores. © The Author(s) 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  14. Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

    Science.gov (United States)

    Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

    2016-04-01

    The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.

  15. Influences on the Test-Retest Reliability of Functional Connectivity MRI and its Relationship with Behavioral Utility.

    Science.gov (United States)

    Noble, Stephanie; Spann, Marisa N; Tokoglu, Fuyuze; Shen, Xilin; Constable, R Todd; Scheinost, Dustin

    2017-11-01

    Best practices are currently being developed for the acquisition and processing of resting-state magnetic resonance imaging data used to estimate brain functional organization-or "functional connectivity." Standards have been proposed based on test-retest reliability, but open questions remain. These include how amount of data per subject influences whole-brain reliability, the influence of increasing runs versus sessions, the spatial distribution of reliability, the reliability of multivariate methods, and, crucially, how reliability maps onto prediction of behavior. We collected a dataset of 12 extensively sampled individuals (144 min data each across 2 identically configured scanners) to assess test-retest reliability of whole-brain connectivity within the generalizability theory framework. We used Human Connectome Project data to replicate these analyses and relate reliability to behavioral prediction. Overall, the historical 5-min scan produced poor reliability averaged across connections. Increasing the number of sessions was more beneficial than increasing runs. Reliability was lowest for subcortical connections and highest for within-network cortical connections. Multivariate reliability was greater than univariate. Finally, reliability could not be used to improve prediction; these findings are among the first to underscore this distinction for functional connectivity. A comprehensive understanding of test-retest reliability, including its limitations, supports the development of best practices in the field. © The Author 2017. Published by Oxford University Press.

  16. Test-retest reliability and stability of N400 effects in a word-pair semantic priming paradigm.

    Science.gov (United States)

    Kiang, Michael; Patriciu, Iulia; Roy, Carolyn; Christensen, Bruce K; Zipursky, Robert B

    2013-04-01

    Elicited by any meaningful stimulus, the N400 event-related potential (ERP) component is reduced when the stimulus is related to a preceding one. This N400 semantic priming effect has been used to probe abnormal semantic relationship processing in clinical disorders, and suggested as a possible biomarker for treatment studies. Validating N400 semantic priming effects as a clinical biomarker requires characterizing their test-retest reliability. We assessed test-retest reliability of N400 semantic priming in 16 healthy adults who viewed the same related and unrelated prime-target word pairs in two sessions one week apart. As expected, N400 amplitudes were smaller for related versus unrelated targets across sessions. N400 priming effects (amplitude differences between unrelated and related targets) were highly correlated across sessions (r=0.85, Pmotivational changes. Use of N400 priming effects in treatment studies should account for possible magnitude decreases with repeat testing. Further research is needed to delineate N400 priming effects' test-retest reliability and stability in different age and clinical groups, and with different stimulus types. Copyright © 2012 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

  17. Test-retest reliability and validity of a web-based food-frequency questionnaire for adolescents aged 13-14 to be used in the Norwegian Mother and Child Cohort Study (MoBa).

    Science.gov (United States)

    Overby, Nina Cecilie; Johannesen, Elisabeth; Jensen, Grete; Skjaevesland, Anne-Kirsti; Haugen, Margaretha

    2014-01-01

    The assessment of food intake is challenging and prone to errors; it is therefore important to consider the reliability and validity of the assessment methods. The aim of this study was to analyze the reproducibility and validity of a developed food-frequency questionnaire (FFQ) for use among adolescents. In total, 58 students (aged 13-14) from four different schools in the southern part of Norway participated in the reproducibility study of filling out the FFQ 4 weeks apart. In addition, 93 students participated in the relative validity study where the FFQ was compared to 2×24-hour dietary recalls, while 92 students participated in the absolute validity study where the intakes of fatty acids and vitamin D from the FFQ were compared to fatty acids and 25-hydroxy-vitamin D3 in whole blood. The median Spearman correlation coefficient for all nutrients in the test-retest reliability study was 0.57. The median Spearman correlation for all nutrients in the relative validity study was 0.26, while the correlations coefficients were low in the absolute validity study with n-3 fatty acid coefficients ranging from 0.05 to 0.25, and absent for vitamin D (r=0.000). The test-retest reproducibility was considered good, the relative validity was considered poor to good, and the absolute validity was considered poor. However, the results are comparable to other studies among adolescents.

  18. Stability of FDG-PET Radiomics features - An integrated analysis of test-retest and inter-observer variability

    Energy Technology Data Exchange (ETDEWEB)

    Leijenaar, Ralph T. H.; Carvalho, Sara; Rios Velazquez, Emmanuel [Dept. of Radiation Oncology (MAASTRO), GROW-School for Oncology and Developmental Biology, Maastricht Univ. Medical Center, Maastricht (Netherlands)] [and others

    2013-10-15

    Purpose: Besides basic measurements as maximum standardized uptake value (SUV){sub max} or SUV{sub mean} derived from 18F-FDG positron emission tomography (PET) scans, more advanced quantitative imaging features (i.e. 'Radiomics' features) are increasingly investigated for treatment monitoring, outcome prediction, or as potential biomarkers. With these prospected applications of Radiomics features, it is a requisite that they provide robust and reliable measurements. The aim of our study was therefore to perform an integrated stability analysis of a large number of PET-derived features in non-small cell lung carcinoma (NSCLC), based on both a test-retest and an inter-observer setup. Methods: Eleven NSCLC patients were included in the test-retest cohort. Patients underwent repeated PET imaging within a one day interval, before any treatment was delivered. Lesions were delineated by applying a threshold of 50 % of the maximum uptake value within the tumor. Twenty-three NSCLC patients were included in the inter-observer cohort. Patients underwent a diagnostic whole body PET-computed tomography (CT). Lesions were manually delineated based on fused PET-CT, using a standardized clinical delineation protocol. Delineation was performed independently by five observers, blinded to each other. Fifteen first order statistics, 39 descriptors of intensity volume histograms, eight geometric features and 44 textural features were extracted. For every feature, test-retest and inter-observer stability was assessed with the intra-class correlation coefficient (ICC) and the coefficient of variability, normalized to mean and range. Similarity between test-retest and inter-observer stability rankings of features was assessed with Spear man's rank correlation coefficient. Results: Results showed that the majority of assessed features had both a high test-retest (71%) and inter-observer (91%) stability in terms of their ICC. Overall, features more stable in repeated PET

  19. [11C]Harmine Binding to Brain Monoamine Oxidase A: Test-Retest Properties and Noninvasive Quantification.

    Science.gov (United States)

    Zanderigo, Francesca; D'Agostino, Alexandra E; Joshi, Nandita; Schain, Martin; Kumar, Dileep; Parsey, Ramin V; DeLorenzo, Christine; Mann, J John

    2018-02-08

    Inhibition of the isoform A of monoamine oxidase (MAO-A), a mitochondrial enzyme catalyzing deamination of monoamine neurotransmitters, is useful in treatment of depression and anxiety disorders. [ 11 C]harmine, a MAO-A PET radioligand, has been used to study mood disorders and antidepressant treatment. However, [ 11 C]harmine binding test-retest characteristics have to date only been partially investigated. Furthermore, since MAO-A is ubiquitously expressed, no reference region is available, thus requiring arterial blood sampling during PET scanning. Here, we investigate [ 11 C]harmine binding measurements test-retest properties; assess effects of using a minimally invasive input function estimation on binding quantification and repeatability; and explore binding potentials estimation using a reference region-free approach. Quantification of [ 11 C]harmine distribution volume (V T ) via kinetic models and graphical analyses was compared based on absolute test-retest percent difference (TRPD), intraclass correlation coefficient (ICC), and identifiability. The optimal procedure was also used with a simultaneously estimated input function in place of the measured curve. Lastly, an approach for binding potentials quantification in absence of a reference region was evaluated. [ 11 C]harmine V T estimates quantified using arterial blood and kinetic modeling showed average absolute TRPD values of 7.7 to 15.6 %, and ICC values between 0.56 and 0.86, across brain regions. Using simultaneous estimation (SIME) of input function resulted in V T estimates close to those obtained using arterial input function (r = 0.951, slope = 1.073, intercept = - 1.037), with numerically but not statistically higher test-retest difference (range 16.6 to 22.0 %), but with overall poor ICC values, between 0.30 and 0.57. Prospective studies using [ 11 C]harmine are possible given its test-retest repeatability when binding is quantified using arterial blood. Results with SIME of

  20. Test-retest measurements of dopamine D_1-type receptors using simultaneous PET/MRI imaging

    International Nuclear Information System (INIS)

    Kaller, Simon; Patt, Marianne; Becker, Georg-Alexander; Luthardt, Julia; Meyer, Philipp M.; Werner, Peter; Barthel, Henryk; Bresch, Anke; Sabri, Osama; Rullmann, Michael; Girbardt, Johanna; Fritz, Thomas H.; Hesse, Swen

    2017-01-01

    The role of dopamine D_1-type receptor (D_1R)-expressing neurons in the regulation of motivated behavior and reward prediction has not yet been fully established. As a prerequisite for future research assessing D_1-mediated neuronal network regulation using simultaneous PET/MRI and D_1R-selective ["1"1C]SCH23390, this study investigated the stability of central D_1R measurements between two independent PET/MRI sessions under baseline conditions. Thirteen healthy volunteers (7 female, age 33 ± 13 yrs) underwent 90-min emission scans, each after 90-s bolus injection of 486 ± 16 MBq ["1"1C]SCH23390, on two separate days within 2-4 weeks using a PET/MRI system. Parametric images of D_1R distribution volume ratio (DVR) and binding potential (BP_N_D) were generated by a multi-linear reference tissue model with two parameters and the cerebellar cortex as receptor-free reference region. Volume-of-interest (VOI) analysis was performed with manual VOIs drawn on consecutive transverse MRI slices for brain regions with high and low D_1R density. The DVR varied from 2.5 ± 0.3 to 2.9 ± 0.5 in regions with high D_1R density (e.g. the head of the caudate) and from 1.2 ± 0.1 to 1.6 ± 0.2 in regions with low D_1R density (e.g. the prefrontal cortex). The absolute variability of the DVR ranged from 2.4% ± 1.3% to 5.1% ± 5.3%, while Bland-Altman analyses revealed very low differences in mean DVR (e.g. 0.013 ± 0.17 for the nucleus accumbens). Intraclass correlation (one-way, random) indicated very high agreement (0.93 in average) for both DVR and BP_N_D values. Accordingly, the absolute variability of BP_N_D ranged from 7.0% ± 4.7% to 12.5% ± 10.6%; however, there were regions with very low D_1R content, such as the occipital cortex, with higher mean variability. The test-retest reliability of D_1R measurements in this study was very high. This was the case not only for D_1R-rich brain areas, but also for regions with low D_1R density. These results will provide a solid base

  1. A reliability generalization meta-analysis of coefficient alpha and test-retest coefficient for the aging males' symptoms (AMS) scale.

    Science.gov (United States)

    Lee, Chin-Pang; Chiu, Yu-Wen; Chu, Chun-Lin; Chen, Yu; Jiang, Kun-Hao; Chen, Jiun-Liang; Chen, Ching-Yen

    2016-12-01

    The aging males' symptoms (AMS) scale is an instrument used to determine the health-related quality of life in adult and elderly men. The purpose of this study was to synthesize internal consistency (Cronbach's alpha) and test-retest reliability for the AMS scale and its three subscales. Of the 123 studies reviewed, 12 provided alpha coefficients which were then used in the meta-analyses of internal consistency. Seven of the 12 included studies provided test-retest coefficients, and these were used in the meta-analyses of test-retest reliability. The AMS scale had excellent internal consistency [α = 0.89 (95% CI 0.88-0.90)]; the mean alpha estimates across the AMS subscales ranged from 0.79 to 0.82. The AMS scale also had good test-retest reliability [r = 0.85 (95% CI 0.82-0.88]; the test-retest reliability coefficients of the AMS subscales ranged from 0.76 to 0.83. There was significant heterogeneity among the included studies. The AMS scale and the three subscales had fairly good internal consistency and test-retest reliability. Future psychometric studies of the AMS scale should report important characteristics of the participants, details of item scores, and test-retest reliability.

  2. Test-Retest Reliability of an Experienced Global Trigger Tool Review Team

    DEFF Research Database (Denmark)

    Bjørn, Brian; Anhøj, Jacob; Østergaard, Mette

    2018-01-01

    and review 2 and between period 1 and period 2. The increase was solely in category E, minor temporary harm. CONCLUSIONS: The very experienced GTT team could not reproduce harm rates found in earlier reviews. We conclude that GTT in its present form is not a reliable measure of harm rate over time....

  3. Comparative test-retest reliability of metabolite values assessed with magnetic resonance spectroscopy of the brain. The LCModel versus the manufacturer software.

    Science.gov (United States)

    Fayed, Nicolas; Modrego, Pedro J; Medrano, Jaime

    2009-06-01

    Reproducibility is an essential strength of any diagnostic technique for cross-sectional and longitudinal works. To determine in vivo short-term comparatively, the test-retest reliability of magnetic resonance spectroscopy (MRS) of the brain was compared using the manufacturer's software package and the widely used linear combination of model (LCModel) technique. Single-voxel H-MRS was performed in a series of patients with different pathologies on a 1.5 T clinical scanner. Four areas of the brain were explored with the point resolved spectroscopy technique acquisition mode; the echo time was 35 milliseconds and the repetition time was 2000 milliseconds. We enrolled 15 patients for every area, and the intra-individual variations of metabolites were studied in two consecutive scans without removing the patient from the scanner. Curve fitting and analysis of metabolites were made with the software of GE and the LCModel. Spectra non-fulfilling the minimum criteria of quality in relation to linewidths and signal/noise ratio were rejected. The intraclass correlation coefficients for the N-acetylaspartate/creatine (NAA/Cr) ratios were 0.93, 0.89, 0.9 and 0.8 for the posterior cingulate gyrus, occipital, prefrontal and temporal regions, respectively, with the GE software. For the LCModel, the coefficients were 0.9, 0.89, 0.87 and 0.84, respectively. For the absolute value of NAA, the GE software was also slightly more reproducible than LCModel. However, for the choline/Cr and myo-inositol/Cr ratios, the LCModel was more reliable than the GE software. The variability we have seen hovers around the percentages observed in previous reports (around 10% for the NAA/Cr ratios). We did not find that the LCModel software is superior to the software of the manufacturer. Reproducibility of metabolite values relies more on the observance of the quality parameters than on the software used.

  4. Test-retest reliability and minimal detectable change of two simplified 3-point balance measures in patients with stroke.

    Science.gov (United States)

    Chen, Yi-Miau; Huang, Yi-Jing; Huang, Chien-Yu; Lin, Gong-Hong; Liaw, Lih-Jiun; Lee, Shih-Chieh; Hsieh, Ching-Lin

    2017-10-01

    The 3-point Berg Balance Scale (BBS-3P) and 3-point Postural Assessment Scale for Stroke Patients (PASS-3P) were simplified from the BBS and PASS to overcome the complex scoring systems. The BBS-3P and PASS-3P were more feasible in busy clinical practice and showed similarly sound validity and responsiveness to the original measures. However, the reliability of the BBS-3P and PASS-3P is unknown limiting their utility and the interpretability of scores. We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BBS-3P and PASS-3P in patients with stroke. Cross-sectional study. The rehabilitation departments of a medical center and a community hospital. A total of 51 chronic stroke patients (64.7% male). Both balance measures were administered twice 7 days apart. The test-retest reliability of both the BBS-3P and PASS-3P were examined by intraclass correlation coefficients (ICC). The MDC and its percentage over the total score (MDC%) of each measure was calculated for examining the random measurement errors. The ICC values of the BBS-3P and PASS-3P were 0.99 and 0.97, respectively. The MDC% (MDC) of the BBS-3P and PASS-3P were 9.1% (5.1 points) and 8.4% (3.0 points), respectively, indicating that both measures had small and acceptable random measurement errors. Our results showed that both the BBS-3P and the PASS-3P had good test-retest reliability, with small and acceptable random measurement error. These two simplified 3-level balance measures can provide reliable results over time. Our findings support the repeated administration of the BBS-3P and PASS-3P to monitor the balance of patients with stroke. The MDC values can help clinicians and researchers interpret the change scores more precisely.

  5. Laterality judgments in people with low back pain--A cross-sectional observational and test-retest reliability study.

    Science.gov (United States)

    Linder, Martin; Michaelson, Peter; Röijezon, Ulrik

    2016-02-01

    Disruption of cortical representation, or body schema, has been indicated as a factor in the persistence and recurrence of low back pain (LBP). This has been observed through impaired laterality judgment ability and it has been suggested that this ability is affected in a spatial rather than anatomical manner. We compared laterality judgment performance of foot and trunk movements between people with LBP with or without leg pain and healthy controls, and investigated associations between test performance and pain. We also assessed the test-retest reliability of the Recognise Online™ software when used in a clinical and a home setting. Cross-sectional observational and test-retest study. Thirty individuals with LBP and 30 healthy controls performed judgment tests of foot and trunk laterality once supervised in a clinic and twice at home. No statistically significant group differences were found. LBP intensity was negatively related to trunk laterality accuracy (p = 0.019). Intraclass correlation values ranged from 0.51 to 0.91. Reaction time improved significantly between test occasions while accuracy did not. Laterality judgments were not impaired in subjects with LBP compared to controls. Further research may clarify the relationship between pain mechanisms in LBP and laterality judgment ability. Reliability values were mostly acceptable, with wide and low confidence intervals, suggesting test-retest reliability for Recognise Online™ could be questioned in this trial. A significant learning effect was observed which should be considered in clinical and research application of the test. Copyright © 2015 Elsevier Ltd. All rights reserved.

  6. Maximal cardiorespiratory fitness testing in individuals with chronic stroke with cognitive impairment: practice test effects and test-retest reliability.

    Science.gov (United States)

    Olivier, Charles; Doré, Jean; Blanchet, Sophie; Brooks, Dina; Richards, Carol L; Martel, Guy; Robitaille, Nancy-Michelle; Maltais, Désirée B

    2013-11-01

    To evaluate, for individuals with chronic stroke with cognitive impairment, (1) the effects of a practice test on peak cardiorespiratory fitness test results; (2) cardiorespiratory fitness test-retest reliability; and (3) the relationship between individual practice test effects and cognitive impairment. Cross-sectional. Rehabilitation center. A convenience sample of 21 persons (men [n=12] and women [n=9]; age range, 48-81y; 44.9±36.2mo poststroke) with cognitive impairments who had sufficient lower limb function to perform the test. Not applicable. Peak oxygen consumption (Vo(2)peak, ml·kg(-1)·min(-1)). Test-retest reliability of Vo(2)peak was excellent (intraclass correlation coefficient model 2,1 [ICC2,1]=.94; 95% confidence interval [CI], .86-.98). A paired t test showed that there was no significant difference for the group for Vo(2)peak obtained from 2 symptom-limited cardiorespiratory fitness tests performed 1 week apart on a semirecumbent cycle ergometer (test 2-test 1 difference, -.32ml·kg(-1)·min(-1); 95% CI, -.69 to 1.33ml·kg(-1)·min(-1); P=.512). Individual test-retest differences in Vo(2)peak were, however, positively related to general cognitive function as measured by the Mini-Mental State Examination (ρ=.485; Preliably measured in this group without a practice test. General cognitive function, however, may influence the effect of a practice test in that those with lower general cognitive function appear to respond differently to a practice test than those with higher cognitive function. Copyright © 2013 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  7. Test-retest repeatability of child's respiratory symptoms and perceived indoor air quality - comparing self- and parent-administered questionnaires.

    Science.gov (United States)

    Lampi, Jussi; Ung-Lanki, Sari; Santalahti, Päivi; Pekkanen, Juha

    2018-02-09

    Questionnaires can be used to assess perceived indoor air quality and symptoms in schools. Questionnaires for primary school aged children have traditionally been parent-administered, but self-administered questionnaires would be easier to administer and may yield as good, if not better, information. Our aim was to compare the repeatability of self- and parent-administered indoor air questionnaires designed for primary school aged pupils. Indoor air questionnaire with questions on child's symptoms and perceived indoor air quality in schools was sent to parents of pupils aged 7-12 years in two schools and again after two weeks. Slightly modified version of the questionnaire was administered to pupils aged 9-12 years in another two schools and repeated after a week. 351 (52%) parents and 319 pupils (86%) answered both the first and the second questionnaire. Test-retest repeatability was assessed with intra-class correlation (ICC) and Cohen's kappa coefficients (k). Test-retest repeatability was generally between 0.4-0.7 (ICC; k) in both self- and parent-administered questionnaire. In majority of the questions on symptoms and perceived indoor air quality test-retest repeatability was at the same level or slightly better in self-administered compared to parent-administered questionnaire. Agreement of self- and parent administered questionnaires was generally indoor air quality. Children aged 9-12 years can give as, or even more, repeatable information about their respiratory symptoms and perceived indoor air quality than their parents. Therefore, it may be possible to use self-administered questionnaires in future studies also with children.

  8. Dual conception of risk in the Iowa Gambling Task: effects of sleep deprivation and test-retest gap.

    Science.gov (United States)

    Singh, Varsha

    2013-01-01

    Risk in the Iowa Gambling Task (IGT) is often understood in terms of intertemporal choices, i.e., preference for immediate outcomes in favor of delayed outcomes is considered risky decision making. According to behavioral economics, healthy decision makers are expected to refrain from choosing the short-sighted immediate gain because, over time (10 trials of the IGT), the immediate gains result in a long term loss (net loss). Instead decision makers are expected to maximize their gains by choosing options that, over time (10 trials), result in delayed or long term gains (net gain). However, task choices are sometimes made on the basis of the frequency of reward and punishment such that frequent rewards/infrequent punishments are favored over infrequent rewards/frequent punishments. The presence of these two attributes (intertemporality and frequency of reward) in IGT decision making may correspond to the emotion-cognition dichotomy and reflect a dual conception of risk. Decision making on the basis of the two attributes was tested under two conditions: delay in retest and sleep deprivation. An interaction between sleep deprivation and time delay was expected to attenuate the difference between the two attributes. Participants were 40 male university students. Analysis of the effects of IGT attribute type (intertemporal vs. frequency of reinforcement), sleep deprivation (sleep deprivation vs. no sleep deprivation), and test-retest gap (short vs. long delay) showed a significant within-subjects effect of IGT attribute type thus confirming the difference between the two attributes. Sleep deprivation had no effect on the attributes, but test-retest gap and the three-way interaction between attribute type, test-retest gap, and sleep deprivation were significantly different. Post-hoc tests revealed that sleep deprivation and short test-retest gap attenuated the difference between the two attributes. Furthermore, the results showed an expected trend of increase in

  9. Assessment of test-retest reliability and internal consistency of the Wisconsin Gait Scale in hemiparetic post-stroke patients

    Directory of Open Access Journals (Sweden)

    Guzik Agnieszka

    2016-09-01

    Full Text Available Introduction: A proper assessment of gait pattern is a significant aspect in planning the process of teaching gait in hemiparetic post-stroke patients. The Wisconsin Gait Scale (WGS is an observational tool for assessing post-stroke patients’ gait. The aim of the study was to assess test-retest reliability and internal consistency of the WGS and examine correlations between gait assessment made with the WGS and gait speed, Brunnström scale, Ashworth’s scale and the Barthel Index.

  10. Test-retest reliability of stride time variability while dual tasking in healthy and demented adults with frontotemporal degeneration

    Directory of Open Access Journals (Sweden)

    Herrmann Francois R

    2011-07-01

    Full Text Available Abstract Background Although test-retest reliability of mean values of spatio-temporal gait parameters has been assessed for reliability while walking alone (i.e., single tasking, little is known about the test-retest reliability of stride time variability (STV while performing an attention demanding-task (i.e., dual tasking. The objective of this study was to examine immediate test-retest reliability of STV while single and dual tasking in cognitively healthy older individuals (CHI and in demented patients with frontotemporal degeneration (FTD. Methods Based on a cross-sectional design, 69 community-dwelling CHI (mean age 75.5 ± 4.3; 43.5% women and 14 demented patients with FTD (mean age 65.7 ± 9.8 years; 6.7% women walked alone (without performing an additional task; i.e., single tasking and while counting backward (CB aloud starting from 50 (i.e., dual tasking. Each subject completed two trials for all the testing conditions. The mean value and the coefficient of variation (CoV of stride time while walking alone and while CB at self-selected walking speed were measured using GAITRite® and SMTEC® footswitch systems. Results ICC of mean value in CHI under both walking conditions were higher than ICC of demented patients with FTD and indicated perfect reliability (ICC > 0.80. Reliability of mean value was better while single tasking than dual tasking in CHI (ICC = 0.96 under single-task and ICC = 0.86 under dual-task, whereas it was the opposite in demented patients (ICC = 0.65 under single-task and ICC = 0.81 under dual-task. ICC of CoV was slight to poor whatever the group of participants and the walking condition (ICC Conclusions The immediate test-retest reliability of the mean value of stride time in single and dual tasking was good in older CHI as well as in demented patients with FTD. In contrast, the variability of stride time was low in both groups of participants.

  11. Test-retest reliability and agreement of the Satisfaction with the Assistive Technology Services (SATS) instrument in two Nordic countries.

    Science.gov (United States)

    Sund, Terje; Iwarsson, Susanne; Anttila, Heidi; Helle, Tina; Brandt, Ase

    2014-07-01

    The purpose of this study was to investigate test-retest reliability, agreement, internal consistency, and floor- and ceiling effects of the Danish and Finnish versions of the Satisfaction with the Assistive Technology Services (SATS) instrument among adult users of powered wheelchairs (PWCs) or powered scooters (scooters). Test-retest design, two telephone interviews 7-18 days apart of 40 informants, with mean age of 67.5 (SD 13.09) years in the Danish; and 54 informants with mean age of 55.6 (SD 12.09) years in the Finnish sample. The intra-class correlation coefficient varied between 0.57 and 0.93 for items in the Danish and between 0.41 and 0.93 in the Finnish sample. The percentage agreement varied between 54.2 and 79.5 for items in the Danish and between 69.2 and 81.1 in the Finnish sample, while the Cronbach's alpha values varied between 0.87 and 0.96 in the two samples. A ceiling effect was found in all items of both samples. This study indicates that the SATS may be reliably administered for telephone interviews among adult PWC and scooter users, and give information about aspects of the service delivery process for quality development improvement purposes. Further psychometric testing of the SATS is required.

  12. Test-retest reliability and predictors of unreliable reporting for a sexual behavior questionnaire for U.S. men.

    Science.gov (United States)

    Nyitray, Alan G; Harris, Robin B; Abalos, Andrew T; Nielson, Carrie M; Papenfuss, Mary; Giuliano, Anna R

    2010-12-01

    Accurate knowledge about human sexual behaviors is important for increasing our understanding of human sexuality; however, there have been few studies assessing the reliability of sexual behavior questionnaires designed for community samples of adult men. A test-retest reliability study was conducted on a questionnaire completed by 334 men who had been recruited in Tucson, Arizona. Reliability coefficients and refusal rates were calculated for 39 non-sexual and sexual behavior questionnaire items. Predictors of unreliable reporting for lifetime number of female sexual partners were also assessed. Refusal rates were generally low, with slightly higher refusal rates for questions related to immigration, income, the frequency of sexual intercourse with women, lifetime number of female sexual partners, and the lifetime number of male anal sex partners. Kappa and intraclass correlation coefficients were substantial or almost perfect for all non-sexual and sexual behavior items. Reliability dropped somewhat, but was still substantial, for items that asked about household income and the men's knowledge of their sexual partners' health, including abnormal Pap tests and prior sexually transmitted diseases (STD). Age and lifetime number of female sexual partners were independent predictors of unreliable reporting while years of education was inversely associated with unreliable reporting. These findings among a community sample of adult men are consistent with other test-retest reliability studies with populations of women and adolescents.

  13. Test-retest reliability of Physical Activity Neighborhood Environment Scale among urban men and women in Nanjing, China.

    Science.gov (United States)

    Zhao, L; Wang, Z; Qin, Z; Leslie, E; He, J; Xiong, Y; Xu, F

    2018-03-01

    The identification of physical-activity-friendly built environment (BE) constructs is highly useful for physical activity promotion and maintenance. The Physical Activity Neighborhood Environment Scale (PANES) was developed for assessing BE correlates. However, PANES reliability has not been investigated among adults in China. A cross-sectional study. With multistage sampling approaches, 1568 urban adults (aged 35-74 years) were recruited for the initial survey on all 17 items of PANES Chinese version (PANES-CHN), with the survey repeated 7 days later for each participant. Intraclass correlation coefficient (ICC) was used to assess the test-retest reliability of PANES-CHN for each item. Totally, 1551 participants completed both surveys (follow-up rate = 98.9%). Among participants (mean age: 54.7 ± 11.1 years), 47.8% were men, 22.1% were elders, and 22.7% had ≥13 years of education. Overall, the PANES-CHN demonstrated at least substantial reliability with ICCs ranging from 0.66 to 0.95 (core items), from 0.75 to 0.95 (recommended items), and from 0.78 to 0.87 (optional items). Similar outcomes were observed when data were analyzed by gender or age groups. The PANES-CHN has excellent test-retest reliability and thus has valuable utility for assessing urban BE attributes among Chinese adults. Copyright © 2017 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.

  14. Test-retest reliability of speech-evoked auditory brainstem response in healthy children at a low sensation level.

    Science.gov (United States)

    Zakaria, Mohd Normani; Jalaei, Bahram

    2017-11-01

    Auditory brainstem responses evoked by complex stimuli such as speech syllables have been studied in normal subjects and subjects with compromised auditory functions. The stability of speech-evoked auditory brainstem response (speech-ABR) when tested over time has been reported but the literature is limited. The present study was carried out to determine the test-retest reliability of speech-ABR in healthy children at a low sensation level. Seventeen healthy children (6 boys, 11 girls) aged from 5 to 9 years (mean = 6.8 ± 3.3 years) were tested in two sessions separated by a 3-month period. The stimulus used was a 40-ms syllable /da/ presented at 30 dB sensation level. As revealed by pair t-test and intra-class correlation (ICC) analyses, peak latencies, peak amplitudes and composite onset measures of speech-ABR were found to be highly replicable. Compared to other parameters, higher ICC values were noted for peak latencies of speech-ABR. The present study was the first to report the test-retest reliability of speech-ABR recorded at low stimulation levels in healthy children. Due to its good stability, it can be used as an objective indicator for assessing the effectiveness of auditory rehabilitation in hearing-impaired children in future studies. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Test-retest reliability of the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA).

    Science.gov (United States)

    Bégel, Valentin; Verga, Laura; Benoit, Charles-Etienne; Kotz, Sonja A; Bella, Simone Dalla

    2018-04-27

    Perceptual and sensorimotor timing skills can be comprehensively assessed with the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA). The battery has been used for testing rhythmic skills in healthy adults and patient populations (e.g., with Parkinson disease), showing sensitivity to timing and rhythm deficits. Here we assessed the test-retest reliability of the BAASTA in 20 healthy adults. Participants were tested twice with the BAASTA, implemented on a tablet interface, with a 2-week interval. They completed 4 perceptual tasks, namely, duration discrimination, anisochrony detection with tones and music, and the Beat Alignment Test (BAT). Moreover, they completed motor tasks via finger tapping, including unpaced and paced tapping with tones and music, synchronization-continuation, and adaptive tapping to a sequence with a tempo change. Despite high variability among individuals, the results showed stable test-retest reliability in most tasks. A slight but significant improvement from test to retest was found in tapping with music, which may reflect a learning effect. In general, the BAASTA was found a reliable tool for evaluating timing and rhythm skills. Copyright © 2018 Elsevier Masson SAS. All rights reserved.

  16. Development of an Agility Test for Badminton Players and Assessment of Its Validity and Test-Retest Reliability.

    Science.gov (United States)

    Loureiro, Luiz de França Bahia; de Freitas, Paulo Barbosa

    2016-04-01

    Badminton requires open and fast actions toward the shuttlecock, but there is no specific agility test for badminton players with specific movements. To develop an agility test that simultaneously assesses perception and motor capacity and examine the test's concurrent and construct validity and its test-retest reliability. The Badcamp agility test consists of running as fast as possible to 6 targets placed on the corners and middle points of a rectangular area (5.6 × 4.2 m) from the start position located in the center of it, following visual stimuli presented in a luminous panel. The authors recruited 43 badminton players (17-32 y old) to evaluate concurrent (with shuttle-run agility test--SRAT) and construct validity and test-retest reliability. Results revealed that Badcamp presents concurrent and construct validity, as its performance is strongly related to SRAT (ρ = 0.83, P < .001), with performance of experts being better than nonexpert players (P < .01). In addition, Badcamp is reliable, as no difference (P = .07) and a high intraclass correlation (ICC = .93) were found in the performance of the players on 2 different occasions. The findings indicate that Badcamp is an effective, valid, and reliable tool to measure agility, allowing coaches and athletic trainers to evaluate players' athletic condition and training effectiveness and possibly detect talented individuals in this sport.

  17. Confiabilidade teste-reteste de aspectos da rede social no Estudo Pró-Saúde Test-retest reliability of measures of social network in the "Pró-Saúde" Study

    Directory of Open Access Journals (Sweden)

    Rosane Harter Griep

    2003-06-01

    Full Text Available OBJETIVO: Avaliar os níveis de confiabilidade teste-reteste de informações relativas à rede social no Estudo Pró-saúde. MÉTODOS: Foi estimada a confiabilidade pelo estudo teste-reteste por meio de questionário multidimensional aplicado a uma coorte de trabalhadores de uma universidade. O mesmo questionário foi preenchido duas vezes por 192 funcionários não efetivos da universidade, com duas semanas de intervalo entre as aplicações. A concordância foi estimada pela estatística Kappa (variáveis categóricas, estatística Kappa ponderado e modelos log-lineares (variáveis ordinais, e coeficiente de correlação intraclasse (variáveis discretas. RESULTADOS: As medidas de concordância situaram-se acima de 0,70 para a maioria das variáveis. Estratificando-se as informações segundo gênero, idade e escolaridade, observou-se que a confiabilidade não apresentou padrão consistente de variabilidade. A aplicação de modelos log-lineares indicou que, para as variáveis ordinais do estudo, o modelo de melhor ajuste foi o de "concordância diagonal mais associação linear por linear". CONCLUSÕES: Os altos níveis de confiabilidade estimados permitem concluir que o processo de aferição dos itens sobre rede social foi adequado para as características investigadas. Estudos de validação em andamento complementarão a avaliação da qualidade dessas informações.OBJECTIVE: To evaluate test-retest reliability of social network-related information of the" Pró-Saúde" study. METHODS: A test-retest reliability study was conducted using a multidimensional questionnaire applied to a cohort of university employees. The same questionnaire was filled out twice by 192 non-permanent employees with two weeks apart. Agreement was estimated using kappa statistics (categorical variables, weighted kappa statistics, log-linear models (ordinal variables, and intraclass correlation coefficient (discrete variables. RESULTS: Estimates of reliability

  18. The test-retest reliability of the latent construct of executive function depends on whether tasks are represented as formative or reflective indicators.

    Science.gov (United States)

    Willoughby, Michael T; Kuhn, Laura J; Blair, Clancy B; Samek, Anya; List, John A

    2017-10-01

    This study investigates the test-retest reliability of a battery of executive function (EF) tasks with a specific interest in testing whether the method that is used to create a battery-wide score would result in differences in the apparent test-retest reliability of children's performance. A total of 188 4-year-olds completed a battery of computerized EF tasks twice across a period of approximately two weeks. Two different approaches were used to create a score that indexed children's overall performance on the battery-i.e., (1) the mean score of all completed tasks and (2) a factor score estimate which used confirmatory factor analysis (CFA). Pearson and intra-class correlations were used to investigate the test-retest reliability of individual EF tasks, as well as an overall battery score. Consistent with previous studies, the test-retest reliability of individual tasks was modest (rs ≈ .60). The test-retest reliability of the overall battery scores differed depending on the scoring approach (r mean  = .72; r factor_ score  = .99). It is concluded that the children's performance on individual EF tasks exhibit modest levels of test-retest reliability. This underscores the importance of administering multiple tasks and aggregating performance across these tasks in order to improve precision of measurement. However, the specific strategy that is used has a large impact on the apparent test-retest reliability of the overall score. These results replicate our earlier findings and provide additional cautionary evidence against the routine use of factor analytic approaches for representing individual performance across a battery of EF tasks.

  19. Reproducibility of Tactile Assessments for Children with Unilateral Cerebral Palsy

    Science.gov (United States)

    Auld, Megan Louise; Ware, Robert S.; Boyd, Roslyn Nancy; Moseley, G. Lorimer; Johnston, Leanne Marie

    2012-01-01

    A systematic review identified tactile assessments used in children with cerebral palsy (CP), but their reproducibility is unknown. Sixteen children with unilateral CP and 31 typically developing children (TDC) were assessed 2-4 weeks apart. Test-retest percent agreements within one point for children with unilateral CP (and TDC) were…

  20. A Test-Retest Reliability Study of the Whiplash Disability Questionnaire in Patients With Acute Whiplash-Associated Disorders.

    Science.gov (United States)

    Stupar, Maja; Côté, Pierre; Beaton, Dorcas E; Boyle, Eleanor; Cassidy, J David

    2015-01-01

    The purpose of this study was to determine the test-retest reliability and the Minimal Detectable Change (MDC) of the Whiplash Disability Questionnaire (WDQ) in individuals with acute whiplash-associated disorders (WADs). We performed a test-retest reliability study. We included insurance claimants from Ontario who were at least 18 years of age, within 21 days of their motor vehicle collision and diagnosed as having acute WAD grades I to III. The WDQ, a 13-item questionnaire scored from 0 (no disability) to 130 (complete disability), was administered to all participants at baseline and by telephone 3 days later. We computed the intraclass correlation coefficient (model 2,1) and the MDC with 95% confidence intervals (CIs; MDC95). The mean (SD) age of the 66 participants was 41.6 (12.7) years and 71.2% were female. Twenty-nine percent had WAD I and 71.2% had WAD II. Time since injury ranged from 0 to 19 days. The mean (SD) baseline WDQ score was 49.3 (28.8) and 46.5 (29.8) 3 days later. The intraclass correlation coefficient for the WDQ total score was 0.89 (95% CI, 0.85-0.92) in the entire sample and 0.83 (95% CI, 0.69-0.93) for the 15 participants reporting no change in neck pain. The MDC95 of the WDQ was 21.4 (SD = 14.9) for participants reporting no change. The WDQ was reliable in individuals with acute WAD. There is 95% confidence that a change of approximately one-sixth of the total score is beyond the daily variation of a stable condition. This level of measurement error must be taken into consideration when interpreting change in WDQ scores. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.

  1. Test-retest variability of multifocal electroretinography in normal volunteers and short-term variability in hydroxychloroquine users

    Directory of Open Access Journals (Sweden)

    Browning DJ

    2014-08-01

    Full Text Available David J Browning,1 Chong Lee2 1Charlotte Eye, Ear, Nose and Throat Associates, 2University of North Carolina – Charlotte, Charlotte, NC, USA Purpose: To determine measurement variability of N1P1 amplitudes and the R1/R2 ratio in normal subjects and hydroxychloroquine users without retinopathy. Design: Retrospective, observational study. Subjects: Normal subjects (n=21 and 44 patients taking hydroxychloroquine (n=44 without retinopathy. Methods: Multifocal electroretinography (mfERG was performed twice in one session in the 21 normal subjects and twice within 1 year in the hydroxychloroquine users, during which time no clinical change in macular status occurred. Main outcome measures: N1P1 amplitudes of rings R1–R5, the R1/R2 ratio, and coefficients of repeatability (COR for these measurements. Results: Values for N1P1 amplitudes in hydroxychloroquine users were reduced compared with normal subjects by the known effect of age, but R1/R2 was not affected by age. The COR for R1–R5 ranged from 43% to 52% for normal subjects and from 43% to 59% for hydroxychloroquine users; for R1/R2 the COR was 29% in normal subjects and 45% in hydroxychloroquine users. Conclusion: mfERG measurements show high test-retest variability, limiting the ability of a single mfERG test to influence a decision to stop hydroxychloroquine; corroborative evidence with a different ancillary test is recommended in a suspicious case. Keywords: multifocal electroretinography, hydroxychloroquine, test-retest variability 

  2. Response process and test-retest reliability of the Context Assessment for Community Health tool in Vietnam.

    Science.gov (United States)

    Duc, Duong M; Bergström, Anna; Eriksson, Leif; Selling, Katarina; Thi Thu Ha, Bui; Wallin, Lars

    2016-01-01

    The recently developed Context Assessment for Community Health (COACH) tool aims to measure aspects of the local healthcare context perceived to influence knowledge translation in low- and middle-income countries. The tool measures eight dimensions (organizational resources, community engagement, monitoring services for action, sources of knowledge, commitment to work, work culture, leadership, and informal payment) through 49 items. The study aimed to explore the understanding and stability of the COACH tool among health providers in Vietnam. To investigate the response process, think-aloud interviews were undertaken with five community health workers, six nurses and midwives, and five physicians. Identified problems were classified according to Conrad and Blair's taxonomy and grouped according to an estimation of the magnitude of the problem's effect on the response data. Further, the stability of the tool was examined using a test-retest survey among 77 respondents. The reliability was analyzed for items (intraclass correlation coefficient (ICC) and percent agreement) and dimensions (ICC and Bland-Altman plots). In general, the think-aloud interviews revealed that the COACH tool was perceived as clear, well organized, and easy to answer. Most items were understood as intended. However, seven prominent problems in the items were identified and the content of three dimensions was perceived to be of a sensitive nature. In the test-retest survey, two-thirds of the items and seven of eight dimensions were found to have an ICC agreement ranging from moderate to substantial (0.5-0.7), demonstrating that the instrument has an acceptable level of stability. This study provides evidence that the Vietnamese translation of the COACH tool is generally perceived to be clear and easy to understand and has acceptable stability. There is, however, a need to rephrase and add generic examples to clarify some items and to further review items with low ICC.

  3. Test-retest reliability of [{sup 11}C]AZ10419369 binding to 5-HT{sub 1B} receptors in human brain

    Energy Technology Data Exchange (ETDEWEB)

    Nord, Magdalena; Finnema, Sjoerd J.; Schain, Martin; Halldin, Christer; Farde, Lars [Karolinska Institutet, Center for Psychiatric Research, R5:00, Karolinska University Hospital, Department of Clinical Neuroscience, Stockholm (Sweden)

    2014-02-15

    [{sup 11}C]AZ10419369 is a recently developed 5-HT{sub 1B} receptor radioligand that is sensitive to changes in endogenous serotonin concentrations in the primate brain. Thus, [{sup 11}C] AZ10419369 may serve as a useful tool in clinical studies of the pathophysiology and pharmacological treatment of diseases related to the serotonin system, such as depression and anxiety disorders. The aim of this study was to evaluate the test-retest reliability of [{sup 11}C]AZ10419369. Eight men were examined with PET and [{sup 11}C] AZ10419369 twice on the same day. The binding potentials (BP{sub ND}) of [{sup 11}C]AZ10419369 in selected serotonergic projection areas and in the raphe nuclei (RN) were determined using the simplified reference tissue model, and for comparison also using a wavelet-aided parametric imaging approach. The BP{sub ND} values obtained from the first and second PET scans were compared by means of descriptive statistics, difference, absolute variability and intraclass correlation coefficient. Similar BP{sub ND} values were obtained with the two methods. The absolute mean differences in BP{sub ND} between PET 1 and PET 2 were less than 3 % in all serotonergic projection regions. Absolute variabilities were low in cortical regions (5 - 7 %), low to moderate (7 - 14 %) in subcortical regions, but higher (20 %) in the RN. The BP{sub ND} of [{sup 11}C]AZ10419369 is highly reproducible in cortical regions and satisfactory in subcortical projection areas. The variability in the RN is higher. Thus larger sample sizes or larger divergences are required to assess a potential difference between subjects or between experimental conditions in this region. (orig.)

  4. Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

    Science.gov (United States)

    2011-01-01

    Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048

  5. Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

    Directory of Open Access Journals (Sweden)

    Singh Amika S

    2011-12-01

    Full Text Available Abstract Background Insight in children's energy balance-related behaviours (EBRBs and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77% showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23% and poor for one item. Construct validity appeared to be good to excellent for 70 (47% of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26% and poor for 41 items (27%. Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.

  6. Intensity response function of the photopic negative response (PhNR): effect of age and test-retest reliability.

    Science.gov (United States)

    Joshi, Nabin R; Ly, Emma; Viswanathan, Suresh

    2017-08-01

    To assess the effect of age and test-retest reliability of the intensity response function of the full-field photopic negative response (PhNR) in normal healthy human subjects. Full-field electroretinograms (ERGs) were recorded from one eye of 45 subjects, and 39 of these subjects were tested on two separate days with a Diagnosys Espion System (Lowell, MA, USA). The visual stimuli consisted of brief (test-retest reliability was assessed with the Wilcoxon signed-rank test and Bland-Altman analysis. Holm's correction was applied to account for multiple comparisons. V max of BT was significantly smaller than that of PT and b-wave, and the V max of PT and b-wave was not significantly different from each other. The slope parameter n was smallest for BT and the largest for b-wave and the difference between the slopes of all three measures were statistically significant. Small differences observed in the mean values of K for the different measures did not reach statistical significance. The Wilcoxon signed-rank test indicated no significant differences between the two test visits for any of the Naka-Rushton parameters for the three ERG measures, and the Bland-Altman plots indicated that the mean difference between test and retest measurements of the different fit parameters was close to zero and within 6% of the average of the test and retest values of the respective parameters for all three ERG measurements, indicating minimal bias. While the coefficient of reliability (COR, defined as 1.96 times the standard deviation of the test and retest difference) of each fit parameter was more or less comparable across the three ERG measurements, the %COR (COR normalized to the mean test and retest measures) was generally larger for BT compared to both PT and b-wave for each fit parameter. The Naka-Rushton fit parameters did not show statistically significant changes with age for any of the ERG measures when corrections were applied for multiple comparisons. However, the V max of

  7. Test-retest reliability of the novel 5-HT1B receptor PET radioligand [11C]P943

    International Nuclear Information System (INIS)

    Saricicek, Aybala; Chen, Jason; Ruf, Barbara; Planeta, Beata; Labaree, David; Gallezot, Jean-Dominique; Huang, Yiyun; Subramanyam, Kalyani; Maloney, Kathleen; Matuskey, David; Deserno, Lorenz; Neumeister, Alexander; Krystal, John H.; Carson, Richard E.; Bhagwagar, Zubin

    2015-01-01

    [ 11 C]P943 is a novel, highly selective 5-HT 1B PET radioligand. The aim of this study was to determine the test-retest reliability of [ 11 C]P943 using two different modeling methods and to perform a power analysis with each quantification technique. Seven healthy volunteers underwent two PET scans on the same day. Regions of interest (ROIs) were the amygdala, hippocampus, pallidum, putamen, insula, frontal, anterior cingulate, parietal, temporal and occipital cortices, and cerebellum. Two multilinear radioligand quantification techniques were used to estimate binding potential: MA1, using arterial input function data, and the second version of the multilinear reference tissue model analysis (MRTM2), using the cerebellum as the reference region. Between-scan percent variability and intraclass correlation coefficients (ICC) were used to assess test-retest reliability. We also performed power analyses to determine the method that would allow the least number of subjects using within-subject or between-subject study designs. A voxel-wise ICC analysis for MRTM2 BP ND was performed for the whole brain and all the ROIs studied. Mean percent variability between two scans across regions ranged between 0.4 % and 12.4 % for MA1 BP ND , 0.5 % and 11.5 % for MA1 BP P , 16.7 % and 28.3 % for MA1 BP F , and between 0.2 % and 5.4 % for MRTM2 BP ND . The power analyses showed a greater number of subjects were required using MA1 BP F compared with other outcome measures for both within-subject and between-subject study designs. ICC values were the highest using MRTM2 BP ND and the lowest with MA1 BP F in ten ROIs. Small regions and regions with low binding had lower ICC values than large regions and regions with high binding. Reliable measures of 5-HT 1B receptor binding can be obtained using the novel PET radioligand [ 11 C]P943. Quantification of 5-HT 1B receptor binding with MRTM2 BP ND and with MA1 BP P provided the least variability and optimal power for within-subject and

  8. Test--retest variability of Randot stereoacuity measures gathered in an unselected sample of UK primary school children.

    Science.gov (United States)

    Adler, Paul; Scally, Andrew J; Barrett, Brendan T

    2012-05-01

    To determine the test-retest reliability of the Randot stereoacuity test when used as part of vision screening in schools. Randot stereoacuity (graded-circles) and logMAR visual acuity measures were gathered in an unselected sample of 139 children (aged 4-12, mean 8.1±2.1 years) in two schools. Randot testing was repeated on two occasions (average interval between successive tests 8 days, range: 1-21 days). Three Randot scores were obtained in 97.8% of children. Randot stereoacuity improved by an average of one plate (ie, one test level) on repeat testing but was little changed when tested on the third occasion. Within-subject variability was up to three test levels on repeat testing. When stereoacuity was categorised as 'fine', 'intermediate' or 'coarse', the greatest variability was found among younger children who exhibited 'intermediate' or 'coarse'/nil stereopsis on initial testing. Whereas 90.8% of children with 'fine' stereopsis (≤50 arc-seconds) on the first test exhibited 'fine' stereopsis on both subsequent tests, only ∼16% of children with 'intermediate' (>50 but ≤140 arc-seconds) or 'coarse'/nil (≥200 arc-seconds) stereoacuity on initial testing exhibited stable test results on repeat testing. Children exhibiting abnormal stereoacuity on initial testing are very likely to exhibit a normal result when retested. The value of a single, abnormal Randot graded-circles stereoacuity measure from school screening is therefore questionable.

  9. Validity and test-retest reliability of manual goniometers for measuring passive hip range of motion in femoroacetabular impingement patients.

    Directory of Open Access Journals (Sweden)

    Nussbaumer Silvio

    2010-08-01

    Full Text Available Abstract Background The aims of this study were to evaluate the construct validity (known group, concurrent validity (criterion based and test-retest (intra-rater reliability of manual goniometers to measure passive hip range of motion (ROM in femoroacetabular impingement patients and healthy controls. Methods Passive hip flexion, abduction, adduction, internal and external rotation ROMs were simultaneously measured with a conventional goniometer and an electromagnetic tracking system (ETS on two different testing sessions. A total of 15 patients and 15 sex- and age-matched healthy controls participated in the study. Results The goniometer provided greater hip ROM values compared to the ETS (range 2.0-18.9 degrees; P P Conclusions The present study suggests that goniometer-based assessments considerably overestimate hip joint ROM by measuring intersegmental angles (e.g., thigh flexion on trunk for hip flexion rather than true hip ROM. It is likely that uncontrolled pelvic rotation and tilt due to difficulties in placing the goniometer properly and in performing the anatomically correct ROM contribute to the overrating of the arc of these motions. Nevertheless, conventional manual goniometers can be used with confidence for longitudinal assessments in the clinic.

  10. Development, content validity and test-retest reliability of the Lifelong Physical Activity Skills Battery in adolescents.

    Science.gov (United States)

    Hulteen, Ryan M; Barnett, Lisa M; Morgan, Philip J; Robinson, Leah E; Barton, Christian J; Wrotniak, Brian H; Lubans, David R

    2018-03-28

    Numerous skill batteries assess fundamental motor skill (e.g., kick, hop) competence. Few skill batteries examine lifelong physical activity skill competence (e.g., resistance training). This study aimed to develop and assess the content validity, test-retest and inter-rater reliability of the "Lifelong Physical Activity Skills Battery". Development of the skill battery occurred in three stages: i) systematic reviews of lifelong physical activity participation rates and existing motor skill assessment tools, ii) practitioner consultation and iii) research expert consultation. The final battery included eight skills: grapevine, golf swing, jog, push-up, squat, tennis forehand, upward dog and warrior I. Adolescents (28 boys, 29 girls; M = 15.8 years, SD = 0.4 years) completed the Lifelong Physical Activity Skills Battery on two occasions two weeks apart. The skill battery was highly reliable (ICC = 0.84, 95% CI = 0.72-0.90) with individual skill reliability scores ranging from moderate (warrior I; ICC = 0.56) to high (tennis forehand; ICC = 0.82). Typical error (4.0; 95% CI 3.4-5.0) and proportional bias (r = -0.21, p = .323) were low. This study has provided preliminary evidence for the content validity and reliability of the Lifelong Physical Activity Skills Battery in an adolescent population.

  11. An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 1. Technical Report #1216

    Science.gov (United States)

    Anderson, Daniel; Park, Jasmine, Bitnara; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

    2012-01-01

    This technical report is one in a series of five describing the reliability (test/retest/and alternate form) and G-Theory/D-Study research on the easy CBM reading measures, grades 1-5. Data were gathered in the spring 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest. Due…

  12. Short-interval test-retest interrater reliability of the Dutch version of the structured clinical interview for DSM-IV personality disorders (SCID-II)

    NARCIS (Netherlands)

    Weertman, A; ArntZ, A; Dreessen, L; van Velzen, C; Vertommen, S

    2003-01-01

    This study examined the short-interval test-retest reliability of the Structured Clinical Interview (SCID-II: First, Spitzer, Gibbon, & Williams, 1995) for DSM-IV personality disorders (PDs). The SCID-II was administered to 69 in- and outpatients on two occasions separated by 1 to 6 weeks. The

  13. Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

    Science.gov (United States)

    van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M.

    2018-01-01

    In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…

  14. Establishing the Test-Retest Reliability & Concurrent Validity for the Repeat Ice Skating Test (RIST) in Adolescent Male Ice Hockey Players

    Science.gov (United States)

    Power, Allan; Faught, Brent E.; Przysucha, Eryk; McPherson, Moira; Montelpare, William

    2012-01-01

    In this study the authors examine the test-retest reliability and concurrent validity of the Repeat Ice Skating Test (RIST). This was an on-ice field anaerobic test that measured average peak power and was validated with 3 anaerobic lab tests: (a) vertical jump, (b) the Margaria-Kalamen stair test, and (c) the Wingate Anaerobic Test. The…

  15. Multilevel Factor Structure, Concurrent Validity, and Test-Retest Reliability of the High School Teacher Version of the Authoritative School Climate Survey

    Science.gov (United States)

    Huang, Francis L.; Cornell, Dewey G.

    2016-01-01

    Although school climate has long been recognized as an important factor in the school improvement process, there are few psychometrically supported measures based on teacher perspectives. The current study replicated and extended the factor structure, concurrent validity, and test-retest reliability of the teacher version of the Authoritative…

  16. Test-retest reliability at the item level and total score level of the Norwegian version of the Spinal Cord Injury Falls Concern Scale (SCI-FCS).

    Science.gov (United States)

    Roaldsen, Kirsti Skavberg; Måøy, Åsa Blad; Jørgensen, Vivien; Stanghelle, Johan Kvalvik

    2016-05-01

    Translation of the Spinal Cord Injury Falls Concern Scale (SCI-FCS), and investigation of test-retest reliability on item-level and total-score-level. Translation, adaptation and test-retest study. A specialized rehabilitation setting in Norway. Fifty-four wheelchair users with a spinal cord injury. The median age of the cohort was 49 years, and the median number of years after injury was 13. Interventions/measurements: The SCI-FCS was translated and back-translated according to guidelines. Individuals answered the SCI-FCS twice over the course of one week. We investigated item-level test-retest reliability using Svensson's rank-based statistical method for disagreement analysis of paired ordinal data. For relative reliability, we analyzed the total-score-level test-retest reliability with intraclass correlation coefficients (ICC2.1), the standard error of measurement (SEM), and the smallest detectable change (SDC) for absolute reliability/measurement-error assessment and Cronbach's alpha for internal consistency. All items showed satisfactory percentage agreement (≥69%) between test and retest. There were small but non-negligible systematic disagreements among three items; we recovered an 11-13% higher chance for a lower second score. There was no disagreement due to random variance. The test-retest agreement (ICC2.1) was excellent (0.83). The SEM was 2.6 (12%), and the SDC was 7.1 (32%). The Cronbach's alpha was high (0.88). The Norwegian SCI-FCS is highly reliable for wheelchair users with chronic spinal cord injuries.

  17. The Dichotic Digits difference Test (DDdT): Development, Normative Data, and Test-Retest Reliability Studies Part 1.

    Science.gov (United States)

    Cameron, Sharon; Glyde, Helen; Dillon, Harvey; Whitfield, Jessica; Seymour, John

    2016-06-01

    The dichotic digits test is one of the most widely used assessment tools for central auditory processing disorder. However, questions remain concerning the impact of cognitive factors on test results. To develop the Dichotic Digits difference Test (DDdT), an assessment tool that could differentiate children with cognitive deficits from children with genuine dichotic deficits based on differential test results. The DDdT consists of four subtests: dichotic free recall (FR), dichotic directed left ear (DLE), dichotic directed right ear (DRE), and diotic. Scores for six conditions are calculated (FR left ear [LE], FR right ear [RE], and FR total, as well as DLE, DRE, and diotic). Scores for four difference measures are also calculated: dichotic advantage, right-ear advantage (REA) FR, REA directed, and attention advantage. Experiment 1 involved development of the DDdT, including error rate analysis. Experiment 2 involved collection of normative and test-retest reliability data. Twenty adults (aged 25 yr 10 mo to 50 yr 7 mo, mean 36 yr 4 mo) took part in the development study; 62 normal-hearing, typically developing, primary-school children (aged 7 yr 1 mo to 11 yr 11 mo, mean 9 yr 4 mo) and 10 adults (aged 25 yr 0 mo to 51 yr 6 mo, mean 34 yr 10 mo) took part in the normative and test-retest reliability study. In Experiment 1, error rate analysis was conducted on the 36 digit-pair combinations of the DDdT. Normative data collected in Experiment 2 were arcsine transformed to achieve a distribution that was closer to a normal distribution and z-scores calculated. Pearson product-moment correlations were used to determine the strength of relationships between DDdT conditions. The development study revealed no significant differences in the adult population between test and retest on any DDdT condition. Error rates on 36 digit pairs ranged from 1.5% to 16.7%. The most and the least error-prone digits were removed before commencement of the normative data study, leaving 25

  18. Feasibility and test-retest reliability of measuring lower‑limb strength in young children with cerebral palsy.

    Science.gov (United States)

    Van Vulpen, L F; De Groot, S; Becher, J G; De Wolf, G S; Dallmeijer, A J

    2013-12-01

    Quantifying leg muscle strength in young children with cerebral palsy (CP) is essential for identifying muscle groups for treatment and for monitoring progress. To study the feasibility, intratester reliability and the optimal test design (number of test occasions and repetitions) of measuring lower-limb strength with handheld dynamometry (HHD) and dynamic ankle plantar flexor strength with the standing heel-rise (SH) test in 3-10 year aged children with CP. Test-retest design. Rehabilitation centre, special needs school for children with disabilities, and university medical centre. Knee extensor, hip abductor and calf muscle strength was assessed in 20 ambulatory children with spastic CP (3-5 years [N.=10] and 6-10 years [N.=10]) on two test occasions. Intraclass correlation coefficients (ICC) and Smallest Detectable Differences (SDD) were calculated to determine the optimal test design for detecting changes in strength. All isometric strength tests had acceptable SDDs (9-30%), when taking the mean values of 2-3 test occasions (separate days) and 2-3 repetitions. The one-leg SH test had large SDDs (40-128% for younger group, 23-48% for older group). Isometric strength (improvements) can only be measured reliably with HHD in young children with CP when the average values over at least 2 test occasions are taken. Reliability of the SH test is not sufficient for measuring individual changes in dynamic muscle strength in the younger children. Results of this study can be used to determine the optimal number of test occasions and repetitions for reliable HHD measurements depending on expected changes, muscle group and age in 3-10 year old children with CP.

  19. Test-retest reliability of the assessment of postural stability in typically developing children and in hearing impaired children.

    Science.gov (United States)

    De Kegel, A; Dhooge, I; Cambier, D; Baetens, T; Palmans, T; Van Waelvelde, H

    2011-04-01

    The purpose of this study was to establish test-retest reliability of centre of pressure (COP) measurements obtained by an AccuGait portable forceplate (ACG), mean COG sway velocity measured by a Basic Balance Master (BBM) and clinical balance tests in children with and without balance difficulties. 49 typically developing children and 23 hearing impaired children, with a higher risk for stability problems, between 6 and 12 years of age participated. Each child performed the modified Clinical Test of Sensory Interaction on Balance (mCTSIB), Unilateral Stance (US) and Tandem Stance on ACG, mCTSIB and US on BBM and clinical balance tests: one-leg standing, balance beam walking and one-leg hopping. All subjects completed 2 test sessions on 2 different days in the same week assessed by the same examiner. Among COP measurements obtained by the ACG, mean sway velocity was the most reliable parameter with all ICCs higher than 0.72. The standard deviation (SD) of sway velocity, sway area, SD of anterior-posterior and SD of medio-lateral COP data showed moderate to excellent reliability with ICCs between 0.55 and 0.96 but some caution must be taken into account in some conditions. BBM is less reliable but clinical balance tests are as reliable as ACG. Hearing impaired children exhibited better relative reliability (ICC) and comparable absolute reliability (SEM) for most balance parameters compared to typically developing children. Reliable information regarding postural stability of typically developing children and hearing impaired children may be obtained utilizing COP measurements generated by an AccuGait system and clinical balance tests. Copyright © 2011 Elsevier B.V. All rights reserved.

  20. Which is the most useful patient-reported outcome in femoroacetabular impingement? Test-retest reliability of six questionnaires.

    Science.gov (United States)

    Hinman, Rana S; Dobson, Fiona; Takla, Amir; O'Donnell, John; Bennell, Kim L

    2014-03-01

    The most reliable patient-reported outcomes (PROs) for people with femoroacetabular impingement (FAI) is unknown because there have been no direct comparisons of questionnaires. Thus, the aim was to evaluate the test-retest reliability of six existing PROs in a single cohort of young active people with hip/groin pain consistent with a clinical diagnosis of FAI. Young adults with clinical FAI completed six PRO questionnaires on two occasions, 1-2 weeks apart. The PROs were modified Harris Hip Score, Hip dysfunction and Osteoarthritis Score, Hip Outcome Score, Non-Arthritic Hip Score, International Hip Outcome Tool, Copenhagen Hip and Groin Outcome Score. 30 young adults (mean age 24 years, SD 4 years, range 18-30 years; 15 men) with stable symptoms participated. Intraclass correlation coefficient(3,1) values ranged from 0.73 to 0.93 (95% CI 0.38 to 0.98) indicating that most questionnaires reached minimal reliability benchmarks. Measurement error at the individual level was quite large for most questionnaires (minimal detectable change (MDC95) 12.4-35.6, 95% CI 8.7 to 54.0). In contrast, measurement error at the group level was quite small for most questionnaires (MDC95 2.2-7.3, 95% CI 1.6 to 11). The majority of the questionnaires were reliable and precise enough for use at the group level. Samples of only 23-30 individuals were required to achieve acceptable measurement variation at the group level. Further direct comparisons of these questionnaires are required to assess other measurement properties such as validity, responsiveness and meaningful change in young people with FAI.

  1. Test-retest reliability of fMRI-based graph theoretical properties during working memory, emotion processing, and resting state.

    Science.gov (United States)

    Cao, Hengyi; Plichta, Michael M; Schäfer, Axel; Haddad, Leila; Grimm, Oliver; Schneider, Michael; Esslinger, Christine; Kirsch, Peter; Meyer-Lindenberg, Andreas; Tost, Heike

    2014-01-01

    The investigation of the brain connectome with functional magnetic resonance imaging (fMRI) and graph theory analyses has recently gained much popularity, but little is known about the robustness of these properties, in particular those derived from active fMRI tasks. Here, we studied the test-retest reliability of brain graphs calculated from 26 healthy participants with three established fMRI experiments (n-back working memory, emotional face-matching, resting state) and two parcellation schemes for node definition (AAL atlas, functional atlas proposed by Power et al.). We compared the intra-class correlation coefficients (ICCs) of five different data processing strategies and demonstrated a superior reliability of task-regression methods with condition-specific regressors. The between-task comparison revealed significantly higher ICCs for resting state relative to the active tasks, and a superiority of the n-back task relative to the face-matching task for global and local network properties. While the mean ICCs were typically lower for the active tasks, overall fair to good reliabilities were detected for global and local connectivity properties, and for the n-back task with both atlases, smallworldness. For all three tasks and atlases, low mean ICCs were seen for the local network properties. However, node-specific good reliabilities were detected for node degree in regions known to be critical for the challenged functions (resting-state: default-mode network nodes, n-back: fronto-parietal nodes, face-matching: limbic nodes). Between-atlas comparison demonstrated significantly higher reliabilities for the functional parcellations for global and local network properties. Our findings can inform the choice of processing strategies, brain atlases and outcome properties for fMRI studies using active tasks, graph theory methods, and within-subject designs, in particular future pharmaco-fMRI studies. © 2013 Elsevier Inc. All rights reserved.

  2. Test-retest reliability of diffusion tensor imaging of the liver at 3.0 T.

    Science.gov (United States)

    Girometti, Rossano; Maieron, Marta; Lissandrello, Giovanni; Bazzocchi, Massimo; Zuiani, Chiara

    2015-06-01

    This study was done to evaluate test-retest reliability of liver diffusion tensor imaging (LDTI). Ten healthy volunteers (median age 23 years) underwent two LDTI scans on a 3.0 T magnet during two imaging sessions separated by 2 weeks (session-1/-2, respectively). Fifteen gradient directions and b values of 0-1,000 s/mm(2) were used. Two radiologists in consensus assessed liver apparent diffusion coefficient (ADC) and fraction of anisotropy (FA) values on ADC and FA maps at four reference levels, namely: right upper level (RUL), right lower level (RLL), left upper level (LUL) and left lower level (LLL). We then assessed (a) whether ADC and FA values overlapped when measured on different levels within the same imaging session or between different imaging sessions; (b) the degree of variability on an intra-session and inter-session basis, respectively, using the coefficient of variation (CV). In sessions 1 and 2, the ADC/FA values were significantly larger in the left liver lobe (LUL/LLL) compared to right liver lobe (RUL/RLL) (p < 0.05/6). Intra-session CVs were 9.51 % (session 1) and 9.73 % (session 2) for ADC, and 12.93 % (session 1) and 11.82 % (session 2) for FA, respectively. When comparing RUL, RLL, LUL and LLL on an inter-session basis, CVs were 6.52, 8.20, 6.52 and 11.06 % for ADC, and 15.42, 15.80, 15.42 and 6.80 % for FA, respectively. LDTI provides consistent and repeatable measurements. However, since larger left lobe ADC/FA values can be attributed to artefacts, right lobe values should be considered the most reliable measurements of water diffusivity within the liver.

  3. Test-retest reliability and longitudinal analysis of automated hippocampal subregion volumes in healthy ageing and Alzheimer's disease populations.

    Science.gov (United States)

    Worker, Amanda; Dima, Danai; Combes, Anna; Crum, William R; Streffer, Johannes; Einstein, Steven; Mehta, Mitul A; Barker, Gareth J; C R Williams, Steve; O'daly, Owen

    2018-04-01

    The hippocampal formation is a complex brain structure that is important in cognitive processes such as memory, mood, reward processing and other executive functions. Histological and neuroimaging studies have implicated the hippocampal region in neuropsychiatric disorders as well as in neurodegenerative diseases. This highly plastic limbic region is made up of several subregions that are believed to have different functional roles. Therefore, there is a growing interest in imaging the subregions of the hippocampal formation rather than modelling the hippocampus as a homogenous structure, driving the development of new automated analysis tools. Consequently, there is a pressing need to understand the stability of the measures derived from these new techniques. In this study, an automated hippocampal subregion segmentation pipeline, released as a developmental version of Freesurfer (v6.0), was applied to T1-weighted magnetic resonance imaging (MRI) scans of 22 healthy older participants, scanned on 3 separate occasions and a separate longitudinal dataset of 40 Alzheimer's disease (AD) patients. Test-retest reliability of hippocampal subregion volumes was assessed using the intra-class correlation coefficient (ICC), percentage volume difference and percentage volume overlap (Dice). Sensitivity of the regional estimates to longitudinal change was estimated using linear mixed effects (LME) modelling. The results show that out of the 24 hippocampal subregions, 20 had ICC scores of 0.9 or higher in both samples; these regions include the molecular layer, granule cell layer of the dentate gyrus, CA1, CA3 and the subiculum (ICC > 0.9), whilst the hippocampal fissure and fimbria had lower ICC scores (0.73-0.88). Furthermore, LME analysis of the independent AD dataset demonstrated sensitivity to group and individual differences in the rate of volume change over time in several hippocampal subregions (CA1, molecular layer, CA3, hippocampal tail, fissure and presubiculum

  4. Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

    Directory of Open Access Journals (Sweden)

    Penny Moss

    Full Text Available Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot. Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%. Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56 years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%. Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add

  5. Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

    Science.gov (United States)

    Moss, Penny; Whitnell, Jasmine; Wright, Anthony

    2016-01-01

    Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot). Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%). Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56) years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%). Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add validity and

  6. Test-Retest Reliability of Measurements of Hand-Grip Strength Obtained by Dynamometry from Older Adults: A Systematic Review of Research in the PubMed Database.

    Science.gov (United States)

    Bohannon, R W

    2017-01-01

    A systematic review was performed to summarize literature describing the test-retest reliability of grip strength measures obtained from older adults. Relevant literature was identified via a PubMed search. Seventeen articles were deemed appropriate based on inclusion and exclusion criteria. The relative test-retest reliability of grip strength measures obtained by dynamometry was good to excellent (intra-class correlation coefficients > 0.80) in all but 3 studies, which involved older adults with severe dementia. Absolute reliability, as indicated by summary statistics such as the minimum detectable change (95%), was more variable. As a percentage, that change ranged from 14.5% to 98.5%. Consequently, clinicians can be confident in the relative reliability of grip strength measures obtained from at risk older adults. However, relatively large percentage changes in grip strength may be necessary to conclude with confidence that a real change has occurred over time in some populations.

  7. Test-Retest Reliability, Convergent Validity, and Internal Consistency of the Persian Version of Fullerton Advanced Balance Scale in Iranian Community-Dwelling Older Adults

    OpenAIRE

    Azar Sabet; Akram Azad; Ghorban Taghizadeh

    2016-01-01

    Objectives: This study was performed to evaluate convergent validity, test-retest reliability and internal consistency of the Persian translation of the Fullerton advanced balance (FAB) for use in Iranian community- dwelling older adults and improve the quality of their functional balance assessment. Methods & Materials: The original scale was translated with forward-backward protocol. In the next step, using convenience sampling and inclusion criteria, 88 functionally indep...

  8. Test-retest repeatability of strength capacity, aerobic power and pericranial tenderness of neck and shoulder muscles in children - relevant for tension-type headache

    DEFF Research Database (Denmark)

    Tornøe, Birte; Andersen, Lars L; Skotte, J H

    2013-01-01

    Frequent or chronic tension-type headache in children is a prevalent and debilitating condition for the child, often leading to medication overuse. To explore the relationship between physical factors and tension-type headache in children, the quality of repeated measures was examined. The aim of...... of the present study was to determine the test-retest repeatability of parameters determining isometric neck and shoulder strength and stability, aerobic power, and pericranial tenderness in children....

  9. Internal consistency, reliability, and temporal stability of the Oxford Happiness Questionnaire short-form: Test-retest data over two weeks

    OpenAIRE

    MCGUCKIN, CONOR

    2006-01-01

    PUBLISHED The Oxford Happiness Questionnaire short-form is a recently developed eight-item measure of happiness. This study evaluated the internal consistency reliability and test-retest reliability of the Oxford Happiness Questionnaire short-form among 55 Northern Irish undergraduate university students who completed the measure on two occasions separated by two weeks. Internal consistency of the measure on both occasions was satisfactory at both Time 1 (alpha = .62) and Time 2 (alpha = ....

  10. An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 5. Technical Report #1220

    Science.gov (United States)

    Lai, Cheng-Fei; Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald

    2012-01-01

    This technical report is one in a series of five describing the reliability (test/retest and alternate form) and G-Theory/D-Study research on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest.…

  11. The interrater and test-retest reliability of the Home Falls and Accidents Screening Tool (HOME FAST) in Malaysia: Using raters with a range of professional backgrounds.

    Science.gov (United States)

    Romli, Muhammad Hibatullah; Mackenzie, Lynette; Lovarini, Meryl; Tan, Maw Pin; Clemson, Lindy

    2017-06-01

    Falls can be a devastating issue for older people living in the community, including those living in Malaysia. Health professionals and community members have a responsibility to ensure that older people have a safe home environment to reduce the risk of falls. Using a standardised screening tool is beneficial to intervene early with this group. The Home Falls and Accidents Screening Tool (HOME FAST) should be considered for this purpose; however, its use in Malaysia has not been studied. Therefore, the aim of this study was to evaluate the interrater and test-retest reliability of the HOME FAST with multiple professionals in the Malaysian context. A cross-sectional design was used to evaluate interrater reliability where the HOME FAST was used simultaneously in the homes of older people by 2 raters and a prospective design was used to evaluate test-retest reliability with a separate group of older people at different times in their homes. Both studies took place in an urban area of Kuala Lumpur. Professionals from 9 professional backgrounds participated as raters in this study, and a group of 51 community older people were recruited for the interrater reliability study and another group of 30 for the test-retest reliability study. The overall agreement was moderate for interrater reliability and good for test-retest reliability. The HOME FAST was consistently rated by different professionals, and no bias was found among the multiple raters. The HOME FAST can be used with confidence by a variety of professionals across different settings. The HOME FAST can become a universal tool to screen for home hazards related to falls. © 2017 John Wiley & Sons, Ltd.

  12. An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 2. Technical Report #1217

    Science.gov (United States)

    Anderson, Daniel; Lai, Cheg-Fei; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald

    2012-01-01

    This technical report is one in a series of five describing the reliability (test/retest an alternate form) and G-Theory/D-Study on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from the convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest. Due to…

  13. Test-retest reliability and construct validity of the DOiT (Dutch Obesity Intervention in Teenagers) questionnaire: measuring energy balance-related behaviours in Dutch adolescents.

    Science.gov (United States)

    Janssen, Evelien H C; Singh, Amika S; van Nassau, Femke; Brug, Johannes; van Mechelen, Willem; Chinapaw, Mai J M

    2014-02-01

    Adequate assessment of energy balance-related behaviours in adolescents is essential to develop and evaluate effective obesity prevention programmes. The present study examined the test-retest reliability and construct validity of a questionnaire assessing energy balance-related behaviours in adolescents during the evaluation of the DOiT (Dutch Obesity Intervention in Teenagers) intervention. To assess test-retest reliability, adolescents filled in the questionnaire twice (n 111). To assess construct validity, the results from the first test were compared with data collected in a personal cognitive interview (n 20, independent from the reliability study). For both reliability and validity, intraclass correlation coefficients for continuous data or Cohen's kappa coefficients for categorical data were calculated as well as percentage agreement. Data were collected during school time from February to May 2010. Study participants were Dutch adolescents aged 12-14 years attending pre-vocational secondary schools. In more than three-quarters of the ninety-five questionnaire items the test-retest reliability appeared to be good to excellent. Moderate reliability was found for all other twenty-one items. Fifty-one items (of ninety-five items) showed good to excellent construct validity. Construct validity appeared moderate in twenty-three items and poor in twenty-one items. Most items with poor construct validity concerned consumption of sugar-containing beverages and high-energy snacks/sweets. Our study showed good test-retest reliability and largely moderate to good construct validity for the majority of items of the DOiT questionnaire. Items with poor construct validity (most of them found for items concerning energy intake-related behaviours) should be revised and tested again to improve the questionnaire for future use.

  14. An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Passage Reading Fluency Assessments: Grade 4. Technical Report #1219

    Science.gov (United States)

    Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald

    2012-01-01

    This technical report is one in a series of five describing the reliability (test/retest and alternate form) and G-Theory/D-Study research on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest.…

  15. Test-retest reliability and comparability of paper and computer questionnaires for the Finnish version of the Tampa Scale of Kinesiophobia.

    Science.gov (United States)

    Koho, P; Aho, S; Kautiainen, H; Pohjolainen, T; Hurri, H

    2014-12-01

    To estimate the internal consistency, test-retest reliability and comparability of paper and computer versions of the Finnish version of the Tampa Scale of Kinesiophobia (TSK-FIN) among patients with chronic pain. In addition, patients' personal experiences of completing both versions of the TSK-FIN and preferences between these two methods of data collection were studied. Test-retest reliability study. Paper and computer versions of the TSK-FIN were completed twice on two consecutive days. The sample comprised 94 consecutive patients with chronic musculoskeletal pain participating in a pain management or individual rehabilitation programme. The group rehabilitation design consisted of physical and functional exercises, evaluation of the social situation, psychological assessment of pain-related stress factors, and personal pain management training in order to regain overall function and mitigate the inconvenience of pain and fear-avoidance behaviour. The mean TSK-FIN score was 37.1 [standard deviation (SD) 8.1] for the computer version and 35.3 (SD 7.9) for the paper version. The mean difference between the two versions was 1.9 (95% confidence interval 0.8 to 2.9). Test-retest reliability was 0.89 for the paper version and 0.88 for the computer version. Internal consistency was considered to be good for both versions. The intraclass correlation coefficient for comparability was 0.77 (95% confidence interval 0.66 to 0.85), indicating substantial reliability between the two methods. Both versions of the TSK-FIN demonstrated substantial intertest reliability, good test-retest reliability, good internal consistency and acceptable limits of agreement, suggesting their suitability for clinical use. However, subjects tended to score higher when using the computer version. As such, in an ideal situation, data should be collected in a similar manner throughout the course of rehabilitation or clinical research. Copyright © 2014 Chartered Society of Physiotherapy. Published

  16. Reliability of the Swedish version of the Exercise Self-Efficacy Scale (S-ESES): a test-retest study in adults with neurological disease.

    Science.gov (United States)

    Ahlström, Isabell; Hellström, Karin; Emtner, Margareta; Anens, Elisabeth

    2015-03-01

    To examine the test-retest reliability of the Swedish translated version of the Exercise Self-Efficacy Scale (S-ESES) in people with neurological disease and to examine internal consistency. Test-retest study. A total of 30 adults with neurological diseases including: Parkinson's disease; Multiple Sclerosis; Cervical Dystonia; and Charcot-Marie-Tooth disease. The S-ESES was sent twice by surface mail. Completion interval mean was 16 days apart. Weighted kappa, intraclass correlation coefficient 2,1 [ICC (2,1)], standard error of measurement (SEM), also expressed as a percentage value (SEM%), and Cronbach's alpha were calculated. The relative reliability of the test-retest results showed substantial agreement measured using weighted kappa (MD = 0.62) and a very high-reliability ICC (2,1) (0.92). Absolute reliability measured using SEM was 5.3 and SEM% was 20.7. Excellent internal consistency was shown, with an alpha coefficient of 0.91 (test 1) and 0.93 (test 2). The S-ESES is recommended for use in research and in clinical work for people with neurological diseases. The low-absolute reliability, however, indicates a limited ability to measure changes on an individual level.

  17. Test-retest reliability of selected items of Health Behaviour in School-aged Children (HBSC survey questionnaire in Beijing, China

    Directory of Open Access Journals (Sweden)

    Liu Yang

    2010-08-01

    Full Text Available Abstract Background Children's health and health behaviour are essential for their development and it is important to obtain abundant and accurate information to understand young people's health and health behaviour. The Health Behaviour in School-aged Children (HBSC study is among the first large-scale international surveys on adolescent health through self-report questionnaires. So far, more than 40 countries in Europe and North America have been involved in the HBSC study. The purpose of this study is to assess the test-retest reliability of selected items in the Chinese version of the HBSC survey questionnaire in a sample of adolescents in Beijing, China. Methods A sample of 95 male and female students aged 11 or 15 years old participated in a test and retest with a three weeks interval. Student Identity numbers of respondents were utilized to permit matching of test-retest questionnaires. 23 items concerning physical activity, sedentary behaviour, sleep and substance use were evaluated by using the percentage of response shifts and the single measure Intraclass Correlation Coefficients (ICC with 95% confidence interval (CI for all respondents and stratified by gender and age. Items on substance use were only evaluated for school children aged 15 years old. Results The percentage of no response shift between test and retest varied from 32% for the item on computer use at weekends to 92% for the three items on smoking. Of all the 23 items evaluated, 6 items (26% showed a moderate reliability, 12 items (52% displayed a substantial reliability and 4 items (17% indicated almost perfect reliability. No gender and age group difference of the test-retest reliability was found except for a few items on sedentary behaviour. Conclusions The overall findings of this study suggest that most selected indicators in the HBSC survey questionnaire have satisfactory test-retest reliability for the students in Beijing. Further test-retest studies in a large

  18. Evaluating the test-retest reliability of symptom indices associated with the ImPACT post-concussion symptom scale (PCSS).

    Science.gov (United States)

    Merritt, Victoria C; Bradson, Megan L; Meyer, Jessica E; Arnett, Peter A

    2018-05-01

    The Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) is a commonly used tool in sports concussion assessment. While test-retest reliabilities have been established for the ImPACT cognitive composites, few studies have evaluated the psychometric properties of the ImPACT's Post-Concussion Symptom Scale (PCSS). The purpose of this study was to establish the test-retest reliability of symptom indices associated with the PCSS. Participants included 38 undergraduate students (50.0% male) who underwent neuropsychological testing as part of their participation in their psychology department's research subject pool. The majority of the participants were Caucasian (94.7%) and had no history of concussion (73.7%). All participants completed the ImPACT at two time points, approximately 6 weeks apart. The PCSS was the main outcome measure, and eight symptom indices were calculated (a total symptom score, three symptom summary indices, and four symptom clusters). Pearson correlations (r) and intraclass correlation coefficients (ICCs) were computed as measures of test-retest reliability. Overall, reliabilities ranged from low to high (r = .44 to .80; ICC = .44 to .77). The cognitive symptom cluster exhibited the highest test-retest reliability (r = .80, ICC = .77), followed by the positive symptom total (PST) index, an indicator of the total number of symptoms endorsed (r = .71, ICC = .69). In contrast, the commonly used total symptom score showed lower test-retest reliability (r = .67, ICC = .62). Paired-samples t tests revealed no significant differences between test and retest for any of the symptom variables (all p > .01). Finally, reliable change indices (RCI) were computed to determine whether differences observed between test and retest represented clinically significant change. RCI values were provided for each symptom index at the 80%, 90%, and 95% confidence intervals. These results suggest that evaluating additional symptom

  19. Test-retest measurements of dopamine D{sub 1}-type receptors using simultaneous PET/MRI imaging

    Energy Technology Data Exchange (ETDEWEB)

    Kaller, Simon; Patt, Marianne; Becker, Georg-Alexander; Luthardt, Julia; Meyer, Philipp M.; Werner, Peter; Barthel, Henryk; Bresch, Anke; Sabri, Osama [University of Leipzig, Department of Nuclear Medicine, Leipzig (Germany); Rullmann, Michael [University of Leipzig, Department of Nuclear Medicine, Leipzig (Germany); Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig (Germany); Girbardt, Johanna [Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig (Germany); Fritz, Thomas H. [Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig (Germany); University of Gent, Institute for Psychoacoustics and Electronic Music (IPEM), Ghent (Belgium); Hesse, Swen [University of Leipzig, Department of Nuclear Medicine, Leipzig (Germany); Leipzig University Medical Centre, Integrated Research and Treatment Centre (IFB) Adiposity Diseases, Leipzig (Germany)

    2017-06-15

    The role of dopamine D{sub 1}-type receptor (D{sub 1}R)-expressing neurons in the regulation of motivated behavior and reward prediction has not yet been fully established. As a prerequisite for future research assessing D{sub 1}-mediated neuronal network regulation using simultaneous PET/MRI and D{sub 1}R-selective [{sup 11}C]SCH23390, this study investigated the stability of central D{sub 1}R measurements between two independent PET/MRI sessions under baseline conditions. Thirteen healthy volunteers (7 female, age 33 ± 13 yrs) underwent 90-min emission scans, each after 90-s bolus injection of 486 ± 16 MBq [{sup 11}C]SCH23390, on two separate days within 2-4 weeks using a PET/MRI system. Parametric images of D{sub 1}R distribution volume ratio (DVR) and binding potential (BP{sub ND}) were generated by a multi-linear reference tissue model with two parameters and the cerebellar cortex as receptor-free reference region. Volume-of-interest (VOI) analysis was performed with manual VOIs drawn on consecutive transverse MRI slices for brain regions with high and low D{sub 1}R density. The DVR varied from 2.5 ± 0.3 to 2.9 ± 0.5 in regions with high D{sub 1}R density (e.g. the head of the caudate) and from 1.2 ± 0.1 to 1.6 ± 0.2 in regions with low D{sub 1}R density (e.g. the prefrontal cortex). The absolute variability of the DVR ranged from 2.4% ± 1.3% to 5.1% ± 5.3%, while Bland-Altman analyses revealed very low differences in mean DVR (e.g. 0.013 ± 0.17 for the nucleus accumbens). Intraclass correlation (one-way, random) indicated very high agreement (0.93 in average) for both DVR and BP{sub ND} values. Accordingly, the absolute variability of BP{sub ND} ranged from 7.0% ± 4.7% to 12.5% ± 10.6%; however, there were regions with very low D{sub 1}R content, such as the occipital cortex, with higher mean variability. The test-retest reliability of D{sub 1}R measurements in this study was very high. This was the case not only for D{sub 1}R-rich brain areas, but

  20. Test-retest reliability of evoked BOLD signals from a cognitive-emotive fMRI test battery.

    Science.gov (United States)

    Plichta, Michael M; Schwarz, Adam J; Grimm, Oliver; Morgen, Katrin; Mier, Daniela; Haddad, Leila; Gerdes, Antje B M; Sauer, Carina; Tost, Heike; Esslinger, Christine; Colman, Peter; Wilson, Frederick; Kirsch, Peter; Meyer-Lindenberg, Andreas

    2012-04-15

    Even more than in cognitive research applications, moving fMRI to the clinic and the drug development process requires the generation of stable and reliable signal changes. The performance characteristics of the fMRI paradigm constrain experimental power and may require different study designs (e.g., crossover vs. parallel groups), yet fMRI reliability characteristics can be strongly dependent on the nature of the fMRI task. The present study investigated both within-subject and group-level reliability of a combined three-task fMRI battery targeting three systems of wide applicability in clinical and cognitive neuroscience: an emotional (face matching), a motivational (monetary reward anticipation) and a cognitive (n-back working memory) task. A group of 25 young, healthy volunteers were scanned twice on a 3T MRI scanner with a mean test-retest interval of 14.6 days. FMRI reliability was quantified using the intraclass correlation coefficient (ICC) applied at three different levels ranging from a global to a localized and fine spatial scale: (1) reliability of group-level activation maps over the whole brain and within targeted regions of interest (ROIs); (2) within-subject reliability of ROI-mean amplitudes and (3) within-subject reliability of individual voxels in the target ROIs. Results showed robust evoked activation of all three tasks in their respective target regions (emotional task=amygdala; motivational task=ventral striatum; cognitive task=right dorsolateral prefrontal cortex and parietal cortices) with high effect sizes (ES) of ROI-mean summary values (ES=1.11-1.44 for the faces task, 0.96-1.43 for the reward task, 0.83-2.58 for the n-back task). Reliability of group level activation was excellent for all three tasks with ICCs of 0.89-0.98 at the whole brain level and 0.66-0.97 within target ROIs. Within-subject reliability of ROI-mean amplitudes across sessions was fair to good for the reward task (ICCs=0.56-0.62) and, dependent on the particular ROI

  1. Intrasubject reproducibility of presurgical language lateralization and mapping using fMRI.

    NARCIS (Netherlands)

    Fernandez, G.S.E.; Specht, K.; Weis, S.; Tendolkar, I.; Reuber, M.; Fell, J.; Klaver, P.; Ruhlmann, J.; Reul, J.; Elger, C.E.

    2003-01-01

    BACKGROUND: fMRI is becoming a standard tool for the presurgical lateralization and mapping of brain areas involved in language processing. However, its within-subject reproducibility has yet to be fully explored. OBJECTIVE: To evaluate within-test and test-retest reliability of language fMRI in

  2. Test-retest reliability of the different dynamometric variables used to evaluate pelvic floor musculature during the menstrual cycle.

    Science.gov (United States)

    Dos Reis Nagano, Reny C; Biasotto-Gonzalez, Daniela A; da Costa, Gilmar L; Amorim, Karina M; Fumagalli, Marco A; Amorim, César F; Politti, Fabiano

    2018-04-17

    The aim of this study was to evaluate the reliability of different dynamometric variables of the pelvic floor muscles (PFM) in healthy women during different periods of menstrual cycle. Vaginal dynamometric equipment was developed by the authors and its reproducibility was tested. The PFM contractions of 20 healthy women were collected by two independent examiners over three consecutive weeks, always on the same day, with a seven-day interval between readings, starting from the first day after the end of the menstrual period. For the measurements, the branch of the dynamometer was positioned first on the sagittal plane and then on the frontal plane. Baseline, peak time, maximum PFM strength, impulse contraction, and average contraction force were calculated. Reproducibility was tested using the intra-class correlation coefficient (ICC) and standard error of measurement. Repeated-measures ANOVA was used to compare the data from different days. For intra-day and inter-day reliability between examiners, all the parameters collected on the sagittal plane presented good and excellent reproducibility (ICC 2,1  = 0.60 to 0.98), whereas reproducibility on the frontal plane was respectively poor and excellent (ICC 2,1  = 0.23 to 0.97). The ANOVA revealed significant differences between sessions only for the impulse of contraction for the sagittal (P = 0.005) and frontal (P = 0.03) planes. Time and contraction force parameters of the PFM are not influenced by hormonal alterations that occur during the menstrual cycle. The impulse of contraction was the only variable to demonstrate a significant difference between the first and second week of the data collection protocol. The baseline, maximum strength value, impulse of contraction, and average contraction force variables presented good to excellent reproducibility and can be safely used as a method of PFM evaluation. © 2018 Wiley Periodicals, Inc.

  3. Interrater and Test-Retest Reliability and Minimal Detectable Change of the Balance Evaluation Systems Test (BESTest) and Subsystems With Community-Dwelling Older Adults.

    Science.gov (United States)

    Wang-Hsu, Elizabeth; Smith, Susan S

    2017-01-10

    Falls are a common cause of injuries and hospital admissions in older adults. Balance limitation is a potentially modifiable factor contributing to falls. The Balance Evaluation Systems Test (BESTest), a clinical balance measure, categorizes balance into 6 underlying subsystems. Each of the subsystems is scored individually and summed to obtain a total score. The reliability of the BESTest and its individual subsystems has been reported in patients with various neurological disorders and cancer survivors. However, the reliability and minimal detectable change (MDC) of the BESTest with community-dwelling older adults have not been reported. The purposes of our study were to (1) determine the interrater and test-retest reliability of the BESTest total and subsystem scores; and (2) estimate the MDC of the BESTest and its individual subsystem scores with community-dwelling older adults. We used a prospective cohort methodological design. Community-dwelling older adults (N = 70; aged 70-94 years; mean = 85.0 [5.5] years) were recruited from a senior independent living community. Trained testers (N = 3) administered the BESTest. All participants were tested with the BESTest by the same tester initially and then retested 7 to 14 days later. With 32 of the participants, a second tester concurrently scored the retest for interrater reliability. Testers were blinded to each other's scores. Intraclass correlation coefficients [ICC(2,1)] were used to determine the interrater and test-retest reliability. Test-retest reliability was also analyzed using method error and the associated coefficients of variation (CVME). MDC was calculated using standard error of measurement. Interrater reliability (N = 32) of the BESTest total score was ICC(2, 1) = 0.97 (95% confidence interval [CI], 0.94-0.99). The ICCs for the individual subsystem scores ranged from 0.85 to 0.94. Test-retest reliability (N = 70) of the BESTest total score was ICC(2,1) = 0.93 (95% CI, 0.89-0.96). ICCs for the

  4. Test-retest reliability and four-week changes in cardiopulmonary fitness in stroke patients: evaluation using a robotics-assisted tilt table.

    Science.gov (United States)

    Saengsuwan, Jittima; Berger, Lucia; Schuster-Amft, Corina; Nef, Tobias; Hunt, Kenneth J

    2016-09-06

    Exercise testing devices for evaluating cardiopulmonary fitness in patients with severe disability after stroke are lacking, but we have adapted a robotics-assisted tilt table (RATT) for cardiopulmonary exercise testing (CPET). Using the RATT in a sample of patients after stroke, this study aimed to investigate test-retest reliability and repeatability of CPET and to prospectively investigate changes in cardiopulmonary outcomes over a period of four weeks. Stroke patients with all degrees of disability underwent 3 separate CPET sessions: 2 tests at baseline (TB1 and TB2) and 1 test at follow up (TF). TB1 and TB2 were at least 24 h apart. TB2 and TF were 4 weeks apart. A RATT equipped with force sensors in the thigh cuffs, a work rate estimation algorithm and a real-time visual feedback system was used to guide the patients' exercise work rate during CPET. Test-retest reliability and repeatability of CPET variables were analysed using paired t-tests, the intraclass correlation coefficient (ICC), the coefficient of variation (CoV), and Bland and Altman limits of agreement. Changes in cardiopulmonary fitness during four weeks were analysed using paired t-tests. Seventeen sub-acute and chronic stroke patients (age 62.7 ± 10.4 years [mean ± SD]; 8 females) completed the test sessions. The median time post stroke was 350 days. There were 4 severely disabled, 1 moderately disabled and 12 mildly disabled patients. For test-retest, there were no statistically significant differences between TB1 and TB2 for most CPET variables. Peak oxygen uptake, peak heart rate, peak work rate and oxygen uptake at the ventilatory anaerobic threshold (VAT) and respiratory compensation point (RCP) showed good to excellent test-retest reliability (ICC 0.65-0.94). For all CPET variables, CoV was 4.1-14.5 %. The mean difference was close to zero in most of the CPET variables. There were no significant changes in most cardiopulmonary performance parameters during the 4-week period

  5. TEST-RETEST RELIABILITY OF HAND GRIP STRENGTH MEASUREMENT USING A JAMAR HAND DYNAMOMETER IN PATIENTS WITH ACUTE AND CHRONIC CERVICAL RADICULOPATHY

    Directory of Open Access Journals (Sweden)

    Ejazi G

    2017-12-01

    Full Text Available Background: To evaluate the test-retest reliability of Jamar hand held dynamometer for measuring handgrip strength (HGS in patients with acute and chronic cervical radiculopathy and to find out the difference in measurement of the handgrip strength between acute and chronic cervical radiculopathy. Methods: A prospective, observational and non-experimental, the comparative study design was used. A sample of 72 subjects (37 women and 35 men suffering from cervical radiculopathy were divided into two groups i.e., Group A(acute and Group B(chronic, handgrip strength was measured using Jamar hand held dynamometer on two occasions by the same rater with an interval of 7-days. Data collection was based on standard guidelines of American Society of Hand Therapists. Three gripping trials (measured in Kg with patient’s arm in standardized arm position were recorded. The data was analyzed from the mean score obtained from the sample. Result: One-way Analysis of Variance(ANOVA was used to evaluate test-retest reliability and Tukey-Kramer Multiple Comparison Test used to find the difference between handgrip strength among acute and chronic Cervical radiculopathy cases. Greater P-value (>0.05 in both testing session, as well as 95% of the confidence interval, shows the reliability of the instrument and lesser p-value (0.05 in female subjects shows no significant difference in handgrip strength between the two groups. Conclusion: Excellent test-retest reliability for hand grip strength measurement was measured in patients with acute and chronic cervical radiculopathy shows that the equipment could be used as an assessment tool for this patient and significant difference exists among male handgrip strength between acute and chronic cervical radiculopathy cases whereas no difference exists among female handgrip strength between acute and chronic cervical radiculopathy cases.

  6. Fiabilidad del test 6 minutos caminando en personas con secuelas de poliomielitis paralítica mediante test-retest de 12 semanas

    Directory of Open Access Journals (Sweden)

    Francisco Javier Domínguez-Muñoz

    2013-01-01

    Full Text Available El análisis de la fiabilidad del test de 6 minutos ca- minando en una población de personas con secuelas de poliomielitis paralítica mediante test-retest de 12 semanas no ha sido estudiado. Participaron personas con secuelas de poliomielitis paralítica (n = 18; 48,72 ± 7,69 años; 65,8 ± 11,6 kg. Se les realizó un test-retest de 12 semanas de la prueba de 6 minutos caminando que consistía en que los sujetos anduvieran la mayor distan- cia, sin llegar a la carrera, en un periodo de 6 minutos. La fiabilidad relativa de la prueba fue excelente (CCI = 0,99. En lo que se refiere a la fiabilidad absoluta se obtuvo un error estándar de medida (SEM del 1,7% y un mínimo cambio real (SRD de 4,7%. La fiabilidad del test de 6 minutos caminando usando el método Bland- Altman mostró que el error sistemático (diferencia de medias entre el test-retest fue 2,72 (bias. En conclu- sión, los resultados obtenidos en el test de 6 minutos ca- minando han sido muy fiables y afirmamos que la prue- ba de 6 minutos caminando podrá ser utilizada como prueba de evaluación en una población con secuelas de poliomielitis paralítica, con un intervalo de 12 semanas entre las dos mediciones, para comprobar los cambios que se han producido tras la aplicación de un programa de actividad física.

  7. Test-retest reliability of the diagnosis of schizoaffective disorder in childhood and adolescence - A systematic review and meta-analysis.

    Science.gov (United States)

    Salamon, Sarah; Santelmann, Hanno; Franklin, Jeremy; Baethge, Christopher

    2018-04-01

    Reliability of schizoaffective disorder (SAD) diagnoses is low in adults but unclear in children and adolescents (CAD). We estimate the test-retest reliability of SAD and its key differential diagnoses (schizophrenia, bipolar disorder, and unipolar depression). Systematic literature search of Medline, Embase, and PsycInfo for studies on test-retest reliability of SAD, in CAD. Cohen's kappa was extracted from studies. We performed meta-analysis for kappa, including subgroup and sensitivity analysis (PROSPERO protocol: CRD42013006713). Out of > 4000 records screened, seven studies were included. We estimated kappa values of 0.27 [95%-CI: 0.07 0.47] for SAD, 0.56 [0.29; 0.83] for schizophrenia, 0.64 [0.55; 0.74] for bipolar disorder, and 0.66 [0.52; 0.81] for unipolar depression. In 5/7 studies kappa of SAD was lower than that of schizophrenia; similar trends emerged for bipolar disorder (4/5) and unipolar depression (2/3). Estimates of positive agreement of SAD diagnoses supported these results. The number of studies and patients included is low. The point-estimate of the test-retest reliability of schizoaffective disorder is only fair, and lower than that of its main differential diagnoses. All kappa values under study were lower in children and adolescents samples than those reported for adults. Clinically, schizoaffective disorder should be diagnosed in strict adherence to the operationalized criteria and ought to be re-evaluated regularly. Should larger studies confirm the insufficient reliability of schizoaffective disorder in children and adolescents, the clinical value of the diagnosis is highly doubtful. Copyright © 2017. Published by Elsevier B.V.

  8. Hip abduction-adduction strength and one-leg hop tests: test-retest reliability and relationship to function in elite ice hockey players.

    Science.gov (United States)

    Kea, J; Kramer, J; Forwell, L; Birmingham, T

    2001-08-01

    Single group, test-retest. To determine: (1) hip abduction and adduction torques during concentric and eccentric muscle actions, (2) medial and lateral one-leg hop distances, (3) the test-retest reliability of these measurements, and (4) the relationship between isokinetic measures of hip muscle strength and hop distances in elite ice hockey players. The skating motion used in ice hockey requires strong contractions of the hip and knee musculature. However, baseline scores for hip strength and hop distances, their test-retest reliability, and measures of the extent to which these tests are related for this population are not available. The dominant leg of 27 men (mean age 20 +/- 3 yrs) was tested on 2 occasions. Hip abduction and adduction movements were completed at 60 degrees.s(-1) angular velocity, with the subject lying on the non-test side and the test leg moving vertically in the subject's coronal plane. One-leg hops requiring jumping from and landing on the same leg without losing balance were completed in the medial and lateral directions. Hip adduction torques were significantly greater than abduction torques during both concentric and eccentric muscle actions, while no significant difference was observed between medial and lateral hop distances. Although hop test scores produced excellent ICCs (> 0.75) when determined using scores on 1 occasion, torques needed to be averaged over 2 test occasions to reach this level. Correlations between the strength and hop tests ranged from slight to low (r = -0.26 to 0.27) and were characterized by wide 95% confidence intervals (-0.54 to 0.61). Isokinetic tests of hip abduction and adduction did not provide a strong indication of performance during sideways hop tests. Although isokinetic tests can provide a measure of muscular strength under specific test conditions, they should not be relied upon as a primary indicator of functional abilities or readiness to return to activity.

  9. Impact of Alzheimer's Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden.

    Science.gov (United States)

    Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine

    2014-09-04

    There is a lack of validated instruments to measure the level of burden of Alzheimer's disease (AD) on caregivers. The Impact of Alzheimer's Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver's burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm. A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers. The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument's stability and consistency over time. Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver's burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50-0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2. The study findings suggest the IADCQ has appropriate psychometric characteristics as a

  10. Inter-rater and test-retest reliability of quality assessments by novice student raters using the Jadad and Newcastle-Ottawa Scales.

    Science.gov (United States)

    Oremus, Mark; Oremus, Carolina; Hall, Geoffrey B C; McKinnon, Margaret C

    2012-01-01

    Quality assessment of included studies is an important component of systematic reviews. The authors investigated inter-rater and test-retest reliability for quality assessments conducted by inexperienced student raters. Student raters received a training session on quality assessment using the Jadad Scale for randomised controlled trials and the Newcastle-Ottawa Scale (NOS) for observational studies. Raters were randomly assigned into five pairs and they each independently rated the quality of 13-20 articles. These articles were drawn from a pool of 78 papers examining cognitive impairment following electroconvulsive therapy to treat major depressive disorder. The articles were randomly distributed to the raters. Two months later, each rater re-assessed the quality of half of their assigned articles. McMaster Integrative Neuroscience Discovery and Study Program. 10 students taking McMaster Integrative Neuroscience Discovery and Study Program courses. The authors measured inter-rater reliability using κ and the intraclass correlation coefficient type 2,1 or ICC(2,1). The authors measured test-retest reliability using ICC(2,1). Inter-rater reliability varied by scale question. For the six-item Jadad Scale, question-specific κs ranged from 0.13 (95% CI -0.11 to 0.37) to 0.56 (95% CI 0.29 to 0.83). The ranges were -0.14 (95% CI -0.28 to 0.00) to 0.39 (95% CI -0.02 to 0.81) for the NOS cohort and -0.20 (95% CI -0.49 to 0.09) to 1.00 (95% CI 1.00 to 1.00) for the NOS case-control. For overall scores on the six-item Jadad Scale, ICC(2,1)s for inter-rater and test-retest reliability (accounting for systematic differences between raters) were 0.32 (95% CI 0.08 to 0.52) and 0.55 (95% CI 0.41 to 0.67), respectively. Corresponding ICC(2,1)s for the NOS cohort were -0.19 (95% CI -0.67 to 0.35) and 0.62 (95% CI 0.25 to 0.83), and for the NOS case-control, the ICC(2,1)s were 0.46 (95% CI -0.13 to 0.92) and 0.83 (95% CI 0.48 to 0.95). Inter-rater reliability was generally poor

  11. Interrater and test-retest reliability and validity of the Norwegian version of the BESTest and mini-BESTest in people with increased risk of falling.

    Science.gov (United States)

    Hamre, Charlotta; Botolfsen, Pernille; Tangen, Gro Gujord; Helbostad, Jorunn L

    2017-04-20

    The Balance Evaluation Systems Test (BESTest) was developed to assess underlying systems for balance control in order to be able to individually tailor rehabilitation interventions to people with balance disorders. A short form, the Mini-BESTest, was developed as a screening test. The study aimed to assess interrater and test-retest reliability of the Norwegian version of the BESTest and the Mini-BESTest in community-dwelling people with increased risk of falling and to assess concurrent validity with the Fall Efficacy Scale-International (FES-I), and it was an observational study with a cross-sectional design. Forty-two persons with increased risk of falling (elderly over 65 years of age, persons with a history of stroke or Multiple Sclerosis) were assessed twice by two raters. Relative reliability was analysed with Intraclass Correlation Coefficient (ICC), and absolute reliability with standard error of measurement (SEM) and smallest detectable change (SDC). Concurrent validity was assessed against the FES-I using Spearman's rho. The BESTest showed very good interrater reliability (ICC = 0.98, SEM = 1.79, SDC 95  = 5.0) and test-retest reliability (rater A/rater B = ICC = 0.89/0.89, SEM = 3.9/4.3, SDC 95  = 10.8/11.8). The Mini-BESTest also showed very good interrater reliability (ICC = 0.95, SEM = 1.19, SDC 95  = 3.3) and test-retest reliability (rater A/rater B = ICC = 0.85/0.84, SEM = 1.8/1.9, SDC 95  = 4.9/5.2). The correlations were moderate between the FES-I and both the BESTest and the Mini-BESTest (Spearman's rho -0.51 and-0.50, p test-retest reliability when assessed in a heterogeneous sample of people with increased risk of falling. The concurrent validity measured against the FES-I showed moderate correlation. The results are comparable with earlier studies and indicate that the Norwegian versions can be used in daily clinic and in research.

  12. Test-retest assessment of functional near-infrared spectroscopy to measure risk decision making in young adults

    Science.gov (United States)

    Li, Lin; Lin, Zijing; Cazzell, Mary; Liu, Hanli

    2013-03-01

    Investigation of the reliability and reproducibility of the hemodynamic response is important for interpretation and understanding of the results of functional near-infrared spectroscopy (fNIRS). It measures optical signals absorbed by the brain tissue and reflects the neuronal activities indirectly. Here we described an fNIRS study measured in the prefrontal region (Brodman area 9, 10, part of 46)to examine the risk decision-making behavior in nine young adults. The Balloon Analog Risk Task (BART) is widely used to test the level of risk taking ability in the field of psychology. BART was a protocol utilized in this study to evoke a risk-taking environment with a gambling-like balloon game in each subject. Specifically, we recorded the brain oxygenated-hemoglobin (HbO) and deoxygenated-hemoglobin (HHb) changes during the two repeated measurements within a time interval of 3 weeks. The results demonstrate that the changes in HbO2 amplitudes have high reliability at the group level, and that the spatial patterns of the tomographic images have high reproducibility in size and a moderate degree of overlap. Overall, this study confirms that the hemodynamic response to risk decision-making (i.e., BART) seen by fNIRS is highly reliable and reproducible.

  13. Imaging of striatal dopamine transporters in rat brain with single pinhole SPECT and co-aligned MRI is highly reproducible

    International Nuclear Information System (INIS)

    Booij, Jan; Bruin, Kora de; Win, Maartje M.L. de; Lavini, Cristina Mphil; Heeten, Gerard J. den; Habraken, Jan

    2003-01-01

    A recently developed pinhole high-resolution SPECT system was used to measure striatal to non-specific binding ratios in rats (n = 9), after injection of the dopamine transporter ligand 123 I-FP-CIT, and to assess its test/retest reproducibility. For co-alignment purposes, the rat brain was imaged on a 1.5 Tesla clinical MRI scanner using a specially developed surface coil. The SPECT images showed clear striatal uptake. On the MR images, cerebral and extra-cerebral structures could be easily delineated. The mean striatal to non-specific [ 123 I]FP-CIT binding ratios of the test/retest studies were 1.7 ± 0.2 and 1.6 ± 0.2, respectively. The test/retest variability was approximately 9%. We conclude that the assessment of striatal [ 123 I]FP-CIT binding ratios in rats is highly reproducible

  14. Test-retest repeatability of myocardial blood flow and infarct size using 11C-acetate micro-PET imaging in mice

    International Nuclear Information System (INIS)

    Croteau, Etienne; Renaud, Jennifer M.; McDonald, Matthew; Klein, Ran; DaSilva, Jean N.; Beanlands, Rob S.B.; DeKemp, Robert A.

    2015-01-01

    Global and regional responses of absolute myocardial blood flow index (iMBF) are used as surrogate markers to assess response to therapies in coronary artery disease. In this study, we assessed the test-retest repeatability of iMBF imaging, and the accuracy of infarct sizing in mice using 11 C-acetate PET. 11 C-Acetate cardiac PET images were acquired in healthy controls, endothelial nitric oxide synthase (eNOS) knockout transgenic mice, and mice after myocardial infarction (MI) to estimate global and regional iMBF, and myocardial infarct size compared to 18 F-FDG PET and ex-vivo histology results. Global test-retest iMBF values had good coefficients of repeatability (CR) in healthy mice, eNOS knockout mice and normally perfused regions in MI mice (CR = 1.6, 2.0 and 1.5 mL/min/g, respectively). Infarct size measured on 11 C-acetate iMBF images was also repeatable (CR = 17 %) and showed a good correlation with the infarct sizes found on 18 F-FDG PET and histopathology (r 2 > 0.77; p < 0.05). 11 C-Acetate micro-PET assessment of iMBF and infarct size is repeatable and suitable for serial investigation of coronary artery disease progression and therapy. (orig.)

  15. Test-retest repeatability of myocardial blood flow and infarct size using {sup 11}C-acetate micro-PET imaging in mice

    Energy Technology Data Exchange (ETDEWEB)

    Croteau, Etienne; Renaud, Jennifer M.; McDonald, Matthew; Klein, Ran; DaSilva, Jean N.; Beanlands, Rob S.B.; DeKemp, Robert A. [University of Ottawa Heart Institute, National Cardiac PET Centre, Ottawa, Ontario (Canada)

    2015-09-15

    Global and regional responses of absolute myocardial blood flow index (iMBF) are used as surrogate markers to assess response to therapies in coronary artery disease. In this study, we assessed the test-retest repeatability of iMBF imaging, and the accuracy of infarct sizing in mice using {sup 11}C-acetate PET. {sup 11}C-Acetate cardiac PET images were acquired in healthy controls, endothelial nitric oxide synthase (eNOS) knockout transgenic mice, and mice after myocardial infarction (MI) to estimate global and regional iMBF, and myocardial infarct size compared to {sup 18}F-FDG PET and ex-vivo histology results. Global test-retest iMBF values had good coefficients of repeatability (CR) in healthy mice, eNOS knockout mice and normally perfused regions in MI mice (CR = 1.6, 2.0 and 1.5 mL/min/g, respectively). Infarct size measured on {sup 11}C-acetate iMBF images was also repeatable (CR = 17 %) and showed a good correlation with the infarct sizes found on {sup 18}F-FDG PET and histopathology (r{sup 2} > 0.77; p < 0.05). {sup 11}C-Acetate micro-PET assessment of iMBF and infarct size is repeatable and suitable for serial investigation of coronary artery disease progression and therapy. (orig.)

  16. The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

    Science.gov (United States)

    Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

    2018-04-12

    To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (Test possessed low CV and significant (pTest possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  17. Reliability of Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory in a test-retest design.

    Science.gov (United States)

    Larson, Tomas; Kerekes, Nóra; Selinus, Eva Norén; Lichtenstein, Paul; Gumpert, Clara Hellner; Anckarsäter, Henrik; Nilsson, Thomas; Lundström, Sebastian

    2014-02-01

    The Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory is used in epidemiological research to assess neurodevelopmental problems and coexisting conditions. Although the A-TAC has been applied in various populations, data on retest reliability are limited. The objective of the present study was to present additional reliability data. The A-TAC was administered by lay assessors and was completed on two occasions by parents of 400 individual twins, with an average interval of 70 days between test sessions. Intra- and inter-rater reliability were analysed with intraclass correlations and Cohen's kappa. A-TAC showed excellent test-retest intraclass correlations for both autism spectrum disorder and attention deficit hyperactivity disorder (each at .84). Most modules in the A-TAC had intra- and inter-rater reliability intraclass correlation coefficients of > or = .60. Cohen's kappa indi- cated acceptable reliability. The current study provides statistical evidence that the A-TAC yields good test-retest reliability in a population-based cohort of children.

  18. Test-retest paradigm of the forced swimming test in female mice is not valid for predicting antidepressant-like activity: participation of acetylcholine and sigma-1 receptors.

    Science.gov (United States)

    Su, Jing; Hato-Yamada, Noriko; Araki, Hiroaki; Yoshimura, Hiroyuki

    2013-01-01

    The forced swimming test (FST) in mice is widely used to predict the antidepressant activity of a drug, but information describing the immobility of female mice is limited. We investigated whether a prior swimming experience affects the immobility duration in a second FST in female mice and whether the test-retest paradigm is a valid screening tool for antidepressants. Female ICR mice were exposed to the FST using two experimental paradigms: a single FST and a double FST in which mice had experienced FST once 24 h prior to the second trail. The initial FST experience reliably prolonged immobility duration in the second FST. The antidepressants imipramine and paroxetine significantly reduced immobility duration in the single FST, but not in the double FST. Scopolamine and the sigma-1 (σ1) antagonist NE-100 administered before the second trial significantly prevented the prolongation of immobility. Neither a 5-HT1A nor a 5-HT2A receptor agonist affected immobility duration. We suggest that the test-retest paradigm in female mice is not adequate for predicting antidepressant-like activity of a drug; the prolongation of immobility in the double FST is modulated through acetylcholine and σ1 receptors.

  19. Test-retest variability of high resolution positron emission tomography (PET) imaging of cortical serotonin (5HT2A) receptors in older, healthy adults

    International Nuclear Information System (INIS)

    Chow, Tiffany W; Mamo, David C; Uchida, Hiroyuki; Graff-Guerrero, Ariel; Houle, Sylvain; Smith, Gwenn S; Pollock, Bruce G; Mulsant, Benoit H

    2009-01-01

    Position emission tomography (PET) imaging using [ 18 F]-setoperone to quantify cortical 5-HT 2A receptors has the potential to inform pharmacological treatments for geriatric depression and dementia. Prior reports indicate a significant normal aging effect on serotonin 5HT 2A receptor (5HT 2A R) binding potential. The purpose of this study was to assess the test-retest variability of [ 18 F]-setoperone PET with a high resolution scanner (HRRT) for measuring 5HT 2A R availability in subjects greater than 60 years old. Methods: Six healthy subjects (age range = 65–78 years) completed two [ 18 F]-setoperone PET scans on two separate occasions 5–16 weeks apart. The average difference in the binding potential (BP ND ) as measured on the two occasions in the frontal and temporal cortical regions ranged between 2 and 12%, with the lowest intraclass correlation coefficient in anterior cingulate regions. We conclude that the test-retest variability of [ 18 F]-setoperone PET in elderly subjects is comparable to that of [ 18 F]-setoperone and other 5HT 2A R radiotracers in younger subject samples

  20. The Perceived Efficacy and Goal Setting System (PEGS), part II: evaluation of test-retest reliability and differences between child and parental reports in the Swedish version.

    Science.gov (United States)

    Vroland-Nordstrand, Kristina; Krumlinde-Sundholm, Lena

    2012-11-01

    to evaluate the test-retest reliability of children's perceptions of their own competence in performing daily tasks and of their choice of goals for intervention using the Swedish version of the perceived efficacy and goal setting system (PEGS). A second aim was to evaluate agreement between children's and parents' perceptions of the child's competence and choices of intervention goals. Forty-four children with disabilities and their parents completed the Swedish version of the PEGS. Thirty-six of the children completed a retest session allocated into one of two groups: (A) for evaluation of perceived competence and (B) for evaluation of choice of goals. Cohen's kappa, weighted kappa and absolute agreement were calculated. Test-retest reliability for children's perceived competence showed good agreement for the dichotomized scale of competent/non-competent performance; however, using the four-point scale the agreement varied. The children's own goals were relatively stable over time; 78% had an absolute agreement ranging from 50% to 100%. There was poor agreement between the children's and their parents' ratings. Goals identified by the children differed from those identified by their parents, with 48% of the children having no goals identical to those chosen by their parents. These results indicate that the Swedish version of the PEGS produces reliable outcomes comparable to the original version.

  1. Assessing the test-retest repeatability of the Vietnamese version of the National Eye Institute 25-item Visual Function Questionnaire among bilateral cataract patients for a Vietnamese population.

    Science.gov (United States)

    To, Kien Gia; Meuleners, Lynn; Chen, Huei-Yang; Lee, Andy; Do, Dung Van; Duong, Dat Van; Phi, Tien Duy; Tran, Hoang Huy; Nguyen, Nguyen Do

    2014-06-01

    To determine the test-retest repeatability of the National Eye Institute 25-item Visual Function Questionnaire (NEI VFQ-25) for use with older Vietnamese adults with bilateral cataract. The questionnaire was translated into Vietnamese and back-translated into English by two independent translators. Patients with bilateral cataract aged 50 and older completed the questionnaire on two separate occasions, one to two weeks after first administration of the questionnaire. Test-retest repeatability was assessed using the Cronbach's α and intraclass correlation coefficients. The average age of participants was 67 ± 8 years and most participants were female (73%). Internal consistency was acceptable with the α coefficient above 0.7 for all subscales and intraclass correlation coefficients were 0.6 or greater in all subscales. The Vietnamese NEI VFQ-25 is reliable for use in studies assessing vision-related quality of life in older adults with bilateral cataract in Vietnam. We propose some modifications to the NEI-VFQ questions to reflect activities of older people in Vietnam. © 2013 ACOTA.

  2. Test-retest reliability of schizoaffective disorder compared with schizophrenia, bipolar disorder, and unipolar depression--a systematic review and meta-analysis.

    Science.gov (United States)

    Santelmann, Hanno; Franklin, Jeremy; Bußhoff, Jana; Baethge, Christopher

    2015-11-01

    Schizoaffective disorder is a frequent diagnosis, and its reliability is subject to ongoing discussion. We compared the diagnostic reliability of schizoaffective disorder with its main differential diagnoses. We systematically searched Medline, Embase, and PsycInfo for all studies on the test-retest reliability of the diagnosis of schizoaffective disorder as compared with schizophrenia, bipolar disorder, and unipolar depression. We used meta-analytic methods to describe and compare Cohen's kappa as well as positive and negative agreement. In addition, multiple pre-specified and post hoc subgroup and sensitivity analyses were carried out. Out of 4,415 studies screened, 49 studies were included. Test-retest reliability of schizoaffective disorder was consistently lower than that of schizophrenia (in 39 out of 42 studies), bipolar disorder (27/33), and unipolar depression (29/35). The mean difference in kappa between schizoaffective disorder and the other diagnoses was approximately 0.2, and mean Cohen's kappa for schizoaffective disorder was 0.50 (95% confidence interval: 0.40-0.59). While findings were unequivocal and homogeneous for schizoaffective disorder's diagnostic reliability relative to its three main differential diagnoses (dichotomous: smaller versus larger), heterogeneity was substantial for continuous measures, even after subgroup and sensitivity analyses. In clinical practice and research, schizoaffective disorder's comparatively low diagnostic reliability should lead to increased efforts to correctly diagnose the disorder. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  3. Reproducibility of Automated Voice Range Profiles, a Systematic Literature Review

    DEFF Research Database (Denmark)

    Printz, Trine; Rosenberg, Tine; Godballe, Christian

    2018-01-01

    literature on test-retest accuracy of the automated voice range profile assessment. Study design: Systematic review. Data sources: PubMed, Scopus, Cochrane Library, ComDisDome, Embase, and CINAHL (EBSCO). Methods: We conducted a systematic literature search of six databases from 1983 to 2016. The following......Objective: Reliable voice range profiles are of great importance when measuring effects and side effects from surgery affecting voice capacity. Automated recording systems are increasingly used, but the reproducibility of results is uncertain. Our objective was to identify and review the existing...... keywords were used: phonetogram, voice range profile, and acoustic voice analysis. Inclusion criteria were automated recording procedure, healthy voices, and no intervention between test and retest. Test-retest values concerning fundamental frequency and voice intensity were reviewed. Results: Of 483...

  4. Test-retest repeatability of strength capacity, aerobic power and pericranial tenderness of neck and shoulder muscles in children - relevant for tension-type headache

    Directory of Open Access Journals (Sweden)

    Tornøe B

    2013-08-01

    Full Text Available Birte Tornøe,1,2,5,6 Lars L Andersen,3 Jørgen H Skotte,3 Rigmor Jensen,4 Gunvor Gard,1 Liselotte Skov,2 Inger Hallström1 1Department of Health Sciences, Lund University, Scania, Sweden; 2Children's Headache Clinic, Department of Pediatrics, University of Copenhagen, Herlev Hospital, Herlev, Denmark; 3National Research Centre for the Working Environment, Copenhagen, Denmark; 4Danish Headache Center, Department of Neurology, University of Copenhagen, Glostrup Hospital, Glostrup, Denmark; 5Department of Physiotherapy and Occupational Therapy, University of Copenhagen, Glostrup Hospital, Glostrup, Denmark; 6Department of Physiotherapy, Medical Department, University of Copenhagen, Herlev Hospital, Herlev, Denmark Background: Frequent or chronic tension-type headache in children is a prevalent and debilitating condition for the child, often leading to medication overuse. To explore the relationship between physical factors and tension-type headache in children, the quality of repeated measures was examined. The aim of the present study was to determine the test-retest repeatability of parameters determining isometric neck and shoulder strength and stability, aerobic power, and pericranial tenderness in children. Methods: Twenty-five healthy children, 9 to 18 years of age, participated in test-retest procedures within a 1-week interval. A computerized padded force transducer was used for testing. The tests included the isometric maximal voluntary contraction and force steadiness of neck flexion and extension, and the isometric maximal voluntary contraction and rate of force of the dominant shoulder. Pericranial tenderness was recorded by means of standardized manual palpation, and a submaximal cycle ergometer test predicted maximal oxygen uptake (VO2 max. The measurements were evaluated in steps, using the intraclass correlation coefficient (ICC; changes in the mean between the two test occasions; the levels of agreement, visualized in Bland

  5. Test-retest and interobserver reliability of quantitative sensory testing according to the protocol of the German Research Network on Neuropathic Pain (DFNS): a multi-centre study.

    Science.gov (United States)

    Geber, Christian; Klein, Thomas; Azad, Shahnaz; Birklein, Frank; Gierthmühlen, Janne; Huge, Volker; Lauchart, Meike; Nitzsche, Dorothee; Stengel, Maike; Valet, Michael; Baron, Ralf; Maier, Christoph; Tölle, Thomas; Treede, Rolf-Detlef

    2011-03-01

    Quantitative sensory testing (QST) is an instrument to assess positive and negative sensory signs, helping to identify mechanisms underlying pathologic pain conditions. In this study, we evaluated the test-retest reliability (TR-R) and the interobserver reliability (IO-R) of QST in patients with sensory disturbances of different etiologies. In 4 centres, 60 patients (37 male and 23 female, 56.4±1.9years) with lesions or diseases of the somatosensory system were included. QST comprised 13 parameters including detection and pain thresholds for thermal and mechanical stimuli. QST was performed in the clinically most affected test area and a less or unaffected control area in a morning and an afternoon session on 2 consecutive days by examiner pairs (4 QSTs/patient). For both, TR-R and IO-R, there were high correlations (r=0.80-0.93) at the affected test area, except for wind-up ratio (TR-R: r=0.67; IO-R: r=0.56) and paradoxical heat sensations (TR-R: r=0.35; IO-R: r=0.44). Mean IO-R (r=0.83, 31% unexplained variance) was slightly lower than TR-R (r=0.86, 26% unexplained variance, Ptest area (TR-R: r=0.86; IO-R: r=0.83) than in the control area (TR-R: r=0.79; IO-R: r=0.71, each Preliability of QST. We conclude that standardized QST performed by trained examiners is a valuable diagnostic instrument with good test-retest and interobserver reliability within 2days. With standardized training, observer bias is much lower than random variance. Quantitative sensory testing performed by trained examiners is a valuable diagnostic instrument with good interobserver and test-retest reliability for use in patients with sensory disturbances of different etiologies to help identify mechanisms of neuropathic and non-neuropathic pain. Copyright © 2010 International Association for the Study of Pain. Published by Elsevier B.V. All rights reserved.

  6. Intra-Rater, Inter-Rater and Test-Retest Reliability of an Instrumented Timed Up and Go (iTUG Test in Patients with Parkinson's Disease.

    Directory of Open Access Journals (Sweden)

    Rob C van Lummel

    Full Text Available The "Timed Up and Go" (TUG is a widely used measure of physical functioning in older people and in neurological populations, including Parkinson's Disease. When using an inertial sensor measurement system (instrumented TUG [iTUG], the individual components of the iTUG and the trunk kinematics can be measured separately, which may provide relevant additional information.The aim of this study was to determine intra-rater, inter-rater and test-retest reliability of the iTUG in patients with Parkinson's Disease.Twenty eight PD patients, aged 50 years or older, were included. For the iTUG the DynaPort Hybrid (McRoberts, The Hague, The Netherlands was worn at the lower back. The device measured acceleration and angular velocity in three directions at a rate of 100 samples/s. Patients performed the iTUG five times on two consecutive days. Repeated measurements by the same rater on the same day were used to calculate intra-rater reliability. Repeated measurements by different raters on the same day were used to calculate intra-rater and inter-rater reliability. Repeated measurements by the same rater on different days were used to calculate test-retest reliability.Nineteen ICC values (15% were ≥ 0.9 which is considered as excellent reliability. Sixty four ICC values (49% were ≥ 0.70 and < 0.90 which is considered as good reliability. Thirty one ICC values (24% were ≥ 0.50 and < 0.70, indicating moderate reliability. Sixteen ICC values (12% were ≥ 0.30 and < 0.50 indicating poor reliability. Two ICT values (2% were < 0.30 indicating very poor reliability.In conclusion, in patients with Parkinson's disease the intra-rater, inter-rater, and test-retest reliability of the individual components of the instrumented TUG (iTUG was excellent to good for total duration and for turning durations, and good to low for the sub durations and for the kinematics of the SiSt and StSi. The results of this fully automated analysis of instrumented TUG movements

  7. Brain GABA Detection in vivo with the J-editing 1H MRS Technique: A Comprehensive Methodological Evaluation of Sensitivity Enhancement, Macromolecule Contamination and Test-Retest Reliability

    Science.gov (United States)

    Shungu, Dikoma C.; Mao, Xiangling; Gonzales, Robyn; Soones, Tacara N.; Dyke, Jonathan P.; van der Veen, Jan Willem; Kegeles, Lawrence S.

    2016-01-01

    Abnormalities in brain γ-aminobutyric acid (GABA) have been implicated in various neuropsychiatric and neurological disorders. However, in vivo GABA detection by proton magnetic resonance spectroscopy (1H MRS) presents significant challenges arising from low brain concentration, overlap by much stronger resonances, and contamination by mobile macromolecule (MM) signals. This study addresses these impediments to reliable brain GABA detection with the J-editing difference technique on a 3T MR system in healthy human subjects by (a) assessing the sensitivity gains attainable with an 8-channel phased-array head coil, (b) determining the magnitude and anatomic variation of the contamination of GABA by MM, and (c) estimating the test-retest reliability of measuring GABA with this method. Sensitivity gains and test-retest reliability were examined in the dorsolateral prefrontal cortex (DLPFC), while MM levels were compared across three cortical regions: the DLPFC, the medial prefrontal cortex (MPFC) and the occipital cortex (OCC). A 3-fold higher GABA detection sensitivity was attained with the 8-channel head coil compared to the standard single-channel head coil in DLPFC. Despite significant anatomic variation in GABA+MM and MM across the three brain regions (p GABA+MM was relatively stable across the three voxels, ranging from 41% to 49%, a non-significant regional variation (p = 0.58). The test-retest reliability of GABA measurement, expressed either as ratios to voxel tissue water (W) or total creatine, was found to be very high for both the single-channel coil and the 8-channel phased-array coil. For the 8-channel coil, for example, Pearson’s correlation coefficient of test vs. retest for GABA/W was 0.98 (R2 = 0.96, p = 0.0007), the percent coefficient of variation (CV) was 1.25%, and the intraclass correlation coefficient (ICC) was 0.98. Similar reliability was also found for the co-edited resonance of combined glutamate and glutamine (Glx) for both coils. PMID

  8. Test-retest reliability of pure-tone thresholds from 0.5 to 16 kHz using Sennheiser HDA 200 and Etymotic Research ER-2 earphones.

    Science.gov (United States)

    Schmuziger, Nicolas; Probst, Rudolf; Smurzynski, Jacek

    2004-04-01

    The purposes of the study were: (1) To evaluate the intrasession test-retest reliability of pure-tone thresholds measured in the 0.5-16 kHz frequency range for a group of otologically healthy subjects using Sennheiser HDA 200 circumaural and Etymotic Research ER-2 insert earphones and (2) to compare the data with existing criteria of significant threshold shifts related to ototoxicity and noise-induced hearing loss. Auditory thresholds in the frequency range from 0.5 to 6 kHz and in the extended high-frequency range from 8 to 16 kHz were measured in one ear of 138 otologically healthy subjects (77 women, 61 men; mean age, 24.4 yr; range, 12-51 yr) using HDA 200 and ER-2 earphones. For each subject, measurements of thresholds were obtained twice for both transducers during the same test session. For analysis, the extended high-frequency range from 8 to 16 kHz was subdivided into 8 to 12.5 and 14 to 16 kHz ranges. Data for each frequency and frequency range were analyzed separately. There were no significant differences in repeatability for the two transducer types for all frequency ranges. The intrasession variability increased slightly, but significantly, as frequency increased with the greatest amount of variability in the 14 to 16 kHz range. Analyzing each individual frequency, variability was increased particularly at 16 kHz. At each individual frequency and for both transducer types, intrasession test-retest repeatability from 0.5 to 6 kHz and 8 to 16 kHz was within 10 dB for >99% and >94% of measurements, respectively. The results indicated a false-positive rate of HDA 200. Repeatability was similar for both transducer types. Intrasession test-retest repeatability from 0.5 to 12.5 kHz at each individual frequency including the frequency range susceptible to noise-induced hearing loss was excellent for both transducers. Repeatability was slightly, but significantly poorer in the frequency range from 14 to 16 kHz compared with the frequency ranges from 0.5 to 6

  9. A review of culturally adapted versions of the Oswestry Disability Index: the adaptation process, construct validity, test-retest reliability and internal consistency.

    Science.gov (United States)

    Sheahan, Peter J; Nelson-Wong, Erika J; Fischer, Steven L

    2015-01-01

    The Oswestry Disability Index (ODI) is a self-report-based outcome measure used to quantify the extent of disability related to low back pain (LBP), a substantial contributor to workplace absenteeism. The ODI tool has been adapted for use by patients in several non-English speaking nations. It is unclear, however, if these adapted versions of the ODI are as credible as the original ODI developed for English-speaking nations. The objective of this study was to conduct a review of the literature to identify culturally adapted versions of the ODI and to report on the adaptation process, construct validity, test-retest reliability and internal consistency of these ODIs. Following a pragmatic review process, data were extracted from each study with regard to these four outcomes. While most studies applied adaptation processes in accordance with best-practice guidelines, there were some deviations. However, all studies reported high-quality psychometric properties: group mean construct validity was 0.734 ± 0.094 (indicated via a correlation coefficient), test-retest reliability was 0.937 ± 0.032 (indicated via an intraclass correlation coefficient) and internal consistency was 0.876 ± 0.047 (indicated via Cronbach's alpha). Researchers can be confident when using any of these culturally adapted ODIs, or when comparing and contrasting results between cultures where these versions were employed. Implications for Rehabilitation Low back pain is the second leading cause of disability in the world, behind only cancer. The Oswestry Disability Index (ODI) has been developed as a self-report outcome measure of low back pain for administration to patients. An understanding of the various cross-cultural adaptations of the ODI is important for more concerted multi-national research efforts. This review examines 16 cross-cultural adaptations of the ODI and should inform the work of health care and rehabilitation professionals.

  10. The Unsupported Upper Limb Exercise Test in People Without Disabilities: Assessing the Within-Day Test-Retest Reliability and the Effects of Age and Gender.

    Science.gov (United States)

    Oliveira, Ana; Cruz, Joana; Jácome, Cristina; Marques, Alda

    2018-01-01

    Purpose: To estimate the within-day test-retest reliability and standard error of measurement (SEM) of the unsupported upper limb exercise test (UULEX) in adults without disabilities and to determine the effects of age and gender on performance of the UULEX. Method: A cross-sectional study was conducted with 100 adults without disabilities (44 men, mean age 44.2 [SD 26] y; 56 women, mean age 38.1 [SD 24.1] y). Participants performed three UULEX tests to establish within-day reliability, measured using an intra-class correlation coefficient (ICC) model 2 (two-way random effects) with a single rater (ICC[2,1]) and SEM. The effects of age and gender were examined using two-factor mixed-design analysis of variance (ANOVA) and one-way repeated-measures ANOVA. For analysis purposes, four sub-groups were created: younger adults, older adults, men, and women. Results: Excellent within-day reliability and a small SEM were found in the four sub-groups (younger adults: ICC[2,1]=0.88; 95% CI: 0.82, 0.92; SEM∼40 s; older adults: ICC[2,1]=0.82; 95% CI: 0.72, 0.90; SEM∼50 s; men: ICC[2,1]=0.93; 95% CI: 0.88, 0.96; SEM∼30 s; women: ICC[2,1]=0.85; 95% CI: 0.78, 0.91; SEM∼45 s). Younger adults took, on average, 308.24 seconds longer than older adults to perform the test; older adults performed significantly better on the third test ( p 0.05). Conclusion: The within-day test-retest reliability and SEM values of the UULEX may be used to define the magnitude of the error obtained with repeated measures. One UULEX test seems to be adequate for younger adults to achieve reliable results, whereas three tests seem to be needed for older adults.

  11. Test-retest reliability of {sup 11}C-ORM-13070 in PET imaging of α{sub 2C}-adrenoceptors in vivo in the human brain

    Energy Technology Data Exchange (ETDEWEB)

    Lehto, Jussi; Peltonen, Juha M.; Volanen, Iina; Scheinin, Mika [University of Turku, Clinical Research Services Turku CRST, Turku (Finland); TYKSLAB, Unit of Clinical Pharmacology, Turku (Finland); Virta, Jere R. [University of Turku and Turku University Hospital, Turku PET Centre, Turku (Finland); Turku University Hospital, Division of Clinical Neurosciences, Turku (Finland); Oikonen, Vesa; Roivainen, Anne; Luoto, Pauliina; Arponen, Eveliina; Helin, Semi; Virtanen, Kirsi [University of Turku and Turku University Hospital, Turku PET Centre, Turku (Finland); Hietamaeki, Johanna; Holopainen, Aila; Rouru, Juha; Sallinen, Jukka [Orion Pharma, Turku (Finland); Kailajaervi, Marita [Turku Imanet, GE Healthcare, Turku (Finland); Rinne, Juha O. [University of Turku and Turku University Hospital, Turku PET Centre, Turku (Finland); Turku University Hospital, Division of Clinical Neurosciences, Turku (Finland); University of Turku, Clinical Research Services Turku CRST, Turku (Finland)

    2015-01-15

    α{sub 2C}-Adrenoceptors share inhibitory presynaptic functions with the more abundant α{sub 2A}-adrenoceptor subtype, but they also have widespread postsynaptic modulatory functions in the brain. Research on the noradrenergic system of the human brain has been hampered by the lack of suitable PET tracers targeted to the α{sub 2}-adrenoceptor subtypes. PET imaging with the specific α{sub 2C}-adrenoceptor antagonist tracer [{sup 11}C]ORM-13070 was performed twice in six healthy male subjects to investigate the test-retest reliability of tracer binding. The bound/free ratio of tracer uptake relative to nonspecific uptake into the cerebellum during the time interval of 5 - 30 min was most prominent in the dorsal striatum: 0.77 in the putamen and 0.58 in the caudate nucleus. Absolute test-retest variability in bound/free ratios of tracer ranged from 4.3 % in the putamen to 29 % in the hippocampus. Variability was also <10 % in the caudate nucleus and thalamus. Intraclass correlation coefficients (ICC) ranged from 0.50 in the hippocampus to 0.89 in the thalamus (ICC >0.70 was also reached in the caudate nucleus, putamen, lateral frontal cortex and parietal cortex). The pattern of [{sup 11}C]ORM-13070 binding, as determined by PET, was in good agreement with receptor density results previously derived from post-mortem autoradiography. PET data analysis results obtained with a compartmental model fit, the simplified reference tissue model and a graphical reference tissue analysis method were convergent with the tissue ratio method. The results of this study support the use of [{sup 11}C]ORM-13070 PET in the quantitative assessment of α{sub 2C}-adrenoceptors in the human brain in vivo. Reliable assessment of specific tracer binding in the dorsal striatum is possible with the help of reference tissue ratios. (orig.)

  12. MicroPET imaging of 5-HT{sub 1A} receptors in rat brain: a test-retest [{sup 18}F]MPPF study

    Energy Technology Data Exchange (ETDEWEB)

    Aznavour, Nicolas [McGill University, Department of Psychiatry, Montreal, QC (Canada)]|[Laboratory of Neuroenergetics and Cellular Dynamics, EPFL, SV, BMI, Lausanne (Switzerland); Benkelfat, Chawki; Gravel, Paul [McGill University, Department of Psychiatry, Montreal, QC (Canada)]|[McGill University, Department of Neurology and Neurosurgery, Montreal, QC (Canada); Aliaga, Antonio [McGill University, Department of Small Animal Imaging Laboratory, Montreal, QC (Canada); Rosa-Neto, Pedro [Douglas Hospital, Molecular NeuroImaging Laboratory, Montreal, QC (Canada); Bedell, Barry [McGill University, Department of Neurology and Neurosurgery, Montreal, QC (Canada)]|[McGill University, Department of Small Animal Imaging Laboratory, Montreal, QC (Canada); Zimmer, Luc [CERMEP, ANIMAGE Department, Lyon (France)]|[Universite Lyon 1 and CNRS, Lyon (France); Descarries, Laurent [Universite de Montreal, Department of Pathology and Cell Biology, Montreal, QC (Canada)]|[Universite de Montreal, Department of Physiology, Montreal, QC (Canada)]|[Universite de Montreal, GRSNC, Montreal, QC (Canada)

    2009-01-15

    Earlier studies have shown that positron emission tomography (PET) imaging with the radioligand [{sup 18}F]MPPF allows for measuring the binding potential of serotonin 5-hydroxytryptamine{sub 1A} (5-HT{sub 1A}) receptors in different regions of animal and human brain, including that of 5-HT{sub 1A} autoreceptors in the raphe nuclei. In the present study, we sought to determine if such data could be obtained in rat, with a microPET (R4, Concorde Microsystems). Scans from isoflurane-anaesthetised rats (n = 18, including six test-retest) were co-registered with magnetic resonance imaging data, and binding potential, blood to plasma ratio and radiotracer efflux were estimated according to a simplified reference tissue model. Values of binding potential for hippocampus (1.2), entorhinal cortex (1.1), septum (1.1), medial prefrontal cortex (1.0), amygdala (0.8), raphe nuclei (0.6), paraventricular hypothalamic nucleus (0.5) and raphe obscurus (0.5) were comparable to those previously measured with PET in cats, non-human primates or humans. Test-retest variability was in the order of 10% in the larger brain regions (hippocampus, medial prefrontal and entorhinal cortex) and less than 20% in small nuclei such as the septum and the paraventricular hypothalamic, basolateral amygdaloid and raphe nuclei. MicroPET brain imaging of 5-HT{sub 1A} receptors with [{sup 18}F]MPPF thus represents a promising avenue for investigating 5-HT{sub 1A} receptor function in rat. (orig.)

  13. Test-retest reliability of knee extensor rate of velocity and power development in older adults using the isotonic mode on a Biodex System 3 dynamometer.

    Science.gov (United States)

    Van Driessche, Stijn; Van Roie, Evelien; Vanwanseele, Benedicte; Delecluse, Christophe

    2018-01-01

    Isotonic testing and measures of rapid power production are emerging as functionally relevant test methods for detection of muscle aging. Our objective was to assess reliability of rapid velocity and power measures in older adults using the isotonic mode of an isokinetic dynamometer. Sixty-three participants (aged 65 to 82 years) underwent a test-retest protocol with one week time interval. Isotonic knee extension tests were performed at four different loads: 0%, 25%, 50% and 75% of maximal isometric strength. Peak velocity (pV) and power (pP) were determined as the highest values of the velocity and power curve. Rate of velocity (RVD) and power development (RPD) were calculated as the linear slopes of the velocity- and power-time curve. Relative and absolute measures of test-retest reliability were analyzed using intraclass correlation coefficients (ICC), standard error of measurement (SEM) and Bland-Altman analyses. Overall, reliability was high for pV, pP, RVD and RPD at 0%, 25% and 50% load (ICC: .85 - .98, SEM: 3% - 10%). A trend for increased reliability at lower loads seemed apparent. The tests at 75% load led to range of motion failure and should be avoided. In addition, results demonstrated that caution is advised when interpreting early phase results (first 50ms). To conclude, our results support the use of the isotonic mode of an isokinetic dynamometer for testing rapid power and velocity characteristics in older adults, which is of high clinical relevance given that these muscle characteristics are emerging as the primary outcomes for preventive and rehabilitative interventions in aging research.

  14. Two Year Longitudinal Change and Test-Retest-Precision of Knee Cartilage Morphology in a Pilot Study for the Osteoarthritis Initiative

    Science.gov (United States)

    Eckstein, Felix; Kunz, Manuela; Schutzer, Matt; Hudelmaier, Martin; Jackson, Rebecca D.; Yu, Joseph; Eaton, Charles B.; Schneider, Erika

    2009-01-01

    Objective Fast low angle shot (FLASH) and double echo steady state (DESS) MRI sequences were recently cross-calibrated for quantification of cartilage morphology at 3 Tesla. In this pilot study for the Osteoarthritis Initiative we compare their test-retest precision and sensitivity to longitudinal change. Method 9 participants with mild to moderate clinical OA were imaged at baseline, year 1 and year 2. Coronal 1.5mm FLASH and sagittal 0.7mm DESS sequences were acquired; 1.5mm coronal multiplanar reformats (MPR) were obtained from the DESS. Patellar, femoral and tibial cartilage plates were quantified in paired fashion, with blinding to time point. Results In the weight-bearing femorotibial joint, average precision errors across plates were 1.8% for FLASH, 2.6% for DESS, and 3.0% for MPR-DESS. Volume loss at year 1 was not significant; at year 2 the average change across the femorotibial cartilage plates was −1.7% for FLASH, −2.8% for DESS, and −0.3% for MPR-DESS. Volume change in the lateral tibia (−5.5%; p<0.03), and in the medial (−2.9%; p<0.04) and lateral femorotibial compartment (−3.8%; p<0.03) were significant for DESS. Conclusion FLASH, MPR-DESS and DESS all displayed adequate test-retest precision. Although the comparison between protocols is limited by the small number of participants and by the relatively small longitudinal change in cartilage morphology in this pilot study, the data suggest that significant change can be detected with MRI in a small sample of OA subjects over 2 years. PMID:17560813

  15. Research Review: Test-retest reliability of standardized diagnostic interviews to assess child and adolescent psychiatric disorders: a systematic review and meta-analysis.

    Science.gov (United States)

    Duncan, Laura; Comeau, Jinette; Wang, Li; Vitoroulis, Irene; Boyle, Michael H; Bennett, Kathryn

    2018-02-19

    A better understanding of factors contributing to the observed variability in estimates of test-retest reliability in published studies on standardized diagnostic interviews (SDI) is needed. The objectives of this systematic review and meta-analysis were to estimate the pooled test-retest reliability for parent and youth assessments of seven common disorders, and to examine sources of between-study heterogeneity in reliability. Following a systematic review of the literature, multilevel random effects meta-analyses were used to analyse 202 reliability estimates (Cohen's kappa = ҡ) from 31 eligible studies and 5,369 assessments of 3,344 children and youth. Pooled reliability was moderate at ҡ = .58 (CI 95% 0.53-0.63) and between-study heterogeneity was substantial (Q = 2,063 (df = 201), p reliability varied across informants for specific types of psychiatric disorder (ҡ = .53-.69 for parent vs. ҡ = .39-.68 for youth) with estimates significantly higher for parents on attention deficit hyperactivity disorder, oppositional defiant disorder and the broad groupings of externalizing and any disorder. Reliability was also significantly higher in studies with indicators of poor or fair study methodology quality (sample size reliability of SDIs and the usefulness of these tools in both clinical and research contexts. Potential remedies include the introduction of standardized study and reporting requirements for reliability studies, and exploration of other approaches to assessing and classifying child and adolescent psychiatric disorder. © 2018 Association for Child and Adolescent Mental Health.

  16. Stability of person ability measures in people with acquired brain injury in the use of everyday technology: the test-retest reliability of the Management of Everyday Technology Assessment (META).

    Science.gov (United States)

    Malinowsky, Camilla; Kassberg, Ann-Charlotte; Larsson-Lund, Maria; Kottorp, Anders

    2016-01-01

    To evaluate the test-retest reliability of the Management of Everyday Technology Assessment (META) in a sample of people with acquired brain injury (ABI). The META was administered twice within a two-week period to 25 people with ABI. A Rasch measurement model was used to convert the META ordinal raw scores into equal-interval linear measures of each participant's ability to manage everyday technology (ET). Test-retest reliability of the stability of the person ability measures in the META was examined by a standardized difference Z-test and an intra-class correlations analysis (ICC 1). The results showed that the paired person ability measures generated from the META were stable over the test-retest period for 22 of the 25 subjects. The ICC 1 correlation was 0.63, which indicates good overall reliability. The META demonstrated acceptable test-retest reliability in a sample of people with ABI. The results illustrate the importance of using sufficiently challenging ETs (relative to a person's abilities) to generate stable META measurements over time. Implications for Rehabilitation The findings add evidence regarding the test-retest reliability of the person ability measures generated from the observation assessment META in a sample of people with ABI. The META might support professionals in the evaluation of interventions that are designed to improve clients' performance of activities including the ability to manage ET.

  17. Escala Razões para Fumar Modificada: tradução e adaptação cultural para o português para uso no Brasil e avaliação da confiabilidade teste-reteste Modified Reasons for Smoking Scale: translation to Portuguese, cross-cultural adaptation for use in Brazil and evaluation of test-retest reliability

    Directory of Open Access Journals (Sweden)

    Elisa Sebba Tosta de Souza

    2009-07-01

    Full Text Available OBJETIVO: Traduzir, fazer a adaptação cultural e testar a confiabilidade teste-reteste de uma versão em língua portuguesa da Escala Razões Para Fumar Modificada (ERPFM para uso no Brasil. MÉTODOS: Uma versão em língua inglesa da ERPFM foi traduzida por médicos brasileiros com profundo conhecimento sobre a língua inglesa. Uma versão de consenso foi obtida por grupo multidisciplinar composto por dois pneumologistas, um psiquiatra e um psicólogo. Essa versão foi traduzida de volta ao inglês por um tradutor americano. A avaliação da adaptação cultural da versão final foi efetuada em uma amostra de 20 fumantes saudáveis. A avaliação da confiabilidade teste-reteste foi feita pela aplicação da versão traduzida da escala em 54 fumantes saudáveis em duas ocasiões separadas por 15 dias. RESULTADOS: Essa versão traduzida da ERPFM exibiu excelente identidade cultural, sendo bem compreendida por 95% dos fumantes. Os graus de concordância das respostas em duas ocasiões distintas foram quase perfeito para duas questões, substancial para dez questões, moderado para oito questões e discreto para uma questão. Os valores dos coeficientes de correlação intraclasse dos fatores motivacionais em duas ocasiões, empregando-se modelos teóricos previamente publicados, foram superiores a 0,7 em seis dos sete domínios. CONCLUSÕES: A presente versão da ERPFM exibe identidade cultural e confiabilidade teste-reteste satisfatórias, podendo ser de utilidade no tratamento e na avaliação de tabagistas em nosso meio.OBJECTIVE: To translate the Modified Reasons for Smoking Scale (MRSS to Portuguese, to submit it to cross-cultural adaptation for use in Brazil and to evaluate the test-retest reliability of the translated version. METHODS: An English-language version of the MRSS was translated to Portuguese by Brazilian doctors who have thorough knowledge of the English language. A consensus version was produced by a multidisciplinary group

  18. The Parsing Syllable Envelopes Test for Assessment of Amplitude Modulation Discrimination Skills in Children: Development, Normative Data, and Test-Retest Reliability Studies.

    Science.gov (United States)

    Cameron, Sharon; Chong-White, Nicky; Mealings, Kiri; Beechey, Tim; Dillon, Harvey; Young, Taegan

    2018-02-01

    Intensity peaks and valleys in the acoustic signal are salient cues to syllable structure, which is accepted to be a crucial early step in phonological processing. As such, the ability to detect low-rate (envelope) modulations in signal amplitude is essential to parse an incoming speech signal into smaller phonological units. The Parsing Syllable Envelopes (ParSE) test was developed to quantify the ability of children to recognize syllable boundaries using an amplitude modulation detection paradigm. The envelope of a 750-msec steady-state /a/ vowel is modulated into two or three pseudo-syllables using notches with modulation depths varying between 0% and 100% along an 11-step continuum. In an adaptive three-alternative forced-choice procedure, the participant identified whether one, two, or three pseudo-syllables were heard. Development of the ParSE stimuli and test protocols, and collection of normative and test-retest reliability data. Eleven adults (aged 23 yr 10 mo to 50 yr 9 mo, mean 32 yr 10 mo) and 134 typically developing, primary-school children (aged 6 yr 0 mo to 12 yr 4 mo, mean 9 yr 3 mo). There were 73 males and 72 females. Data were collected using a touchscreen computer. Psychometric functions (PFs) were automatically fit to individual data by the ParSE software. Performance was related to the modulation depth at which syllables can be detected with 88% accuracy (referred to as the upper boundary of the uncertainty region [UBUR]). A shallower PF slope reflected a greater level of uncertainty. Age effects were determined based on raw scores. z Scores were calculated to account for the effect of age on performance. Outliers, and individual data for which the confidence interval of the UBUR exceeded a maximum allowable value, were removed. Nonparametric tests were used as the data were skewed toward negative performance. Across participants, the performance criterion (UBUR) was met with a median modulation depth of 42%. The effect of age on the UBUR was

  19. The Phoneme Identification Test for Assessment of Spectral and Temporal Discrimination Skills in Children: Development, Normative Data, and Test-Retest Reliability Studies.

    Science.gov (United States)

    Cameron, Sharon; Chong-White, Nicky; Mealings, Kiri; Beechey, Tim; Dillon, Harvey; Young, Taegan

    2018-02-01

    Previous research suggests that a proportion of children experiencing reading and listening difficulties may have an underlying primary deficit in the way that the central auditory nervous system analyses the perceptually important, rapidly varying, formant frequency components of speech. The Phoneme Identification Test (PIT) was developed to investigate the ability of children to use spectro-temporal cues to perceptually categorize speech sounds based on their rapidly changing formant frequencies. The PIT uses an adaptive two-alternative forced-choice procedure whereby the participant identifies a synthesized consonant-vowel (CV) (/ba/ or /da/) syllable. CV syllables differed only in the second formant (F2) frequency along an 11-step continuum (between 0% and 100%-representing an ideal /ba/ and /da/, respectively). The CV syllables were presented in either quiet (PIT Q) or noise at a 0 dB signal-to-noise ratio (PIT N). Development of the PIT stimuli and test protocols, and collection of normative and test-retest reliability data. Twelve adults (aged 23 yr 10 mo to 50 yr 9 mo, mean 32 yr 5 mo) and 137 typically developing, primary-school children (aged 6 yr 0 mo to 12 yr 4 mo, mean 9 yr 3 mo). There were 73 males and 76 females. Data were collected using a touchscreen computer. Psychometric functions were automatically fit to individual data by the PIT software. Performance was determined by the width of the continuum for which responses were neither clearly /ba/ nor /da/ (referred to as the uncertainty region [UR]). A shallower psychometric function slope reflected greater uncertainty. Age effects were determined based on raw scores. Z scores were calculated to account for the effect of age on performance. Outliers, and individual data for which the confidence interval of the UR exceeded a maximum allowable value, were removed. Nonparametric tests were used as the data were skewed toward negative performance. Across participants, the median value of the F2 range

  20. Test-retest reliability of the KINARM end-point robot for assessment of sensory, motor and neurocognitive function in young adult athletes.

    Directory of Open Access Journals (Sweden)

    Cameron S Mang

    Full Text Available Current assessment tools for sport-related concussion are limited by a reliance on subjective interpretation and patient symptom reporting. Robotic assessments may provide more objective and precise measures of neurological function than traditional clinical tests.To determine the reliability of assessments of sensory, motor and cognitive function conducted with the KINARM end-point robotic device in young adult elite athletes.Sixty-four randomly selected healthy, young adult elite athletes participated. Twenty-five individuals (25 M, mean age±SD, 20.2±2.1 years participated in a within-season study, where three assessments were conducted within a single season (assessments labeled by session: S1, S2, S3. An additional 39 individuals (28M; 22.8±6.0 years participated in a year-to-year study, where annual pre-season assessments were conducted for three consecutive seasons (assessments labeled by year: Y1, Y2, Y3. Forty-four parameters from five robotic tasks (Visually Guided Reaching, Position Matching, Object Hit, Object Hit and Avoid, and Trail Making B and overall Task Scores describing performance on each task were quantified.Test-retest reliability was determined by intra-class correlation coefficients (ICCs between the first and second, and second and third assessments. In the within-season study, ICCs were ≥0.50 for 68% of parameters between S1 and S2, 80% of parameters between S2 and S3, and for three of the five Task Scores both between S1 and S2, and S2 and S3. In the year-to-year study, ICCs were ≥0.50 for 64% of parameters between Y1 and Y2, 82% of parameters between Y2 and Y3, and for four of the five Task Scores both between Y1 and Y2, and Y2 and Y3.Overall, the results suggest moderate-to-good test-retest reliability for the majority of parameters measured by the KINARM robot in healthy young adult elite athletes. Future work will consider the potential use of this information for clinical assessment of concussion

  1. Test-retest reliability of the novel 5-HT{sub 1B} receptor PET radioligand [{sup 11}C]P943

    Energy Technology Data Exchange (ETDEWEB)

    Saricicek, Aybala [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Izmir Katip Celebi University, Department of Psychiatry, Izmir (Turkey); Chen, Jason; Ruf, Barbara [Yale University, Department of Psychiatry, New Haven, CT (United States); Planeta, Beata; Labaree, David; Gallezot, Jean-Dominique; Huang, Yiyun [Yale University, PET Center, Department of Diagnostic Radiology, New Haven, CT (United States); Subramanyam, Kalyani; Maloney, Kathleen [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Matuskey, David [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Yale University, PET Center, Department of Diagnostic Radiology, New Haven, CT (United States); Deserno, Lorenz [Charite - Universitaetsmedizin Berlin, Department of Psychiatry and Psychotherapy, Campus Charite Mitte, Berlin (Germany); Max-Planck-Institute for Human Cognitive and Brain Sciences, Leipzig, Berlin (Germany); Neumeister, Alexander [Yale University, Department of Psychiatry, New Haven, CT (United States); Mount Sinai School of Medicine, Department of Psychiatry, New York, NY (United States); VA Connecticut Healthcare System, Clinical Neuroscience Division, VA National Center for PTSD, West Haven, CT (United States); Krystal, John H. [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); VA Connecticut Healthcare System, Clinical Neuroscience Division, VA National Center for PTSD, West Haven, CT (United States); Carson, Richard E. [Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Bhagwagar, Zubin [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Bristol-Myers Squibb, Wallingford, CT (United States)

    2014-11-27

    [{sup 11}C]P943 is a novel, highly selective 5-HT{sub 1B} PET radioligand. The aim of this study was to determine the test-retest reliability of [{sup 11}C]P943 using two different modeling methods and to perform a power analysis with each quantification technique. Seven healthy volunteers underwent two PET scans on the same day. Regions of interest (ROIs) were the amygdala, hippocampus, pallidum, putamen, insula, frontal, anterior cingulate, parietal, temporal and occipital cortices, and cerebellum. Two multilinear radioligand quantification techniques were used to estimate binding potential: MA1, using arterial input function data, and the second version of the multilinear reference tissue model analysis (MRTM2), using the cerebellum as the reference region. Between-scan percent variability and intraclass correlation coefficients (ICC) were used to assess test-retest reliability. We also performed power analyses to determine the method that would allow the least number of subjects using within-subject or between-subject study designs. A voxel-wise ICC analysis for MRTM2 BP{sub ND} was performed for the whole brain and all the ROIs studied. Mean percent variability between two scans across regions ranged between 0.4 % and 12.4 % for MA1 BP{sub ND}, 0.5 % and 11.5 % for MA1 BP{sub P}, 16.7 % and 28.3 % for MA1 BP{sub F}, and between 0.2 % and 5.4 % for MRTM2 BP{sub ND}. The power analyses showed a greater number of subjects were required using MA1 BP{sub F} compared with other outcome measures for both within-subject and between-subject study designs. ICC values were the highest using MRTM2 BP{sub ND} and the lowest with MA1 BP{sub F} in ten ROIs. Small regions and regions with low binding had lower ICC values than large regions and regions with high binding. Reliable measures of 5-HT{sub 1B} receptor binding can be obtained using the novel PET radioligand [{sup 11}C]P943. Quantification of 5-HT{sub 1B} receptor binding with MRTM2 BP{sub ND} and with MA1 BP{sub P

  2. Evaluation of the Relative Validity and Test-Retest Reliability of a 15-Item Beverage Intake Questionnaire in Children and Adolescents.

    Science.gov (United States)

    Hill, Catelyn E; MacDougall, Carly R; Riebl, Shaun K; Savla, Jyoti; Hedrick, Valisa E; Davy, Brenda M

    2017-11-01

    Added sugar intake, in the form of sugar-sweetened beverages (SSBs), may contribute to weight gain and obesity development in children and adolescents. A valid and reliable brief beverage intake assessment tool for children and adolescents could facilitate research in this area. The purpose of this investigation was to evaluate the relative validity and test-retest reliability of a 15-item beverage intake questionnaire (BEVQ) for assessing usual beverage intake in children and adolescents. This cross-sectional investigation included four study visits within a 2- to 3-week time period. Participants (333 enrolled; 98% completion rate) were children aged 6 to 11 years and adolescents aged 12 to18 years recruited from the New River Valley, VA, region from January 2014 to September 2015. Study visits included assessment of height/weight, health history, and four 24-hour dietary recalls (24HRs). The BEVQ was completed at two visits (BEVQ 1, BEVQ 2). To evaluate relative validity, BEVQ 1 was compared with habitual beverage intake determined by the averaged 24HR. To evaluate test-retest reliability, BEVQ 1 was compared with BEVQ 2. Analyses included descriptive statistics, independent sample t tests, χ 2 tests, one-way analysis of variance, paired sample t tests, and correlational analyses. In the full sample, self-reported water and total SSB intake were not different between BEVQ 1 and 24HR (mean differences 0±1 fl oz and 0±1 fl oz, respectively; both P values >0.05). Reported intake across all beverage categories was significantly correlated between BEVQ 1 and BEVQ 2 (Pbeverages was not different (all P values >0.05) between BEVQ 1 and 24HR (mean differences: whole milk=3±4 kcal, reduced-fat milk=9±5 kcal, and fat-free milk=7±6 kcal, which is 7±15 total beverage kilocalories). In adolescents (n=200), water and SSB kilocalories were not different (both P values >0.05) between BEVQ 1 and 24HR (mean differences: -1±1 fl oz and 12±9 kcal, respectively). A 15

  3. Internal consistency, test-retest reliability and measurement error of the self-report version of the social skills rating system in a sample of Australian adolescents.

    Directory of Open Access Journals (Sweden)

    Sharmila Vaz

    Full Text Available The social skills rating system (SSRS is used to assess social skills and competence in children and adolescents. While its characteristics based on United States samples (US are published, corresponding Australian figures are unavailable. Using a 4-week retest design, we examined the internal consistency, retest reliability and measurement error (ME of the SSRS secondary student form (SSF in a sample of Year 7 students (N = 187, from five randomly selected public schools in Perth, western Australia. Internal consistency (IC of the total scale and most subscale scores (except empathy on the frequency rating scale was adequate to permit independent use. On the importance rating scale, most IC estimates for girls fell below the benchmark. Test-retest estimates of the total scale and subscales were insufficient to permit reliable use. ME of the total scale score (frequency rating for boys was equivalent to the US estimate, while that for girls was lower than the US error. ME of the total scale score (importance rating was larger than the error using the frequency rating scale. The study finding supports the idea of using multiple informants (e.g. teacher and parent reports, not just student as recommended in the manual. Future research needs to substantiate the clinical meaningfulness of the MEs calculated in this study by corroborating them against the respective Minimum Clinically Important Difference (MCID.

  4. Internal consistency, test-retest reliability and measurement error of the self-report version of the social skills rating system in a sample of Australian adolescents.

    Science.gov (United States)

    Vaz, Sharmila; Parsons, Richard; Passmore, Anne Elizabeth; Andreou, Pantelis; Falkmer, Torbjörn

    2013-01-01

    The social skills rating system (SSRS) is used to assess social skills and competence in children and adolescents. While its characteristics based on United States samples (US) are published, corresponding Australian figures are unavailable. Using a 4-week retest design, we examined the internal consistency, retest reliability and measurement error (ME) of the SSRS secondary student form (SSF) in a sample of Year 7 students (N = 187), from five randomly selected public schools in Perth, western Australia. Internal consistency (IC) of the total scale and most subscale scores (except empathy) on the frequency rating scale was adequate to permit independent use. On the importance rating scale, most IC estimates for girls fell below the benchmark. Test-retest estimates of the total scale and subscales were insufficient to permit reliable use. ME of the total scale score (frequency rating) for boys was equivalent to the US estimate, while that for girls was lower than the US error. ME of the total scale score (importance rating) was larger than the error using the frequency rating scale. The study finding supports the idea of using multiple informants (e.g. teacher and parent reports), not just student as recommended in the manual. Future research needs to substantiate the clinical meaningfulness of the MEs calculated in this study by corroborating them against the respective Minimum Clinically Important Difference (MCID).

  5. Test-retest reliability and agreement of the SPI-Questionnaire to detect symptoms of digital ischemia in elite volleyball players.

    Science.gov (United States)

    van de Pol, Daan; Zacharian, Tigran; Maas, Mario; Kuijer, P Paul F M

    2017-06-01

    The Shoulder posterior circumflex humeral artery Pathology and digital Ischemia - questionnaire (SPI-Q) has been developed to enable periodic surveillance of elite volleyball players, who are at risk for digital ischemia. Prior to implementation, assessing reliability is mandatory. Therefore, the test-retest reliability and agreement of the SPI-Q were evaluated among the population at risk. A questionnaire survey was performed with a 2-week interval among 65 elite male volleyball players assessing symptoms of cold, pale and blue digits in the dominant hand during or after practice or competition using a 4-point Likert scale (never, sometimes, often and always). Kappa (κ) and percentage of agreement (POA) were calculated for individual symptoms, and to distinguish symptomatic and asymptomatic players. For the individual symptoms, κ ranged from "poor" (0.25) to "good" (0.63), and POA ranged from "moderate" (78%) to "good" (97%). To classify symptomatic players, the SPI-Q showed "good" reliability (κ = 0.83; 95%CI 0.69-0.97) and "good" agreement (POA = 92%). The current study has proven the SPI-Q to be reliable for detecting elite male indoor volleyball players with symptoms of digital ischemia.

  6. Reproducibility assessment of brain responses to visual food stimuli in adults with overweight and obesity.

    Science.gov (United States)

    Drew Sayer, R; Tamer, Gregory G; Chen, Ningning; Tregellas, Jason R; Cornier, Marc-Andre; Kareken, David A; Talavage, Thomas M; McCrory, Megan A; Campbell, Wayne W

    2016-10-01

    The brain's reward system influences ingestive behavior and subsequently obesity risk. Functional magnetic resonance imaging (fMRI) is a common method for investigating brain reward function. This study sought to assess the reproducibility of fasting-state brain responses to visual food stimuli using BOLD fMRI. A priori brain regions of interest included bilateral insula, amygdala, orbitofrontal cortex, caudate, and putamen. Fasting-state fMRI and appetite assessments were completed by 28 women (n = 16) and men (n = 12) with overweight or obesity on 2 days. Reproducibility was assessed by comparing mean fasting-state brain responses and measuring test-retest reliability of these responses on the two testing days. Mean fasting-state brain responses on day 2 were reduced compared with day 1 in the left insula and right amygdala, but mean day 1 and day 2 responses were not different in the other regions of interest. With the exception of the left orbitofrontal cortex response (fair reliability), test-retest reliabilities of brain responses were poor or unreliable. fMRI-measured responses to visual food cues in adults with overweight or obesity show relatively good mean-level reproducibility but considerable within-subject variability. Poor test-retest reliability reduces the likelihood of observing true correlations and increases the necessary sample sizes for studies. © 2016 The Obesity Society.

  7. Test-Retest Reliability and Minimal Detectable Change of Randomized Dichotic Digits in Learning-Disabled Children: Implications for Dichotic Listening Training.

    Science.gov (United States)

    Mahdavi, Mohammad Ebrahim; Pourbakht, Akram; Parand, Akram; Jalaie, Shohreh

    2018-03-01

    Evaluation of dichotic listening to digits is a common part of many studies for diagnosis and managing auditory processing disorders in children. Previous researchers have verified test-retest relative reliability of dichotic digits results in normal children and adults. However, detecting intervention-related changes in the ear scores after dichotic listening training requires information regarding trial-to-trial typical variation of individual ear scores that is estimated using indices of absolute reliability. Previous studies have not addressed absolute reliability of dichotic listening results. To compare the results of the Persian randomized dichotic digits test (PRDDT) and its relative and absolute indices of reliability between typical achieving (TA) and learning-disabled (LD) children. A repeated measures observational study. Fifteen LD children were recruited from a previously performed study with age range of 7-12 yr. The control group consisted of 15 TA schoolchildren with age range of 8-11 yr. The Persian randomized dichotic digits test was administered on the children under free recall condition in two test sessions 7-12 days apart. We compared the average of the ear scores and ear advantage between TA and LD children. Relative indices of reliability included Pearson's correlation and intraclass correlation (ICC 2,1 ) coefficients and absolute reliability was evaluated by calculation of standard error of measurement (SEM) and minimal detectable change (MDC) using the raw ear scores. The Pearson correlation coefficient indicated that in both groups of children the ear scores of test and retest sessions were strongly and positively (greater than +0.8) correlated. The ear scores showed excellent ICC coefficient of consistency (0.78-0.82) and fair to excellent ICC coefficient of absolute agreement (0.62-0.74) in TA children and excellent ICC coefficients of consistency and absolute agreement in LD children (0.76-0.87). SEM and SEM% of the ear scores in TA

  8. Test-retest reliability of spatial and temporal gait parameters in children with cerebral palsy as measured by an electronic walkway.

    Science.gov (United States)

    Sorsdahl, Anne Brit; Moe-Nilssen, Rolf; Strand, Liv Inger

    2008-01-01

    The purpose of this study was to examine test-retest reliability of seven selected temporal and spatial gait parameters and asymmetry measures in children with cerebral palsy. Seventeen children with CP between 3 and 13 years of age walked at three different speeds across an electronic walkway of 5.2m. The tests were repeated after approximately 25 min. The scores were normalized to a walking speed of 1.1m/s to avoid the confounding effect of gait speed on speed dependent gait parameters. Intraclass correlation coefficients (ICC(1,1) and ICC(3,1)) with 95% confidence intervals, within-subject standard deviation (S(w)) and smallest detectable difference (SDD) were calculated. The relative reliability of cadence, step length, stride length and single stance time was high to excellent (ICC(1,1) between 0.73 and 0.95), while it was poor for step width (ICC(1,1)=0.27 and 0.35). The relative reliability for two calculated asymmetry measures were high for the step length index (ICC(1,1)=0.82) and moderate for the single stance time index (ICC(1,1)=0.49). The absolute reliability values for all gait parameters are reported. Five of seven gait parameters measured by an electronic walkway and normalized to a common walking speed, appear to be highly repeatable in a short-term time span in children with CP who were able to walk without assistive walking devices, provided sufficient cognitive function.

  9. TEST-RETEST RELIABILITY OF THE CLOSED KINETIC CHAIN UPPER EXTREMITY STABILITY TEST (CKCUEST) IN ADOLESCENTS: RELIABILITY OF CKCUEST IN ADOLESCENTS.

    Science.gov (United States)

    de Oliveira, Valéria M A; Pitangui, Ana C R; Nascimento, Vinícius Y S; da Silva, Hítalo A; Dos Passos, Muana H P; de Araújo, Rodrigo C

    2017-02-01

    The Closed Kinetic Chain Upper Extremity Stability Test (CKCUEST) has been proposed as an option to assess upper limb function and stability; however, there are few studies that support the use of this test in adolescents. The purpose of the present study was to investigate the intersession reliability and agreement of three CKCUEST scores in adolescents and establish clinimetric values for this test. Test-retest reliability. Twenty-five healthy adolescents of both sexes were evaluated. The subjects performed two CKCUEST with an interval of one week between the tests. An intraclass correlation coefficient (ICC 3,3 ) two-way mixed model with a 95% interval of confidence was utilized to determine intersession reliability. A Bland-Altman graph was plotted to analyze the agreement between assessments. The presence of systematic error was evaluated by a one-sample t test. The difference between the evaluation and reevaluation was observed using a paired-sample t test. The level of significance was set at 0.05. Standard error of measurements and minimum detectable changes were calculated. The intersession reliability of the average touches score, normalized score, and power score were 0.68, 0.68 and 0.87, the standard error of measurement were 2.17, 1.35 and 6.49, and the minimal detectable change was 6.01, 3.74 and 17.98, respectively. The presence of systematic error (p test with moderate to excellent reliability when used with adolescents. The CKCUEST is a measurement with moderate to excellent reliability for adolescents. 2b.

  10. Test-retest reliability, smallest real difference and concurrent validity of six different balance tests on young people with mild to moderate intellectual disability.

    Science.gov (United States)

    Blomqvist, Sven; Wester, Anita; Sundelin, Gunnevi; Rehn, Börje

    2012-12-01

    Some studies have reported that people with intellectual disability may have reduced balance ability compared with the population in general. However, none of these studies involved adolescents, and the reliability and validity of balance tests in this population are not known. The purpose of this study was to examine the reliability of six different balance tests and to investigate their concurrent validity. Test-retest reliability assessment. All subjects were recruited from a special school for people with intellectual disability in Bollnäs, Sweden. Eighty-nine adolescents (35 females and 54 males) with mild to moderate intellectual disability with a mean age of 18 years (range 16 to 20 years). All subjects followed the same test protocol on two occasions within an 11-day period. Balance test performances. Intraclass correlation coefficients greater than 0.80 were achieved for four of the balance tests: Extended Timed Up and Go Test, Modified Functional Reach Test, One-leg Stance Test and Force Platform Test. The smallest real differences ranged from 12% to 40%; less than 20% is considered to be low. Concurrent validity among these balance tests varied between no and low correlation. The results indicate that these tests could be used to evaluate changes in balance ability over time in people with mild to moderate intellectual disability. The low concurrent validity illustrates the importance of knowing more about the influence of various sensory subsystems that are significant for balance among adolescents with intellectual disability. Copyright © 2011 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.

  11. Characterization of regional left ventricular function in nonhuman primates using magnetic resonance imaging biomarkers: a test-retest repeatability and inter-subject variability study.

    Directory of Open Access Journals (Sweden)

    Smita Sampath

    Full Text Available Pre-clinical animal models are important to study the fundamental biological and functional mechanisms involved in the longitudinal evolution of heart failure (HF. Particularly, large animal models, like nonhuman primates (NHPs, that possess greater physiological, biochemical, and phylogenetic similarity to humans are gaining interest. To assess the translatability of these models into human diseases, imaging biomarkers play a significant role in non-invasive phenotyping, prediction of downstream remodeling, and evaluation of novel experimental therapeutics. This paper sheds insight into NHP cardiac function through the quantification of magnetic resonance (MR imaging biomarkers that comprehensively characterize the spatiotemporal dynamics of left ventricular (LV systolic pumping and LV diastolic relaxation. MR tagging and phase contrast (PC imaging were used to quantify NHP cardiac strain and flow. Temporal inter-relationships between rotational mechanics, myocardial strain and LV chamber flow are presented, and functional biomarkers are evaluated through test-retest repeatability and inter subject variability analyses. The temporal trends observed in strain and flow was similar to published data in humans. Our results indicate a dominant dimension based pumping during early systole, followed by a torsion dominant pumping action during late systole. Early diastole is characterized by close to 65% of untwist, the remainder of which likely contributes to efficient filling during atrial kick. Our data reveal that moderate to good intra-subject repeatability was observed for peak strain, strain-rates, E/circumferential strain-rate (CSR ratio, E/longitudinal strain-rate (LSR ratio, and deceleration time. The inter-subject variability was high for strain dyssynchrony, diastolic strain-rates, peak torsion and peak untwist rate. We have successfully characterized cardiac function in NHPs using MR imaging. Peak strain, average systolic strain

  12. The Test-Retest Reliability OfTthe Onset Of Core And Vasti Eectromyographic Activity While Ascending And Descending Stairs In Healthy Controls Aand patellofemoral Pain Patients

    Directory of Open Access Journals (Sweden)

    Mohammad-Ali Sanjari

    2011-02-01

    Full Text Available Backgroundentity.It is hypothesized to result from abnormal patellar tracking caused by altered motorcontrol. Deficit in neuromotor control of the core may be a remote contributing factor to thedevelopment of PFP. Application of reliable EMG measures would be helpful to handle thistheory. Therefore, the purpose of this study was to determine the test-retest reliability of thecore and vasti EMG onsets, while ascending/descending stairs.: Patellofemoral pain (PFP is a common affliction and complex clinicalMethodsand Core EMG onsets during stair stepping were assessed two times a day. Intraclass correlationcoefficients (ICCs and standard errors of measurement (SEMs were calculated.: Ten males with PFP and ten healthy controls participated in this study. VastiResultsonsets of control cases (ICC 3,1 ≥ 0.70 except Quadratus Lumborum (QL which showeda moderate reliability (ICC for ascending=0.59 and for descending = 0.61. In controls,Vasti in both tasks showed the highest absolute reliability. During ascending, highreliability (ICC ≥ 0.70 in PFP group was demonstrated for all EMG onsets except Gluteusmaximus (GMAX and QL which showed a moderate reliability (ICC = 0.69 and 0.63 respectively.In this group while descending stairs, all EMG onsets showed high relativereliability (ICC ≥ 0.70. Moderate to high absolute reliability was obtained for onset timeswhile ascending/descending stairs in PFP group.: During both ascending/descending, high reliability was found for all EMGConclusionreliability.: Most EMG onsets during stair scending/descending had moderate to high

  13. Assessment of isometric muscle strength and rate of torque development with hand-held dynamometry: Test-retest reliability and relationship with gait velocity after stroke.

    Science.gov (United States)

    Mentiplay, Benjamin F; Tan, Dawn; Williams, Gavin; Adair, Brooke; Pua, Yong-Hao; Bower, Kelly J; Clark, Ross A

    2018-04-27

    Isometric rate of torque development examines how quickly force can be exerted and may resemble everyday task demands more closely than isometric strength. Rate of torque development may provide further insight into the relationship between muscle function and gait following stroke. Aims of this study were to examine the test-retest reliability of hand-held dynamometry to measure isometric rate of torque development following stroke, to examine associations between strength and rate of torque development, and to compare the relationships of strength and rate of torque development to gait velocity. Sixty-three post-stroke adults participated (60 years, 34 male). Gait velocity was assessed using the fast-paced 10 m walk test. Isometric strength and rate of torque development of seven lower-limb muscle groups were assessed with hand-held dynamometry. Intraclass correlation coefficients were calculated for reliability and Spearman's rho correlations were calculated for associations. Regression analyses using partial F-tests were used to compare strength and rate of torque development in their relationship with gait velocity. Good to excellent reliability was shown for strength and rate of torque development (0.82-0.97). Strong associations were found between strength and rate of torque development (0.71-0.94). Despite high correlations between strength and rate of torque development, rate of torque development failed to provide significant value to regression models that already contained strength. Assessment of isometric rate of torque development with hand-held dynamometry is reliable following stroke, however isometric strength demonstrated greater relationships with gait velocity. Further research should examine the relationship between dynamic measures of muscle strength/torque and gait after stroke. Copyright © 2018 Elsevier Ltd. All rights reserved.

  14. Graph Theoretical Analysis of Functional Brain Networks: Test-Retest Evaluation on Short- and Long-Term Resting-State Functional MRI Data

    Science.gov (United States)

    Wang, Jin-Hui; Zuo, Xi-Nian; Gohel, Suril; Milham, Michael P.; Biswal, Bharat B.; He, Yong

    2011-01-01

    Graph-based computational network analysis has proven a powerful tool to quantitatively characterize functional architectures of the brain. However, the test-retest (TRT) reliability of graph metrics of functional networks has not been systematically examined. Here, we investigated TRT reliability of topological metrics of functional brain networks derived from resting-state functional magnetic resonance imaging data. Specifically, we evaluated both short-term (5 months apart) TRT reliability for 12 global and 6 local nodal network metrics. We found that reliability of global network metrics was overall low, threshold-sensitive and dependent on several factors of scanning time interval (TI, long-term>short-term), network membership (NM, networks excluding negative correlations>networks including negative correlations) and network type (NT, binarized networks>weighted networks). The dependence was modulated by another factor of node definition (ND) strategy. The local nodal reliability exhibited large variability across nodal metrics and a spatially heterogeneous distribution. Nodal degree was the most reliable metric and varied the least across the factors above. Hub regions in association and limbic/paralimbic cortices showed moderate TRT reliability. Importantly, nodal reliability was robust to above-mentioned four factors. Simulation analysis revealed that global network metrics were extremely sensitive (but varying degrees) to noise in functional connectivity and weighted networks generated numerically more reliable results in compared with binarized networks. For nodal network metrics, they showed high resistance to noise in functional connectivity and no NT related differences were found in the resistance. These findings provide important implications on how to choose reliable analytical schemes and network metrics of interest. PMID:21818285

  15. Can health workers reliably assess their own work? A test-retest study of bias among data collectors conducting a Lot Quality Assurance Sampling survey in Uganda.

    Science.gov (United States)

    Beckworth, Colin A; Davis, Rosemary H; Faragher, Brian; Valadez, Joseph J

    2015-03-01

    Lot Quality Assurance Sampling (LQAS) is a classification method that enables local health staff to assess health programmes for which they are responsible. While LQAS has been favourably reviewed by the World Bank and World Health Organization (WHO), questions remain about whether using local health staff as data collectors can lead to biased data. In this test-retest research, Pallisa Health District in Uganda is subdivided into four administrative units called supervision areas (SA). Data collectors from each SA conducted an LQAS survey. A week later, the data collectors were swapped to a different SA, outside their area of responsibility, to repeat the LQAS survey with the same respondents. The two data sets were analysed for agreement using Cohens' kappa coefficient and disagreements were analysed. Kappa values ranged from 0.19 to 0.97. On average, there was a moderate degree of agreement for knowledge indicators and a substantial level for practice indicators. Respondents were found to be systematically more knowledgeable on retest indicating bias favouring the retest, although no evidence of bias was found for practices indicators. In this initial study, using local health care providers to collect data did not bias data collection. The bias observed in the knowledge indicators is most likely due to the 'practice effect', whereby respondents increased their knowledge as a result of completing the first survey, as no corresponding effect was seen in the practices indicators. Published by Oxford University Press in association with The London School of Hygiene and Tropical Medicine © The Author 2014; all rights reserved.

  16. Can local staff reliably assess their own programs? A confirmatory test-retest study of Lot Quality Assurance Sampling data collectors in Uganda.

    Science.gov (United States)

    Beckworth, Colin A; Anguyo, Robert; Kyakulaga, Francis Cranmer; Lwanga, Stephen K; Valadez, Joseph J

    2016-08-17

    Data collection techniques that routinely provide health system information at the local level are in demand and needed. LQAS is intended for use by local health teams to collect data at the district and sub-district levels. Our question is whether local health staff produce biased results as they are responsible for implementing the programs they also assess. This test-retest study replicates on a larger scale an earlier LQAS reliability assessment in Uganda. We conducted in two districts an LQAS survey using 15 local health staff as data collectors. A week later, the data collectors swapped districts, where they acted as disinterested non-local data collectors, repeating the LQAS survey with the same respondents. We analysed the resulting two data sets for agreement using Cohens' Kappa. The average Kappa score for the knowledge indicators was k = 0.43 (SD = 0.16) and for practice indicators k = 0.63 (SD = 0.17). These scores show moderate agreement for knowledge indicators and substantial agreement for practice indicators. Analyses confirm that respondents were more knowledgeable on retest; no evidence of bias was found for practice indicators. The findings of this study are remarkably similar to those produced in the first reliability study. There is no evidence that using local healthcare staff to collect LQAS data biases data collection in an LQAS study. The bias observed in the knowledge indicators was most likely due to a 'practice effect', whereby respondents increased their knowledge as a result of completing the first survey; no corresponding effect was seen in the practice indicators.

  17. Test-retest reliability of prefrontal transcranial Direct Current Stimulation (tDCS) effects on functional MRI connectivity in healthy subjects.

    Science.gov (United States)

    Wörsching, Jana; Padberg, Frank; Helbich, Konstantin; Hasan, Alkomiet; Koch, Lena; Goerigk, Stephan; Stoecklein, Sophia; Ertl-Wagner, Birgit; Keeser, Daniel

    2017-07-15

    Transcranial Direct Current Stimulation (tDCS) of the prefrontal cortex (PFC) can be used for probing functional brain connectivity and meets general interest as novel therapeutic intervention in psychiatric and neurological disorders. Along with a more extensive use, it is important to understand the interplay between neural systems and stimulation protocols requiring basic methodological work. Here, we examined the test-retest (TRT) characteristics of tDCS-induced modulations in resting-state functional-connectivity MRI (RS fcMRI). Twenty healthy subjects received 20minutes of either active or sham tDCS of the dorsolateral PFC (2mA, anode over F3 and cathode over F4, international 10-20 system), preceded and ensued by a RS fcMRI (10minutes each). All subject underwent three tDCS sessions with one-week intervals in between. Effects of tDCS on RS fcMRI were determined at an individual as well as at a group level using both ROI-based and independent-component analyses (ICA). To evaluate the TRT reliability of individual active-tDCS and sham effects on RS fcMRI, voxel-wise intra-class correlation coefficients (ICC) of post-tDCS maps between testing sessions were calculated. For both approaches, results revealed low reliability of RS fcMRI after active tDCS (ICC (2,1) = -0.09 - 0.16). Reliability of RS fcMRI (baselines only) was low to moderate for ROI-derived (ICC (2,1) = 0.13 - 0.50) and low for ICA-derived connectivity (ICC (2,1) = 0.19 - 0.34). Thus, for ROI-based analyses, the distribution of voxel-wise ICC was shifted to lower TRT reliability after active, but not after sham tDCS, for which the distribution was similar to baseline. The intra-individual variation observed here resembles variability of tDCS effects in motor regions and may be one reason why in this study robust tDCS effects at a group level were missing. The data can be used for appropriately designing large scale studies investigating methodological issues such as sources of variability and

  18. Test-Retest Variability of Functional and Structural Parameters in Patients with Stargardt Disease Participating in the SAR422459 Gene Therapy Trial.

    Science.gov (United States)

    Parker, Maria A; Choi, Dongseok; Erker, Laura R; Pennesi, Mark E; Yang, Paul; Chegarnov, Elvira N; Steinkamp, Peter N; Schlechter, Catherine L; Dhaenens, Claire-Marie; Mohand-Said, Saddek; Audo, Isabelle; Sahel, Jose; Weleber, Richard G; Wilson, David J

    2016-10-01

    The goal of this analysis was to determine the test-retest variability of functional and structural measures from a cohort of patients with advanced forms of Stargardt Disease (STGD) participating in the SAR422459 (NCT01367444) gene therapy clinical trial. Twenty-two participants, aged 24 to 66, diagnosed with advanced forms of STGD, with at least one pathogenic ABCA4 mutation on each chromosome participating in the SAR422459 (NCT01367444) gene therapy clinical trial, were screened over three visits within 3 weeks or less. Functional visual evaluations included: best-corrected visual acuity (BCVA) Early Treatment Diabetic Retinopathy Study (ETDRS) letter score, semiautomated kinetic perimetry (SKP) using isopters I4e, III4e, and V4e, hill of vision (HOV) calculated from static visual fields (SVF) by using a 184n point centrally condensed grid with the stimulus size V test target. Retinal structural changes such as central macular thickness and macular volume were assessed by spectral-domain optical coherence tomography (SD-OCT). Repeatability coefficients (RC) and 95% confidential intervals (CI) were calculated for each parameter using a hierarchical mixed-effects model and bootstrapping. Criteria for statistically significant changes for various parameters were found to be the following: BCVA letter score (8 letters), SKP isopters I4e, III4e, and V4e (3478.85; 2488.02 and 2622.46 deg 2 , respectively), SVF full volume HOV (V TOT, 14.62 dB-sr), central macular thickness, and macular volume (4.27 μm and 0.15 mm 3 , respectively). This analysis provides important information necessary to determine if significant changes are occurring in structural and functional assessments commonly used to measure disease progression in this cohort of patients with STGD. Moreover, this information is useful for future trials assessing safety and efficacy of treatments in STGD. Determination of variability of functional and structural measures in participants with advanced stages of

  19. An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Word and Passage Reading Fluency Assessments: Grade 3. Technical Report #1218

    Science.gov (United States)

    Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald

    2012-01-01

    This technical report is one in a series of five describing the reliability (test/retest and alternate form) and G-Theory/D-Study research on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest.…

  20. Demonstration of the test-retest reliability and sensitivity of the Lower Limb Functional Index-10 as a measure of functional recovery post burn injury: a cross-sectional repeated measures study design.

    Science.gov (United States)

    Ryland, Margaret E; Grisbrook, Tiffany L; Wood, Fiona M; Phillips, Michael; Edgar, Dale W

    2016-01-01

    Lower limb burns can significantly delay recovery of function. Measuring lower limb functional outcomes is challenging in the unique burn patient population and necessitates the use of reliable and valid tools. The aims of this study were to examine the test-retest reliability, sensitivity, and internal consistency of Sections 1 and 3 of the Lower Limb Functional Index-10 (LLFI-10) questionnaire for measuring functional ability in patients with lower limb burns over time. Twenty-nine adult patients who had sustained a lower limb burn injury in the previous 12 months completed the test-retest procedure of the study. In addition, the minimal detectable change (MDC) was calculated for Section 1 and 3 of the LLFI-10. Section 1 is focused on the activity limitations experienced by patients with a lower limb disorder whereas Section 3 involves patients indicating their current percentage of pre-injury duties. Section 1 of the LLFI-10 demonstrated excellent test-retest reliability (intra-class correlation coefficient (ICC) 0.98, 95 % CI 0.96-0.99) whilst Section 3 demonstrated high test-retest reliability (ICC 0.88, 95 % CI 0.79-0.94). MDC scores for Sections 1 and 3 were 1.27 points and 30.22 %, respectively. Internal consistency was demonstrated with a significant negative association (r s  = -0.83) between Sections 1 and 3 of the LLFI-10 (p reliable for measuring functional ability in patients who have sustained lower limb burns in the previous 12 months, and furthermore, Section 1 is sensitive to changes in patient function over time.

  1. Test-retest reliability and construct validity of the ENERGY-parent questionnaire on parenting practices, energy balance-related behaviours and their potential behavioural determinants: the ENERGY-project

    Directory of Open Access Journals (Sweden)

    Singh Amika S

    2012-08-01

    Full Text Available Abstract Background Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10–12 year old children. Findings We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study of 10–12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement. All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. Conclusions The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.

  2. Measurement of central μ-opioid receptor binding in vivo with PET and [11C]carfentanil: a test-retest study in healthy subjects

    International Nuclear Information System (INIS)

    Hirvonen, Jussi; Aalto, Sargo; Maksimow, Anu; Oikonen, Vesa; Naagren, Kjell; Hagelberg, Nora; Scheinin, Harry; Ingman, Kimmo; Virkkala, Jussi

    2009-01-01

    [ 11 C]Carfentanil has been widely used in positron emission tomography (PET) studies for measuring μ-opioid receptor binding in humans, but the reproducibility of the binding parameter estimates is unknown. Eight healthy volunteers were scanned twice during the same day with [ 11 C]carfentanil PET, and binding to receptors was assessed with both reference tissue and arterial plasma input-based models using region of interest (ROI) and voxel-based quantification. The two-tissue compartmental model distribution volume (V T ) was highly reproducible as indicated by low variability (VAR 0.93). BP ND (BP relative to the nondisplaceable tissue compartment) was also highly reproducible (VAR 0.90) both at ROI- and voxel-level, and reference tissue-based models provided stable estimates after 40 min. The reproducibility of [ 11 C]carfentanil binding parameter estimates is excellent with outcome measures based on both arterial plasma and reference tissue input, and a scanning time of 40 min appears sufficient. (orig.)

  3. Short-term test-retest-reliability of conditioned pain modulation using the cold-heat-pain method in healthy subjects and its correlation to parameters of standardized quantitative sensory testing.

    Science.gov (United States)

    Gehling, Julia; Mainka, Tina; Vollert, Jan; Pogatzki-Zahn, Esther M; Maier, Christoph; Enax-Krumova, Elena K

    2016-08-05

    Conditioned Pain Modulation (CPM) is often used to assess human descending pain inhibition. Nine different studies on the test-retest-reliability of different CPM paradigms have been published, but none of them has investigated the commonly used heat-cold-pain method. The results vary widely and therefore, reliability measures cannot be extrapolated from one CPM paradigm to another. Aim of the present study was to analyse the test-retest-reliability of the common heat-cold-pain method and its correlation to pain thresholds. We tested the short-term test-retest-reliability within 40 ± 19.9 h using a cold-water immersion (10 °C, left hand) as conditioning stimulus (CS) and heat pain (43-49 °C, pain intensity 60 ± 5 on the 101-point numeric rating scale, right forearm) as test stimulus (TS) in 25 healthy right-handed subjects (12females, 31.6 ± 14.1 years). The TS was applied 30s before (TSbefore), during (TSduring) and after (TSafter) the 60s CS. The difference between the pain ratings for TSbefore and TSduring represents the early CPM-effect, between TSbefore and TSafter the late CPM-effect. Quantitative sensory testing (QST, DFNS protocol) was performed on both sessions before the CPM assessment. paired t-tests, Intraclass correlation coefficient (ICC), standard error of measurement (SEM), smallest real difference (SRD), Pearson's correlation, Bland-Altman analysis, significance level p Pain ratings during CPM correlated significantly (ICC: 0.411…0.962) between both days, though ratings for TSafter were lower on day 2 (p pain thresholds. The short-term test-retest-reliability of the early CPM-effect using the heat-cold-pain method in healthy subjects achieved satisfying results in terms of the ICC. The SRD of the early CPM effect showed that an individual change of > 20 NRS can be attributed to a real change rather than chance. The late CPM-effect was weaker and not reliable.

  4. Validation and Test-Retest Reliability of New Thermographic Technique Called Thermovision Technique of Dry Needling for Gluteus Minimus Trigger Points in Sciatica Subjects and TrPs-Negative Healthy Volunteers

    Science.gov (United States)

    Rychlik, Michał; Samborski, Włodzimierz

    2015-01-01

    The aim of this study was to assess the validity and test-retest reliability of Thermovision Technique of Dry Needling (TTDN) for the gluteus minimus muscle. TTDN is a new thermography approach used to support trigger points (TrPs) diagnostic criteria by presence of short-term vasomotor reactions occurring in the area where TrPs refer pain. Method. Thirty chronic sciatica patients (n=15 TrP-positive and n=15 TrPs-negative) and 15 healthy volunteers were evaluated by TTDN three times during two consecutive days based on TrPs of the gluteus minimus muscle confirmed additionally by referred pain presence. TTDN employs average temperature (T avr), maximum temperature (T max), low/high isothermal-area, and autonomic referred pain phenomenon (AURP) that reflects vasodilatation/vasoconstriction. Validity and test-retest reliability were assessed concurrently. Results. Two components of TTDN validity and reliability, T avr and AURP, had almost perfect agreement according to κ (e.g., thigh: 0.880 and 0.938; calf: 0.902 and 0.956, resp.). The sensitivity for T avr, T max, AURP, and high isothermal-area was 100% for everyone, but specificity of 100% was for T avr and AURP only. Conclusion. TTDN is a valid and reliable method for T avr and AURP measurement to support TrPs diagnostic criteria for the gluteus minimus muscle when digitally evoked referred pain pattern is present. PMID:26137486

  5. Construct validity, test-retest reliability and internal consistency of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) in patients with carpal tunnel syndrome.

    Science.gov (United States)

    Buntragulpoontawee, Montana; Phutrit, Suphatha; Tongprasert, Siam; Wongpakaran, Tinakon; Khunachiva, Jeeranan

    2018-03-27

    This study evaluated additional psychometric properties of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) which included, test-retest reliability, construct validity, internal consistency of in patients with carpal tunnel syndrome. As for determining construct validity, the Thai EuroQOL questionnaire (EQ-5D-5L) was also administered in order to examine convergent and divergent validity. Fifty patients completed both questionnaires. The DASH-TH showed excellent test-retest reliability (intraclass correlation coefficient = 0.811) and internal consistency (Cronbach's alpha = 0.911). The exploratory factor analysis yielded a six-factor solution while the confirmatory factor analysis denoted that the hypothesized model adequately fit the data with a comparative fit index of 0.967 and a Tucker-Lewis index of 0.964. The related subscales between the DASH-TH and the Thai EQ-5D-5L were significantly correlated, indicating the DASH-TH's convergent and discriminant validity. The DASH-TH demonstrated good reliability, internal consistency construct validity, and multidimensionality, in assessing the upper extremity function in carpal tunnel syndrome patients.

  6. A comparison between the original and Tablet-based Symbol Digit Modalities Test in patients with schizophrenia: Test-retest agreement, random measurement error, practice effect, and ecological validity.

    Science.gov (United States)

    Tang, Shih-Fen; Chen, I-Hui; Chiang, Hsin-Yu; Wu, Chien-Te; Hsueh, I-Ping; Yu, Wan-Hui; Hsieh, Ching-Lin

    2017-11-27

    We aimed to compare the test-retest agreement, random measurement error, practice effect, and ecological validity of the original and Tablet-based Symbol Digit Modalities Test (T-SDMT) over five serial assessments, and to examine the concurrent validity of the T-SDMT in patients with schizophrenia. Sixty patients with chronic schizophrenia completed five serial assessments (one week apart) of the SDMT and T-SDMT and one assessment of the Activities of Daily Living Rating Scale III at the first time point. Both measures showed high test-retest agreement, similar levels of random measurement error over five serial assessments. Moreover, the practice effects of the two measures did not reach a plateau phase after five serial assessments in young and middle-aged participants. Nevertheless, only the practice effect of the T-SDMT became trivial after the first assessment. Like the SDMT, the T-SDMT had good ecological validity. The T-SDMT also had good concurrent validity with the SDMT. In addition, only the T-SDMT had discriminative validity to discriminate processing speed in young and middle-aged participants. Compared to the SDMT, the T-SDMT had overall slightly better psychometric properties, so it can be an alternative measure to the SDMT for assessing processing speed in patients with schizophrenia. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Intra-Rater Reproducibility and Validity of Nintendo Wii Balance Testing in Community-Dwelling Older Adults

    DEFF Research Database (Denmark)

    Jørgensen, Martin Grønbech; Laessoe, Uffe; Hendriksen, Carsten

    2014-01-01

    The aims of the current study were to (1) examine the intra-rater inter-session reproducibility of the Nintendo Wii Agility and Stillness tests and (2) explore the concurrent validity in relation to 'gold-standard' force plate analysis. Within-day inter-session reproducibility was examined in 30 ...... older adults (age 71.8±5.1 yrs.). No systematic test-retest differences were found for the Wii Stillness test, however, the Wii Agility test scores differed systematically between test sessions (p...

  8. Psychometric Evaluation of the Brachial Assessment Tool Part 1: Reproducibility.

    Science.gov (United States)

    Hill, Bridget; Williams, Gavin; Olver, John; Ferris, Scott; Bialocerkowski, Andrea

    2018-04-01

    To evaluate reproducibility (reliability and agreement) of the Brachial Assessment Tool (BrAT), a new patient-reported outcome measure for adults with traumatic brachial plexus injury (BPI). Prospective repeated-measure design. Outpatient clinics. Adults with confirmed traumatic BPI (N=43; age range, 19-82y). People with BPI completed the 31-item 4-response BrAT twice, 2 weeks apart. Results for the 3 subscales and summed score were compared at time 1 and time 2 to determine reliability, including systematic differences using paired t tests, test retest using intraclass correlation coefficient model 1,1 (ICC 1,1 ), and internal consistency using Cronbach α. Agreement parameters included standard error of measurement, minimal detectable change, and limits of agreement. BrAT. Test-retest reliability was excellent (ICC 1,1 =.90-.97). Internal consistency was high (Cronbach α=.90-.98). Measurement error was relatively low (standard error of measurement range, 3.1-8.8). A change of >4 for subscale 1, >6 for subscale 2, >4 for subscale 3, and >10 for the summed score is indicative of change over and above measurement error. Limits of agreement ranged from ±4.4 (subscale 3) to 11.61 (summed score). These findings support the use of the BrAT as a reproducible patient-reported outcome measure for adults with traumatic BPI with evidence of appropriate reliability and agreement for both individual and group comparisons. Further psychometric testing is required to establish the construct validity and responsiveness of the BrAT. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  9. Reproducibility and Reliability of the Quality of Life Questionnaire in Patients With Atrial Fibrillation

    Directory of Open Access Journals (Sweden)

    Rita Simone Lopes Moreira

    2016-03-01

    Full Text Available Abstract Background: Studies have shown the impact of atrial fibrillation (AF on the patients' quality of life. Specific questionnaires enable the evaluation of relevant events. We previously developed a questionnaire to assess the quality of life of patients with AF (AFQLQ version 1, which was reviewed in this study, and new domains were added. Objective: To demonstrate the reproducibility of the AFQLQ version 2 (AFQLQ v.2, which included the domains of fatigue, illness perception and well-being. Methods: We applied 160 questionnaires (AFQLQ v.2 and SF-36 to 40 patients, at baseline and 15 days after, to measure inter- and intraobserver reproducibility. The analysis of quality of life stability was determined by test-retest, applying the Bartko intraclass correlation coefficient (ICC. Internal consistency was assessed by Cronbach's alpha test. Results: The total score of the test-retest (n = 40 had an ICC of 0.98 in the AFQLQ v.2, and of 0.94 in the SF36. In assessing the intra- and interobserver reproducibility of the AFQLQ v.2, the ICC reliability was 0.98 and 0.97, respectively. The internal consistency had a Cronbach's alpha coefficient of 0.82, compatible with good agreement of the AFQLQ v.2. Conclusion: The AFQLQ v.2 performed better than its previous version. Similarly, the domains added contributed to make it more comprehensive and robust to assess the quality of life of patients with AF.

  10. The intersubject and intrasubject reproducibility of FMRI activation during three encoding tasks: implications for clinical applications

    Energy Technology Data Exchange (ETDEWEB)

    Harrington, Greg S. [Virginia Commonwealth University, Department of Radiology, Richmond, VA (United States); Tomaszewski Farias, Sarah [University of California at Davis, Department of Neurology, Sacramento (United States); Buonocore, Michael H. [University of California at Davis, Department of Radiology, Sacramento (United States); Yonelinas, Andrew P. [University of California at Davis, Department of Psychology, Davis (United States)

    2006-07-15

    The goal of the present study was to evaluate the inter- and intrasubject reproducibility of FMRI activation for three memory encoding tasks previously used in the context of presurgical functional mapping. The primary region of interest (ROI) was the medial temporal lobe (MTL). Comparative ROIs included the inferior frontal and fusiform gyri which are less affected by susceptibility-induced signal losses than the MTL regions. Eighteen subjects were scanned using three memory encoding paradigms: word-pair, pattern, and scene encoding. Nine subjects underwent repeat scanning. Intersubject reproducibility of FMRI activation was evaluated by examining the percent of subjects who showed activation within a given ROI and the range to which individual laterality indices (LIs) varied from the mean. Intrasubject test-retest reproducibility was evaluated by examining the LI test-retest correlation, the average difference between LIs from two separate imaging sessions, and concordance ratios of activation volumes (R{sub volume} and R{sub overlap}). For scene encoding the reproducibility of activation volume and LIs within the MTL were as good as or better than the reproducibility within the fusiform and inferior frontal ROIs. For pattern encoding and word-pair encoding, the reproducibility of activation volume and LIs within the MTL tended to be worse compared to the fusiform and inferior frontal ROIs. The differences in FMRI reproducibility appeared more dependent on the task than the susceptibility effects. The results of this study suggest that FMRI-based assessment of the neural substrates of memory using a scene encoding task may be a useful clinical tool. (orig.)

  11. The intersubject and intrasubject reproducibility of FMRI activation during three encoding tasks: implications for clinical applications

    International Nuclear Information System (INIS)

    Harrington, Greg S.; Tomaszewski Farias, Sarah; Buonocore, Michael H.; Yonelinas, Andrew P.

    2006-01-01

    The goal of the present study was to evaluate the inter- and intrasubject reproducibility of FMRI activation for three memory encoding tasks previously used in the context of presurgical functional mapping. The primary region of interest (ROI) was the medial temporal lobe (MTL). Comparative ROIs included the inferior frontal and fusiform gyri which are less affected by susceptibility-induced signal losses than the MTL regions. Eighteen subjects were scanned using three memory encoding paradigms: word-pair, pattern, and scene encoding. Nine subjects underwent repeat scanning. Intersubject reproducibility of FMRI activation was evaluated by examining the percent of subjects who showed activation within a given ROI and the range to which individual laterality indices (LIs) varied from the mean. Intrasubject test-retest reproducibility was evaluated by examining the LI test-retest correlation, the average difference between LIs from two separate imaging sessions, and concordance ratios of activation volumes (R volume and R overlap ). For scene encoding the reproducibility of activation volume and LIs within the MTL were as good as or better than the reproducibility within the fusiform and inferior frontal ROIs. For pattern encoding and word-pair encoding, the reproducibility of activation volume and LIs within the MTL tended to be worse compared to the fusiform and inferior frontal ROIs. The differences in FMRI reproducibility appeared more dependent on the task than the susceptibility effects. The results of this study suggest that FMRI-based assessment of the neural substrates of memory using a scene encoding task may be a useful clinical tool. (orig.)

  12. Reproducibility of a 3-dimensional gyroscope in measuring shoulder anteflexion and abduction

    Directory of Open Access Journals (Sweden)

    Penning Ludo I F

    2012-07-01

    Full Text Available Abstract Background Few studies have investigated the use of a 3-dimensional gyroscope for measuring the range of motion (ROM in the impaired shoulder. Reproducibility of digital inclinometer and visual estimation is poor. This study aims to investigate the reproducibility of a tri axial gyroscope in measurement of anteflexion, abduction and related rotations in the impaired shoulder. Methods Fifty-eight patients with either subacromial impingement (27 or osteoarthritis of the shoulder (31 participated. Active anteflexion, abduction and related rotations were measured with a tri axial gyroscope according to a test retest protocol. Severity of shoulder impairment and patient perceived pain were assessed by the Disability of Arm Shoulder and Hand score (DASH and the Visual Analogue Scale (VAS. VAS scores were recorded before and after testing. Results In two out of three hospitals patients with osteoarthritis (n = 31 were measured, in the third hospital patients with subacromial impingement (n = 27. There were significant differences among hospitals for the VAS and DASH scores measured before and after testing. The mean differences between the test and retest means for anteflexion were −6 degrees (affected side, 9 (contralateral side and for abduction 15 degrees (affected side and 10 degrees (contralateral side. Bland & Altman plots showed that the confidence intervals for the mean differences fall within −6 up to 15 degrees, individual test - retest differences could exceed these limits. A simulation according to ‘Generalizability Theory’ produces very good coefficients for anteflexion and related rotation as a comprehensive measure of reproducibility. Optimal reproducibility is achieved with 2 repetitions for anteflexion. Conclusions Measurements were influenced by patient perceived pain. Differences in VAS and DASH might be explained by different underlying pathology. These differences in shoulder pathology however did not alter

  13. Smallest detectable change and test-retest reliability of a self-reported outcome measure: Results of the Center for Epidemiologic Studies Depression Scale, General Self-Efficacy Scale, and 12-item General Health Questionnaire.

    Science.gov (United States)

    Ohno, Shotaro; Takahashi, Kana; Inoue, Aimi; Takada, Koki; Ishihara, Yoshiaki; Tanigawa, Masaru; Hirao, Kazuki

    2017-12-01

    This study aims to examine the smallest detectable change (SDC) and test-retest reliability of the Center for Epidemiologic Studies Depression Scale (CES-D), General Self-Efficacy Scale (GSES), and 12-item General Health Questionnaire (GHQ-12). We tested 154 young adults at baseline and 2 weeks later. We calculated the intra-class correlation coefficients (ICCs) for test-retest reliability with a two-way random effects model for agreement. We then calculated the standard error of measurement (SEM) for agreement using the ICC formula. The SEM for agreement was used to calculate SDC values at the individual level (SDC ind ) and group level (SDC group ). The study participants included 137 young adults. The ICCs for all self-reported outcome measurement scales exceeded 0.70. The SEM of CES-D was 3.64, leading to an SDC ind of 10.10 points and SDC group of 0.86 points. The SEM of GSES was 1.56, leading to an SDC ind of 4.33 points and SDC group of 0.37 points. The SEM of GHQ-12 with bimodal scoring was 1.47, leading to an SDC ind of 4.06 points and SDC group of 0.35 points. The SEM of GHQ-12 with Likert scoring was 2.44, leading to an SDC ind of 6.76 points and SDC group of 0.58 points. To confirm that the change was not a result of measurement error, a score of self-reported outcome measurement scales would need to change by an amount greater than these SDC values. This has important implications for clinicians and epidemiologists when assessing outcomes. © 2017 John Wiley & Sons, Ltd.

  14. Reproducibility of R-fMRI metrics on the impact of different strategies for multiple comparison correction and sample sizes.

    Science.gov (United States)

    Chen, Xiao; Lu, Bin; Yan, Chao-Gan

    2018-01-01

    Concerns regarding reproducibility of resting-state functional magnetic resonance imaging (R-fMRI) findings have been raised. Little is known about how to operationally define R-fMRI reproducibility and to what extent it is affected by multiple comparison correction strategies and sample size. We comprehensively assessed two aspects of reproducibility, test-retest reliability and replicability, on widely used R-fMRI metrics in both between-subject contrasts of sex differences and within-subject comparisons of eyes-open and eyes-closed (EOEC) conditions. We noted permutation test with Threshold-Free Cluster Enhancement (TFCE), a strict multiple comparison correction strategy, reached the best balance between family-wise error rate (under 5%) and test-retest reliability/replicability (e.g., 0.68 for test-retest reliability and 0.25 for replicability of amplitude of low-frequency fluctuations (ALFF) for between-subject sex differences, 0.49 for replicability of ALFF for within-subject EOEC differences). Although R-fMRI indices attained moderate reliabilities, they replicated poorly in distinct datasets (replicability < 0.3 for between-subject sex differences, < 0.5 for within-subject EOEC differences). By randomly drawing different sample sizes from a single site, we found reliability, sensitivity and positive predictive value (PPV) rose as sample size increased. Small sample sizes (e.g., < 80 [40 per group]) not only minimized power (sensitivity < 2%), but also decreased the likelihood that significant results reflect "true" effects (PPV < 0.26) in sex differences. Our findings have implications for how to select multiple comparison correction strategies and highlight the importance of sufficiently large sample sizes in R-fMRI studies to enhance reproducibility. Hum Brain Mapp 39:300-318, 2018. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  15. Reliability versus reproducibility

    International Nuclear Information System (INIS)

    Lautzenheiser, C.E.

    1976-01-01

    Defect detection and reproducibility of results are two separate but closely related subjects. It is axiomatic that a defect must be detected from examination to examination or reproducibility of results is very poor. On the other hand, a defect can be detected on each of subsequent examinations for higher reliability and still have poor reproducibility of results

  16. The Need for Reproducibility

    Energy Technology Data Exchange (ETDEWEB)

    Robey, Robert W. [Los Alamos National Laboratory

    2016-06-27

    The purpose of this presentation is to consider issues of reproducibility, specifically it determines whether bitwise reproducible computation is possible, if computational research in DOE improves its publication process, and if reproducible results can be achieved apart from the peer review process?

  17. Reproducibility of graph metrics in fMRI networks

    Directory of Open Access Journals (Sweden)

    Qawi K Telesford

    2010-12-01

    Full Text Available The reliability of graph metrics calculated in network analysis is essential to the interpretation of complex network organization. These graph metrics are used to deduce the small-world properties in networks. In this study, we investigated the test-retest reliability of graph metrics from functional magnetic resonance imaging (fMRI data collected for two runs in 45 healthy older adults. Graph metrics were calculated on data for both runs and compared using intraclass correlation coefficient (ICC statistics and Bland-Altman (BA plots. ICC scores describe the level of absolute agreement between two measurements and provide a measure of reproducibility. For mean graph metrics, ICC scores were high for clustering coefficient (ICC=0.86, global efficiency (ICC=0.83, path length (ICC=0.79, and local efficiency (ICC=0.75; the ICC score for degree was found to be low (ICC=0.29. ICC scores were also used to generate reproducibility maps in brain space to test voxel-wise reproducibility for unsmoothed and smoothed data. Reproducibility was uniform across the brain for global efficiency and path length, but was only high in network hubs for clustering coefficient, local efficiency and degree. BA plots were used to test the measurement repeatability of all graph metrics. All graph metrics fell within the limits for repeatability. Together, these results suggest that with exception of degree, mean graph metrics are reproducible and suitable for clinical studies. Further exploration is warranted to better understand reproducibility across the brain on a voxel-wise basis.

  18. Reproducibility and validity of the Nintendo Wii Balance Board for measuring shoulder sensorimotor control in prone lying

    DEFF Research Database (Denmark)

    Eshøj, Henrik; Juul-Kristensen, Birgit; Gam Bender Jørgensen, René

    2017-01-01

    INTRODUCTION: For the lower limbs, the Nintendo Wii Balance Board (NWBB) has been widely used to measure postural control. However, this has not been performed for upper limb measurements. Further, the NWBB has shown to produce more background noise with decreasing loads, which may be of concern...... when used for upper limb testing. The aim was to investigate reproducibility and validity of the NWBB. METHODS: A test-retest design was performed with 68 subjects completing three different prone lying, upper limb weight-bearing balance tasks on a NWBB: two-arms, eyes closed (1) one-arm, non...

  19. Intra-session test-retest reliability of magnitude and structure of center of pressure from the Nintendo Wii Balance Board™ for a visually impaired and normally sighted population.

    Science.gov (United States)

    Jeter, Pamela E; Wang, Jiangxia; Gu, Jialiang; Barry, Michael P; Roach, Crystal; Corson, Marilyn; Yang, Lindsay; Dagnelie, Gislin

    2015-02-01

    Individuals with visual impairment (VI) have irreparable damage to one of the input streams contributing to postural stability. Here, we evaluated the intra-session test-retest reliability of the Wii Balance Board (WBB) for measuring Center of Pressure (COP) magnitude and structure, i.e. approximate entropy (ApEn) in fourteen legally blind participants and 21 participants with corrected-to-normal vision. Participants completed a validated balance protocol which included four sensory conditions: double-leg standing on a firm surface with eyes open (EO-firm); a firm surface with eyes closed (EC-firm); a foam surface with EO (EO-foam); and a foam surface with EC (EC-foam). Participants performed the full balance protocol twice during the session, separated by a period of 15min, to determine the intraclass correlation coefficient (ICC). Absolute reliability was determined by the standard error of measurement (SEM). The minimal difference (MD) was estimated to determine clinical significance for future studies. COP measures were derived from data sent by the WBB to a laptop via Bluetooth. COP scores increased with the difficulty of sensory condition indicating WBB sensitivity (all pbalance impairment among VI persons. Copyright © 2014 Elsevier B.V. All rights reserved.

  20. Magni Reproducibility Example

    DEFF Research Database (Denmark)

    2016-01-01

    An example of how to use the magni.reproducibility package for storing metadata along with results from a computational experiment. The example is based on simulating the Mandelbrot set.......An example of how to use the magni.reproducibility package for storing metadata along with results from a computational experiment. The example is based on simulating the Mandelbrot set....

  1. Preserve specimens for reproducibility

    Czech Academy of Sciences Publication Activity Database

    Krell, F.-T.; Klimeš, Petr; Rocha, L. A.; Fikáček, M.; Miller, S. E.

    2016-01-01

    Roč. 539, č. 7628 (2016), s. 168 ISSN 0028-0836 Institutional support: RVO:60077344 Keywords : reproducibility * specimen * biodiversity Subject RIV: EH - Ecology, Behaviour Impact factor: 40.137, year: 2016 http://www.nature.com/nature/journal/v539/n7628/full/539168b.html

  2. Reproducibility of skeletal muscle vasodilatation responses to Stroop mental challenge over repeated sessions.

    Science.gov (United States)

    Hamer, Mark; Boutcher, Yati N; Park, Young; Boutcher, Stephen H

    2006-08-01

    Skeletal muscle blood flow responses to stress have implications for psychobiological disease pathways. An important assumption underlying psychophysiological studies relating stress reactivity with disease risk is that individuals are characterized by stable response profiles that can be reliably assessed using acute psychophysiological stress testing. We examined the reproducibility of forearm vasodilatation, blood pressure, and cardiac responses to a 2 min Stroop mental challenge over two repeated stress sessions that were on average 3.6 months apart. Participants were 21 healthy men and women (aged 21.8+/-3.7 years). Vasodilatation, blood pressure and heart rate responses displayed no habituation between sessions, although there was significantly greater cardiac parasympathetic involvement during the second testing session. Significant test-retest correlations between the sessions were observed for both forearm blood flow and heart rate reactivity. These findings demonstrate skeletal muscle vasodilatation responses to repeated stress are robust, so may be a useful psychophysiological indicator in studies of stress reactivity and disease risk.

  3. Portuguese-language version of the Chronic Respiratory Questionnaire: a validity and reproducibility study.

    Science.gov (United States)

    Moreira, Graciane Laender; Pitta, Fábio; Ramos, Dionei; Nascimento, Cinthia Sousa Carvalho; Barzon, Danielle; Kovelis, Demétria; Colange, Ana Lúcia; Brunetto, Antonio Fernando; Ramos, Ercy Mara Cipulo

    2009-08-01

    To determine the validity and reproducibility of a Portuguese-language version of the Chronic Respiratory Questionnaire (CRQ) in patients with COPD. A Portuguese-language version of the CRQ (provided by McMaster University, the holder of the questionnaire copyright) was applied to 50 patients with COPD (70 +/- 8 years of age; 32 males; FEV1 = 47 +/- 18% of predicted) on two occasions, one week apart. The CRQ has four domains (dyspnea, fatigue, emotional function, and mastery) and was applied as an interviewer-administered instrument. The Saint George's Respiratory Questionnaire (SGRQ), already validated for use in Brazil, was used as the criterion for validation. Spirometry and the six-minute walk test (6MWT) were performed to analyze the correlations with the CRQ scores. There were no significant CRQ test-retest differences (p > 0.05 for all domains). The test-retest intraclass correlation coefficient was 0.98, 0.97, 0.98 and 0.95 for the dyspnea, fatigue, emotional function and mastery domains, respectively. The Cronbach's alpha coefficient was 0.91. The CRQ domains correlated significantly with the SGRQ domains (-0.30 < r < -0.67; p < 0.05). There were no significant correlations between spirometric variables and the CRQ domains or between the CRQ domains and the 6MWT, with the exception of the fatigue domain (r = 0.30; p = 0.04). The Portuguese-language version of the CRQ proved to be reproducible and valid for use in Brazilian patients with COPD.

  4. [DIN-compatible vision assessment of increased reproducibility using staircase measurement and maximum likelihood analysis].

    Science.gov (United States)

    Weigmann, U; Petersen, J

    1996-08-01

    Visual acuity determination according to DIN 58,220 does not make full use of the information received about the patient, in contrast to the staircase method. Thus, testing the same number of optotypes, the staircase method should yield more reproducible acuity results. On the other hand, the staircase method gives systematically higher acuity values because it converges on the 48% point of the psychometric function (for Landolt rings in eight positions) and not on the 65% probability, as DIN 58,220 with criterion 3/5 does. This bias can be avoided by means of a modified evaluation. Using the staircase data we performed a maximum likelihood estimate of the psychometric function as a whole and computed the acuity value for 65% probability of correct answers. We determined monocular visual acuity in 102 persons with widely differing visual performance. Each subject underwent four tests in random order, two according to DIN 58,220 and two using the modified staircase method (Landolt rings in eight positions scaled by a factor 1.26; PC monitor with 1024 x 768 pixels; distance 4.5 m). Each test was performed with 25 optotypes. The two procedures provide the same mean visual acuity values (difference less than 0.02 acuity steps). The test-retest results match in 30.4% of DIN repetitions but in 50% of the staircases. The standard deviation of the test-retest difference is 1.41 (DIN) and 1.06 (modified staircase) acuity steps. Thus the standard deviation of the single test is 1.0 (DIN) and 0.75 (modified staircase) acuity steps. The new method provides visual acuity values identical to DIN 58,220 but is superior with respect to reproducibility.

  5. Reproducibility of ultrasonic testing

    International Nuclear Information System (INIS)

    Lecomte, J.-C.; Thomas, Andre; Launay, J.-P.; Martin, Pierre

    The reproducibility of amplitude quotations for both artificial and natural reflectors was studied for several combinations of instrument/search unit, all being of the same type. This study shows that in industrial inspection if a range of standardized equipment is used, a margin of error of about 6 decibels has to be taken into account (confidence interval of 95%). This margin is about 4 to 5 dB for natural or artificial defects located in the central area and about 6 to 7 dB for artificial defects located on the back surface. This lack of reproducibility seems to be attributable first to the search unit and then to the instrument and operator. These results were confirmed by analysis of calibration data obtained from 250 tests performed by 25 operators under shop conditions. The margin of error was higher than the 6 dB obtained in the study [fr

  6. Reproducibility of myelin content-based human habenula segmentation at 3 Tesla.

    Science.gov (United States)

    Kim, Joo-Won; Naidich, Thomas P; Joseph, Joshmi; Nair, Divya; Glasser, Matthew F; O'halloran, Rafael; Doucet, Gaelle E; Lee, Won Hee; Krinsky, Hannah; Paulino, Alejandro; Glahn, David C; Anticevic, Alan; Frangou, Sophia; Xu, Junqian

    2018-03-26

    In vivo morphological study of the human habenula, a pair of small epithalamic nuclei adjacent to the dorsomedial thalamus, has recently gained significant interest for its role in reward and aversion processing. However, segmenting the habenula from in vivo magnetic resonance imaging (MRI) is challenging due to the habenula's small size and low anatomical contrast. Although manual and semi-automated habenula segmentation methods have been reported, the test-retest reproducibility of the segmented habenula volume and the consistency of the boundaries of habenula segmentation have not been investigated. In this study, we evaluated the intra- and inter-site reproducibility of in vivo human habenula segmentation from 3T MRI (0.7-0.8 mm isotropic resolution) using our previously proposed semi-automated myelin contrast-based method and its fully-automated version, as well as a previously published manual geometry-based method. The habenula segmentation using our semi-automated method showed consistent boundary definition (high Dice coefficient, low mean distance, and moderate Hausdorff distance) and reproducible volume measurement (low coefficient of variation). Furthermore, the habenula boundary in our semi-automated segmentation from 3T MRI agreed well with that in the manual segmentation from 7T MRI (0.5 mm isotropic resolution) of the same subjects. Overall, our proposed semi-automated habenula segmentation showed reliable and reproducible habenula localization, while its fully-automated version offers an efficient way for large sample analysis. © 2018 Wiley Periodicals, Inc.

  7. Magnet stability and reproducibility

    CERN Document Server

    Marks, N

    2010-01-01

    Magnet stability and reproducibility have become increasingly important as greater precision and beams with smaller dimension are required for research, medical and other purpose. The observed causes of mechanical and electrical instability are introduced and the engineering arrangements needed to minimize these problems discussed; the resulting performance of a state-of-the-art synchrotron source (Diamond) is then presented. The need for orbit feedback to obtain best possible beam stability is briefly introduced, but omitting any details of the necessary technical equipment, which is outside the scope of the presentation.

  8. Reproducible research in palaeomagnetism

    Science.gov (United States)

    Lurcock, Pontus; Florindo, Fabio

    2015-04-01

    The reproducibility of research findings is attracting increasing attention across all scientific disciplines. In palaeomagnetism as elsewhere, computer-based analysis techniques are becoming more commonplace, complex, and diverse. Analyses can often be difficult to reproduce from scratch, both for the original researchers and for others seeking to build on the work. We present a palaeomagnetic plotting and analysis program designed to make reproducibility easier. Part of the problem is the divide between interactive and scripted (batch) analysis programs. An interactive desktop program with a graphical interface is a powerful tool for exploring data and iteratively refining analyses, but usually cannot operate without human interaction. This makes it impossible to re-run an analysis automatically, or to integrate it into a larger automated scientific workflow - for example, a script to generate figures and tables for a paper. In some cases the parameters of the analysis process itself are not saved explicitly, making it hard to repeat or improve the analysis even with human interaction. Conversely, non-interactive batch tools can be controlled by pre-written scripts and configuration files, allowing an analysis to be 'replayed' automatically from the raw data. However, this advantage comes at the expense of exploratory capability: iteratively improving an analysis entails a time-consuming cycle of editing scripts, running them, and viewing the output. Batch tools also tend to require more computer expertise from their users. PuffinPlot is a palaeomagnetic plotting and analysis program which aims to bridge this gap. First released in 2012, it offers both an interactive, user-friendly desktop interface and a batch scripting interface, both making use of the same core library of palaeomagnetic functions. We present new improvements to the program that help to integrate the interactive and batch approaches, allowing an analysis to be interactively explored and refined

  9. Reproducibility and validity of the Nintendo Wii Balance Board for measuring shoulder sensorimotor control in prone lying.

    Science.gov (United States)

    Eshoj, H; Juul-Kristensen, Birgit; Jørgensen, Rene Gam Bender; Søgaard, Karen

    2017-02-01

    For the lower limbs, the Nintendo Wii Balance Board (NWBB) has been widely used to measure postural control. However, this has not been performed for upper limb measurements. Further, the NWBB has shown to produce more background noise with decreasing loads, which may be of concern when used for upper limb testing. The aim was to investigate reproducibility and validity of the NWBB. A test-retest design was performed with 68 subjects completing three different prone lying, upper limb weight-bearing balance tasks on a NWBB: two-arms, eyes closed (1) one-arm, non-dominant/non-injured (2) and one-arm, dominant/injured (3). Each task was repeated three times over the course of two test sessions with a 30-min break in between. Further, the level of background noise from a NWBB was compared with a force platform through systematic loading of both boards with increasing deadweights ranging from 5 to 90kg. Test-retest reproducibility was high with ICCs ranging from 0.95 to 0.97 (95% CI 0.92 to 0.98). However, systematic bias and tendencies for funnel effects in the Bland Altman plots for both one-armed tests were present. The concurrent validity of the NWBB was low (CCC 0.17 (95% CI 0.12-0.22)) due to large differences between the NWBB and force platform in noise sensitivity at low deadweights (especially below 50kg). The NWBB prone lying, shoulder sensorimotor control test was highly reproducible. Though, concurrent validity of the NWBB was poor compared to a force platform. Further investigation of the impact of the background noise, especially at low loads, is needed. Copyright © 2016 Elsevier B.V. All rights reserved.

  10. Visual Acuity Testing: Feedback Affects Neither Outcome nor Reproducibility, but Leaves Participants Happier.

    Science.gov (United States)

    Bach, Michael; Schäfer, Kerstin

    2016-01-01

    Assessment of visual acuity is a well standardized procedure at least for expert opinions and clinical trials. It is often recommended not giving patients feedback on the correctness of their responses. As this viewpoint has not been quantitatively examined so far, we quantitatively assessed possible effects of feedback on visual acuity testing. In 40 normal participants we presented Landolt Cs in 8 orientations using the automated Freiburg Acuity Test (FrACT, feedback was provided in 2 x 4 conditions: (A) no feedback, (B) acoustic signals indicating correctness, (C)visual indication of correct orientation, and (D) a combination of (B) and (C). After each run the participants judged comfort. Main outcome measures were absolute visual acuity (logMAR), its test-retest agreement (limits of agreement) and participants' comfort estimates on a 5-step symmetric Likert scale. Feedback influenced acuity outcome significantly (p = 0.02), but with a tiny effect size: 0.02 logMAR poorer acuity for (D) compared to (A), even weaker effects for (B) and (C). Test-retest agreement was high (limits of agreement: ± 1.0 lines) and did not depend on feedback (p>0.5). The comfort ranking clearly differed, by 2 steps on the Likert scale: the condition (A)-no feedback-was on average "slightly uncomfortable", the other three conditions were "slightly comfortable" (pFeedback affected neither reproducibility nor the acuity outcome to any relevant extent. The participants, however, reported markedly greater comfort with any kind of feedback. We conclude that systematic feedback (as implemented in FrACT) offers nothing but advantages for routine use.

  11. How reproducible is self-reported information on exposure to smoking, drinking, and dietary patterns? Evidence among Brazilian adults in the Pró-Saúde Study

    Directory of Open Access Journals (Sweden)

    Dóra Chor

    Full Text Available CONTEXT: Epidemiological studies of the validity and reliability of self-reported information on important risk factors for non-communicable chronic diseases are scarce in Brazil. OBJECTIVE: We evaluated the test-retest reliability of information overall and stratified by gender, age and education on active and passive smoking, alcohol intake and aspects of dietary habits. TYPE OF STUDY: Test-retest reliability. SETTING: Universidade do Estado do Rio de Janeiro, Rio de Janeiro, Brazil. PARTICIPANTS: 192 University employees. PROCEDURES: Self-administered questionnaires were completed on two occasions, two weeks apart. MAIN MEASUREMENTS: Kappa Statistics; Intraclass Correlation Coefficient. RESULTS: Information on smoking status and pack-years smoked had almost perfect levels of agreement, respectively, kappa = 0.97 (95% CI, 0.92-1.00, and intraclass correlation coefficient = 0.93 (CI 95%, 0.89-0.96. Characteristics of alcohol intake yielded substantial levels of agreement (kappa ranging from 0.62 to 0.69. The reproducibility of the information on dietary habits varied from 0.67 to 0.79 (kappa. No clear-cut patterns could be identified comparing information by age or gender. There was a slight tendency towards greater reliability among people with higher levels of education. CONCLUSION: The reproducibility of information on smoking, drinking, and dietary patterns ranged from substantial to excellent, as investigated in the Pró-Saúde Study, a longitudinal investigation recently launched in Rio de Janeiro.

  12. Reproducibility and validity of the Shanghai Women's Health Study physical activity questionnaire.

    Science.gov (United States)

    Matthews, Charles E; Shu, Xiao-Ou; Yang, Gong; Jin, Fan; Ainsworth, Barbara E; Liu, Dake; Gao, Yu-Tang; Zheng, Wei

    2003-12-01

    In this investigation, the authors evaluated the reproducibility and validity of the Shanghai Women's Health Study (SWHS) physical activity questionnaire (PAQ), which was administered in a cohort study of approximately 75,000 Chinese women aged 40-70 years. Reproducibility (2-year test-retest) was evaluated using kappa statistics and intraclass correlation coefficients (ICCs). Validity was evaluated by comparing Spearman correlations (r) for the SWHS PAQ with two criterion measures administered over a period of 12 months: four 7-day physical activity logs and up to 28 7-day PAQs. Women were recruited from the SWHS cohort (n = 200). Results indicated that the reproducibility of adolescent and adult exercise participation (kappa = 0.85 and kappa = 0.64, respectively) and years of adolescent exercise and adult exercise energy expenditure (ICC = 0.83 and ICC = 0.70, respectively) was reasonable. Reproducibility values for adult lifestyle activities were lower (ICC = 0.14-0.54). Significant correlations between the PAQ and criterion measures of adult exercise were observed for the first PAQ administration (physical activity log, r = 0.50; 7-day PAQ, r = 0.62) and the second PAQ administration (physical activity log, r = 0.74; 7-day PAQ, r = 0.80). Significant correlations between PAQ lifestyle activities and the 7-day PAQ were also noted (r = 0.33-0.88). These data indicate that the SWHS PAQ is a reproducible and valid measure of exercise behaviors and that it demonstrates utility in stratifying women by levels of important lifestyle activities (e.g., housework, walking, cycling).

  13. Reproducibility of proximal and distal transcutaneous oxygen pressure measurements during exercise in stage 2 arterial claudication.

    Science.gov (United States)

    Bouyé, P; Picquet, J; Jaquinandi, V; Enon, B; Leftheriotis, G; Saumet, J-L; Abraham, P

    2004-06-01

    Although transcutaneous oxygen pressure measurements (tcpO2) are largely used in the investigation of vascular patients, its reproducibility is still debated. Indeed an unpredictable gradient exists between arterial and transcutaneous oxygen pressure. We hypothesised that indices taking into account changes over time and independent of absolute starting values would be more reproducible than other indices. comparative test-retest procedure (1 to 13 days between tests). institutional practice, ambulatory care. 15 subjects with stage 2 claudication. tcpO2 recordings at rest and at exercise during the 2 treadmill tests. calculation of the Delta-from-rest of oxygen pressure index (limb tcpO2 changes minus chest tcpO2 changes), of the resting - or minimal values attained during exercise - of absolute tcpO2 and of the regional perfusion index (regional perfusion index: ration of limb to chest). Both absolute tcpO2 and regional perfusion index at rest showed low reproducibility. During exercise the best reproducibility was attained through Delta-from-rest of oxygen pressure index calculation. Equations from the linear regression analysis (test 2 versus test 1) were 0.88 x -4.2 (r(2)=0.82) at the buttock level and 0.82 x -3.8 (r(2)=0.80) at the calf level. TcpO2 measurement on the calf or buttock during exercise, is a reproducible measurement in patients with vascular claudication, specifically when corrected for exercise-induced systemic pO2 changes trough Delta-from-rest of oxygen pressure calculation.

  14. Reproducibility in a multiprocessor system

    Science.gov (United States)

    Bellofatto, Ralph A; Chen, Dong; Coteus, Paul W; Eisley, Noel A; Gara, Alan; Gooding, Thomas M; Haring, Rudolf A; Heidelberger, Philip; Kopcsay, Gerard V; Liebsch, Thomas A; Ohmacht, Martin; Reed, Don D; Senger, Robert M; Steinmacher-Burow, Burkhard; Sugawara, Yutaka

    2013-11-26

    Fixing a problem is usually greatly aided if the problem is reproducible. To ensure reproducibility of a multiprocessor system, the following aspects are proposed; a deterministic system start state, a single system clock, phase alignment of clocks in the system, system-wide synchronization events, reproducible execution of system components, deterministic chip interfaces, zero-impact communication with the system, precise stop of the system and a scan of the system state.

  15. Prognostic Value and Reproducibility of Pretreatment CT Texture Features in Stage III Non-Small Cell Lung Cancer

    Energy Technology Data Exchange (ETDEWEB)

    Fried, David V. [Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas (United States); Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas (United States); Tucker, Susan L. [Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas (United States); Zhou, Shouhao [Division of Quantitative Sciences, Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas (United States); Liao, Zhongxing [Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas (United States); Mawlawi, Osama [Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas (United States); Ibbott, Geoffrey [Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas (United States); Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas (United States); Court, Laurence E., E-mail: LECourt@mdanderson.org [Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas (United States); Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas (United States)

    2014-11-15

    Purpose: To determine whether pretreatment CT texture features can improve patient risk stratification beyond conventional prognostic factors (CPFs) in stage III non-small cell lung cancer (NSCLC). Methods and Materials: We retrospectively reviewed 91 cases with stage III NSCLC treated with definitive chemoradiation therapy. All patients underwent pretreatment diagnostic contrast enhanced computed tomography (CE-CT) followed by 4-dimensional CT (4D-CT) for treatment simulation. We used the average-CT and expiratory (T50-CT) images from the 4D-CT along with the CE-CT for texture extraction. Histogram, gradient, co-occurrence, gray tone difference, and filtration-based techniques were used for texture feature extraction. Penalized Cox regression implementing cross-validation was used for covariate selection and modeling. Models incorporating texture features from the 33 image types and CPFs were compared to those with models incorporating CPFs alone for overall survival (OS), local-regional control (LRC), and freedom from distant metastases (FFDM). Predictive Kaplan-Meier curves were generated using leave-one-out cross-validation. Patients were stratified based on whether their predicted outcome was above or below the median. Reproducibility of texture features was evaluated using test-retest scans from independent patients and quantified using concordance correlation coefficients (CCC). We compared models incorporating the reproducibility seen on test-retest scans to our original models and determined the classification reproducibility. Results: Models incorporating both texture features and CPFs demonstrated a significant improvement in risk stratification compared to models using CPFs alone for OS (P=.046), LRC (P=.01), and FFDM (P=.005). The average CCCs were 0.89, 0.91, and 0.67 for texture features extracted from the average-CT, T50-CT, and CE-CT, respectively. Incorporating reproducibility within our models yielded 80.4% (±3.7% SD), 78.3% (±4.0% SD), and 78

  16. Prognostic Value and Reproducibility of Pretreatment CT Texture Features in Stage III Non-Small Cell Lung Cancer

    International Nuclear Information System (INIS)

    Fried, David V.; Tucker, Susan L.; Zhou, Shouhao; Liao, Zhongxing; Mawlawi, Osama; Ibbott, Geoffrey; Court, Laurence E.

    2014-01-01

    Purpose: To determine whether pretreatment CT texture features can improve patient risk stratification beyond conventional prognostic factors (CPFs) in stage III non-small cell lung cancer (NSCLC). Methods and Materials: We retrospectively reviewed 91 cases with stage III NSCLC treated with definitive chemoradiation therapy. All patients underwent pretreatment diagnostic contrast enhanced computed tomography (CE-CT) followed by 4-dimensional CT (4D-CT) for treatment simulation. We used the average-CT and expiratory (T50-CT) images from the 4D-CT along with the CE-CT for texture extraction. Histogram, gradient, co-occurrence, gray tone difference, and filtration-based techniques were used for texture feature extraction. Penalized Cox regression implementing cross-validation was used for covariate selection and modeling. Models incorporating texture features from the 33 image types and CPFs were compared to those with models incorporating CPFs alone for overall survival (OS), local-regional control (LRC), and freedom from distant metastases (FFDM). Predictive Kaplan-Meier curves were generated using leave-one-out cross-validation. Patients were stratified based on whether their predicted outcome was above or below the median. Reproducibility of texture features was evaluated using test-retest scans from independent patients and quantified using concordance correlation coefficients (CCC). We compared models incorporating the reproducibility seen on test-retest scans to our original models and determined the classification reproducibility. Results: Models incorporating both texture features and CPFs demonstrated a significant improvement in risk stratification compared to models using CPFs alone for OS (P=.046), LRC (P=.01), and FFDM (P=.005). The average CCCs were 0.89, 0.91, and 0.67 for texture features extracted from the average-CT, T50-CT, and CE-CT, respectively. Incorporating reproducibility within our models yielded 80.4% (±3.7% SD), 78.3% (±4.0% SD), and 78

  17. Novel Use of the Nintendo Wii Board for Measuring Isometric Lower Limb Strength: A Reproducible and Valid Method in Older Adults.

    Science.gov (United States)

    Gronbech Jorgensen, Martin; Andersen, Stig; Ryg, Jesper; Masud, Tahir

    2015-01-01

    Portable, low-cost, objective and reproducible assessment of muscle strength in the lower limbs is important as it allows clinicians to precisly track progression of patients undergoing rehabilitation. The Nintendo Wii Balance Board (WBB) is portable, inexpensive, durable, available worldwide, and may serve the above function. The purpose of the study was to evaluate (1) reproducibility and (2) concurrent validity of the WBB for measuring isometric muscle strength in the lower limb. A custom hardware and software was developed to utilize the WBB for assessment of isometric muscle strength. Thirty older adults (69.0 ± 4.2 years of age) were studied on two separate occasions on both the WBB and a stationary isometric dynamometer (SID). On each occasion, three recordings were obtained from each device. For the first recording, means and maximum values were used for further analysis. The test-retest reproducibility was examined using intraclass correlation coefficients (ICC), Standard Error of Measurement (SEM), and limits of agreement (LOA). Bland-Altman plots (BAP) and ICC's were used to explore concurrent validity. No systematic difference between test-retest was detected for the WBB. ICC within-device were between 0.90 and 0.96 and between-devices were from 0.80 to 0.84. SEM ranged for the WBB from 9.7 to 13.9%, and for the SID from 11.9 to 13.1%. LOA ranged for the WBB from 20.3 to 28.7% and for the SID from 24.2 to 26.6%. The BAP showed no relationship between the difference and the mean. A high relative and an acceptable absolute reproducibility combined with a good validity was found for the novel method using the WBB for measuring isometric lower limb strength in older adults. Further research using the WBB for assessing lower limb strength should be conducted in different study-populations.

  18. Novel Use of the Nintendo Wii Board for Measuring Isometric Lower Limb Strength: A Reproducible and Valid Method in Older Adults.

    Directory of Open Access Journals (Sweden)

    Martin Gronbech Jorgensen

    Full Text Available Portable, low-cost, objective and reproducible assessment of muscle strength in the lower limbs is important as it allows clinicians to precisly track progression of patients undergoing rehabilitation. The Nintendo Wii Balance Board (WBB is portable, inexpensive, durable, available worldwide, and may serve the above function.The purpose of the study was to evaluate (1 reproducibility and (2 concurrent validity of the WBB for measuring isometric muscle strength in the lower limb.A custom hardware and software was developed to utilize the WBB for assessment of isometric muscle strength. Thirty older adults (69.0 ± 4.2 years of age were studied on two separate occasions on both the WBB and a stationary isometric dynamometer (SID. On each occasion, three recordings were obtained from each device. For the first recording, means and maximum values were used for further analysis. The test-retest reproducibility was examined using intraclass correlation coefficients (ICC, Standard Error of Measurement (SEM, and limits of agreement (LOA. Bland-Altman plots (BAP and ICC's were used to explore concurrent validity.No systematic difference between test-retest was detected for the WBB. ICC within-device were between 0.90 and 0.96 and between-devices were from 0.80 to 0.84. SEM ranged for the WBB from 9.7 to 13.9%, and for the SID from 11.9 to 13.1%. LOA ranged for the WBB from 20.3 to 28.7% and for the SID from 24.2 to 26.6%. The BAP showed no relationship between the difference and the mean.A high relative and an acceptable absolute reproducibility combined with a good validity was found for the novel method using the WBB for measuring isometric lower limb strength in older adults. Further research using the WBB for assessing lower limb strength should be conducted in different study-populations.

  19. Sub-maximal and maximal Yo-Yo intermittent endurance test level 2: heart rate response, reproducibility and application to elite soccer

    DEFF Research Database (Denmark)

    Bradley, Paul S; Mohr, Magni; Bendiksen, Mads

    2011-01-01

    to detect test-retest changes and discriminate between performance for different playing standards and positions in elite soccer. Elite (n = 148) and sub-elite male (n = 14) soccer players carried out the Yo-Yo IE2 test on several occasions over consecutive seasons. Test-retest coefficient of variation (CV......) in Yo-Yo IE2 test performance and heart rate after 6 min were 3.9% (n = 37) and 1.4% (n = 32), respectively. Elite male senior and youth U19 players Yo-Yo IE2 performances were better (P ......The aims of this study were to (1) determine the reproducibility of sub-maximal and maximal versions of the Yo-Yo intermittent endurance test level 2 (Yo-Yo IE2 test), (2) assess the relationship between the Yo-Yo IE2 test and match performance and (3) quantify the sensitivity of the Yo-Yo IE2 test...

  20. Contextual sensitivity in scientific reproducibility.

    Science.gov (United States)

    Van Bavel, Jay J; Mende-Siedlecki, Peter; Brady, William J; Reinero, Diego A

    2016-06-07

    In recent years, scientists have paid increasing attention to reproducibility. For example, the Reproducibility Project, a large-scale replication attempt of 100 studies published in top psychology journals found that only 39% could be unambiguously reproduced. There is a growing consensus among scientists that the lack of reproducibility in psychology and other fields stems from various methodological factors, including low statistical power, researcher's degrees of freedom, and an emphasis on publishing surprising positive results. However, there is a contentious debate about the extent to which failures to reproduce certain results might also reflect contextual differences (often termed "hidden moderators") between the original research and the replication attempt. Although psychologists have found extensive evidence that contextual factors alter behavior, some have argued that context is unlikely to influence the results of direct replications precisely because these studies use the same methods as those used in the original research. To help resolve this debate, we recoded the 100 original studies from the Reproducibility Project on the extent to which the research topic of each study was contextually sensitive. Results suggested that the contextual sensitivity of the research topic was associated with replication success, even after statistically adjusting for several methodological characteristics (e.g., statistical power, effect size). The association between contextual sensitivity and replication success did not differ across psychological subdisciplines. These results suggest that researchers, replicators, and consumers should be mindful of contextual factors that might influence a psychological process. We offer several guidelines for dealing with contextual sensitivity in reproducibility.

  1. Handgrip force steadiness in young and older adults: a reproducibility study.

    Science.gov (United States)

    Blomkvist, Andreas W; Eika, Fredrik; de Bruin, Eling D; Andersen, Stig; Jorgensen, Martin

    2018-04-02

    Force steadiness is a quantitative measure of the ability to control muscle tonus. It is an independent predictor of functional performance and has shown to correlate well with different degrees of motor impairment following stroke. Despite being clinically relevant, few studies have assessed the validity of measuring force steadiness. The aim of this study was to explore the reproducibility of handgrip force steadiness, and to assess age difference in steadiness. Intrarater reproducibility (the degree to which a rating gives consistent result on separate occasions) was investigated in a test-retest design with seven days between sessions. Ten young and thirty older adults were recruited and handgrip steadiness was tested at 5%, 10% and 25% of maximum voluntary contraction (MVC) using Nintendo Wii Balance Board (WBB). Coefficients of variation were calculated from the mean force produced (CVM) and the target force (CVT). Area between the force curve and the target force line (Area) was also calculated. For the older adults we explored reliability using intraclass correlation coefficient (ICC) and agreement using standard error of measurement (SEM), limits of agreement (LOA) and smallest real difference (SRD). A systematic improvement in handgrip steadiness was found between sessions for all measures (CVM, CVT, Area). CVM and CVT at 5% of MVC showed good to high reliability, while Area had poor reliability for all percentages of MVC. Averaged ICC for CVM, CVT and Area was 0.815, 0.806 and 0.464, respectively. Averaged ICC on 5%, 10%, and 25% of MVC was 0.751, 0.667 and 0.668, respectively. Measures of agreement showed similar trends with better results for CVM and CVT than for Area. Young adults had better handgrip steadiness than older adults across all measures. The CVM and CVT measures demonstrated good reproducibility at lower percentages of MVC using the WBB, and could become relevant measures in the clinical setting. The Area measure had poor reproducibility

  2. Reproducibility of Automated Voice Range Profiles, a Systematic Literature Review

    DEFF Research Database (Denmark)

    Printz, Trine; Rosenberg, Tine; Godballe, Christian

    2018-01-01

    literature on test-retest accuracy of the automated voice range profile assessment. Study design: Systematic review. Data sources: PubMed, Scopus, Cochrane Library, ComDisDome, Embase, and CINAHL (EBSCO). Methods: We conducted a systematic literature search of six databases from 1983 to 2016. The following...

  3. Reproducibility of Dual-Microphone Voice Range Profile Equipment

    DEFF Research Database (Denmark)

    Printz, Trine; Pedersen, Ellen Raben; Juhl, Peter

    2017-01-01

    in an anechoic chamber and an office: (a) comparing sound pressure levels (SPLs) from a dual-microphone VRP device, the Voice Profiler, when given the same input repeatedly (test-retest reliability); (b) comparing SPLs from 3 devices when given the same input repeatedly (intervariation); and (c) assessing...

  4. Can radiomics features be reproducibly measured from CBCT images for patients with non-small cell lung cancer?

    Energy Technology Data Exchange (ETDEWEB)

    Fave, Xenia, E-mail: xjfave@mdanderson.org; Fried, David [Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, Texas 77030 and The University of Texas Graduate School of Biomedical Sciences at Houston, 6767 Bertner Avenue, Houston, Texas 77030 (United States); Mackin, Dennis; Yang, Jinzhong; Zhang, Joy; Balter, Peter; Followill, David [Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, Texas 77030 (United States); Gomez, Daniel [Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, Texas 77030 (United States); Kyle Jones, A. [Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, Texas 77030 (United States); Stingo, Francesco [Department of Biostatistics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, Texas 77030 (United States); Fontenot, Jonas [Mary Bird Perkins Cancer Center, 4950 Essen Lane, Baton Rouge, Louisiana 70809 (United States); Court, Laurence [Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, Texas 77030 and Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, Texas 77030 (United States)

    2015-12-15

    Purpose: Increasing evidence suggests radiomics features extracted from computed tomography (CT) images may be useful in prognostic models for patients with nonsmall cell lung cancer (NSCLC). This study was designed to determine whether such features can be reproducibly obtained from cone-beam CT (CBCT) images taken using medical Linac onboard-imaging systems in order to track them through treatment. Methods: Test-retest CBCT images of ten patients previously enrolled in a clinical trial were retrospectively obtained and used to determine the concordance correlation coefficient (CCC) for 68 different texture features. The volume dependence of each feature was also measured using the Spearman rank correlation coefficient. Features with a high reproducibility (CCC > 0.9) that were not due to volume dependence in the patient test-retest set were further examined for their sensitivity to differences in imaging protocol, level of scatter, and amount of motion by using two phantoms. The first phantom was a texture phantom composed of rectangular cartridges to represent different textures. Features were measured from two cartridges, shredded rubber and dense cork, in this study. The texture phantom was scanned with 19 different CBCT imagers to establish the features’ interscanner variability. The effect of scatter on these features was studied by surrounding the same texture phantom with scattering material (rice and solid water). The effect of respiratory motion on these features was studied using a dynamic-motion thoracic phantom and a specially designed tumor texture insert of the shredded rubber material. The differences between scans acquired with different Linacs and protocols, varying amounts of scatter, and with different levels of motion were compared to the mean intrapatient difference from the test-retest image set. Results: Of the original 68 features, 37 had a CCC >0.9 that was not due to volume dependence. When the Linac manufacturer and imaging protocol

  5. Contextual sensitivity in scientific reproducibility

    Science.gov (United States)

    Van Bavel, Jay J.; Mende-Siedlecki, Peter; Brady, William J.; Reinero, Diego A.

    2016-01-01

    In recent years, scientists have paid increasing attention to reproducibility. For example, the Reproducibility Project, a large-scale replication attempt of 100 studies published in top psychology journals found that only 39% could be unambiguously reproduced. There is a growing consensus among scientists that the lack of reproducibility in psychology and other fields stems from various methodological factors, including low statistical power, researcher’s degrees of freedom, and an emphasis on publishing surprising positive results. However, there is a contentious debate about the extent to which failures to reproduce certain results might also reflect contextual differences (often termed “hidden moderators”) between the original research and the replication attempt. Although psychologists have found extensive evidence that contextual factors alter behavior, some have argued that context is unlikely to influence the results of direct replications precisely because these studies use the same methods as those used in the original research. To help resolve this debate, we recoded the 100 original studies from the Reproducibility Project on the extent to which the research topic of each study was contextually sensitive. Results suggested that the contextual sensitivity of the research topic was associated with replication success, even after statistically adjusting for several methodological characteristics (e.g., statistical power, effect size). The association between contextual sensitivity and replication success did not differ across psychological subdisciplines. These results suggest that researchers, replicators, and consumers should be mindful of contextual factors that might influence a psychological process. We offer several guidelines for dealing with contextual sensitivity in reproducibility. PMID:27217556

  6. Reproducibility of heart rate and perceptual demands of game-based training drills in handball players

    Directory of Open Access Journals (Sweden)

    Gilles Ravier

    2017-12-01

    Full Text Available Game-based training are popular in team-sports; however there is a lack of research specific to team handball. The aim of this study was to assess i the test-retest reliability of heart rate (HR, time spent in HR zone intensities and rating of perceived exertion of a novel small-sided game, ii and whether it is comparable to that of generic intermittent shuttle running and match play with team handball players. Fourteen elite male handball players completed each exercise comprising two periods of 10min interspersed with 2min recovery in separate occasions and repeated them one week apart. Exercises consisted of intermittent 30s-30s shuttle running (ISR, intermittent 30s-30s small-sided game (with 3-a-side field players, 3vs3 and match play (with 6-a-side field players, 6vs6. Mean HR demonstrated high level of reproducibility for the three drills (r = 0.86-0.89, TEM = 2.21-2.63 bpm, CV = 1.23-1.55%. For time spent in heart rate zones TEMs reached up 1.12, 1.40 and 2.48 min for ISR, 6vs6 and 3vs3, respectively. Specifically for HR zone higher than 90% of HRmax, CVs showed wide extent of scores with 9.73 (ISR, 27.39 (6vs6 and 108.29% (3vs3. Mean HR results suggest that physiological response was consistent between sessions. Because of the poor reproducibility for time spent in the target zone higher than 90% of HRmax, the efficiency of both 3vs3 and 6vs6 in improving aerobic power should be analysed with caution. The present results suggest that reproducibility of physiological demand of ball-drills should be considered before prescribing them as conditioning training.

  7. Testing Reproducibility in Earth Sciences

    Science.gov (United States)

    Church, M. A.; Dudill, A. R.; Frey, P.; Venditti, J. G.

    2017-12-01

    Reproducibility represents how closely the results of independent tests agree when undertaken using the same materials but different conditions of measurement, such as operator, equipment or laboratory. The concept of reproducibility is fundamental to the scientific method as it prevents the persistence of incorrect or biased results. Yet currently the production of scientific knowledge emphasizes rapid publication of previously unreported findings, a culture that has emerged from pressures related to hiring, publication criteria and funding requirements. Awareness and critique of the disconnect between how scientific research should be undertaken, and how it actually is conducted, has been prominent in biomedicine for over a decade, with the fields of economics and psychology more recently joining the conversation. The purpose of this presentation is to stimulate the conversation in earth sciences where, despite implicit evidence in widely accepted classifications, formal testing of reproducibility is rare.As a formal test of reproducibility, two sets of experiments were undertaken with the same experimental procedure, at the same scale, but in different laboratories. Using narrow, steep flumes and spherical glass beads, grain size sorting was examined by introducing fine sediment of varying size and quantity into a mobile coarse bed. The general setup was identical, including flume width and slope; however, there were some variations in the materials, construction and lab environment. Comparison of the results includes examination of the infiltration profiles, sediment mobility and transport characteristics. The physical phenomena were qualitatively reproduced but not quantitatively replicated. Reproduction of results encourages more robust research and reporting, and facilitates exploration of possible variations in data in various specific contexts. Following the lead of other fields, testing of reproducibility can be incentivized through changes to journal

  8. The web-based ASSO-food frequency questionnaire for adolescents: relative and absolute reproducibility assessment.

    Science.gov (United States)

    Filippi, Anna Rita; Amodio, Emanuele; Napoli, Giuseppe; Breda, João; Bianco, Antonino; Jemni, Monèm; Censi, Laura; Mammina, Caterina; Tabacchi, Garden

    2014-12-17

    A new food frequency questionnaire (FFQ) has been recently developed within the Italian Adolescents and Surveillance System for the Obesity prevention (ASSO) Project; it was found to be appropriate for ranking adolescents in food and nutrient levels of intake. The aim of this study was to assess the relative and absolute reproducibility of the ASSO-FFQ for 24 food groups, energy and 52 nutrients. A test-retest study was performed on two ASSO-FFQs administered one month apart of each other to 185 adolescents, aged 14-17 and attending secondary schools in Palermo (Italy). Wilcoxon test assessed differences in median daily intakes between the two FFQs. Agreement was evaluated by quintiles comparison and weighted kappa. Intraclass Correlation Coefficients (ICC) and Bland-Altman method assessed the relative and absolute reliability respectively. Significant difference (p food, water, soft drinks, carbohydrates and sugar. The subjects classified into the same or adjacent quintiles for food groups ranged from 62% (white bread) to 91% (soft drinks); for energy and nutrients from 64% (polyunsaturated fatty acids) to 90% (ethanol). Mean values of weighted kappa were 0.47 and 0.48, respectively for food groups and nutrients. Fair to good ICC values (>0.40) were assessed for thirteen food groups, energy and forty-three nutrients. Limits of Agreement were narrow for almost all food groups and all nutrients. The ASSO-FFQ is a reliable instrument for estimating food groups, energy and nutrients intake in adolescents.

  9. Reproducible Bioinformatics Research for Biologists

    Science.gov (United States)

    This book chapter describes the current Big Data problem in Bioinformatics and the resulting issues with performing reproducible computational research. The core of the chapter provides guidelines and summaries of current tools/techniques that a noncomputational researcher would need to learn to pe...

  10. Reproducibility of brain ADC histograms

    International Nuclear Information System (INIS)

    Steens, S.C.A.; Buchem, M.A. van; Admiraal-Behloul, F.; Schaap, J.A.; Hoogenraad, F.G.C.; Wheeler-Kingshott, C.A.M.; Tofts, P.S.; Cessie, S. le

    2004-01-01

    The aim of this study was to assess the effect of differences in acquisition technique on whole-brain apparent diffusion coefficient (ADC) histogram parameters, as well as to assess scan-rescan reproducibility. Diffusion-weighted imaging (DWI) was performed in 7 healthy subjects with b-values 0-800, 0-1000, and 0-1500 s/mm 2 and fluid-attenuated inversion recovery (FLAIR) DWI with b-values 0-1000 s/mm 2 . All sequences were repeated with and without repositioning. The peak location, peak height, and mean ADC of the ADC histograms and mean ADC of a region of interest (ROI) in the white matter were compared using paired-sample t tests. Scan-rescan reproducibility was assessed using paired-sample t tests, and repeatability coefficients were reported. With increasing maximum b-values, ADC histograms shifted to lower values, with an increase in peak height (p<0.01). With FLAIR DWI, the ADC histogram shifted to lower values with a significantly higher, narrower peak (p<0.01), although the ROI mean ADC showed no significant differences. For scan-rescan reproducibility, no significant differences were observed. Different DWI pulse sequences give rise to different ADC histograms. With a given pulse sequence, however, ADC histogram analysis is a robust and reproducible technique. Using FLAIR DWI, the partial-voluming effect of cerebrospinal fluid, and thus its confounding effect on histogram analyses, can be reduced

  11. Reproducibility of a reaming test

    DEFF Research Database (Denmark)

    Pilny, Lukas; Müller, Pavel; De Chiffre, Leonardo

    2012-01-01

    The reproducibility of a reaming test was analysed to document its applicability as a performance test for cutting fluids. Reaming tests were carried out on a drilling machine using HSS reamers. Workpiece material was an austenitic stainless steel, machined using 4.75 m∙min-1 cutting speed and 0......). Process reproducibility was assessed as the ability of different operators to ensure a consistent rating of individual lubricants. Absolute average values as well as experimental standard deviations of the evaluation parameters were calculated, and uncertainty budgeting was performed. Results document...... a built-up edge occurrence hindering a robust evaluation of cutting fluid performance, if the data evaluation is based on surface finish only. Measurements of hole geometry provide documentation to recognize systematic error distorting the performance test....

  12. Reproducibility of a reaming test

    DEFF Research Database (Denmark)

    Pilny, Lukas; Müller, Pavel; De Chiffre, Leonardo

    2014-01-01

    The reproducibility of a reaming test was analysed to document its applicability as a performance test for cutting fluids. Reaming tests were carried out on a drilling machine using HSS reamers. Workpiece material was an austenitic stainless steel, machined using 4.75 m•min−1 cutting speed and 0......). Process reproducibility was assessed as the ability of different operators to ensure a consistent rating of individual lubricants. Absolute average values as well as experimental standard deviations of the evaluation parameters were calculated, and uncertainty budgeting was performed. Results document...... a built–up edge occurrence hindering a robust evaluation of cutting fluid performance, if the data evaluation is based on surface finish only. Measurements of hole geometry provide documentation to recognise systematic error distorting the performance test....

  13. Test-retest reliability of knee kinematics measurement during gait ...

    African Journals Online (AJOL)

    ACLR) is crucial to minimize the risk of joint degeneration. To achieve this, it is essential that the chosen measurement method can accurately assess knee kinematics and detect the changes in multi-planes of motion. However to date, limited ...

  14. Test-retest studies in quantitative sensory testing

    DEFF Research Database (Denmark)

    Werner, M U; Petersen, M A; Bischoff, J M

    2013-01-01

    Quantitative sensory testing (QST) investigates the graded psychophysical response to controlled thermal, mechanical, electrical or chemical stimuli, allowing quantification of clinically relevant perception and pain thresholds. The methods are ubiquitously used in experimental and clinical pain...... research, and therefore, the need for uniform assessment procedures has been emphasised. However, varying consistency and transparency in the statistical methodology seem to occur in the QST literature. Sixteen publications, evaluating aspects of QST variability, from 2010 to 2012, were critically reviewed...

  15. Towards Reproducibility in Computational Hydrology

    Science.gov (United States)

    Hutton, Christopher; Wagener, Thorsten; Freer, Jim; Han, Dawei; Duffy, Chris; Arheimer, Berit

    2017-04-01

    Reproducibility is a foundational principle in scientific research. The ability to independently re-run an experiment helps to verify the legitimacy of individual findings, and evolve (or reject) hypotheses and models of how environmental systems function, and move them from specific circumstances to more general theory. Yet in computational hydrology (and in environmental science more widely) the code and data that produces published results are not regularly made available, and even if they are made available, there remains a multitude of generally unreported choices that an individual scientist may have made that impact the study result. This situation strongly inhibits the ability of our community to reproduce and verify previous findings, as all the information and boundary conditions required to set up a computational experiment simply cannot be reported in an article's text alone. In Hutton et al 2016 [1], we argue that a cultural change is required in the computational hydrological community, in order to advance and make more robust the process of knowledge creation and hypothesis testing. We need to adopt common standards and infrastructures to: (1) make code readable and re-useable; (2) create well-documented workflows that combine re-useable code together with data to enable published scientific findings to be reproduced; (3) make code and workflows available, easy to find, and easy to interpret, using code and code metadata repositories. To create change we argue for improved graduate training in these areas. In this talk we reflect on our progress in achieving reproducible, open science in computational hydrology, which are relevant to the broader computational geoscience community. In particular, we draw on our experience in the Switch-On (EU funded) virtual water science laboratory (http://www.switch-on-vwsl.eu/participate/), which is an open platform for collaboration in hydrological experiments (e.g. [2]). While we use computational hydrology as

  16. Reproducibility of isotope ratio measurements

    International Nuclear Information System (INIS)

    Elmore, D.

    1981-01-01

    The use of an accelerator as part of a mass spectrometer has improved the sensitivity for measuring low levels of long-lived radionuclides by several orders of magnitude. However, the complexity of a large tandem accelerator and beam transport system has made it difficult to match the precision of low energy mass spectrometry. Although uncertainties for accelerator measured isotope ratios as low as 1% have been obtained under favorable conditions, most errors quoted in the literature for natural samples are in the 5 to 20% range. These errors are dominated by statistics and generally the reproducibility is unknown since the samples are only measured once

  17. Evaluation of multichannel reproduced sound

    DEFF Research Database (Denmark)

    Choisel, Sylvain; Wickelmaier, Florian Maria

    2007-01-01

    A study was conducted with the goal of quantifying auditory attributes which underlie listener preference for multichannel reproduced sound. Short musical excerpts were presented in mono, stereo and several multichannel formats to a panel of forty selected listeners. Scaling of auditory attributes......, as well as overall preference, was based on consistency tests of binary paired-comparison judgments and on modeling the choice frequencies using probabilistic choice models. As a result, the preferences of non-expert listeners could be measured reliably at a ratio scale level. Principal components derived...

  18. Reproducibility for Heart Rate Variability Analysis during 6-Min Walk Test in Patients with Heart Failure and Agreement between Devices.

    Science.gov (United States)

    Braga, Lays Magalhães; Prado, Gustavo Faibischew; Umeda, Iracema Ioco Kikuchi; Kawauchi, Tatiana Satie; Taboada, Adriana Marques Fróes; Azevedo, Raymundo Soares; Pereira Filho, Horacio Gomes; Grupi, César José; Souza, Hayala Cristina Cavenague; Moreira, Dalmo Antônio Ribeiro; Nakagawa, Naomi Kondo

    2016-01-01

    Heart rate variability (HRV) analysis is a useful method to assess abnormal functioning in the autonomic nervous system and to predict cardiac events in patients with heart failure (HF). HRV measurements with heart rate monitors have been validated with an electrocardiograph in healthy subjects but not in patients with HF. We explored the reproducibility of HRV in two consecutive six-minute walk tests (6MW), 60-minute apart, using a heart rate monitor (PolarS810i) and a portable electrocardiograph (called Holter) in 50 HF patients (mean age 59 years, NYHA II, left ventricular ejection fraction ~35%). The reproducibility for each device was analysed using a paired t-test or the Wilcoxon signed-rank test. Additionally, we assessed the agreement between the two devices based on the HRV indices at rest, during the 6MW and during recovery using concordance correlation coefficients (CCC), 95% confidence intervals and Bland-Altman plots. The test-retest for the HRV analyses was reproducible using Holter and PolarS810i at rest but not during recovery. In the second 6MW, patients showed significant increases in rMSSD and walking distance. The PolarS810i measurements had remarkably high concordance correlation [0.86reproducibility of HRV at rest in two consecutive 6MW using Holter and PolarS810i. Additionally, PolarS810i produced good agreements in short-term HRV indices based on Holter simultaneous recordings at rest, during the 6MW and recovery in HF patients.

  19. Repeatability and Reproducibility of Retinal Nerve Fiber Layer Parameters Measured by Scanning Laser Polarimetry with Enhanced Corneal Compensation in Normal and Glaucomatous Eyes.

    Science.gov (United States)

    Ara, Mirian; Ferreras, Antonio; Pajarin, Ana B; Calvo, Pilar; Figus, Michele; Frezzotti, Paolo

    2015-01-01

    To assess the intrasession repeatability and intersession reproducibility of peripapillary retinal nerve fiber layer (RNFL) thickness parameters measured by scanning laser polarimetry (SLP) with enhanced corneal compensation (ECC) in healthy and glaucomatous eyes. One randomly selected eye of 82 healthy individuals and 60 glaucoma subjects was evaluated. Three scans were acquired during the first visit to evaluate intravisit repeatability. A different operator obtained two additional scans within 2 months after the first session to determine intervisit reproducibility. The intraclass correlation coefficient (ICC), coefficient of variation (COV), and test-retest variability (TRT) were calculated for all SLP parameters in both groups. ICCs ranged from 0.920 to 0.982 for intravisit measurements and from 0.910 to 0.978 for intervisit measurements. The temporal-superior-nasal-inferior-temporal (TSNIT) average was the highest (0.967 and 0.946) in normal eyes, while nerve fiber indicator (NFI; 0.982) and inferior average (0.978) yielded the best ICC in glaucomatous eyes for intravisit and intervisit measurements, respectively. All COVs were under 10% in both groups, except NFI. TSNIT average had the lowest COV (2.43%) in either type of measurement. Intervisit TRT ranged from 6.48 to 12.84. The reproducibility of peripapillary RNFL measurements obtained with SLP-ECC was excellent, indicating that SLP-ECC is sufficiently accurate for monitoring glaucoma progression.

  20. Reproducible research: a minority opinion

    Science.gov (United States)

    Drummond, Chris

    2018-01-01

    Reproducible research, a growing movement within many scientific fields, including machine learning, would require the code, used to generate the experimental results, be published along with any paper. Probably the most compelling argument for this is that it is simply following good scientific practice, established over the years by the greats of science. The implication is that failure to follow such a practice is unscientific, not a label any machine learning researchers would like to carry. It is further claimed that misconduct is causing a growing crisis of confidence in science. That, without this practice being enforced, science would inevitably fall into disrepute. This viewpoint is becoming ubiquitous but here I offer a differing opinion. I argue that far from being central to science, what is being promulgated is a narrow interpretation of how science works. I contend that the consequences are somewhat overstated. I would also contend that the effort necessary to meet the movement's aims, and the general attitude it engenders would not serve well any of the research disciplines, including our own.

  1. Field assessment of balance in 10 to 14 year old children, reproducibility and validity of the Nintendo Wii board.

    Science.gov (United States)

    Larsen, Lisbeth Runge; Jørgensen, Martin Grønbech; Junge, Tina; Juul-Kristensen, Birgit; Wedderkopp, Niels

    2014-06-10

    Because body proportions in childhood are different to those in adulthood, children have a relatively higher centre of mass location. This biomechanical difference and the fact that children's movements have not yet fully matured result in different sway performances in children and adults. When assessing static balance, it is essential to use objective, sensitive tools, and these types of measurement have previously been performed in laboratory settings. However, the emergence of technologies like the Nintendo Wii Board (NWB) might allow balance assessment in field settings. As the NWB has only been validated and tested for reproducibility in adults, the purpose of this study was to examine reproducibility and validity of the NWB in a field setting, in a population of children. Fifty-four 10-14 year-olds from the CHAMPS-Study DK performed four different balance tests: bilateral stance with eyes open (1), unilateral stance on dominant (2) and non-dominant leg (3) with eyes open, and bilateral stance with eyes closed (4). Three rounds of the four tests were completed with the NWB and with a force platform (AMTI). To assess reproducibility, an intra-day test-retest design was applied with a two-hour break between sessions. Bland-Altman plots supplemented by Minimum Detectable Change (MDC) and concordance correlation coefficient (CCC) demonstrated satisfactory reproducibility for the NWB and the AMTI (MDC: 26.3-28.2%, CCC: 0.76-0.86) using Centre Of Pressure path Length as measurement parameter. Bland-Altman plots demonstrated satisfactory concurrent validity between the NWB and the AMTI, supplemented by satisfactory CCC in all tests (CCC: 0.74-0.87). The ranges of the limits of agreement in the validity study were comparable to the limits of agreement of the reproducibility study. Both NWB and AMTI have satisfactory reproducibility for testing static balance in a population of children. Concurrent validity of NWB compared with AMTI was satisfactory. Furthermore, the

  2. Strategies for the generation of parametric images of [11C]PIB with plasma input functions considering discriminations and reproducibility.

    Science.gov (United States)

    Edison, Paul; Brooks, David J; Turkheimer, Federico E; Archer, Hilary A; Hinz, Rainer

    2009-11-01

    Pittsburgh compound B or [11C]PIB is an amyloid imaging agent which shows a clear differentiation between subjects with Alzheimer's disease (AD) and controls. However the observed signal difference in other forms of dementia such as dementia with Lewy bodies (DLB) is smaller, and mild cognitively impaired (MCI) subjects and some healthy elderly normals may show intermediate levels of [11C]PIB binding. The cerebellum, a commonly used reference region for non-specific tracer uptake in [11C]PIB studies in AD may not be valid in Prion disorders or monogenic forms of AD. The aim of this work was to: 1-compare methods for generating parametric maps of [11C]PIB retention in tissue using a plasma input function in respect of their ability to discriminate between AD subjects and controls and 2-estimate the test-retest reproducibility in AD subjects. 12 AD subjects (5 of which underwent a repeat scan within 6 weeks) and 10 control subjects had 90 minute [11C]PIB dynamic PET scans, and arterial plasma input functions were measured. Parametric maps were generated with graphical analysis of reversible binding (Logan plot), irreversible binding (Patlak plot), and spectral analysis. Between group differentiation was calculated using Student's t-test and comparisons between different methods were made using p values. Reproducibility was assessed by intraclass correlation coefficients (ICC). We found that the 75 min value of the impulse response function showed the best group differentiation and had a higher ICC than volume of distribution maps generated from Logan and spectral analysis. Patlak analysis of [11C]PIB binding was the least reproducible.

  3. Theory of reproducing kernels and applications

    CERN Document Server

    Saitoh, Saburou

    2016-01-01

    This book provides a large extension of the general theory of reproducing kernels published by N. Aronszajn in 1950, with many concrete applications. In Chapter 1, many concrete reproducing kernels are first introduced with detailed information. Chapter 2 presents a general and global theory of reproducing kernels with basic applications in a self-contained way. Many fundamental operations among reproducing kernel Hilbert spaces are dealt with. Chapter 2 is the heart of this book. Chapter 3 is devoted to the Tikhonov regularization using the theory of reproducing kernels with applications to numerical and practical solutions of bounded linear operator equations. In Chapter 4, the numerical real inversion formulas of the Laplace transform are presented by applying the Tikhonov regularization, where the reproducing kernels play a key role in the results. Chapter 5 deals with ordinary differential equations; Chapter 6 includes many concrete results for various fundamental partial differential equations. In Chapt...

  4. Reproducibility of surface roughness in reaming

    DEFF Research Database (Denmark)

    Müller, Pavel; De Chiffre, Leonardo

    An investigation on the reproducibility of surface roughness in reaming was performed to document the applicability of this approach for testing cutting fluids. Austenitic stainless steel was used as a workpiece material and HSS reamers as cutting tools. Reproducibility of the results was evaluat...

  5. Reproducibility principles, problems, practices, and prospects

    CERN Document Server

    Maasen, Sabine

    2016-01-01

    Featuring peer-reviewed contributions from noted experts in their fields of research, Reproducibility: Principles, Problems, Practices, and Prospects presents state-of-the-art approaches to reproducibility, the gold standard sound science, from multi- and interdisciplinary perspectives. Including comprehensive coverage for implementing and reflecting the norm of reproducibility in various pertinent fields of research, the book focuses on how the reproducibility of results is applied, how it may be limited, and how such limitations can be understood or even controlled in the natural sciences, computational sciences, life sciences, social sciences, and studies of science and technology. The book presents many chapters devoted to a variety of methods and techniques, as well as their epistemic and ontological underpinnings, which have been developed to safeguard reproducible research and curtail deficits and failures. The book also investigates the political, historical, and social practices that underlie repro...

  6. Reproducibility indices applied to cervical pressure pain threshold measurements in healthy subjects.

    Science.gov (United States)

    Prushansky, Tamara; Dvir, Zeevi; Defrin-Assa, Ruth

    2004-01-01

    To apply various statistical indices for reproducibility analysis of pressure pain threshold measurements and to derive a preferred pressure pain threshold measurement protocol based on these indices. The pressure pain threshold of 3 pairs of right and left homologous cervical region sites were measured in 20 healthy subjects (10 women, 10 men) using a hand-held pressure algometer. Measurements took place on 2 occasions (test 1 and test 2) separated by a mean interval of 1 week. On each testing session, the site-related pressure pain thresholds were measured 3 times each according to 2 different protocols. Protocol A consisted of a repetitive order, namely 3 consecutive measurements at each site before proceeding to the next, whereas protocol B consisted of an alternate order in which 3 consecutive rounds of all individually tested sites took place. For test 1, protocol A was followed by protocol B with an hour interval. For test 2, the reverse order took place. The findings revealed no significant differences between the two protocols and indicated a significant rise (P test 1 to test 2 in both protocols. Absolute values (mean +/-SD) derived from the entire sample of pressure pain threshold sites ranged from 140 +/- 60 to 198.7 +/- 95 kPa (1.60 +/- 0.6 to 1.99 +/- 0.95 kg/cm, respectively). No significant gender or side differences were noted. Pearson r as well as the intraclass correlation coefficient revealed good to excellent reproducibility for both protocols and for all sites measured: r = 0.79-0.94 and intraclass correlation coefficient(3,3) = 0.85-0.96, respectively. To define site-specific cutoff values indicating change at the 95% confidence level, 1.96*SEM was calculated, and its values ranged from 31.6 to 58.2 kPa, which correspond to 16.8% to 32.8% of the absolute mean values. In addition, the limits of agreement, which depict the individual test-retest differences relative to their mean, indicated a heteroscedastic trend. The two protocols yielded

  7. Reproducibility of automated simplified voxel-based analysis of PET amyloid ligand [11C]PIB uptake using 30-min scanning data

    International Nuclear Information System (INIS)

    Aalto, Sargo; Scheinin, Noora M.; Naagren, Kjell; Rinne, Juha O.; Kemppainen, Nina M.; Kailajaervi, Marita; Leinonen, Mika; Scheinin, Mika

    2009-01-01

    Positron emission tomography (PET) with 11 C-labelled Pittsburgh compound B ([ 11 C]PIB) enables the quantitation of β-amyloid accumulation in the brain of patients with Alzheimer's disease (AD). Voxel-based image analysis techniques conducted in a standard brain space provide an objective, rapid and fully automated method to analyze [ 11 C]PIB PET data. The purpose of this study was to evaluate both region- and voxel-level reproducibility of automated and simplified [ 11 C]PIB quantitation when using only 30 min of imaging data. Six AD patients and four healthy controls were scanned twice with an average interval of 6 weeks. To evaluate the feasibility of short scanning (convenient for AD patients), [ 11 C]PIB uptake was quantitated using 30 min of imaging data (60 to 90 min after tracer injection) for region-to-cerebellum ratio calculations. To evaluate the reproducibility, a test-retest design was used to derive absolute variability (VAR) estimates and intraclass correlation coefficients at both region-of-interest (ROI) and voxel level. The reproducibility both at the region level (VAR 0.9-5.5%) and at the voxel level (VAR 4.2-6.4%) was good to excellent. Based on the variability estimates obtained, power calculations indicated that 90% power to obtain statistically significant difference can be achieved using a sample size of five subjects per group when a 15% change from baseline (increase or decrease) in [ 11 C]PIB accumulation in the frontal cortex is anticipated in one group compared to no change in another group. Our results showed that an automated analysis method based on an efficient scanning protocol provides reproducible results for [ 11 C]PIB uptake and appears suitable for PET studies aiming at the quantitation of amyloid accumulation in the brain of AD patients for the evaluation of progression and treatment effects. (orig.)

  8. Learning Reproducibility with a Yearly Networking Contest

    KAUST Repository

    Canini, Marco

    2017-08-10

    Better reproducibility of networking research results is currently a major goal that the academic community is striving towards. This position paper makes the case that improving the extent and pervasiveness of reproducible research can be greatly fostered by organizing a yearly international contest. We argue that holding a contest undertaken by a plurality of students will have benefits that are two-fold. First, it will promote hands-on learning of skills that are helpful in producing artifacts at the replicable-research level. Second, it will advance the best practices regarding environments, testbeds, and tools that will aid the tasks of reproducibility evaluation committees by and large.

  9. The Economics of Reproducibility in Preclinical Research.

    Directory of Open Access Journals (Sweden)

    Leonard P Freedman

    2015-06-01

    Full Text Available Low reproducibility rates within life science research undermine cumulative knowledge production and contribute to both delays and costs of therapeutic drug development. An analysis of past studies indicates that the cumulative (total prevalence of irreproducible preclinical research exceeds 50%, resulting in approximately US$28,000,000,000 (US$28B/year spent on preclinical research that is not reproducible-in the United States alone. We outline a framework for solutions and a plan for long-term improvements in reproducibility rates that will help to accelerate the discovery of life-saving therapies and cures.

  10. Thou Shalt Be Reproducible! A Technology Perspective

    Directory of Open Access Journals (Sweden)

    Patrick Mair

    2016-07-01

    Full Text Available This article elaborates on reproducibility in psychology from a technological viewpoint. Modernopen source computational environments are shown and explained that foster reproducibilitythroughout the whole research life cycle, and to which emerging psychology researchers shouldbe sensitized, are shown and explained. First, data archiving platforms that make datasets publiclyavailable are presented. Second, R is advocated as the data-analytic lingua franca in psychologyfor achieving reproducible statistical analysis. Third, dynamic report generation environments forwriting reproducible manuscripts that integrate text, data analysis, and statistical outputs such asfigures and tables in a single document are described. Supplementary materials are provided inorder to get the reader started with these technologies.

  11. Task and task-free fMRI reproducibility comparison for motor network identification

    NARCIS (Netherlands)

    Kristo, G.; Rutten, G.J.; Raemaekers, M.; de Gelder, B.; Rombouts, S.A.R.B.; Ramsey, N.F.

    2014-01-01

    Test-retest reliability of individual functional magnetic resonance imaging (fMRI) results is of importance in clinical practice and longitudinal experiments. While several studies have investigated reliability of task-induced motor network activation, less is known about the reliability of the

  12. Examination of reproducibility in microbiological degredation experiments

    DEFF Research Database (Denmark)

    Sommer, Helle Mølgaard; Spliid, Henrik; Holst, Helle

    1998-01-01

    Experimental data indicate that certain microbiological degradation experiments have a limited reproducibility. Nine identical batch experiments were carried out on 3 different days to examine reproducibility. A pure culture, isolated from soil, grew with toluene as the only carbon and energy...... source. Toluene was degraded under aerobic conditions at a constant temperature of 28 degreesC. The experiments were modelled by a Monod model - extended to meet the air/liquid system, and the parameter values were estimated using a statistical nonlinear estimation procedure. Model reduction analysis...... resulted in a simpler model without the biomass decay term. In order to test for model reduction and reproducibility of parameter estimates, a likelihood ratio test was employed. The limited reproducibility for these experiments implied that all 9 batch experiments could not be described by the same set...

  13. Archiving Reproducible Research with R and Dataverse

    DEFF Research Database (Denmark)

    Leeper, Thomas

    2014-01-01

    Reproducible research and data archiving are increasingly important issues in research involving statistical analyses of quantitative data. This article introduces the dvn package, which allows R users to publicly archive datasets, analysis files, codebooks, and associated metadata in Dataverse...

  14. Reproducing Epidemiologic Research and Ensuring Transparency.

    Science.gov (United States)

    Coughlin, Steven S

    2017-08-15

    Measures for ensuring that epidemiologic studies are reproducible include making data sets and software available to other researchers so they can verify published findings, conduct alternative analyses of the data, and check for statistical errors or programming errors. Recent developments related to the reproducibility and transparency of epidemiologic studies include the creation of a global platform for sharing data from clinical trials and the anticipated future extension of the global platform to non-clinical trial data. Government agencies and departments such as the US Department of Veterans Affairs Cooperative Studies Program have also enhanced their data repositories and data sharing resources. The Institute of Medicine and the International Committee of Medical Journal Editors released guidance on sharing clinical trial data. The US National Institutes of Health has updated their data-sharing policies. In this issue of the Journal, Shepherd et al. (Am J Epidemiol. 2017;186:387-392) outline a pragmatic approach for reproducible research with sensitive data for studies for which data cannot be shared because of legal or ethical restrictions. Their proposed quasi-reproducible approach facilitates the dissemination of statistical methods and codes to independent researchers. Both reproducibility and quasi-reproducibility can increase transparency for critical evaluation, further dissemination of study methods, and expedite the exchange of ideas among researchers. © The Author(s) 2017. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Reproducibility of central lumbar vertebral BMD

    International Nuclear Information System (INIS)

    Chan, F.; Pocock, N.; Griffiths, M.; Majerovic, Y.; Freund, J.

    1997-01-01

    Full text: Lumbar vertebral bone mineral density (BMD) using dual X-ray absorptiometry (DXA) has generally been calculated from a region of interest which includes the entire vertebral body. Although this region excludes part of the transverse processes, it does include the outer cortical shell of the vertebra. Recent software has been devised to calculate BMD in a central vertebral region of interest which excludes the outer cortical envelope. Theoretically this area may be more sensitive to detecting osteoporosis which affects trabecular bone to a greater extent than cortical bone. Apart from the sensitivity of BMD estimation, the reproducibility of any measurement is important owing to the slow rate of change of bone mass. We have evaluated the reproducibility of this new vertebral region of interest in 23 women who had duplicate lumbar spine DXA scans performed on the same day. The patients were repositioned between each measurement. Central vertebral analysis was performed for L2-L4 and the reproducibility of area, bone mineral content (BMC) and BMD calculated as the coefficient of variation; these values were compared with those from conventional analysis. Thus we have shown that the reproducibility of the central BMD is comparable to the conventional analysis which is essential if this technique is to provide any additional clinical data. The reasons for the decrease in reproducibility of the area and hence BMC requires further investigation

  16. Enacting the International/Reproducing Eurocentrism

    Directory of Open Access Journals (Sweden)

    Zeynep Gülşah Çapan

    Full Text Available Abstract This article focuses on the way in which Eurocentric conceptualisations of the ‘international’ are reproduced in different geopolitical contexts. Even though the Eurocentrism of International Relations has received growing attention, it has predominantly been concerned with unearthing the Eurocentrism of the ‘centre’, overlooking its varied manifestations in other geopolitical contexts. The article seeks to contribute to discussions about Eurocentrism by examining how different conceptualisations of the international are at work at a particular moment, and how these conceptualisations continue to reproduce Eurocentrism. It will focus on the way in which Eurocentric designations of spatial and temporal hierarchies were reproduced in the context of Turkey through a reading of how the ‘Gezi Park protests’ of 2013 and ‘Turkey’ itself were written into the story of the international.

  17. Reproducibility, controllability, and optimization of LENR experiments

    Energy Technology Data Exchange (ETDEWEB)

    Nagel, David J. [The George Washington University, Washington DC 20052 (United States)

    2006-07-01

    Low-energy nuclear reaction (LENR) measurements are significantly, and increasingly reproducible. Practical control of the production of energy or materials by LENR has yet to be demonstrated. Minimization of costly inputs and maximization of desired outputs of LENR remain for future developments. The paper concludes by underlying that it is now clearly that demands for reproducible experiments in the early years of LENR experiments were premature. In fact, one can argue that irreproducibility should be expected for early experiments in a complex new field. As emphasized in the paper and as often happened in the history of science, experimental and theoretical progress can take even decades. It is likely to be many years before investments in LENR experiments will yield significant returns, even for successful research programs. However, it is clearly that a fundamental understanding of the anomalous effects observed in numerous experiments will significantly increase reproducibility, improve controllability, enable optimization of processes, and accelerate the economic viability of LENR.

  18. Reproducibility, controllability, and optimization of LENR experiments

    International Nuclear Information System (INIS)

    Nagel, David J.

    2006-01-01

    Low-energy nuclear reaction (LENR) measurements are significantly, and increasingly reproducible. Practical control of the production of energy or materials by LENR has yet to be demonstrated. Minimization of costly inputs and maximization of desired outputs of LENR remain for future developments. The paper concludes by underlying that it is now clearly that demands for reproducible experiments in the early years of LENR experiments were premature. In fact, one can argue that irreproducibility should be expected for early experiments in a complex new field. As emphasized in the paper and as often happened in the history of science, experimental and theoretical progress can take even decades. It is likely to be many years before investments in LENR experiments will yield significant returns, even for successful research programs. However, it is clearly that a fundamental understanding of the anomalous effects observed in numerous experiments will significantly increase reproducibility, improve controllability, enable optimization of processes, and accelerate the economic viability of LENR

  19. Undefined cellulase formulations hinder scientific reproducibility.

    Science.gov (United States)

    Himmel, Michael E; Abbas, Charles A; Baker, John O; Bayer, Edward A; Bomble, Yannick J; Brunecky, Roman; Chen, Xiaowen; Felby, Claus; Jeoh, Tina; Kumar, Rajeev; McCleary, Barry V; Pletschke, Brett I; Tucker, Melvin P; Wyman, Charles E; Decker, Stephen R

    2017-01-01

    In the shadow of a burgeoning biomass-to-fuels industry, biological conversion of lignocellulose to fermentable sugars in a cost-effective manner is key to the success of second-generation and advanced biofuel production. For the effective comparison of one cellulase preparation to another, cellulase assays are typically carried out with one or more engineered cellulase formulations or natural exoproteomes of known performance serving as positive controls. When these formulations have unknown composition, as is the case with several widely used commercial products, it becomes impossible to compare or reproduce work done today to work done in the future, where, for example, such preparations may not be available. Therefore, being a critical tenet of science publishing, experimental reproducibility is endangered by the continued use of these undisclosed products. We propose the introduction of standard procedures and materials to produce specific and reproducible cellulase formulations. These formulations are to serve as yardsticks to measure improvements and performance of new cellulase formulations.

  20. Field assessment of balance in 10 to 14 year old children, reproducibility and validity of the Nintendo Wii board

    Science.gov (United States)

    2014-01-01

    Background Because body proportions in childhood are different to those in adulthood, children have a relatively higher centre of mass location. This biomechanical difference and the fact that children’s movements have not yet fully matured result in different sway performances in children and adults. When assessing static balance, it is essential to use objective, sensitive tools, and these types of measurement have previously been performed in laboratory settings. However, the emergence of technologies like the Nintendo Wii Board (NWB) might allow balance assessment in field settings. As the NWB has only been validated and tested for reproducibility in adults, the purpose of this study was to examine reproducibility and validity of the NWB in a field setting, in a population of children. Methods Fifty-four 10–14 year-olds from the CHAMPS-Study DK performed four different balance tests: bilateral stance with eyes open (1), unilateral stance on dominant (2) and non-dominant leg (3) with eyes open, and bilateral stance with eyes closed (4). Three rounds of the four tests were completed with the NWB and with a force platform (AMTI). To assess reproducibility, an intra-day test-retest design was applied with a two-hour break between sessions. Results Bland-Altman plots supplemented by Minimum Detectable Change (MDC) and concordance correlation coefficient (CCC) demonstrated satisfactory reproducibility for the NWB and the AMTI (MDC: 26.3-28.2%, CCC: 0.76-0.86) using Centre Of Pressure path Length as measurement parameter. Bland-Altman plots demonstrated satisfactory concurrent validity between the NWB and the AMTI, supplemented by satisfactory CCC in all tests (CCC: 0.74-0.87). The ranges of the limits of agreement in the validity study were comparable to the limits of agreement of the reproducibility study. Conclusion Both NWB and AMTI have satisfactory reproducibility for testing static balance in a population of children. Concurrent validity of NWB compared

  1. Development of the Japanese version of the Council on Nutrition Appetite Questionnaire and its simplified versions, and evaluation of their reliability, validity, and reproducibility.

    Science.gov (United States)

    Tokudome, Yuko; Okumura, Keiko; Kumagai, Yoshiko; Hirano, Hirohiko; Kim, Hunkyung; Morishita, Shiho; Watanabe, Yutaka

    2017-11-01

    Because few Japanese questionnaires assess the elderly's appetite, there is an urgent need to develop an appetite questionnaire with verified reliability, validity, and reproducibility. We translated and back-translated the Council on Nutrition Appetite Questionnaire (CNAQ), which has eight items, into Japanese (CNAQ-J), as well as the Simplified Nutritional Appetite Questionnaire (SNAQ-J), which includes four CNAQ-J-derived items. Using structural equation modeling, we examined the CNAQ-J structure based on data of 649 Japanese elderly people in 2013, including individuals having a certain degree of cognitive impairment, and we developed the SNAQ for the Japanese elderly (SNAQ-JE) according to an exploratory factor analysis. Confirmatory factor analyses on the appetite questionnaires were conducted to probe fitting to the model. We computed Cronbach's α coefficients and criterion-referenced/-related validity figures examining associations of the three appetite battery scores with body mass index (BMI) values and with nutrition-related questionnaire values. Test-retest reproducibility of appetite tools was scrutinized over an approximately 2-week interval. An exploratory factor analysis demonstrated that the CNAQ-J was constructed of one factor (appetite), yielding the SNAQ-JE, which includes four questions derived from the CNAQ-J. The three appetite instruments showed almost equivalent fitting to the model and reproducibility. The CNAQ-J and SNAQ-JE demonstrated satisfactory reliability and significant criterion-referenced/-related validity values, including BMIs, but the SNAQ-J included a low factor-loading item, exhibited less satisfactory reliability and had a non-significant relationship to BMI. The CNAQ-J and SNAQ-JE may be applied to assess the appetite of Japanese elderly, including persons with some cognitive impairment. Copyright © 2017 The Authors. Production and hosting by Elsevier B.V. All rights reserved.

  2. Reproducibility of somatosensory spatial perceptual maps.

    Science.gov (United States)

    Steenbergen, Peter; Buitenweg, Jan R; Trojan, Jörg; Veltink, Peter H

    2013-02-01

    Various studies have shown subjects to mislocalize cutaneous stimuli in an idiosyncratic manner. Spatial properties of individual localization behavior can be represented in the form of perceptual maps. Individual differences in these maps may reflect properties of internal body representations, and perceptual maps may therefore be a useful method for studying these representations. For this to be the case, individual perceptual maps need to be reproducible, which has not yet been demonstrated. We assessed the reproducibility of localizations measured twice on subsequent days. Ten subjects participated in the experiments. Non-painful electrocutaneous stimuli were applied at seven sites on the lower arm. Subjects localized the stimuli on a photograph of their own arm, which was presented on a tablet screen overlaying the real arm. Reproducibility was assessed by calculating intraclass correlation coefficients (ICC) for the mean localizations of each electrode site and the slope and offset of regression models of the localizations, which represent scaling and displacement of perceptual maps relative to the stimulated sites. The ICCs of the mean localizations ranged from 0.68 to 0.93; the ICCs of the regression parameters were 0.88 for the intercept and 0.92 for the slope. These results indicate a high degree of reproducibility. We conclude that localization patterns of non-painful electrocutaneous stimuli on the arm are reproducible on subsequent days. Reproducibility is a necessary property of perceptual maps for these to reflect properties of a subject's internal body representations. Perceptual maps are therefore a promising method for studying body representations.

  3. [Natural head position's reproducibility on photographs].

    Science.gov (United States)

    Eddo, Marie-Line; El Hayeck, Émilie; Hoyeck, Maha; Khoury, Élie; Ghoubril, Joseph

    2017-12-01

    The purpose of this study is to evaluate the reproducibility of natural head position with time on profile photographs. Our sample is composed of 96 students (20-30 years old) at the department of dentistry of Saint Joseph University in Beirut. Two profile photographs were taken in natural head position about a week apart. No significant differences were found between T0 and T1 (E = 1.065°). Many studies confirmed this reproducibility with time. Natural head position can be adopted as an orientation for profile photographs in orthodontics. © EDP Sciences, SFODF, 2017.

  4. Highly reproducible polyol synthesis for silver nanocubes

    Science.gov (United States)

    Han, Hye Ji; Yu, Taekyung; Kim, Woo-Sik; Im, Sang Hyuk

    2017-07-01

    We could synthesize the Ag nanocubes highly reproducibly by conducting the polyol synthesis using HCl etchant in dark condition because the photodecomposition/photoreduction of AgCl nanoparticles formed at initial reaction stage were greatly depressed and consequently the selective self-nucleation of Ag single crystals and their selective growth reaction could be promoted. Whereas the reproducibility of the formation of Ag nanocubes were very poor when we synthesize the Ag nanocubes in light condition due to the photoreduction of AgCl to Ag.

  5. Reproducible statistical analysis with multiple languages

    DEFF Research Database (Denmark)

    Lenth, Russell; Højsgaard, Søren

    2011-01-01

    This paper describes the system for making reproducible statistical analyses. differs from other systems for reproducible analysis in several ways. The two main differences are: (1) Several statistics programs can be in used in the same document. (2) Documents can be prepared using OpenOffice or ......Office or \\LaTeX. The main part of this paper is an example showing how to use and together in an OpenOffice text document. The paper also contains some practical considerations on the use of literate programming in statistics....

  6. Left ventricular volume measurements with free breathing respiratory self-gated 3-dimensional golden angle radial whole-heart cine imaging - Feasibility and reproducibility.

    Science.gov (United States)

    Holst, Karen; Ugander, Martin; Sigfridsson, Andreas

    2017-11-01

    To develop and evaluate a free breathing respiratory self-gated isotropic resolution technique for left ventricular (LV) volume measurements. A 3D radial trajectory with double golden-angle ordering was used for free-running data acquisition during free breathing in 9 healthy volunteers. A respiratory self-gating signal was extracted from the center of k-space and used with the electrocardiogram to bin all data into 3 respiratory and 25 cardiac phases. 3D image volumes were reconstructed and the LV endocardial border was segmented. LV volume measurements and reproducibility from 3D free breathing cine were compared to conventional 2D breath-held cine. No difference was found between 3D free breathing cine and 2D breath-held cine with regards to LV ejection fraction, stroke volume, end-systolic volume and end-diastolic volume (Pcine and 2D breath-held cine (Pcine and conventional 2D breath-held cine showed similar values and test-retest repeatability for LV volumes in healthy volunteers. 3D free breathing cine enabled retrospective sorting and arbitrary angulation of isotropic data, and could correctly measure LV volumes during free breathing acquisition. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Reproducing kernel Hilbert spaces of Gaussian priors

    NARCIS (Netherlands)

    Vaart, van der A.W.; Zanten, van J.H.; Clarke, B.; Ghosal, S.

    2008-01-01

    We review definitions and properties of reproducing kernel Hilbert spaces attached to Gaussian variables and processes, with a view to applications in nonparametric Bayesian statistics using Gaussian priors. The rate of contraction of posterior distributions based on Gaussian priors can be described

  8. Reproducibility of the results in ultrasonic testing

    International Nuclear Information System (INIS)

    Chalaye, M.; Launay, J.P.; Thomas, A.

    1980-12-01

    This memorandum reports on the conclusions of the tests carried out in order to evaluate the reproducibility of ultrasonic tests made on welded joints. FRAMATOME have started a study to assess the dispersion of results afforded by the test line and to characterize its behaviour. The tests covered sensors and ultrasonic generators said to be identical to each other (same commercial batch) [fr

  9. Reproducibility in Computational Neuroscience Models and Simulations

    Science.gov (United States)

    McDougal, Robert A.; Bulanova, Anna S.; Lytton, William W.

    2016-01-01

    Objective Like all scientific research, computational neuroscience research must be reproducible. Big data science, including simulation research, cannot depend exclusively on journal articles as the method to provide the sharing and transparency required for reproducibility. Methods Ensuring model reproducibility requires the use of multiple standard software practices and tools, including version control, strong commenting and documentation, and code modularity. Results Building on these standard practices, model sharing sites and tools have been developed that fit into several categories: 1. standardized neural simulators, 2. shared computational resources, 3. declarative model descriptors, ontologies and standardized annotations; 4. model sharing repositories and sharing standards. Conclusion A number of complementary innovations have been proposed to enhance sharing, transparency and reproducibility. The individual user can be encouraged to make use of version control, commenting, documentation and modularity in development of models. The community can help by requiring model sharing as a condition of publication and funding. Significance Model management will become increasingly important as multiscale models become larger, more detailed and correspondingly more difficult to manage by any single investigator or single laboratory. Additional big data management complexity will come as the models become more useful in interpreting experiments, thus increasing the need to ensure clear alignment between modeling data, both parameters and results, and experiment. PMID:27046845

  10. Estimating the reproducibility of psychological science

    NARCIS (Netherlands)

    Aarts, Alexander A.; Anderson, Joanna E.; Anderson, Christopher J.; Attridge, Peter R.; Attwood, Angela; Axt, Jordan; Babel, Molly; Bahnik, Stepan; Baranski, Erica; Barnett-Cowan, Michael; Bartmess, Elizabeth; Beer, Jennifer; Bell, Raoul; Bentley, Heather; Beyan, Leah; Binion, Grace; Borsboom, Denny; Bosch, Annick; Bosco, Frank A.; Bowman, Sara D.; Brandt, Mark J.; Braswell, Erin; Brohmer, Hilmar; Brown, Benjamin T.; Brown, Kristina; Bruening, Jovita; Calhoun-Sauls, Ann; Chagnon, Elizabeth; Callahan, Shannon P.; Chandler, Jesse; Chartier, Christopher R.; Cheung, Felix; Cillessen, Linda; Christopherson, Cody D.; Clay, Russ; Cleary, Hayley; Cloud, Mark D.; Cohn, Michael; Cohoon, Johanna; Columbus, Simon; Cordes, Andreas; Costantini, Giulio; Hartgerink, Chris; Krijnen, Job; Nuijten, Michele B.; van 't Veer, Anna E.; Van Aert, Robbie; van Assen, M.A.L.M.; Wissink, Joeri; Zeelenberg, Marcel

    2015-01-01

    INTRODUCTION Reproducibility is a defining feature of science, but the extent to which it characterizes current research is unknown. Scientific claims should not gain credence because of the status or authority of their originator but by the replicability of their supporting evidence. Even research

  11. Reproducibility, Controllability, and Optimization of Lenr Experiments

    Science.gov (United States)

    Nagel, David J.

    2006-02-01

    Low-energy nuclear reaction (LENR) measurements are significantly and increasingly reproducible. Practical control of the production of energy or materials by LENR has yet to be demonstrated. Minimization of costly inputs and maximization of desired outputs of LENR remain for future developments.

  12. Estimating the reproducibility of psychological science

    NARCIS (Netherlands)

    Anderson, Joanna E.; Aarts, Alexander A.; Anderson, Christopher J.; Attridge, Peter R.; Attwood, Angela; Axt, Jordan; Babel, Molly; Bahník, Štěpán; Baranski, Erica; Barnett-Cowan, Michael; Bartmess, Elizabeth; Beer, Jennifer; Bell, Raoul; Bentley, Heather; Beyan, Leah; Binion, Grace; Borsboom, Denny; Bosch, Annick; Bosco, Frank A.; Bowman, Sara D.; Brandt, Mark J.; Braswell, Erin; Brohmer, Hilmar; Brown, Benjamin T.; Brown, Kristina; Brüning, Jovita; Calhoun-Sauls, Ann; Callahan, Shannon P.; Chagnon, Elizabeth; Chandler, Jesse; Chartier, Christopher R.; Cheung, Felix; Christopherson, Cody D.; Cillessen, Linda; Clay, Russ; Cleary, Hayley; Cloud, Mark D.; Conn, Michael; Cohoon, Johanna; Columbus, Simon; Cordes, Andreas; Costantini, Giulio; Alvarez, Leslie D Cramblet; Cremata, Ed; Crusius, Jan; DeCoster, Jamie; DeGaetano, Michelle A.; Penna, Nicolás Delia; Den Bezemer, Bobby; Deserno, Marie K.; Devitt, Olivia; Dewitte, Laura; Dobolyi, David G.; Dodson, Geneva T.; Donnellan, M. Brent; Donohue, Ryan; Dore, Rebecca A.; Dorrough, Angela; Dreber, Anna; Dugas, Michelle; Dunn, Elizabeth W.; Easey, Kayleigh; Eboigbe, Sylvia; Eggleston, Casey; Embley, Jo; Epskamp, Sacha; Errington, Timothy M.; Estel, Vivien; Farach, Frank J.; Feather, Jenelle; Fedor, Anna; Fernández-Castilla, Belén; Fiedler, Susann; Field, James G.; Fitneva, Stanka A.; Flagan, Taru; Forest, Amanda L.; Forsell, Eskil; Foster, Joshua D.; Frank, Michael C.; Frazier, Rebecca S.; Fuchs, Heather; Gable, Philip; Galak, Jeff; Galliani, Elisa Maria; Gampa, Anup; Garcia, Sara; Gazarian, Douglas; Gilbert, Elizabeth; Giner-Sorolla, Roger; Glöckner, Andreas; Goellner, Lars; Goh, Jin X.; Goldberg, Rebecca; Goodbourn, Patrick T.; Gordon-McKeon, Shauna; Gorges, Bryan; Gorges, Jessie; Goss, Justin; Graham, Jesse; Grange, James A.; Gray, Jeremy; Hartgerink, Chris; Hartshorne, Joshua; Hasselman, Fred; Hayes, Timothy; Heikensten, Emma; Henninger, Felix; Hodsoll, John; Holubar, Taylor; Hoogendoorn, Gea; Humphries, Denise J.; Hung, Cathy O Y; Immelman, Nathali; Irsik, Vanessa C.; Jahn, Georg; Jäkel, Frank; Jekel, Marc; Johannesson, Magnus; Johnson, Larissa G.; Johnson, David J.; Johnson, Kate M.; Johnston, William J.; Jonas, Kai; Joy-Gaba, Jennifer A.; Kappes, Heather Barry; Kelso, Kim; Kidwell, Mallory C.; Kim, Seung Kyung; Kirkhart, Matthew; Kleinberg, Bennett; Knežević, Goran; Kolorz, Franziska Maria; Kossakowski, Jolanda J.; Krause, Robert Wilhelm; Krijnen, Job; Kuhlmann, Tim; Kunkels, Yoram K.; Kyc, Megan M.; Lai, Calvin K.; Laique, Aamir; Lakens, Daniël|info:eu-repo/dai/nl/298811855; Lane, Kristin A.; Lassetter, Bethany; Lazarević, Ljiljana B.; Le Bel, Etienne P.; Lee, Key Jung; Lee, Minha; Lemm, Kristi; Levitan, Carmel A.; Lewis, Melissa; Lin, Lin; Lin, Stephanie; Lippold, Matthias; Loureiro, Darren; Luteijn, Ilse; MacKinnon, Sean; Mainard, Heather N.; Marigold, Denise C.; Martin, Daniel P.; Martinez, Tylar; Masicampo, E. J.; Matacotta, Josh; Mathur, Maya; May, Michael; Mechin, Nicole; Mehta, Pranjal; Meixner, Johannes; Melinger, Alissa; Miller, Jeremy K.; Miller, Mallorie; Moore, Katherine; Möschl, Marcus; Motyl, Matt; Müller, Stephanie M.; Munafo, Marcus; Neijenhuijs, Koen I.; Nervi, Taylor; Nicolas, Gandalf; Nilsonne, Gustav; Nosek, Brian A.; Nuijten, Michèle B.; Olsson, Catherine; Osborne, Colleen; Ostkamp, Lutz; Pavel, Misha; Penton-Voak, Ian S.; Perna, Olivia; Pernet, Cyril; Perugini, Marco; Pipitone, R. Nathan; Pitts, Michael; Plessow, Franziska; Prenoveau, Jason M.; Rahal, Rima Maria; Ratliff, Kate A.; Reinhard, David; Renkewitz, Frank; Ricker, Ashley A.; Rigney, Anastasia; Rivers, Andrew M.; Roebke, Mark; Rutchick, Abraham M.; Ryan, Robert S.; Sahin, Onur; Saide, Anondah; Sandstrom, Gillian M.; Santos, David; Saxe, Rebecca; Schlegelmilch, René; Schmidt, Kathleen; Scholz, Sabine; Seibel, Larissa; Selterman, Dylan Faulkner; Shaki, Samuel; Simpson, William B.; Sinclair, H. Colleen; Skorinko, Jeanine L M; Slowik, Agnieszka; Snyder, Joel S.; Soderberg, Courtney; Sonnleitner, Carina; Spencer, Nick; Spies, Jeffrey R.; Steegen, Sara; Stieger, Stefan; Strohminger, Nina; Sullivan, Gavin B.; Talhelm, Thomas; Tapia, Megan; Te Dorsthorst, Anniek; Thomae, Manuela; Thomas, Sarah L.; Tio, Pia; Traets, Frits; Tsang, Steve; Tuerlinckx, Francis; Turchan, Paul; Valášek, Milan; Van't Veer, Anna E.; Van Aert, Robbie; Van Assen, Marcel|info:eu-repo/dai/nl/407629971; Van Bork, Riet; Van De Ven, Mathijs; Van Den Bergh, Don; Van Der Hulst, Marije; Van Dooren, Roel; Van Doorn, Johnny; Van Renswoude, Daan R.; Van Rijn, Hedderik; Vanpaemel, Wolf; Echeverría, Alejandro Vásquez; Vazquez, Melissa; Velez, Natalia; Vermue, Marieke; Verschoor, Mark; Vianello, Michelangelo; Voracek, Martin; Vuu, Gina; Wagenmakers, Eric Jan; Weerdmeester, Joanneke; Welsh, Ashlee; Westgate, Erin C.; Wissink, Joeri; Wood, Michael; Woods, Andy; Wright, Emily; Wu, Sining; Zeelenberg, Marcel; Zuni, Kellylynn

    2015-01-01

    Reproducibility is a defining feature of science, but the extent to which it characterizes current research is unknown. We conducted replications of 100 experimental and correlational studies published in three psychology journals using high-powered designs and original materials when available.

  13. ITK: Enabling Reproducible Research and Open Science

    Directory of Open Access Journals (Sweden)

    Matthew Michael McCormick

    2014-02-01

    Full Text Available Reproducibility verification is essential to the practice of the scientific method. Researchers report their findings, which are strengthened as other independent groups in the scientific community share similar outcomes. In the many scientific fields where software has become a fundamental tool for capturing and analyzing data, this requirement of reproducibility implies that reliable and comprehensive software platforms and tools should be made available to the scientific community. The tools will empower them and the public to verify, through practice, the reproducibility of observations that are reported in the scientific literature.Medical image analysis is one of the fields in which the use of computational resources, both software and hardware, are an essential platform for performing experimental work. In this arena, the introduction of the Insight Toolkit (ITK in 1999 has transformed the field and facilitates its progress by accelerating the rate at which algorithmic implementations are developed, tested, disseminated and improved. By building on the efficiency and quality of open source methodologies, ITK has provided the medical image community with an effective platform on which to build a daily workflow that incorporates the true scientific practices of reproducibility verification.This article describes the multiple tools, methodologies, and practices that the ITK community has adopted, refined, and followed during the past decade, in order to become one of the research communities with the most modern reproducibility verification infrastructure. For example, 207 contributors have created over 2400 unit tests that provide over 84% code line test coverage. The Insight Journal, an open publication journal associated with the toolkit, has seen over 360,000 publication downloads. The median normalized closeness centrality, a measure of knowledge flow, resulting from the distributed peer code review system was high, 0.46.

  14. A PHYSICAL ACTIVITY QUESTIONNAIRE: REPRODUCIBILITY AND VALIDITY

    Directory of Open Access Journals (Sweden)

    Nicolas Barbosa

    2007-12-01

    Full Text Available This study evaluates the Quantification de L'Activite Physique en Altitude chez les Enfants (QAPACE supervised self-administered questionnaire reproducibility and validity on the estimation of the mean daily energy expenditure (DEE on Bogotá's schoolchildren. The comprehension was assessed on 324 students, whereas the reproducibility was studied on a different random sample of 162 who were exposed twice to it. Reproducibility was assessed using both the Bland-Altman plot and the intra-class correlation coefficient (ICC. The validity was studied in a sample of 18 girls and 18 boys randomly selected, which completed the test - re-test study. The DEE derived from the questionnaire was compared with the laboratory measurement results of the peak oxygen uptake (Peak VO2 from ergo-spirometry and Leger Test. The reproducibility ICC was 0.96 (95% C.I. 0.95-0.97; by age categories 8-10, 0.94 (0.89-0. 97; 11-13, 0.98 (0.96- 0.99; 14-16, 0.95 (0.91-0.98. The ICC between mean TEE as estimated by the questionnaire and the direct and indirect Peak VO2 was 0.76 (0.66 (p<0.01; by age categories, 8-10, 11-13, and 14-16 were 0.89 (0.87, 0.76 (0.78 and 0.88 (0.80 respectively. The QAPACE questionnaire is reproducible and valid for estimating PA and showed a high correlation with the Peak VO2 uptake

  15. Does systematic variation improve the reproducibility of animal experiments?

    NARCIS (Netherlands)

    Jonker, R.M.; Guenther, A.; Engqvist, L.; Schmoll, T.

    2013-01-01

    Reproducibility of results is a fundamental tenet of science. In this journal, Richter et al.1 tested whether systematic variation in experimental conditions (heterogenization) affects the reproducibility of results. Comparing this approach with the current standard of ensuring reproducibility

  16. Reproducibility of scoring emphysema by HRCT

    International Nuclear Information System (INIS)

    Malinen, A.; Partanen, K.; Rytkoenen, H.; Vanninen, R.; Erkinjuntti-Pekkanen, R.

    2002-01-01

    Purpose: We evaluated the reproducibility of three visual scoring methods of emphysema and compared these methods with pulmonary function tests (VC, DLCO, FEV1 and FEV%) among farmer's lung patients and farmers. Material and Methods: Three radiologists examined high-resolution CT images of farmer's lung patients and their matched controls (n=70) for chronic interstitial lung diseases. Intraobserver reproducibility and interobserver variability were assessed for three methods: severity, Sanders' (extent) and Sakai. Pulmonary function tests as spirometry and diffusing capacity were measured. Results: Intraobserver -values for all three methods were good (0.51-0.74). Interobserver varied from 0.35 to 0.72. The Sanders' and the severity methods correlated strongly with pulmonary function tests, especially DLCO and FEV1. Conclusion: The Sanders' method proved to be reliable in evaluating emphysema, in terms of good consistency of interpretation and good correlation with pulmonary function tests

  17. Reproducibility of scoring emphysema by HRCT

    Energy Technology Data Exchange (ETDEWEB)

    Malinen, A.; Partanen, K.; Rytkoenen, H.; Vanninen, R. [Kuopio Univ. Hospital (Finland). Dept. of Clinical Radiology; Erkinjuntti-Pekkanen, R. [Kuopio Univ. Hospital (Finland). Dept. of Pulmonary Diseases

    2002-04-01

    Purpose: We evaluated the reproducibility of three visual scoring methods of emphysema and compared these methods with pulmonary function tests (VC, DLCO, FEV1 and FEV%) among farmer's lung patients and farmers. Material and Methods: Three radiologists examined high-resolution CT images of farmer's lung patients and their matched controls (n=70) for chronic interstitial lung diseases. Intraobserver reproducibility and interobserver variability were assessed for three methods: severity, Sanders' (extent) and Sakai. Pulmonary function tests as spirometry and diffusing capacity were measured. Results: Intraobserver -values for all three methods were good (0.51-0.74). Interobserver varied from 0.35 to 0.72. The Sanders' and the severity methods correlated strongly with pulmonary function tests, especially DLCO and FEV1. Conclusion: The Sanders' method proved to be reliable in evaluating emphysema, in terms of good consistency of interpretation and good correlation with pulmonary function tests.

  18. Reproducibility of the chamber scarification test

    DEFF Research Database (Denmark)

    Andersen, Klaus Ejner

    1996-01-01

    The chamber scarification test is a predictive human skin irritation test developed to rank the irritation potential of products and ingredients meant for repeated use on normal and diseased skin. 12 products or ingredients can be tested simultaneously on the forearm skin of each volunteer....... The test combines with the procedure scratching of the skin at each test site and subsequent closed patch tests with the products, repeated daily for 3 days. The test is performed on groups of human volunteers: a skin irritant substance or products is included in each test as a positive control...... high reproducibility of the test. Further, intra-individual variation in skin reaction to the 2 control products in 26 volunteers, who participated 2x, is shown, which supports the conclusion that the chamber scarification test is a useful short-term human skin irritation test with high reproducibility....

  19. Additive Manufacturing: Reproducibility of Metallic Parts

    Directory of Open Access Journals (Sweden)

    Konda Gokuldoss Prashanth

    2017-02-01

    Full Text Available The present study deals with the properties of five different metals/alloys (Al-12Si, Cu-10Sn and 316L—face centered cubic structure, CoCrMo and commercially pure Ti (CP-Ti—hexagonal closed packed structure fabricated by selective laser melting. The room temperature tensile properties of Al-12Si samples show good consistency in results within the experimental errors. Similar reproducible results were observed for sliding wear and corrosion experiments. The other metal/alloy systems also show repeatable tensile properties, with the tensile curves overlapping until the yield point. The curves may then follow the same path or show a marginal deviation (~10 MPa until they reach the ultimate tensile strength and a negligible difference in ductility levels (of ~0.3% is observed between the samples. The results show that selective laser melting is a reliable fabrication method to produce metallic materials with consistent and reproducible properties.

  20. Reproducibility in cyclostratigraphy: initiating an intercomparison project

    Science.gov (United States)

    Sinnesael, Matthias; De Vleeschouwer, David; Zeeden, Christian; Claeys, Philippe

    2017-04-01

    The study of astronomical climate forcing and the application of cyclostratigraphy have experienced a spectacular growth over the last decades. In the field of cyclostratigraphy a broad range in methodological approaches exist. However, comparative study between the different approaches is lacking. Different cases demand different approaches, but with the growing importance of the field, questions arise about reproducibility, uncertainties and standardization of results. The radioisotopic dating community, in particular, has done far-reaching efforts to improve reproducibility and intercomparison of radioisotopic dates and their errors. To satisfy this need in cyclostratigraphy, we initiate a comparable framework for the community. The aims are to investigate and quantify reproducibility of, and uncertainties related to cyclostratigraphic studies and to provide a platform to discuss the merits and pitfalls of different methodologies, and their applicabilities. With this poster, we ask the feedback from the community on how to design this comparative framework in a useful, meaningful and productive manner. In parallel, we would like to discuss how reproducibility should be tested and what uncertainties should stand for in cyclostratigraphy. On the other hand, we intend to trigger interest for a cyclostratigraphic intercomparison project. This intercomparison project would imply the analysis of artificial and genuine geological records by individual researchers. All participants would be free to determine their method of choice. However, a handful of criterions will be required for an outcome to be comparable. The different results would be compared (e.g. during a workshop or a special session), and the lessons learned from the comparison could potentially be reported in a review paper. The aim of an intercomparison project is not to rank the different methods according to their merits, but to get insight into which specific methods are most suitable for which

  1. A how to guide to reproducible research

    OpenAIRE

    Whitaker, Kirstie

    2018-01-01

    This talk will discuss the perceived and actual barriers experienced by researchers attempting to do reproducible research, and give practical guidance on how they can be overcome. It will include suggestions on how to make your code and data available and usable for others (including a strong suggestion to document both clearly so you don't have to reply to lots of email questions from future users). Specifically it will include a brief guide to version control, collaboration and disseminati...

  2. Bad Behavior: Improving Reproducibility in Behavior Testing.

    Science.gov (United States)

    Andrews, Anne M; Cheng, Xinyi; Altieri, Stefanie C; Yang, Hongyan

    2018-01-24

    Systems neuroscience research is increasingly possible through the use of integrated molecular and circuit-level analyses. These studies depend on the use of animal models and, in many cases, molecular and circuit-level analyses. Associated with genetic, pharmacologic, epigenetic, and other types of environmental manipulations. We illustrate typical pitfalls resulting from poor validation of behavior tests. We describe experimental designs and enumerate controls needed to improve reproducibility in investigating and reporting of behavioral phenotypes.

  3. A Framework for Reproducible Latent Fingerprint Enhancements.

    Science.gov (United States)

    Carasso, Alfred S

    2014-01-01

    Photoshop processing of latent fingerprints is the preferred methodology among law enforcement forensic experts, but that appproach is not fully reproducible and may lead to questionable enhancements. Alternative, independent, fully reproducible enhancements, using IDL Histogram Equalization and IDL Adaptive Histogram Equalization, can produce better-defined ridge structures, along with considerable background information. Applying a systematic slow motion smoothing procedure to such IDL enhancements, based on the rapid FFT solution of a Lévy stable fractional diffusion equation, can attenuate background detail while preserving ridge information. The resulting smoothed latent print enhancements are comparable to, but distinct from, forensic Photoshop images suitable for input into automated fingerprint identification systems, (AFIS). In addition, this progressive smoothing procedure can be reexamined by displaying the suite of progressively smoother IDL images. That suite can be stored, providing an audit trail that allows monitoring for possible loss of useful information, in transit to the user-selected optimal image. Such independent and fully reproducible enhancements provide a valuable frame of reference that may be helpful in informing, complementing, and possibly validating the forensic Photoshop methodology.

  4. Reproducibility of 201Tl myocardial imaging

    International Nuclear Information System (INIS)

    McLaughlin, P.R.; Martin, R.P.; Doherty, P.; Daspit, S.; Goris, M.; Haskell, W.; Lewis, S.; Kriss, J.P.; Harrison, D.C.

    1977-01-01

    Seventy-six thallium-201 myocardial perfusion studies were performed on twenty-five patients to assess their reproducibility and the effect of varying the level of exercise on the results of imaging. Each patient had a thallium-201 study at rest. Fourteen patients had studies on two occasions at maximum exercise, and twelve patients had studies both at light and at maximum exercise. Of 70 segments in the 14 patients assessed on each of two maximum exercise tests, 64 (91 percent) were reproducible. Only 53 percent (16/30) of the ischemic defects present at maximum exercise were seen in the light exercise study in the 12 patients assessed at two levels of exercise. Correlation of perfusion defects with arteriographically proven significant coronary stenosis was good for the left anterior descending and right coronary arteries, but not as good for circumflex artery disease. Thallium-201 myocardial imaging at maximum exercise is reproducible within acceptable limits, but careful attention to exercise technique is essential for valid comparative studies

  5. Standing Together for Reproducibility in Large-Scale Computing: Report on reproducibility@XSEDE

    OpenAIRE

    James, Doug; Wilkins-Diehr, Nancy; Stodden, Victoria; Colbry, Dirk; Rosales, Carlos; Fahey, Mark; Shi, Justin; Silva, Rafael F.; Lee, Kyo; Roskies, Ralph; Loewe, Laurence; Lindsey, Susan; Kooper, Rob; Barba, Lorena; Bailey, David

    2014-01-01

    This is the final report on reproducibility@xsede, a one-day workshop held in conjunction with XSEDE14, the annual conference of the Extreme Science and Engineering Discovery Environment (XSEDE). The workshop's discussion-oriented agenda focused on reproducibility in large-scale computational research. Two important themes capture the spirit of the workshop submissions and discussions: (1) organizational stakeholders, especially supercomputer centers, are in a unique position to promote, enab...

  6. Online dietary intake estimation: reproducibility and validity of the Food4Me food frequency questionnaire against a 4-day weighed food record.

    Science.gov (United States)

    Fallaize, Rosalind; Forster, Hannah; Macready, Anna L; Walsh, Marianne C; Mathers, John C; Brennan, Lorraine; Gibney, Eileen R; Gibney, Michael J; Lovegrove, Julie A

    2014-08-11

    Advances in nutritional assessment are continuing to embrace developments in computer technology. The online Food4Me food frequency questionnaire (FFQ) was created as an electronic system for the collection of nutrient intake data. To ensure its accuracy in assessing both nutrient and food group intake, further validation against data obtained using a reliable, but independent, instrument and assessment of its reproducibility are required. The aim was to assess the reproducibility and validity of the Food4Me FFQ against a 4-day weighed food record (WFR). Reproducibility of the Food4Me FFQ was assessed using test-retest methodology by asking participants to complete the FFQ on 2 occasions 4 weeks apart. To assess the validity of the Food4Me FFQ against the 4-day WFR, half the participants were also asked to complete a 4-day WFR 1 week after the first administration of the Food4Me FFQ. Level of agreement between nutrient and food group intakes estimated by the repeated Food4Me FFQ and the Food4Me FFQ and 4-day WFR were evaluated using Bland-Altman methodology and classification into quartiles of daily intake. Crude unadjusted correlation coefficients were also calculated for nutrient and food group intakes. In total, 100 people participated in the assessment of reproducibility (mean age 32, SD 12 years), and 49 of these (mean age 27, SD 8 years) also took part in the assessment of validity. Crude unadjusted correlations for repeated Food4Me FFQ ranged from .65 (vitamin D) to .90 (alcohol). The mean cross-classification into "exact agreement plus adjacent" was 92% for both nutrient and food group intakes, and Bland-Altman plots showed good agreement for energy-adjusted macronutrient intakes. Agreement between the Food4Me FFQ and 4-day WFR varied, with crude unadjusted correlations ranging from .23 (vitamin D) to .65 (protein, % total energy) for nutrient intakes and .11 (soups, sauces and miscellaneous foods) to .73 (yogurts) for food group intake. The mean cross

  7. Convergence of macrostates under reproducible processes

    International Nuclear Information System (INIS)

    Rau, Jochen

    2010-01-01

    I show that whenever a system undergoes a reproducible macroscopic process the mutual distinguishability of macrostates, as measured by their relative entropy, diminishes. This extends the second law which regards only ordinary entropies, and hence only the distinguishability between macrostates and one specific reference state (equidistribution). The new result holds regardless of whether the process is linear or nonlinear. Its proof hinges on the monotonicity of quantum relative entropy under arbitrary coarse grainings, even those that cannot be represented by trace-preserving completely positive maps.

  8. Open and reproducible global land use classification

    Science.gov (United States)

    Nüst, Daniel; Václavík, Tomáš; Pross, Benjamin

    2015-04-01

    Researchers led by the Helmholtz Centre for Environmental research (UFZ) developed a new world map of land use systems based on over 30 diverse indicators (http://geoportal.glues.geo.tu-dresden.de/stories/landsystemarchetypes.html) of land use intensity, climate and environmental and socioeconomic factors. They identified twelve land system archetypes (LSA) using a data-driven classification algorithm (self-organizing maps) to assess global impacts of land use on the environment, and found unexpected similarities across global regions. We present how the algorithm behind this analysis can be published as an executable web process using 52°North WPS4R (https://wiki.52north.org/bin/view/Geostatistics/WPS4R) within the GLUES project (http://modul-a.nachhaltiges-landmanagement.de/en/scientific-coordination-glues/). WPS4R is an open source collaboration platform for researchers, analysts and software developers to publish R scripts (http://www.r-project.org/) as a geo-enabled OGC Web Processing Service (WPS) process. The interoperable interface to call the geoprocess allows both reproducibility of the analysis and integration of user data without knowledge about web services or classification algorithms. The open platform allows everybody to replicate the analysis in their own environments. The LSA WPS process has several input parameters, which can be changed via a simple web interface. The input parameters are used to configure both the WPS environment and the LSA algorithm itself. The encapsulation as a web process allows integration of non-public datasets, while at the same time the publication requires a well-defined documentation of the analysis. We demonstrate this platform specifically to domain scientists and show how reproducibility and open source publication of analyses can be enhanced. We also discuss future extensions of the reproducible land use classification, such as the possibility for users to enter their own areas of interest to the system and

  9. Reproducibility in Research: Systems, Infrastructure, Culture

    Directory of Open Access Journals (Sweden)

    Tom Crick

    2017-11-01

    Full Text Available The reproduction and replication of research results has become a major issue for a number of scientific disciplines. In computer science and related computational disciplines such as systems biology, the challenges closely revolve around the ability to implement (and exploit novel algorithms and models. Taking a new approach from the literature and applying it to a new codebase frequently requires local knowledge missing from the published manuscripts and transient project websites. Alongside this issue, benchmarking, and the lack of open, transparent and fair benchmark sets present another barrier to the verification and validation of claimed results. In this paper, we outline several recommendations to address these issues, driven by specific examples from a range of scientific domains. Based on these recommendations, we propose a high-level prototype open automated platform for scientific software development which effectively abstracts specific dependencies from the individual researcher and their workstation, allowing easy sharing and reproduction of results. This new e-infrastructure for reproducible computational science offers the potential to incentivise a culture change and drive the adoption of new techniques to improve the quality and efficiency – and thus reproducibility – of scientific exploration.

  10. PSYCHOLOGY. Estimating the reproducibility of psychological science.

    Science.gov (United States)

    2015-08-28

    Reproducibility is a defining feature of science, but the extent to which it characterizes current research is unknown. We conducted replications of 100 experimental and correlational studies published in three psychology journals using high-powered designs and original materials when available. Replication effects were half the magnitude of original effects, representing a substantial decline. Ninety-seven percent of original studies had statistically significant results. Thirty-six percent of replications had statistically significant results; 47% of original effect sizes were in the 95% confidence interval of the replication effect size; 39% of effects were subjectively rated to have replicated the original result; and if no bias in original results is assumed, combining original and replication results left 68% with statistically significant effects. Correlational tests suggest that replication success was better predicted by the strength of original evidence than by characteristics of the original and replication teams. Copyright © 2015, American Association for the Advancement of Science.

  11. Echo Particle Image Velocimetry for Estimation of Carotid Artery Wall Shear Stress: Repeatability, Reproducibility and Comparison with Phase-Contrast Magnetic Resonance Imaging.

    Science.gov (United States)

    Gurung, Arati; Gates, Phillip E; Mazzaro, Luciano; Fulford, Jonathan; Zhang, Fuxing; Barker, Alex J; Hertzberg, Jean; Aizawa, Kunihiko; Strain, William D; Elyas, Salim; Shore, Angela C; Shandas, Robin

    2017-08-01

    Measurement of hemodynamic wall shear stress (WSS) is important in investigating the role of WSS in the initiation and progression of atherosclerosis. Echo particle image velocimetry (echo PIV) is a novel ultrasound-based technique for measuring WSS in vivo that has previously been validated in vitro using the standard optical PIV technique. We evaluated the repeatability and reproducibility of echo PIV for measuring WSS in the human common carotid artery. We measured WSS in 28 healthy participants (18 males and 10 females, mean age: 56 ± 12 y). Echo PIV was highly repeatable, with an intra-observer variability of 1.0 ± 0.1 dyn/cm 2 for peak systolic (maximum), 0.9 dyn/cm 2 for mean and 0.5 dyn/cm 2 for end-diastolic (minimum) WSS measurements. Likewise, echo PIV was reproducible, with a low inter-observer variability (max: 2.0 ± 0.2 dyn/cm 2 , mean: 1.3 ± 0.1 dyn/cm 2 , end-diastolic: 0.7 dyn/cm 2 ) and more variable inter-scan (test-retest) variability (max: 7.1 ± 2.3 dyn/cm 2 , mean: 2.9 ± 0.4 dyn/cm 2 , min: 1.5 ± 0.1 dyn/cm 2 ). We compared echo PIV with the reference method, phase-contrast magnetic resonance imaging (PC-MRI); echo PIV-based WSS measurements agreed qualitatively with PC-MRI measurements (r = 0.89, p PIV vs. PC-MRI): WSS at peak systole: 21 ± 7.0 dyn/cm 2 vs. 15 ± 5.0 dyn/cm 2 ; time-averaged WSS: 8.9 ± 3.0 dyn/cm 2 vs. 7.1 ± 3.0 dyn/cm 2 (p  0.05). For the first time, we report that echo PIV can measure WSS with good repeatability and reproducibility in adult humans with a broad age range. Echo PIV is feasible in humans and offers an easy-to-use, ultrasound-based, quantitative technique for measuring WSS in vivo in humans with good repeatability and reproducibility. Copyright © 2017. Published by Elsevier Inc.

  12. Reproducibility of morphometric X-ray absorptiometry

    International Nuclear Information System (INIS)

    Culton, N.; Pocock, N.

    1999-01-01

    Full text: Morphometric X-ray absorptiometry (MXA) using DXA is potentially a useful clinical tool which may provide additional vertebral fracture information with low radiation exposure. While morphometric analysis is semi-automated, operator intervention is crucial for the accurate positioning of the six data points quantifying the vertebral heights at the anterior, middle and posterior positions. Our study evaluated intra-operator reproducibility of MXA in an elderly patient population and assessed the effect of training and experience on vertebral height precision. Ten patients, with a mean lumbar T score of - 2.07, were studied. Images were processed by a trained operator who had initially only limited morphometric experience. The analysis of the data files were repeated at 2 and 6 weeks, during which time the operator had obtained further experience and training. The intra-operator precision of vertebral height measurements was calculated using the three separate combinations of paired analyses, and expressed as the coefficient of variation. This study confirms the importance of adequate training and attention to detail in MXA analysis. The data indicate that the precision of MXA is adequate for its use in the diagnosis of vertebral fractures, based on a 20% deformity criteria. Use of MXA for monitoring would require approximately an 8% change in vertebral heights to achieve statistical significance

  13. Reproducibility of neuroimaging analyses across operating systems.

    Science.gov (United States)

    Glatard, Tristan; Lewis, Lindsay B; Ferreira da Silva, Rafael; Adalat, Reza; Beck, Natacha; Lepage, Claude; Rioux, Pierre; Rousseau, Marc-Etienne; Sherif, Tarek; Deelman, Ewa; Khalili-Mahani, Najmeh; Evans, Alan C

    2015-01-01

    Neuroimaging pipelines are known to generate different results depending on the computing platform where they are compiled and executed. We quantify these differences for brain tissue classification, fMRI analysis, and cortical thickness (CT) extraction, using three of the main neuroimaging packages (FSL, Freesurfer and CIVET) and different versions of GNU/Linux. We also identify some causes of these differences using library and system call interception. We find that these packages use mathematical functions based on single-precision floating-point arithmetic whose implementations in operating systems continue to evolve. While these differences have little or no impact on simple analysis pipelines such as brain extraction and cortical tissue classification, their accumulation creates important differences in longer pipelines such as subcortical tissue classification, fMRI analysis, and cortical thickness extraction. With FSL, most Dice coefficients between subcortical classifications obtained on different operating systems remain above 0.9, but values as low as 0.59 are observed. Independent component analyses (ICA) of fMRI data differ between operating systems in one third of the tested subjects, due to differences in motion correction. With Freesurfer and CIVET, in some brain regions we find an effect of build or operating system on cortical thickness. A first step to correct these reproducibility issues would be to use more precise representations of floating-point numbers in the critical sections of the pipelines. The numerical stability of pipelines should also be reviewed.

  14. Modeling reproducibility of porescale multiphase flow experiments

    Science.gov (United States)

    Ling, B.; Tartakovsky, A. M.; Bao, J.; Oostrom, M.; Battiato, I.

    2017-12-01

    Multi-phase flow in porous media is widely encountered in geological systems. Understanding immiscible fluid displacement is crucial for processes including, but not limited to, CO2 sequestration, non-aqueous phase liquid contamination and oil recovery. Microfluidic devices and porescale numerical models are commonly used to study multiphase flow in biological, geological, and engineered porous materials. In this work, we perform a set of drainage and imbibition experiments in six identical microfluidic cells to study the reproducibility of multiphase flow experiments. We observe significant variations in the experimental results, which are smaller during the drainage stage and larger during the imbibition stage. We demonstrate that these variations are due to sub-porescale geometry differences in microcells (because of manufacturing defects) and variations in the boundary condition (i.e.,fluctuations in the injection rate inherent to syringe pumps). Computational simulations are conducted using commercial software STAR-CCM+, both with constant and randomly varying injection rate. Stochastic simulations are able to capture variability in the experiments associated with the varying pump injection rate.

  15. Environment and industrial economy: Challenge of reproducibility

    International Nuclear Information System (INIS)

    Rullani, E.

    1992-01-01

    Historically and methodologically counterposed until now, the environmentalist and the economic approach to environmental problems need to be integrated in a new approach that considers, from one side, the relevance of the ecological equilibria for the economic systems and, from the other side, the economic dimension (in terms of investments and transformations in the production system) of any attempt to achieve a better environment. In order to achieve this integration, both approaches are compelled to give up some cultural habits that have characterized them, and have contributed to over-emphasize the opposition between them. The article shows that both approaches can converge into a new one, in which environment is no longer only an holistic, not bargainable, natural external limit to human activity (as in the environmentalist approach), nor simply a scarce and exhaustible resource (as economics tends to consider it); environment should instead become part of the reproducibility sphere, or, in other words, it must be regarded as part of the output that the economic system provides. This new approach, due to scientific and technological advances, is made possible for an increasing class of environmental problems. In order to do this, an evolution is required, that could be able to convert environmental goals into investment and technological innovation goals, and communicate to the firms the value society assigns to environmental resources. This value, the author suggests, should correspond to the reproduction cost. Various examples of this new approach are analyzed and discussed

  16. Regional Reproducibility of BOLD Calibration Parameter M, OEF and Resting-State CMRO2 Measurements with QUO2 MRI.

    Directory of Open Access Journals (Sweden)

    Isabelle Lajoie

    low contrast-to-noise ratio intrinsic to ASL. Reproducibility of the QUO2 derived estimates were computed, yielding a GM intra-subject reproducibility of 3.87% for O2 delivery, 16.8% for the M value, 13.6% for OEF and 15.2% for CMRO2. Although these results focus on the precision of the QUO2 method, rather than the accuracy, the information will be useful for calculation of statistical power in future validation studies and ultimately for research applications of the method. The higher test-retest variability for the more extensively modeled parameters (M, OEF, and CMRO2 highlights the need for further improvement of acquisition methods to reduce noise levels.

  17. Liquid scintigraphic gastric emptying - is it reproducible?

    International Nuclear Information System (INIS)

    Cooper, R.G.; Shuter, B.; Leach, M.; Roach, P.J.

    1999-01-01

    Full text: Radioisotope gastric emptying (GE) studies have been used as a non-invasive technique for motility assessment for many years. In a recent study investigating the correlation of mesenteric vascular changes with GE, six subjects had a repeat study 2-4 months later. Repeat studies were required due to minor technical problems (5 subjects) and a very slow GE (I subject) on the original study. Subjects drank 275 ml of 'Ensure Plus' mixed with 8 MBq 67 Ga-DTPA and were imaged for 2 h while lying supine. GE time-activity curves for each subject were generated and time to half emptying (T l/2 ) calculated. Five of the six subjects had more rapid GE on the second study. Three of the subjects had T l/2 values on their second study which were within ± 15 min of their original T l/2 . The other three subjects had T l/2 values on their second study which were 36 min, 55 min and 280 min (subject K.H.) less than their original T l/2 . Statistical analysis (t-test) was performed on paired T l/2 values. The average T l/2 value was greater in the first study than in the second (149 ± 121 and 86 ± 18 min respectively), although the difference was not statistically significant (P ∼ 0.1). Subjects' anxiety levels were not quantitated during the GE study; however, several major equipment faults occurred during the original study of subject K.H., who became visibly stressed. These results suggest that the reproducibility of GE studies may be influenced by psychological factors

  18. Is my network module preserved and reproducible?

    Directory of Open Access Journals (Sweden)

    Peter Langfelder

    2011-01-01

    Full Text Available In many applications, one is interested in determining which of the properties of a network module change across conditions. For example, to validate the existence of a module, it is desirable to show that it is reproducible (or preserved in an independent test network. Here we study several types of network preservation statistics that do not require a module assignment in the test network. We distinguish network preservation statistics by the type of the underlying network. Some preservation statistics are defined for a general network (defined by an adjacency matrix while others are only defined for a correlation network (constructed on the basis of pairwise correlations between numeric variables. Our applications show that the correlation structure facilitates the definition of particularly powerful module preservation statistics. We illustrate that evaluating module preservation is in general different from evaluating cluster preservation. We find that it is advantageous to aggregate multiple preservation statistics into summary preservation statistics. We illustrate the use of these methods in six gene co-expression network applications including 1 preservation of cholesterol biosynthesis pathway in mouse tissues, 2 comparison of human and chimpanzee brain networks, 3 preservation of selected KEGG pathways between human and chimpanzee brain networks, 4 sex differences in human cortical networks, 5 sex differences in mouse liver networks. While we find no evidence for sex specific modules in human cortical networks, we find that several human cortical modules are less preserved in chimpanzees. In particular, apoptosis genes are differentially co-expressed between humans and chimpanzees. Our simulation studies and applications show that module preservation statistics are useful for studying differences between the modular structure of networks. Data, R software and accompanying tutorials can be downloaded from the following webpage: http

  19. Reproducibility of graph metrics of human brain functional networks.

    Science.gov (United States)

    Deuker, Lorena; Bullmore, Edward T; Smith, Marie; Christensen, Soren; Nathan, Pradeep J; Rockstroh, Brigitte; Bassett, Danielle S

    2009-10-01

    Graph theory provides many metrics of complex network organization that can be applied to analysis of brain networks derived from neuroimaging data. Here we investigated the test-retest reliability of graph metrics of functional networks derived from magnetoencephalography (MEG) data recorded in two sessions from 16 healthy volunteers who were studied at rest and during performance of the n-back working memory task in each session. For each subject's data at each session, we used a wavelet filter to estimate the mutual information (MI) between each pair of MEG sensors in each of the classical frequency intervals from gamma to low delta in the overall range 1-60 Hz. Undirected binary graphs were generated by thresholding the MI matrix and 8 global network metrics were estimated: the clustering coefficient, path length, small-worldness, efficiency, cost-efficiency, assortativity, hierarchy, and synchronizability. Reliability of each graph metric was assessed using the intraclass correlation (ICC). Good reliability was demonstrated for most metrics applied to the n-back data (mean ICC=0.62). Reliability was greater for metrics in lower frequency networks. Higher frequency gamma- and beta-band networks were less reliable at a global level but demonstrated high reliability of nodal metrics in frontal and parietal regions. Performance of the n-back task was associated with greater reliability than measurements on resting state data. Task practice was also associated with greater reliability. Collectively these results suggest that graph metrics are sufficiently reliable to be considered for future longitudinal studies of functional brain network changes.

  20. Precision and reproducibility in AMS radiocarbon measurements.

    Energy Technology Data Exchange (ETDEWEB)

    Hotchkis, M A; Fink, D; Hua, Q; Jacobsen, G E; Lawson, E M; Smith, A M; Tuniz, C [Australian Nuclear Science and Technology Organisation, Lucas Heights, NSW (Australia)

    1997-12-31

    Accelerator Mass Spectrometry (AMS) is a technique by which rare radioisotopes such as {sup 14}C can be measured at environmental levels with high efficiency. Instead of detecting radioactivity, which is very weak for long-lived environmental radioisotopes, atoms are counted directly. The sample is placed in an ion source, from which a negative ion beam of the atoms of interest is extracted, mass analysed, and injected into a tandem accelerator. After stripping to positive charge states in the accelerator HV terminal, the ions are further accelerated, analysed with magnetic and electrostatic devices and counted in a detector. An isotopic ratio is derived from the number of radioisotope atoms counted in a given time and the beam current of a stable isotope of the same element, measured after the accelerator. For radiocarbon, {sup 14}C/{sup 13}C ratios are usually measured, and the ratio of an unknown sample is compared to that of a standard. The achievable precision for such ratio measurements is limited primarily by {sup 14}C counting statistics and also by a variety of factors related to accelerator and ion source stability. At the ANTARES AMS facility at Lucas Heights Research Laboratories we are currently able to measure {sup 14}C with 0.5% precision. In the two years since becoming operational, more than 1000 {sup 14}C samples have been measured. Recent improvements in precision for {sup 14}C have been achieved with the commissioning of a 59 sample ion source. The measurement system, from sample changing to data acquisition, is under common computer control. These developments have allowed a new regime of automated multi-sample processing which has impacted both on the system throughput and the measurement precision. We have developed data evaluation methods at ANTARES which cross-check the self-consistency of the statistical analysis of our data. Rigorous data evaluation is invaluable in assessing the true reproducibility of the measurement system and aids in

  1. Precision and reproducibility in AMS radiocarbon measurements.

    Energy Technology Data Exchange (ETDEWEB)

    Hotchkis, M.A.; Fink, D.; Hua, Q.; Jacobsen, G.E.; Lawson, E. M.; Smith, A.M.; Tuniz, C. [Australian Nuclear Science and Technology Organisation, Lucas Heights, NSW (Australia)

    1996-12-31

    Accelerator Mass Spectrometry (AMS) is a technique by which rare radioisotopes such as {sup 14}C can be measured at environmental levels with high efficiency. Instead of detecting radioactivity, which is very weak for long-lived environmental radioisotopes, atoms are counted directly. The sample is placed in an ion source, from which a negative ion beam of the atoms of interest is extracted, mass analysed, and injected into a tandem accelerator. After stripping to positive charge states in the accelerator HV terminal, the ions are further accelerated, analysed with magnetic and electrostatic devices and counted in a detector. An isotopic ratio is derived from the number of radioisotope atoms counted in a given time and the beam current of a stable isotope of the same element, measured after the accelerator. For radiocarbon, {sup 14}C/{sup 13}C ratios are usually measured, and the ratio of an unknown sample is compared to that of a standard. The achievable precision for such ratio measurements is limited primarily by {sup 14}C counting statistics and also by a variety of factors related to accelerator and ion source stability. At the ANTARES AMS facility at Lucas Heights Research Laboratories we are currently able to measure {sup 14}C with 0.5% precision. In the two years since becoming operational, more than 1000 {sup 14}C samples have been measured. Recent improvements in precision for {sup 14}C have been achieved with the commissioning of a 59 sample ion source. The measurement system, from sample changing to data acquisition, is under common computer control. These developments have allowed a new regime of automated multi-sample processing which has impacted both on the system throughput and the measurement precision. We have developed data evaluation methods at ANTARES which cross-check the self-consistency of the statistical analysis of our data. Rigorous data evaluation is invaluable in assessing the true reproducibility of the measurement system and aids in

  2. Guidelines for Reproducibly Building and Simulating Systems Biology Models.

    Science.gov (United States)

    Medley, J Kyle; Goldberg, Arthur P; Karr, Jonathan R

    2016-10-01

    Reproducibility is the cornerstone of the scientific method. However, currently, many systems biology models cannot easily be reproduced. This paper presents methods that address this problem. We analyzed the recent Mycoplasma genitalium whole-cell (WC) model to determine the requirements for reproducible modeling. We determined that reproducible modeling requires both repeatable model building and repeatable simulation. New standards and simulation software tools are needed to enhance and verify the reproducibility of modeling. New standards are needed to explicitly document every data source and assumption, and new deterministic parallel simulation tools are needed to quickly simulate large, complex models. We anticipate that these new standards and software will enable researchers to reproducibly build and simulate more complex models, including WC models.

  3. Participant Nonnaiveté and the reproducibility of cognitive psychology.

    Science.gov (United States)

    Zwaan, Rolf A; Pecher, Diane; Paolacci, Gabriele; Bouwmeester, Samantha; Verkoeijen, Peter; Dijkstra, Katinka; Zeelenberg, René

    2017-07-25

    Many argue that there is a reproducibility crisis in psychology. We investigated nine well-known effects from the cognitive psychology literature-three each from the domains of perception/action, memory, and language, respectively-and found that they are highly reproducible. Not only can they be reproduced in online environments, but they also can be reproduced with nonnaïve participants with no reduction of effect size. Apparently, some cognitive tasks are so constraining that they encapsulate behavior from external influences, such as testing situation and prior recent experience with the experiment to yield highly robust effects.

  4. Reproducibilidad de test de aceleración y cambio de dirección en fútbol. [Reproducibility of test acceleration and change of direction in football].

    Directory of Open Access Journals (Sweden)

    Julio Calleja-González

    2015-04-01

    Full Text Available El fútbol es un deporte multifacético en donde los factores condicionales y antropométricos son pre-requisitos necesarios para competir a alto nivel. Especialmente la velocidad y la agilidad son necesarias para alcanzar el rendimiento en fútbol. En el estudio participaron 10 jugadores (21,2 ± 2,0 años; 1,81 ± 0,1 m; 73,7 ± 5,9 kg; 22,5 ± 0,8 kg.m-2 con 3 años de experiencia en categoría regional. La reproducibilidad del test de sprint 20 metros y los test de capacidad de cambiar de dirección (MAT y test de sprint con cambios de dirección de 90ºS (90ºS fue analizada mediante el diseñó de test-retest. La reproducibilidad se calculó mediante el Coeficiente de Correlación Intraclase (CCI. La representación de la concordancia entre sesiones mediante el método de Bland y Altman. Los resultados sugieren una reproducibilidad aceptable de los test analizados. Se ha obtenido una asociación moderada entre los dos test de capacidad de cambio de dirección (MAT y 90ºS, r = 0,74. Una baja entre el sprint 20 m y el 90ºS (r = 0,46 y una baja entre el sprint 20 m y el MAT (r = 0,53. Los test de 20 m, MAT y 90ºS mostraron buenos valores de reproducibilidad absoluta y relativa. Podemos afirmar, que se ha obtenido una asociación moderada entre los test de capacidad de cambio de dirección (MAT y 90ºS. Una baja asociación entre el sprint 20 m y el MAT y una asociación baja entre el sprint 20 m y el test de 90ºS. Abstract Soccer is a many-sided sport, where the conditional and anthropometrical factors are necessary pre-requirements to compete in high level. Especially the speed and the agility, they are necessary to reach the yield in soccer. In the study 10 players took part (21,2 ± 2,0 years; 1,81 ± 0,1 m; 73,7 ± 5,9 kg; 22,5 ± 0,8 kg.m-2 with 3 years of experience in regional category. The reproducibility of the 20 meters sprint test and the test of capacity to change direction (MAT and test of sprint with way changes of 90ºS (90

  5. Test-retest of computerized health status questionnaires frequently used in the monitoring of knee osteoarthritis

    DEFF Research Database (Denmark)

    Gudbergsen, Henrik; Bartels, Else M.; Krusager, Peter

    2011-01-01

    ABSTRACT: BACKGROUND: To compare data based on touch screen to data based on traditional paper versions of questionnaires frequently used to examine patient reported outcomes in knee osteoarthritis patients and to examine the impact of patient characteristics on this comparison METHODS: Participa......ABSTRACT: BACKGROUND: To compare data based on touch screen to data based on traditional paper versions of questionnaires frequently used to examine patient reported outcomes in knee osteoarthritis patients and to examine the impact of patient characteristics on this comparison METHODS...... subgroups, completing either the paper or touch screen version first. Mean, mean differences (95% CI), median, median differences and Intraclass Correlation Coefficients (ICCs) were calculated for all questionnaires. RESULTS: ICCs between data based on computerized and paper versions ranged from 0.86 to 0.......99. Analysis revealed a statistically significant difference between versions of the ADL Taxonomy, but not for the remaining questionnaires. Age, computer experience or education-level had no significant impact on the results. The computerized questionnaires were reported to be easier to use. CONCLUSION...

  6. Test-retest reliability of myofascial trigger point detection in hip and thigh areas.

    Science.gov (United States)

    Rozenfeld, E; Finestone, A S; Moran, U; Damri, E; Kalichman, L

    2017-10-01

    Myofascial trigger points (MTrP's) are a primary source of pain in patients with musculoskeletal disorders. Nevertheless, they are frequently underdiagnosed. Reliable MTrP palpation is the necessary for their diagnosis and treatment. The few studies that have looked for intra-tester reliability of MTrPs detection in upper body, provide preliminary evidence that MTrP palpation is reliable. Reliability tests for MTrP palpation on the lower limb have not yet been performed. To evaluate inter- and intra-tester reliability of MTrP recognition in hip and thigh muscles. Reliability study. 21 patients (15 males and 6 females, mean age 21.1 years) referred to the physical therapy clinic, 10 with knee or hip pain and 11 with pain in an upper limb, low back, shin or ankle. Two experienced physical therapists performed the examinations, blinded to the subjects' identity, medical condition and results of the previous MTrP evaluation. Each subject was evaluated four times, twice by each examiner in a random order. Dichotomous findings included a palpable taut band, tenderness, referred pain, and relevance of referred pain to patient's complaint. Based on these, diagnosis of latent MTrP's or active MTrP's was established. The evaluation was performed on both legs and included a total of 16 locations in the following muscles: rectus femoris (proximal), vastus medialis (middle and distal), vastus lateralis (middle and distal) and gluteus medius (anterior, posterior and distal). Inter- and intra-tester reliability (Cohen's kappa (κ)) values for single sites ranged from -0.25 to 0.77. Median intra-tester reliability was 0.45 and 0.46 for latent and active MTrP's, and median inter-tester reliability was 0.51 and 0.64 for latent and active MTrPs, respectively. The examination of the distal vastus medialis was most reliable for latent and active MTrP's (intra-tester k = 0.27-0.77, inter-tester k = 0.77 and intra-tester k = 0.53-0.72, inter-tester k = 0.72, correspondingly). Inter- and intra-tester reliability of active and latent MTrP evaluation was moderate to substantial. Palpation evaluation can be used for clinical diagnosis of MTrP's in the hip and thigh muscles. This study provides evidence that MTrP palpation is a moderately reliable diagnostic tool in the hip and thigh muscles and can be used in clinical practice and research. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. Test-retest reliability and task order effects of emotional cognitive tests in healthy subjects.

    Science.gov (United States)

    Adams, Thomas; Pounder, Zoe; Preston, Sally; Hanson, Andy; Gallagher, Peter; Harmer, Catherine J; McAllister-Williams, R Hamish

    2016-11-01

    Little is known of the retest reliability of emotional cognitive tasks or the impact of using different tasks employing similar emotional stimuli within a battery. We investigated this in healthy subjects. We found improved overall performance in an emotional attentional blink task (EABT) with repeat testing at one hour and one week compared to baseline, but the impact of an emotional stimulus on performance was unchanged. Similarly, performance on a facial expression recognition task (FERT) was better one week after a baseline test, though the relative effect of specific emotions was unaltered. There was no effect of repeat testing on an emotional word categorising, recall and recognition task. We found no difference in performance in the FERT and EABT irrespective of task order. We concluded that it is possible to use emotional cognitive tasks in longitudinal studies and combine tasks using emotional facial stimuli in a single battery.

  8. Test-retest reliability of joint position and kinesthetic sense in the elbow of healthy subjects

    DEFF Research Database (Denmark)

    Juul-Kristensen, B.; Lund, Hans Aage; Hansen, K.

    2008-01-01

    Proprioception is an important effect measure in neuromuscular function training in physiotherapy. Reliability studies of methods for measuring proprioception are few on joint position sense (JPS) and threshold to detection of a passive movement (TDPM) on the elbow. The aim was to study test-rete...

  9. Wideband Acoustic Immittance: Normative Study and Test-Retest Reliability of Tympanometric Measurements in Adults

    Science.gov (United States)

    Sun, Xiao-Ming

    2016-01-01

    Purpose: The purpose of this study was to present normative data of tympanometric measurements of wideband acoustic immittance and to characterize wideband tympanograms. Method: Data were collected in 84 young adults with strictly defined normal hearing and middle ear status. Energy absorbance (EA) was measured using clicks for 1/12-octave…

  10. Test-Retest Effects in Treatment Studies of Reading Disability: The Devil Is in the Detail

    Science.gov (United States)

    McArthur, Genevieve

    2007-01-01

    Reynolds and Nicolson ("Dyslexia," 2007; 13: 78-96) claim to show that the "dyslexia dyspraxia attention-deficit treatment" (DDAT) benefits children with reading difficulties. However, Rack, Snowling, Hulme, and Gibbs ("Dyslexia," 2007; 13: 97-104) argue that because this study did not include an untrained control group then "all that needs to be…

  11. Test-retest reliability of the soleus H-reflex excitability measured during human walking

    DEFF Research Database (Denmark)

    Simonsen, Erik B; Dyhre-Poulsen, Poul

    2010-01-01

    The purpose of the study was to investigate with what accuracy the soleus H-reflex modulation and excitability could be measured during human walking on two occasions separated by days. The maximal M-wave (Mmax) was measured at rest in the standing position. During treadmill walking every stimulus...... elicited an M-wave of 25+/-10% of Mmax in the soleus muscle and a supra-maximal stimulus elicited a maximal M-wave 60ms after the first stimulus. Both Mmax during rest and during walking were later used for normalization. When normalized to resting Mmax, the peak reflex amplitude during walking was 5...

  12. Test-Retest Intervisit Variability of Functional and Structural Parameters in X-Linked Retinoschisis.

    Science.gov (United States)

    Jeffrey, Brett G; Cukras, Catherine A; Vitale, Susan; Turriff, Amy; Bowles, Kristin; Sieving, Paul A

    2014-09-01

    To examine the variability of four outcome measures that could be used to address safety and efficacy in therapeutic trials with X-linked juvenile retinoschisis. Seven men with confirmed mutations in the RS1 gene were evaluated over four visits spanning 6 months. Assessments included visual acuity, full-field electroretinograms (ERG), microperimetric macular sensitivity, and retinal thickness measured by optical coherence tomography (OCT). Eyes were separated into Better or Worse Eye groups based on acuity at baseline. Repeatability coefficients were calculated for each parameter and jackknife resampling used to derive 95% confidence intervals (CIs). The threshold for statistically significant change in visual acuity ranged from three to eight letters. For ERG a-wave, an amplitude reduction greater than 56% would be considered significant. For other parameters, variabilities were lower in the Worse Eye group, likely a result of floor effects due to collapse of the schisis pockets and/or retinal atrophy. The criteria for significant change (Better/Worse Eye) for three important parameters were: ERG b/a-wave ratio (0.44/0.23), point wise sensitivity (10.4/7.0 dB), and central retinal thickness (31%/18%). The 95% CI range for visual acuity, ERG, retinal sensitivity, and central retinal thickness relative to baseline are described for this cohort of participants with X-linked juvenile retinoschisis (XLRS). A quantitative understanding of the variability of outcome measures is vital to establishing the safety and efficacy limits for therapeutic trials of XLRS patients.

  13. The Test-Retest Reliability of New Generation Power Indices of Wingate All-Out Test

    Directory of Open Access Journals (Sweden)

    Ozgur Ozkaya

    2018-04-01

    Full Text Available Although reliability correlations of traditional power indices of the Wingate test have been well documented, no study has analyzed new generation power indices based on milliseconds obtained from a Peak Bike. The purpose of this study was to investigate the retest reliability of new generation power indices. Thirty-two well-trained male athletes who were specialized in basketball, football, tennis, or track and field volunteered to take part in the study (age: 24.3 ± 2.2 years; body mass: 77 ± 8.3 kg; height: 180.3 ± 6.3 cm. Participants performed two Wingate all-out sessions on two separate days. Intra-class correlation coefficient (ICC, standard error measurement (SEM, smallest real differences (SRD and coefficient of variation (CV scores were analyzed based on the test and retest data. Reliability results of traditional power indices calculated based on 5-s means such as peak power, average power, power drop, and fatigue index ratio were similar with the previous findings in literature (ICC ≥ 0.94; CV ≤ 2.8%; SEM ≤ 12.28; SRD% ≤ 7.7%. New generation power indices such as peak power, average power, lowest power, power drop, fatigue index, power decline, maximum speed as rpm, and amount of total energy expenditure demonstrated high reliability (ICC ≥ 0.94; CV ≤ 4.3%; SEM ≤ 10.36; SRD% ≤ 8.8%. Time to peak power, time at maximum speed, and power at maximum speed showed a moderate level of reliability (ICC ≥ 0.73; CV ≤ 8.9%; SEM ≤ 63.01; SRD% ≤ 22.4%. The results of this study indicate that reliability correlations and SRD% of new generation power and fatigue-related indices are similar with traditional 5-s means. However, new time-related indices are very sensitive and moderately reliable.

  14. Reproducible diagnosis of Chronic Lymphocytic Leukemia by flow cytometry

    DEFF Research Database (Denmark)

    Rawstron, Andy C; Kreuzer, Karl-Anton; Soosapilla, Asha

    2018-01-01

    The diagnostic criteria for CLL rely on morphology and immunophenotype. Current approaches have limitations affecting reproducibility and there is no consensus on the role of new markers. The aim of this project was to identify reproducible criteria and consensus on markers recommended for the di...

  15. Genotypic variability enhances the reproducibility of an ecological study.

    Science.gov (United States)

    Milcu, Alexandru; Puga-Freitas, Ruben; Ellison, Aaron M; Blouin, Manuel; Scheu, Stefan; Freschet, Grégoire T; Rose, Laura; Barot, Sebastien; Cesarz, Simone; Eisenhauer, Nico; Girin, Thomas; Assandri, Davide; Bonkowski, Michael; Buchmann, Nina; Butenschoen, Olaf; Devidal, Sebastien; Gleixner, Gerd; Gessler, Arthur; Gigon, Agnès; Greiner, Anna; Grignani, Carlo; Hansart, Amandine; Kayler, Zachary; Lange, Markus; Lata, Jean-Christophe; Le Galliard, Jean-François; Lukac, Martin; Mannerheim, Neringa; Müller, Marina E H; Pando, Anne; Rotter, Paula; Scherer-Lorenzen, Michael; Seyhun, Rahme; Urban-Mead, Katherine; Weigelt, Alexandra; Zavattaro, Laura; Roy, Jacques

    2018-02-01

    Many scientific disciplines are currently experiencing a 'reproducibility crisis' because numerous scientific findings cannot be repeated consistently. A novel but controversial hypothesis postulates that stringent levels of environmental and biotic standardization in experimental studies reduce reproducibility by amplifying the impacts of laboratory-specific environmental factors not accounted for in study designs. A corollary to this hypothesis is that a deliberate introduction of controlled systematic variability (CSV) in experimental designs may lead to increased reproducibility. To test this hypothesis, we had 14 European laboratories run a simple microcosm experiment using grass (Brachypodium distachyon L.) monocultures and grass and legume (Medicago truncatula Gaertn.) mixtures. Each laboratory introduced environmental and genotypic CSV within and among replicated microcosms established in either growth chambers (with stringent control of environmental conditions) or glasshouses (with more variable environmental conditions). The introduction of genotypic CSV led to 18% lower among-laboratory variability in growth chambers, indicating increased reproducibility, but had no significant effect in glasshouses where reproducibility was generally lower. Environmental CSV had little effect on reproducibility. Although there are multiple causes for the 'reproducibility crisis', deliberately including genetic variability may be a simple solution for increasing the reproducibility of ecological studies performed under stringently controlled environmental conditions.

  16. Participant Nonnaiveté and the reproducibility of cognitive psychology

    NARCIS (Netherlands)

    R.A. Zwaan (Rolf); D. Pecher (Diane); G. Paolacci (Gabriele); S. Bouwmeester (Samantha); P.P.J.L. Verkoeijen (Peter); K. Dijkstra (Katinka); R. Zeelenberg (René)

    2017-01-01

    textabstractMany argue that there is a reproducibility crisis in psychology. We investigated nine well-known effects from the cognitive psychology literature—three each from the domains of perception/action, memory, and language, respectively—and found that they are highly reproducible. Not only can

  17. Reproducing Kernels and Coherent States on Julia Sets

    Energy Technology Data Exchange (ETDEWEB)

    Thirulogasanthar, K., E-mail: santhar@cs.concordia.ca; Krzyzak, A. [Concordia University, Department of Computer Science and Software Engineering (Canada)], E-mail: krzyzak@cs.concordia.ca; Honnouvo, G. [Concordia University, Department of Mathematics and Statistics (Canada)], E-mail: g_honnouvo@yahoo.fr

    2007-11-15

    We construct classes of coherent states on domains arising from dynamical systems. An orthonormal family of vectors associated to the generating transformation of a Julia set is found as a family of square integrable vectors, and, thereby, reproducing kernels and reproducing kernel Hilbert spaces are associated to Julia sets. We also present analogous results on domains arising from iterated function systems.

  18. Reproducing Kernels and Coherent States on Julia Sets

    International Nuclear Information System (INIS)

    Thirulogasanthar, K.; Krzyzak, A.; Honnouvo, G.

    2007-01-01

    We construct classes of coherent states on domains arising from dynamical systems. An orthonormal family of vectors associated to the generating transformation of a Julia set is found as a family of square integrable vectors, and, thereby, reproducing kernels and reproducing kernel Hilbert spaces are associated to Julia sets. We also present analogous results on domains arising from iterated function systems

  19. Completely reproducible description of digital sound data with cellular automata

    International Nuclear Information System (INIS)

    Wada, Masato; Kuroiwa, Jousuke; Nara, Shigetoshi

    2002-01-01

    A novel method of compressive and completely reproducible description of digital sound data by means of rule dynamics of CA (cellular automata) is proposed. The digital data of spoken words and music recorded with the standard format of a compact disk are reproduced completely by this method with use of only two rules in a one-dimensional CA without loss of information

  20. Quantification, Variability, and Reproducibility of Basal Skeletal Muscle Glucose Uptake in Healthy Humans Using 18F-FDG PET/CT.

    Science.gov (United States)

    Gheysens, Olivier; Postnov, Andrey; Deroose, Christophe M; Vandermeulen, Corinne; de Hoon, Jan; Declercq, Ruben; Dennie, Justin; Mixson, Lori; De Lepeleire, Inge; Van Laere, Koen; Klimas, Michael; Chakravarthy, Manu V

    2015-10-01

    The quantification and variability of skeletal muscle glucose utilization (SMGU) in healthy subjects under basal (low insulin) conditions are poorly known. This information is essential early in clinical drug development to effectively interrogate novel pharmacologic interventions that modulate glucose uptake. The aim of this study was to determine test-retest characteristics and variability of SMGU within and between healthy subjects under basal conditions. Furthermore, different kinetic modeling strategies were evaluated to find the best-fitting model to assess SMGU studied by 18F-FDG. Six healthy male volunteers underwent 2 dynamic 18F-FDG PET/CT scans with an interval of 24 h. Subjects were admitted to the clinical unit to minimize variability in daily activities and food intake and restrict physical activity. 18F-FDG PET/CT scans of gluteal and quadriceps muscle area were obtained with arterial input. Regions of interest were drawn over the muscle area to obtain time-activity curves and standardized uptake values (SUVs) between 60 and 90 min. Spectral analysis of the data and kinetic modeling was performed using 2-tissue-irreversible (2T3K), 2-tissue-reversible, and 3-tissue-sequential-irreversible (3T5KS) models. Reproducibility was assessed by intraclass correlation coefficients (ICCs) and within-subject coefficient of variation (WSCV). SUVs in gluteal and quadriceps areas were 0.56±0.09 and 0.64±0.07. ICCs (with 90% confidence intervals in parentheses) were 0.88 (0.64-0.96) and 0.96 (0.82-0.99), respectively, for gluteal and quadriceps muscles, and WSCV for gluteal and quadriceps muscles was 2.2% and 3.6%, respectively. The rate of glucose uptake into muscle was 0.0016±0.0004 mL/mL⋅min, with an ICC of 0.94 (0.93-0.95) and WSCV of 6.6% for the 3T5KS model, whereas an ICC of 0.98 (0.92-1.00) and WSCV of 2.8% was obtained for the 2T3K model. 3T5KS demonstrated the best fit to the measured experimental points. Minimal variability in skeletal muscle glucose

  1. Effect of Initial Conditions on Reproducibility of Scientific Research

    Science.gov (United States)

    Djulbegovic, Benjamin; Hozo, Iztok

    2014-01-01

    Background: It is estimated that about half of currently published research cannot be reproduced. Many reasons have been offered as explanations for failure to reproduce scientific research findings- from fraud to the issues related to design, conduct, analysis, or publishing scientific research. We also postulate a sensitive dependency on initial conditions by which small changes can result in the large differences in the research findings when attempted to be reproduced at later times. Methods: We employed a simple logistic regression equation to model the effect of covariates on the initial study findings. We then fed the input from the logistic equation into a logistic map function to model stability of the results in repeated experiments over time. We illustrate the approach by modeling effects of different factors on the choice of correct treatment. Results: We found that reproducibility of the study findings depended both on the initial values of all independent variables and the rate of change in the baseline conditions, the latter being more important. When the changes in the baseline conditions vary by about 3.5 to about 4 in between experiments, no research findings could be reproduced. However, when the rate of change between the experiments is ≤2.5 the results become highly predictable between the experiments. Conclusions: Many results cannot be reproduced because of the changes in the initial conditions between the experiments. Better control of the baseline conditions in-between the experiments may help improve reproducibility of scientific findings. PMID:25132705

  2. Reproducible and controllable induction voltage adder for scaled beam experiments

    Energy Technology Data Exchange (ETDEWEB)

    Sakai, Yasuo; Nakajima, Mitsuo; Horioka, Kazuhiko [Department of Energy Sciences, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama 226-8502 (Japan)

    2016-08-15

    A reproducible and controllable induction adder was developed using solid-state switching devices and Finemet cores for scaled beam compression experiments. A gate controlled MOSFET circuit was developed for the controllable voltage driver. The MOSFET circuit drove the induction adder at low magnetization levels of the cores which enabled us to form reproducible modulation voltages with jitter less than 0.3 ns. Preliminary beam compression experiments indicated that the induction adder can improve the reproducibility of modulation voltages and advance the beam physics experiments.

  3. Reproducibilidad y sensibilidad de un cuestionario de actividad física en población mexicana Reproducibility and sensitivity of a physical activity questionnaire in Mexican people

    Directory of Open Access Journals (Sweden)

    Juan Carlos López-Alvarenga

    2001-08-01

    Full Text Available Objetivo. Determinar si el cuestionario de actividad física (CAF de Laval es reproducible y sensible para detectar diferencias en grupos de mexicanos con peso normal y en obesos. Material y métodos. Estudio efectuado en el Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, entre enero y mayo de 1999, en México, D.F. El CAF se tradujo al castellano y se adaptó a población mexicana. Se midió la reproducibilidad por prueba-reprueba, con cuatro semanas de diferencia (n=30 sujetos con obesidad. Para determinar la sensibilidad del cuestionario se comparó un grupo de jóvenes cadetes (n=18 con otro de jóvenes civiles (n=32. Se utilizó como concordancia el coeficiente de correlación intraclase y se empleó la prueba t de student pareada o para muestras independientes, según fuera necesario. Resultados. El coeficiente de correlación intraclase fue de 0.86. El CAF fue sensible al demostrar diferencias de más de 400 kcal/día (1 674 kJ/día y más de 4 kcal/kg/día (17 kJ/kg/día entre jóvenes con actividad física importante (t de Student. Conclusiones. El CAF es un instrumento sensible y reproducible que puede ser utilizado en población mexicana. El texto completo en inglés de este artículo está disponible en: http://www.insp.mx/salud/index.htmlObjective. To assess the reproducibility and sensitivity of a physical activity questionnaire (PAQ developed at Laval University, to detect differences in lean and obese individuals. Material and Methods. A cross-sectional study was conducted at Mexico's National Institute of Medical Sciences and Nutrition, between January and May 1999. The PAQ was translated into Spanish and adjusted to the Mexican setting. The test-retest method was used to measure reliability, allowing a four-week interval between tests (n=30 overweight subjects. To assess the questionnaire's sensitivity a group of young cadets (n=18 was compared to a group of young civilians (n=32. Concordance was

  4. Reproducibility of corneal, macular and retinal nerve fiber layer ...

    African Journals Online (AJOL)

    side the limits of a consulting room.5. Reproducibility of ... examination, intraocular pressure and corneal thickness ... All OCT measurements were taken between 2 and 5 pm ..... CAS-OCT, Slit-lamp OCT, RTVue-100) have shown ICC.

  5. Beyond Bundles - Reproducible Software Environments with GNU Guix

    CERN Multimedia

    CERN. Geneva; Wurmus, Ricardo

    2018-01-01

    Building reproducible data analysis pipelines and numerical experiments is a key challenge for reproducible science, in which tools to reproduce software environments play a critical role. The advent of “container-based” deployment tools such as Docker and Singularity has made it easier to replicate software environments. These tools are very much about bundling the bits of software binaries in a convenient way, not so much about describing how software is composed. Science is not just about replicating, though—it demands the ability to inspect and to experiment. In this talk we will present GNU Guix, a software management toolkit. Guix departs from container-based solutions in that it enables declarative composition of software environments. It is comparable to “package managers” like apt or yum, but with a significant difference: Guix provides accurate provenance tracking of build artifacts, and bit-reproducible software. We will illustrate the many ways in which Guix can improve how software en...

  6. The reproducibility of random amplified polymorphic DNA (RAPD ...

    African Journals Online (AJOL)

    RAPD) profiles of Streptococcus thermophilus strains by using the polymerase chain reaction (PCR). Several factors can cause the amplification of false and non reproducible bands in the RAPD profiles. We tested three primers, OPI-02 MOD, ...

  7. Systematic heterogenization for better reproducibility in animal experimentation.

    Science.gov (United States)

    Richter, S Helene

    2017-08-31

    The scientific literature is full of articles discussing poor reproducibility of findings from animal experiments as well as failures to translate results from preclinical animal studies to clinical trials in humans. Critics even go so far as to talk about a "reproducibility crisis" in the life sciences, a novel headword that increasingly finds its way into numerous high-impact journals. Viewed from a cynical perspective, Fett's law of the lab "Never replicate a successful experiment" has thus taken on a completely new meaning. So far, poor reproducibility and translational failures in animal experimentation have mostly been attributed to biased animal data, methodological pitfalls, current publication ethics and animal welfare constraints. More recently, the concept of standardization has also been identified as a potential source of these problems. By reducing within-experiment variation, rigorous standardization regimes limit the inference to the specific experimental conditions. In this way, however, individual phenotypic plasticity is largely neglected, resulting in statistically significant but possibly irrelevant findings that are not reproducible under slightly different conditions. By contrast, systematic heterogenization has been proposed as a concept to improve representativeness of study populations, contributing to improved external validity and hence improved reproducibility. While some first heterogenization studies are indeed very promising, it is still not clear how this approach can be transferred into practice in a logistically feasible and effective way. Thus, further research is needed to explore different heterogenization strategies as well as alternative routes toward better reproducibility in animal experimentation.

  8. Shear wave elastography for breast masses is highly reproducible.

    Science.gov (United States)

    Cosgrove, David O; Berg, Wendie A; Doré, Caroline J; Skyba, Danny M; Henry, Jean-Pierre; Gay, Joel; Cohen-Bacrie, Claude

    2012-05-01

    To evaluate intra- and interobserver reproducibility of shear wave elastography (SWE) for breast masses. For intraobserver reproducibility, each observer obtained three consecutive SWE images of 758 masses that were visible on ultrasound. 144 (19%) were malignant. Weighted kappa was used to assess the agreement of qualitative elastographic features; the reliability of quantitative measurements was assessed by intraclass correlation coefficients (ICC). For the interobserver reproducibility, a blinded observer reviewed images and agreement on features was determined. Mean age was 50 years; mean mass size was 13 mm. Qualitatively, SWE images were at least reasonably similar for 666/758 (87.9%). Intraclass correlation for SWE diameter, area and perimeter was almost perfect (ICC ≥ 0.94). Intraobserver reliability for maximum and mean elasticity was almost perfect (ICC = 0.84 and 0.87) and was substantial for the ratio of mass-to-fat elasticity (ICC = 0.77). Interobserver agreement was moderate for SWE homogeneity (κ = 0.57), substantial for qualitative colour assessment of maximum elasticity (κ = 0.66), fair for SWE shape (κ = 0.40), fair for B-mode mass margins (κ = 0.38), and moderate for B-mode mass shape (κ = 0.58), orientation (κ = 0.53) and BI-RADS assessment (κ = 0.59). SWE is highly reproducible for assessing elastographic features of breast masses within and across observers. SWE interpretation is at least as consistent as that of BI-RADS ultrasound B-mode features. • Shear wave ultrasound elastography can measure the stiffness of breast tissue • It provides a qualitatively and quantitatively interpretable colour-coded map of tissue stiffness • Intraobserver reproducibility of SWE is almost perfect while intraobserver reproducibility of SWE proved to be moderate to substantial • The most reproducible SWE features between observers were SWE image homogeneity and maximum elasticity.

  9. Reproducibility of computer-aided detection system in digital mammograms

    International Nuclear Information System (INIS)

    Kim, Seung Ja; Cho, Nariya; Cha, Joo Hee; Chung, Hye Kyung; Lee, Sin Ho; Cho, Kyung Soo; Kim, Sun Mi; Moon, Woo Kyung

    2005-01-01

    To evaluate the reproducibility of the computer-aided detection (CAD) system for digital mammograms. We applied the CAD system (ImageChecker M1000-DM, version 3.1; R2 Technology) to full field digital mammograms. These mammograms were taken twice at an interval of 10-45 days (mean:25 days) for 34 preoperative patients (breast cancer n=27, benign disease n=7, age range:20-66 years, mean age:47.9 years). On the mammograms, lesions were visible in 19 patients and these were depicted as 15 masses and 12 calcification clusters. We analyzed the sensitivity, the false positive rate (FPR) and the reproducibility of the CAD marks. The broader sensitivities of the CAD system were 80% (12 of 15), 67%(10 of 15) for masses and those for calcification clusters were 100% (12 of 12). The strict sensitivities were 50% (15 of 30) and 50% (15 of 30) for masses and 92% (22 of 24) and 79% (19 of 24) for the clusters. The FPR for the masses was 0.21-0.22/image, the FPR for the clusters was 0.03-0.04/image and the total FPR was 0.24-0.26/image. Among 132 mammography images, the identical images regardless of the existence of CAD marks were 59% (78 of 132), and the identical images with CAD marks were 22% (15 of 69). The reproducibility of the CAD marks for the true positive mass was 67% (12 of 18) and 71% (17 of 24) for the true positive cluster. The reproducibility of CAD marks for the false positive mass was 8% (4 of 53), and the reproducibility of CAD marks for the false positive clusters was 14% (1 of 7). The reproducibility of the total mass marks was 23% (16 of 71), and the reproducibility of the total cluster marks was 58% (18 of 31). CAD system showed higher sensitivity and reproducibility of CAD marks for the calcification clusters which are related to breast cancer. Yet the overall reproducibility of CAD marks was low; therefore, the CAD system must be applied considering this limitation

  10. Using prediction markets to estimate the reproducibility of scientific research

    Science.gov (United States)

    Dreber, Anna; Pfeiffer, Thomas; Almenberg, Johan; Isaksson, Siri; Wilson, Brad; Chen, Yiling; Nosek, Brian A.; Johannesson, Magnus

    2015-01-01

    Concerns about a lack of reproducibility of statistically significant results have recently been raised in many fields, and it has been argued that this lack comes at substantial economic costs. We here report the results from prediction markets set up to quantify the reproducibility of 44 studies published in prominent psychology journals and replicated in the Reproducibility Project: Psychology. The prediction markets predict the outcomes of the replications well and outperform a survey of market participants’ individual forecasts. This shows that prediction markets are a promising tool for assessing the reproducibility of published scientific results. The prediction markets also allow us to estimate probabilities for the hypotheses being true at different testing stages, which provides valuable information regarding the temporal dynamics of scientific discovery. We find that the hypotheses being tested in psychology typically have low prior probabilities of being true (median, 9%) and that a “statistically significant” finding needs to be confirmed in a well-powered replication to have a high probability of being true. We argue that prediction markets could be used to obtain speedy information about reproducibility at low cost and could potentially even be used to determine which studies to replicate to optimally allocate limited resources into replications. PMID:26553988

  11. Validation and reproducibility of an Australian caffeine food frequency questionnaire.

    Science.gov (United States)

    Watson, E J; Kohler, M; Banks, S; Coates, A M

    2017-08-01

    The aim of this study was to measure validity and reproducibility of a caffeine food frequency questionnaire (C-FFQ) developed for the Australian population. The C-FFQ was designed to assess average daily caffeine consumption using four categories of food and beverages including; energy drinks; soft drinks/soda; coffee and tea and chocolate (food and drink). Participants completed a seven-day food diary immediately followed by the C-FFQ on two consecutive days. The questionnaire was first piloted in 20 adults, and then, a validity/reproducibility study was conducted (n = 90 adults). The C-FFQ showed moderate correlations (r = .60), fair agreement (mean difference 63 mg) and reasonable quintile rankings indicating fair to moderate agreement with the seven-day food diary. To test reproducibility, the C-FFQ was compared to itself and showed strong correlations (r = .90), good quintile rankings and strong kappa values (κ = 0.65), indicating strong reproducibility. The C-FFQ shows adequate validity and reproducibility and will aid researchers in Australia to quantify caffeine consumption.

  12. Using prediction markets to estimate the reproducibility of scientific research.

    Science.gov (United States)

    Dreber, Anna; Pfeiffer, Thomas; Almenberg, Johan; Isaksson, Siri; Wilson, Brad; Chen, Yiling; Nosek, Brian A; Johannesson, Magnus

    2015-12-15

    Concerns about a lack of reproducibility of statistically significant results have recently been raised in many fields, and it has been argued that this lack comes at substantial economic costs. We here report the results from prediction markets set up to quantify the reproducibility of 44 studies published in prominent psychology journals and replicated in the Reproducibility Project: Psychology. The prediction markets predict the outcomes of the replications well and outperform a survey of market participants' individual forecasts. This shows that prediction markets are a promising tool for assessing the reproducibility of published scientific results. The prediction markets also allow us to estimate probabilities for the hypotheses being true at different testing stages, which provides valuable information regarding the temporal dynamics of scientific discovery. We find that the hypotheses being tested in psychology typically have low prior probabilities of being true (median, 9%) and that a "statistically significant" finding needs to be confirmed in a well-powered replication to have a high probability of being true. We argue that prediction markets could be used to obtain speedy information about reproducibility at low cost and could potentially even be used to determine which studies to replicate to optimally allocate limited resources into replications.

  13. The intra-individual reproducibility of flash-evoked potentials in a sample of children.

    Science.gov (United States)

    Schellberg, D; Gasser, T; Köhler, W

    1987-07-01

    Visual evoked potentials (VEPs) to flash stimuli were recorded twice from 26 children aged 10-13 years, with an intersession interval of about 10 months. Test-retest reliability was poor for recordings taken from scalp locations overlying non-specific cortex and somewhat better for specific cortex. The size of consistency coefficients (i.e. correlations within session) showed that noise and artefacts were not the decisive factors which lower reliability. A comparison with retest correlations of broad band parameters of the EEG at rest for the same sample showed, to our surprise, smaller retest reliability for VEP parameters. Variability of the VEP in children over time seems to be a substantial as its well-known inter-individual variability.

  14. Versão em português do Chronic Respiratory Questionnaire: estudo da validade e reprodutibilidade Portuguese-language version of the Chronic Respiratory Questionnaire: a validity and reproducibility study

    Directory of Open Access Journals (Sweden)

    Graciane Laender Moreira

    2009-08-01

    -minute walk test (6MWT were performed to analyze the correlations with the CRQ scores. RESULTS: There were no significant CRQ test-retest differences (p > 0.05 for all domains. The test-retest intraclass correlation coefficient was 0.98, 0.97, 0.98 and 0.95 for the dyspnea, fatigue, emotional function and mastery domains, respectively. The Cronbach's alpha coefficient was 0.91. The CRQ domains correlated significantly with the SGRQ domains (-0.30 < r < -0.67; p < 0.05. There were no significant correlations between spirometric variables and the CRQ domains or between the CRQ domains and the 6MWT, with the exception of the fatigue domain (r = 0.30; p = 0.04. CONCLUSIONS: The Portuguese-language version of the CRQ proved to be reproducible and valid for use in Brazilian patients with COPD.

  15. The quest for improved reproducibility in MALDI mass spectrometry.

    Science.gov (United States)

    O'Rourke, Matthew B; Djordjevic, Steven P; Padula, Matthew P

    2018-03-01

    Reproducibility has been one of the biggest hurdles faced when attempting to develop quantitative protocols for MALDI mass spectrometry. The heterogeneous nature of sample recrystallization has made automated sample acquisition somewhat "hit and miss" with manual intervention needed to ensure that all sample spots have been analyzed. In this review, we explore the last 30 years of literature and anecdotal evidence that has attempted to address and improve reproducibility in MALDI MS. Though many methods have been attempted, we have discovered a significant publication history surrounding the use of nitrocellulose as a substrate to improve homogeneity of crystal formation and therefore reproducibility. We therefore propose that this is the most promising avenue of research for developing a comprehensive and universal preparation protocol for quantitative MALDI MS analysis. © 2016 Wiley Periodicals, Inc. Mass Spec Rev 37:217-228, 2018. © 2016 Wiley Periodicals, Inc.

  16. Dysplastic naevus: histological criteria and their inter-observer reproducibility.

    Science.gov (United States)

    Hastrup, N; Clemmensen, O J; Spaun, E; Søndergaard, K

    1994-06-01

    Forty melanocytic lesions were examined in a pilot study, which was followed by a final series of 100 consecutive melanocytic lesions, in order to evaluate the inter-observer reproducibility of the histological criteria proposed for the dysplastic naevus. The specimens were examined in a blind fashion by four observers. Analysis by kappa statistics showed poor reproducibility of nuclear features, while reproducibility of architectural features was acceptable, improving in the final series. Consequently, we cannot apply the combined criteria of cytological and architectural features with any confidence in the diagnosis of dysplastic naevus, and, until further studies have documented that architectural criteria alone will suffice in the diagnosis of dysplastic naevus, we, as pathologists, shall avoid this term.

  17. Relevant principal factors affecting the reproducibility of insect primary culture.

    Science.gov (United States)

    Ogata, Norichika; Iwabuchi, Kikuo

    2017-06-01

    The primary culture of insect cells often suffers from problems with poor reproducibility in the quality of the final cell preparations. The cellular composition of the explants (cell number and cell types), surgical methods (surgical duration and surgical isolation), and physiological and genetic differences between donors may be critical factors affecting the reproducibility of culture. However, little is known about where biological variation (interindividual differences between donors) ends and technical variation (variance in replication of culture conditions) begins. In this study, we cultured larval fat bodies from the Japanese rhinoceros beetle, Allomyrina dichotoma, and evaluated, using linear mixed models, the effect of interindividual variation between donors on the reproducibility of the culture. We also performed transcriptome analysis of the hemocyte-like cells mainly seen in the cultures using RNA sequencing and ultrastructural analyses of hemocytes using a transmission electron microscope, revealing that the cultured cells have many characteristics of insect hemocytes.

  18. Reproducibility of frequency-dependent low frequency fluctuations in reaction time over time and across tasks.

    Science.gov (United States)

    Liu, Zan-Zan; Qu, Hui-Jie; Tian, Zhuo-Ling; Han, Meng-Jian; Fan, Yi; Ge, Lie-Zhong; Zang, Yu-Feng; Zhang, Hang

    2017-01-01

    Increased levels of reaction time variability (RTV) are characteristics of sustained attention deficits. The clinical significance of RTV has been widely recognized. However, the reliability of RTV measurements has not been widely studied. The present study aimed to assess the test-retest reliability of RTV conventional measurements, e.g., the standard deviation (SD), the coefficient of variation (CV), and a new measurement called the amplitude of low frequency fluctuation (ALFF) of RT. In addition, we aimed to assess differences and similarities of these measurements between different tasks. Thirty-seven healthy college students participated in 2 tasks, i.e., an Eriksen flanker task (EFT) and a simple reaction task (SRT), twice over a mean interval of 56 days. Conventional measurements of RTV including RT-SD and RT-CV were assessed first. Then the RT time series were converted into frequency domains, and RT-ALFF was further calculated for the whole frequency band (0.0023-0.167 Hz) and for a few sub-frequency bands including Slow-6 (frequency bands (Slow-3), but SRT RT-ALFF values showed slightly higher ICC values than EFT values in lower frequency bands (Slow-5 and Slow-4). 2) RT-ALFF magnitudes in each sub-frequency band were greater for the SRT than those for the EFT. 3) The RT-ALFF in the Slow-4 of the EFT was found to be correlated with the RT-ALFF in the Slow-5 of the SRT for both two visits, but no consistently significant correlation was found between the same frequency bands. These findings reveal good test-retest reliability for conventional measurements and for the RT-ALFF of RTV. The RT-ALFF presented frequency-dependent similarities across tasks. All of our results reveal the presence of different frequency structures between the two tasks, and thus the frequency-dependent characteristics of different tasks deserve more attention in future studies.

  19. Reproducibility of clinical research in critical care: a scoping review.

    Science.gov (United States)

    Niven, Daniel J; McCormick, T Jared; Straus, Sharon E; Hemmelgarn, Brenda R; Jeffs, Lianne; Barnes, Tavish R M; Stelfox, Henry T

    2018-02-21

    The ability to reproduce experiments is a defining principle of science. Reproducibility of clinical research has received relatively little scientific attention. However, it is important as it may inform clinical practice, research agendas, and the design of future studies. We used scoping review methods to examine reproducibility within a cohort of randomized trials examining clinical critical care research and published in the top general medical and critical care journals. To identify relevant clinical practices, we searched the New England Journal of Medicine, The Lancet, and JAMA for randomized trials published up to April 2016. To identify a comprehensive set of studies for these practices, included articles informed secondary searches within other high-impact medical and specialty journals. We included late-phase randomized controlled trials examining therapeutic clinical practices in adults admitted to general medical-surgical or specialty intensive care units (ICUs). Included articles were classified using a reproducibility framework. An original study was the first to evaluate a clinical practice. A reproduction attempt re-evaluated that practice in a new set of participants. Overall, 158 practices were examined in 275 included articles. A reproduction attempt was identified for 66 practices (42%, 95% CI 33-50%). Original studies reported larger effects than reproduction attempts (primary endpoint, risk difference 16.0%, 95% CI 11.6-20.5% vs. 8.4%, 95% CI 6.0-10.8%, P = 0.003). More than half of clinical practices with a reproduction attempt demonstrated effects that were inconsistent with the original study (56%, 95% CI 42-68%), among which a large number were reported to be efficacious in the original study and to lack efficacy in the reproduction attempt (34%, 95% CI 19-52%). Two practices reported to be efficacious in the original study were found to be harmful in the reproduction attempt. A minority of critical care practices with research published

  20. Effective Form of Reproducing the Total Financial Potential of Ukraine

    Directory of Open Access Journals (Sweden)

    Portna Oksana V.

    2015-03-01

    Full Text Available Development of scientific principles of reproducing the total financial potential of the country and its effective form is an urgent problem both in theoretical and practical aspects of the study, the solution of which is intended to ensure the active mobilization and effective use of the total financial potential of Ukraine, and as a result — its expanded reproduction as well, which would contribute to realization of the internal capacities for stabilization of the national economy. The purpose of the article is disclosing the essence of the effective form of reproducing the total financial potential of the country, analyzing the results of reproducing the total financial potential of Ukraine. It has been proved that the basis for the effective form of reproducing the total financial potential of the country is the volume and flow of resources, which are associated with the «real» economy, affect the dynamics of GDP and define it, i.e. resource and process forms of reproducing the total financial potential of Ukraine (which precede the effective one. The analysis of reproducing the total financial potential of Ukraine has shown that in the analyzed period there was an increase in the financial possibilities of the country, but steady dynamics of reduction of the total financial potential was observed. If we consider the amount of resources involved in production, creating a net value added and GDP, it occurs on a restricted basis. Growth of the total financial potential of Ukraine is connected only with extensive quantitative factors rather than intensive qualitative changes.

  1. The MIMIC Code Repository: enabling reproducibility in critical care research.

    Science.gov (United States)

    Johnson, Alistair Ew; Stone, David J; Celi, Leo A; Pollard, Tom J

    2018-01-01

    Lack of reproducibility in medical studies is a barrier to the generation of a robust knowledge base to support clinical decision-making. In this paper we outline the Medical Information Mart for Intensive Care (MIMIC) Code Repository, a centralized code base for generating reproducible studies on an openly available critical care dataset. Code is provided to load the data into a relational structure, create extractions of the data, and reproduce entire analysis plans including research studies. Concepts extracted include severity of illness scores, comorbid status, administrative definitions of sepsis, physiologic criteria for sepsis, organ failure scores, treatment administration, and more. Executable documents are used for tutorials and reproduce published studies end-to-end, providing a template for future researchers to replicate. The repository's issue tracker enables community discussion about the data and concepts, allowing users to collaboratively improve the resource. The centralized repository provides a platform for users of the data to interact directly with the data generators, facilitating greater understanding of the data. It also provides a location for the community to collaborate on necessary concepts for research progress and share them with a larger audience. Consistent application of the same code for underlying concepts is a key step in ensuring that research studies on the MIMIC database are comparable and reproducible. By providing open source code alongside the freely accessible MIMIC-III database, we enable end-to-end reproducible analysis of electronic health records. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association.

  2. Language-Agnostic Reproducible Data Analysis Using Literate Programming.

    Science.gov (United States)

    Vassilev, Boris; Louhimo, Riku; Ikonen, Elina; Hautaniemi, Sampsa

    2016-01-01

    A modern biomedical research project can easily contain hundreds of analysis steps and lack of reproducibility of the analyses has been recognized as a severe issue. While thorough documentation enables reproducibility, the number of analysis programs used can be so large that in reality reproducibility cannot be easily achieved. Literate programming is an approach to present computer programs to human readers. The code is rearranged to follow the logic of the program, and to explain that logic in a natural language. The code executed by the computer is extracted from the literate source code. As such, literate programming is an ideal formalism for systematizing analysis steps in biomedical research. We have developed the reproducible computing tool Lir (literate, reproducible computing) that allows a tool-agnostic approach to biomedical data analysis. We demonstrate the utility of Lir by applying it to a case study. Our aim was to investigate the role of endosomal trafficking regulators to the progression of breast cancer. In this analysis, a variety of tools were combined to interpret the available data: a relational database, standard command-line tools, and a statistical computing environment. The analysis revealed that the lipid transport related genes LAPTM4B and NDRG1 are coamplified in breast cancer patients, and identified genes potentially cooperating with LAPTM4B in breast cancer progression. Our case study demonstrates that with Lir, an array of tools can be combined in the same data analysis to improve efficiency, reproducibility, and ease of understanding. Lir is an open-source software available at github.com/borisvassilev/lir.

  3. Reproducibility problems of in-service ultrasonic testing results

    International Nuclear Information System (INIS)

    Honcu, E.

    1974-01-01

    The reproducibility of the results of ultrasonic testing is the basic precondition for its successful application in in-service inspection of changes in the quality of components of nuclear power installations. The results of periodic ultrasonic inspections are not satisfactory from the point of view of reproducibility. Regardless, the ultrasonic pulse-type method is suitable for evaluating the quality of most components of nuclear installations and often the sole method which may be recommended for inspection with regard to its technical and economic aspects. (J.B.)

  4. Reproducibility of esophageal scintigraphy using semi-solid yoghurt

    Energy Technology Data Exchange (ETDEWEB)

    Imai, Yukinori; Kinoshita, Manabu; Asakura, Yasushi; Kakinuma, Tohru; Shimoji, Katsunori; Fujiwara, Kenji; Suzuki, Kenji; Miyamae, Tatsuya [Saitama Medical School, Moroyama (Japan)

    1999-10-01

    Esophageal scintigraphy is a non-invasive method which evaluate esophageal function quantitatively. We applied new technique using semi-solid yoghurt, which can evaluate esophageal function in a sitting position. To evaluate the reproducibility of this method, scintigraphy were performed in 16 healthy volunteers. From the result of four swallows except the first one, the mean coefficients of variation in esophageal transit time and esophageal emptying time were 12.8% and 13.4% respectively (interday variation). As regards the interday variation, this method had also good reproducibility from the result on the 2 separate days. (author)

  5. Reproducing Kernel Method for Solving Nonlinear Differential-Difference Equations

    Directory of Open Access Journals (Sweden)

    Reza Mokhtari

    2012-01-01

    Full Text Available On the basis of reproducing kernel Hilbert spaces theory, an iterative algorithm for solving some nonlinear differential-difference equations (NDDEs is presented. The analytical solution is shown in a series form in a reproducing kernel space, and the approximate solution , is constructed by truncating the series to terms. The convergence of , to the analytical solution is also proved. Results obtained by the proposed method imply that it can be considered as a simple and accurate method for solving such differential-difference problems.

  6. Reproducible and expedient rice regeneration system using in vitro ...

    African Journals Online (AJOL)

    Inevitable prerequisite for expedient regeneration in rice is the selection of totipotent explant and developing an apposite combination of growth hormones. Here, we reported a reproducible regeneration protocol in which basal segments of the stem of the in vitro grown rice plants were used as ex-plant. Using the protocol ...

  7. Composting in small laboratory pilots: Performance and reproducibility

    International Nuclear Information System (INIS)

    Lashermes, G.; Barriuso, E.; Le Villio-Poitrenaud, M.; Houot, S.

    2012-01-01

    Highlights: ► We design an innovative small-scale composting device including six 4-l reactors. ► We investigate the performance and reproducibility of composting on a small scale. ► Thermophilic conditions are established by self-heating in all replicates. ► Biochemical transformations, organic matter losses and stabilisation are realistic. ► The organic matter evolution exhibits good reproducibility for all six replicates. - Abstract: Small-scale reactors ( 2 consumption and CO 2 emissions, and characterising the biochemical evolution of organic matter. A good reproducibility was found for the six replicates with coefficients of variation for all parameters generally lower than 19%. An intense self-heating ensured the existence of a spontaneous thermophilic phase in all reactors. The average loss of total organic matter (TOM) was 46% of the initial content. Compared to the initial mixture, the hot water soluble fraction decreased by 62%, the hemicellulose-like fraction by 68%, the cellulose-like fraction by 50% and the lignin-like fractions by 12% in the final compost. The TOM losses, compost stabilisation and evolution of the biochemical fractions were similar to observed in large reactors or on-site experiments, excluding the lignin degradation, which was less important than in full-scale systems. The reproducibility of the process and the quality of the final compost make it possible to propose the use of this experimental device for research requiring a mass reduction of the initial composted waste mixtures.

  8. Intercenter reproducibility of binary typing for Staphylococcus aureus

    NARCIS (Netherlands)

    van Leeuwen, Willem B.; Snoeijers, Sandor; van der Werken-Libregts, Christel; Tuip, Anita; van der Zee, Anneke; Egberink, Diane; de Proost, Monique; Bik, Elisabeth; Lunter, Bjorn; Kluytmans, Jan; Gits, Etty; van Duyn, Inge; Heck, Max; van der Zwaluw, Kim; Wannet, Wim; Noordhoek, Gerda T.; Mulder, Sije; Renders, Nicole; Boers, Miranda; Zaat, Sebastiaan; van der Riet, Daniëlle; Kooistra, Mirjam; Talens, Adriaan; Dijkshoorn, Lenie; van der Reyden, Tanny; Veenendaal, Dick; Bakker, Nancy; Cookson, Barry; Lynch, Alisson; Witte, Wolfgang; Cuny, Christa; Blanc, Dominique; Vernez, Isabelle; Hryniewicz, Waleria; Fiett, Janusz; Struelens, Marc; Deplano, Ariane; Landegent, Jim; Verbrugh, Henri A.; van Belkum, Alex

    2002-01-01

    The reproducibility of the binary typing (BT) protocol developed for epidemiological typing of Staphylococcus aureus was analyzed in a biphasic multicenter study. In a Dutch multicenter pilot study, 10 genetically unique isolates of methicillin-resistant S. aureus (MRSA) were characterized by the BT

  9. Modeling and evaluating repeatability and reproducibility of ordinal classifications

    NARCIS (Netherlands)

    de Mast, J.; van Wieringen, W.N.

    2010-01-01

    This paper argues that currently available methods for the assessment of the repeatability and reproducibility of ordinal classifications are not satisfactory. The paper aims to study whether we can modify a class of models from Item Response Theory, well established for the study of the reliability

  10. ReproPhylo: An Environment for Reproducible Phylogenomics.

    Directory of Open Access Journals (Sweden)

    Amir Szitenberg

    2015-09-01

    Full Text Available The reproducibility of experiments is key to the scientific process, and particularly necessary for accurate reporting of analyses in data-rich fields such as phylogenomics. We present ReproPhylo, a phylogenomic analysis environment developed to ensure experimental reproducibility, to facilitate the handling of large-scale data, and to assist methodological experimentation. Reproducibility, and instantaneous repeatability, is built in to the ReproPhylo system and does not require user intervention or configuration because it stores the experimental workflow as a single, serialized Python object containing explicit provenance and environment information. This 'single file' approach ensures the persistence of provenance across iterations of the analysis, with changes automatically managed by the version control program Git. This file, along with a Git repository, are the primary reproducibility outputs of the program. In addition, ReproPhylo produces an extensive human-readable report and generates a comprehensive experimental archive file, both of which are suitable for submission with publications. The system facilitates thorough experimental exploration of both parameters and data. ReproPhylo is a platform independent CC0 Python module and is easily installed as a Docker image or a WinPython self-sufficient package, with a Jupyter Notebook GUI, or as a slimmer version in a Galaxy distribution.

  11. Reproducibility of abdominal fat assessment by ultrasound and computed tomography.

    Science.gov (United States)

    Mauad, Fernando Marum; Chagas-Neto, Francisco Abaeté; Benedeti, Augusto César Garcia Saab; Nogueira-Barbosa, Marcello Henrique; Muglia, Valdair Francisco; Carneiro, Antonio Adilton Oliveira; Muller, Enrico Mattana; Elias Junior, Jorge

    2017-01-01

    To test the accuracy and reproducibility of ultrasound and computed tomography (CT) for the quantification of abdominal fat in correlation with the anthropometric, clinical, and biochemical assessments. Using ultrasound and CT, we determined the thickness of subcutaneous and intra-abdominal fat in 101 subjects-of whom 39 (38.6%) were men and 62 (61.4%) were women-with a mean age of 66.3 years (60-80 years). The ultrasound data were correlated with the anthropometric, clinical, and biochemical parameters, as well as with the areas measured by abdominal CT. Intra-abdominal thickness was the variable for which the correlation with the areas of abdominal fat was strongest (i.e., the correlation coefficient was highest). We also tested the reproducibility of ultrasound and CT for the assessment of abdominal fat and found that CT measurements of abdominal fat showed greater reproducibility, having higher intraobserver and interobserver reliability than had the ultrasound measurements. There was a significant correlation between ultrasound and CT, with a correlation coefficient of 0.71. In the assessment of abdominal fat, the intraobserver and interobserver reliability were greater for CT than for ultrasound, although both methods showed high accuracy and good reproducibility.

  12. An empirical analysis of journal policy effectiveness for computational reproducibility.

    Science.gov (United States)

    Stodden, Victoria; Seiler, Jennifer; Ma, Zhaokun

    2018-03-13

    A key component of scientific communication is sufficient information for other researchers in the field to reproduce published findings. For computational and data-enabled research, this has often been interpreted to mean making available the raw data from which results were generated, the computer code that generated the findings, and any additional information needed such as workflows and input parameters. Many journals are revising author guidelines to include data and code availability. This work evaluates the effectiveness of journal policy that requires the data and code necessary for reproducibility be made available postpublication by the authors upon request. We assess the effectiveness of such a policy by ( i ) requesting data and code from authors and ( ii ) attempting replication of the published findings. We chose a random sample of 204 scientific papers published in the journal Science after the implementation of their policy in February 2011. We found that we were able to obtain artifacts from 44% of our sample and were able to reproduce the findings for 26%. We find this policy-author remission of data and code postpublication upon request-an improvement over no policy, but currently insufficient for reproducibility.

  13. Reproducibility of contrast-enhanced transrectal ultrasound of the prostate

    NARCIS (Netherlands)

    Sedelaar, J. P.; Goossen, T. E.; Wijkstra, H.; de la Rosette, J. J.

    2001-01-01

    Transrectal three-dimensional (3-D) contrast-enhanced power Doppler ultrasound (US) is a novel technique for studying possible prostate malignancy. Before studies can be performed to investigate the clinical validity of the technique, reproducibility of the contrast US studies must be proven.

  14. Reproducibility in the assessment of acute pancreatitis with computed tomography

    International Nuclear Information System (INIS)

    Freire Filho, Edison de Oliveira; Vieira, Renata La Rocca; Yamada, Andre Fukunishi; Shigueoka, David Carlos; Bekhor, Daniel; Freire, Maxime Figueiredo de Oliveira; Ajzen, Sergio; D'Ippolito, Giuseppe

    2007-01-01

    Objective: To evaluate the reproducibility of unenhanced and contrast-enhanced computed tomography in the assessment of patients with acute pancreatitis. Materials and methods: Fifty-one unenhanced and contrast-enhanced abdominal computed tomography studies of patients with acute pancreatitis were blindly reviewed by two radiologists (observers 1 and 2). The morphological index was separately calculated for unenhanced and contrast-enhanced computed tomography and the disease severity index was established. Intraobserver and interobserver reproducibility of computed tomography was measured by means of the kappa index (κ). Results: Interobserver agreement was κ 0.666, 0.705, 0.648, 0.547 and 0.631, respectively for unenhanced and contrast-enhanced morphological index, presence of pancreatic necrosis, pancreatic necrosis extension, and disease severity index. Intraobserver agreement (observers 1 and 2, respectively) was κ = 0.796 and 0.732 for unenhanced morphological index; κ 0.725 and 0.802 for contrast- enhanced morphological index; κ = 0.674 and 0.849 for presence of pancreatic necrosis; κ = 0.606 and 0.770 for pancreatic necrosis extension; and κ = 0.801 and 0.687 for disease severity index at computed tomography. Conclusion: Computed tomography for determination of morphological index and disease severity index in the staging of acute pancreatitis is a quite reproducible method. The absence of contrast- enhancement does not affect the computed tomography morphological index reproducibility. (author)

  15. Reproducible positioning in chest X-ray radiography

    International Nuclear Information System (INIS)

    1974-01-01

    A device is described that can be used to ensure reproducibility in the positioning of the patient during X-ray radiography of the thorax. Signals are taken from an electrocardiographic monitor and from a device recording the respiratory cycle. Radiography is performed only when two preselected signals coincide

  16. Reproducibility of Manual Platelet Estimation Following Automated Low Platelet Counts

    Directory of Open Access Journals (Sweden)

    Zainab S Al-Hosni

    2016-11-01

    Full Text Available Objectives: Manual platelet estimation is one of the methods used when automated platelet estimates are very low. However, the reproducibility of manual platelet estimation has not been adequately studied. We sought to assess the reproducibility of manual platelet estimation following automated low platelet counts and to evaluate the impact of the level of experience of the person counting on the reproducibility of manual platelet estimates. Methods: In this cross-sectional study, peripheral blood films of patients with platelet counts less than 100 × 109/L were retrieved and given to four raters to perform manual platelet estimation independently using a predefined method (average of platelet counts in 10 fields using 100× objective multiplied by 20. Data were analyzed using intraclass correlation coefficient (ICC as a method of reproducibility assessment. Results: The ICC across the four raters was 0.840, indicating excellent agreement. The median difference of the two most experienced raters was 0 (range: -64 to 78. The level of platelet estimate by the least-experienced rater predicted the disagreement (p = 0.037. When assessing the difference between pairs of raters, there was no significant difference in the ICC (p = 0.420. Conclusions: The agreement between different raters using manual platelet estimation was excellent. Further confirmation is necessary, with a prospective study using a gold standard method of platelet counts.

  17. Reproducibility of Quantitative Structural and Physiological MRI Measurements

    Science.gov (United States)

    2017-08-09

    project.org/) and SPSS (IBM Corp., Armonk, NY) for data analysis. Mean and confidence inter- vals for each measure are found in Tables 1–7. To assess...visits, and was calculated using a two- way mixed model in SPSS MCV and MRD values closer to 0 are considered to be the most reproducible, and ICC

  18. Reproducibility of abdominal fat assessment by ultrasound and computed tomography

    Energy Technology Data Exchange (ETDEWEB)

    Mauad, Fernando Marum; Chagas-Neto, Francisco Abaete; Benedeti, Augusto Cesar Garcia Saab; Nogueira-Barbosa, Marcello Henrique; Muglia, Valdair Francisco; Carneiro, Antonio Adilton Oliveira; Muller, Enrico Mattana; Elias Junior, Jorge, E-mail: fernando@fatesa.edu.br [Faculdade de Tecnologia em Saude (FATESA), Ribeirao Preto, SP (Brazil); Universidade de Fortaleza (UNIFOR), Fortaleza, CE (Brazil). Departmento de Radiologia; Universidade de Sao Paulo (FMRP/USP), Ribeirao Preto, SP (Brazil). Faculdade de Medicina. Departmento de Medicina Clinica; Universidade de Sao Paulo (FFCLRP/USP), Ribeirao Preto, SP (Brazil). Faculdade de Filosofia, Ciencias e Letras; Hospital Mae de Deus, Porto Alegre, RS (Brazil)

    2017-05-15

    Objective: To test the accuracy and reproducibility of ultrasound and computed tomography (CT) for the quantification of abdominal fat in correlation with the anthropometric, clinical, and biochemical assessments. Materials and Methods: Using ultrasound and CT, we determined the thickness of subcutaneous and intra-abdominal fat in 101 subjects-of whom 39 (38.6%) were men and 62 (61.4%) were women-with a mean age of 66.3 years (60-80 years). The ultrasound data were correlated with the anthropometric, clinical, and biochemical parameters, as well as with the areas measured by abdominal CT. Results: Intra-abdominal thickness was the variable for which the correlation with the areas of abdominal fat was strongest (i.e., the correlation coefficient was highest). We also tested the reproducibility of ultrasound and CT for the assessment of abdominal fat and found that CT measurements of abdominal fat showed greater reproducibility, having higher intraobserver and interobserver reliability than had the ultrasound measurements. There was a significant correlation between ultrasound and CT, with a correlation coefficient of 0.71. Conclusion: In the assessment of abdominal fat, the intraobserver and interobserver reliability were greater for CT than for ultrasound, although both methods showed high accuracy and good reproducibility. (author)

  19. High Reproducibility of ELISPOT Counts from Nine Different Laboratories

    DEFF Research Database (Denmark)

    Sundararaman, Srividya; Karulin, Alexey Y; Ansari, Tameem

    2015-01-01

    The primary goal of immune monitoring with ELISPOT is to measure the number of T cells, specific for any antigen, accurately and reproducibly between different laboratories. In ELISPOT assays, antigen-specific T cells secrete cytokines, forming spots of different sizes on a membrane with variable...

  20. Reproducibility of the Pleth Variability Index in premature infants

    NARCIS (Netherlands)

    Den Boogert, W.J. (Wilhelmina J.); H.A. van Elteren (Hugo); T.G. Goos (Tom); I.K.M. Reiss (Irwin); R.C.J. de Jonge (Rogier); V.J. van den Berg (Victor J.)

    2017-01-01

    textabstractThe aim was to assess the reproducibility of the Pleth Variability Index (PVI), developed for non-invasive monitoring of peripheral perfusion, in preterm neonates below 32 weeks of gestational age. Three PVI measurements were consecutively performed in stable, comfortable preterm

  1. Reproducibility of the Pleth Variability Index in premature infants

    NARCIS (Netherlands)

    Den Boogert, Wilhelmina J.; Van Elteren, Hugo A.; Goos, T.G.; Reiss, Irwin K.M.; De Jonge, Rogier C.J.; van Den Berg, Victor J.

    2017-01-01

    The aim was to assess the reproducibility of the Pleth Variability Index (PVI), developed for non-invasive monitoring of peripheral perfusion, in preterm neonates below 32 weeks of gestational age. Three PVI measurements were consecutively performed in stable, comfortable preterm neonates in the

  2. Annotating with Propp's Morphology of the Folktale: Reproducibility and Trainability

    NARCIS (Netherlands)

    Fisseni, B.; Kurji, A.; Löwe, B.

    2014-01-01

    We continue the study of the reproducibility of Propp’s annotations from Bod et al. (2012). We present four experiments in which test subjects were taught Propp’s annotation system; we conclude that Propp’s system needs a significant amount of training, but that with sufficient time investment, it

  3. Exploring the Coming Repositories of Reproducible Experiments: Challenges and Opportunities

    DEFF Research Database (Denmark)

    Freire, Juliana; Bonnet, Philippe; Shasha, Dennis

    2011-01-01

    Computational reproducibility efforts in many communities will soon give rise to validated software and data repositories of high quality. A scientist in a field may want to query the components of such repositories to build new software workflows, perhaps after adding the scientist’s own algorithms...

  4. Reproducibility of airway luminal size in asthma measured by HRCT.

    Science.gov (United States)

    Brown, Robert H; Henderson, Robert J; Sugar, Elizabeth A; Holbrook, Janet T; Wise, Robert A

    2017-10-01

    Brown RH, Henderson RJ, Sugar EA, Holbrook JT, Wise RA, on behalf of the American Lung Association Airways Clinical Research Centers. Reproducibility of airway luminal size in asthma measured by HRCT. J Appl Physiol 123: 876-883, 2017. First published July 13, 2017; doi:10.1152/japplphysiol.00307.2017.-High-resolution CT (HRCT) is a well-established imaging technology used to measure lung and airway morphology in vivo. However, there is a surprising lack of studies examining HRCT reproducibility. The CPAP Trial was a multicenter, randomized, three-parallel-arm, sham-controlled 12-wk clinical trial to assess the use of a nocturnal continuous positive airway pressure (CPAP) device on airway reactivity to methacholine. The lack of a treatment effect of CPAP on clinical or HRCT measures provided an opportunity for the current analysis. We assessed the reproducibility of HRCT imaging over 12 wk. Intraclass correlation coefficients (ICCs) were calculated for individual airway segments, individual lung lobes, both lungs, and air trapping. The ICC [95% confidence interval (CI)] for airway luminal size at total lung capacity ranged from 0.95 (0.91, 0.97) to 0.47 (0.27, 0.69). The ICC (95% CI) for airway luminal size at functional residual capacity ranged from 0.91 (0.85, 0.95) to 0.32 (0.11, 0.65). The ICC measurements for airway distensibility index and wall thickness were lower, ranging from poor (0.08) to moderate (0.63) agreement. The ICC for air trapping at functional residual capacity was 0.89 (0.81, 0.94) and varied only modestly by lobe from 0.76 (0.61, 0.87) to 0.95 (0.92, 0.97). In stable well-controlled asthmatic subjects, it is possible to reproducibly image unstimulated airway luminal areas over time, by region, and by size at total lung capacity throughout the lungs. Therefore, any changes in luminal size on repeat CT imaging are more likely due to changes in disease state and less likely due to normal variability. NEW & NOTEWORTHY There is a surprising lack

  5. Audiovisual biofeedback improves diaphragm motion reproducibility in MRI

    Science.gov (United States)

    Kim, Taeho; Pollock, Sean; Lee, Danny; O’Brien, Ricky; Keall, Paul

    2012-01-01

    Purpose: In lung radiotherapy, variations in cycle-to-cycle breathing results in four-dimensional computed tomography imaging artifacts, leading to inaccurate beam coverage and tumor targeting. In previous studies, the effect of audiovisual (AV) biofeedback on the external respiratory signal reproducibility has been investigated but the internal anatomy motion has not been fully studied. The aim of this study is to test the hypothesis that AV biofeedback improves diaphragm motion reproducibility of internal anatomy using magnetic resonance imaging (MRI). Methods: To test the hypothesis 15 healthy human subjects were enrolled in an ethics-approved AV biofeedback study consisting of two imaging sessions spaced ∼1 week apart. Within each session MR images were acquired under free breathing and AV biofeedback conditions. The respiratory signal to the AV biofeedback system utilized optical monitoring of an external marker placed on the abdomen. Synchronously, serial thoracic 2D MR images were obtained to measure the diaphragm motion using a fast gradient-recalled-echo MR pulse sequence in both coronal and sagittal planes. The improvement in the diaphragm motion reproducibility using the AV biofeedback system was quantified by comparing cycle-to-cycle variability in displacement, respiratory period, and baseline drift. Additionally, the variation in improvement between the two sessions was also quantified. Results: The average root mean square error (RMSE) of diaphragm cycle-to-cycle displacement was reduced from 2.6 mm with free breathing to 1.6 mm (38% reduction) with the implementation of AV biofeedback (p-value biofeedback (p-value biofeedback (p-value = 0.012). The diaphragm motion reproducibility improvements with AV biofeedback were consistent with the abdominal motion reproducibility that was observed from the external marker motion variation. Conclusions: This study was the first to investigate the potential of AV biofeedback to improve the motion

  6. Reproducibility of the 133Xe inhalation technique in resting studies: task order and sex related effects in healthy young adults

    International Nuclear Information System (INIS)

    Warach, S.; Gur, R.C.; Gur, R.E.; Skolnick, B.E.; Obrist, W.D.; Reivich, M.

    1987-01-01

    Repeated applications of the 133 Xe inhalation technique for measuring regional CBF (rCBF) were made during consecutive resting conditions in a sample of young healthy subjects. Subjects were grouped by order and by sex [nine had resting studies as the initial two measurements in a series of four measurement (six men, three women) and six had these measurements later (two men, four women)]. Three flow parameters were examined: f1 (fast flow) and IS (initial slope) for gray matter CBF, and CBF-15 for mean CBF (gray and white matter over 15-min integration), as well as w1, the percentage of tissue with fast clearing characteristics. With all groups combined, there were no significant differences between the two resting measurements, and high test-retest correlations were obtained for the flow parameters and w1. Analyses by order and sex grouping revealed, for the flow parameters, significant interactions of test-retest difference with order. Repeated initial studies showed reduced CBF from the first to second measurement, whereas resting studies performed later in the series showed no reduction. Interactions for test-retest difference with sex indicated that reduced CBF in serial measures was more pronounced for women. No hemispheric or regional specificity to account for these effects was found. Correction for PaCO 2 differences did not alter these results. The results resemble data regarding habituation effects measured for other psychophysiologic measures, and suggest that reduction in CBF for consecutive measurements made on the same day may reflect habituation. This underscores the importance of controlling for effects of habituation on serial measurements of CBF and metabolism

  7. Reproducible analyses of microbial food for advanced life support systems

    Science.gov (United States)

    Petersen, Gene R.

    1988-01-01

    The use of yeasts in controlled ecological life support systems (CELSS) for microbial food regeneration in space required the accurate and reproducible analysis of intracellular carbohydrate and protein levels. The reproducible analysis of glycogen was a key element in estimating overall content of edibles in candidate yeast strains. Typical analytical methods for estimating glycogen in Saccharomyces were not found to be entirely aplicable to other candidate strains. Rigorous cell lysis coupled with acid/base fractionation followed by specific enzymatic glycogen analyses were required to obtain accurate results in two strains of Candida. A profile of edible fractions of these strains was then determined. The suitability of yeasts as food sources in CELSS food production processes is discussed.

  8. Tools for Reproducibility and Extensibility in Scientific Research

    CERN Multimedia

    CERN. Geneva

    2018-01-01

    Open inquiry through reproducing results is fundamental to the scientific process. Contemporary research relies on software engineering pipelines to collect, process, and analyze data. The open source projects within Project Jupyter facilitate these objectives by bringing software engineering within the context of scientific communication. We will highlight specific projects that are computational building blocks for scientific communication, starting with the Jupyter Notebook. We will also explore applications of projects that build off of the Notebook such as Binder, JupyterHub, and repo2docker. We will discuss how these projects can individually and jointly improve reproducibility in scientific communication. Finally, we will demonstrate applications of Jupyter software that allow researchers to build upon the code of other scientists, both to extend their work and the work of others.    There will be a follow-up demo session in the afternoon, hosted by iML. Details can be foun...

  9. MASSIVE DATA, THE DIGITIZATION OF SCIENCE, AND REPRODUCIBILITY OF RESULTS

    CERN Multimedia

    CERN. Geneva

    2010-01-01

    As the scientific enterprise becomes increasingly computational and data-driven, the nature of the information communicated must change. Without inclusion of the code and data with published computational results, we are engendering a credibility crisis in science. Controversies such as ClimateGate, the microarray-based drug sensitivity clinical trials under investigation at Duke University, and retractions from prominent journals due to unverified code suggest the need for greater transparency in our computational science. In this talk I argue that the scientific method be restored to (1) a focus on error control as central to scientific communication and (2) complete communication of the underlying methodology producing the results, ie. reproducibility. I outline barriers to these goals based on recent survey work (Stodden 2010), and suggest solutions such as the “Reproducible Research Standard” (Stodden 2009), giving open licensing options designed to create an intellectual property framework for scien...

  10. Reproducibility of Mammography Units, Film Processing and Quality Imaging

    International Nuclear Information System (INIS)

    Gaona, Enrique

    2003-01-01

    The purpose of this study was to carry out an exploratory survey of the problems of quality control in mammography and processors units as a diagnosis of the current situation of mammography facilities. Measurements of reproducibility, optical density, optical difference and gamma index are included. Breast cancer is the most frequently diagnosed cancer and is the second leading cause of cancer death among women in the Mexican Republic. Mammography is a radiographic examination specially designed for detecting breast pathology. We found that the problems of reproducibility of AEC are smaller than the problems of processors units because almost all processors fall outside of the acceptable variation limits and they can affect the mammography quality image and the dose to breast. Only four mammography units agree with the minimum score established by ACR and FDA for the phantom image

  11. Reproducibility of CT bone dosimetry: Operator versus automated ROI definition

    International Nuclear Information System (INIS)

    Louis, O.; Luypaert, R.; Osteaux, M.; Kalender, W.

    1988-01-01

    Intrasubject reproducibility with repeated determination of vertebral mineral density from a given set of CT images was investigated. The region of interest (ROI) in 10 patient scans was selected by four independent operators either manually or with an automated procedure separating cortical and spongeous bone, the operators being requested to interact in ROI selection. The mean intrasubject variation was found to be much lower with the automated process (0.3 to 0.6%) than with the conventional method (2.5 to 5.2%). In a second study, 10 patients were examined twice to determine the reproducibility of CT slice selection by the operator. The errors were of the same order of magnitude as in ROI selection. (orig.)

  12. Timbral aspects of reproduced sound in small rooms. I

    DEFF Research Database (Denmark)

    Bech, Søren

    1995-01-01

    , has been simulated using an electroacoustic setup. The model included the direct sound, 17 individual reflections, and the reverberant field. The threshold of detection and just-noticeable differences for an increase in level were measured for individual reflections using eight subjects for noise......This paper reports some of the influences of individual reflections on the timbre of reproduced sound. A single loudspeaker with frequency-independent directivity characteristics, positioned in a listening room of normal size with frequency-independent absorption coefficients of the room surfaces...... and speech. The results have shown that the first-order floor and ceiling reflections are likely to individually contribute to the timbre of reproduced speech. For a noise signal, additional reflections from the left sidewall will contribute individually. The level of the reverberant field has been found...

  13. Transition questions in clinical practice - validity and reproducibility

    DEFF Research Database (Denmark)

    Lauridsen, Henrik Hein

    2008-01-01

    Transition questions in CLINICAL practice - validity and reproducibility Lauridsen HH1, Manniche C3, Grunnet-Nilsson N1, Hartvigsen J1,2 1   Clinical Locomotion Science, Institute of Sports Science and Clinical Biomechanics, University of Southern Denmark, Odense, Denmark. e-mail: hlauridsen......@health.sdu.dk 2   Nordic Institute of Chiropractic and Clinical Biomechanics, Part of Clinical Locomotion Science, Odense, Denmark 3   Backcenter Funen, Part of Clinical Locomotion Science, Ringe, Denmark   Abstract  Understanding a change score is indispensable for interpretation of results from clinical studies...... are reproducible in patients with low back pain and/or leg pain. Despite critique of several biases, our results have reinforced the construct validity of TQ’s as an outcome measure since only 1 hypothesis was rejected. On the basis of our findings we have outlined a proposal for a standardised use of transition...

  14. Properties of galaxies reproduced by a hydrodynamic simulation

    Science.gov (United States)

    Vogelsberger, M.; Genel, S.; Springel, V.; Torrey, P.; Sijacki, D.; Xu, D.; Snyder, G.; Bird, S.; Nelson, D.; Hernquist, L.

    2014-05-01

    Previous simulations of the growth of cosmic structures have broadly reproduced the `cosmic web' of galaxies that we see in the Universe, but failed to create a mixed population of elliptical and spiral galaxies, because of numerical inaccuracies and incomplete physical models. Moreover, they were unable to track the small-scale evolution of gas and stars to the present epoch within a representative portion of the Universe. Here we report a simulation that starts 12 million years after the Big Bang, and traces 13 billion years of cosmic evolution with 12 billion resolution elements in a cube of 106.5 megaparsecs a side. It yields a reasonable population of ellipticals and spirals, reproduces the observed distribution of galaxies in clusters and characteristics of hydrogen on large scales, and at the same time matches the `metal' and hydrogen content of galaxies on small scales.

  15. LHC Orbit Correction Reproducibility and Related Machine Protection

    CERN Document Server

    Baer, T; Schmidt, R; Wenninger, J

    2012-01-01

    The Large Hadron Collider (LHC) has an unprecedented nominal stored beam energy of up to 362 MJ per beam. In order to ensure an adequate machine protection by the collimation system, a high reproducibility of the beam position at collimators and special elements like the final focus quadrupoles is essential. This is realized by a combination of manual orbit corrections, feed forward and real time feedback. In order to protect the LHC against inconsistent orbit corrections, which could put the machine in a vulnerable state, a novel software-based interlock system for orbit corrector currents was developed. In this paper, the principle of the new interlock system is described and the reproducibility of the LHC orbit correction is discussed against the background of this system.

  16. Towards reproducibility of research by reuse of IT best practices

    CERN Multimedia

    CERN. Geneva

    2013-01-01

    Reproducibility of any research gives much higher credibility both to research results and to the researchers. This is true for any kind of research including computer science, where a lot of tools and approaches have been developed to ensure reproducibility. In this talk I will focus on basic and seemingly simple principles, which sometimes look too obvious to follow, but help researchers build beautiful and reliable systems that produce consistent, measurable results. My talk will cover, among other things, the problem of embedding machine learning techniques into analysis strategy. I will also speak about the most common pitfalls in this process and how to avoid them. In addition, I will demonstrate the research environment based on the principles that I will have outlined. About the speaker Andrey Ustyuzhanin (36) is Head of CERN partnership program at Yandex. He is involved in the development of event indexing and event filtering services which Yandex has been providing for the LHCb experiment sinc...

  17. Reproducibility of Computer-Aided Detection Marks in Digital Mammography

    International Nuclear Information System (INIS)

    Kim, Seung Ja; Moon, Woo Kyung; Cho, Nariya; Kim, Sun Mi; Im, Jung Gi; Cha, Joo Hee

    2007-01-01

    To evaluate the performance and reproducibility of a computeraided detection (CAD) system in mediolateral oblique (MLO) digital mammograms taken serially, without release of breast compression. A CAD system was applied preoperatively to the fulfilled digital mammograms of two MLO views taken without release of breast compression in 82 patients (age range: 33 83 years; mean age: 49 years) with previously diagnosed breast cancers. The total number of visible lesion components in 82 patients was 101: 66 masses and 35 microcalcifications. We analyzed the sensitivity and reproducibility of the CAD marks. The sensitivity of the CAD system for first MLO views was 71% (47/66) for masses and 80% (28/35) for microcalcifications. The sensitivity of the CAD system for second MLO views was 68% (45/66) for masses and 17% (6/35) for microcalcifications. In 84 ipsilateral serial MLO image sets (two patients had bilateral cancers), identical images, regardless of the existence of CAD marks, were obtained for 35% (29/84) and identical images with CAD marks were obtained for 29% (23/78). Identical images, regardless of the existence of CAD marks, for contralateral MLO images were 65% (52/80) and identical images with CAD marks were obtained for 28% (11/39). The reproducibility of CAD marks for the true positive masses in serial MLO views was 84% (42/50) and that for the true positive microcalcifications was 0% (0/34). The CAD system in digital mammograms showed a high sensitivity for detecting masses and microcalcifications. However, reproducibility of microcalcification marks was very low in MLO views taken serially without release of breast compression. Minute positional change and patient movement can alter the images and result in a significant effect on the algorithm utilized by the CAD for detecting microcalcifications

  18. The reproducibility of single photon absorptiometry in a clinical setting

    International Nuclear Information System (INIS)

    Valkema, R.; Blokland, J.A.K.; Pauwels, E.K.J.; Papapoulos, S.E.; Bijvoet, O.L.M.

    1989-01-01

    The reproducibility of single photon absorptiometry (SPA) results for detection of changes in bone mineral content (BMC) was evaluated in a clinical setting. During a period of 18 months with 4 different sources, the calibration scans of an aluminium standard had a variation of less than 1% unless the activity of the 125 I source was low. The calibration procedure was performed weekly and this was sufficient to correct for drift of the system. The short term reproducibility in patients was assessed with 119 duplicate measurements made in direct succession. The best reproducibility (CV=1.35%) was found for fat corrected BMC results expressed in g/cm, obtained at the site proximal to the 8 mm space between the radius and ulna. Analysis of all SPA scans made during 1 year (487 scans) showed a failure of the automatic procedure to detect the space of 8 mm between the forearm bones in 19 scans (3.9%). A space adjacent to the ulnar styloid was taken as the site for the first scan in these examinations. This problem may be recognized and corrected relatively easy. A significant correlation was found between BMC at the lower arm and BMC of the lumbar spine assessed with dual photon absorptiometry. However, the error of estimation of proximal BMC (SEE=20%) and distal BMC (SEE=19.4%) made these measurements of little value to predict BMC at the lumbar spine in individuals. The short term reproducibility in patients combined with long term stability of the equipment in our clinical setting showed that SPA is a reliable technique to assess changes in bone mass at the lower arm of 4% between 2 measurements with a confidence level of 95%. (orig.)

  19. Automated Generation of Technical Documentation and Provenance for Reproducible Research

    Science.gov (United States)

    Jolly, B.; Medyckyj-Scott, D.; Spiekermann, R.; Ausseil, A. G.

    2017-12-01

    Data provenance and detailed technical documentation are essential components of high-quality reproducible research, however are often only partially addressed during a research project. Recording and maintaining this information during the course of a project can be a difficult task to get right as it is a time consuming and often boring process for the researchers involved. As a result, provenance records and technical documentation provided alongside research results can be incomplete or may not be completely consistent with the actual processes followed. While providing access to the data and code used by the original researchers goes some way toward enabling reproducibility, this does not count as, or replace, data provenance. Additionally, this can be a poor substitute for good technical documentation and is often more difficult for a third-party to understand - particularly if they do not understand the programming language(s) used. We present and discuss a tool built from the ground up for the production of well-documented and reproducible spatial datasets that are created by applying a series of classification rules to a number of input layers. The internal model of the classification rules required by the tool to process the input data is exploited to also produce technical documentation and provenance records with minimal additional user input. Available provenance records that accompany input datasets are incorporated into those that describe the current process. As a result, each time a new iteration of the analysis is performed the documentation and provenance records are re-generated to provide an accurate description of the exact process followed. The generic nature of this tool, and the lessons learned during its creation, have wider application to other fields where the production of derivative datasets must be done in an open, defensible, and reproducible way.

  20. Towards reproducible MSMS data preprocessing, quality control and quantification

    OpenAIRE

    Gatto, Laurent; Lilley, Kathryn S.

    2010-01-01

    The development of MSnbase aims at providing researchers dealing with labelled quantitative proteomics data with a transparent, portable, extensible and open-source collaborative framework to easily manipulate and analyse MS2-level raw tandem mass spectrometry data. The implementation in R gives users and developers a great variety of powerful tools to be used in a controlled and reproducible way. Furthermore, MSnbase has been developed following an object-oriented programming paradigm: all i...

  1. Cuban strategy for reproducing, preserving and developing nuclear knowledge

    International Nuclear Information System (INIS)

    Elias Hardy, L.L.; Guzman Martinez, F.; Rodriguez Hoyos, O.E.; Lopez Nunez, A.F.

    2006-01-01

    One of the problems in the changing world is the preservation of knowledge for the next human generation, and nuclear knowledge is not an exception. Cuba has worked for reproducing, preserving, developing and capturing nuclear knowledge, mainly through a higher education centre, the Higher Institute of Nuclear Sciences and Technologies. This institute is a component of a national network in the preparation of manpower not only for nuclear activities but also for environmental and managerial activities too. (author)

  2. Regulating Ultrasound Cavitation in order to Induce Reproducible Sonoporation

    Science.gov (United States)

    Mestas, J.-L.; Alberti, L.; El Maalouf, J.; Béra, J.-C.; Gilles, B.

    2010-03-01

    Sonoporation would be linked to cavitation, which generally appears to be a non reproducible and unstationary phenomenon. In order to obtain an acceptable trade-off between cell mortality and transfection, a regulated cavitation generator based on an acoustical cavitation measurement was developed and tested. The medium to be sonicated is placed in a sample tray. This tray is immersed in in degassed water and positioned above the face of a flat ultrasonic transducer (frequency: 445 kHz; intensity range: 0.08-1.09 W/cm2). This technical configuration was admitted to be conducive to standing-wave generation through reflection at the air/medium interface in the well thus enhancing the cavitation phenomenon. Laterally to the transducer, a homemade hydrophone was oriented to receive the acoustical signal from the bubbles. From this spectral signal recorded at intervals of 5 ms, a cavitation index was calculated as the mean of the cavitation spectrum integration in a logarithmic scale, and the excitation power is automatically corrected. The device generates stable and reproducible cavitation level for a wide range of cavitation setpoint from stable cavitation condition up to full-developed inertial cavitation. For the ultrasound intensity range used, the time delay of the response is lower than 200 ms. The cavitation regulation device was evaluated in terms of chemical bubble collapse effect. Hydroxyl radical production was measured on terephthalic acid solutions. In open loop, the results present a great variability whatever the excitation power. On the contrary the closed loop allows a great reproducibility. This device was implemented for study of sonodynamic effect. The regulation provides more reproducible results independent of cell medium and experimental conditions (temperature, pressure). Other applications of this regulated cavitation device concern internalization of different particles (Quantum Dot) molecules (SiRNA) or plasmids (GFP, DsRed) into different

  3. Serous tubal intraepithelial carcinoma: diagnostic reproducibility and its implications.

    Science.gov (United States)

    Carlson, Joseph W; Jarboe, Elke A; Kindelberger, David; Nucci, Marisa R; Hirsch, Michelle S; Crum, Christopher P

    2010-07-01

    Serous tubal intraepithelial carcinoma (STIC) is detected in between 5% and 7% of women undergoing risk-reduction salpingooophorectomy for mutations in the BRCA1 or 2 genes (BRCA+), and seems to play a role in the pathogenesis of many ovarian and "primary peritoneal" serous carcinomas. The recognition of STIC is germane to the management of BRCA+ women; however, the diagnostic reproducibility of STIC is unknown. Twenty-one cases were selected and classified as STIC or benign, using both hematoxylin and eosin and immunohistochemical stains for p53 and MIB-1. Digital images of 30 hematoxylin and eosin-stained STICs (n=14) or benign tubal epithelium (n=16) were photographed and randomized for blind digital review in a Powerpoint format by 6 experienced gynecologic pathologists and 6 pathology trainees. A generalized kappa statistic for multiple raters was calculated for all groups. For all reviewers, the kappa was 0.333, indicating poor reproducibility; kappa was 0.453 for the experienced gynecologic pathologists (fair-to-good reproducibility), and kappa=0.253 for the pathology residents (poor reproducibility). In the experienced group, 3 of 14 STICs were diagnosed by all 6 reviewers, and 9 of 14 by a majority of the reviewers. These results show that interobserver concordance in the recognition of STIC in high-quality digital images is at best fair-to-good for even experienced gynecologic pathologists, and a proportion cannot be consistently identified even among experienced observers. In view of these findings, a diagnosis of STIC should be corroborated by a second pathologist, if feasible.

  4. Adaptive Learning in Cartesian Product of Reproducing Kernel Hilbert Spaces

    OpenAIRE

    Yukawa, Masahiro

    2014-01-01

    We propose a novel adaptive learning algorithm based on iterative orthogonal projections in the Cartesian product of multiple reproducing kernel Hilbert spaces (RKHSs). The task is estimating/tracking nonlinear functions which are supposed to contain multiple components such as (i) linear and nonlinear components, (ii) high- and low- frequency components etc. In this case, the use of multiple RKHSs permits a compact representation of multicomponent functions. The proposed algorithm is where t...

  5. Reproducibility Test for Thermoluminescence Dosimeter (TLD) Using TLD Radpro

    International Nuclear Information System (INIS)

    Nur Khairunisa Zahidi; Ahmad Bazlie Abdul Kadir; Faizal Azrin Abdul Razalim

    2016-01-01

    Thermoluminescence dosimeters (TLD) as one type of dosimeter which are often used to substitute the film badge. Like a film badge, it is worn for a period of time and then must be processed to determine the dose received. This study was to test the reproducibility of TLD using Radpro reader. This study aimed to determine the dose obtained by TLD-100 chips when irradiated with Co-60 gamma source and to test the effectiveness of TLD Radpro reader as a machine to analyse the TLD. Ten chips of TLD -100 were irradiated using Eldorado machine with Co-60 source at a distance of 5 meters from the source with 2 mSv dose exposure. After the irradiation process, TLD-100 chips were read using the TLD Radpro reader. These steps will be repeated for nine times to obtain reproducibility coefficient, r i . The readings of dose obtained from experiment was almost equivalent to the actual dose. Results shows that the average value obtained for reproducibility coefficient, r i is 6.39 % which is less than 10 %. As conclusion, the dose obtained from experiment considered accurate because its value were almost equivalent to the actual dose and TLD Radpro was verified as a good reader to analyse the TLD. (author)

  6. Reproducibility of gene expression across generations of Affymetrix microarrays

    Directory of Open Access Journals (Sweden)

    Haslett Judith N

    2003-06-01

    Full Text Available Abstract Background The development of large-scale gene expression profiling technologies is rapidly changing the norms of biological investigation. But the rapid pace of change itself presents challenges. Commercial microarrays are regularly modified to incorporate new genes and improved target sequences. Although the ability to compare datasets across generations is crucial for any long-term research project, to date no means to allow such comparisons have been developed. In this study the reproducibility of gene expression levels across two generations of Affymetrix GeneChips® (HuGeneFL and HG-U95A was measured. Results Correlation coefficients were computed for gene expression values across chip generations based on different measures of similarity. Comparing the absolute calls assigned to the individual probe sets across the generations found them to be largely unchanged. Conclusion We show that experimental replicates are highly reproducible, but that reproducibility across generations depends on the degree of similarity of the probe sets and the expression level of the corresponding transcript.

  7. Reproducibility of the Portuguese version of the PEDro Scale

    Directory of Open Access Journals (Sweden)

    Silvia Regina Shiwa

    2011-10-01

    Full Text Available The objective of this study was to test the inter-rater reproducibility of the Portuguese version of the PEDro Scale. Seven physiotherapists rated the methodological quality of 50 reports of randomized controlled trials written in Portuguese indexed on the PEDro database. Each report was also rated using the English version of the PEDro Scale. Reproducibility was evaluated by comparing two separate ratings of reports written in Portuguese and comparing the Portuguese PEDro score with the English version of the scale. Kappa coefficients ranged from 0.53 to 1.00 for individual item and an intraclass correlation coefficient (ICC of 0.82 for the total PEDro score was observed. The standard error of the measurement of the scale was 0.58. The Portuguese version of the scale was comparable with the English version, with an ICC of 0.78. The inter-rater reproducibility of the Brazilian Portuguese PEDro Scale is adequate and similar to the original English version.

  8. Can cancer researchers accurately judge whether preclinical reports will reproduce?

    Directory of Open Access Journals (Sweden)

    Daniel Benjamin

    2017-06-01

    Full Text Available There is vigorous debate about the reproducibility of research findings in cancer biology. Whether scientists can accurately assess which experiments will reproduce original findings is important to determining the pace at which science self-corrects. We collected forecasts from basic and preclinical cancer researchers on the first 6 replication studies conducted by the Reproducibility Project: Cancer Biology (RP:CB to assess the accuracy of expert judgments on specific replication outcomes. On average, researchers forecasted a 75% probability of replicating the statistical significance and a 50% probability of replicating the effect size, yet none of these studies successfully replicated on either criterion (for the 5 studies with results reported. Accuracy was related to expertise: experts with higher h-indices were more accurate, whereas experts with more topic-specific expertise were less accurate. Our findings suggest that experts, especially those with specialized knowledge, were overconfident about the RP:CB replicating individual experiments within published reports; researcher optimism likely reflects a combination of overestimating the validity of original studies and underestimating the difficulties of repeating their methodologies.

  9. On the origin of reproducible sequential activity in neural circuits

    Science.gov (United States)

    Afraimovich, V. S.; Zhigulin, V. P.; Rabinovich, M. I.

    2004-12-01

    Robustness and reproducibility of sequential spatio-temporal responses is an essential feature of many neural circuits in sensory and motor systems of animals. The most common mathematical images of dynamical regimes in neural systems are fixed points, limit cycles, chaotic attractors, and continuous attractors (attractive manifolds of neutrally stable fixed points). These are not suitable for the description of reproducible transient sequential neural dynamics. In this paper we present the concept of a stable heteroclinic sequence (SHS), which is not an attractor. SHS opens the way for understanding and modeling of transient sequential activity in neural circuits. We show that this new mathematical object can be used to describe robust and reproducible sequential neural dynamics. Using the framework of a generalized high-dimensional Lotka-Volterra model, that describes the dynamics of firing rates in an inhibitory network, we present analytical results on the existence of the SHS in the phase space of the network. With the help of numerical simulations we confirm its robustness in presence of noise in spite of the transient nature of the corresponding trajectories. Finally, by referring to several recent neurobiological experiments, we discuss possible applications of this new concept to several problems in neuroscience.

  10. Aveiro method in reproducing kernel Hilbert spaces under complete dictionary

    Science.gov (United States)

    Mai, Weixiong; Qian, Tao

    2017-12-01

    Aveiro Method is a sparse representation method in reproducing kernel Hilbert spaces (RKHS) that gives orthogonal projections in linear combinations of reproducing kernels over uniqueness sets. It, however, suffers from determination of uniqueness sets in the underlying RKHS. In fact, in general spaces, uniqueness sets are not easy to be identified, let alone the convergence speed aspect with Aveiro Method. To avoid those difficulties we propose an anew Aveiro Method based on a dictionary and the matching pursuit idea. What we do, in fact, are more: The new Aveiro method will be in relation to the recently proposed, the so called Pre-Orthogonal Greedy Algorithm (P-OGA) involving completion of a given dictionary. The new method is called Aveiro Method Under Complete Dictionary (AMUCD). The complete dictionary consists of all directional derivatives of the underlying reproducing kernels. We show that, under the boundary vanishing condition, bring available for the classical Hardy and Paley-Wiener spaces, the complete dictionary enables an efficient expansion of any given element in the Hilbert space. The proposed method reveals new and advanced aspects in both the Aveiro Method and the greedy algorithm.

  11. Validity and reproducibility of a Spanish dietary history.

    Directory of Open Access Journals (Sweden)

    Pilar Guallar-Castillón

    Full Text Available To assess the validity and reproducibility of food and nutrient intake estimated with the electronic diet history of ENRICA (DH-E, which collects information on numerous aspects of the Spanish diet.The validity of food and nutrient intake was estimated using Pearson correlation coefficients between the DH-E and the mean of seven 24-hour recalls collected every 2 months over the previous year. The reproducibility was estimated using intraclass correlation coefficients between two DH-E made one year apart.The correlations coefficients between the DH-E and the mean of seven 24-hour recalls for the main food groups were cereals (r = 0.66, meat (r = 0.66, fish (r = 0.42, vegetables (r = 0.62 and fruits (r = 0.44. The mean correlation coefficient for all 15 food groups considered was 0.53. The correlations for macronutrients were: energy (r = 0.76, proteins (r= 0.58, lipids (r = 0.73, saturated fat (r = 0.73, monounsaturated fat (r = 0.59, polyunsaturated fat (r = 0.57, and carbohydrates (r = 0.66. The mean correlation coefficient for all 41 nutrients studied was 0.55. The intraclass correlation coefficient between the two DH-E was greater than 0.40 for most foods and nutrients.The DH-E shows good validity and reproducibility for estimating usual intake of foods and nutrients.

  12. The Reproducibility of Nuclear Morphometric Measurements in Invasive Breast Carcinoma

    Directory of Open Access Journals (Sweden)

    Pauliina Kronqvist

    1997-01-01

    Full Text Available The intraobserver and interobserver reproducibility of computerized nuclear morphometry was determined in repeated measurements of 212 samples of invasive breast cancer. The influence of biological variation and the selection of the measurement area was also tested. Morphometrically determined mean nuclear profile area (Pearson’s r 0.89, grading efficiency (GE 0.95 and standard deviation (SD of nuclear profile area (Pearson’s r 0.84, GE 0.89 showed high reproducibility. In this respect, nuclear morphometry equals with other established methods of quantitative pathology and exceeds the results of subjective grading of nuclear atypia in invasive breast cancer. A training period of eight days was sufficient to produce clear improvement in consistency of nuclear morphometry results. By estimating the sources of variation it could be shown that the variation associated with the measurement procedure itself is small. Instead, sample associated variation is responsible for the majority of variation in the measurements (82.9% in mean nuclear profile area and 65.9% in SD of nuclear profile area. This study points out that when standardized methods are applied computerized morphometry is a reproducible and reliable method of assessing nuclear atypia in invasive breast cancer. For further improvement special emphasize should be put on sampling rules of selecting the microscope fields and measurement areas.

  13. Inter-examiner reproducibility of tests for lumbar motor control

    Directory of Open Access Journals (Sweden)

    Elkjaer Arne

    2011-05-01

    Full Text Available Abstract Background Many studies show a relation between reduced lumbar motor control (LMC and low back pain (LBP. However, test circumstances vary and during test performance, subjects may change position. In other words, the reliability - i.e. reproducibility and validity - of tests for LMC should be based on quantitative data. This has not been considered before. The aim was to analyse the reproducibility of five different quantitative tests for LMC commonly used in daily clinical practice. Methods The five tests for LMC were: repositioning (RPS, sitting forward lean (SFL, sitting knee extension (SKE, and bent knee fall out (BKFO, all measured in cm, and leg lowering (LL, measured in mm Hg. A total of 40 subjects (14 males, 26 females 25 with and 15 without LBP, with a mean age of 46.5 years (SD 14.8, were examined independently and in random order by two examiners on the same day. LBP subjects were recruited from three physiotherapy clinics with a connection to the clinic's gym or back-school. Non-LBP subjects were recruited from the clinic's staff acquaintances, and from patients without LBP. Results The means and standard deviations for each of the tests were 0.36 (0.27 cm for RPS, 1.01 (0.62 cm for SFL, 0.40 (0.29 cm for SKE, 1.07 (0.52 cm for BKFO, and 32.9 (7.1 mm Hg for LL. All five tests for LMC had reproducibility with the following ICCs: 0.90 for RPS, 0.96 for SFL, 0.96 for SKE, 0.94 for BKFO, and 0.98 for LL. Bland and Altman plots showed that most of the differences between examiners A and B were less than 0.20 cm. Conclusion These five tests for LMC displayed excellent reproducibility. However, the diagnostic accuracy of these tests needs to be addressed in larger cohorts of subjects, establishing values for the normal population. Also cut-points between subjects with and without LBP must be determined, taking into account age, level of activity, degree of impairment and participation in sports. Whether reproducibility of these

  14. Cervical vertebrae maturation method morphologic criteria: poor reproducibility.

    Science.gov (United States)

    Nestman, Trenton S; Marshall, Steven D; Qian, Fang; Holton, Nathan; Franciscus, Robert G; Southard, Thomas E

    2011-08-01

    The cervical vertebrae maturation (CVM) method has been advocated as a predictor of peak mandibular growth. A careful review of the literature showed potential methodologic errors that might influence the high reported reproducibility of the CVM method, and we recently established that the reproducibility of the CVM method was poor when these potential errors were eliminated. The purpose of this study was to further investigate the reproducibility of the individual vertebral patterns. In other words, the purpose was to determine which of the individual CVM vertebral patterns could be classified reliably and which could not. Ten practicing orthodontists, trained in the CVM method, evaluated the morphology of cervical vertebrae C2 through C4 from 30 cephalometric radiographs using questions based on the CVM method. The Fleiss kappa statistic was used to assess interobserver agreement when evaluating each cervical vertebrae morphology question for each subject. The Kendall coefficient of concordance was used to assess the level of interobserver agreement when determining a "derived CVM stage" for each subject. Interobserver agreement was high for assessment of the lower borders of C2, C3, and C4 that were either flat or curved in the CVM method, but interobserver agreement was low for assessment of the vertebral bodies of C3 and C4 when they were either trapezoidal, rectangular horizontal, square, or rectangular vertical; this led to the overall poor reproducibility of the CVM method. These findings were reflected in the Fleiss kappa statistic. Furthermore, nearly 30% of the time, individual morphologic criteria could not be combined to generate a final CVM stage because of incompatible responses to the 5 questions. Intraobserver agreement in this study was only 62%, on average, when the inconclusive stagings were excluded as disagreements. Intraobserver agreement was worse (44%) when the inconclusive stagings were included as disagreements. For the group of subjects

  15. Assessment of precision and reproducibility of a new myograph

    Directory of Open Access Journals (Sweden)

    Piepenbrock Siegfried

    2007-12-01

    Full Text Available Abstract Background The physiological characteristics of muscle activity and the assessment of muscle strength represent important diagnostic information. There are many devices that measure muscle force in humans, but some require voluntary contractions, which are difficult to assess in weak or unconscious patients who are unable to complete a full range of voluntary force assessment tasks. Other devices, which obtain standard muscle contractions by electric stimulations, do not have the technology required to induce and measure reproducible valid contractions at the optimum muscle length. Methods In our study we used a newly developed diagnostic device which measures accurately the reproducibility and time-changed-variability of the muscle force in an individual muscle. A total of 500 in-vivo measurements of supra-maximal isometric single twitch contractions were carried out on the musculus adductor pollicis of 5 test subjects over 10 sessions, with ten repetitions per session. The same protocol was performed on 405 test subjects with two repetitions each to determine a reference-interval on healthy subjects. Results Using our test setting, we found a high reproducibility of the muscle contractions of each test subject. The precision of the measurements performed with our device was 98.74%. Only two consecutive measurements are needed in order to assess a real, representative individual value of muscle force. The mean value of the force of contraction was 9.51 N and the 95% reference interval was 4.77–14.25 N. Conclusion The new myograph is a highly reliable measuring device with which the adductor pollicis can be investigated at the optimum length. It has the potential to become a reliable and valid tool for diagnostic in the clinical setting and for monitoring neuromuscular diseases.

  16. Efficient and reproducible mammalian cell bioprocesses without probes and controllers?

    Science.gov (United States)

    Tissot, Stéphanie; Oberbek, Agata; Reclari, Martino; Dreyer, Matthieu; Hacker, David L; Baldi, Lucia; Farhat, Mohamed; Wurm, Florian M

    2011-07-01

    Bioprocesses for recombinant protein production with mammalian cells are typically controlled for several physicochemical parameters including the pH and dissolved oxygen concentration (DO) of the culture medium. Here we studied whether these controls are necessary for efficient and reproducible bioprocesses in an orbitally shaken bioreactor (OSR). Mixing, gas transfer, and volumetric power consumption (P(V)) were determined in both a 5-L OSR and a 3-L stirred-tank bioreactor (STR). The two cultivation systems had a similar mixing intensity, but the STR had a lower volumetric mass transfer coefficient of oxygen (k(L)a) and a higher P(V) than the OSR. Recombinant CHO cell lines expressing either tumor necrosis factor receptor as an Fc fusion protein (TNFR:Fc) or an anti-RhesusD monoclonal antibody were cultivated in the two systems. The 5-L OSR was operated in an incubator shaker with 5% CO(2) in the gas environment but without pH and DO control whereas the STR was operate