significant statistical association: Topics by WorldWideScience.org

Sample records for significant statistical association

Understanding the Sampling Distribution and Its Use in Testing Statistical Significance.

Science.gov (United States)

Breunig, Nancy A.

Despite the increasing criticism of statistical significance testing by researchers, particularly in the publication of the 1994 American Psychological Association's style manual, statistical significance test results are still popular in journal articles. For this reason, it remains important to understand the logic of inferential statistics. A…
Statistical significance of cis-regulatory modules

Directory of Open Access Journals (Sweden)

Smith Andrew D

2007-01-01

Full Text Available Abstract Background It is becoming increasingly important for researchers to be able to scan through large genomic regions for transcription factor binding sites or clusters of binding sites forming cis-regulatory modules. Correspondingly, there has been a push to develop algorithms for the rapid detection and assessment of cis-regulatory modules. While various algorithms for this purpose have been introduced, most are not well suited for rapid, genome scale scanning. Results We introduce methods designed for the detection and statistical evaluation of cis-regulatory modules, modeled as either clusters of individual binding sites or as combinations of sites with constrained organization. In order to determine the statistical significance of module sites, we first need a method to determine the statistical significance of single transcription factor binding site matches. We introduce a straightforward method of estimating the statistical significance of single site matches using a database of known promoters to produce data structures that can be used to estimate p-values for binding site matches. We next introduce a technique to calculate the statistical significance of the arrangement of binding sites within a module using a max-gap model. If the module scanned for has defined organizational parameters, the probability of the module is corrected to account for organizational constraints. The statistical significance of single site matches and the architecture of sites within the module can be combined to provide an overall estimation of statistical significance of cis-regulatory module sites. Conclusion The methods introduced in this paper allow for the detection and statistical evaluation of single transcription factor binding sites and cis-regulatory modules. The features described are implemented in the Search Tool for Occurrences of Regulatory Motifs (STORM and MODSTORM software.
Test for the statistical significance of differences between ROC curves

International Nuclear Information System (INIS)

Metz, C.E.; Kronman, H.B.

1979-01-01

A test for the statistical significance of observed differences between two measured Receiver Operating Characteristic (ROC) curves has been designed and evaluated. The set of observer response data for each ROC curve is assumed to be independent and to arise from a ROC curve having a form which, in the absence of statistical fluctuations in the response data, graphs as a straight line on double normal-deviate axes. To test the significance of an apparent difference between two measured ROC curves, maximum likelihood estimates of the two parameters of each curve and the associated parameter variances and covariance are calculated from the corresponding set of observer response data. An approximate Chi-square statistic with two degrees of freedom is then constructed from the differences between the parameters estimated for each ROC curve and from the variances and covariances of these estimates. This statistic is known to be truly Chi-square distributed only in the limit of large numbers of trials in the observer performance experiments. Performance of the statistic for data arising from a limited number of experimental trials was evaluated. Independent sets of rating scale data arising from the same underlying ROC curve were paired, and the fraction of differences found (falsely) significant was compared to the significance level, α, used with the test. Although test performance was found to be somewhat dependent on both the number of trials in the data and the position of the underlying ROC curve in the ROC space, the results for various significance levels showed the test to be reliable under practical experimental conditions
The thresholds for statistical and clinical significance

DEFF Research Database (Denmark)

Jakobsen, Janus Christian; Gluud, Christian; Winkel, Per

2014-01-01

BACKGROUND: Thresholds for statistical significance are insufficiently demonstrated by 95% confidence intervals or P-values when assessing results from randomised clinical trials. First, a P-value only shows the probability of getting a result assuming that the null hypothesis is true and does...... not reflect the probability of getting a result assuming an alternative hypothesis to the null hypothesis is true. Second, a confidence interval or a P-value showing significance may be caused by multiplicity. Third, statistical significance does not necessarily result in clinical significance. Therefore...... of the probability that a given trial result is compatible with a 'null' effect (corresponding to the P-value) divided by the probability that the trial result is compatible with the intervention effect hypothesised in the sample size calculation; (3) adjust the confidence intervals and the statistical significance...
The insignificance of statistical significance testing

Science.gov (United States)

Johnson, Douglas H.

1999-01-01

Despite their use in scientific journals such as The Journal of Wildlife Management, statistical hypothesis tests add very little value to the products of research. Indeed, they frequently confuse the interpretation of data. This paper describes how statistical hypothesis tests are often viewed, and then contrasts that interpretation with the correct one. I discuss the arbitrariness of P-values, conclusions that the null hypothesis is true, power analysis, and distinctions between statistical and biological significance. Statistical hypothesis testing, in which the null hypothesis about the properties of a population is almost always known a priori to be false, is contrasted with scientific hypothesis testing, which examines a credible null hypothesis about phenomena in nature. More meaningful alternatives are briefly outlined, including estimation and confidence intervals for determining the importance of factors, decision theory for guiding actions in the face of uncertainty, and Bayesian approaches to hypothesis testing and other statistical practices.
Significance levels for studies with correlated test statistics.

Science.gov (United States)

Shi, Jianxin; Levinson, Douglas F; Whittemore, Alice S

2008-07-01

When testing large numbers of null hypotheses, one needs to assess the evidence against the global null hypothesis that none of the hypotheses is false. Such evidence typically is based on the test statistic of the largest magnitude, whose statistical significance is evaluated by permuting the sample units to simulate its null distribution. Efron (2007) has noted that correlation among the test statistics can induce substantial interstudy variation in the shapes of their histograms, which may cause misleading tail counts. Here, we show that permutation-based estimates of the overall significance level also can be misleading when the test statistics are correlated. We propose that such estimates be conditioned on a simple measure of the spread of the observed histogram, and we provide a method for obtaining conditional significance levels. We justify this conditioning using the conditionality principle described by Cox and Hinkley (1974). Application of the method to gene expression data illustrates the circumstances when conditional significance levels are needed.
Interpreting Statistical Significance Test Results: A Proposed New "What If" Method.

Science.gov (United States)

Kieffer, Kevin M.; Thompson, Bruce

As the 1994 publication manual of the American Psychological Association emphasized, "p" values are affected by sample size. As a result, it can be helpful to interpret the results of statistical significant tests in a sample size context by conducting so-called "what if" analyses. However, these methods can be inaccurate…
Caveats for using statistical significance tests in research assessments

DEFF Research Database (Denmark)

Schneider, Jesper Wiborg

2013-01-01

controversial and numerous criticisms have been leveled against their use. Based on examples from articles by proponents of the use statistical significance tests in research assessments, we address some of the numerous problems with such tests. The issues specifically discussed are the ritual practice......This article raises concerns about the advantages of using statistical significance tests in research assessments as has recently been suggested in the debate about proper normalization procedures for citation indicators by Opthof and Leydesdorff (2010). Statistical significance tests are highly...... argue that applying statistical significance tests and mechanically adhering to their results are highly problematic and detrimental to critical thinking. We claim that the use of such tests do not provide any advantages in relation to deciding whether differences between citation indicators...
Sibling Competition & Growth Tradeoffs. Biological vs. Statistical Significance.

Science.gov (United States)

Kramer, Karen L; Veile, Amanda; Otárola-Castillo, Erik

2016-01-01

Early childhood growth has many downstream effects on future health and reproduction and is an important measure of offspring quality. While a tradeoff between family size and child growth outcomes is theoretically predicted in high-fertility societies, empirical evidence is mixed. This is often attributed to phenotypic variation in parental condition. However, inconsistent study results may also arise because family size confounds the potentially differential effects that older and younger siblings can have on young children's growth. Additionally, inconsistent results might reflect that the biological significance associated with different growth trajectories is poorly understood. This paper addresses these concerns by tracking children's monthly gains in height and weight from weaning to age five in a high fertility Maya community. We predict that: 1) as an aggregate measure family size will not have a major impact on child growth during the post weaning period; 2) competition from young siblings will negatively impact child growth during the post weaning period; 3) however because of their economic value, older siblings will have a negligible effect on young children's growth. Accounting for parental condition, we use linear mixed models to evaluate the effects that family size, younger and older siblings have on children's growth. Congruent with our expectations, it is younger siblings who have the most detrimental effect on children's growth. While we find statistical evidence of a quantity/quality tradeoff effect, the biological significance of these results is negligible in early childhood. Our findings help to resolve why quantity/quality studies have had inconsistent results by showing that sibling competition varies with sibling age composition, not just family size, and that biological significance is distinct from statistical significance.
Sibling Competition & Growth Tradeoffs. Biological vs. Statistical Significance.

Directory of Open Access Journals (Sweden)

Karen L Kramer

Full Text Available Early childhood growth has many downstream effects on future health and reproduction and is an important measure of offspring quality. While a tradeoff between family size and child growth outcomes is theoretically predicted in high-fertility societies, empirical evidence is mixed. This is often attributed to phenotypic variation in parental condition. However, inconsistent study results may also arise because family size confounds the potentially differential effects that older and younger siblings can have on young children's growth. Additionally, inconsistent results might reflect that the biological significance associated with different growth trajectories is poorly understood. This paper addresses these concerns by tracking children's monthly gains in height and weight from weaning to age five in a high fertility Maya community. We predict that: 1 as an aggregate measure family size will not have a major impact on child growth during the post weaning period; 2 competition from young siblings will negatively impact child growth during the post weaning period; 3 however because of their economic value, older siblings will have a negligible effect on young children's growth. Accounting for parental condition, we use linear mixed models to evaluate the effects that family size, younger and older siblings have on children's growth. Congruent with our expectations, it is younger siblings who have the most detrimental effect on children's growth. While we find statistical evidence of a quantity/quality tradeoff effect, the biological significance of these results is negligible in early childhood. Our findings help to resolve why quantity/quality studies have had inconsistent results by showing that sibling competition varies with sibling age composition, not just family size, and that biological significance is distinct from statistical significance.
Statistically significant relational data mining :

Energy Technology Data Exchange (ETDEWEB)

Berry, Jonathan W.; Leung, Vitus Joseph; Phillips, Cynthia Ann; Pinar, Ali; Robinson, David Gerald; Berger-Wolf, Tanya; Bhowmick, Sanjukta; Casleton, Emily; Kaiser, Mark; Nordman, Daniel J.; Wilson, Alyson G.

2014-02-01

This report summarizes the work performed under the project (3z(BStatitically significant relational data mining.(3y (BThe goal of the project was to add more statistical rigor to the fairly ad hoc area of data mining on graphs. Our goal was to develop better algorithms and better ways to evaluate algorithm quality. We concetrated on algorithms for community detection, approximate pattern matching, and graph similarity measures. Approximate pattern matching involves finding an instance of a relatively small pattern, expressed with tolerance, in a large graph of data observed with uncertainty. This report gathers the abstracts and references for the eight refereed publications that have appeared as part of this work. We then archive three pieces of research that have not yet been published. The first is theoretical and experimental evidence that a popular statistical measure for comparison of community assignments favors over-resolved communities over approximations to a ground truth. The second are statistically motivated methods for measuring the quality of an approximate match of a small pattern in a large graph. The third is a new probabilistic random graph model. Statisticians favor these models for graph analysis. The new local structure graph model overcomes some of the issues with popular models such as exponential random graph models and latent variable models.
Common pitfalls in statistical analysis: "P" values, statistical significance and confidence intervals

Directory of Open Access Journals (Sweden)

Priya Ranganathan

2015-01-01

Full Text Available In the second part of a series on pitfalls in statistical analysis, we look at various ways in which a statistically significant study result can be expressed. We debunk some of the myths regarding the ′P′ value, explain the importance of ′confidence intervals′ and clarify the importance of including both values in a paper
Health significance and statistical uncertainty. The value of P-value.

Science.gov (United States)

Consonni, Dario; Bertazzi, Pier Alberto

2017-10-27

The P-value is widely used as a summary statistics of scientific results. Unfortunately, there is a widespread tendency to dichotomize its value in "P0.05" ("statistically not significant"), with the former implying a "positive" result and the latter a "negative" one. To show the unsuitability of such an approach when evaluating the effects of environmental and occupational risk factors. We provide examples of distorted use of P-value and of the negative consequences for science and public health of such a black-and-white vision. The rigid interpretation of P-value as a dichotomy favors the confusion between health relevance and statistical significance, discourages thoughtful thinking, and distorts attention from what really matters, the health significance. A much better way to express and communicate scientific results involves reporting effect estimates (e.g., risks, risks ratios or risk differences) and their confidence intervals (CI), which summarize and convey both health significance and statistical uncertainty. Unfortunately, many researchers do not usually consider the whole interval of CI but only examine if it includes the null-value, therefore degrading this procedure to the same P-value dichotomy (statistical significance or not). In reporting statistical results of scientific research present effects estimates with their confidence intervals and do not qualify the P-value as "significant" or "not significant".
Recent Literature on Whether Statistical Significance Tests Should or Should Not Be Banned.

Science.gov (United States)

Deegear, James

This paper summarizes the literature regarding statistical significant testing with an emphasis on recent literature in various discipline and literature exploring why researchers have demonstrably failed to be influenced by the American Psychological Association publication manual's encouragement to report effect sizes. Also considered are…
HPV-Associated Cancers Statistics

Science.gov (United States)

... What CDC Is Doing Related Links Stay Informed Statistics for Other Kinds of Cancer Breast Cervical Colorectal ( ... Vaginal and Vulvar Cancer Home HPV-Associated Cancer Statistics Language: English (US) Español (Spanish) Recommend on Facebook ...
Statistical significance versus clinical relevance.

Science.gov (United States)

van Rijn, Marieke H C; Bech, Anneke; Bouyer, Jean; van den Brand, Jan A J G

2017-04-01

In March this year, the American Statistical Association (ASA) posted a statement on the correct use of P-values, in response to a growing concern that the P-value is commonly misused and misinterpreted. We aim to translate these warnings given by the ASA into a language more easily understood by clinicians and researchers without a deep background in statistics. Moreover, we intend to illustrate the limitations of P-values, even when used and interpreted correctly, and bring more attention to the clinical relevance of study findings using two recently reported studies as examples. We argue that P-values are often misinterpreted. A common mistake is saying that P < 0.05 means that the null hypothesis is false, and P ≥0.05 means that the null hypothesis is true. The correct interpretation of a P-value of 0.05 is that if the null hypothesis were indeed true, a similar or more extreme result would occur 5% of the times upon repeating the study in a similar sample. In other words, the P-value informs about the likelihood of the data given the null hypothesis and not the other way around. A possible alternative related to the P-value is the confidence interval (CI). It provides more information on the magnitude of an effect and the imprecision with which that effect was estimated. However, there is no magic bullet to replace P-values and stop erroneous interpretation of scientific results. Scientists and readers alike should make themselves familiar with the correct, nuanced interpretation of statistical tests, P-values and CIs. © The Author 2017. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.
Common pitfalls in statistical analysis: “P” values, statistical significance and confidence intervals

Science.gov (United States)

Ranganathan, Priya; Pramesh, C. S.; Buyse, Marc

2015-01-01

In the second part of a series on pitfalls in statistical analysis, we look at various ways in which a statistically significant study result can be expressed. We debunk some of the myths regarding the ‘P’ value, explain the importance of ‘confidence intervals’ and clarify the importance of including both values in a paper PMID:25878958
Statistics for X-chromosome associations.

Science.gov (United States)

Özbek, Umut; Lin, Hui-Min; Lin, Yan; Weeks, Daniel E; Chen, Wei; Shaffer, John R; Purcell, Shaun M; Feingold, Eleanor

2018-06-13

In a genome-wide association study (GWAS), association between genotype and phenotype at autosomal loci is generally tested by regression models. However, X-chromosome data are often excluded from published analyses of autosomes because of the difference between males and females in number of X chromosomes. Failure to analyze X-chromosome data at all is obviously less than ideal, and can lead to missed discoveries. Even when X-chromosome data are included, they are often analyzed with suboptimal statistics. Several mathematically sensible statistics for X-chromosome association have been proposed. The optimality of these statistics, however, is based on very specific simple genetic models. In addition, while previous simulation studies of these statistics have been informative, they have focused on single-marker tests and have not considered the types of error that occur even under the null hypothesis when the entire X chromosome is scanned. In this study, we comprehensively tested several X-chromosome association statistics using simulation studies that include the entire chromosome. We also considered a wide range of trait models for sex differences and phenotypic effects of X inactivation. We found that models that do not incorporate a sex effect can have large type I error in some cases. We also found that many of the best statistics perform well even when there are modest deviations, such as trait variance differences between the sexes or small sex differences in allele frequencies, from assumptions. © 2018 WILEY PERIODICALS, INC.
How to construct the statistic network? An association network of herbaceous

Directory of Open Access Journals (Sweden)

WenJun Zhang

2012-06-01

Full Text Available In present study I defined a new type of network, the statistic network. The statistic network is a weighted and non-deterministic network. In the statistic network, a connection value, i.e., connection weight, represents connection strength and connection likelihood between two nodes and its absolute value falls in the interval (0,1]. The connection value is expressed as a statistical measure such as correlation coefficient, association coefficient, or Jaccard coefficient, etc. In addition, all connections of the statistic network can be statistically tested for their validity. A connection is true if the connection value is statistically significant. If all connection values of a node are not statistically significant, it is an isolated node. An isolated node has not any connection to other nodes in the statistic network. Positive and negative connection values denote distinct connectiontypes (positive or negative association or interaction. In the statistic network, two nodes with the greater connection value will show more similar trend in the change of their states. At any time we can obtain a sample network of the statistic network. A sample network is a non-weighted and deterministic network. Thestatistic network, in particular the plant association network that constructed from field sampling, is mostly an information network. Most of the interspecific relationships in plant community are competition and cooperation. Therefore in comparison to animal networks, the methodology of statistic network is moresuitable to construct plant association networks. Some conclusions were drawn from this study: (1 in the plant association network, most connections are weak and positive interactions. The association network constructed from Spearman rank correlation has most connections and isolated taxa are fewer. From net linear correlation,linear correlation, to Spearman rank correlation, the practical number of connections and connectance in the
Swiss solar power statistics 2007 - Significant expansion

International Nuclear Information System (INIS)

Hostettler, T.

2008-01-01

This article presents and discusses the 2007 statistics for solar power in Switzerland. A significant number of new installations is noted as is the high production figures from newer installations. The basics behind the compilation of the Swiss solar power statistics are briefly reviewed and an overview for the period 1989 to 2007 is presented which includes figures on the number of photovoltaic plant in service and installed peak power. Typical production figures in kilowatt-hours (kWh) per installed kilowatt-peak power (kWp) are presented and discussed for installations of various sizes. Increased production after inverter replacement in older installations is noted. Finally, the general political situation in Switzerland as far as solar power is concerned are briefly discussed as are international developments.

A robust statistical method for association-based eQTL analysis.

Directory of Open Access Journals (Sweden)

Ning Jiang

Full Text Available It has been well established that theoretical kernel for recently surging genome-wide association study (GWAS is statistical inference of linkage disequilibrium (LD between a tested genetic marker and a putative locus affecting a disease trait. However, LD analysis is vulnerable to several confounding factors of which population stratification is the most prominent. Whilst many methods have been proposed to correct for the influence either through predicting the structure parameters or correcting inflation in the test statistic due to the stratification, these may not be feasible or may impose further statistical problems in practical implementation.We propose here a novel statistical method to control spurious LD in GWAS from population structure by incorporating a control marker into testing for significance of genetic association of a polymorphic marker with phenotypic variation of a complex trait. The method avoids the need of structure prediction which may be infeasible or inadequate in practice and accounts properly for a varying effect of population stratification on different regions of the genome under study. Utility and statistical properties of the new method were tested through an intensive computer simulation study and an association-based genome-wide mapping of expression quantitative trait loci in genetically divergent human populations.The analyses show that the new method confers an improved statistical power for detecting genuine genetic association in subpopulations and an effective control of spurious associations stemmed from population structure when compared with other two popularly implemented methods in the literature of GWAS.
On detection and assessment of statistical significance of Genomic Islands

Directory of Open Access Journals (Sweden)

Chaudhuri Probal

2008-04-01

Full Text Available Abstract Background Many of the available methods for detecting Genomic Islands (GIs in prokaryotic genomes use markers such as transposons, proximal tRNAs, flanking repeats etc., or they use other supervised techniques requiring training datasets. Most of these methods are primarily based on the biases in GC content or codon and amino acid usage of the islands. However, these methods either do not use any formal statistical test of significance or use statistical tests for which the critical values and the P-values are not adequately justified. We propose a method, which is unsupervised in nature and uses Monte-Carlo statistical tests based on randomly selected segments of a chromosome. Such tests are supported by precise statistical distribution theory, and consequently, the resulting P-values are quite reliable for making the decision. Results Our algorithm (named Design-Island, an acronym for Detection of Statistically Significant Genomic Island runs in two phases. Some 'putative GIs' are identified in the first phase, and those are refined into smaller segments containing horizontally acquired genes in the refinement phase. This method is applied to Salmonella typhi CT18 genome leading to the discovery of several new pathogenicity, antibiotic resistance and metabolic islands that were missed by earlier methods. Many of these islands contain mobile genetic elements like phage-mediated genes, transposons, integrase and IS elements confirming their horizontal acquirement. Conclusion The proposed method is based on statistical tests supported by precise distribution theory and reliable P-values along with a technique for visualizing statistically significant islands. The performance of our method is better than many other well known methods in terms of their sensitivity and accuracy, and in terms of specificity, it is comparable to other methods.
The distribution of P-values in medical research articles suggested selective reporting associated with statistical significance.

Science.gov (United States)

Perneger, Thomas V; Combescure, Christophe

2017-07-01

Published P-values provide a window into the global enterprise of medical research. The aim of this study was to use the distribution of published P-values to estimate the relative frequencies of null and alternative hypotheses and to seek irregularities suggestive of publication bias. This cross-sectional study included P-values published in 120 medical research articles in 2016 (30 each from the BMJ, JAMA, Lancet, and New England Journal of Medicine). The observed distribution of P-values was compared with expected distributions under the null hypothesis (i.e., uniform between 0 and 1) and the alternative hypothesis (strictly decreasing from 0 to 1). P-values were categorized according to conventional levels of statistical significance and in one-percent intervals. Among 4,158 recorded P-values, 26.1% were highly significant (P values values equal to 1, and (3) about twice as many P-values less than 0.05 compared with those more than 0.05. The latter finding was seen in both randomized trials and observational studies, and in most types of analyses, excepting heterogeneity tests and interaction tests. Under plausible assumptions, we estimate that about half of the tested hypotheses were null and the other half were alternative. This analysis suggests that statistical tests published in medical journals are not a random sample of null and alternative hypotheses but that selective reporting is prevalent. In particular, significant results are about twice as likely to be reported as nonsignificant results. Copyright © 2017 Elsevier Inc. All rights reserved.
Increasing the statistical significance of entanglement detection in experiments.

Science.gov (United States)

Jungnitsch, Bastian; Niekamp, Sönke; Kleinmann, Matthias; Gühne, Otfried; Lu, He; Gao, Wei-Bo; Chen, Yu-Ao; Chen, Zeng-Bing; Pan, Jian-Wei

2010-05-28

Entanglement is often verified by a violation of an inequality like a Bell inequality or an entanglement witness. Considerable effort has been devoted to the optimization of such inequalities in order to obtain a high violation. We demonstrate theoretically and experimentally that such an optimization does not necessarily lead to a better entanglement test, if the statistical error is taken into account. Theoretically, we show for different error models that reducing the violation of an inequality can improve the significance. Experimentally, we observe this phenomenon in a four-photon experiment, testing the Mermin and Ardehali inequality for different levels of noise. Furthermore, we provide a way to develop entanglement tests with high statistical significance.
Testing the Difference of Correlated Agreement Coefficients for Statistical Significance

Science.gov (United States)

Gwet, Kilem L.

2016-01-01

This article addresses the problem of testing the difference between two correlated agreement coefficients for statistical significance. A number of authors have proposed methods for testing the difference between two correlated kappa coefficients, which require either the use of resampling methods or the use of advanced statistical modeling…
Past and future American Psychological Association guidelines for statistical practice

NARCIS (Netherlands)

Finch, S; Thomason, N; Cumming, G

2002-01-01

We review the publication guidelines of the American Psychological Association (APA) since 1929 and document their advice for authors about statistical practice. Although the advice has been extended with each revision of the guidelines, it has largely focused on null hypothesis significance testing
Statistical Significance for Hierarchical Clustering

Science.gov (United States)

Kimes, Patrick K.; Liu, Yufeng; Hayes, D. Neil; Marron, J. S.

2017-01-01

Summary Cluster analysis has proved to be an invaluable tool for the exploratory and unsupervised analysis of high dimensional datasets. Among methods for clustering, hierarchical approaches have enjoyed substantial popularity in genomics and other fields for their ability to simultaneously uncover multiple layers of clustering structure. A critical and challenging question in cluster analysis is whether the identified clusters represent important underlying structure or are artifacts of natural sampling variation. Few approaches have been proposed for addressing this problem in the context of hierarchical clustering, for which the problem is further complicated by the natural tree structure of the partition, and the multiplicity of tests required to parse the layers of nested clusters. In this paper, we propose a Monte Carlo based approach for testing statistical significance in hierarchical clustering which addresses these issues. The approach is implemented as a sequential testing procedure guaranteeing control of the family-wise error rate. Theoretical justification is provided for our approach, and its power to detect true clustering structure is illustrated through several simulation studies and applications to two cancer gene expression datasets. PMID:28099990
Significant Association of Streptococcus bovis with Malignant Gastrointestinal Diseases

Directory of Open Access Journals (Sweden)

Salah Shanan

2011-01-01

Full Text Available Streptococcus bovis is a Gram-positive bacterium causing serious human infections, including endocarditis and bacteremia, and is usually associated with underlying disease. The aims of the current study were to compare prevalence of the bacterium associated with malignant and nonmalignant gastrointestinal diseases and to determine the susceptibility of the isolated strains to different antimicrobial agents. The result showed that the prevalence of S. bovis in stool specimens from patients with malignant or with nonmalignant gastrointestinal diseases was statistically significant. This result may support the idea that there is correlation between S. bovis and the malignant gastrointestinal diseases.
Identification of sequence motifs significantly associated with antisense activity

Directory of Open Access Journals (Sweden)

Peek Andrew S

2007-06-01

Full Text Available Abstract Background Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. Results We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. Conclusion The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic
Statistical significance of trends in monthly heavy precipitation over the US

KAUST Repository

Mahajan, Salil

2011-05-11

Trends in monthly heavy precipitation, defined by a return period of one year, are assessed for statistical significance in observations and Global Climate Model (GCM) simulations over the contiguous United States using Monte Carlo non-parametric and parametric bootstrapping techniques. The results from the two Monte Carlo approaches are found to be similar to each other, and also to the traditional non-parametric Kendall\\'s τ test, implying the robustness of the approach. Two different observational data-sets are employed to test for trends in monthly heavy precipitation and are found to exhibit consistent results. Both data-sets demonstrate upward trends, one of which is found to be statistically significant at the 95% confidence level. Upward trends similar to observations are observed in some climate model simulations of the twentieth century, but their statistical significance is marginal. For projections of the twenty-first century, a statistically significant upwards trend is observed in most of the climate models analyzed. The change in the simulated precipitation variance appears to be more important in the twenty-first century projections than changes in the mean precipitation. Stochastic fluctuations of the climate-system are found to be dominate monthly heavy precipitation as some GCM simulations show a downwards trend even in the twenty-first century projections when the greenhouse gas forcings are strong. © 2011 Springer-Verlag.
Increasing the statistical significance of entanglement detection in experiments

Energy Technology Data Exchange (ETDEWEB)

Jungnitsch, Bastian; Niekamp, Soenke; Kleinmann, Matthias; Guehne, Otfried [Institut fuer Quantenoptik und Quanteninformation, Innsbruck (Austria); Lu, He; Gao, Wei-Bo; Chen, Zeng-Bing [Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei (China); Chen, Yu-Ao; Pan, Jian-Wei [Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei (China); Physikalisches Institut, Universitaet Heidelberg (Germany)

2010-07-01

Entanglement is often verified by a violation of an inequality like a Bell inequality or an entanglement witness. Considerable effort has been devoted to the optimization of such inequalities in order to obtain a high violation. We demonstrate theoretically and experimentally that such an optimization does not necessarily lead to a better entanglement test, if the statistical error is taken into account. Theoretically, we show for different error models that reducing the violation of an inequality can improve the significance. We show this to be the case for an error model in which the variance of an observable is interpreted as its error and for the standard error model in photonic experiments. Specifically, we demonstrate that the Mermin inequality yields a Bell test which is statistically more significant than the Ardehali inequality in the case of a photonic four-qubit state that is close to a GHZ state. Experimentally, we observe this phenomenon in a four-photon experiment, testing the above inequalities for different levels of noise.
Reporting effect sizes as a supplement to statistical significance ...

African Journals Online (AJOL)

The purpose of the article is to review the statistical significance reporting practices in reading instruction studies and to provide guidelines for when to calculate and report effect sizes in educational research. A review of six readily accessible (online) and accredited journals publishing research on reading instruction ...
Funding source and primary outcome changes in clinical trials registered on ClinicalTrials.gov are associated with the reporting of a statistically significant primary outcome: a cross-sectional study [v2; ref status: indexed, http://f1000r.es/5bj

Directory of Open Access Journals (Sweden)

Sreeram V Ramagopalan

2015-04-01

Full Text Available Background: We and others have shown a significant proportion of interventional trials registered on ClinicalTrials.gov have their primary outcomes altered after the listed study start and completion dates. The objectives of this study were to investigate whether changes made to primary outcomes are associated with the likelihood of reporting a statistically significant primary outcome on ClinicalTrials.gov. Methods: A cross-sectional analysis of all interventional clinical trials registered on ClinicalTrials.gov as of 20 November 2014 was performed. The main outcome was any change made to the initially listed primary outcome and the time of the change in relation to the trial start and end date. Findings: 13,238 completed interventional trials were registered with ClinicalTrials.gov that also had study results posted on the website. 2555 (19.3% had one or more statistically significant primary outcomes. Statistical analysis showed that registration year, funding source and primary outcome change after trial completion were associated with reporting a statistically significant primary outcome. Conclusions: Funding source and primary outcome change after trial completion are associated with a statistically significant primary outcome report on clinicaltrials.gov.
Your Chi-Square Test Is Statistically Significant: Now What?

Science.gov (United States)

Sharpe, Donald

2015-01-01

Applied researchers have employed chi-square tests for more than one hundred years. This paper addresses the question of how one should follow a statistically significant chi-square test result in order to determine the source of that result. Four approaches were evaluated: calculating residuals, comparing cells, ransacking, and partitioning. Data…
Sunspot activity and influenza pandemics: a statistical assessment of the purported association.

Science.gov (United States)

Towers, S

2017-10-01

Since 1978, a series of papers in the literature have claimed to find a significant association between sunspot activity and the timing of influenza pandemics. This paper examines these analyses, and attempts to recreate the three most recent statistical analyses by Ertel (1994), Tapping et al. (2001), and Yeung (2006), which all have purported to find a significant relationship between sunspot numbers and pandemic influenza. As will be discussed, each analysis had errors in the data. In addition, in each analysis arbitrary selections or assumptions were also made, and the authors did not assess the robustness of their analyses to changes in those arbitrary assumptions. Varying the arbitrary assumptions to other, equally valid, assumptions negates the claims of significance. Indeed, an arbitrary selection made in one of the analyses appears to have resulted in almost maximal apparent significance; changing it only slightly yields a null result. This analysis applies statistically rigorous methodology to examine the purported sunspot/pandemic link, using more statistically powerful un-binned analysis methods, rather than relying on arbitrarily binned data. The analyses are repeated using both the Wolf and Group sunspot numbers. In all cases, no statistically significant evidence of any association was found. However, while the focus in this particular analysis was on the purported relationship of influenza pandemics to sunspot activity, the faults found in the past analyses are common pitfalls; inattention to analysis reproducibility and robustness assessment are common problems in the sciences, that are unfortunately not noted often enough in review.
Confidence intervals permit, but don't guarantee, better inference than statistical significance testing

Directory of Open Access Journals (Sweden)

Melissa Coulson

2010-07-01

Full Text Available A statistically significant result, and a non-significant result may differ little, although significance status may tempt an interpretation of difference. Two studies are reported that compared interpretation of such results presented using null hypothesis significance testing (NHST, or confidence intervals (CIs. Authors of articles published in psychology, behavioural neuroscience, and medical journals were asked, via email, to interpret two fictitious studies that found similar results, one statistically significant, and the other non-significant. Responses from 330 authors varied greatly, but interpretation was generally poor, whether results were presented as CIs or using NHST. However, when interpreting CIs respondents who mentioned NHST were 60% likely to conclude, unjustifiably, the two results conflicted, whereas those who interpreted CIs without reference to NHST were 95% likely to conclude, justifiably, the two results were consistent. Findings were generally similar for all three disciplines. An email survey of academic psychologists confirmed that CIs elicit better interpretations if NHST is not invoked. Improved statistical inference can result from encouragement of meta-analytic thinking and use of CIs but, for full benefit, such highly desirable statistical reform requires also that researchers interpret CIs without recourse to NHST.
Testing statistical significance scores of sequence comparison methods with structure similarity

Directory of Open Access Journals (Sweden)

Leunissen Jack AM

2006-10-01

Full Text Available Abstract Background In the past years the Smith-Waterman sequence comparison algorithm has gained popularity due to improved implementations and rapidly increasing computing power. However, the quality and sensitivity of a database search is not only determined by the algorithm but also by the statistical significance testing for an alignment. The e-value is the most commonly used statistical validation method for sequence database searching. The CluSTr database and the Protein World database have been created using an alternative statistical significance test: a Z-score based on Monte-Carlo statistics. Several papers have described the superiority of the Z-score as compared to the e-value, using simulated data. We were interested if this could be validated when applied to existing, evolutionary related protein sequences. Results All experiments are performed on the ASTRAL SCOP database. The Smith-Waterman sequence comparison algorithm with both e-value and Z-score statistics is evaluated, using ROC, CVE and AP measures. The BLAST and FASTA algorithms are used as reference. We find that two out of three Smith-Waterman implementations with e-value are better at predicting structural similarities between proteins than the Smith-Waterman implementation with Z-score. SSEARCH especially has very high scores. Conclusion The compute intensive Z-score does not have a clear advantage over the e-value. The Smith-Waterman implementations give generally better results than their heuristic counterparts. We recommend using the SSEARCH algorithm combined with e-values for pairwise sequence comparisons.
Statistical testing of association between menstruation and migraine.

Science.gov (United States)

Barra, Mathias; Dahl, Fredrik A; Vetvik, Kjersti G

2015-02-01

To repair and refine a previously proposed method for statistical analysis of association between migraine and menstruation. Menstrually related migraine (MRM) affects about 20% of female migraineurs in the general population. The exact pathophysiological link from menstruation to migraine is hypothesized to be through fluctuations in female reproductive hormones, but the exact mechanisms remain unknown. Therefore, the main diagnostic criterion today is concurrency of migraine attacks with menstruation. Methods aiming to exclude spurious associations are wanted, so that further research into these mechanisms can be performed on a population with a true association. The statistical method is based on a simple two-parameter null model of MRM (which allows for simulation modeling), and Fisher's exact test (with mid-p correction) applied to standard 2 × 2 contingency tables derived from the patients' headache diaries. Our method is a corrected version of a previously published flawed framework. To our best knowledge, no other published methods for establishing a menstruation-migraine association by statistical means exist today. The probabilistic methodology shows good performance when subjected to receiver operator characteristic curve analysis. Quick reference cutoff values for the clinical setting were tabulated for assessing association given a patient's headache history. In this paper, we correct a proposed method for establishing association between menstruation and migraine by statistical methods. We conclude that the proposed standard of 3-cycle observations prior to setting an MRM diagnosis should be extended with at least one perimenstrual window to obtain sufficient information for statistical processing. © 2014 American Headache Society.
A Note on Comparing the Power of Test Statistics at Low Significance Levels.

Science.gov (United States)

Morris, Nathan; Elston, Robert

2011-01-01

It is an obvious fact that the power of a test statistic is dependent upon the significance (alpha) level at which the test is performed. It is perhaps a less obvious fact that the relative performance of two statistics in terms of power is also a function of the alpha level. Through numerous personal discussions, we have noted that even some competent statisticians have the mistaken intuition that relative power comparisons at traditional levels such as α = 0.05 will be roughly similar to relative power comparisons at very low levels, such as the level α = 5 × 10 -8 , which is commonly used in genome-wide association studies. In this brief note, we demonstrate that this notion is in fact quite wrong, especially with respect to comparing tests with differing degrees of freedom. In fact, at very low alpha levels the cost of additional degrees of freedom is often comparatively low. Thus we recommend that statisticians exercise caution when interpreting the results of power comparison studies which use alpha levels that will not be used in practice.
Statistical significance of epidemiological data. Seminar: Evaluation of epidemiological studies

International Nuclear Information System (INIS)

Weber, K.H.

1993-01-01

In stochastic damages, the numbers of events, e.g. the persons who are affected by or have died of cancer, and thus the relative frequencies (incidence or mortality) are binomially distributed random variables. Their statistical fluctuations can be characterized by confidence intervals. For epidemiologic questions, especially for the analysis of stochastic damages in the low dose range, the following issues are interesting: - Is a sample (a group of persons) with a definite observed damage frequency part of the whole population? - Is an observed frequency difference between two groups of persons random or statistically significant? - Is an observed increase or decrease of the frequencies with increasing dose random or statistically significant and how large is the regression coefficient (= risk coefficient) in this case? These problems can be solved by sttistical tests. So-called distribution-free tests and tests which are not bound to the supposition of normal distribution are of particular interest, such as: - χ 2 -independence test (test in contingency tables); - Fisher-Yates-test; - trend test according to Cochran; - rank correlation test given by Spearman. These tests are explained in terms of selected epidemiologic data, e.g. of leukaemia clusters, of the cancer mortality of the Japanese A-bomb survivors especially in the low dose range as well as on the sample of the cancer mortality in the high background area in Yangjiang (China). (orig.) [de

IGESS: a statistical approach to integrating individual-level genotype data and summary statistics in genome-wide association studies.

Science.gov (United States)

Dai, Mingwei; Ming, Jingsi; Cai, Mingxuan; Liu, Jin; Yang, Can; Wan, Xiang; Xu, Zongben

2017-09-15

Results from genome-wide association studies (GWAS) suggest that a complex phenotype is often affected by many variants with small effects, known as 'polygenicity'. Tens of thousands of samples are often required to ensure statistical power of identifying these variants with small effects. However, it is often the case that a research group can only get approval for the access to individual-level genotype data with a limited sample size (e.g. a few hundreds or thousands). Meanwhile, summary statistics generated using single-variant-based analysis are becoming publicly available. The sample sizes associated with the summary statistics datasets are usually quite large. How to make the most efficient use of existing abundant data resources largely remains an open question. In this study, we propose a statistical approach, IGESS, to increasing statistical power of identifying risk variants and improving accuracy of risk prediction by i ntegrating individual level ge notype data and s ummary s tatistics. An efficient algorithm based on variational inference is developed to handle the genome-wide analysis. Through comprehensive simulation studies, we demonstrated the advantages of IGESS over the methods which take either individual-level data or summary statistics data as input. We applied IGESS to perform integrative analysis of Crohns Disease from WTCCC and summary statistics from other studies. IGESS was able to significantly increase the statistical power of identifying risk variants and improve the risk prediction accuracy from 63.2% ( ±0.4% ) to 69.4% ( ±0.1% ) using about 240 000 variants. The IGESS software is available at https://github.com/daviddaigithub/IGESS . zbxu@xjtu.edu.cn or xwan@comp.hkbu.edu.hk or eeyang@hkbu.edu.hk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Statistical Significance and Effect Size: Two Sides of a Coin.

Science.gov (United States)

Fan, Xitao

This paper suggests that statistical significance testing and effect size are two sides of the same coin; they complement each other, but do not substitute for one another. Good research practice requires that both should be taken into consideration to make sound quantitative decisions. A Monte Carlo simulation experiment was conducted, and a…
Publication of statistically significant research findings in prosthodontics & implant dentistry in the context of other dental specialties.

Science.gov (United States)

Papageorgiou, Spyridon N; Kloukos, Dimitrios; Petridis, Haralampos; Pandis, Nikolaos

2015-10-01

To assess the hypothesis that there is excessive reporting of statistically significant studies published in prosthodontic and implantology journals, which could indicate selective publication. The last 30 issues of 9 journals in prosthodontics and implant dentistry were hand-searched for articles with statistical analyses. The percentages of significant and non-significant results were tabulated by parameter of interest. Univariable/multivariable logistic regression analyses were applied to identify possible predictors of reporting statistically significance findings. The results of this study were compared with similar studies in dentistry with random-effects meta-analyses. From the 2323 included studies 71% of them reported statistically significant results, with the significant results ranging from 47% to 86%. Multivariable modeling identified that geographical area and involvement of statistician were predictors of statistically significant results. Compared to interventional studies, the odds that in vitro and observational studies would report statistically significant results was increased by 1.20 times (OR: 2.20, 95% CI: 1.66-2.92) and 0.35 times (OR: 1.35, 95% CI: 1.05-1.73), respectively. The probability of statistically significant results from randomized controlled trials was significantly lower compared to various study designs (difference: 30%, 95% CI: 11-49%). Likewise the probability of statistically significant results in prosthodontics and implant dentistry was lower compared to other dental specialties, but this result did not reach statistical significant (P>0.05). The majority of studies identified in the fields of prosthodontics and implant dentistry presented statistically significant results. The same trend existed in publications of other specialties in dentistry. Copyright © 2015 Elsevier Ltd. All rights reserved.
Significant Statistics: Viewed with a Contextual Lens

Science.gov (United States)

Tait-McCutcheon, Sandi

2010-01-01

This paper examines the pedagogical and organisational changes three lead teachers made to their statistics teaching and learning programs. The lead teachers posed the research question: What would the effect of contextually integrating statistical investigations and literacies into other curriculum areas be on student achievement? By finding the…
"What If" Analyses: Ways to Interpret Statistical Significance Test Results Using EXCEL or "R"

Science.gov (United States)

Ozturk, Elif

2012-01-01

The present paper aims to review two motivations to conduct "what if" analyses using Excel and "R" to understand the statistical significance tests through the sample size context. "What if" analyses can be used to teach students what statistical significance tests really do and in applied research either prospectively to estimate what sample size…
Determining the significance of associations between two series of discrete events : bootstrap methods /

Energy Technology Data Exchange (ETDEWEB)

Niehof, Jonathan T.; Morley, Steven K.

2012-01-01

We review and develop techniques to determine associations between series of discrete events. The bootstrap, a nonparametric statistical method, allows the determination of the significance of associations with minimal assumptions about the underlying processes. We find the key requirement for this method: one of the series must be widely spaced in time to guarantee the theoretical applicability of the bootstrap. If this condition is met, the calculated significance passes a reasonableness test. We conclude with some potential future extensions and caveats on the applicability of these methods. The techniques presented have been implemented in a Python-based software toolkit.
Statistical vs. Economic Significance in Economics and Econometrics: Further comments on McCloskey & Ziliak

DEFF Research Database (Denmark)

Engsted, Tom

I comment on the controversy between McCloskey & Ziliak and Hoover & Siegler on statistical versus economic significance, in the March 2008 issue of the Journal of Economic Methodology. I argue that while McCloskey & Ziliak are right in emphasizing 'real error', i.e. non-sampling error that cannot...... be eliminated through specification testing, they fail to acknowledge those areas in economics, e.g. rational expectations macroeconomics and asset pricing, where researchers clearly distinguish between statistical and economic significance and where statistical testing plays a relatively minor role in model...
Distinguishing between statistical significance and practical/clinical meaningfulness using statistical inference.

Science.gov (United States)

Wilkinson, Michael

2014-03-01

Decisions about support for predictions of theories in light of data are made using statistical inference. The dominant approach in sport and exercise science is the Neyman-Pearson (N-P) significance-testing approach. When applied correctly it provides a reliable procedure for making dichotomous decisions for accepting or rejecting zero-effect null hypotheses with known and controlled long-run error rates. Type I and type II error rates must be specified in advance and the latter controlled by conducting an a priori sample size calculation. The N-P approach does not provide the probability of hypotheses or indicate the strength of support for hypotheses in light of data, yet many scientists believe it does. Outcomes of analyses allow conclusions only about the existence of non-zero effects, and provide no information about the likely size of true effects or their practical/clinical value. Bayesian inference can show how much support data provide for different hypotheses, and how personal convictions should be altered in light of data, but the approach is complicated by formulating probability distributions about prior subjective estimates of population effects. A pragmatic solution is magnitude-based inference, which allows scientists to estimate the true magnitude of population effects and how likely they are to exceed an effect magnitude of practical/clinical importance, thereby integrating elements of subjective Bayesian-style thinking. While this approach is gaining acceptance, progress might be hastened if scientists appreciate the shortcomings of traditional N-P null hypothesis significance testing.
Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance

Directory of Open Access Journals (Sweden)

Zhang Zhang

2012-03-01

Full Text Available Abstract Background Genetic mutation, selective pressure for translational efficiency and accuracy, level of gene expression, and protein function through natural selection are all believed to lead to codon usage bias (CUB. Therefore, informative measurement of CUB is of fundamental importance to making inferences regarding gene function and genome evolution. However, extant measures of CUB have not fully accounted for the quantitative effect of background nucleotide composition and have not statistically evaluated the significance of CUB in sequence analysis. Results Here we propose a novel measure--Codon Deviation Coefficient (CDC--that provides an informative measurement of CUB and its statistical significance without requiring any prior knowledge. Unlike previous measures, CDC estimates CUB by accounting for background nucleotide compositions tailored to codon positions and adopts the bootstrapping to assess the statistical significance of CUB for any given sequence. We evaluate CDC by examining its effectiveness on simulated sequences and empirical data and show that CDC outperforms extant measures by achieving a more informative estimation of CUB and its statistical significance. Conclusions As validated by both simulated and empirical data, CDC provides a highly informative quantification of CUB and its statistical significance, useful for determining comparative magnitudes and patterns of biased codon usage for genes or genomes with diverse sequence compositions.
Statistics Refresher for Molecular Imaging Technologists, Part 2: Accuracy of Interpretation, Significance, and Variance.

Science.gov (United States)

Farrell, Mary Beth

2018-06-01

This article is the second part of a continuing education series reviewing basic statistics that nuclear medicine and molecular imaging technologists should understand. In this article, the statistics for evaluating interpretation accuracy, significance, and variance are discussed. Throughout the article, actual statistics are pulled from the published literature. We begin by explaining 2 methods for quantifying interpretive accuracy: interreader and intrareader reliability. Agreement among readers can be expressed simply as a percentage. However, the Cohen κ-statistic is a more robust measure of agreement that accounts for chance. The higher the κ-statistic is, the higher is the agreement between readers. When 3 or more readers are being compared, the Fleiss κ-statistic is used. Significance testing determines whether the difference between 2 conditions or interventions is meaningful. Statistical significance is usually expressed using a number called a probability ( P ) value. Calculation of P value is beyond the scope of this review. However, knowing how to interpret P values is important for understanding the scientific literature. Generally, a P value of less than 0.05 is considered significant and indicates that the results of the experiment are due to more than just chance. Variance, standard deviation (SD), confidence interval, and standard error (SE) explain the dispersion of data around a mean of a sample drawn from a population. SD is commonly reported in the literature. A small SD indicates that there is not much variation in the sample data. Many biologic measurements fall into what is referred to as a normal distribution taking the shape of a bell curve. In a normal distribution, 68% of the data will fall within 1 SD, 95% will fall within 2 SDs, and 99.7% will fall within 3 SDs. Confidence interval defines the range of possible values within which the population parameter is likely to lie and gives an idea of the precision of the statistic being
Systematic reviews of anesthesiologic interventions reported as statistically significant

DEFF Research Database (Denmark)

Imberger, Georgina; Gluud, Christian; Boylan, John

2015-01-01

statistically significant meta-analyses of anesthesiologic interventions, we used TSA to estimate power and imprecision in the context of sparse data and repeated updates. METHODS: We conducted a search to identify all systematic reviews with meta-analyses that investigated an intervention that may......: From 11,870 titles, we found 682 systematic reviews that investigated anesthesiologic interventions. In the 50 sampled meta-analyses, the median number of trials included was 8 (interquartile range [IQR], 5-14), the median number of participants was 964 (IQR, 523-1736), and the median number...
Using the Bootstrap Method for a Statistical Significance Test of Differences between Summary Histograms

Science.gov (United States)

Xu, Kuan-Man

2006-01-01

A new method is proposed to compare statistical differences between summary histograms, which are the histograms summed over a large ensemble of individual histograms. It consists of choosing a distance statistic for measuring the difference between summary histograms and using a bootstrap procedure to calculate the statistical significance level. Bootstrapping is an approach to statistical inference that makes few assumptions about the underlying probability distribution that describes the data. Three distance statistics are compared in this study. They are the Euclidean distance, the Jeffries-Matusita distance and the Kuiper distance. The data used in testing the bootstrap method are satellite measurements of cloud systems called cloud objects. Each cloud object is defined as a contiguous region/patch composed of individual footprints or fields of view. A histogram of measured values over footprints is generated for each parameter of each cloud object and then summary histograms are accumulated over all individual histograms in a given cloud-object size category. The results of statistical hypothesis tests using all three distances as test statistics are generally similar, indicating the validity of the proposed method. The Euclidean distance is determined to be most suitable after comparing the statistical tests of several parameters with distinct probability distributions among three cloud-object size categories. Impacts on the statistical significance levels resulting from differences in the total lengths of satellite footprint data between two size categories are also discussed.
P-Value, a true test of statistical significance? a cautionary note ...

African Journals Online (AJOL)

While it's not the intention of the founders of significance testing and hypothesis testing to have the two ideas intertwined as if they are complementary, the inconvenient marriage of the two practices into one coherent, convenient, incontrovertible and misinterpreted practice has dotted our standard statistics textbooks and ...
Codon Deviation Coefficient: A novel measure for estimating codon usage bias and its statistical significance

KAUST Repository

Zhang, Zhang

2012-03-22

Background: Genetic mutation, selective pressure for translational efficiency and accuracy, level of gene expression, and protein function through natural selection are all believed to lead to codon usage bias (CUB). Therefore, informative measurement of CUB is of fundamental importance to making inferences regarding gene function and genome evolution. However, extant measures of CUB have not fully accounted for the quantitative effect of background nucleotide composition and have not statistically evaluated the significance of CUB in sequence analysis.Results: Here we propose a novel measure--Codon Deviation Coefficient (CDC)--that provides an informative measurement of CUB and its statistical significance without requiring any prior knowledge. Unlike previous measures, CDC estimates CUB by accounting for background nucleotide compositions tailored to codon positions and adopts the bootstrapping to assess the statistical significance of CUB for any given sequence. We evaluate CDC by examining its effectiveness on simulated sequences and empirical data and show that CDC outperforms extant measures by achieving a more informative estimation of CUB and its statistical significance.Conclusions: As validated by both simulated and empirical data, CDC provides a highly informative quantification of CUB and its statistical significance, useful for determining comparative magnitudes and patterns of biased codon usage for genes or genomes with diverse sequence compositions. 2012 Zhang et al; licensee BioMed Central Ltd.
Measuring individual significant change on the Beck Depression Inventory-II through IRT-based statistics.

NARCIS (Netherlands)

Brouwer, D.; Meijer, R.R.; Zevalkink, D.J.

2013-01-01

Several researchers have emphasized that item response theory (IRT)-based methods should be preferred over classical approaches in measuring change for individual patients. In the present study we discuss and evaluate the use of IRT-based statistics to measure statistical significant individual
Strategies for Testing Statistical and Practical Significance in Detecting DIF with Logistic Regression Models

Science.gov (United States)

Fidalgo, Angel M.; Alavi, Seyed Mohammad; Amirian, Seyed Mohammad Reza

2014-01-01

This study examines three controversial aspects in differential item functioning (DIF) detection by logistic regression (LR) models: first, the relative effectiveness of different analytical strategies for detecting DIF; second, the suitability of the Wald statistic for determining the statistical significance of the parameters of interest; and…
Thyroid Autoimmunity and Behçet’s Disease: Is There a Significant Association?

Directory of Open Access Journals (Sweden)

Filiz Cebeci

2013-01-01

Full Text Available Background. Behcet’s disease (BD could be regarded as an autoimmune disease in many aspects. Autoimmune thyroid disease (ATD is frequently accompanied by other various autoimmune diseases. Nevertheless, there is not still enough data showing the association between BD and ATD. In addition, no controlled study is present in the PubMed, which evaluates thyroidal autoimmunity using antithyroid peroxidase antibody in a large series of patients with BD. Methods. We aimed to investigate the frequency of ATD in patients with BD. The study included 124 patients with BD and 99 age- and sex-matched healthy volunteers. Results. Autoimmune thyroiditis was noted in 21 cases (16.9% with BD. In the control group, 22 cases (22.22% were diagnosed as autoimmune thyroiditis. There was no difference between the groups in respect to thyroid autoantibodies (. There were no statistically significant differences between baseline TSH levels of the BD patients and of the controls (. Statistically, the mean serum free T4 levels of the patients with BD were higher than those of the controls (. Conclusions. No association could be found between BD and ATD. Therefore, it is not of significance to investigate thyroid autoimmunity in BD.
Thresholds for statistical and clinical significance in systematic reviews with meta-analytic methods

DEFF Research Database (Denmark)

Jakobsen, Janus Christian; Wetterslev, Jorn; Winkel, Per

2014-01-01

BACKGROUND: Thresholds for statistical significance when assessing meta-analysis results are being insufficiently demonstrated by traditional 95% confidence intervals and P-values. Assessment of intervention effects in systematic reviews with meta-analysis deserves greater rigour. METHODS......: Methodologies for assessing statistical and clinical significance of intervention effects in systematic reviews were considered. Balancing simplicity and comprehensiveness, an operational procedure was developed, based mainly on The Cochrane Collaboration methodology and the Grading of Recommendations...... Assessment, Development, and Evaluation (GRADE) guidelines. RESULTS: We propose an eight-step procedure for better validation of meta-analytic results in systematic reviews (1) Obtain the 95% confidence intervals and the P-values from both fixed-effect and random-effects meta-analyses and report the most...
Statistical testing and power analysis for brain-wide association study.

Science.gov (United States)

Gong, Weikang; Wan, Lin; Lu, Wenlian; Ma, Liang; Cheng, Fan; Cheng, Wei; Grünewald, Stefan; Feng, Jianfeng

2018-04-05

The identification of connexel-wise associations, which involves examining functional connectivities between pairwise voxels across the whole brain, is both statistically and computationally challenging. Although such a connexel-wise methodology has recently been adopted by brain-wide association studies (BWAS) to identify connectivity changes in several mental disorders, such as schizophrenia, autism and depression, the multiple correction and power analysis methods designed specifically for connexel-wise analysis are still lacking. Therefore, we herein report the development of a rigorous statistical framework for connexel-wise significance testing based on the Gaussian random field theory. It includes controlling the family-wise error rate (FWER) of multiple hypothesis testings using topological inference methods, and calculating power and sample size for a connexel-wise study. Our theoretical framework can control the false-positive rate accurately, as validated empirically using two resting-state fMRI datasets. Compared with Bonferroni correction and false discovery rate (FDR), it can reduce false-positive rate and increase statistical power by appropriately utilizing the spatial information of fMRI data. Importantly, our method bypasses the need of non-parametric permutation to correct for multiple comparison, thus, it can efficiently tackle large datasets with high resolution fMRI images. The utility of our method is shown in a case-control study. Our approach can identify altered functional connectivities in a major depression disorder dataset, whereas existing methods fail. A software package is available at https://github.com/weikanggong/BWAS. Copyright © 2018 Elsevier B.V. All rights reserved.
An Entropy-Based Statistic for Genomewide Association Studies

OpenAIRE

Zhao, Jinying; Boerwinkle, Eric; Xiong, Momiao

2005-01-01

Efficient genotyping methods and the availability of a large collection of single-nucleotide polymorphisms provide valuable tools for genetic studies of human disease. The standard χ2 statistic for case-control studies, which uses a linear function of allele frequencies, has limited power when the number of marker loci is large. We introduce a novel test statistic for genetic association studies that uses Shannon entropy and a nonlinear function of allele frequencies to amplify the difference...

A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.

Science.gov (United States)

Luo, Li; Zhu, Yun; Xiong, Momiao

2012-06-01

The genome-wide association studies (GWAS) designed for next-generation sequencing data involve testing association of genomic variants, including common, low frequency, and rare variants. The current strategies for association studies are well developed for identifying association of common variants with the common diseases, but may be ill-suited when large amounts of allelic heterogeneity are present in sequence data. Recently, group tests that analyze their collective frequency differences between cases and controls shift the current variant-by-variant analysis paradigm for GWAS of common variants to the collective test of multiple variants in the association analysis of rare variants. However, group tests ignore differences in genetic effects among SNPs at different genomic locations. As an alternative to group tests, we developed a novel genome-information content-based statistics for testing association of the entire allele frequency spectrum of genomic variation with the diseases. To evaluate the performance of the proposed statistics, we use large-scale simulations based on whole genome low coverage pilot data in the 1000 Genomes Project to calculate the type 1 error rates and power of seven alternative statistics: a genome-information content-based statistic, the generalized T(2), collapsing method, multivariate and collapsing (CMC) method, individual χ(2) test, weighted-sum statistic, and variable threshold statistic. Finally, we apply the seven statistics to published resequencing dataset from ANGPTL3, ANGPTL4, ANGPTL5, and ANGPTL6 genes in the Dallas Heart Study. We report that the genome-information content-based statistic has significantly improved type 1 error rates and higher power than the other six statistics in both simulated and empirical datasets.
von Neumann entropy associated with the haldane exclusion statistics

International Nuclear Information System (INIS)

Rajagopal, A.K.

1995-01-01

We obtain the von Neumann entropy per state of the Haldane exclusion statistics with parameter g in terms of the mean occupation number bar n{wlnw-(1+w)ln(1+w)}, where w=(1-bar n). This reduces correctly to the well known expressions in the limiting cases of Bose (g=0) and Fermi (g=1) statistics. We have derived the second and third order fluctuations in the occupation numbers for arbitrary g. An elegant general duality relationship between the w factor associated with the particle and that associated with the hole at the reciprocal g is deduced along with the attendant relationship between the two respective entropies
statistical fluid theory for associating fluids containing alternating ...

Indian Academy of Sciences (India)

Statistical associating fluid theory of homonuclear dimerized chain fluids and homonuclear ... The proposed models account for the appropriate .... where gHNM(1,1) is the expression for the contact value of the correlation func- tion of two ...
Intensive inpatient treatment for bulimia nervosa: Statistical and clinical significance of symptom changes.

Science.gov (United States)

Diedrich, Alice; Schlegl, Sandra; Greetfeld, Martin; Fumi, Markus; Voderholzer, Ulrich

2018-03-01

This study examines the statistical and clinical significance of symptom changes during an intensive inpatient treatment program with a strong psychotherapeutic focus for individuals with severe bulimia nervosa. 295 consecutively admitted bulimic patients were administered the Structured Interview for Anorexic and Bulimic Syndromes-Self-Rating (SIAB-S), the Eating Disorder Inventory-2 (EDI-2), the Brief Symptom Inventory (BSI), and the Beck Depression Inventory-II (BDI-II) at treatment intake and discharge. Results indicated statistically significant symptom reductions with large effect sizes regarding severity of binge eating and compensatory behavior (SIAB-S), overall eating disorder symptom severity (EDI-2), overall psychopathology (BSI), and depressive symptom severity (BDI-II) even when controlling for antidepressant medication. The majority of patients showed either reliable (EDI-2: 33.7%, BSI: 34.8%, BDI-II: 18.1%) or even clinically significant symptom changes (EDI-2: 43.2%, BSI: 33.9%, BDI-II: 56.9%). Patients with clinically significant improvement were less distressed at intake and less likely to suffer from a comorbid borderline personality disorder when compared with those who did not improve to a clinically significant extent. Findings indicate that intensive psychotherapeutic inpatient treatment may be effective in about 75% of severely affected bulimic patients. For the remaining non-responding patients, inpatient treatment might be improved through an even stronger focus on the reduction of comorbid borderline personality traits.
Statistical Measure Of Association Between Smoking And Lung ...

African Journals Online (AJOL)

Statistical Measure Of Association Between Smoking And Lung Cancer In Abakaliki, Ebonyi State Nigeria. ... East African Journal of Public Health ... To investigate the havoc caused by all these on people, questionnaire was distributed among smokers and non smokers in various areas of specialization and habitations.
Cloud-based solution to identify statistically significant MS peaks differentiating sample categories.

Science.gov (United States)

Ji, Jun; Ling, Jeffrey; Jiang, Helen; Wen, Qiaojun; Whitin, John C; Tian, Lu; Cohen, Harvey J; Ling, Xuefeng B

2013-03-23

Mass spectrometry (MS) has evolved to become the primary high throughput tool for proteomics based biomarker discovery. Until now, multiple challenges in protein MS data analysis remain: large-scale and complex data set management; MS peak identification, indexing; and high dimensional peak differential analysis with the concurrent statistical tests based false discovery rate (FDR). "Turnkey" solutions are needed for biomarker investigations to rapidly process MS data sets to identify statistically significant peaks for subsequent validation. Here we present an efficient and effective solution, which provides experimental biologists easy access to "cloud" computing capabilities to analyze MS data. The web portal can be accessed at http://transmed.stanford.edu/ssa/. Presented web application supplies large scale MS data online uploading and analysis with a simple user interface. This bioinformatic tool will facilitate the discovery of the potential protein biomarkers using MS.
Nursing students' attitudes toward statistics: Effect of a biostatistics course and association with examination performance.

Science.gov (United States)

Kiekkas, Panagiotis; Panagiotarou, Aliki; Malja, Alvaro; Tahirai, Daniela; Zykai, Rountina; Bakalis, Nick; Stefanopoulos, Nikolaos

2015-12-01

Although statistical knowledge and skills are necessary for promoting evidence-based practice, health sciences students have expressed anxiety about statistics courses, which may hinder their learning of statistical concepts. To evaluate the effects of a biostatistics course on nursing students' attitudes toward statistics and to explore the association between these attitudes and their performance in the course examination. One-group quasi-experimental pre-test/post-test design. Undergraduate nursing students of the fifth or higher semester of studies, who attended a biostatistics course. Participants were asked to complete the pre-test and post-test forms of The Survey of Attitudes Toward Statistics (SATS)-36 scale at the beginning and end of the course respectively. Pre-test and post-test scale scores were compared, while correlations between post-test scores and participants' examination performance were estimated. Among 156 participants, post-test scores of the overall SATS-36 scale and of the Affect, Cognitive Competence, Interest and Effort components were significantly higher than pre-test ones, indicating that the course was followed by more positive attitudes toward statistics. Among 104 students who participated in the examination, higher post-test scores of the overall SATS-36 scale and of the Affect, Difficulty, Interest and Effort components were significantly but weakly correlated with higher examination performance. Students' attitudes toward statistics can be improved through appropriate biostatistics courses, while positive attitudes contribute to higher course achievements and possibly to improved statistical skills in later professional life. Copyright © 2015 Elsevier Ltd. All rights reserved.
Power, effects, confidence, and significance: an investigation of statistical practices in nursing research.

Science.gov (United States)

Gaskin, Cadeyrn J; Happell, Brenda

2014-05-01

improvement. Most importantly, researchers should abandon the misleading practice of interpreting the results from inferential tests based solely on whether they are statistically significant (or not) and, instead, focus on reporting and interpreting effect sizes, confidence intervals, and significance levels. Nursing researchers also need to conduct and report a priori power analyses, and to address the issue of Type I experiment-wise error inflation in their studies. Crown Copyright © 2013. Published by Elsevier Ltd. All rights reserved.
Examining reproducibility in psychology : A hybrid method for combining a statistically significant original study and a replication

NARCIS (Netherlands)

Van Aert, R.C.M.; Van Assen, M.A.L.M.

2018-01-01

The unrealistically high rate of positive results within psychology has increased the attention to replication research. However, researchers who conduct a replication and want to statistically combine the results of their replication with a statistically significant original study encounter
A tutorial on hunting statistical significance by chasing N

Directory of Open Access Journals (Sweden)

Denes Szucs

2016-09-01

Full Text Available There is increasing concern about the replicability of studies in psychology and cognitive neuroscience. Hidden data dredging (also called p-hacking is a major contributor to this crisis because it substantially increases Type I error resulting in a much larger proportion of false positive findings than the usually expected 5%. In order to build better intuition to avoid, detect and criticise some typical problems, here I systematically illustrate the large impact of some easy to implement and so, perhaps frequent data dredging techniques on boosting false positive findings. I illustrate several forms of two special cases of data dredging. First, researchers may violate the data collection stopping rules of null hypothesis significance testing by repeatedly checking for statistical significance with various numbers of participants. Second, researchers may group participants post-hoc along potential but unplanned independent grouping variables. The first approach 'hacks' the number of participants in studies, the second approach ‘hacks’ the number of variables in the analysis. I demonstrate the high amount of false positive findings generated by these techniques with data from true null distributions. I also illustrate that it is extremely easy to introduce strong bias into data by very mild selection and re-testing. Similar, usually undocumented data dredging steps can easily lead to having 20-50%, or more false positives.
Evaluation and application of summary statistic imputation to discover new height-associated loci.

Science.gov (United States)

Rüeger, Sina; McDaid, Aaron; Kutalik, Zoltán

2018-05-01

As most of the heritability of complex traits is attributed to common and low frequency genetic variants, imputing them by combining genotyping chips and large sequenced reference panels is the most cost-effective approach to discover the genetic basis of these traits. Association summary statistics from genome-wide meta-analyses are available for hundreds of traits. Updating these to ever-increasing reference panels is very cumbersome as it requires reimputation of the genetic data, rerunning the association scan, and meta-analysing the results. A much more efficient method is to directly impute the summary statistics, termed as summary statistics imputation, which we improved to accommodate variable sample size across SNVs. Its performance relative to genotype imputation and practical utility has not yet been fully investigated. To this end, we compared the two approaches on real (genotyped and imputed) data from 120K samples from the UK Biobank and show that, genotype imputation boasts a 3- to 5-fold lower root-mean-square error, and better distinguishes true associations from null ones: We observed the largest differences in power for variants with low minor allele frequency and low imputation quality. For fixed false positive rates of 0.001, 0.01, 0.05, using summary statistics imputation yielded a decrease in statistical power by 9, 43 and 35%, respectively. To test its capacity to discover novel associations, we applied summary statistics imputation to the GIANT height meta-analysis summary statistics covering HapMap variants, and identified 34 novel loci, 19 of which replicated using data in the UK Biobank. Additionally, we successfully replicated 55 out of the 111 variants published in an exome chip study. Our study demonstrates that summary statistics imputation is a very efficient and cost-effective way to identify and fine-map trait-associated loci. Moreover, the ability to impute summary statistics is important for follow-up analyses, such as Mendelian
Probability, statistics, and associated computing techniques

International Nuclear Information System (INIS)

James, F.

1983-01-01

This chapter attempts to explore the extent to which it is possible for the experimental physicist to find optimal statistical techniques to provide a unique and unambiguous quantitative measure of the significance of raw data. Discusses statistics as the inverse of probability; normal theory of parameter estimation; normal theory (Gaussian measurements); the universality of the Gaussian distribution; real-life resolution functions; combination and propagation of uncertainties; the sum or difference of 2 variables; local theory, or the propagation of small errors; error on the ratio of 2 discrete variables; the propagation of large errors; confidence intervals; classical theory; Bayesian theory; use of the likelihood function; the second derivative of the log-likelihood function; multiparameter confidence intervals; the method of MINOS; least squares; the Gauss-Markov theorem; maximum likelihood for uniform error distribution; the Chebyshev fit; the parameter uncertainties; the efficiency of the Chebyshev estimator; error symmetrization; robustness vs. efficiency; testing of hypotheses (e.g., the Neyman-Pearson test); goodness-of-fit; distribution-free tests; comparing two one-dimensional distributions; comparing multidimensional distributions; and permutation tests for comparing two point sets
A weighted U-statistic for genetic association analyses of sequencing data.

Science.gov (United States)

Wei, Changshuai; Li, Ming; He, Zihuai; Vsevolozhskaya, Olga; Schaid, Daniel J; Lu, Qing

2014-12-01

With advancements in next-generation sequencing technology, a massive amount of sequencing data is generated, which offers a great opportunity to comprehensively investigate the role of rare variants in the genetic etiology of complex diseases. Nevertheless, the high-dimensional sequencing data poses a great challenge for statistical analysis. The association analyses based on traditional statistical methods suffer substantial power loss because of the low frequency of genetic variants and the extremely high dimensionality of the data. We developed a Weighted U Sequencing test, referred to as WU-SEQ, for the high-dimensional association analysis of sequencing data. Based on a nonparametric U-statistic, WU-SEQ makes no assumption of the underlying disease model and phenotype distribution, and can be applied to a variety of phenotypes. Through simulation studies and an empirical study, we showed that WU-SEQ outperformed a commonly used sequence kernel association test (SKAT) method when the underlying assumptions were violated (e.g., the phenotype followed a heavy-tailed distribution). Even when the assumptions were satisfied, WU-SEQ still attained comparable performance to SKAT. Finally, we applied WU-SEQ to sequencing data from the Dallas Heart Study (DHS), and detected an association between ANGPTL 4 and very low density lipoprotein cholesterol. © 2014 WILEY PERIODICALS, INC.
A critical discussion of null hypothesis significance testing and statistical power analysis within psychological research

DEFF Research Database (Denmark)

Jones, Allan; Sommerlund, Bo

2007-01-01

The uses of null hypothesis significance testing (NHST) and statistical power analysis within psychological research are critically discussed. The article looks at the problems of relying solely on NHST when dealing with small and large sample sizes. The use of power-analysis in estimating...... the potential error introduced by small and large samples is advocated. Power analysis is not recommended as a replacement to NHST but as an additional source of information about the phenomena under investigation. Moreover, the importance of conceptual analysis in relation to statistical analysis of hypothesis...
Confounding and Statistical Significance of Indirect Effects: Childhood Adversity, Education, Smoking, and Anxious and Depressive Symptomatology

Directory of Open Access Journals (Sweden)

Mashhood Ahmed Sheikh

2017-08-01

mediate the association between childhood adversity and ADS in adulthood. However, when education was excluded as a mediator-response confounding variable, the indirect effect of childhood adversity on ADS in adulthood was statistically significant (p < 0.05. This study shows that a careful inclusion of potential confounding variables is important when assessing mediation.
Statistical significance estimation of a signal within the GooFit framework on GPUs

Directory of Open Access Journals (Sweden)

Cristella Leonardo

2017-01-01

Full Text Available In order to test the computing capabilities of GPUs with respect to traditional CPU cores a high-statistics toy Monte Carlo technique has been implemented both in ROOT/RooFit and GooFit frameworks with the purpose to estimate the statistical significance of the structure observed by CMS close to the kinematical boundary of the J/ψϕ invariant mass in the three-body decay B+ → J/ψϕK+. GooFit is a data analysis open tool under development that interfaces ROOT/RooFit to CUDA platform on nVidia GPU. The optimized GooFit application running on GPUs hosted by servers in the Bari Tier2 provides striking speed-up performances with respect to the RooFit application parallelised on multiple CPUs by means of PROOF-Lite tool. The considerable resulting speed-up, evident when comparing concurrent GooFit processes allowed by CUDA Multi Process Service and a RooFit/PROOF-Lite process with multiple CPU workers, is presented and discussed in detail. By means of GooFit it has also been possible to explore the behaviour of a likelihood ratio test statistic in different situations in which the Wilks Theorem may or may not apply because its regularity conditions are not satisfied.
Is statistical significance clinically important?--A guide to judge the clinical relevance of study findings

NARCIS (Netherlands)

Sierevelt, Inger N.; van Oldenrijk, Jakob; Poolman, Rudolf W.

2007-01-01

In this paper we describe several issues that influence the reporting of statistical significance in relation to clinical importance, since misinterpretation of p values is a common issue in orthopaedic literature. Orthopaedic research is tormented by the risks of false-positive (type I error) and
A powerful score-based test statistic for detecting gene-gene co-association.

Science.gov (United States)

Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

2016-01-29

The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.
Novel loci and pathways significantly associated with longevity

DEFF Research Database (Denmark)

Zeng, Yi; Nie, Chao; Min, Junxia

2016-01-01

Only two genome-wide significant loci associated with longevity have been identified so far, probably because of insufficient sample sizes of centenarians, whose genomes may harbor genetic variants associated with health and longevity. Here we report a genome-wide association study (GWAS) of Han ...
Statistical significance of theoretical predictions: A new dimension in nuclear structure theories (I)

International Nuclear Information System (INIS)

DUDEK, J; SZPAK, B; FORNAL, B; PORQUET, M-G

2011-01-01

In this and the follow-up article we briefly discuss what we believe represents one of the most serious problems in contemporary nuclear structure: the question of statistical significance of parametrizations of nuclear microscopic Hamiltonians and the implied predictive power of the underlying theories. In the present Part I, we introduce the main lines of reasoning of the so-called Inverse Problem Theory, an important sub-field in the contemporary Applied Mathematics, here illustrated on the example of the Nuclear Mean-Field Approach.

Statistical Significance of the Contribution of Variables to the PCA Solution: An Alternative Permutation Strategy

Science.gov (United States)

Linting, Marielle; van Os, Bart Jan; Meulman, Jacqueline J.

2011-01-01

In this paper, the statistical significance of the contribution of variables to the principal components in principal components analysis (PCA) is assessed nonparametrically by the use of permutation tests. We compare a new strategy to a strategy used in previous research consisting of permuting the columns (variables) of a data matrix…
THE MILKY WAY PROJECT: A STATISTICAL STUDY OF MASSIVE STAR FORMATION ASSOCIATED WITH INFRARED BUBBLES

International Nuclear Information System (INIS)

Kendrew, S.; Robitaille, T. P.; Simpson, R.; Lintott, C. J.; Bressert, E.; Povich, M. S.; Sherman, R.; Schawinski, K.; Wolf-Chase, G.

2012-01-01

The Milky Way Project citizen science initiative recently increased the number of known infrared bubbles in the inner Galactic plane by an order of magnitude compared to previous studies. We present a detailed statistical analysis of this data set with the Red MSX Source (RMS) catalog of massive young stellar sources to investigate the association of these bubbles with massive star formation. We particularly address the question of massive triggered star formation near infrared bubbles. We find a strong positional correlation of massive young stellar objects (MYSOs) and H II regions with Milky Way Project bubbles at separations of <2 bubble radii. As bubble sizes increase, a statistically significant overdensity of massive young sources emerges in the region of the bubble rims, possibly indicating the occurrence of triggered star formation. Based on numbers of bubble-associated RMS sources, we find that 67% ± 3% of MYSOs and (ultra-)compact H II regions appear to be associated with a bubble. We estimate that approximately 22% ± 2% of massive young stars may have formed as a result of feedback from expanding H II regions. Using MYSO-bubble correlations, we serendipitously recovered the location of the recently discovered massive cluster Mercer 81, suggesting the potential of such analyses for discovery of heavily extincted distant clusters.
THE MILKY WAY PROJECT: A STATISTICAL STUDY OF MASSIVE STAR FORMATION ASSOCIATED WITH INFRARED BUBBLES

Energy Technology Data Exchange (ETDEWEB)

Kendrew, S.; Robitaille, T. P. [Max-Planck-Institut fuer Astronomie, Koenigstuhl 17, D-69117 Heidelberg (Germany); Simpson, R.; Lintott, C. J. [Department of Astrophysics, University of Oxford, Denys Wilkinson Building, Keble Road, Oxford OX1 3RH (United Kingdom); Bressert, E. [School of Physics, University of Exeter, Stocker Road, Exeter EX4 4QL (United Kingdom); Povich, M. S. [Department of Astronomy and Astrophysics, Pennsylvania State University, 525 Davey Laboratory, University Park, PA 16802 (United States); Sherman, R. [Department of Astronomy and Astrophysics, University of Chicago, 5640 S. Ellis Avenue, Chicago, IL 60637 (United States); Schawinski, K. [Yale Center for Astronomy and Astrophysics, Yale University, P.O. Box 208121, New Haven, CT 06520 (United States); Wolf-Chase, G., E-mail: kendrew@mpia.de [Astronomy Department, Adler Planetarium, 1300 S. Lake Shore Drive, Chicago, IL 60605 (United States)

2012-08-10

The Milky Way Project citizen science initiative recently increased the number of known infrared bubbles in the inner Galactic plane by an order of magnitude compared to previous studies. We present a detailed statistical analysis of this data set with the Red MSX Source (RMS) catalog of massive young stellar sources to investigate the association of these bubbles with massive star formation. We particularly address the question of massive triggered star formation near infrared bubbles. We find a strong positional correlation of massive young stellar objects (MYSOs) and H II regions with Milky Way Project bubbles at separations of <2 bubble radii. As bubble sizes increase, a statistically significant overdensity of massive young sources emerges in the region of the bubble rims, possibly indicating the occurrence of triggered star formation. Based on numbers of bubble-associated RMS sources, we find that 67% {+-} 3% of MYSOs and (ultra-)compact H II regions appear to be associated with a bubble. We estimate that approximately 22% {+-} 2% of massive young stars may have formed as a result of feedback from expanding H II regions. Using MYSO-bubble correlations, we serendipitously recovered the location of the recently discovered massive cluster Mercer 81, suggesting the potential of such analyses for discovery of heavily extincted distant clusters.
ClusterSignificance: A bioconductor package facilitating statistical analysis of class cluster separations in dimensionality reduced data

DEFF Research Database (Denmark)

Serviss, Jason T.; Gådin, Jesper R.; Eriksson, Per

2017-01-01

, e.g. genes in a specific pathway, alone can separate samples into these established classes. Despite this, the evaluation of class separations is often subjective and performed via visualization. Here we present the ClusterSignificance package; a set of tools designed to assess the statistical...... significance of class separations downstream of dimensionality reduction algorithms. In addition, we demonstrate the design and utility of the ClusterSignificance package and utilize it to determine the importance of long non-coding RNA expression in the identity of multiple hematological malignancies....
Statistical learning and selective inference.

Science.gov (United States)

Taylor, Jonathan; Tibshirani, Robert J

2015-06-23

We describe the problem of "selective inference." This addresses the following challenge: Having mined a set of data to find potential associations, how do we properly assess the strength of these associations? The fact that we have "cherry-picked"--searched for the strongest associations--means that we must set a higher bar for declaring significant the associations that we see. This challenge becomes more important in the era of big data and complex statistical modeling. The cherry tree (dataset) can be very large and the tools for cherry picking (statistical learning methods) are now very sophisticated. We describe some recent new developments in selective inference and illustrate their use in forward stepwise regression, the lasso, and principal components analysis.
How significant is the ‘significant other’? Associations between significant others’ health behaviors and attitudes and young adults’ health outcomes

Directory of Open Access Journals (Sweden)

Berge Jerica M

2012-04-01

Full Text Available Abstract Background Having a significant other has been shown to be protective against physical and psychological health conditions for adults. Less is known about the period of emerging young adulthood and associations between significant others’ weight and weight-related health behaviors (e.g. healthy dietary intake, the frequency of physical activity, weight status. This study examined the association between significant others’ health attitudes and behaviors regarding eating and physical activity and young adults’ weight status, dietary intake, and physical activity. Methods This study uses data from Project EAT-III, a population-based cohort study with emerging young adults from diverse ethnic and socioeconomic backgrounds (n = 1212. Logistic regression models examining cross-sectional associations, adjusted for sociodemographics and health behaviors five years earlier, were used to estimate predicted probabilities and calculate prevalence differences. Results Young adult women whose significant others had health promoting attitudes/behaviors were significantly less likely to be overweight/obese and were more likely to eat ≥ 5 fruits/vegetables per day and engage in ≥ 3.5 hours/week of physical activity, compared to women whose significant others did not have health promoting behaviors/attitudes. Young adult men whose significant other had health promoting behaviors/attitudes were more likely to engage in ≥ 3.5 hours/week of physical activity compared to men whose significant others did not have health promoting behaviors/attitudes. Conclusions Findings suggest the protective nature of the significant other with regard to weight-related health behaviors of young adults, particularly for young adult women. Obesity prevention efforts should consider the importance of including the significant other in intervention efforts with young adult women and potentially men.
Statistical significance versus clinical importance: trials on exercise therapy for chronic low back pain as example.

NARCIS (Netherlands)

van Tulder, M.W.; Malmivaara, A.; Hayden, J.; Koes, B.

2007-01-01

STUDY DESIGN. Critical appraisal of the literature. OBJECIVES. The objective of this study was to assess if results of back pain trials are statistically significant and clinically important. SUMMARY OF BACKGROUND DATA. There seems to be a discrepancy between conclusions reported by authors and
Statistical power of model selection strategies for genome-wide association studies.

Directory of Open Access Journals (Sweden)

Zheyang Wu

2009-07-01

Full Text Available Genome-wide association studies (GWAS aim to identify genetic variants related to diseases by examining the associations between phenotypes and hundreds of thousands of genotyped markers. Because many genes are potentially involved in common diseases and a large number of markers are analyzed, it is crucial to devise an effective strategy to identify truly associated variants that have individual and/or interactive effects, while controlling false positives at the desired level. Although a number of model selection methods have been proposed in the literature, including marginal search, exhaustive search, and forward search, their relative performance has only been evaluated through limited simulations due to the lack of an analytical approach to calculating the power of these methods. This article develops a novel statistical approach for power calculation, derives accurate formulas for the power of different model selection strategies, and then uses the formulas to evaluate and compare these strategies in genetic model spaces. In contrast to previous studies, our theoretical framework allows for random genotypes, correlations among test statistics, and a false-positive control based on GWAS practice. After the accuracy of our analytical results is validated through simulations, they are utilized to systematically evaluate and compare the performance of these strategies in a wide class of genetic models. For a specific genetic model, our results clearly reveal how different factors, such as effect size, allele frequency, and interaction, jointly affect the statistical power of each strategy. An example is provided for the application of our approach to empirical research. The statistical approach used in our derivations is general and can be employed to address the model selection problems in other random predictor settings. We have developed an R package markerSearchPower to implement our formulas, which can be downloaded from the
To Be or Not to Be Associated: Power study of four statistical modeling approaches to identify parasite associations in cross-sectional studies

Directory of Open Access Journals (Sweden)

Elise eVaumourin

2014-05-01

Full Text Available A growing number of studies are reporting simultaneous infections by parasites in many different hosts. The detection of whether these parasites are significantly associated is important in medicine and epidemiology. Numerous approaches to detect associations are available, but only a few provide statistical tests. Furthermore, they generally test for an overall detection of association and do not identify which parasite is associated with which other one. Here, we developed a new approach, the association screening approach, to detect the overall and the detail of multi-parasite associations. We studied the power of this new approach and of three other known ones (i.e. the generalized chi-square, the network and the multinomial GLM approaches to identify parasite associations either due to parasite interactions or to confounding factors. We applied these four approaches to detect associations within two populations of multi-infected hosts: 1 rodents infected with Bartonella sp., Babesia microti and Anaplasma phagocytophilum and 2 bovine population infected with Theileria sp. and Babesia sp.. We found that the best power is obtained with the screening model and the generalized chi-square test. The differentiation between associations, which are due to confounding factors and parasite interactions was not possible. The screening approach significantly identified associations between Bartonella doshiae and B. microti, and between T. parva, T. mutans and T. velifera. Thus, the screening approach was relevant to test the overall presence of parasite associations and identify the parasite combinations that are significantly over- or under-represented. Unravelling whether the associations are due to real biological interactions or confounding factors should be further investigated. Nevertheless, in the age of genomics and the advent of new technologies, it is a considerable asset to speed up researches focusing on the mechanisms driving interactions
Significant association between renal function and amyloid-positive area in renal biopsy specimens in AL amyloidosis

Directory of Open Access Journals (Sweden)

Kuroda Takeshi

2012-09-01

Full Text Available Abstract Background The kidney is a major target organ for systemic amyloidosis that often affects the kidney including proteinura, and elevated serum creatinine (Cr. The correlation between amount of amyloid deposits and clinical parameters is not known. The aim of this study was to clarify correlation the amyloid area in all renal biopsy specimen and clinical parameters. Methods Fifty-eight patients with an established diagnosis of AL amyloidosis participated in the study. All patients showed amyloid deposits in renal biopsies. We retrospectively investigated the correlation between clinical data and amyloid occupied area in whole renal biopsy specimens. Results The area occupied by amyloid was less than 10% in 57 of the 58 patients, and was under 2% in 40. For statistical analyses, %amyloid-positive areas were transformed to common logarithmic values (Log10%amyloid. Cr showed significant correlation with Log10%amyloid and estimated glomerular filtration rate (eGFR showed the significant negative correlation. Patient age, cleatinine clearance (Ccr, blood urea nitorogen, and urinary protein was not significantly correlated with Log10%amyloid. The correlation with other clinical factors such as sex, and serum concentrations of total protein, albumin, immunoglobulins, compliments was evaluated. None of these factors significantly correlated with Log10%amyloid. According to sex- and age- adjusted multiple linear regression analysis, Log10%amyloid had significant positive association with Cr and significant negative association with eGFR. Conclusion There is significant association between amyloid-positive area in renal tissue and renal function, especially Cr and eGFR. The level of Cr and eGFR may be a marker of amount of amyloid in renal tissue.
Statistical significance approximation in local trend analysis of high-throughput time-series data using the theory of Markov chains.

Science.gov (United States)

Xia, Li C; Ai, Dongmei; Cram, Jacob A; Liang, Xiaoyi; Fuhrman, Jed A; Sun, Fengzhu

2015-09-21

Local trend (i.e. shape) analysis of time series data reveals co-changing patterns in dynamics of biological systems. However, slow permutation procedures to evaluate the statistical significance of local trend scores have limited its applications to high-throughput time series data analysis, e.g., data from the next generation sequencing technology based studies. By extending the theories for the tail probability of the range of sum of Markovian random variables, we propose formulae for approximating the statistical significance of local trend scores. Using simulations and real data, we show that the approximate p-value is close to that obtained using a large number of permutations (starting at time points >20 with no delay and >30 with delay of at most three time steps) in that the non-zero decimals of the p-values obtained by the approximation and the permutations are mostly the same when the approximate p-value is less than 0.05. In addition, the approximate p-value is slightly larger than that based on permutations making hypothesis testing based on the approximate p-value conservative. The approximation enables efficient calculation of p-values for pairwise local trend analysis, making large scale all-versus-all comparisons possible. We also propose a hybrid approach by integrating the approximation and permutations to obtain accurate p-values for significantly associated pairs. We further demonstrate its use with the analysis of the Polymouth Marine Laboratory (PML) microbial community time series from high-throughput sequencing data and found interesting organism co-occurrence dynamic patterns. The software tool is integrated into the eLSA software package that now provides accelerated local trend and similarity analysis pipelines for time series data. The package is freely available from the eLSA website: http://bitbucket.org/charade/elsa.
Indirectional statistics and the significance of an asymmetry discovered by Birch

International Nuclear Information System (INIS)

Kendall, D.G.; Young, G.A.

1984-01-01

Birch (1982, Nature, 298, 451) reported an apparent 'statistical asymmetry of the Universe'. The authors here develop 'indirectional analysis' as a technique for investigating statistical effects of this kind and conclude that the reported effect (whatever may be its origin) is strongly supported by the observations. The estimated pole of the asymmetry is at RA 13h 30m, Dec. -37deg. The angular error in its estimation is unlikely to exceed 20-30deg. (author)
A PLSPM-Based Test Statistic for Detecting Gene-Gene Co-Association in Genome-Wide Association Study with Case-Control Design

Science.gov (United States)

Zhang, Xiaoshuai; Yang, Xiaowei; Yuan, Zhongshang; Liu, Yanxun; Li, Fangyu; Peng, Bin; Zhu, Dianwen; Zhao, Jinghua; Xue, Fuzhong

2013-01-01

For genome-wide association data analysis, two genes in any pathway, two SNPs in the two linked gene regions respectively or in the two linked exons respectively within one gene are often correlated with each other. We therefore proposed the concept of gene-gene co-association, which refers to the effects not only due to the traditional interaction under nearly independent condition but the correlation between two genes. Furthermore, we constructed a novel statistic for detecting gene-gene co-association based on Partial Least Squares Path Modeling (PLSPM). Through simulation, the relationship between traditional interaction and co-association was highlighted under three different types of co-association. Both simulation and real data analysis demonstrated that the proposed PLSPM-based statistic has better performance than single SNP-based logistic model, PCA-based logistic model, and other gene-based methods. PMID:23620809
Expression of tumor necrosis factor receptor-associated protein 1 and its clinical significance in kidney cancer.

Science.gov (United States)

Si, Tong; Yang, Guosheng; Qiu, Xiaofu; Luo, Youhua; Liu, Baichuan; Wang, Bingwei

2015-01-01

To investigate the expression and clinical significance of TRAP1 (tumor necrosis factor receptor-associated protein 1) in kidney cancer. TRAP1 expression was detected in kidney cancer and normal kidney tissues by qRT-PCR and immunohistochemistry (IHC), respectively. Then, the correlation of TRAP1 expression with clinicopathological characters and patients' prognosis was evaluated in kidney cancer. IHC results revealed that the high-expression rates of TRAP1 in kidney cancer tissues and normal kidney tissues were 51.3% (41/80), 23.3% (7/30), and the difference was statistically significant (P=0.01). Also, TRAP1 mRNA level in kidney cancer was found to be significantly greater compared with those in normal kidney by qRT-PCR. In addition, TRAP1 expression in kidney cancer significantly correlated with lymph node metastasis and clinical stage (Pkidney cancer and correlates with patients prognosis, which may be served as a potential marker for the diagnosis and treatment of kidney cancer.
Statistics associated with an elemental analysis system of particles induced by X-ray emission

International Nuclear Information System (INIS)

Romo K, C.M.

1987-01-01

In the quantitative elemental analysis by X-ray techniques one has to use data spectra which present fluctuations of statistical nature both from the energy and from the number of counts accumulated. While processing the results for the obtainment of a quantitative result, a detailed knowledge of the associated statistics distributions is needed. In this work, l) the statistics associated with the system photon's counting as well as 2) the distribution of the results as a function of the energy are analyzed. The first one is important for the definition of the expected values and uncertainties and for the spectra simulation (Mukoyama, 1975). The second one is fundamental for the determination of the contribution for each spectral line. (M.R.) [es
A functional U-statistic method for association analysis of sequencing data.

Science.gov (United States)

Jadhav, Sneha; Tong, Xiaoran; Lu, Qing

2017-11-01

Although sequencing studies hold great promise for uncovering novel variants predisposing to human diseases, the high dimensionality of the sequencing data brings tremendous challenges to data analysis. Moreover, for many complex diseases (e.g., psychiatric disorders) multiple related phenotypes are collected. These phenotypes can be different measurements of an underlying disease, or measurements characterizing multiple related diseases for studying common genetic mechanism. Although jointly analyzing these phenotypes could potentially increase the power of identifying disease-associated genes, the different types of phenotypes pose challenges for association analysis. To address these challenges, we propose a nonparametric method, functional U-statistic method (FU), for multivariate analysis of sequencing data. It first constructs smooth functions from individuals' sequencing data, and then tests the association of these functions with multiple phenotypes by using a U-statistic. The method provides a general framework for analyzing various types of phenotypes (e.g., binary and continuous phenotypes) with unknown distributions. Fitting the genetic variants within a gene using a smoothing function also allows us to capture complexities of gene structure (e.g., linkage disequilibrium, LD), which could potentially increase the power of association analysis. Through simulations, we compared our method to the multivariate outcome score test (MOST), and found that our test attained better performance than MOST. In a real data application, we apply our method to the sequencing data from Minnesota Twin Study (MTS) and found potential associations of several nicotine receptor subunit (CHRN) genes, including CHRNB3, associated with nicotine dependence and/or alcohol dependence. © 2017 WILEY PERIODICALS, INC.
Significant association of RNF213 p.R4810K, a moyamoya susceptibility variant, with coronary artery disease.

Science.gov (United States)

Morimoto, Takaaki; Mineharu, Yohei; Ono, Koh; Nakatochi, Masahiro; Ichihara, Sahoko; Kabata, Risako; Takagi, Yasushi; Cao, Yang; Zhao, Lanying; Kobayashi, Hatasu; Harada, Kouji H; Takenaka, Katsunobu; Funaki, Takeshi; Yokota, Mitsuhiro; Matsubara, Tatsuaki; Yamamoto, Ken; Izawa, Hideo; Kimura, Takeshi; Miyamoto, Susumu; Koizumi, Akio

2017-01-01

The genetic architecture of coronary artery disease has not been fully elucidated, especially in Asian countries. Moyamoya disease is a progressive cerebrovascular disease that is reported to be complicated by coronary artery disease. Because most Japanese patients with moyamoya disease carry the p.R4810K variant of the ring finger 213 gene (RNF213), this may also be a risk factor for coronary artery disease; however, this possibility has never been tested. We genotyped the RNF213 p.R4810K variant in 956 coronary artery disease patients and 716 controls and tested the association between p.R4810K and coronary artery disease. We also validated the association in an independent population of 311 coronary artery disease patients and 494 controls. In the replication study, the p.R4810K genotypes were imputed from genome-wide genotyping data based on the 1000 Genomes Project. We used multivariate logistic regression analyses to adjust for well-known risk factors such as dyslipidemia and smoking habits. In the primary study population, the frequency of the minor variant allele was significantly higher in patients with coronary artery disease than in controls (2.04% vs. 0.98%), with an odds ratio of 2.11 (p = 0.017). Under a dominant model, after adjustment for risk factors, the association remained significant, with an odds ratio of 2.90 (95% confidence interval: 1.37-6.61; p = 0.005). In the replication study, the association was significant after adjustment for age and sex (odds ratio = 4.99; 95% confidence interval: 1.16-21.53; p = 0.031), although it did not reach statistical significance when further adjusted for risk factors (odds ratio = 3.82; 95% confidence interval: 0.87-16.77; p = 0.076). The RNF213 p.R4810K variant appears to be significantly associated with coronary artery disease in the Japanese population.
The Association of Academic Health Sciences Libraries Annual Statistics: a thematic history.

Science.gov (United States)

Shedlock, James; Byrd, Gary D

2003-04-01

The Annual Statistics of Medical School Libraries in the United States and Canada (Annual Statistics) is the most recognizable achievement of the Association of Academic Health Sciences Libraries in its history to date. This article gives a thematic history of the Annual Statistics, emphasizing the leadership role of editors and Editorial Boards, the need for cooperation and membership support to produce comparable data useful for everyday management of academic medical center libraries and the use of technology as a tool for data gathering and publication. The Annual Statistics' origin is recalled, and survey features and content are related to the overall themes. The success of the Annual Statistics is evident in the leadership skills of the first editor, Richard Lyders, executive director of the Houston Academy of Medicine-Texas Medical Center Library. The history shows the development of a survey instrument that strives to produce reliable and valid data for a diverse group of libraries while reflecting the many complex changes in the library environment. The future of the Annual Statistics is assured by the anticipated changes facing academic health sciences libraries, namely the need to reflect the transition from a physical environment to an electronic operation.
Rapid Classification and Identification of Multiple Microorganisms with Accurate Statistical Significance via High-Resolution Tandem Mass Spectrometry.

Science.gov (United States)

Alves, Gelio; Wang, Guanghui; Ogurtsov, Aleksey Y; Drake, Steven K; Gucek, Marjan; Sacks, David B; Yu, Yi-Kuo

2018-06-05

Rapid and accurate identification and classification of microorganisms is of paramount importance to public health and safety. With the advance of mass spectrometry (MS) technology, the speed of identification can be greatly improved. However, the increasing number of microbes sequenced is complicating correct microbial identification even in a simple sample due to the large number of candidates present. To properly untwine candidate microbes in samples containing one or more microbes, one needs to go beyond apparent morphology or simple "fingerprinting"; to correctly prioritize the candidate microbes, one needs to have accurate statistical significance in microbial identification. We meet these challenges by using peptide-centric representations of microbes to better separate them and by augmenting our earlier analysis method that yields accurate statistical significance. Here, we present an updated analysis workflow that uses tandem MS (MS/MS) spectra for microbial identification or classification. We have demonstrated, using 226 MS/MS publicly available data files (each containing from 2500 to nearly 100,000 MS/MS spectra) and 4000 additional MS/MS data files, that the updated workflow can correctly identify multiple microbes at the genus and often the species level for samples containing more than one microbe. We have also shown that the proposed workflow computes accurate statistical significances, i.e., E values for identified peptides and unified E values for identified microbes. Our updated analysis workflow MiCId, a freely available software for Microorganism Classification and Identification, is available for download at https://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads.html . Graphical Abstract ᅟ.
Significant Association of HLA-DQ5 with Autoimmune Hepatitis in Taiwan

Directory of Open Access Journals (Sweden)

Lok-Beng Koay

2007-12-01

Full Text Available Genetic predisposition is known to be an important etiopathogenic factor of autoimmune hepatitis (AIH. HLA antigens associated with AIH have been well studied in Western countries and Japan, but there is no HLA typing data of AIH patients in Taiwan. We therefore investigated HLA phenotypes and their association with AIH patients and compared the results with those of normal subjects and patients with chronic liver disease. Group 1 consisted of 22 AIH patients. All were born in Taiwan with no history of blood transfusion. Group 2 consisted of 19 chronic liver disease patients. Group 3 consisted of 81 unrelated healthy subjects who were normal blood donors. All three groups were tested for HLA phenotypes (HLAA, B, C, DR, DQ using the polymerase chain reaction—sequence specific probe method. The statistical method used was Fisher's exact test. We found that HLA-DQ5 was significantly more frequent in the AIH group compared to the control group (RR, 2.03; p = 0.034. Low frequency of A1 (n = 2/22, B8 (n = 1/22 and DR3 (n = 0/22 were noted compared to results from the West; only HLA-DR4 showed a higher rate in our AIH patients (n = 8/22. This is a preliminary report of our study of HLA antigens in AIH patients. Further investigation to characterize AIH patients into HLA allelic subgroups is being done.

Assessing Statistically Significant Heavy-Metal Concentrations in Abandoned Mine Areas via Hot Spot Analysis of Portable XRF Data.

Science.gov (United States)

Kim, Sung-Min; Choi, Yosoon

2017-06-18

To develop appropriate measures to prevent soil contamination in abandoned mining areas, an understanding of the spatial variation of the potentially toxic trace elements (PTEs) in the soil is necessary. For the purpose of effective soil sampling, this study uses hot spot analysis, which calculates a z -score based on the Getis-Ord Gi* statistic to identify a statistically significant hot spot sample. To constitute a statistically significant hot spot, a feature with a high value should also be surrounded by other features with high values. Using relatively cost- and time-effective portable X-ray fluorescence (PXRF) analysis, sufficient input data are acquired from the Busan abandoned mine and used for hot spot analysis. To calibrate the PXRF data, which have a relatively low accuracy, the PXRF analysis data are transformed using the inductively coupled plasma atomic emission spectrometry (ICP-AES) data. The transformed PXRF data of the Busan abandoned mine are classified into four groups according to their normalized content and z -scores: high content with a high z -score (HH), high content with a low z -score (HL), low content with a high z -score (LH), and low content with a low z -score (LL). The HL and LH cases may be due to measurement errors. Additional or complementary surveys are required for the areas surrounding these suspect samples or for significant hot spot areas. The soil sampling is conducted according to a four-phase procedure in which the hot spot analysis and proposed group classification method are employed to support the development of a sampling plan for the following phase. Overall, 30, 50, 80, and 100 samples are investigated and analyzed in phases 1-4, respectively. The method implemented in this case study may be utilized in the field for the assessment of statistically significant soil contamination and the identification of areas for which an additional survey is required.
Assessing Statistically Significant Heavy-Metal Concentrations in Abandoned Mine Areas via Hot Spot Analysis of Portable XRF Data

Directory of Open Access Journals (Sweden)

Sung-Min Kim

2017-06-01

Full Text Available To develop appropriate measures to prevent soil contamination in abandoned mining areas, an understanding of the spatial variation of the potentially toxic trace elements (PTEs in the soil is necessary. For the purpose of effective soil sampling, this study uses hot spot analysis, which calculates a z-score based on the Getis-Ord Gi* statistic to identify a statistically significant hot spot sample. To constitute a statistically significant hot spot, a feature with a high value should also be surrounded by other features with high values. Using relatively cost- and time-effective portable X-ray fluorescence (PXRF analysis, sufficient input data are acquired from the Busan abandoned mine and used for hot spot analysis. To calibrate the PXRF data, which have a relatively low accuracy, the PXRF analysis data are transformed using the inductively coupled plasma atomic emission spectrometry (ICP-AES data. The transformed PXRF data of the Busan abandoned mine are classified into four groups according to their normalized content and z-scores: high content with a high z-score (HH, high content with a low z-score (HL, low content with a high z-score (LH, and low content with a low z-score (LL. The HL and LH cases may be due to measurement errors. Additional or complementary surveys are required for the areas surrounding these suspect samples or for significant hot spot areas. The soil sampling is conducted according to a four-phase procedure in which the hot spot analysis and proposed group classification method are employed to support the development of a sampling plan for the following phase. Overall, 30, 50, 80, and 100 samples are investigated and analyzed in phases 1–4, respectively. The method implemented in this case study may be utilized in the field for the assessment of statistically significant soil contamination and the identification of areas for which an additional survey is required.
Insertion/Deletion Within the KDM6A Gene Is Significantly Associated With Litter Size in Goat

Science.gov (United States)

Cui, Yang; Yan, Hailong; Wang, Ke; Xu, Han; Zhang, Xuelian; Zhu, Haijing; Liu, Jinwang; Qu, Lei; Lan, Xianyong; Pan, Chuanying

2018-01-01

A previous whole-genome association analysis identified lysine demethylase 6A (KDM6A), which encodes a type of histone demethylase, as a candidate gene associated to goat fecundity. KDM6A gene knockout mouse disrupts gametophyte development, suggesting that it has a critical role in reproduction. In this study, goat KDM6A mRNA expression profiles were determined, insertion/deletion (indel) variants in the gene identified, indel variants effect on KDM6A gene expression assessed, and their association with first-born litter size analyzed in 2326 healthy female Shaanbei white cashmere goats. KDM6A mRNA was expressed in all tissues tested (heart, liver, spleen, lung, kidney, muscle, brain, skin and testis); the expression levels in testes at different developmental stages [1-week-old (wk), 2, 3 wk, 1-month-old (mo), 1.5 and 2 mo] indicated a potential association with the mitosis-to-meiosis transition, implying that KDM6A may have an essential role in goat fertility. Meanwhile, two novel intronic indels of 16 bp and 5 bp were identified. Statistical analysis revealed that only the 16 bp indel was associated with first-born litter size (P goat population (P = 0.001). Consistently, the 16 bp indel also had a significant effect on KDM6A gene expression. Additionally, there was no significant linkage disequilibrium (LD) between these two indel loci, consistent with the association analysis results. Together, these findings suggest that the 16 bp indel in KDM6A may be useful for marker-assisted selection (MAS) of goats. PMID:29616081
Association testing for next-generation sequencing data using score statistics

DEFF Research Database (Denmark)

Skotte, Line; Korneliussen, Thorfinn Sand; Albrechtsen, Anders

2012-01-01

computationally feasible due to the use of score statistics. As part of the joint likelihood, we model the distribution of the phenotypes using a generalized linear model framework, which works for both quantitative and discrete phenotypes. Thus, the method presented here is applicable to case-control studies...... of genotype calls into account have been proposed; most require numerical optimization which for large-scale data is not always computationally feasible. We show that using a score statistic for the joint likelihood of observed phenotypes and observed sequencing data provides an attractive approach...... to association testing for next-generation sequencing data. The joint model accounts for the genotype classification uncertainty via the posterior probabilities of the genotypes given the observed sequencing data, which gives the approach higher power than methods based on called genotypes. This strategy remains...
Breast-cancer-associated metastasis is significantly increased in a model of autoimmune arthritis.

Science.gov (United States)

Das Roy, Lopamudra; Pathangey, Latha B; Tinder, Teresa L; Schettini, Jorge L; Gruber, Helen E; Mukherjee, Pinku

2009-01-01

Sites of chronic inflammation are often associated with the establishment and growth of various malignancies including breast cancer. A common inflammatory condition in humans is autoimmune arthritis (AA) that causes inflammation and deformity of the joints. Other systemic effects associated with arthritis include increased cellular infiltration and inflammation of the lungs. Several studies have reported statistically significant risk ratios between AA and breast cancer. Despite this knowledge, available for a decade, it has never been questioned if the site of chronic inflammation linked to AA creates a milieu that attracts tumor cells to home and grow in the inflamed bones and lungs which are frequent sites of breast cancer metastasis. To determine if chronic inflammation induced by autoimmune arthritis contributes to increased breast cancer-associated metastasis, we generated mammary gland tumors in SKG mice that were genetically prone to develop AA. Two breast cancer cell lines, one highly metastatic (4T1) and the other non-metastatic (TUBO) were used to generate the tumors in the mammary fat pad. Lung and bone metastasis and the associated inflammatory milieu were evaluated in the arthritic versus the non-arthritic mice. We report a three-fold increase in lung metastasis and a significant increase in the incidence of bone metastasis in the pro-arthritic and arthritic mice compared to non-arthritic control mice. We also report that the metastatic breast cancer cells augment the severity of arthritis resulting in a vicious cycle that increases both bone destruction and metastasis. Enhanced neutrophilic and granulocytic infiltration in lungs and bone of the pro-arthritic and arthritic mice and subsequent increase in circulating levels of proinflammatory cytokines, such as macrophage colony stimulating factor (M-CSF), interleukin-17 (IL-17), interleukin-6 (IL-6), vascular endothelial growth factor (VEGF), and tumor necrosis factor-alpha (TNF-alpha) may contribute
Breast cancer-associated metastasis is significantly increased in a model of autoimmune arthritis

Science.gov (United States)

Das Roy, Lopamudra; Pathangey, Latha B; Tinder, Teresa L; Schettini, Jorge L; Gruber, Helen E; Mukherjee, Pinku

2009-01-01

Introduction Sites of chronic inflammation are often associated with the establishment and growth of various malignancies including breast cancer. A common inflammatory condition in humans is autoimmune arthritis (AA) that causes inflammation and deformity of the joints. Other systemic effects associated with arthritis include increased cellular infiltration and inflammation of the lungs. Several studies have reported statistically significant risk ratios between AA and breast cancer. Despite this knowledge, available for a decade, it has never been questioned if the site of chronic inflammation linked to AA creates a milieu that attracts tumor cells to home and grow in the inflamed bones and lungs which are frequent sites of breast cancer metastasis. Methods To determine if chronic inflammation induced by autoimmune arthritis contributes to increased breast cancer-associated metastasis, we generated mammary gland tumors in SKG mice that were genetically prone to develop AA. Two breast cancer cell lines, one highly metastatic (4T1) and the other non-metastatic (TUBO) were used to generate the tumors in the mammary fat pad. Lung and bone metastasis and the associated inflammatory milieu were evaluated in the arthritic versus the non-arthritic mice. Results We report a three-fold increase in lung metastasis and a significant increase in the incidence of bone metastasis in the pro-arthritic and arthritic mice compared to non-arthritic control mice. We also report that the metastatic breast cancer cells augment the severity of arthritis resulting in a vicious cycle that increases both bone destruction and metastasis. Enhanced neutrophilic and granulocytic infiltration in lungs and bone of the pro-arthritic and arthritic mice and subsequent increase in circulating levels of proinflammatory cytokines, such as macrophage colony stimulating factor (M-CSF), interleukin-17 (IL-17), interleukin-6 (IL-6), vascular endothelial growth factor (VEGF), and tumor necrosis factor
Statistical analysis and data management

International Nuclear Information System (INIS)

Anon.

1981-01-01

This report provides an overview of the history of the WIPP Biology Program. The recommendations of the American Institute of Biological Sciences (AIBS) for the WIPP biology program are summarized. The data sets available for statistical analyses and problems associated with these data sets are also summarized. Biological studies base maps are presented. A statistical model is presented to evaluate any correlation between climatological data and small mammal captures. No statistically significant relationship between variance in small mammal captures on Dr. Gennaro's 90m x 90m grid and precipitation records from the Duval Potash Mine were found
Statistically significant faunal differences among Middle Ordovician age, Chickamauga Group bryozoan bioherms, central Alabama

Energy Technology Data Exchange (ETDEWEB)

Crow, C.J.

1985-01-01

Middle Ordovician age Chickamauga Group carbonates crop out along the Birmingham and Murphrees Valley anticlines in central Alabama. The macrofossil contents on exposed surfaces of seven bioherms have been counted to determine their various paleontologic characteristics. Twelve groups of organisms are present in these bioherms. Dominant organisms include bryozoans, algae, brachiopods, sponges, pelmatozoans, stromatoporoids and corals. Minor accessory fauna include predators, scavengers and grazers such as gastropods, ostracods, trilobites, cephalopods and pelecypods. Vertical and horizontal niche zonation has been detected for some of the bioherm dwelling fauna. No one bioherm of those studied exhibits all 12 groups of organisms; rather, individual bioherms display various subsets of the total diversity. Statistical treatment (G-test) of the diversity data indicates a lack of statistical homogeneity of the bioherms, both within and between localities. Between-locality population heterogeneity can be ascribed to differences in biologic responses to such gross environmental factors as water depth and clarity, and energy levels. At any one locality, gross aspects of the paleoenvironments are assumed to have been more uniform. Significant differences among bioherms at any one locality may have resulted from patchy distribution of species populations, differential preservation and other factors.
Detecting Statistically Significant Communities of Triangle Motifs in Undirected Networks

Science.gov (United States)

2016-04-26

Systems, Statistics & Management Science, University of Alabama, USA. 1 DISTRIBUTION A: Distribution approved for public release. Contents 1 Summary 5...13 5 Application to Real Networks 18 5.1 2012 FBS Football Schedule Network... football schedule network. . . . . . . . . . . . . . . . . . . . . . 21 14 Stem plot of degree-ordered vertices versus the degree for college football
Socioeconomic Status Is Significantly Associated with the Dietary Intakes of Folate and Depression Scales in Japanese Workers (J-HOPE Study

Directory of Open Access Journals (Sweden)

Takuro Shimbo

2013-02-01

Full Text Available The association of socioeconomic status (SES with nutrient intake attracts public attention worldwide. In the current study, we examined the associations of SES with dietary intake of folate and health outcomes in general Japanese workers. This Japanese occupational cohort consisted off 2266 workers. SES was assessed by a self-administered questionnaire. Intakes of all nutrients were assessed with a validated, brief and self-administered diet history questionnaire (BDHQ. The degree of depressive symptoms was measured by the validated Japanese version of the K6 scale. Multiple linear regression and stratified analysis were used to evaluate the associations of intake with the confounding factors. Path analysis was conducted to describe the impacts of intake on health outcomes. Education levels and household incomes were significantly associated with intake of folate and depression scales (p < 0.05. After adjusting for age, sex and total energy intake, years of education significantly affect the folate intake (β = 0.117, p < 0.001. The structural equation model (SEM shows that the indirect effect of folate intake is statistically significant and strong (p < 0.05, 56% of direct effect in the pathway of education level to depression scale. Our study shows both education and income are significantly associated with depression scales in Japanese workers, and the effort to increase the folate intake may alleviate the harms of social disparities on mental health.
Statistical significant changes in ground thermal conditions of alpine Austria during the last decade

Science.gov (United States)

Kellerer-Pirklbauer, Andreas

2016-04-01

Longer data series (e.g. >10 a) of ground temperatures in alpine regions are helpful to improve the understanding regarding the effects of present climate change on distribution and thermal characteristics of seasonal frost- and permafrost-affected areas. Beginning in 2004 - and more intensively since 2006 - a permafrost and seasonal frost monitoring network was established in Central and Eastern Austria by the University of Graz. This network consists of c.60 ground temperature (surface and near-surface) monitoring sites which are located at 1922-3002 m a.s.l., at latitude 46°55'-47°22'N and at longitude 12°44'-14°41'E. These data allow conclusions about general ground thermal conditions, potential permafrost occurrence, trend during the observation period, and regional pattern of changes. Calculations and analyses of several different temperature-related parameters were accomplished. At an annual scale a region-wide statistical significant warming during the observation period was revealed by e.g. an increase in mean annual temperature values (mean, maximum) or the significant lowering of the surface frost number (F+). At a seasonal scale no significant trend of any temperature-related parameter was in most cases revealed for spring (MAM) and autumn (SON). Winter (DJF) shows only a weak warming. In contrast, the summer (JJA) season reveals in general a significant warming as confirmed by several different temperature-related parameters such as e.g. mean seasonal temperature, number of thawing degree days, number of freezing degree days, or days without night frost. On a monthly basis August shows the statistically most robust and strongest warming of all months, although regional differences occur. Despite the fact that the general ground temperature warming during the last decade is confirmed by the field data in the study region, complications in trend analyses arise by temperature anomalies (e.g. warm winter 2006/07) or substantial variations in the winter
Male infertility is significantly associated with multiple deletions in an 8.7-kb segment of sperm mtDNA in Pakistan.

Science.gov (United States)

Mughal, Irfan Afzal; Irfan, Asma; Jahan, Sarwat; Hameed, Abdul

2017-06-12

This study aimed to find a link between sperm mitochondrial DNA mutations and male infertility in Pakistan. DNA from semen samples was extracted and amplified by PCR using 7.8-kb deletion-specific primers. The PCR products were separated on agarose gel, visualized under UV-illumination, and then photographed. The results were genotyped and the data were analyzed using SPSS. Deletion analysis of the 8.7-kb fragment by long PCR revealed multiple deletions. The frequency of deletion was much higher in infertile groups as compared to the control group. Further, on comparison between different subtypes of infertile groups, the deletions were highest in the oligoasthenoteratozoospermia (OAT) group. The statistical analysis of case and control groups showed a significant association of the 8.7-kb deletion with human male infertile groups (P = 0.031), and particularly a very significant association with the OAT subgroup (P = 0.019). A significant association has been found between human male infertility and mtDNA deletions in an 8.7-kb segment of sperm mtDNA in a Pakistani population.
Conceptual and statistical problems associated with the use of diversity indices in ecology.

Science.gov (United States)

Barrantes, Gilbert; Sandoval, Luis

2009-09-01

Diversity indices, particularly the Shannon-Wiener index, have extensively been used in analyzing patterns of diversity at different geographic and ecological scales. These indices have serious conceptual and statistical problems which make comparisons of species richness or species abundances across communities nearly impossible. There is often no a single statistical method that retains all information needed to answer even a simple question. However, multivariate analyses could be used instead of diversity indices, such as cluster analyses or multiple regressions. More complex multivariate analyses, such as Canonical Correspondence Analysis, provide very valuable information on environmental variables associated to the presence and abundance of the species in a community. In addition, particular hypotheses associated to changes in species richness across localities, or change in abundance of one, or a group of species can be tested using univariate, bivariate, and/or rarefaction statistical tests. The rarefaction method has proved to be robust to standardize all samples to a common size. Even the simplest method as reporting the number of species per taxonomic category possibly provides more information than a diversity index value.
Contributions to statistics

CERN Document Server

Mahalanobis, P C

1965-01-01

Contributions to Statistics focuses on the processes, methodologies, and approaches involved in statistics. The book is presented to Professor P. C. Mahalanobis on the occasion of his 70th birthday. The selection first offers information on the recovery of ancillary information and combinatorial properties of partially balanced designs and association schemes. Discussions focus on combinatorial applications of the algebra of association matrices, sample size analogy, association matrices and the algebra of association schemes, and conceptual statistical experiments. The book then examines latt
The Leu72Met polymorphism of the ghrelin gene is significantly associated with binge eating disorder.

Science.gov (United States)

Monteleone, Palmiero; Tortorella, Alfonso; Castaldo, Eloisa; Di Filippo, Carmela; Maj, Mario

2007-02-01

The pathophysiological mechanisms underlying binge eating disorder are poorly understood. Evidence exists for the fact that abnormalities in peptides involved in the regulation of appetite, including ghrelin, may play a role in binge eating behavior. Genes involved in the ghrelin physiology may therefore contribute to the biological vulnerability to binge eating disorder. We examined whether two polymorphisms of the ghrelin gene, the G152A (Arg51Gln) and C214A (Leu72Met), were associated with binge eating disorder. Ninety obese or nonobese women with binge eating disorder and 119 normal weight women were genotyped at the ghrelin gene. Statistical analyses showed that the Leu72Met ghrelin gene variant was significantly more frequent in binge eating disorder patients (chi2=5.940; d.f.=1, P=0.01) and was associated with a moderate, but significant risk to develop binge eating disorder (odds ratio=2.725, 95% confidence interval: 1.168-6.350). Although these data should be regarded as preliminary because of the small sample size, they suggest that the Leu72Met ghrelin gene variant may contribute to the genetic susceptibility to binge eating disorder.
Conducting tests for statistically significant differences using forest inventory data

Science.gov (United States)

James A. Westfall; Scott A. Pugh; John W. Coulston

2013-01-01

Many forest inventory and monitoring programs are based on a sample of ground plots from which estimates of forest resources are derived. In addition to evaluating metrics such as number of trees or amount of cubic wood volume, it is often desirable to make comparisons between resource attributes. To properly conduct statistical tests for differences, it is imperative...
Comparison of statistical tests for association between rare variants and binary traits.

OpenAIRE

Bacanu, SA; Nelson, MR; Whittaker, JC

2012-01-01

: Genome-wide association studies have found thousands of common genetic variants associated with a wide variety of diseases and other complex traits. However, a large portion of the predicted genetic contribution to many traits remains unknown. One plausible explanation is that some of the missing variation is due to the effects of rare variants. Nonetheless, the statistical analysis of rare variants is challenging. A commonly used method is to contrast, within the same region (gene), the fr...
Insertion/Deletion Within the KDM6A Gene Is Significantly Associated With Litter Size in Goat

Directory of Open Access Journals (Sweden)

Yang Cui

2018-03-01

Full Text Available A previous whole-genome association analysis identified lysine demethylase 6A (KDM6A, which encodes a type of histone demethylase, as a candidate gene associated to goat fecundity. KDM6A gene knockout mouse disrupts gametophyte development, suggesting that it has a critical role in reproduction. In this study, goat KDM6A mRNA expression profiles were determined, insertion/deletion (indel variants in the gene identified, indel variants effect on KDM6A gene expression assessed, and their association with first-born litter size analyzed in 2326 healthy female Shaanbei white cashmere goats. KDM6A mRNA was expressed in all tissues tested (heart, liver, spleen, lung, kidney, muscle, brain, skin and testis; the expression levels in testes at different developmental stages [1-week-old (wk, 2, 3 wk, 1-month-old (mo, 1.5 and 2 mo] indicated a potential association with the mitosis-to-meiosis transition, implying that KDM6A may have an essential role in goat fertility. Meanwhile, two novel intronic indels of 16 bp and 5 bp were identified. Statistical analysis revealed that only the 16 bp indel was associated with first-born litter size (P < 0.01, and the average first-born litter size of individuals with an insertion/insertion genotype higher than that of those with the deletion/deletion genotype (P < 0.05. There was also a significant difference in genotype distributions of the 16 bp indel between mothers of single-lamb and multi-lamb litters in the studied goat population (P = 0.001. Consistently, the 16 bp indel also had a significant effect on KDM6A gene expression. Additionally, there was no significant linkage disequilibrium (LD between these two indel loci, consistent with the association analysis results. Together, these findings suggest that the 16 bp indel in KDM6A may be useful for marker-assisted selection (MAS of goats.
Combining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies

Science.gov (United States)

Mieth, Bettina; Kloft, Marius; Rodríguez, Juan Antonio; Sonnenburg, Sören; Vobruba, Robin; Morcillo-Suárez, Carlos; Farré, Xavier; Marigorta, Urko M.; Fehr, Ernst; Dickhaus, Thorsten; Blanchard, Gilles; Schunk, Daniel; Navarro, Arcadi; Müller, Klaus-Robert

2016-01-01

The standard approach to the analysis of genome-wide association studies (GWAS) is based on testing each position in the genome individually for statistical significance of its association with the phenotype under investigation. To improve the analysis of GWAS, we propose a combination of machine learning and statistical testing that takes correlation structures within the set of SNPs under investigation in a mathematically well-controlled manner into account. The novel two-step algorithm, COMBI, first trains a support vector machine to determine a subset of candidate SNPs and then performs hypothesis tests for these SNPs together with an adequate threshold correction. Applying COMBI to data from a WTCCC study (2007) and measuring performance as replication by independent GWAS published within the 2008–2015 period, we show that our method outperforms ordinary raw p-value thresholding as well as other state-of-the-art methods. COMBI presents higher power and precision than the examined alternatives while yielding fewer false (i.e. non-replicated) and more true (i.e. replicated) discoveries when its results are validated on later GWAS studies. More than 80% of the discoveries made by COMBI upon WTCCC data have been validated by independent studies. Implementations of the COMBI method are available as a part of the GWASpi toolbox 2.0. PMID:27892471
Relationships between Association of Research Libraries (ARL) Statistics and Bibliometric Indicators: A Principal Components Analysis

Science.gov (United States)

Hendrix, Dean

2010-01-01

This study analyzed 2005-2006 Web of Science bibliometric data from institutions belonging to the Association of Research Libraries (ARL) and corresponding ARL statistics to find any associations between indicators from the two data sets. Principal components analysis on 36 variables from 103 universities revealed obvious associations between…

Development of free statistical software enabling researchers to calculate confidence levels, clinical significance curves and risk-benefit contours

International Nuclear Information System (INIS)

Shakespeare, T.P.; Mukherjee, R.K.; Gebski, V.J.

2003-01-01

Confidence levels, clinical significance curves, and risk-benefit contours are tools improving analysis of clinical studies and minimizing misinterpretation of published results, however no software has been available for their calculation. The objective was to develop software to help clinicians utilize these tools. Excel 2000 spreadsheets were designed using only built-in functions, without macros. The workbook was protected and encrypted so that users can modify only input cells. The workbook has 4 spreadsheets for use in studies comparing two patient groups. Sheet 1 comprises instructions and graphic examples for use. Sheet 2 allows the user to input the main study results (e.g. survival rates) into a 2-by-2 table. Confidence intervals (95%), p-value and the confidence level for Treatment A being better than Treatment B are automatically generated. An additional input cell allows the user to determine the confidence associated with a specified level of benefit. For example if the user wishes to know the confidence that Treatment A is at least 10% better than B, 10% is entered. Sheet 2 automatically displays clinical significance curves, graphically illustrating confidence levels for all possible benefits of one treatment over the other. Sheet 3 allows input of toxicity data, and calculates the confidence that one treatment is more toxic than the other. It also determines the confidence that the relative toxicity of the most effective arm does not exceed user-defined tolerability. Sheet 4 automatically calculates risk-benefit contours, displaying the confidence associated with a specified scenario of minimum benefit and maximum risk of one treatment arm over the other. The spreadsheet is freely downloadable at www.ontumor.com/professional/statistics.htm A simple, self-explanatory, freely available spreadsheet calculator was developed using Excel 2000. The incorporated decision-making tools can be used for data analysis and improve the reporting of results of any
Genome-wide association study identifies TF as a significant modifier gene of iron metabolism in HFE hemochromatosis.

Science.gov (United States)

de Tayrac, Marie; Roth, Marie-Paule; Jouanolle, Anne-Marie; Coppin, Hélène; le Gac, Gérald; Piperno, Alberto; Férec, Claude; Pelucchi, Sara; Scotet, Virginie; Bardou-Jacquet, Edouard; Ropert, Martine; Bouvet, Régis; Génin, Emmanuelle; Mosser, Jean; Deugnier, Yves

2015-03-01

Hereditary hemochromatosis (HH) is the most common form of genetic iron loading disease. It is mainly related to the homozygous C282Y/C282Y mutation in the HFE gene that is, however, a necessary but not a sufficient condition to develop clinical and even biochemical HH. This suggests that modifier genes are likely involved in the expressivity of the disease. Our aim was to identify such modifier genes. We performed a genome-wide association study (GWAS) using DNA collected from 474 unrelated C282Y homozygotes. Associations were examined for both quantitative iron burden indices and clinical outcomes with 534,213 single nucleotide polymorphisms (SNP) genotypes, with replication analyses in an independent sample of 748 C282Y homozygotes from four different European centres. One SNP met genome-wide statistical significance for association with transferrin concentration (rs3811647, GWAS p value of 7×10(-9) and replication p value of 5×10(-13)). This SNP, located within intron 11 of the TF gene, had a pleiotropic effect on serum iron (GWAS p value of 4.9×10(-6) and replication p value of 3.2×10(-6)). Both serum transferrin and iron levels were associated with serum ferritin levels, amount of iron removed and global clinical stage (pHFE-associated HH (HFE-HH) patients, identified the rs3811647 polymorphism in the TF gene as the only SNP significantly associated with iron metabolism through serum transferrin and iron levels. Because these two outcomes were clearly associated with the biochemical and clinical expression of the disease, an indirect link between the rs3811647 polymorphism and the phenotypic presentation of HFE-HH is likely. Copyright © 2014 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
Communicating Hydrocephalus Associated with Small- to Medium-Sized Vestibular Schwannomas: Clinical Significance of the Tumor Apparent Diffusion Coefficient Map.

Science.gov (United States)

Taniguchi, Masaaki; Nakai, Tomoaki; Kohta, Masaaki; Kimura, Hidehito; Kohmura, Eiji

2016-10-01

The etiology of hydrocephalus associated with the small- to medium-sized vestibular schwannomas is still controversial. We investigated tumor-specific factors related to the association of hydrocephalus with small- to medium-sized vestibular schwannomas. Among the 77 patients with vestibular schwannoma smaller than 30 mm, 9 patients demonstrated associated communicating hydrocephalus. Patient medical records, radiologic data, and histopathologic specimens were reviewed retrospectively. The age of the patients, and size, mean apparent diffusion coefficient (ADC) value, and histologic features of the tumors were compared with those of patients without hydrocephalus. The symptoms related to hydrocephalus improved in all patients after tumor removal. Both the mean size and ADC values exhibited a statistically significant difference between the tumors with and without hydrocephalus (P hydrocephalus. The increased tumor ADC value was considered to be the result of degenerative change and suggested the involvement of protein sloughing in the etiology of the associated hydrocephalus. Copyright © 2016 Elsevier Inc. All rights reserved.
Statistical Reporting Errors and Collaboration on Statistical Analyses in Psychological Science.

Science.gov (United States)

Veldkamp, Coosje L S; Nuijten, Michèle B; Dominguez-Alvarez, Linda; van Assen, Marcel A L M; Wicherts, Jelte M

2014-01-01

Statistical analysis is error prone. A best practice for researchers using statistics would therefore be to share data among co-authors, allowing double-checking of executed tasks just as co-pilots do in aviation. To document the extent to which this 'co-piloting' currently occurs in psychology, we surveyed the authors of 697 articles published in six top psychology journals and asked them whether they had collaborated on four aspects of analyzing data and reporting results, and whether the described data had been shared between the authors. We acquired responses for 49.6% of the articles and found that co-piloting on statistical analysis and reporting results is quite uncommon among psychologists, while data sharing among co-authors seems reasonably but not completely standard. We then used an automated procedure to study the prevalence of statistical reporting errors in the articles in our sample and examined the relationship between reporting errors and co-piloting. Overall, 63% of the articles contained at least one p-value that was inconsistent with the reported test statistic and the accompanying degrees of freedom, and 20% of the articles contained at least one p-value that was inconsistent to such a degree that it may have affected decisions about statistical significance. Overall, the probability that a given p-value was inconsistent was over 10%. Co-piloting was not found to be associated with reporting errors.
Modeling of aqueous electrolyte solutions with perturbed-chain statistical associated fluid theory

DEFF Research Database (Denmark)

Cameretti, Luca F.; Sadowski, Gabriele; Mollerup, Jørgen

2005-01-01

The vapor pressures and liquid densities of single-salt electrolyte solutions containing NaCl, LiCl, KCl, NaBr, LiBr, KBr, NaI, LiI, KI, Li2SO4, Na2SO4, and K2SO4 were modeled with an equation of state based on perturbed-chain statistical associated fluid theory (PC-SAFT). The PC-SAFT model...
Seizure-associated aphasia has good lateralizing but poor localizing significance.

Science.gov (United States)

Loesch, Anna Mira; Steger, Hannah; Losher, Claudia; Hartl, Elisabeth; Rémi, Jan; Vollmar, Christian; Noachtar, Soheyl

2017-09-01

To investigate the occurrence of ictal and postictal aphasia in different focal epilepsy syndromes. We retrospectively analyzed the video-electroencephalographic monitoring data of 1,118 patients with focal epilepsy for seizure-associated aphasia (SAA). Statistical analysis included chi-square analysis and Fisher's exact test. We identified 102 of 1,118 patients (9.1%) in whom ictal or postictal aphasia (SAA) was part of their recorded seizures (n = 59 of 102; 57.8%) or who reported aphasia by history (n = 43; 42.2% only reported aphasia by history). Postictal aphasia was present in 18 patients (30.5%). Six of the 59 patients had both ictal and postictal aphasia (10.2%). SAA occurred either with left hemisphere seizure onset or with seizures spreading from the right to the left hemisphere. SAA was most common in patients with parieto-occipital epilepsy (10.9%; five of 46 patients), followed by patients with temporal (6.7%; 28 of 420 patients), focal (not further localized; 4.8%; 22 of 462 patients), and frontal epilepsy (2.1%; four of 190 patients; p = 0.04). SAA was more common in parieto-occipital epilepsy than in frontal epilepsy (p = 0.02). In contrast, there was no significant difference in SAA between temporal and parieto-occipital epilepsy (p = 0.36). SAA has a high lateralizing but limited localizing value, as it often reflects spread of epileptic activity into speech-harboring brain regions. Wiley Periodicals, Inc. © 2017 International League Against Epilepsy.
Sigsearch: a new term for post hoc unplanned search for statistically significant relationships with the intent to create publishable findings.

Science.gov (United States)

Hashim, Muhammad Jawad

2010-09-01

Post-hoc secondary data analysis with no prespecified hypotheses has been discouraged by textbook authors and journal editors alike. Unfortunately no single term describes this phenomenon succinctly. I would like to coin the term "sigsearch" to define this practice and bring it within the teaching lexicon of statistics courses. Sigsearch would include any unplanned, post-hoc search for statistical significance using multiple comparisons of subgroups. It would also include data analysis with outcomes other than the prespecified primary outcome measure of a study as well as secondary data analyses of earlier research.
Significant association of interleukin-4 gene intron 3 VNTR polymorphism with susceptibility to knee osteoarthritis.

Science.gov (United States)

Yigit, Serbulent; Inanir, Ahmet; Tekcan, Akın; Tural, Ercan; Ozturk, Gokhan Tuna; Kismali, Gorkem; Karakus, Nevin

2014-03-01

Interleukin-4 (IL-4) is a strong chondroprotective cytokine and polymorphisms within this gene may be a risk factor for osteoarthritis (OA). We aimed to investigate genotype and allele frequencies of IL-4 gene intron 3 variable number of tandem repeats (VNTR) polymorphism in patients with knee OA in a Turkish population. The study included 202 patients with knee OA and 180 healthy controls. Genomic DNA was isolated and IL-4 gene 70 bp VNTR polymorphism determined by using polymerase chain reaction (PCR) with specific primers followed by restriction fragment length polymorphism (RFLP) analysis. Our result show that there was statistically significant difference between knee OA patients and control group with respect to IL-4 genotype distribution and allele frequencies (p=0.000, OR: 0.20, 95% CI: 0.10-0.41, OR: 0.22, 95% CI: 0.12-0.42, respectively). Our findings suggest that there is an association of IL-4 gene intron 3 VNTR polymorphism with susceptibility of a person for development of knee OA. As a result, IL-4 gene intron 3 VNTR polymorphism could be a genetic marker in OA in a Turkish study population. This is the first association study that evaluates the associations between IL-4 gene VNTR polymorphism and knee OA. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.
Canadian Petroleum Association statistical handbook

International Nuclear Information System (INIS)

1992-04-01

Statistical data are presented for the Canadian oil and gas industry for 1991, with some historical and background data included. Tables are provided on land sales and holdings, drilling completions, reserves, production, inventories, production capacity, cash expenditures, value of sales, prices, consumption, sales, refinery capacity and utilization, refinery yields, pipelines, imports and exports, National Energy Board licenses and orders, electricity generation capacity, and supply and disposal of electric energy. 112 tabs
Hippocampal Structure Predicts Statistical Learning and Associative Inference Abilities during Development.

Science.gov (United States)

Schlichting, Margaret L; Guarino, Katharine F; Schapiro, Anna C; Turk-Browne, Nicholas B; Preston, Alison R

2017-01-01

Despite the importance of learning and remembering across the lifespan, little is known about how the episodic memory system develops to support the extraction of associative structure from the environment. Here, we relate individual differences in volumes along the hippocampal long axis to performance on statistical learning and associative inference tasks-both of which require encoding associations that span multiple episodes-in a developmental sample ranging from ages 6 to 30 years. Relating age to volume, we found dissociable patterns across the hippocampal long axis, with opposite nonlinear volume changes in the head and body. These structural differences were paralleled by performance gains across the age range on both tasks, suggesting improvements in the cross-episode binding ability from childhood to adulthood. Controlling for age, we also found that smaller hippocampal heads were associated with superior behavioral performance on both tasks, consistent with this region's hypothesized role in forming generalized codes spanning events. Collectively, these results highlight the importance of examining hippocampal development as a function of position along the hippocampal axis and suggest that the hippocampal head is particularly important in encoding associative structure across development.
Intelligent system for statistically significant expertise knowledge on the basis of the model of self-organizing nonequilibrium dissipative system

Directory of Open Access Journals (Sweden)

E. A. Tatokchin

2017-01-01

Full Text Available Development of the modern educational technologies caused by broad introduction of comput-er testing and development of distant forms of education does necessary revision of methods of an examination of pupils. In work it was shown, need transition to mathematical criteria, exami-nations of knowledge which are deprived of subjectivity. In article the review of the problems arising at realization of this task and are offered approaches for its decision. The greatest atten-tion is paid to discussion of a problem of objective transformation of rated estimates of the ex-pert on to the scale estimates of the student. In general, the discussion this question is was con-cluded that the solution to this problem lies in the creation of specialized intellectual systems. The basis for constructing intelligent system laid the mathematical model of self-organizing nonequilibrium dissipative system, which is a group of students. This article assumes that the dissipative system is provided by the constant influx of new test items of the expert and non-equilibrium – individual psychological characteristics of students in the group. As a result, the system must self-organize themselves into stable patterns. This patern will allow for, relying on large amounts of data, get a statistically significant assessment of student. To justify the pro-posed approach in the work presents the data of the statistical analysis of the results of testing a large sample of students (> 90. Conclusions from this statistical analysis allowed to develop intelligent system statistically significant examination of student performance. It is based on data clustering algorithm (k-mean for the three key parameters. It is shown that this approach allows you to create of the dynamics and objective expertise evaluation.
The Importance of Integrating Clinical Relevance and Statistical Significance in the Assessment of Quality of Care--Illustrated Using the Swedish Stroke Register.

Directory of Open Access Journals (Sweden)

Anita Lindmark

Full Text Available When profiling hospital performance, quality inicators are commonly evaluated through hospital-specific adjusted means with confidence intervals. When identifying deviations from a norm, large hospitals can have statistically significant results even for clinically irrelevant deviations while important deviations in small hospitals can remain undiscovered. We have used data from the Swedish Stroke Register (Riksstroke to illustrate the properties of a benchmarking method that integrates considerations of both clinical relevance and level of statistical significance.The performance measure used was case-mix adjusted risk of death or dependency in activities of daily living within 3 months after stroke. A hospital was labeled as having outlying performance if its case-mix adjusted risk exceeded a benchmark value with a specified statistical confidence level. The benchmark was expressed relative to the population risk and should reflect the clinically relevant deviation that is to be detected. A simulation study based on Riksstroke patient data from 2008-2009 was performed to investigate the effect of the choice of the statistical confidence level and benchmark value on the diagnostic properties of the method.Simulations were based on 18,309 patients in 76 hospitals. The widely used setting, comparing 95% confidence intervals to the national average, resulted in low sensitivity (0.252 and high specificity (0.991. There were large variations in sensitivity and specificity for different requirements of statistical confidence. Lowering statistical confidence improved sensitivity with a relatively smaller loss of specificity. Variations due to different benchmark values were smaller, especially for sensitivity. This allows the choice of a clinically relevant benchmark to be driven by clinical factors without major concerns about sufficiently reliable evidence.The study emphasizes the importance of combining clinical relevance and level of statistical
The Importance of Integrating Clinical Relevance and Statistical Significance in the Assessment of Quality of Care--Illustrated Using the Swedish Stroke Register.

Science.gov (United States)

Lindmark, Anita; van Rompaye, Bart; Goetghebeur, Els; Glader, Eva-Lotta; Eriksson, Marie

2016-01-01

When profiling hospital performance, quality inicators are commonly evaluated through hospital-specific adjusted means with confidence intervals. When identifying deviations from a norm, large hospitals can have statistically significant results even for clinically irrelevant deviations while important deviations in small hospitals can remain undiscovered. We have used data from the Swedish Stroke Register (Riksstroke) to illustrate the properties of a benchmarking method that integrates considerations of both clinical relevance and level of statistical significance. The performance measure used was case-mix adjusted risk of death or dependency in activities of daily living within 3 months after stroke. A hospital was labeled as having outlying performance if its case-mix adjusted risk exceeded a benchmark value with a specified statistical confidence level. The benchmark was expressed relative to the population risk and should reflect the clinically relevant deviation that is to be detected. A simulation study based on Riksstroke patient data from 2008-2009 was performed to investigate the effect of the choice of the statistical confidence level and benchmark value on the diagnostic properties of the method. Simulations were based on 18,309 patients in 76 hospitals. The widely used setting, comparing 95% confidence intervals to the national average, resulted in low sensitivity (0.252) and high specificity (0.991). There were large variations in sensitivity and specificity for different requirements of statistical confidence. Lowering statistical confidence improved sensitivity with a relatively smaller loss of specificity. Variations due to different benchmark values were smaller, especially for sensitivity. This allows the choice of a clinically relevant benchmark to be driven by clinical factors without major concerns about sufficiently reliable evidence. The study emphasizes the importance of combining clinical relevance and level of statistical confidence when
The Role of Statistics in Business and Industry

CERN Document Server

Hahn, Gerald J

2011-01-01

An insightful guide to the use of statistics for solving key problems in modern-day business and industry This book has been awarded the Technometrics Ziegel Prize for the best book reviewed by the journal in 2010. Technometrics is a journal of statistics for the physical, chemical and engineering sciences, published jointly by the American Society for Quality and the American Statistical Association. Criteria for the award include that the book brings together in one volume a body of material previously only available in scattered research articles and having the potential to significantly im
Estimates of statistical significance for comparison of individual positions in multiple sequence alignments

Directory of Open Access Journals (Sweden)

Sadreyev Ruslan I

2004-08-01

Full Text Available Abstract Background Profile-based analysis of multiple sequence alignments (MSA allows for accurate comparison of protein families. Here, we address the problems of detecting statistically confident dissimilarities between (1 MSA position and a set of predicted residue frequencies, and (2 between two MSA positions. These problems are important for (i evaluation and optimization of methods predicting residue occurrence at protein positions; (ii detection of potentially misaligned regions in automatically produced alignments and their further refinement; and (iii detection of sites that determine functional or structural specificity in two related families. Results For problems (1 and (2, we propose analytical estimates of P-value and apply them to the detection of significant positional dissimilarities in various experimental situations. (a We compare structure-based predictions of residue propensities at a protein position to the actual residue frequencies in the MSA of homologs. (b We evaluate our method by the ability to detect erroneous position matches produced by an automatic sequence aligner. (c We compare MSA positions that correspond to residues aligned by automatic structure aligners. (d We compare MSA positions that are aligned by high-quality manual superposition of structures. Detected dissimilarities reveal shortcomings of the automatic methods for residue frequency prediction and alignment construction. For the high-quality structural alignments, the dissimilarities suggest sites of potential functional or structural importance. Conclusion The proposed computational method is of significant potential value for the analysis of protein families.
Test the Overall Significance of p-values by Using Joint Tail Probability of Ordered p-values as Test Statistic

NARCIS (Netherlands)

Fang, Yongxiang; Wit, Ernst

2008-01-01

Fisher’s combined probability test is the most commonly used method to test the overall significance of a set independent p-values. However, it is very obviously that Fisher’s statistic is more sensitive to smaller p-values than to larger p-value and a small p-value may overrule the other p-values
GWAPower: a statistical power calculation software for genome-wide association studies with quantitative traits.

Science.gov (United States)

Feng, Sheng; Wang, Shengchu; Chen, Chia-Cheng; Lan, Lan

2011-01-21

In designing genome-wide association (GWA) studies it is important to calculate statistical power. General statistical power calculation procedures for quantitative measures often require information concerning summary statistics of distributions such as mean and variance. However, with genetic studies, the effect size of quantitative traits is traditionally expressed as heritability, a quantity defined as the amount of phenotypic variation in the population that can be ascribed to the genetic variants among individuals. Heritability is hard to transform into summary statistics. Therefore, general power calculation procedures cannot be used directly in GWA studies. The development of appropriate statistical methods and a user-friendly software package to address this problem would be welcomed. This paper presents GWAPower, a statistical software package of power calculation designed for GWA studies with quantitative traits, where genetic effect is defined as heritability. Based on several popular one-degree-of-freedom genetic models, this method avoids the need to specify the non-centrality parameter of the F-distribution under the alternative hypothesis. Therefore, it can use heritability information directly without approximation. In GWAPower, the power calculation can be easily adjusted for adding covariates and linkage disequilibrium information. An example is provided to illustrate GWAPower, followed by discussions. GWAPower is a user-friendly free software package for calculating statistical power based on heritability in GWA studies with quantitative traits. The software is freely available at: http://dl.dropbox.com/u/10502931/GWAPower.zip.
Improved score statistics for meta-analysis in single-variant and gene-level association studies.

Science.gov (United States)

Yang, Jingjing; Chen, Sai; Abecasis, Gonçalo

2018-06-01

Meta-analysis is now an essential tool for genetic association studies, allowing them to combine large studies and greatly accelerating the pace of genetic discovery. Although the standard meta-analysis methods perform equivalently as the more cumbersome joint analysis under ideal settings, they result in substantial power loss under unbalanced settings with various case-control ratios. Here, we investigate the power loss problem by the standard meta-analysis methods for unbalanced studies, and further propose novel meta-analysis methods performing equivalently to the joint analysis under both balanced and unbalanced settings. We derive improved meta-score-statistics that can accurately approximate the joint-score-statistics with combined individual-level data, for both linear and logistic regression models, with and without covariates. In addition, we propose a novel approach to adjust for population stratification by correcting for known population structures through minor allele frequencies. In the simulated gene-level association studies under unbalanced settings, our method recovered up to 85% power loss caused by the standard methods. We further showed the power gain of our methods in gene-level tests with 26 unbalanced studies of age-related macular degeneration . In addition, we took the meta-analysis of three unbalanced studies of type 2 diabetes as an example to discuss the challenges of meta-analyzing multi-ethnic samples. In summary, our improved meta-score-statistics with corrections for population stratification can be used to construct both single-variant and gene-level association studies, providing a useful framework for ensuring well-powered, convenient, cross-study analyses. © 2018 WILEY PERIODICALS, INC.
Polymorphisms in the TLR4 and TLR5 gene are significantly associated with inflammatory bowel disease in German shepherd dogs.

Science.gov (United States)

Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin

2010-12-23

Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease.
Statistical determination of significant curved I-girder bridge seismic response parameters

Science.gov (United States)

Seo, Junwon

2013-06-01

Curved steel bridges are commonly used at interchanges in transportation networks and more of these structures continue to be designed and built in the United States. Though the use of these bridges continues to increase in locations that experience high seismicity, the effects of curvature and other parameters on their seismic behaviors have been neglected in current risk assessment tools. These tools can evaluate the seismic vulnerability of a transportation network using fragility curves. One critical component of fragility curve development for curved steel bridges is the completion of sensitivity analyses that help identify influential parameters related to their seismic response. In this study, an accessible inventory of existing curved steel girder bridges located primarily in the Mid-Atlantic United States (MAUS) was used to establish statistical characteristics used as inputs for a seismic sensitivity study. Critical seismic response quantities were captured using 3D nonlinear finite element models. Influential parameters from these quantities were identified using statistical tools that incorporate experimental Plackett-Burman Design (PBD), which included Pareto optimal plots and prediction profiler techniques. The findings revealed that the potential variation in the influential parameters included number of spans, radius of curvature, maximum span length, girder spacing, and cross-frame spacing. These parameters showed varying levels of influence on the critical bridge response.

Integration of association statistics over genomic regions using Bayesian adaptive regression splines

Directory of Open Access Journals (Sweden)

Zhang Xiaohua

2003-11-01

Full Text Available Abstract In the search for genetic determinants of complex disease, two approaches to association analysis are most often employed, testing single loci or testing a small group of loci jointly via haplotypes for their relationship to disease status. It is still debatable which of these approaches is more favourable, and under what conditions. The former has the advantage of simplicity but suffers severely when alleles at the tested loci are not in linkage disequilibrium (LD with liability alleles; the latter should capture more of the signal encoded in LD, but is far from simple. The complexity of haplotype analysis could be especially troublesome for association scans over large genomic regions, which, in fact, is becoming the standard design. For these reasons, the authors have been evaluating statistical methods that bridge the gap between single-locus and haplotype-based tests. In this article, they present one such method, which uses non-parametric regression techniques embodied by Bayesian adaptive regression splines (BARS. For a set of markers falling within a common genomic region and a corresponding set of single-locus association statistics, the BARS procedure integrates these results into a single test by examining the class of smooth curves consistent with the data. The non-parametric BARS procedure generally finds no signal when no liability allele exists in the tested region (ie it achieves the specified size of the test and it is sensitive enough to pick up signals when a liability allele is present. The BARS procedure provides a robust and potentially powerful alternative to classical tests of association, diminishes the multiple testing problem inherent in those tests and can be applied to a wide range of data types, including genotype frequencies estimated from pooled samples.
Statistical theory applications and associated computer codes

International Nuclear Information System (INIS)

Prince, A.

1980-01-01

The general format is along the same lines as that used in the O.M. Session, i.e. an introduction to the nature of the physical problems and methods of solution based on the statistical model of the nucleus. Both binary and higher multiple reactions are considered. The computer codes used in this session are a combination of optical model and statistical theory. As with the O.M. sessions, the preparation of input and analysis of output are thoroughly examined. Again, comparison with experimental data serves to demonstrate the validity of the results and possible areas for improvement. (author)
metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis.

Science.gov (United States)

Cichonska, Anna; Rousu, Juho; Marttinen, Pekka; Kangas, Antti J; Soininen, Pasi; Lehtimäki, Terho; Raitakari, Olli T; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ala-Korpela, Mika; Ripatti, Samuli; Pirinen, Matti

2016-07-01

A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-level genotype-phenotype data across the cohorts limit conducting multivariate tests. We introduce metaCCA, a computational framework for summary statistics-based analysis of a single or multiple studies that allows multivariate representation of both genotype and phenotype. It extends the statistical technique of canonical correlation analysis to the setting where original individual-level records are not available, and employs a covariance shrinkage algorithm to achieve robustness.Multivariate meta-analysis of two Finnish studies of nuclear magnetic resonance metabolomics by metaCCA, using standard univariate output from the program SNPTEST, shows an excellent agreement with the pooled individual-level analysis of original data. Motivated by strong multivariate signals in the lipid genes tested, we envision that multivariate association testing using metaCCA has a great potential to provide novel insights from already published summary statistics from high-throughput phenotyping technologies. Code is available at https://github.com/aalto-ics-kepaco anna.cichonska@helsinki.fi or matti.pirinen@helsinki.fi Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Statistical analogues of thermodynamic extremum principles

Science.gov (United States)

Ramshaw, John D.

2018-05-01

As shown by Jaynes, the canonical and grand canonical probability distributions of equilibrium statistical mechanics can be simply derived from the principle of maximum entropy, in which the statistical entropy S=- {k}{{B}}{\\sum }i{p}i{log}{p}i is maximised subject to constraints on the mean values of the energy E and/or number of particles N in a system of fixed volume V. The Lagrange multipliers associated with those constraints are then found to be simply related to the temperature T and chemical potential μ. Here we show that the constrained maximisation of S is equivalent to, and can therefore be replaced by, the essentially unconstrained minimisation of the obvious statistical analogues of the Helmholtz free energy F = E ‑ TS and the grand potential J = F ‑ μN. Those minimisations are more easily performed than the maximisation of S because they formally eliminate the constraints on the mean values of E and N and their associated Lagrange multipliers. This procedure significantly simplifies the derivation of the canonical and grand canonical probability distributions, and shows that the well known extremum principles for the various thermodynamic potentials possess natural statistical analogues which are equivalent to the constrained maximisation of S.
Youth Sports Safety Statistics

Science.gov (United States)

... 6):794-799. 31 American Heart Association. CPR statistics. www.heart.org/HEARTORG/CPRAndECC/WhatisCPR/CPRFactsandStats/CPRpercent20Statistics_ ... Mental Health Services Administration, Center for Behavioral Health Statistics and Quality. (January 10, 2013). The DAWN Report: ...
Statistically significant dependence of the Xaa-Pro peptide bond conformation on secondary structure and amino acid sequence

Directory of Open Access Journals (Sweden)

Leitner Dietmar

2005-04-01

Full Text Available Abstract Background A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation. Results One of the main results of the statistical investigations is the ranking of the secondary structure and sequence information with respect to the prediction of the Xaa-Pro peptide bond conformation. We observed a significant impact of secondary structure information on the occurrence of the Xaa-Pro peptide bond conformation, while the sequence information of amino acids neighboring proline is of little predictive value for the conformation of this bond. Conclusion In this work, we present an extensive analysis of the occurrence of the cis and trans proline conformation in proteins. Based on the data set, we derived patterns and rules for a possible prediction of the proline conformation. Upon adoption of the Chou-Fasman parameters, we are able to derive statistically relevant correlations between the secondary structure of amino acid fragments and the Xaa-Pro peptide bond conformation.
A novel statistic for genome-wide interaction analysis.

Directory of Open Access Journals (Sweden)

Xuesen Wu

2010-09-01

Full Text Available Although great progress in genome-wide association studies (GWAS has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked. The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.
Heart Disease and Stroke Statistics

Science.gov (United States)

... Media for Heart.org Heart and Stroke Association Statistics Each year, the American Heart Association, in conjunction ... health and disease in the population. Heart & Stroke Statistics FAQs What is Prevalence? Prevalence is an estimate ...
CONFIDENCE LEVELS AND/VS. STATISTICAL HYPOTHESIS TESTING IN STATISTICAL ANALYSIS. CASE STUDY

Directory of Open Access Journals (Sweden)

ILEANA BRUDIU

2009-05-01

Full Text Available Estimated parameters with confidence intervals and testing statistical assumptions used in statistical analysis to obtain conclusions on research from a sample extracted from the population. Paper to the case study presented aims to highlight the importance of volume of sample taken in the study and how this reflects on the results obtained when using confidence intervals and testing for pregnant. If statistical testing hypotheses not only give an answer "yes" or "no" to some questions of statistical estimation using statistical confidence intervals provides more information than a test statistic, show high degree of uncertainty arising from small samples and findings build in the "marginally significant" or "almost significant (p very close to 0.05.
Statistical study of seismicity associated with geothermal reservoirs in California

Energy Technology Data Exchange (ETDEWEB)

Hadley, D.M.; Cavit, D.S.

1982-01-01

Statistical methods are outlined to separate spatially, temporally, and magnitude-dependent portions of both the random and non-random components of the seismicity. The methodology employed compares the seismicity distributions with a generalized Poisson distribution. Temporally related events are identified by the distribution of the interoccurrence times. The regions studied to date include the Imperial Valley, Coso, The Geysers, Lassen, and the San Jacinto fault. The spatial characteristics of the random and clustered components of the seismicity are diffuse and appear unsuitable for defining the areal extent of the reservoir. However, from the temporal characteristics of the seismicity associated with these regions a general discriminant was constructed that combines several physical parameters for identifying the presence of a geothermal system.
A novel variational Bayes multiple locus Z-statistic for genome-wide association studies with Bayesian model averaging

Science.gov (United States)

Logsdon, Benjamin A.; Carty, Cara L.; Reiner, Alexander P.; Dai, James Y.; Kooperberg, Charles

2012-01-01

Motivation: For many complex traits, including height, the majority of variants identified by genome-wide association studies (GWAS) have small effects, leaving a significant proportion of the heritable variation unexplained. Although many penalized multiple regression methodologies have been proposed to increase the power to detect associations for complex genetic architectures, they generally lack mechanisms for false-positive control and diagnostics for model over-fitting. Our methodology is the first penalized multiple regression approach that explicitly controls Type I error rates and provide model over-fitting diagnostics through a novel normally distributed statistic defined for every marker within the GWAS, based on results from a variational Bayes spike regression algorithm. Results: We compare the performance of our method to the lasso and single marker analysis on simulated data and demonstrate that our approach has superior performance in terms of power and Type I error control. In addition, using the Women's Health Initiative (WHI) SNP Health Association Resource (SHARe) GWAS of African-Americans, we show that our method has power to detect additional novel associations with body height. These findings replicate by reaching a stringent cutoff of marginal association in a larger cohort. Availability: An R-package, including an implementation of our variational Bayes spike regression (vBsr) algorithm, is available at http://kooperberg.fhcrc.org/soft.html. Contact: blogsdon@fhcrc.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22563072
NCK2 Is Significantly Associated with Opiates Addiction in African-Origin Men

Directory of Open Access Journals (Sweden)

Zhifa Liu

2013-01-01

Full Text Available Substance dependence is a complex environmental and genetic disorder with significant social and medical concerns. Understanding the etiology of substance dependence is imperative to the development of effective treatment and prevention strategies. To this end, substantial effort has been made to identify genes underlying substance dependence, and in recent years, genome-wide association studies (GWASs have led to discoveries of numerous genetic variants for complex diseases including substance dependence. Most of the GWAS discoveries were only based on single nucleotide polymorphisms (SNPs and a single dichotomized outcome. By employing both SNP- and gene-based methods of analysis, we identified a strong (odds ratio = 13.87 and significant (P value = 1.33E−11 association of an SNP in the NCK2 gene on chromosome 2 with opiates addiction in African-origin men. Codependence analysis also identified a genome-wide significant association between NCK2 and comorbidity of substance dependence (P value = 3.65E−08 in African-origin men. Furthermore, we observed that the association between the NCK2 gene (P value = 3.12E−10 and opiates addiction reached the gene-based genome-wide significant level. In summary, our findings provided the first evidence for the involvement of NCK2 in the susceptibility to opiates addiction and further revealed the racial and gender specificities of its impact.
Polymorphisms in the Tlr4 and Tlr5 Gene Are Significantly Associated with Inflammatory Bowel Disease in German Shepherd Dogs

Science.gov (United States)

Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin

2010-01-01

Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease. PMID:21203467
The significance of FM associations for women with FM.

Science.gov (United States)

Juuso, Päivi; Söderberg, Siv; Olsson, Malin; Skär, Lisa

2014-01-01

Living with fibromyalgia (FM) means living with a long-term pain syndrome that is invisible to others. Support and understanding from others seem to be important to managing the affected daily life. The aim of this study was to describe the significance of FM associations for women with FM. Data collection was carried out through focus group discussions with seventeen women with FM. Data were analyzed through thematic content analysis. The findings show that women experienced associations for people with FM as important as they gave access to contacts with others with similar experiences. Their need of togetherness was fulfilled at the association and they described being strengthened by the support received. Because of the lack of information and knowledge about FM, the association was described as an important venue for getting and mediating information about the illness. At the association the women seem to be empowered, which increases their ability to manage their daily lives despite the limitations imposed by FM. Healthcare personnel could not satisfy the women's needs and to manage to support women with FM. There is a need for communication based on a shared understanding between the women and healthcare personnel. This study highlighted the need for communication based on a shared understanding between people with chronic illness and healthcare personnel to support and strengthen women with FM in their daily lives. The FM associations meet the needs for togetherness, confirmation, and information that the women with FM in this study described and healthcare personnel could not satisfy. Healthcare personnel can learn from FM associations how to empower women with FM in their everyday lives.
Radiotherapy is associated with significant improvement in local and regional control in Merkel cell carcinoma

International Nuclear Information System (INIS)

Kang, Susan H; Haydu, Lauren E; Goh, Robin Yeong Hong; Fogarty, Gerald B

2012-01-01

Merkel cell carcinoma (MCC) is a rare tumour of skin. This study is a retrospective audit of patients with MCC from St Vincent’s and Mater Hospital, Sydney, Australia. The aim of this study was to investigate the influence of radiotherapy (RT) on the local and regional control of MCC lesions and survival of patients with MCC. The data bases in anatomical pathology, RT and surgery. We searched for patients having a diagnosis of MCC between 1996 and 2007. Patient, tumour and treatment characteristics were collected and analysed. Univariate survival analysis of categorical variables was conducted with the Kaplan-Meier method together with the Log-Rank test for statistical significance. Continuous variables were assessed using the Cox regression method. Multivariate analysis was performed for significant univariate results. Sixty seven patients were found. Sixty two who were stage I-III and were treated with radical intent were analysed. 68% were male. The median age was 74 years. Forty-two cases (68%) were stage I or II, and 20 cases (32%) were stage III. For the subset of 42 stage I and II patients, those that had RT to their primary site had a 2-year local recurrence free survival of 89% compared with 36% for patients not receiving RT (p<0.001). The cumulative 2-year regional recurrence free survival for patients having adjuvant regional RT was 84% compared with 43% for patients not receiving this treatment (p<0.001). Immune status at initial surgery was a significant predictor for OS and MCCSS. In a multivariate analysis combining macroscopic size (mm) and immune status at initial surgery, only immune status remained a significant predictor of overall survival (HR=2.096, 95% CI: 1.002-4.385, p=0.049). RT is associated with significant improvement in local and regional control in Merkel cell carcinoma. Immunosuppression is an important factor in overall survival
Tumor-Associated Macrophages Provide Significant Prognostic Information in Urothelial Bladder Cancer.

Directory of Open Access Journals (Sweden)

Minna M Boström

Full Text Available Inflammation is an important feature of carcinogenesis. Tumor-associated macrophages (TAMs can be associated with either poor or improved prognosis, depending on their properties and polarization. Current knowledge of the prognostic significance of TAMs in bladder cancer is limited and was investigated in this study. We analyzed 184 urothelial bladder cancer patients undergoing transurethral resection of a bladder tumor or radical cystectomy. CD68 (pan-macrophage marker, MAC387 (polarized towards type 1 macrophages, and CLEVER-1/Stabilin-1 (type 2 macrophages and lymphatic/blood vessels were detected immunohistochemically. The median follow-up time was 6.0 years. High macrophage counts associated with a higher pT category and grade. Among patients undergoing transurethral resection, all studied markers apart from CLEVER-1/Stabilin-1 were associated with increased risk of progression and poorer disease-specific and overall survival in univariate analyses. High levels of two macrophage markers (CD68/MAC387+/+ or CD68/CLEVER-1+/+ groups had an independent prognostic role after transurethral resection in multivariate analyses. In the cystectomy cohort, MAC387, alone and in combination with CD68, was associated with poorer survival in univariate analyses, but none of the markers were independent predictors of outcome in multivariate analyses. In conclusion, this study demonstrates that macrophage phenotypes provide significant independent prognostic information, particularly in bladder cancers undergoing transurethral resection.
THE ANALYSIS OF STATISTICAL DATA ON MALIGNANT NEOPLASMS ASSOCIATED WITH HUMAN P APILLOMAVIRUS

Directory of Open Access Journals (Sweden)

A. A. Kostin

2016-01-01

Full Text Available In this study of statistical data for the first time in Russia the analysis of the morbidity and mortality of patients with malignant neoplasms that may be associated with human papilloma virus (HPV is performed: cervical cancer (cervical cancer, cancer of the vulva and vagina, cancer of penis, cancer of the rectum, anal canal and rectosigmoid junction cancer, cancer of the pharynx and larynx.
Fusing Data Mining, Machine Learning and Traditional Statistics to Detect Biomarkers Associated with Depression.

Science.gov (United States)

Dipnall, Joanna F; Pasco, Julie A; Berk, Michael; Williams, Lana J; Dodd, Seetal; Jacka, Felice N; Meyer, Denny

2016-01-01

Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study. The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009-2010). Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators. After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30), serum glucose (OR 1.01; 95% CI 1.00, 1.01) and total bilirubin (OR 0.12; 95% CI 0.05, 0.28). Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016), and current smokers (pmachine learning algorithm with traditional statistical modelling, accounted for missing data and complex survey sampling methodology and was demonstrated to be a useful tool for detecting three biomarkers associated with depression for future
A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data

Directory of Open Access Journals (Sweden)

Scherer Stephen W

2011-05-01

Full Text Available Abstract Background Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. Results We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. Conclusions The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.
Significant association between asthma risk and the GSTM1 and GSTT1 deletion polymorphisms: an updated meta-analysis of case-control studies.

Science.gov (United States)

Liang, Siqiao; Wei, Xuan; Gong, Chen; Wei, Jinmei; Chen, Zhangrong; Chen, Xiaoli; Wang, Zhibo; Deng, Jingmin

2013-07-01

Polymorphisms in GSTM1 and GSTT1 may be associated with asthma risk, yet several studies and meta-analyses have reported inconclusive results. Therefore, an updated meta-analysis was conducted. Literature searches were performed using the Pubmed, Embase and Web of Science databases until October 2012. Variant 'null' genotype was compared with wild-type 'present' in the pooled data. All statistical analyses were performed using STATA 11.0. A total of 26 case-control studies were suitable for inclusion in the meta-analysis. In the overall population, a significant association was found for both the GSTM1 (odds ratio (OR) = 1.452; 95% confidence interval (CI): 1.192-1.770) and GSTT1 polymorphism (OR = 1.792; 95% CI:1.293-2.483). For subgroup analysis by age, GSTM1 significantly increased risk for both children (OR = 1.368; 95% CI: 1.051-1.781) and adults (OR = 1.859; 95% CI: 1.183-2.921). For GSTT1, a significant association was only found in the adult population (OR = 2.312; 95%CI: 1.204-4.439). Based on subgroup analysis by ethnicity, a significant association for GSTM1 was found in Europe (OR = 1.303; 95% CI: 1.018-1.667), Africa (OR = 2.175; 95%CI: 1.560-3.031) and Latin America (OR = 2.265; 95%CI: 1.375-3.729). For GSTT1, significantly increased risk was found only for Asian (OR = 2.105; 95% CI: 1.101-4.025) and Russian (OR = 2.747; 95% CI: 1.071-7.046) populations. This meta-analysis provides evidence that GSTM1 and GSTT1 polymorphisms may be risk factors for asthma. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.

Statistical reporting inconsistencies in experimental philosophy.

Science.gov (United States)

Colombo, Matteo; Duev, Georgi; Nuijten, Michèle B; Sprenger, Jan

2018-01-01

Experimental philosophy (x-phi) is a young field of research in the intersection of philosophy and psychology. It aims to make progress on philosophical questions by using experimental methods traditionally associated with the psychological and behavioral sciences, such as null hypothesis significance testing (NHST). Motivated by recent discussions about a methodological crisis in the behavioral sciences, questions have been raised about the methodological standards of x-phi. Here, we focus on one aspect of this question, namely the rate of inconsistencies in statistical reporting. Previous research has examined the extent to which published articles in psychology and other behavioral sciences present statistical inconsistencies in reporting the results of NHST. In this study, we used the R package statcheck to detect statistical inconsistencies in x-phi, and compared rates of inconsistencies in psychology and philosophy. We found that rates of inconsistencies in x-phi are lower than in the psychological and behavioral sciences. From the point of view of statistical reporting consistency, x-phi seems to do no worse, and perhaps even better, than psychological science.
Statistical reporting inconsistencies in experimental philosophy

Science.gov (United States)

Colombo, Matteo; Duev, Georgi; Nuijten, Michèle B.; Sprenger, Jan

2018-01-01

Experimental philosophy (x-phi) is a young field of research in the intersection of philosophy and psychology. It aims to make progress on philosophical questions by using experimental methods traditionally associated with the psychological and behavioral sciences, such as null hypothesis significance testing (NHST). Motivated by recent discussions about a methodological crisis in the behavioral sciences, questions have been raised about the methodological standards of x-phi. Here, we focus on one aspect of this question, namely the rate of inconsistencies in statistical reporting. Previous research has examined the extent to which published articles in psychology and other behavioral sciences present statistical inconsistencies in reporting the results of NHST. In this study, we used the R package statcheck to detect statistical inconsistencies in x-phi, and compared rates of inconsistencies in psychology and philosophy. We found that rates of inconsistencies in x-phi are lower than in the psychological and behavioral sciences. From the point of view of statistical reporting consistency, x-phi seems to do no worse, and perhaps even better, than psychological science. PMID:29649220
[Delirium in stroke patients : Critical analysis of statistical procedures for the identification of risk factors].

Science.gov (United States)

Nydahl, P; Margraf, N G; Ewers, A

2017-04-01

Delirium is a relevant complication following an acute stroke. It is a multifactor occurrence with numerous interacting risk factors that alternately influence each other. The risk factors of delirium in stroke patients are often based on limited clinical studies. The statistical procedures and clinical relevance of delirium related risk factors in adult stroke patients should therefore be questioned. This secondary analysis includes clinically relevant studies that give evidence for the clinical relevance and statistical significance of delirium-associated risk factors in stroke patients. The quality of the reporting of regression analyses was assessed using Ottenbacher's quality criteria. The delirium-associated risk factors identified were examined with regard to statistical significance using the Bonferroni method of multiple testing for forming incorrect positive hypotheses. This was followed by a literature-based discussion on clinical relevance. Nine clinical studies were included. None of the studies fulfilled all the prerequisites and assumptions given for the reporting of regression analyses according to Ottenbacher. Of the 108 delirium-associated risk factors, a total of 48 (44.4%) were significant, whereby a total of 28 (58.3%) were false positive after Bonferroni correction. Following a literature-based discussion on clinical relevance, the assumption of statistical significance and clinical relevance could be found for only four risk factors (dementia or cognitive impairment, total anterior infarct, severe infarct and infections). The statistical procedures used in the existing literature are questionable, as are their results. A post-hoc analysis and critical appraisal reduced the number of possible delirium-associated risk factors to just a few clinically relevant factors.
Determining coding CpG islands by identifying regions significant for pattern statistics on Markov chains.

Science.gov (United States)

Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior

2011-09-23

Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
New scanning technique using Adaptive Statistical Iterative Reconstruction (ASIR) significantly reduced the radiation dose of cardiac CT.

Science.gov (United States)

Tumur, Odgerel; Soon, Kean; Brown, Fraser; Mykytowycz, Marcus

2013-06-01

The aims of our study were to evaluate the effect of application of Adaptive Statistical Iterative Reconstruction (ASIR) algorithm on the radiation dose of coronary computed tomography angiography (CCTA) and its effects on image quality of CCTA and to evaluate the effects of various patient and CT scanning factors on the radiation dose of CCTA. This was a retrospective study that included 347 consecutive patients who underwent CCTA at a tertiary university teaching hospital between 1 July 2009 and 20 September 2011. Analysis was performed comparing patient demographics, scan characteristics, radiation dose and image quality in two groups of patients in whom conventional Filtered Back Projection (FBP) or ASIR was used for image reconstruction. There were 238 patients in the FBP group and 109 patients in the ASIR group. There was no difference between the groups in the use of prospective gating, scan length or tube voltage. In ASIR group, significantly lower tube current was used compared with FBP group, 550 mA (450-600) vs. 650 mA (500-711.25) (median (interquartile range)), respectively, P ASIR group compared with FBP group, 4.29 mSv (2.84-6.02) vs. 5.84 mSv (3.88-8.39) (median (interquartile range)), respectively, P ASIR was associated with increased image noise compared with FBP (39.93 ± 10.22 vs. 37.63 ± 18.79 (mean ± standard deviation), respectively, P ASIR reduces the radiation dose of CCTA without affecting the image quality. © 2013 The Authors. Journal of Medical Imaging and Radiation Oncology © 2013 The Royal Australian and New Zealand College of Radiologists.
Using the longest significance run to estimate region-specific p-values in genetic association mapping studies

Directory of Open Access Journals (Sweden)

Yang Hsin-Chou

2008-05-01

Full Text Available Abstract Background Association testing is a powerful tool for identifying disease susceptibility genes underlying complex diseases. Technological advances have yielded a dramatic increase in the density of available genetic markers, necessitating an increase in the number of association tests required for the analysis of disease susceptibility genes. As such, multiple-tests corrections have become a critical issue. However the conventional statistical corrections on locus-specific multiple tests usually result in lower power as the number of markers increases. Alternatively, we propose here the application of the longest significant run (LSR method to estimate a region-specific p-value to provide an index for the most likely candidate region. Results An advantage of the LSR method relative to procedures based on genotypic data is that only p-value data are needed and hence can be applied extensively to different study designs. In this study the proposed LSR method was compared with commonly used methods such as Bonferroni's method and FDR controlling method. We found that while all methods provide good control over false positive rate, LSR has much better power and false discovery rate. In the authentic analysis on psoriasis and asthma disease data, the LSR method successfully identified important candidate regions and replicated the results of previous association studies. Conclusion The proposed LSR method provides an efficient exploratory tool for the analysis of sequences of dense genetic markers. Our results show that the LSR method has better power and lower false discovery rate comparing with the locus-specific multiple tests.
Properties of permutation-based gene tests and controlling type 1 error using a summary statistic based gene test.

Science.gov (United States)

Swanson, David M; Blacker, Deborah; Alchawa, Taofik; Ludwig, Kerstin U; Mangold, Elisabeth; Lange, Christoph

2013-11-07

The advent of genome-wide association studies has led to many novel disease-SNP associations, opening the door to focused study on their biological underpinnings. Because of the importance of analyzing these associations, numerous statistical methods have been devoted to them. However, fewer methods have attempted to associate entire genes or genomic regions with outcomes, which is potentially more useful knowledge from a biological perspective and those methods currently implemented are often permutation-based. One property of some permutation-based tests is that their power varies as a function of whether significant markers are in regions of linkage disequilibrium (LD) or not, which we show from a theoretical perspective. We therefore develop two methods for quantifying the degree of association between a genomic region and outcome, both of whose power does not vary as a function of LD structure. One method uses dimension reduction to "filter" redundant information when significant LD exists in the region, while the other, called the summary-statistic test, controls for LD by scaling marker Z-statistics using knowledge of the correlation matrix of markers. An advantage of this latter test is that it does not require the original data, but only their Z-statistics from univariate regressions and an estimate of the correlation structure of markers, and we show how to modify the test to protect the type 1 error rate when the correlation structure of markers is misspecified. We apply these methods to sequence data of oral cleft and compare our results to previously proposed gene tests, in particular permutation-based ones. We evaluate the versatility of the modification of the summary-statistic test since the specification of correlation structure between markers can be inaccurate. We find a significant association in the sequence data between the 8q24 region and oral cleft using our dimension reduction approach and a borderline significant association using the
Conversion factors and oil statistics

International Nuclear Information System (INIS)

Karbuz, Sohbet

2004-01-01

World oil statistics, in scope and accuracy, are often far from perfect. They can easily lead to misguided conclusions regarding the state of market fundamentals. Without proper attention directed at statistic caveats, the ensuing interpretation of oil market data opens the door to unnecessary volatility, and can distort perception of market fundamentals. Among the numerous caveats associated with the compilation of oil statistics, conversion factors, used to produce aggregated data, play a significant role. Interestingly enough, little attention is paid to conversion factors, i.e. to the relation between different units of measurement for oil. Additionally, the underlying information regarding the choice of a specific factor when trying to produce measurements of aggregated data remains scant. The aim of this paper is to shed some light on the impact of conversion factors for two commonly encountered issues, mass to volume equivalencies (barrels to tonnes) and for broad energy measures encountered in world oil statistics. This paper will seek to demonstrate how inappropriate and misused conversion factors can yield wildly varying results and ultimately distort oil statistics. Examples will show that while discrepancies in commonly used conversion factors may seem trivial, their impact on the assessment of a world oil balance is far from negligible. A unified and harmonised convention for conversion factors is necessary to achieve accurate comparisons and aggregate oil statistics for the benefit of both end-users and policy makers
Confidence intervals for effect sizes: compliance and clinical significance in the Journal of Consulting and clinical Psychology.

Science.gov (United States)

Odgaard, Eric C; Fowler, Robert L

2010-06-01

In 2005, the Journal of Consulting and Clinical Psychology (JCCP) became the first American Psychological Association (APA) journal to require statistical measures of clinical significance, plus effect sizes (ESs) and associated confidence intervals (CIs), for primary outcomes (La Greca, 2005). As this represents the single largest editorial effort to improve statistical reporting practices in any APA journal in at least a decade, in this article we investigate the efficacy of that change. All intervention studies published in JCCP in 2003, 2004, 2007, and 2008 were reviewed. Each article was coded for method of clinical significance, type of ES, and type of associated CI, broken down by statistical test (F, t, chi-square, r/R(2), and multivariate modeling). By 2008, clinical significance compliance was 75% (up from 31%), with 94% of studies reporting some measure of ES (reporting improved for individual statistical tests ranging from eta(2) = .05 to .17, with reasonable CIs). Reporting of CIs for ESs also improved, although only to 40%. Also, the vast majority of reported CIs used approximations, which become progressively less accurate for smaller sample sizes and larger ESs (cf. Algina & Kessleman, 2003). Changes are near asymptote for ESs and clinical significance, but CIs lag behind. As CIs for ESs are required for primary outcomes, we show how to compute CIs for the vast majority of ESs reported in JCCP, with an example of how to use CIs for ESs as a method to assess clinical significance.
Childhood-compared to adolescent-onset bipolar disorder has more statistically significant clinical correlates.

Science.gov (United States)

Holtzman, Jessica N; Miller, Shefali; Hooshmand, Farnaz; Wang, Po W; Chang, Kiki D; Hill, Shelley J; Rasgon, Natalie L; Ketter, Terence A

2015-07-01

The strengths and limitations of considering childhood-and adolescent-onset bipolar disorder (BD) separately versus together remain to be established. We assessed this issue. BD patients referred to the Stanford Bipolar Disorder Clinic during 2000-2011 were assessed with the Systematic Treatment Enhancement Program for BD Affective Disorders Evaluation. Patients with childhood- and adolescent-onset were compared to those with adult-onset for 7 unfavorable bipolar illness characteristics with replicated associations with early-onset patients. Among 502 BD outpatients, those with childhood- (adolescent- (13-18 years, N=218) onset had significantly higher rates for 4/7 unfavorable illness characteristics, including lifetime comorbid anxiety disorder, at least ten lifetime mood episodes, lifetime alcohol use disorder, and prior suicide attempt, than those with adult-onset (>18 years, N=174). Childhood- but not adolescent-onset BD patients also had significantly higher rates of first-degree relative with mood disorder, lifetime substance use disorder, and rapid cycling in the prior year. Patients with pooled childhood/adolescent - compared to adult-onset had significantly higher rates for 5/7 of these unfavorable illness characteristics, while patients with childhood- compared to adolescent-onset had significantly higher rates for 4/7 of these unfavorable illness characteristics. Caucasian, insured, suburban, low substance abuse, American specialty clinic-referred sample limits generalizability. Onset age is based on retrospective recall. Childhood- compared to adolescent-onset BD was more robustly related to unfavorable bipolar illness characteristics, so pooling these groups attenuated such relationships. Further study is warranted to determine the extent to which adolescent-onset BD represents an intermediate phenotype between childhood- and adult-onset BD. Copyright © 2015 Elsevier B.V. All rights reserved.
On the Use of Running Trends as Summary Statistics for Univariate Time Series and Time Series Association

OpenAIRE

Trottini, Mario; Vigo, Isabel; Belda, Santiago

2015-01-01

Given a time series, running trends analysis (RTA) involves evaluating least squares trends over overlapping time windows of L consecutive time points, with overlap by all but one observation. This produces a new series called the “running trends series,” which is used as summary statistics of the original series for further analysis. In recent years, RTA has been widely used in climate applied research as summary statistics for time series and time series association. There is no doubt that ...
A test statistic in the complex Wishart distribution and its application to change detection in polarimetric SAR data

DEFF Research Database (Denmark)

Conradsen, Knut; Nielsen, Allan Aasbjerg; Schou, Jesper

2003-01-01

. Based on this distribution, a test statistic for equality of two such matrices and an associated asymptotic probability for obtaining a smaller value of the test statistic are derived and applied successfully to change detection in polarimetric SAR data. In a case study, EMISAR L-band data from April 17...... to HH, VV, or HV data alone, the derived test statistic reduces to the well-known gamma likelihood-ratio test statistic. The derived test statistic and the associated significance value can be applied as a line or edge detector in fully polarimetric SAR data also....
An Exploration of the Perceived Usefulness of the Introductory Statistics Course and Students’ Intentions to Further Engage in Statistics

Directory of Open Access Journals (Sweden)

Rossi Hassad

2018-01-01

Full Text Available Students� attitude, including perceived usefulness, is generally associated with academic success. The related research in statistics education has focused almost exclusively on the role of attitude in explaining and predicting academic learning outcomes, hence there is a paucity of research evidence on how attitude (particularly perceived usefulness impacts students� intentions to use and stay engaged in statistics beyond the introductory course. This study explored the relationship between college students� perception of the usefulness of an introductory statistics course, their beliefs about where statistics will be most useful, and their intentions to take another statistics course. A cross-sectional study of 106 students was conducted. The mean rating for usefulness was 4.7 (out of 7, with no statistically significant differences based on gender and age. Sixty-four percent reported that they would consider taking another statistics course, and this subgroup rated the course as more useful (p = .01. The majority (67% reported that statistics would be most useful for either graduate school or research, whereas 14% indicated their job, and 19% were undecided. The �undecided� students had the lowest mean rating for usefulness of the course (p = .001. Addressing data, in the context of real-world problem-solving and decision-making, could facilitate students to better appreciate the usefulness and practicality of statistics. Qualitative research methods could help to elucidate these findings.
Nonparametric statistics for social and behavioral sciences

CERN Document Server

Kraska-MIller, M

2013-01-01

Introduction to Research in Social and Behavioral SciencesBasic Principles of ResearchPlanning for ResearchTypes of Research Designs Sampling ProceduresValidity and Reliability of Measurement InstrumentsSteps of the Research Process Introduction to Nonparametric StatisticsData AnalysisOverview of Nonparametric Statistics and Parametric Statistics Overview of Parametric Statistics Overview of Nonparametric StatisticsImportance of Nonparametric MethodsMeasurement InstrumentsAnalysis of Data to Determine Association and Agreement Pearson Chi-Square Test of Association and IndependenceContingency
Ventilator-associated pneumonia: clinical significance and implications for nursing.

Science.gov (United States)

Grap, M J; Munro, C L

1997-01-01

Pneumonia is the second most common nosocomial infection in the United States and the leading cause of death from nosocomial infections. Intubation and mechanical ventilation greatly increase the risk of bacterial pneumonia. Ventilator-associated pneumonia (VAP) occurs in a patient treated with mechanical ventilation, and it is neither present nor developing at the time of intubation; it is a serious problem--with significant morbidity and mortality rates. Aspiration of bacteria from the oropharynx, leakage of contaminated secretions around the endotracheal tube, patient position, and cross-contamination from respiratory equipment and health care providers are important factors in the development of VAP. Nurses caring for patients treated with mechanical ventilation must recognize risk factors and include strategies for reducing these factors as part of their nursing care. This article summarizes the literature related to VAP: its incidence, associated factors, diagnosis, and current therapies, with an emphasis on nursing implications in the care of these patients.
Test the Overall Significance of p-values by Using Joint Tail Probability of Ordered p-values as Test Statistic

OpenAIRE

Fang, Yongxiang; Wit, Ernst

2008-01-01

Fisher’s combined probability test is the most commonly used method to test the overall significance of a set independent p-values. However, it is very obviously that Fisher’s statistic is more sensitive to smaller p-values than to larger p-value and a small p-value may overrule the other p-values and decide the test result. This is, in some cases, viewed as a flaw. In order to overcome this flaw and improve the power of the test, the joint tail probability of a set p-values is proposed as a ...
New scanning technique using Adaptive Statistical lterative Reconstruction (ASIR) significantly reduced the radiation dose of cardiac CT

International Nuclear Information System (INIS)

Tumur, Odgerel; Soon, Kean; Brown, Fraser; Mykytowycz, Marcus

2013-01-01

The aims of our study were to evaluate the effect of application of Adaptive Statistical Iterative Reconstruction (ASIR) algorithm on the radiation dose of coronary computed tomography angiography (CCTA) and its effects on image quality of CCTA and to evaluate the effects of various patient and CT scanning factors on the radiation dose of CCTA. This was a retrospective study that included 347 consecutive patients who underwent CCTA at a tertiary university teaching hospital between 1 July 2009 and 20 September 2011. Analysis was performed comparing patient demographics, scan characteristics, radiation dose and image quality in two groups of patients in whom conventional Filtered Back Projection (FBP) or ASIR was used for image reconstruction. There were 238 patients in the FBP group and 109 patients in the ASIR group. There was no difference between the groups in the use of prospective gating, scan length or tube voltage. In ASIR group, significantly lower tube current was used compared with FBP group, 550mA (450–600) vs. 650mA (500–711.25) (median (interquartile range)), respectively, P<0.001. There was 27% effective radiation dose reduction in the ASIR group compared with FBP group, 4.29mSv (2.84–6.02) vs. 5.84mSv (3.88–8.39) (median (interquartile range)), respectively, P<0.001. Although ASIR was associated with increased image noise compared with FBP (39.93±10.22 vs. 37.63±18.79 (mean ±standard deviation), respectively, P<001), it did not affect the signal intensity, signal-to-noise ratio, contrast-to-noise ratio or the diagnostic quality of CCTA. Application of ASIR reduces the radiation dose of CCTA without affecting the image quality.
Esomeprazole use is independently associated with significant reduction of BMD: 1-year prospective comparative safety study of four proton pump inhibitors.

Science.gov (United States)

Bahtiri, Elton; Islami, Hilmi; Hoxha, Rexhep; Qorraj-Bytyqi, Hasime; Rexhepi, Sylejman; Hoti, Kreshnik; Thaçi, Kujtim; Thaçi, Shpetim; Karakulak, Çağla

2016-09-01

Because of the efficacy of proton pump inhibitors (PPIs), their the use is increasing dramatically. The risk of adverse effects of short-term PPI therapy is low, but there are important safety concerns for potential adverse effects of prolonged PPI therapy. Findings from studies assessing the association between PPI use and bone mineral density (BMD) and/or fracture risk are contradictory. The aim of this study was to prospectively assess potential association of PPI treatment with the 12-month change in BMD of the lumbar spine, femur neck, and total hip. The study was performed in 200 PPI users and 50 PPI nonusers. Lumbar spine (L1-L4), femur neck, and total hip BMD were measured by dual-energy X-ray absorptiometry at the baseline and at 12 months. A total of 209 subjects completed the entire 12 months of the study and were included in the final analysis. A Wilcoxon signed-rank test showed that at 12 months PPI use was associated with statistically significant reductions in femur neck and total hip T scores (Z = -2.764, p = 0.005 and Z = -3.281, p = 0.001, respectively). A multiple linear regression analysis showed that only esomeprazole added significantly to the prediction of total lumbar spine and femur neck T scores (p = 0.048 and p = 0.037, respectively). Compared with the baseline, 12 months of PPI treatment resulted in lower femur neck and total hip BMD T scores. Among the four PPIs studied, esomeprazole was independently associated with significant reduction of BMD, whereas omeprazole had no effects on BMD. Considering the widespread use of PPIs, BMD screening should be considered in the case of prolonged PPI use.
Application of pedagogy reflective in statistical methods course and practicum statistical methods

Science.gov (United States)

Julie, Hongki

2017-08-01

Subject Elementary Statistics, Statistical Methods and Statistical Methods Practicum aimed to equip students of Mathematics Education about descriptive statistics and inferential statistics. The students' understanding about descriptive and inferential statistics were important for students on Mathematics Education Department, especially for those who took the final task associated with quantitative research. In quantitative research, students were required to be able to present and describe the quantitative data in an appropriate manner, to make conclusions from their quantitative data, and to create relationships between independent and dependent variables were defined in their research. In fact, when students made their final project associated with quantitative research, it was not been rare still met the students making mistakes in the steps of making conclusions and error in choosing the hypothetical testing process. As a result, they got incorrect conclusions. This is a very fatal mistake for those who did the quantitative research. There were some things gained from the implementation of reflective pedagogy on teaching learning process in Statistical Methods and Statistical Methods Practicum courses, namely: 1. Twenty two students passed in this course and and one student did not pass in this course. 2. The value of the most accomplished student was A that was achieved by 18 students. 3. According all students, their critical stance could be developed by them, and they could build a caring for each other through a learning process in this course. 4. All students agreed that through a learning process that they undergo in the course, they can build a caring for each other.
Lack of significant associations with early career performance suggest no link between the DMRT3 "Gait Keeper" mutation and precocity in Coldblooded trotters.

Directory of Open Access Journals (Sweden)

Kim Jäderkvist Fegraeus

Full Text Available The Swedish-Norwegian Coldblooded trotter (CBT is a local breed in Sweden and Norway mainly used for harness racing. Previous studies have shown that a mutation from cytosine (C to adenine (A in the doublesex and mab-3 related transcription factor 3 (DMRT3 gene has a major impact on harness racing performance of different breeds. An association of the DMRT3 mutation with early career performance has also been suggested. The aim of the current study was to investigate this proposed association in a randomly selected group of CBTs. 769 CBTs (485 raced, 284 unraced were genotyped for the DMRT3 mutation. The association with racing performance was investigated for 13 performance traits and three different age intervals: 3 years, 3 to 6 years, and 7 to 10 years of age, using the statistical software R. Each performance trait was analyzed for association with DMRT3 using linear models. The results suggest no association of the DMRT3 mutation with precocity (i.e. performance at 3 years of age. Only two traits (race time and number of disqualifications were significantly different between the genotypes, with AA horses having the fastest times and CC horses having the highest number of disqualifications at 3 years of age. The frequency of the AA genotype was significantly lower in the raced CBT sample compared with the unraced sample and less than 50% of the AA horses participated in a race. For the age intervals 3 to 6 and 7 to 10 years the AA horses also failed to demonstrate significantly better performance than the other genotypes. Although suggested as the most favorable genotype for racing performance in Standardbreds and Finnhorses across all ages, the AA genotype does not appear to be associated with superior performance, early or late, in the racing career of CBTs.

Fusing Data Mining, Machine Learning and Traditional Statistics to Detect Biomarkers Associated with Depression

Science.gov (United States)

Dipnall, Joanna F.

2016-01-01

Background Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study. Methods The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009–2010). Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators. Results After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30), serum glucose (OR 1.01; 95% CI 1.00, 1.01) and total bilirubin (OR 0.12; 95% CI 0.05, 0.28). Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016), and current smokers (p<0.001). Conclusion The systematic use of a hybrid methodology for variable selection, fusing data mining techniques using a machine learning algorithm with traditional statistical modelling, accounted for missing data and
Fusing Data Mining, Machine Learning and Traditional Statistics to Detect Biomarkers Associated with Depression.

Directory of Open Access Journals (Sweden)

Joanna F Dipnall

Full Text Available Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study.The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009-2010. Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators.After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30, serum glucose (OR 1.01; 95% CI 1.00, 1.01 and total bilirubin (OR 0.12; 95% CI 0.05, 0.28. Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016, and current smokers (p<0.001.The systematic use of a hybrid methodology for variable selection, fusing data mining techniques using a machine learning algorithm with traditional statistical modelling, accounted for missing data and complex survey sampling
The expression and significance of P-glycoprotein, lung resistance protein and multidrug resistance-associated protein in gastric cancer

Directory of Open Access Journals (Sweden)

Li Yan

2009-11-01

Full Text Available Abstract Background To detect the expression of multidrug resistance molecules P-glycoprotein (P-gp, Lung resistnce protein (LRP and Multidrug resistance-associated protein (MRP and analyze the relationship between them and the clinico-pathological features. Methods The expressions of P-gp, LRP and MRP in formalin-fixed paraffin-embedded tissue sections from 59 gastric cancer patients were determined by a labbelled Streptavidin-Peroxidase (SP immunohistochemical technique, and the results were analyzed in correlation with clinicopathological data. None of these patients received chemotherapy prior to surgery. Results The positive rates of P-gp, LRP, MRP were 86.4%, 84.7% and 27.1%, respectively. The difference between the positive rate of P-gp and MRP was significant statistically, as well as the difference between the expression of MRP and LRP. No significant difference was observed between P-gp and LRP, but the positively correlation between the expression of P-gp and LRP had been found. No significant correlation between the expression of P-gp, LRP, MRP and the grade of differentiation were observed. The expression of P-gp was correlated with clinical stages positively (r = 0.742, but the difference with the expression of P-gp in different stages was not significant. Conclusion The expressions of P-gp, LRP and MRP in patients with gastric cancer without prior chemotherapy are high, indicating that innate drug resistance may exist in gastric cancer.
Paradigms and pragmatism: approaches to medical statistics.

Science.gov (United States)

Healy, M J

2000-01-01

Until recently, the dominant philosophy of science was that due to Karl Popper, with its doctrine that the proper task of science was the formulation of hypotheses followed by attempts at refuting them. In spite of the close analogy with significance testing, these ideas do not fit well with the practice of medical statistics. The same can be said of the later philosophy of Thomas Kuhn, who maintains that science proceeds by way of revolutionary upheavals separated by periods of relatively pedestrian research which are governed by what Kuhn refers to as paradigms. Through there have been paradigm shifts in the history of statistics, a degree of continuity can also be discerned. A current paradigm shift is embodied in the spread of Bayesian ideas. It may be that a future paradigm will emphasise the pragmatic approach to statistics that is associated with the name of Daniel Schwartz.
Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

DEFF Research Database (Denmark)

Jason, Flannick; Fuchsberger, Christian; Mahajan, Anubha

2017-01-01

variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced...... individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics...... from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D....
Periodontitis is associated with significant hepatic fibrosis in patients with non-alcoholic fatty liver disease.

Science.gov (United States)

Alazawi, William; Bernabe, Eduardo; Tai, David; Janicki, Tomasz; Kemos, Polychronis; Samsuddin, Salma; Syn, Wing-Kin; Gillam, David; Turner, Wendy

2017-01-01

Non-alcoholic fatty liver disease (NAFLD) has a bidirectional association with metabolic syndrome. It affects up to 30% of the general population, 70% of individuals with diabetes and 90% with obesity. The main histological hallmark of progressive NAFLD is fibrosis. There is a bidirectional epidemiological link between periodontitis and metabolic syndrome. NAFLD, periodontitis and diabetes share common risk factors, are characterised by inflammation and associated with changes in commensal bacteria. Therefore we tested the hypothesis that periodontitis is associated with NAFLD and with significant fibrosis in two study groups. We analyzed data from a population-based survey and a patient-based study. NHANES III participants with abdominal ultrasound and sociodemographic, clinical, and oral examination data were extracted and appropriate weighting applied. In a separate patient-based study, consenting patients with biopsy-proved NAFLD (or with liver indices too mild to justify biopsy) underwent dental examination. Basic Periodontal Examination score was recorded. In NHANES, periodontitis was significantly associated with steatosis in 8172 adults even after adjusting for sociodemographic factors. However, associations were fully explained after accounting for features of metabolic syndrome. In the patient-based study, periodontitis was significantly more common in patients with biopsy-proven NASH and any fibrosis (F0-F4) than without NASH (p = 0.009). Periodontitis was more common in patients with NASH and significant fibrosis (F2-4) than mild or no fibrosis (F0-1, p = 0.04). Complementary evidence from an epidemiological survey and a clinical study show that NAFLD is associated with periodontitis and that the association is stronger with significant liver fibrosis.
After statistics reform : Should we still teach significance testing?

NARCIS (Netherlands)

A. Hak (Tony)

2014-01-01

textabstractIn the longer term null hypothesis significance testing (NHST) will disappear because p- values are not informative and not replicable. Should we continue to teach in the future the procedures of then abolished routines (i.e., NHST)? Three arguments are discussed for not teaching NHST in
Worry, Intolerance of Uncertainty, and Statistics Anxiety

Science.gov (United States)

Williams, Amanda S.

2013-01-01

Statistics anxiety is a problem for most graduate students. This study investigates the relationship between intolerance of uncertainty, worry, and statistics anxiety. Intolerance of uncertainty was significantly related to worry, and worry was significantly related to three types of statistics anxiety. Six types of statistics anxiety were…
Reducing statistics anxiety and enhancing statistics learning achievement: effectiveness of a one-minute strategy.

Science.gov (United States)

Chiou, Chei-Chang; Wang, Yu-Min; Lee, Li-Tze

2014-08-01

Statistical knowledge is widely used in academia; however, statistics teachers struggle with the issue of how to reduce students' statistics anxiety and enhance students' statistics learning. This study assesses the effectiveness of a "one-minute paper strategy" in reducing students' statistics-related anxiety and in improving students' statistics-related achievement. Participants were 77 undergraduates from two classes enrolled in applied statistics courses. An experiment was implemented according to a pretest/posttest comparison group design. The quasi-experimental design showed that the one-minute paper strategy significantly reduced students' statistics anxiety and improved students' statistics learning achievement. The strategy was a better instructional tool than the textbook exercise for reducing students' statistics anxiety and improving students' statistics achievement.
Evaluation of significantly modified water bodies in Vojvodina by using multivariate statistical techniques

Directory of Open Access Journals (Sweden)

Vujović Svetlana R.

2013-01-01

Full Text Available This paper illustrates the utility of multivariate statistical techniques for analysis and interpretation of water quality data sets and identification of pollution sources/factors with a view to get better information about the water quality and design of monitoring network for effective management of water resources. Multivariate statistical techniques, such as factor analysis (FA/principal component analysis (PCA and cluster analysis (CA, were applied for the evaluation of variations and for the interpretation of a water quality data set of the natural water bodies obtained during 2010 year of monitoring of 13 parameters at 33 different sites. FA/PCA attempts to explain the correlations between the observations in terms of the underlying factors, which are not directly observable. Factor analysis is applied to physico-chemical parameters of natural water bodies with the aim classification and data summation as well as segmentation of heterogeneous data sets into smaller homogeneous subsets. Factor loadings were categorized as strong and moderate corresponding to the absolute loading values of >0.75, 0.75-0.50, respectively. Four principal factors were obtained with Eigenvalues >1 summing more than 78 % of the total variance in the water data sets, which is adequate to give good prior information regarding data structure. Each factor that is significantly related to specific variables represents a different dimension of water quality. The first factor F1 accounting for 28 % of the total variance and represents the hydrochemical dimension of water quality. The second factor F2 accounting for 18% of the total variance and may be taken factor of water eutrophication. The third factor F3 accounting 17 % of the total variance and represents the influence of point sources of pollution on water quality. The fourth factor F4 accounting 13 % of the total variance and may be taken as an ecological dimension of water quality. Cluster analysis (CA is an
Testing earthquake prediction algorithms: Statistically significant advance prediction of the largest earthquakes in the Circum-Pacific, 1992-1997

Science.gov (United States)

Kossobokov, V.G.; Romashkova, L.L.; Keilis-Borok, V. I.; Healy, J.H.

1999-01-01

Algorithms M8 and MSc (i.e., the Mendocino Scenario) were used in a real-time intermediate-term research prediction of the strongest earthquakes in the Circum-Pacific seismic belt. Predictions are made by M8 first. Then, the areas of alarm are reduced by MSc at the cost that some earthquakes are missed in the second approximation of prediction. In 1992-1997, five earthquakes of magnitude 8 and above occurred in the test area: all of them were predicted by M8 and MSc identified correctly the locations of four of them. The space-time volume of the alarms is 36% and 18%, correspondingly, when estimated with a normalized product measure of empirical distribution of epicenters and uniform time. The statistical significance of the achieved results is beyond 99% both for M8 and MSc. For magnitude 7.5 + , 10 out of 19 earthquakes were predicted by M8 in 40% and five were predicted by M8-MSc in 13% of the total volume considered. This implies a significance level of 81% for M8 and 92% for M8-MSc. The lower significance levels might result from a global change in seismic regime in 1993-1996, when the rate of the largest events has doubled and all of them become exclusively normal or reversed faults. The predictions are fully reproducible; the algorithms M8 and MSc in complete formal definitions were published before we started our experiment [Keilis-Borok, V.I., Kossobokov, V.G., 1990. Premonitory activation of seismic flow: Algorithm M8, Phys. Earth and Planet. Inter. 61, 73-83; Kossobokov, V.G., Keilis-Borok, V.I., Smith, S.W., 1990. Localization of intermediate-term earthquake prediction, J. Geophys. Res., 95, 19763-19772; Healy, J.H., Kossobokov, V.G., Dewey, J.W., 1992. A test to evaluate the earthquake prediction algorithm, M8. U.S. Geol. Surv. OFR 92-401]. M8 is available from the IASPEI Software Library [Healy, J.H., Keilis-Borok, V.I., Lee, W.H.K. (Eds.), 1997. Algorithms for Earthquake Statistics and Prediction, Vol. 6. IASPEI Software Library]. ?? 1999 Elsevier
Significant association between parathyroid hormone and uric acid level in men

Directory of Open Access Journals (Sweden)

Chin KY

2015-08-01

Full Text Available Kok-Yong Chin,1 Soelaiman Ima Nirwana,1 Wan Zurinah Wan Ngah21Department of Pharmacology, 2Department of Biochemistry, Faculty of Medicine, Universiti Kebangsaan Malaysia Medical Centre, Kuala Lumpur, MalaysiaBackground: Previous reports of patients undergoing parathyroidectomy and of patients receiving teriparatide as antiosteoporotic treatment have suggested a plausible relationship between parathyroid hormone (PTH and uric acid. However, similar data at population level were lacking. The current study aimed to determine the relationship between PTH and uric acid in a group of apparently healthy Malaysian men.Methods: A cross-sectional study was conducted among 380 Malay and Chinese men aged 20 years and above, residing in the Klang Valley, Malaysia. Their body anthropometry was measured, and their fasting blood samples were collected for biochemical analysis. The relationship between PTH and uric acid was analyzed using regression analysis.Results: Increased serum PTH level was significantly associated with increased serum uric acid level (β=0.165; P=0.001. Increased PTH level was also significantly associated with the condition of hyperuricemia in the study population (odds ratio [OR], 1.045; 95% confidence interval [CI], 1.017–1.075; P=0.002. All analyses were adjusted for age, body mass index, vitamin D, total calcium, inorganic phosphate, blood urea nitrogen and creatinine levels.Conclusion: There is a significant positive relationship between PTH level and uric acid level in Malaysian men. This relationship and its clinical significance should be further investigated in a larger longitudinal study. Keywords: hyperuricemia, Asian, cross-sectional study, uric acid, urate
Significance of genetic variants in DLC1 and their association with hepatocellular carcinoma

Science.gov (United States)

XIE, CHENG-RONG; SUN, HONG-GUANG; SUN, YU; ZHAO, WEN-XIU; ZHANG, SHENG; WANG, XIAO-MIN; YIN, ZHEN-YU

2015-01-01

DLC1 has been shown to be downregulated or absent in hepatocellular carcinoma (HCC) and is associated with tumorigenesis and development. However, only a small number of studies have focused on genetic variations of DLC1. The present study performed exon sequencing for the DLC1 gene in HCC tissue samples from 105 patients to identify functional genetic variation of DLC1 and its association with HCC susceptibility, clinicopathological features and prognosis. A novel missense mutation and four non-synonymous single nucleotide polymorphisms (SNPs; rs3816748, rs11203495, rs3816747 and rs532841) were identified. A significant correlation of rs3816747 polymorphisms with HCC susceptibility was identified. Compared to individuals with the GG genotype of rs3816747, those with the GA (odds ratio (OR)=0.486; P=0.037) or GA+AA genotype (OR=0.51; P=0.039) were associated with a significantly decreased HCC risk. Furthermore, patients with the GC+CC genotype of rs3816748, the TC+CC genotype of rs11203495 or the GA+AA genotype of rs3816747 had small-sized tumors compared with those carrying the wild-type genotype. No significant association of DLC1 SNPs with the patients' prognosis was found. These results indicated that genetic variations in the DLC1 gene may confer a risk for HCC. PMID:26095787
Periodontitis is associated with significant hepatic fibrosis in patients with non-alcoholic fatty liver disease.

Directory of Open Access Journals (Sweden)

William Alazawi

Full Text Available Non-alcoholic fatty liver disease (NAFLD has a bidirectional association with metabolic syndrome. It affects up to 30% of the general population, 70% of individuals with diabetes and 90% with obesity. The main histological hallmark of progressive NAFLD is fibrosis. There is a bidirectional epidemiological link between periodontitis and metabolic syndrome. NAFLD, periodontitis and diabetes share common risk factors, are characterised by inflammation and associated with changes in commensal bacteria. Therefore we tested the hypothesis that periodontitis is associated with NAFLD and with significant fibrosis in two study groups.We analyzed data from a population-based survey and a patient-based study. NHANES III participants with abdominal ultrasound and sociodemographic, clinical, and oral examination data were extracted and appropriate weighting applied. In a separate patient-based study, consenting patients with biopsy-proved NAFLD (or with liver indices too mild to justify biopsy underwent dental examination. Basic Periodontal Examination score was recorded.In NHANES, periodontitis was significantly associated with steatosis in 8172 adults even after adjusting for sociodemographic factors. However, associations were fully explained after accounting for features of metabolic syndrome. In the patient-based study, periodontitis was significantly more common in patients with biopsy-proven NASH and any fibrosis (F0-F4 than without NASH (p = 0.009. Periodontitis was more common in patients with NASH and significant fibrosis (F2-4 than mild or no fibrosis (F0-1, p = 0.04.Complementary evidence from an epidemiological survey and a clinical study show that NAFLD is associated with periodontitis and that the association is stronger with significant liver fibrosis.
Are Statistics Labs Worth the Effort?--Comparison of Introductory Statistics Courses Using Different Teaching Methods

Directory of Open Access Journals (Sweden)

Jose H. Guardiola

2010-01-01

Full Text Available This paper compares the academic performance of students in three similar elementary statistics courses taught by the same instructor, but with the lab component differing among the three. One course is traditionally taught without a lab component; the second with a lab component using scenarios and an extensive use of technology, but without explicit coordination between lab and lecture; and the third using a lab component with an extensive use of technology that carefully coordinates the lab with the lecture. Extensive use of technology means, in this context, using Minitab software in the lab section, doing homework and quizzes using MyMathlab ©, and emphasizing interpretation of computer output during lectures. Initially, an online instrument based on Gardner’s multiple intelligences theory, is given to students to try to identify students’ learning styles and intelligence types as covariates. An analysis of covariance is performed in order to compare differences in achievement. In this study there is no attempt to measure difference in student performance across the different treatments. The purpose of this study is to find indications of associations among variables that support the claim that statistics labs could be associated with superior academic achievement in one of these three instructional environments. Also, this study tries to identify individual student characteristics that could be associated with superior academic performance. This study did not find evidence of any individual student characteristics that could be associated with superior achievement. The response variable was computed as percentage of correct answers for the three exams during the semester added together. The results of this study indicate a significant difference across these three different instructional methods, showing significantly higher mean scores for the response variable on students taking the lab component that was carefully coordinated with
A novel complete-case analysis to determine statistical significance between treatments in an intention-to-treat population of randomized clinical trials involving missing data.

Science.gov (United States)

Liu, Wei; Ding, Jinhui

2018-04-01

The application of the principle of the intention-to-treat (ITT) to the analysis of clinical trials is challenged in the presence of missing outcome data. The consequences of stopping an assigned treatment in a withdrawn subject are unknown. It is difficult to make a single assumption about missing mechanisms for all clinical trials because there are complicated reactions in the human body to drugs due to the presence of complex biological networks, leading to data missing randomly or non-randomly. Currently there is no statistical method that can tell whether a difference between two treatments in the ITT population of a randomized clinical trial with missing data is significant at a pre-specified level. Making no assumptions about the missing mechanisms, we propose a generalized complete-case (GCC) analysis based on the data of completers. An evaluation of the impact of missing data on the ITT analysis reveals that a statistically significant GCC result implies a significant treatment effect in the ITT population at a pre-specified significance level unless, relative to the comparator, the test drug is poisonous to the non-completers as documented in their medical records. Applications of the GCC analysis are illustrated using literature data, and its properties and limits are discussed.
Analysis of statistical misconception in terms of statistical reasoning

Science.gov (United States)

Maryati, I.; Priatna, N.

2018-05-01

Reasoning skill is needed for everyone to face globalization era, because every person have to be able to manage and use information from all over the world which can be obtained easily. Statistical reasoning skill is the ability to collect, group, process, interpret, and draw conclusion of information. Developing this skill can be done through various levels of education. However, the skill is low because many people assume that statistics is just the ability to count and using formulas and so do students. Students still have negative attitude toward course which is related to research. The purpose of this research is analyzing students’ misconception in descriptive statistic course toward the statistical reasoning skill. The observation was done by analyzing the misconception test result and statistical reasoning skill test; observing the students’ misconception effect toward statistical reasoning skill. The sample of this research was 32 students of math education department who had taken descriptive statistic course. The mean value of misconception test was 49,7 and standard deviation was 10,6 whereas the mean value of statistical reasoning skill test was 51,8 and standard deviation was 8,5. If the minimal value is 65 to state the standard achievement of a course competence, students’ mean value is lower than the standard competence. The result of students’ misconception study emphasized on which sub discussion that should be considered. Based on the assessment result, it was found that students’ misconception happen on this: 1) writing mathematical sentence and symbol well, 2) understanding basic definitions, 3) determining concept that will be used in solving problem. In statistical reasoning skill, the assessment was done to measure reasoning from: 1) data, 2) representation, 3) statistic format, 4) probability, 5) sample, and 6) association.
ER, p53 and MIB-1 are significantly associated with malignant phyllodes tumor

Directory of Open Access Journals (Sweden)

Nurhayati H Munawer

2012-12-01

Full Text Available Background: Phyllodes tumors (PT are rare. We evaluated the expression status of ER, Bcl2, p53, and MIB-1 protein in these tumors. Methods: One hundred and ninety-three tumors were examined using immunohistochemistry on tissue microarray. Results: ERβ (p <0.001, and p53 (p=0.006 in the stromal component were associated with tumor size. p53 expression was significantly associated with both epithelial and stromal components of malignant PTs (p<0.05. In PT, the decreased expressions of p53 and MIB-1 were significantly different with positive Bcl2 protein expression in epithelial component (p=0.000. Besides, MIB-1 was also found to be associated with ERα and ERβ in stromal component (p=0.000. Conclusion: The expression of p53 with tumor size and histological grade in PTs may increase risk for malignancy.
Significant association between polymorphism of the erythropoietin gene promoter and myelodysplastic syndrome

Directory of Open Access Journals (Sweden)

O'Brien Susan

2010-11-01

Full Text Available Abstract Background Myelodysplastic syndrome (MDS may be induced by certain mutagenic environmental or chemotherapeutic toxins; however, the role of susceptibility genes remains unclear. The G/G genotype of the single-nucleotide polymorphism (SNP rs1617640 in the erythropoietin (EPO promoter has been shown to be associated with decreased EPO expression. We examined the association of rs1617640 genotype with MDS. Methods We genotyped the EPO rS1617640 SNP in 189 patients with MDS, 257 with acute myeloid leukemia (AML, 106 with acute lymphoblastic leukemia, 97 with chronic lymphocytic leukemia, 353 with chronic myeloid leukemia, and 95 healthy controls. Results The G/G genotype was significantly more common in MDS patients (47/187; 25.1% than in controls (6/95; 6.3% or in patients with other leukemias (101/813; 12.4% (all P P = 0.03. Time to neutrophils recovery after therapy was significantly longer in MDS patients with the G/G genotype (P = 0.02. Conclusions These findings suggest a strong association between the rs1617640 G/G genotype and MDS. Further studies are warranted to investigate the utility of screening for this marker in individuals exposed to environmental toxins or chemotherapy.
Statistical Viewer: a tool to upload and integrate linkage and association data as plots displayed within the Ensembl genome browser

Directory of Open Access Journals (Sweden)

Hauser Elizabeth R

2005-04-01

Full Text Available Abstract Background To facilitate efficient selection and the prioritization of candidate complex disease susceptibility genes for association analysis, increasingly comprehensive annotation tools are essential to integrate, visualize and analyze vast quantities of disparate data generated by genomic screens, public human genome sequence annotation and ancillary biological databases. We have developed a plug-in package for Ensembl called "Statistical Viewer" that facilitates the analysis of genomic features and annotation in the regions of interest defined by linkage analysis. Results Statistical Viewer is an add-on package to the open-source Ensembl Genome Browser and Annotation System that displays disease study-specific linkage and/or association data as 2 dimensional plots in new panels in the context of Ensembl's Contig View and Cyto View pages. An enhanced upload server facilitates the upload of statistical data, as well as additional feature annotation to be displayed in DAS tracts, in the form of Excel Files. The Statistical View panel, drawn directly under the ideogram, illustrates lod score values for markers from a study of interest that are plotted against their position in base pairs. A module called "Get Map" easily converts the genetic locations of markers to genomic coordinates. The graph is placed under the corresponding ideogram features a synchronized vertical sliding selection box that is seamlessly integrated into Ensembl's Contig- and Cyto- View pages to choose the region to be displayed in Ensembl's "Overview" and "Detailed View" panels. To resolve Association and Fine mapping data plots, a "Detailed Statistic View" plot corresponding to the "Detailed View" may be displayed underneath. Conclusion Features mapping to regions of linkage are accentuated when Statistic View is used in conjunction with the Distributed Annotation System (DAS to display supplemental laboratory information such as differentially expressed disease

Intuitive introductory statistics

CERN Document Server

Wolfe, Douglas A

2017-01-01

This textbook is designed to give an engaging introduction to statistics and the art of data analysis. The unique scope includes, but also goes beyond, classical methodology associated with the normal distribution. What if the normal model is not valid for a particular data set? This cutting-edge approach provides the alternatives. It is an introduction to the world and possibilities of statistics that uses exercises, computer analyses, and simulations throughout the core lessons. These elementary statistical methods are intuitive. Counting and ranking features prominently in the text. Nonparametric methods, for instance, are often based on counts and ranks and are very easy to integrate into an introductory course. The ease of computation with advanced calculators and statistical software, both of which factor into this text, allows important techniques to be introduced earlier in the study of statistics. This book's novel scope also includes measuring symmetry with Walsh averages, finding a nonp...
Low statistical power in biomedical science: a review of three human research domains

Science.gov (United States)

Dumas-Mallet, Estelle; Button, Katherine S.; Boraud, Thomas; Gonon, Francois

2017-01-01

Studies with low statistical power increase the likelihood that a statistically significant finding represents a false positive result. We conducted a review of meta-analyses of studies investigating the association of biological, environmental or cognitive parameters with neurological, psychiatric and somatic diseases, excluding treatment studies, in order to estimate the average statistical power across these domains. Taking the effect size indicated by a meta-analysis as the best estimate of the likely true effect size, and assuming a threshold for declaring statistical significance of 5%, we found that approximately 50% of studies have statistical power in the 0–10% or 11–20% range, well below the minimum of 80% that is often considered conventional. Studies with low statistical power appear to be common in the biomedical sciences, at least in the specific subject areas captured by our search strategy. However, we also observe evidence that this depends in part on research methodology, with candidate gene studies showing very low average power and studies using cognitive/behavioural measures showing high average power. This warrants further investigation. PMID:28386409
Evaluation of the truncated perturbed chain-polar statistical associating fluid theory for complex mixture fluid phase equilibria

DEFF Research Database (Denmark)

Karakatsani, Eirini; Kontogeorgis, Georgios; Economou, Ioannis

2006-01-01

Perturbed chain-statistical associating fluid theory (PC-SAFT) was extended rigorously to polar fluids based on the theory of Stell and co-workers [Mol. Phys. 1977, 33, 987]. The new PC-PSAFT was simplified to truncated PC-PSAFT (tPC-PSAFT) so that it can be practical for real polar fluid...
Confidence Intervals for Effect Sizes: Compliance and Clinical Significance in the "Journal of Consulting and Clinical Psychology"

Science.gov (United States)

Odgaard, Eric C.; Fowler, Robert L.

2010-01-01

Objective: In 2005, the "Journal of Consulting and Clinical Psychology" ("JCCP") became the first American Psychological Association (APA) journal to require statistical measures of clinical significance, plus effect sizes (ESs) and associated confidence intervals (CIs), for primary outcomes (La Greca, 2005). As this represents the single largest…
Ureaplasma parvum and Mycoplasma genitalium are found to be significantly associated with microscopy-confirmed urethritis in a routine genitourinary medicine setting.

Science.gov (United States)

Cox, Ciara; McKenna, James P; Watt, Alison P; Coyle, Peter V

2016-09-01

Inflammation of the urethra defined by an excess of polymorphonuclear leukocytes in the absence of sexually transmitted Chlamydia trachomatis and Neisseria gonorrhoeae is called non-chlamydial non-gonococcal urethritis (NCNGU). Although Mycoplasma genitalium is now recognised as causing a sexually transmitted infection, the clinical significance of the other Mollicute species is less clear. This study used specific real-time quantitative polymerase chain reaction assays to detect and quantify four Mollicute species, M. genitalium, M. hominis, Ureaplasma urealyticum and U. parvum, in urine specimens from men with and without NCNGU. A total of 165 urine specimens from male patients attending a genitourinary medicine clinic were eligible for the study, with microscopy-confirmed (≥5 polymorphonuclear leukocytes in urethral swab) NCNGU in 75 (45.5%) and non-confirmed NCNGU in 90 (54.5%). Chi-squared statistical analysis indicated a significantly higher prevalence of U. parvum (17.3% vs. 5.6%; p = 0.03) and M. genitalium (12% vs. 0%; p < 0.001) in NCNGU. In a subset analysis, M. genitalium was also significantly (p = 0.03) higher in men who have sex with men (MSM; 13.5%) compared to non-MSM (3.1%). No significant associations were reported for U. urealyticum and M. hominis In conclusion, this study supports a clinically significant role in NGNCU for both U. parvum and M. genitalium. © The Author(s) 2015.
Significant association between ERCC2 and MTHR polymorphisms and breast cancer susceptibility in Moroccan population: genotype and haplotype analysis in a case-control study.

Science.gov (United States)

Hardi, Hanaa; Melki, Rahma; Boughaleb, Zouhour; El Harroudi, Tijani; Aissaoui, Souria; Boukhatem, Noureddine

2018-03-15

Genetic determinants of breast cancer (BC) remained largely unknown in the majority of Moroccan patients. The purpose of this study was to explore the association of ERCC2 and MTHFR polymorphisms with genetic susceptibility to breast cancer in Moroccan population. We genotyped ERCC2 polymorphisms (rs1799793 (G934A) and rs13181 (A2251C)) and MTHFR polymorphisms (rs1801133 (C677T) and rs1801131 (A1298C)) using TaqMan SNP Genotyping Assays. Genotypes were compared in 151 BC cases and 156 population-matched controls. Allelic, genotypic and haplotype associations with the risk and clinicopathological features of BC were assessed using logistic regression analyses. ERCC2-rs1799793-AA genotype was associated with high risk of BC compared to wild type genotype (recessive model: OR: 2.90, 95% CI: 1.34-6.26, p = 0.0069) even after Bonferroni correction (p < 0,0125). MTHFR rs1801133-TT genotype was associated with increased risk of BC (recessive model, OR: 2.49, 95% CI: 1.17-5.29, p = 0.017) but the association turned insignificant after Bonferroni correction. For the rest of SNPs, no statistical associations to BC risk were detected. Significant association with clinical features was detected for MTHFR-rs1801133-TC genotype with early age at diagnosis and familial BC. Following Bonferroni correction, only association with familial BC remained significant. MTHFR-rs1801131-CC genotype was associated with sporadic BC. ERCC2-rs1799793-AA genotype correlated with ER+ and PR+ breast cancer. ERCC2-rs13181-CA genotype was significantly associated large tumors (T ≥ 3) in BC patients. None of these associations passed Bonferroni correction. Haplotype analysis showed that ERCC2 A-C haplotype was significantly associated with increased BC risk (OR: 3.71, 95% CI: 1.7-8.12, p = 0.0002 and p = 0.0008 before and after Bonferroni correction, respectively) and positive expression of ER and PR in BC patients. ERCC2 G-C haplotype was correlated with PR negative and
Statistical characterization report for Single-Shell Tank 241-T-107

International Nuclear Information System (INIS)

Cromar, R.D.; Wilmarth, S.R.; Jensen, L.

1994-01-01

This report contains the results of the statistical analysis of data from three core samples obtained from single-shell tank 241-T-107 (T-107). Four specific topics are addressed. They are summarized below. Section 3.0 contains mean concentration estimates of analytes found in T-107. The estimates of open-quotes errorclose quotes associated with the concentration estimates are given as 95% confidence intervals (CI) on the mean. The results given are based on three types of samples: core composite samples, core segment samples, and drainable liquid samples. Section 4.0 contains estimates of the spatial variability (variability between cores and between segments) and the analytical variability (variability between the primary and the duplicate analysis). Statistical tests were performed to test the hypothesis that the between cores and the between segments spatial variability is zero. The results of the tests are as follows. Based on the core composite data, the between cores variance is significantly different from zero for 35 out of 74 analytes; i.e., for 53% of the analytes there is no statistically significant difference between the concentration means for two cores. Based on core segment data, the between segments variance is significantly different from zero for 22 out of 24 analytes and the between cores variance is significantly different from zero for 4 out of 24 analytes; i.e., for 8% of the analytes there is no statistically significant difference between segment means and for 83% of the analytes there is no difference between the means from the three cores. Section 5.0 contains the results of the application of multiple comparison methods to the core composite data, the core segment data, and the drainable liquid data. Section 6.0 contains the results of a statistical test conducted to determine the 222-S Analytical Laboratory's ability to homogenize solid core segments
Statistical Methods in Psychology Journals.

Science.gov (United States)

Willkinson, Leland

1999-01-01

Proposes guidelines for revising the American Psychological Association (APA) publication manual or other APA materials to clarify the application of statistics in research reports. The guidelines are intended to induce authors and editors to recognize the thoughtless application of statistical methods. Contains 54 references. (SLD)
Technical issues relating to the statistical parametric mapping of brain SPECT studies

International Nuclear Information System (INIS)

Hatton, R.L.; Cordato, N.; Hutton, B.F.; Lau, Y.H.; Evans, S.G.

2000-01-01

Full text: Statistical Parametric Mapping (SPM) is a software tool designed for the statistical analysis of functional neuro images, specifically Positron Emission Tomography and functional Magnetic Resonance Imaging, and more recently SPECT. This review examines some problems associated with the analysis of SPECT. A comparison of a patient group with normal studies revealed factors that could influence results, some that commonly occur, others that require further exploration. To optimise the differences between two groups of subjects, both spatial variability and differences in global activity must be minimised. The choice and effectiveness of co registration method and approach to normalisation of activity concentration can affect the optimisation. A small number of subject scans were identified as possessing truncated data resulting in edge effects that could adversely influence the analysis. Other problems included unusual areas of significance possibly related to reconstruction methods and the geometry associated with nonparallel collimators. Areas of extra cerebral significance are a point of concern - and may result from scatter effects, or mis registration. Difficulties in patient positioning, due to postural limitations, can lead to resolution differences. SPM has been used to assess areas of statistical significance arising from these technical factors, as opposed to areas of true clinical significance when comparing subject groups. This contributes to a better understanding of the effects of technical factors so that these may be eliminated, minimised, or incorporated in the study design. Copyright (2000) The Australian and New Zealand Society of Nuclear Medicine Inc
Quality of statistical reporting in developmental disability journals.

Science.gov (United States)

Namasivayam, Aravind K; Yan, Tina; Wong, Wing Yiu Stephanie; van Lieshout, Pascal

2015-12-01

Null hypothesis significance testing (NHST) dominates quantitative data analysis, but its use is controversial and has been heavily criticized. The American Psychological Association has advocated the reporting of effect sizes (ES), confidence intervals (CIs), and statistical power analysis to complement NHST results to provide a more comprehensive understanding of research findings. The aim of this paper is to carry out a sample survey of statistical reporting practices in two journals with the highest h5-index scores in the areas of developmental disability and rehabilitation. Using a checklist that includes critical recommendations by American Psychological Association, we examined 100 randomly selected articles out of 456 articles reporting inferential statistics in the year 2013 in the Journal of Autism and Developmental Disorders (JADD) and Research in Developmental Disabilities (RDD). The results showed that for both journals, ES were reported only half the time (JADD 59.3%; RDD 55.87%). These findings are similar to psychology journals, but are in stark contrast to ES reporting in educational journals (73%). Furthermore, a priori power and sample size determination (JADD 10%; RDD 6%), along with reporting and interpreting precision measures (CI: JADD 13.33%; RDD 16.67%), were the least reported metrics in these journals, but not dissimilar to journals in other disciplines. To advance the science in developmental disability and rehabilitation and to bridge the research-to-practice divide, reforms in statistical reporting, such as providing supplemental measures to NHST, are clearly needed.
Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments

Directory of Open Access Journals (Sweden)

Elizabeth Trembath-Reichert

2016-04-01

Full Text Available Methane seep systems along continental margins host diverse and dynamic microbial assemblages, sustained in large part through the microbially mediated process of sulfate-coupled Anaerobic Oxidation of Methane (AOM. This methanotrophic metabolism has been linked to consortia of anaerobic methane-oxidizing archaea (ANME and sulfate-reducing bacteria (SRB. These two groups are the focus of numerous studies; however, less is known about the wide diversity of other seep associated microorganisms. We selected a hierarchical set of FISH probes targeting a range of Deltaproteobacteria diversity. Using the Magneto-FISH enrichment technique, we then magnetically captured CARD-FISH hybridized cells and their physically associated microorganisms from a methane seep sediment incubation. DNA from nested Magneto-FISH experiments was analyzed using Illumina tag 16S rRNA gene sequencing (iTag. Enrichment success and potential bias with iTag was evaluated in the context of full-length 16S rRNA gene clone libraries, CARD-FISH, functional gene clone libraries, and iTag mock communities. We determined commonly used Earth Microbiome Project (EMP iTAG primers introduced bias in some common methane seep microbial taxa that reduced the ability to directly compare OTU relative abundances within a sample, but comparison of relative abundances between samples (in nearly all cases and whole community-based analyses were robust. The iTag dataset was subjected to statistical co-occurrence measures of the most abundant OTUs to determine which taxa in this dataset were most correlated across all samples. Many non-canonical microbial partnerships were statistically significant in our co-occurrence network analysis, most of which were not recovered with conventional clone library sequencing, demonstrating the utility of combining Magneto-FISH and iTag sequencing methods for hypothesis generation of associations within complex microbial communities. Network analysis pointed to
Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments.

Science.gov (United States)

Trembath-Reichert, Elizabeth; Case, David H; Orphan, Victoria J

2016-01-01

Methane seep systems along continental margins host diverse and dynamic microbial assemblages, sustained in large part through the microbially mediated process of sulfate-coupled Anaerobic Oxidation of Methane (AOM). This methanotrophic metabolism has been linked to consortia of anaerobic methane-oxidizing archaea (ANME) and sulfate-reducing bacteria (SRB). These two groups are the focus of numerous studies; however, less is known about the wide diversity of other seep associated microorganisms. We selected a hierarchical set of FISH probes targeting a range of Deltaproteobacteria diversity. Using the Magneto-FISH enrichment technique, we then magnetically captured CARD-FISH hybridized cells and their physically associated microorganisms from a methane seep sediment incubation. DNA from nested Magneto-FISH experiments was analyzed using Illumina tag 16S rRNA gene sequencing (iTag). Enrichment success and potential bias with iTag was evaluated in the context of full-length 16S rRNA gene clone libraries, CARD-FISH, functional gene clone libraries, and iTag mock communities. We determined commonly used Earth Microbiome Project (EMP) iTAG primers introduced bias in some common methane seep microbial taxa that reduced the ability to directly compare OTU relative abundances within a sample, but comparison of relative abundances between samples (in nearly all cases) and whole community-based analyses were robust. The iTag dataset was subjected to statistical co-occurrence measures of the most abundant OTUs to determine which taxa in this dataset were most correlated across all samples. Many non-canonical microbial partnerships were statistically significant in our co-occurrence network analysis, most of which were not recovered with conventional clone library sequencing, demonstrating the utility of combining Magneto-FISH and iTag sequencing methods for hypothesis generation of associations within complex microbial communities. Network analysis pointed to many co
The MAX Statistic is Less Powerful for Genome Wide Association Studies Under Most Alternative Hypotheses.

Science.gov (United States)

Shifflett, Benjamin; Huang, Rong; Edland, Steven D

2017-01-01

Genotypic association studies are prone to inflated type I error rates if multiple hypothesis testing is performed, e.g., sequentially testing for recessive, multiplicative, and dominant risk. Alternatives to multiple hypothesis testing include the model independent genotypic χ 2 test, the efficiency robust MAX statistic, which corrects for multiple comparisons but with some loss of power, or a single Armitage test for multiplicative trend, which has optimal power when the multiplicative model holds but with some loss of power when dominant or recessive models underlie the genetic association. We used Monte Carlo simulations to describe the relative performance of these three approaches under a range of scenarios. All three approaches maintained their nominal type I error rates. The genotypic χ 2 and MAX statistics were more powerful when testing a strictly recessive genetic effect or when testing a dominant effect when the allele frequency was high. The Armitage test for multiplicative trend was most powerful for the broad range of scenarios where heterozygote risk is intermediate between recessive and dominant risk. Moreover, all tests had limited power to detect recessive genetic risk unless the sample size was large, and conversely all tests were relatively well powered to detect dominant risk. Taken together, these results suggest the general utility of the multiplicative trend test when the underlying genetic model is unknown.
Clinical Significance of Immuno phenotypic Markers in Pediatric T-cell Acute Lymphoblastic Leukemia

International Nuclear Information System (INIS)

SIDHOM, I.; SHAABAN, Kh.; SOLIMAN, S.; HAMDY, N.; YASSIN, D.; SALEM, Sh.; HASSANEIN, H.; MANSOUR, M.T.; EZZAT, S.; EL-ANWAR, W.

2008-01-01

Background: Cell-marker profiling has led to conflicting conclusions about its prognostic significance in T-ALL. Aim: To investigate the prevalence of the expression of CD34, CD10 and myeloid associated antigens (CD13/ CD33) in childhood T-ALL and to relate their presence to initial clinical and biologic features and early response to therapy. Patients and Methods: This study included 67 consecutive patients with newly diagnosed T-ALL recruited from the Children's Cancer Hospital in Egypt during the time period from July 2007 to June 2008. Immuno phenotypic markers and minimal residual disease (MRD) were studied by five-color flow cytometry. Results: The frequency of CD34 was 34.9%, CD10 33.3%, while CD13/CD33 was 18.8%. No significant association was encountered between CD34, CD10 or myeloid antigen positivity and the presenting clinical features as age, sex, TLC and CNS leukemia. Only CD10+ expression had significant association with initial CNS involvement (p=0.039). CD34 and CD13/CD33 expression was significantly associated with T-cell maturation stages (p<0.05). No relationship was observed for age, TLC, gender, NCI risk or CNS involvement with early response to therapy illustrated by BM as well as MRD day 15 and day 42. CD34+, CD13/CD33+ and early T-cell stage had high MRD levels on day 15 that was statistically highly significant (p<0.01), but CD10+ had statistically significant lower MRD level on day 15 (p=0.049). However, only CD34 retained its significance at an MRD cut-off level of 0.01%. Conclusion: CD34, CD10, CD13/CD33 expression, as well as T-cell maturation stages, may have prognostic significance in pediatric T-ALL as they have a significant impact on early clearance of leukemic cells detected by MRD day 15.
Expression and prognostic significance of lysozyme in male breast cancer

International Nuclear Information System (INIS)

Serra, Carlos; Baltasar, Aniceto; Medrano, Justo; Vizoso, Francisco; Alonso, Lorena; Rodríguez, Juan C; González, Luis O; Fernández, María; Lamelas, María L; Sánchez, Luis M; García-Muñiz, José L

2002-01-01

Lysozyme, one of the major protein components of human milk that is also synthesized by a significant percentage of breast carcinomas, is associated with lesions that have a favorable outcome in female breast cancer. Here we evaluate the expression and prognostic value of lysozyme in male breast cancer (MBC). Lysozyme expression was examined by immunohistochemical methods in a series of 60 MBC tissue sections and in 15 patients with gynecomastia. Staining was quantified using the HSCORE (histological score) system, which considers both the intensity and the percentage of cells staining at each intensity. Prognostic value of lysozyme was retrospectively evaluated by multivariate analysis taking into account conventional prognostic factors. Lysozyme immunostaining was negative in all cases of gynecomastia. A total of 27 of 60 MBC sections (45%) stained positively for this protein, but there were clear differences among them with regard to the intensity and percentage of stained cells. Statistical analysis showed that lysozyme HSCORE values in relation to age, tumor size, nodal status, histological grade, estrogen receptor status, metastasis and histological type did not increase the statistical significance. Univariate analysis confirmed that both nodal involvement and lysozyme values were significant predictors of short-term relapse-free survival. Multivariate analysis, according to Cox's regression model, also showed that nodal status and lysozyme levels were significant independent indicators of short-term relapse-free survival. Tumor expression of lysozyme is associated with lesions that have an unfavorable outcome in male breast cancer. This milk protein may be a new prognostic factor in patients with breast cancer
Evaluating statistical and clinical significance of intervention effects in single-case experimental designs: an SPSS method to analyze univariate data.

Science.gov (United States)

Maric, Marija; de Haan, Else; Hogendoorn, Sanne M; Wolters, Lidewij H; Huizenga, Hilde M

2015-03-01

Single-case experimental designs are useful methods in clinical research practice to investigate individual client progress. Their proliferation might have been hampered by methodological challenges such as the difficulty applying existing statistical procedures. In this article, we describe a data-analytic method to analyze univariate (i.e., one symptom) single-case data using the common package SPSS. This method can help the clinical researcher to investigate whether an intervention works as compared with a baseline period or another intervention type, and to determine whether symptom improvement is clinically significant. First, we describe the statistical method in a conceptual way and show how it can be implemented in SPSS. Simulation studies were performed to determine the number of observation points required per intervention phase. Second, to illustrate this method and its implications, we present a case study of an adolescent with anxiety disorders treated with cognitive-behavioral therapy techniques in an outpatient psychotherapy clinic, whose symptoms were regularly assessed before each session. We provide a description of the data analyses and results of this case study. Finally, we discuss the advantages and shortcomings of the proposed method. Copyright © 2014. Published by Elsevier Ltd.
Prognostic significance of obstructive uropathy in advanced prostate cancer.

Science.gov (United States)

Oefelein, Michael G

2004-06-01

To report the incidence and prognostic implications of obstructive uropathy (OU) in patients with advanced prostate cancer receiving androgen deprivation therapy and to define the impact initial local therapy has on the development of OU in patients with prostate cancer who develop recurrence and begin androgen deprivation therapy. From a population of 260 patients with advanced prostate cancer diagnosed between 1986 and 2003, OU was identified in 51 patients. The OU treatment options included ureteral stent, percutaneous nephrostomy, transurethral resection of the prostate, Foley catheter placement, and urinary diversion. Overall survival and the factors that influenced survival were calculated using standard statistical methods. OU was diagnosed in 15 (16%) of 80 patients who received local therapy with curative intent and in whom local therapy subsequently failed and in 36 (19%) of 180 patients who had never received local therapy (P = 0.7, chi-square test). Of these 51 patients, 39 had bladder neck obstruction and 16 had ureteral obstruction. Overall survival was significantly worse for the men with OU compared with those without OU (41 versus 54 months). OU was associated with tumor stage and androgen-insensitive prostate cancer. OU results in significantly reduced survival in men with prostate cancer. In a select group of patients with prostate cancer with progression after local therapy (primarily radiotherapy), no statistically significant reduction in the development of OU was observed relative to patients matched for stage, grade, and pretreatment prostate-specific antigen level treated with androgen deprivation therapy alone. Aggressive advanced stage and hormone-insensitive disease are variables associated with OU.
Performance studies of GooFit on GPUs vs RooFit on CPUs while estimating the statistical significance of a new physical signal

Science.gov (United States)

Di Florio, Adriano

2017-10-01

In order to test the computing capabilities of GPUs with respect to traditional CPU cores a high-statistics toy Monte Carlo technique has been implemented both in ROOT/RooFit and GooFit frameworks with the purpose to estimate the statistical significance of the structure observed by CMS close to the kinematical boundary of the J/ψϕ invariant mass in the three-body decay B + → J/ψϕK +. GooFit is a data analysis open tool under development that interfaces ROOT/RooFit to CUDA platform on nVidia GPU. The optimized GooFit application running on GPUs hosted by servers in the Bari Tier2 provides striking speed-up performances with respect to the RooFit application parallelised on multiple CPUs by means of PROOF-Lite tool. The considerable resulting speed-up, evident when comparing concurrent GooFit processes allowed by CUDA Multi Process Service and a RooFit/PROOF-Lite process with multiple CPU workers, is presented and discussed in detail. By means of GooFit it has also been possible to explore the behaviour of a likelihood ratio test statistic in different situations in which the Wilks Theorem may or may not apply because its regularity conditions are not satisfied.
Wind energy statistics

International Nuclear Information System (INIS)

Holttinen, H.; Tammelin, B.; Hyvoenen, R.

1997-01-01

The recording, analyzing and publishing of statistics of wind energy production has been reorganized in cooperation of VTT Energy, Finnish Meteorological (FMI Energy) and Finnish Wind Energy Association (STY) and supported by the Ministry of Trade and Industry (KTM). VTT Energy has developed a database that contains both monthly data and information on the wind turbines, sites and operators involved. The monthly production figures together with component failure statistics are collected from the operators by VTT Energy, who produces the final wind energy statistics to be published in Tuulensilmae and reported to energy statistics in Finland and abroad (Statistics Finland, Eurostat, IEA). To be able to verify the annual and monthly wind energy potential with average wind energy climate a production index in adopted. The index gives the expected wind energy production at various areas in Finland calculated using real wind speed observations, air density and a power curve for a typical 500 kW-wind turbine. FMI Energy has produced the average figures for four weather stations using the data from 1985-1996, and produces the monthly figures. (orig.)
Quantum formalism for classical statistics

Science.gov (United States)

Wetterich, C.

2018-06-01

In static classical statistical systems the problem of information transport from a boundary to the bulk finds a simple description in terms of wave functions or density matrices. While the transfer matrix formalism is a type of Heisenberg picture for this problem, we develop here the associated Schrödinger picture that keeps track of the local probabilistic information. The transport of the probabilistic information between neighboring hypersurfaces obeys a linear evolution equation, and therefore the superposition principle for the possible solutions. Operators are associated to local observables, with rules for the computation of expectation values similar to quantum mechanics. We discuss how non-commutativity naturally arises in this setting. Also other features characteristic of quantum mechanics, such as complex structure, change of basis or symmetry transformations, can be found in classical statistics once formulated in terms of wave functions or density matrices. We construct for every quantum system an equivalent classical statistical system, such that time in quantum mechanics corresponds to the location of hypersurfaces in the classical probabilistic ensemble. For suitable choices of local observables in the classical statistical system one can, in principle, compute all expectation values and correlations of observables in the quantum system from the local probabilistic information of the associated classical statistical system. Realizing a static memory material as a quantum simulator for a given quantum system is not a matter of principle, but rather of practical simplicity.

Statistical identification of gene association by CID in application of constructing ER regulatory network

Directory of Open Access Journals (Sweden)

Lien Huang-Chun

2009-03-01

Full Text Available Abstract Background A variety of high-throughput techniques are now available for constructing comprehensive gene regulatory networks in systems biology. In this study, we report a new statistical approach for facilitating in silico inference of regulatory network structure. The new measure of association, coefficient of intrinsic dependence (CID, is model-free and can be applied to both continuous and categorical distributions. When given two variables X and Y, CID answers whether Y is dependent on X by examining the conditional distribution of Y given X. In this paper, we apply CID to analyze the regulatory relationships between transcription factors (TFs (X and their downstream genes (Y based on clinical data. More specifically, we use estrogen receptor α (ERα as the variable X, and the analyses are based on 48 clinical breast cancer gene expression arrays (48A. Results The analytical utility of CID was evaluated in comparison with four commonly used statistical methods, Galton-Pearson's correlation coefficient (GPCC, Student's t-test (STT, coefficient of determination (CoD, and mutual information (MI. When being compared to GPCC, CoD, and MI, CID reveals its preferential ability to discover the regulatory association where distribution of the mRNA expression levels on X and Y does not fit linear models. On the other hand, when CID is used to measure the association of a continuous variable (Y against a discrete variable (X, it shows similar performance as compared to STT, and appears to outperform CoD and MI. In addition, this study established a two-layer transcriptional regulatory network to exemplify the usage of CID, in combination with GPCC, in deciphering gene networks based on gene expression profiles from patient arrays. Conclusion CID is shown to provide useful information for identifying associations between genes and transcription factors of interest in patient arrays. When coupled with the relationships detected by GPCC, the
The significance of reduced respiratory chain enzyme activities: clinical, biochemical and radiological associations.

Science.gov (United States)

Mordekar, S R; Guthrie, P; Bonham, J R; Olpin, S E; Hargreaves, I; Baxter, P S

2006-03-01

Mitochondrial diseases are an important group of neurometabolic disorders in children with varied clinical presentations and diagnosis that can be difficult to confirm. To report the significance of reduced respiratory chain enzyme (RCE) activity in muscle biopsy samples from children. Retrospective odds ratio was used to compare clinical and biochemical features, DNA studies, neuroimaging, and muscle biopsies in 18 children with and 48 without reduced RCE activity. Children with reduced RCE activity were significantly more likely to have consanguineous parents, to present with acute encephalopathy and lactic acidaemia and/or within the first year of life; to have an axonal neuropathy, CSF lactate >4 mmol/l; and/or to have signal change in the basal ganglia. There were positive associations with a maternal family history of possible mitochondrial cytopathy; a presentation with failure to thrive and lactic acidaemia, ragged red fibres, reduced fibroblast fatty acid oxidation and with an abnormal allopurinol loading test. There was no association with ophthalmic abnormalities, deafness, epilepsy or myopathy. The association of these clinical, biochemical and radiological features with reduced RCE activity suggests a possible causative link.
Robust statistical methods for significance evaluation and applications in cancer driver detection and biomarker discovery

DEFF Research Database (Denmark)

Madsen, Tobias

2017-01-01

In the present thesis I develop, implement and apply statistical methods for detecting genomic elements implicated in cancer development and progression. This is done in two separate bodies of work. The first uses the somatic mutation burden to distinguish cancer driver mutations from passenger m...
Allergic Contact Dermatitis Is Associated with Significant Oxidative Stress

Directory of Open Access Journals (Sweden)

S. Kaur

2014-01-01

Full Text Available Background. Research has confirmed the involvement of oxidative stress (OxS in allergic contact dermatitis whilst other inflammation-related biomarkers have been less studied. Objective. To evaluate systemic levels of selected inflammatory markers, OxS indices and adipokines as well as their associations in allergic contact dermatitis. Methods. In 40 patients, interleukin- (IL- 6, monocyte chemoattractant protein (MCP-1, and IL-10 levels were measured in sera with the Evidence Investigator Cytokine & Growth factors High-Sensitivity Array, total peroxide concentration (TPX and total antioxidant capacity (TAC by means of spectrophotometry, and the plasma concentrations of adiponectin and leptin by the quantitative sandwich enzyme immunoassay technique. Results. TNF-α level (P < 0.01 and TPX (P < 0.0001 were increased whilst IL-10 (P < 0.05 and TAC (P < 0.0001 were decreased in the patients as compared to controls. Correlation and multiple linear regression analysis identified both, TPX and TAC (inversely, as possible independent markers for evaluating allergic contact dermatitis. Adiponectin level in patients was increased (P < 0.0001, but neither adiponectin nor leptin correlated significantly with the biomarkers of inflammation or OxS. Conclusion. OxS parameters, especially TPX and OSI, reflect the degree of systemic inflammation associated with allergic contact dermatitis in the best way. The relation between OxS and adiponectin level warrants further studies.
Statistics in the 21st century

CERN Document Server

Wells, Martin T; Wells, Martin T

2001-01-01

Exactly what is the state of the art in statistics as we move forward into the 21st century? What promises, what trends does its future hold? Through the reflections of 70 of the world's leading statistical methodologists, researchers, theorists, and practitioners, Statistics in the 21st Century answers those questions. Originally published in the Journal of the American Statistical Association, this collection of vignettes examines our statistical past, comments on our present, and speculates on our future. Although the coverage is broad and the topics diverse, it reveals the essential intell
Significance evaluation in factor graphs

DEFF Research Database (Denmark)

Madsen, Tobias; Hobolth, Asger; Jensen, Jens Ledet

2017-01-01

in genomics and the multiple-testing issues accompanying them, accurate significance evaluation is of great importance. We here address the problem of evaluating statistical significance of observations from factor graph models. Results Two novel numerical approximations for evaluation of statistical...... significance are presented. First a method using importance sampling. Second a saddlepoint approximation based method. We develop algorithms to efficiently compute the approximations and compare them to naive sampling and the normal approximation. The individual merits of the methods are analysed both from....... Conclusions The applicability of saddlepoint approximation and importance sampling is demonstrated on known models in the factor graph framework. Using the two methods we can substantially improve computational cost without compromising accuracy. This contribution allows analyses of large datasets...
Assessment of significant psychological distress at the end of pregnancy and associated factors.

Science.gov (United States)

Lorén-Guerrero, L; Gascón-Catalán, A; Pasierb, D; Romero-Cardiel, M A

2018-06-01

The aim of this study is to study the prevalence of mental distress at the end of pregnancy and after birth and the impact of selected socio-demographic and obstetric factors. This is a cross-sectional study. The sample is consisted of 351 puerperal women at the age of 18 and over. Sociodemographic, obstetric variables were collected to detect significant psychological distress; the instrument used was General Health Questionnaire (GHQ-28). Logistic multivariable regressions were used to investigate associations. The prevalence of significant mental distress amounted to 81.2%, mostly related to social relationship and anxiety. The women who affirmed having more stress during pregnancy had too significantly increased emotional distress before the birth as well as during early puerperium, increasing somatic symptoms (p Psychological distress at the end of a full-term pregnancy and in the postpartum period occurs frequently and was associated mainly with stress experienced during pregnancy and parity. It is advisable to perform proper assessment of stress and significant psychological distress at the early stage of pregnancy and repeatedly later on until delivery. Information and support from professionals can help to decrease and prevent their negative impact on maternal and fetal health, as observed in the current evidence.
Estimating Effect Sizes and Expected Replication Probabilities from GWAS Summary Statistics

DEFF Research Database (Denmark)

Holland, Dominic; Wang, Yunpeng; Thompson, Wesley K

2016-01-01

Genome-wide Association Studies (GWAS) result in millions of summary statistics ("z-scores") for single nucleotide polymorphism (SNP) associations with phenotypes. These rich datasets afford deep insights into the nature and extent of genetic contributions to complex phenotypes such as psychiatric......-scores, as such knowledge would enhance causal SNP and gene discovery, help elucidate mechanistic pathways, and inform future study design. Here we present a parsimonious methodology for modeling effect sizes and replication probabilities, relying only on summary statistics from GWAS substudies, and a scheme allowing...... for estimating the degree of polygenicity of the phenotype and predicting the proportion of chip heritability explainable by genome-wide significant SNPs in future studies with larger sample sizes. We apply the model to recent GWAS of schizophrenia (N = 82,315) and putamen volume (N = 12,596), with approximately...
Are studies reporting significant results more likely to be published?

Science.gov (United States)

Koletsi, Despina; Karagianni, Anthi; Pandis, Nikolaos; Makou, Margarita; Polychronopoulou, Argy; Eliades, Theodore

2009-11-01

Our objective was to assess the hypothesis that there are variations of the proportion of articles reporting a significant effect, with a higher percentage of those articles published in journals with impact factors. The contents of 5 orthodontic journals (American Journal of Orthodontics and Dentofacial Orthopedics, Angle Orthodontist, European Journal of Orthodontics, Journal of Orthodontics, and Orthodontics and Craniofacial Research), published between 2004 and 2008, were hand-searched. Articles with statistical analysis of data were included in the study and classified into 4 categories: behavior and psychology, biomaterials and biomechanics, diagnostic procedures and treatment, and craniofacial growth, morphology, and genetics. In total, 2622 articles were examined, with 1785 included in the analysis. Univariate and multivariate logistic regression analyses were applied with statistical significance as the dependent variable, and whether the journal had an impact factor, the subject, and the year were the independent predictors. A higher percentage of articles showed significant results relative to those without significant associations (on average, 88% vs 12%) for those journals. Overall, these journals published significantly more studies with significant results, ranging from 75% to 90% (P = 0.02). Multivariate modeling showed that journals with impact factors had a 100% increased probability of publishing a statistically significant result compared with journals with no impact factor (odds ratio [OR], 1.99; 95% CI, 1.19-3.31). Compared with articles on biomaterials and biomechanics, all other subject categories showed lower probabilities of significant results. Nonsignificant findings in behavior and psychology and diagnosis and treatment were 1.8 (OR, 1.75; 95% CI, 1.51-2.67) and 3.5 (OR, 3.50; 95% CI, 2.27-5.37) times more likely to be published, respectively. Journals seem to prefer reporting significant results; this might be because of authors
Preovulatory progesterone concentration associates significantly to follicle number and LH concentration but not to pregnancy rate

DEFF Research Database (Denmark)

Yding Andersen, Claus; Bungum, Leif; Nyboe Andersen, Anders

2011-01-01

Using data from a large prospective randomized controlled trial that evaluated the effect of recombinant LH (rLH)co-administration for ovarian stimulation, the present study assessed whether progesterone concentration on the day of human chorionic gonadotrophin (HCG) administration was associated...... with or without rLH administration from day 6 of stimulation. There was no significant association between the late-follicular-phase progesterone concentration and the clinical pregnancy rate. However, progesterone concentration was strongly associated with the number of follicles and retrieved oocytes. Late......-follicular-phase LH concentration also showed a significant positive association with progesterone concentration (P = 0.018). Administration of rLH during ovarian stimulation did not affect progesterone concentration. The present study does not support an association between progesterone concentration on the day...
Significant variables associated with epilepsy

International Nuclear Information System (INIS)

Cheema, F.A.; Qayyum, K.; Ahmad, N.; Makhdoomi, A.; Safdar, A.; Asif, A.; Chaudhry, H.R.

2003-01-01

Objective: To study the characteristics of the epileptics and the risk factors contributing to the development of epilepsy. Results: Majority of the subjects were single (77.84%), 1st born among their siblings (25.95%), belonged to low social class (50.63%), and unemployed(25.31%). The major risk factors were family history of illness (23.52%) and positive medical problem around birth (12.66%). The presence of family history of illness, positive medical problem around birth and advanced maternal age at birth were associated with early onset of epilepsy. Vulnerability for the epilepsy also increases among hospital deliveries. Conclusion: Although the present study has identified various risk factors, yet the results need to be further confirmed through case-control studies. (author)
Use of statistical procedures in Brazilian and international dental journals.

Science.gov (United States)

Ambrosano, Gláucia Maria Bovi; Reis, André Figueiredo; Giannini, Marcelo; Pereira, Antônio Carlos

2004-01-01

A descriptive survey was performed in order to assess the statistical content and quality of Brazilian and international dental journals, and compare their evolution throughout the last decades. The authors identified the reporting and accuracy of statistical techniques in 1000 papers published from 1970 to 2000 in seven dental journals: three Brazilian (Brazilian Dental Journal, Revista de Odontologia da Universidade de Sao Paulo and Revista de Odontologia da UNESP) and four international journals (Journal of the American Dental Association, Journal of Dental Research, Caries Research and Journal of Periodontology). Papers were divided into two time periods: from 1970 to 1989, and from 1990 to 2000. A slight increase in the number of articles that presented some form of statistical technique was noticed for Brazilian journals (from 61.0 to 66.7%), whereas for international journals, a significant increase was observed (65.8 to 92.6%). In addition, a decrease in the number of statistical errors was verified. The most commonly used statistical tests as well as the most frequent errors found in dental journals were assessed. Hopefully, this investigation will encourage dental educators to better plan the teaching of biostatistics, and to improve the statistical quality of submitted manuscripts.
Dengue hemorrhagic fever and typhoid fever association based on spatial standpoint using scan statistics in DKI Jakarta

Science.gov (United States)

Hervind, Widyaningsih, Y.

2017-07-01

Concurrent infection with multiple infectious agents may occur in one patient, it appears frequently in dengue hemorrhagic fever (DHF) and typhoid fever. This paper depicted association between DHF and typhoid based on spatial point of view. Since paucity of data regarding dengue and typhoid co-infection, data that be used are the number of patients of those diseases in every district (kecamatan) in Jakarta in 2014 and 2015 obtained from Jakarta surveillance website. Poisson spatial scan statistics is used to detect DHF and typhoid hotspots area district in Jakarta separately. After obtain the hotspot, Fisher's exact test is applied to validate association between those two diseases' hotspot. The result exhibit hotspots of DHF and typhoid are located around central Jakarta. The further analysis used Poisson space-time scan statistics to reveal the hotspot in term of spatial and time. DHF and typhoid fever more likely occurr from January until May in the area which is relatively similar with pure spatial result. Preventive action could be done especially in the hotspot areas and it is required further study to observe the causes based on characteristics of the hotspot area.
Mitochondrial DNA content in embryo culture medium is significantly associated with human embryo fragmentation.

Science.gov (United States)

Stigliani, S; Anserini, P; Venturini, P L; Scaruffi, P

2013-10-01

Is the amount of cell-free DNA released by human embryos into culture medium correlated with embryo morphological features? The mitochondrial DNA (mtDNA) content of culture medium is significantly associated with the fragmentation rate on Days 2 and 3 of embryo development, whether the oocyte came from women ≤ 35 or >35 years old. Cellular fragmentation is often utilized as one of the morphological parameters for embryo quality assessment. The amount of cellular fragments is considered to be an important morphological parameter for embryo implantation potential. It has been hypothesized that fragments are apoptotic bodies or anuclear cytoplasmatic pieces of blastomeres, although no definitive conclusion has been drawn about their pathogenesis. Human fertilized oocytes were individually cultured from Day 1 to Days 2 and 3. A total of 800 samples (166 spent media from Day 2 and 634 from Day 3) were enrolled into the present study. Double-stranded DNA (dsDNA) was quantified in 800 spent embryo culture media by Pico Green dye fluorescence assay. After DNA purification, genomic DNA (gDNA) and mtDNA were profiled by specific quantitative PCR. Statistical analyses defined correlations among DNA contents, embryo morphology and maternal age. Different independent tests confirmed the presence of DNA into embryo culture medium and, for the first time, we demonstrate that both gDNA and mtDNA are detectable in the secretome. The amount of DNA is larger in embryos with bad quality cleavage compared with high-grade embryos, suggesting that the DNA profile of culture medium is an objective marker for embryo quality assessment. In particular, DNA profiles are significantly associated with fragmentation feature (total dsDNA: P = 0.0010; mtDNA; P = 0.0247) and advanced maternal age. It is necessary to establish whether DNA profiling of spent embryo culture medium is a robust onsite test that can improve the prediction of blastulation, implantation and/or pregnancy rate. The
The Application of Strength of Association Statistics to the Item Analysis of an In-Training Examination in Diagnostic Radiology.

Science.gov (United States)

Diamond, James J.; McCormick, Janet

1986-01-01

Using item responses from an in-training examination in diagnostic radiology, the application of a strength of association statistic to the general problem of item analysis is illustrated. Criteria for item selection, general issues of reliability, and error of measurement are discussed. (Author/LMO)
Levels of pregnancy-associated plasma protein-A in patients with coronary heart diseases and clinic significance

International Nuclear Information System (INIS)

Wang Lingyan; Cai Gaojun; Zhang Wenwei; Wang Wenzhi; Sun Wenwei; Yan Weiqun

2006-01-01

Objective: To explore the relationship between pregnancy-associated plasma protein-A (PAPP-A) and occurance, development of cardiovascular diseases, and lipids. Methods: 75 patients with coronary disease were divided into acute myocardial infarction (n=32), unstable angina pectoris (n=22) and stable angina pectoris (n=21) groups, and 60 subjects without coronary diseases were used as controls. The serum PAPP-A, IL-6, IL-10, lipids were measured in all patients and controls by different methods of enzymatically amplified two-step sandwith- type immunoassay, double antibody radio-immunoassay, ABC-HRP, auto biochemistic analytist. Results: (1) The level of PAPP-A in acute coronary syndrome (ACS, including acute myocardial infarction and unstable angina pectoris) patients was significantly higher than that in stable angina pectoris patients and controls (P<0.05). (2) There were significantly associations between PAPP-A and serum totle cholesterol, ApoA1/ApoB (r=0.348, 0.420, P<0.05). (3) The levels of IL-6 and IL-10 in coronary heart disease patients were significantly higher than those in controls (P<0.05), and the variations among acute myocardial infarction, unstable angina pectoris, stable angina pectoris patients were significantly (P<0.05). There were significantly associations between PAPP-A, IL-6 and IL-10 (Spearman r 0.446, 0.523, P<0.05). Conclusion: PAPP-A is significantly associated with occurance and development of coronary heart disease, probablely as a marker of unstable plaque in coronary heart disease. (authors)
Identifying clusters of active transportation using spatial scan statistics.

Science.gov (United States)

Huang, Lan; Stinchcomb, David G; Pickle, Linda W; Dill, Jennifer; Berrigan, David

2009-08-01

There is an intense interest in the possibility that neighborhood characteristics influence active transportation such as walking or biking. The purpose of this paper is to illustrate how a spatial cluster identification method can evaluate the geographic variation of active transportation and identify neighborhoods with unusually high/low levels of active transportation. Self-reported walking/biking prevalence, demographic characteristics, street connectivity variables, and neighborhood socioeconomic data were collected from respondents to the 2001 California Health Interview Survey (CHIS; N=10,688) in Los Angeles County (LAC) and San Diego County (SDC). Spatial scan statistics were used to identify clusters of high or low prevalence (with and without age-adjustment) and the quantity of time spent walking and biking. The data, a subset from the 2001 CHIS, were analyzed in 2007-2008. Geographic clusters of significantly high or low prevalence of walking and biking were detected in LAC and SDC. Structural variables such as street connectivity and shorter block lengths are consistently associated with higher levels of active transportation, but associations between active transportation and socioeconomic variables at the individual and neighborhood levels are mixed. Only one cluster with less time spent walking and biking among walkers/bikers was detected in LAC, and this was of borderline significance. Age-adjustment affects the clustering pattern of walking/biking prevalence in LAC, but not in SDC. The use of spatial scan statistics to identify significant clustering of health behaviors such as active transportation adds to the more traditional regression analysis that examines associations between behavior and environmental factors by identifying specific geographic areas with unusual levels of the behavior independent of predefined administrative units.
Statistical lamb wave localization based on extreme value theory

Science.gov (United States)

Harley, Joel B.

2018-04-01

Guided wave localization methods based on delay-and-sum imaging, matched field processing, and other techniques have been designed and researched to create images that locate and describe structural damage. The maximum value of these images typically represent an estimated damage location. Yet, it is often unclear if this maximum value, or any other value in the image, is a statistically significant indicator of damage. Furthermore, there are currently few, if any, approaches to assess the statistical significance of guided wave localization images. As a result, we present statistical delay-and-sum and statistical matched field processing localization methods to create statistically significant images of damage. Our framework uses constant rate of false alarm statistics and extreme value theory to detect damage with little prior information. We demonstrate our methods with in situ guided wave data from an aluminum plate to detect two 0.75 cm diameter holes. Our results show an expected improvement in statistical significance as the number of sensors increase. With seventeen sensors, both methods successfully detect damage with statistical significance.
Enrichment of statistical power for genome-wide association studies

Science.gov (United States)

The inheritance of most human diseases and agriculturally important traits is controlled by many genes with small effects. Identifying these genes, while simultaneously controlling false positives, is challenging. Among available statistical methods, the mixed linear model (MLM) has been the most fl...
The Relationship between Test Anxiety and Academic Performance of Students in Vital Statistics Course

Directory of Open Access Journals (Sweden)

Shirin Iranfar

2013-12-01

Full Text Available Introduction: Test anxiety is a common phenomenon among students and is one of the problems of educational system. The present study was conducted to investigate the test anxiety in vital statistics course and its association with academic performance of students at Kermanshah University of Medical Sciences. This study was descriptive-analytical and the study sample included the students studying in nursing and midwifery, paramedicine and health faculties that had taken vital statistics course and were selected through census method. Sarason questionnaire was used to analyze the test anxiety. Data were analyzed by descriptive and inferential statistics. The findings indicated no significant correlation between test anxiety and score of vital statistics course.

Significant association of SREBP-2 genetic polymorphisms with avascular necrosis in the Korean population

Directory of Open Access Journals (Sweden)

Park Eui

2008-10-01

Full Text Available Abstract Background It is known that steroid usage and alcohol abuse are major etiological factors in the development of avascular necrosis (AVN, a bone disease that produces osteonecrosis of the femoral head. The facilitation of fat biosynthesis by steroids and alcohol disrupts the blood supply into the femoral head. SREBP-2 plays a central role in the maintenance of lipid homeostasis through stimulating expression of genes associated with cholesterol biosynthetic pathways. The aim of this study was to examine the association between the polymorphisms of the SREBP-2 gene and AVN susceptibility in the Korean population. Methods Four single nucleotide polymorphisms (SNP in the SREBP-2 gene, IVS1+8408 T>C (rs2267439, IVS3-342 G>T (rs2269657, IVS11+414 G>A (rs1052717 and IVS12-1667 G>A (rs2267443, were selected from public databases and genotyped in 443 AVN patients and 273 control subjects by using single-based extension (SBE genotyping. Results The minor allele (C frequency of rs2267439 showed a significant protective effect on AVN (P = 0.01, OR; 0.75, 95% CI; 0.604–0.935, and the genotype frequencies of this polymorphism were also different from the controls in all alternative analysis models (P range, 0.009–0.03, OR; 0.647–0.744. In contrast, rs1052717 and rs2267443 polymorphisms were significantly associated with AVN risk. Further analysis based on pathological etiology showed that the genotypes of rs2267439, rs1052717 and rs2267443 were also significantly associated with AVN susceptibility in each subgroup. Conclusion This study is the first report to evaluate the association between SREBP-2 gene polymorphisms and the susceptibility of AVN in the Korean population.
Statistical Model of Extreme Shear

DEFF Research Database (Denmark)

Larsen, Gunner Chr.; Hansen, Kurt Schaldemose

2004-01-01

In order to continue cost-optimisation of modern large wind turbines, it is important to continously increase the knowledge on wind field parameters relevant to design loads. This paper presents a general statistical model that offers site-specific prediction of the probability density function...... by a model that, on a statistically consistent basis, describe the most likely spatial shape of an extreme wind shear event. Predictions from the model have been compared with results from an extreme value data analysis, based on a large number of high-sampled full-scale time series measurements...... are consistent, given the inevitabel uncertainties associated with model as well as with the extreme value data analysis. Keywords: Statistical model, extreme wind conditions, statistical analysis, turbulence, wind loading, statistical analysis, turbulence, wind loading, wind shear, wind turbines....
The significance of Good Chair as part of children’s school and home environment in the preventive treatment of body statistics distortions

OpenAIRE

Mirosław Mrozkowiak; Hanna Żukowska

2015-01-01

Mrozkowiak Mirosław, Żukowska Hanna. Znaczenie Dobrego Krzesła, jako elementu szkolnego i domowego środowiska ucznia, w profilaktyce zaburzeń statyki postawy ciała = The significance of Good Chair as part of children’s school and home environment in the preventive treatment of body statistics distortions. Journal of Education, Health and Sport. 2015;5(7):179-215. ISSN 2391-8306. DOI 10.5281/zenodo.19832 http://ojs.ukw.edu.pl/index.php/johs/article/view/2015%3B5%287%29%3A179-215 https:...
A Bayesian Framework for Multiple Trait Colo-calization from Summary Association Statistics.

Science.gov (United States)

Giambartolomei, Claudia; Zhenli Liu, Jimmy; Zhang, Wen; Hauberg, Mads; Shi, Huwenbo; Boocock, James; Pickrell, Joe; Jaffe, Andrew E; Pasaniuc, Bogdan; Roussos, Panos

2018-03-19

Most genetic variants implicated in complex diseases by genome-wide association studies (GWAS) are non-coding, making it challenging to understand the causative genes involved in disease. Integrating external information such as quantitative trait locus (QTL) mapping of molecular traits (e.g., expression, methylation) is a powerful approach to identify the subset of GWAS signals explained by regulatory effects. In particular, expression QTLs (eQTLs) help pinpoint the responsible gene among the GWAS regions that harbor many genes, while methylation QTLs (mQTLs) help identify the epigenetic mechanisms that impact gene expression which in turn affect disease risk. In this work we propose multiple-trait-coloc (moloc), a Bayesian statistical framework that integrates GWAS summary data with multiple molecular QTL data to identify regulatory effects at GWAS risk loci. We applied moloc to schizophrenia (SCZ) and eQTL/mQTL data derived from human brain tissue and identified 52 candidate genes that influence SCZ through methylation. Our method can be applied to any GWAS and relevant functional data to help prioritize disease associated genes. moloc is available for download as an R package (https://github.com/clagiamba/moloc). We also developed a web site to visualize the biological findings (icahn.mssm.edu/moloc). The browser allows searches by gene, methylation probe, and scenario of interest. claudia.giambartolomei@gmail.com. Supplementary data are available at Bioinformatics online.
Can a significance test be genuinely Bayesian?

OpenAIRE

Pereira, Carlos A. de B.; Stern, Julio Michael; Wechsler, Sergio

2008-01-01

The Full Bayesian Significance Test, FBST, is extensively reviewed. Its test statistic, a genuine Bayesian measure of evidence, is discussed in detail. Its behavior in some problems of statistical inference like testing for independence in contingency tables is discussed.
Brain edema associated with intracranial meningiomas

International Nuclear Information System (INIS)

Asahi, Minoru; Kikuchi, Haruhiko; Hirai, Osamu

1992-01-01

Brain edema associated with intracranial meningiomas was investigated on 80 patients, excluding recurrent cases. Statistically significant positive correlations with the degree of edema were found with large tumors, the convexity or parasagittal locations, the venous outflow disturbance, and the evidence of cortical disruption or peritumoral enhancement visualized on computed tomography or magnetic resonance imagings. Patients with a short clinical history and with angiographic evidence of hypervascularity tended to have edema, but there was no statistical significance. It is concluded that various factors are responsible for the edema associated with meningiomas and that it would be hard to determine the most important cause, since each factor plays a part edema production, spread, and resolution. (author)
Elevated myeloid-derived suppressor cells in pancreatic, esophageal and gastric cancer are an independent prognostic factor and are associated with significant elevation of the Th2 cytokine interleukin-13.

Science.gov (United States)

Gabitass, Rachel F; Annels, Nicola E; Stocken, Deborah D; Pandha, Hardev A; Middleton, Gary W

2011-10-01

We undertook a comprehensive analysis of circulating myeloid-derived suppressor cells (MDSCs) and T regulatory cells (Tregs) in pancreatic, esophageal and gastric cancer patients and investigated whether MDSCs are an independent prognostic factor for survival. We evaluated a series of plasma cytokines and in particular re-evaluated the Th2 cytokine interleukin-13 (IL-13). Peripheral blood was collected from 131 cancer patients (46 pancreatic, 60 esophageal and 25 gastric) and 54 healthy controls. PBMC were harvested with subsequent flow cytometric analysis of MDSC (HLADR(-) Lin1(low/-) CD33(+) CD11b(+)) and Treg (CD4(+) CD25(+) CD127(low/-) FoxP3(+)) percentages. Plasma IL-2, IL-4, IL-5, IL-6, IL-10, IL-12 (p70), IL-13, IL-17, G-CSF, IFN-γ, TNF-α and VEGF levels were analyzed by the Bio-Plex cytokine assay. Plasma arginase I levels were analyzed by ELISA. MDSCs and Tregs were statistically significantly elevated in pancreatic, esophageal and gastric cancer compared with controls, and MDSC numbers correlated with Treg levels. Increasing MDSC percentage was associated with increased risk of death, and in a multivariate analysis, MDSC level was an independent prognostic factor for survival. A unit increase in MDSC percentage was associated with a 22% increased risk of death (hazard ratio 1.22, 95% confidence interval 1.06-1.41). Arginase I levels were also statistically significantly elevated in upper gastrointestinal cancer patients compared with controls. There was Th2 skewing for cytokine production in all three diseases, and importantly there were significant elevations of the pivotal Th2 cytokine interleukin-13, an increase that correlated with MDSC levels.
Statistical Significance, Effect Size Reporting, and Confidence Intervals: Best Reporting Strategies

Science.gov (United States)

Capraro, Robert M.

2004-01-01

With great interest the author read the May 2002 editorial in the "Journal for Research in Mathematics Education (JRME)" (King, 2002) regarding changes to the 5th edition of the "Publication Manual of the American Psychological Association" (APA, 2001). Of special note to him, and of great import to the field of mathematics education research, are…
Fisher statistics for analysis of diffusion tensor directional information.

Science.gov (United States)

Hutchinson, Elizabeth B; Rutecki, Paul A; Alexander, Andrew L; Sutula, Thomas P

2012-04-30

A statistical approach is presented for the quantitative analysis of diffusion tensor imaging (DTI) directional information using Fisher statistics, which were originally developed for the analysis of vectors in the field of paleomagnetism. In this framework, descriptive and inferential statistics have been formulated based on the Fisher probability density function, a spherical analogue of the normal distribution. The Fisher approach was evaluated for investigation of rat brain DTI maps to characterize tissue orientation in the corpus callosum, fornix, and hilus of the dorsal hippocampal dentate gyrus, and to compare directional properties in these regions following status epilepticus (SE) or traumatic brain injury (TBI) with values in healthy brains. Direction vectors were determined for each region of interest (ROI) for each brain sample and Fisher statistics were applied to calculate the mean direction vector and variance parameters in the corpus callosum, fornix, and dentate gyrus of normal rats and rats that experienced TBI or SE. Hypothesis testing was performed by calculation of Watson's F-statistic and associated p-value giving the likelihood that grouped observations were from the same directional distribution. In the fornix and midline corpus callosum, no directional differences were detected between groups, however in the hilus, significant (pstatistical comparison of tissue structural orientation. Copyright © 2012 Elsevier B.V. All rights reserved.
Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.

Science.gov (United States)

Flannick, Jason; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M; Agarwala, Vineeta; Gaulton, Kyle J; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Dennis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana Cn; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Altshuler, David; Burtt, Noël P; Florez, Jose C; Boehnke, Michael; McCarthy, Mark I

2017-12-19

To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.
Substorm associated radar auroral surges: a statistical study and possible generation model

Directory of Open Access Journals (Sweden)

B. A. Shand

Full Text Available Substorm-associated radar auroral surges (SARAS are a short lived (15–90 minutes and spatially localised (~5° of latitude perturbation of the plasma convection pattern observed within the auroral E-region. The understanding of such phenomena has important ramifications for the investigation of the larger scale plasma convection and ultimately the coupling of the solar wind, magnetosphere and ionosphere system. A statistical investigation is undertaken of SARAS, observed by the Sweden And Britain Radar Experiment (SABRE, in order to provide a more extensive examination of the local time occurrence and propagation characteristics of the events. The statistical analysis has determined a local time occurrence of observations between 1420 MLT and 2200 MLT with a maximum occurrence centred around 1700 MLT. The propagation velocity of the SARAS feature through the SABRE field of view was found to be predominately L-shell aligned with a velocity centred around 1750 m s^–1 and within the range 500 m s^–1 and 3500 m s^–1. This comprehensive examination of the SARAS provides the opportunity to discuss, qualitatively, a possible generation mechanism for SARAS based on a proposed model for the production of a similar phenomenon referred to as sub-auroral ion drifts (SAIDs. The results of the comparison suggests that SARAS may result from a similar geophysical mechanism to that which produces SAID events, but probably occurs at a different time in the evolution of the event.

Key words. Substorms · Auroral surges · Plasma con-vection · Sub-auroral ion drifts
Significant association of serum creatinine with HbA1C in impaired glucose tolerant Pakistani subjects.

Science.gov (United States)

Farasat, Tasnim; Sharif, Saima; Naz, Shagufta; Fazal, Sabiha

2015-01-01

The present study was conducted to assess the serum concentration of creatinine and determine its relationship with potential risk factors of diabetes in Impaired Glucose tolerance subjects. This cross sectional study was conducted on 100 IGT patients who attended Amin Hayat diabetic center in Lahore from January 2011- June 2011. Patients with age group 34-67 years, (both sexes) were included in the study. Different demographic parameters as age, BMI, WHR, B.P, personal history and socioeconomic status were recorded. Oral Glucose Tolerance Test was performed. The biochemical parameters including HbA1c, lipid profile, urea, uric acid, creatinine and bilirubin level were measured by chemistry analyzer. A strong correlation between creatinine and HbA1c was observed. The level of creatinine was also significantly associated with age in IGT subjects. Creatinine is non-significantly correlated with Cholesterol, LDL-Chol and TG while negatively significantly associated with BMI, fasting blood glucose and HDL-Chol. The present study concluded significant association of serum creatinine with HbA1c, BMI and HDL cholesterol.
Statistical-Dynamical Seasonal Forecasts of Central-Southwest Asian Winter Precipitation.

Science.gov (United States)

Tippett, Michael K.; Goddard, Lisa; Barnston, Anthony G.

2005-06-01

Interannual precipitation variability in central-southwest (CSW) Asia has been associated with East Asian jet stream variability and western Pacific tropical convection. However, atmospheric general circulation models (AGCMs) forced by observed sea surface temperature (SST) poorly simulate the region's interannual precipitation variability. The statistical-dynamical approach uses statistical methods to correct systematic deficiencies in the response of AGCMs to SST forcing. Statistical correction methods linking model-simulated Indo-west Pacific precipitation and observed CSW Asia precipitation result in modest, but statistically significant, cross-validated simulation skill in the northeast part of the domain for the period from 1951 to 1998. The statistical-dynamical method is also applied to recent (winter 1998/99 to 2002/03) multimodel, two-tier December-March precipitation forecasts initiated in October. This period includes 4 yr (winter of 1998/99 to 2001/02) of severe drought. Tercile probability forecasts are produced using ensemble-mean forecasts and forecast error estimates. The statistical-dynamical forecasts show enhanced probability of below-normal precipitation for the four drought years and capture the return to normal conditions in part of the region during the winter of 2002/03.May Kabul be without gold, but not without snow.—Traditional Afghan proverb
Clinical analysis and prognostic significance of haemophagocytic lymphohistiocytosis-associated anaplastic large cell lymphoma in children.

Science.gov (United States)

Pasqualini, Claudia; Minard-Colin, Veronique; Saada, Veronique; Lamant, Laurence; Delsol, Georges; Patte, Catherine; Le Deley, Marie-Cécile; Valteau-Couanet, Dominique; Brugières, Laurence

2014-04-01

Haemophagocytic lymphohistiocytosis (HLH) has been rarely described in children treated for an anaplastic large-cell lymphoma (ALCL). We evaluated the incidence, the clinical and histological characteristics and the prognosis of HLH associated-ALCL. The medical, biological, cytological and histological data of patients treated for ALK-positive ALCL in the paediatric department of a single institution between 1975 and 2008 were analysed and assessed for HLH according to diagnosis criteria of the Histiocyte Society. Data concerning a series of 50 consecutive children with ALCL were reviewed. HLH-associated ALCL was observed in 12% of the patients. Lung involvement was significantly more frequent in HLH-associated ALCL patients than in the group without HLH (P = 0·004), as well as central nervous system (CNS) and bone marrow involvement (P = 0·001 and P = 0·007 respectively). The histological subtype in children with HLH-associated ALCL did not differ from that of the group without HLH. There was no significant difference between the two groups in 5-year EFS and OS (P = 0·91 and P > 0·99 respectively). In conclusion, HLH is not rare in paediatric ALCL. Despite a high incidence of visceral, CNS and bone marrow involvement, HLH does not seem to exert a significant impact on outcome in children treated for ALCL. © 2014 John Wiley & Sons Ltd.
A study of statistics anxiety levels of graduate dental hygiene students.

Science.gov (United States)

Welch, Paul S; Jacks, Mary E; Smiley, Lynn A; Walden, Carolyn E; Clark, William D; Nguyen, Carol A

2015-02-01

In light of increased emphasis on evidence-based practice in the profession of dental hygiene, it is important that today's dental hygienist comprehend statistical measures to fully understand research articles, and thereby apply scientific evidence to practice. Therefore, the purpose of this study was to investigate statistics anxiety among graduate dental hygiene students in the U.S. A web-based self-report, anonymous survey was emailed to directors of 17 MSDH programs in the U.S. with a request to distribute to graduate students. The survey collected data on statistics anxiety, sociodemographic characteristics and evidence-based practice. Statistic anxiety was assessed using the Statistical Anxiety Rating Scale. Study significance level was α=0.05. Only 8 of the 17 invited programs participated in the study. Statistical Anxiety Rating Scale data revealed graduate dental hygiene students experience low to moderate levels of statistics anxiety. Specifically, the level of anxiety on the Interpretation Anxiety factor indicated this population could struggle with making sense of scientific research. A decisive majority (92%) of students indicated statistics is essential for evidence-based practice and should be a required course for all dental hygienists. This study served to identify statistics anxiety in a previously unexplored population. The findings should be useful in both theory building and in practical applications. Furthermore, the results can be used to direct future research. Copyright © 2015 The American Dental Hygienists’ Association.
Fundamentals of modern statistical methods substantially improving power and accuracy

CERN Document Server

Wilcox, Rand R

2001-01-01

Conventional statistical methods have a very serious flaw They routinely miss differences among groups or associations among variables that are detected by more modern techniques - even under very small departures from normality Hundreds of journal articles have described the reasons standard techniques can be unsatisfactory, but simple, intuitive explanations are generally unavailable Improved methods have been derived, but they are far from obvious or intuitive based on the training most researchers receive Situations arise where even highly nonsignificant results become significant when analyzed with more modern methods Without assuming any prior training in statistics, Part I of this book describes basic statistical principles from a point of view that makes their shortcomings intuitive and easy to understand The emphasis is on verbal and graphical descriptions of concepts Part II describes modern methods that address the problems covered in Part I Using data from actual studies, many examples are include...
Experimental statistics for biological sciences.

Science.gov (United States)

Bang, Heejung; Davidian, Marie

2010-01-01

In this chapter, we cover basic and fundamental principles and methods in statistics - from "What are Data and Statistics?" to "ANOVA and linear regression," which are the basis of any statistical thinking and undertaking. Readers can easily find the selected topics in most introductory statistics textbooks, but we have tried to assemble and structure them in a succinct and reader-friendly manner in a stand-alone chapter. This text has long been used in real classroom settings for both undergraduate and graduate students who do or do not major in statistical sciences. We hope that from this chapter, readers would understand the key statistical concepts and terminologies, how to design a study (experimental or observational), how to analyze the data (e.g., describe the data and/or estimate the parameter(s) and make inference), and how to interpret the results. This text would be most useful if it is used as a supplemental material, while the readers take their own statistical courses or it would serve as a great reference text associated with a manual for any statistical software as a self-teaching guide.
Statistics for experimentalists

CERN Document Server

Cooper, B E

2014-01-01

Statistics for Experimentalists aims to provide experimental scientists with a working knowledge of statistical methods and search approaches to the analysis of data. The book first elaborates on probability and continuous probability distributions. Discussions focus on properties of continuous random variables and normal variables, independence of two random variables, central moments of a continuous distribution, prediction from a normal distribution, binomial probabilities, and multiplication of probabilities and independence. The text then examines estimation and tests of significance. Topics include estimators and estimates, expected values, minimum variance linear unbiased estimators, sufficient estimators, methods of maximum likelihood and least squares, and the test of significance method. The manuscript ponders on distribution-free tests, Poisson process and counting problems, correlation and function fitting, balanced incomplete randomized block designs and the analysis of covariance, and experiment...
Statistical investigation of expected wave energy and its reliability

International Nuclear Information System (INIS)

Ozger, M.; Altunkaynak, A.; Sen, Z.

2004-01-01

The statistical behavior of wave energy at a single site is derived by considering simultaneous variations in the period and wave height. In this paper, the general wave power formulation is derived by using the theory of perturbation. This method leads to a general formulation of the wave power expectation and other statistical parameter expressions, such as standard deviation and coefficient of variation. The statistical parameters, namely the mean value and variance of wave energy, are found in terms of the simple statistical parameters of period, significant wave height and zero up-crossing period. The elegance of these parameters is that they are distribution free. These parameters provide a means for defining the wave energy distribution function by employing the Chebyschev's inequality. Subsequently, an approximate probability distribution function of the wave energy is also derived for assessment of risk and reliability associated with wave energy. Necessary simple charts are given for risk and reliability assessments. Two procedures are presented for such assessments in wave energy calculations and the applications of these procedures are provided for wave energy potential assessment in the regions of the Pacific Ocean off the west coast of U.S. (author)
Statistical investigation of expected wave energy and its reliability

International Nuclear Information System (INIS)

Oezger, Mehmet; Altunkaynak, Abduesselam; Sen, Zekai

2004-01-01

The statistical behavior of wave energy at a single site is derived by considering simultaneous variations in the period and wave height. In this paper, the general wave power formulation is derived by using the theory of perturbation. This method leads to a general formulation of the wave power expectation and other statistical parameter expressions, such as standard deviation and coefficient of variation. The statistical parameters, namely the mean value and variance of wave energy, are found in terms of the simple statistical parameters of period, significant wave height and zero up-crossing period. The elegance of these parameters is that they are distribution free. These parameters provide a means for defining the wave energy distribution function by employing the Chebyschev's inequality. Subsequently, an approximate probability distribution function of the wave energy is also derived for assessment of risk and reliability associated with wave energy. Necessary simple charts are given for risk and reliability assessments. Two procedures are presented for such assessments in wave energy calculations and the applications of these procedures are provided for wave energy potential assessment in the regions of the Pacific Ocean off the west coast of U.S

Evaluating and Reporting Statistical Power in Counseling Research

Science.gov (United States)

Balkin, Richard S.; Sheperis, Carl J.

2011-01-01

Despite recommendations from the "Publication Manual of the American Psychological Association" (6th ed.) to include information on statistical power when publishing quantitative results, authors seldom include analysis or discussion of statistical power. The rationale for discussing statistical power is addressed, approaches to using "G*Power" to…
A weighted U statistic for association analyses considering genetic heterogeneity.

Science.gov (United States)

Wei, Changshuai; Elston, Robert C; Lu, Qing

2016-07-20

Converging evidence suggests that common complex diseases with the same or similar clinical manifestations could have different underlying genetic etiologies. While current research interests have shifted toward uncovering rare variants and structural variations predisposing to human diseases, the impact of heterogeneity in genetic studies of complex diseases has been largely overlooked. Most of the existing statistical methods assume the disease under investigation has a homogeneous genetic effect and could, therefore, have low power if the disease undergoes heterogeneous pathophysiological and etiological processes. In this paper, we propose a heterogeneity-weighted U (HWU) method for association analyses considering genetic heterogeneity. HWU can be applied to various types of phenotypes (e.g., binary and continuous) and is computationally efficient for high-dimensional genetic data. Through simulations, we showed the advantage of HWU when the underlying genetic etiology of a disease was heterogeneous, as well as the robustness of HWU against different model assumptions (e.g., phenotype distributions). Using HWU, we conducted a genome-wide analysis of nicotine dependence from the Study of Addiction: Genetics and Environments dataset. The genome-wide analysis of nearly one million genetic markers took 7h, identifying heterogeneous effects of two new genes (i.e., CYP3A5 and IKBKB) on nicotine dependence. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Game Related Statistics Discriminating Between Starters and Nonstarters Players in Women’S National Basketball Association League (WNBA)

Science.gov (United States)

Gòmez, Miguel-Ángel; Lorenzo, Alberto; Ortega, Enrique; Sampaio, Jaime; Ibàñez, Sergio-José

2009-01-01

The aim of the present study was to identify the game-related statistics that allow discriminating between starters and nonstarter players in women’s basketball when related to winning or losing games and best or worst teams. The sample comprised all 216 regular season games from the 2005 Women’s National Basketball Association League (WNBA). The game-related statistics included were 2- and 3- point field-goals (both successful and unsuccessful), free-throws (both successful and unsuccessful), defensive and offensive rebounds, assists, blocks, fouls, steals, turnovers and minutes played. Results from multivariate analysis showed that when best teams won, the discriminant game-related statistics were successful 2-point field-goals (SC = 0.47), successful free-throws (SC = 0.44), fouls (SC = -0.41), assists (SC = 0.37), and defensive rebounds (SC = 0.37). When the worst teams won, the discriminant game-related statistics were successful 2-point field- goals (SC = 0.37), successful free-throws (SC = 0.45), assists (SC = 0.58), and steals (SC = 0.35). The results showed that the successful 2-point field-goals, successful free-throws and the assists were the most powerful variables discriminating between starters and nonstarters. These specific characteristics helped to point out the importance of starters’ players shooting and passing ability during competitions. Key points The players’ game-related statistical profile varied according to team status, game outcome and team quality in women’s basketball. The results of this work help to point out the different player’s performance described in women’s basketball compared with men’s basketball. The results obtained enhance the importance of starters and nonstarters contribution to team’s performance in different game contexts. Results showed the power of successful 2-point field-goals, successful free-throws and assists discriminating between starters and nonstarters in all the analyses. PMID:24149538
On two methods of statistical image analysis

NARCIS (Netherlands)

Missimer, J; Knorr, U; Maguire, RP; Herzog, H; Seitz, RJ; Tellman, L; Leenders, K.L.

1999-01-01

The computerized brain atlas (CBA) and statistical parametric mapping (SPM) are two procedures for voxel-based statistical evaluation of PET activation studies. Each includes spatial standardization of image volumes, computation of a statistic, and evaluation of its significance. In addition,
An evaluation of the statistical significance of the association between northward turnings of the interplanetary magnetic field and substorm expansion onsets

Science.gov (United States)

Hsu, Tung-Shin; McPherron, R. L.

2002-11-01

An outstanding problem in magnetospheric physics is deciding whether substorms are always triggered by external changes in the interplanetary magnetic field (IMF) or solar wind plasma, or whether they sometimes occur spontaneously. Over the past decade, arguments have been made on both sides of this issue. In fact, there is considerable evidence that some substorms are triggered. However, equally persuasive examples of substorms with no obvious trigger have been found. Because of conflicting views on this subject, further work is required to determine whether there is a physical relation between IMF triggers and substorm onset. In the work reported here a list of substorm onsets was created using two independent substorm signatures: sudden changes in the slope of the AL index and the start of a Pi 2 pulsation burst. Possible IMF triggers were determined from ISEE-2 observations. With the ISEE spacecraft near local noon immediately upstream of the bow shock, there can be little question about propagation delay to the magnetopause or whether a particular IMF feature hits the subsolar magnetopause. Thus it eliminates the objections that the calculated arrival time is subject to a large error or that the solar wind monitor missed a potential trigger incident at the subsolar point. Using a less familiar technique, statistics of point process, we find that the time delay between substorm onsets and the propagated arrival time of IMF triggers are clustered around zero. We estimate for independent processes that the probability of this clustering by chance alone is about 10-11. If we take into account the requirement that the IMF must have been southward prior to the onset, then the probability of clustering is higher, ˜10-5, but still extremely unlikely. Thus it is not possible to ascribe the apparent relation between IMF northward turnings and substorm onset to coincidence.
OPATs: Omnibus P-value association tests.

Science.gov (United States)

Chen, Chia-Wei; Yang, Hsin-Chou

2017-07-10

Combining statistical significances (P-values) from a set of single-locus association tests in genome-wide association studies is a proof-of-principle method for identifying disease-associated genomic segments, functional genes and biological pathways. We review P-value combinations for genome-wide association studies and introduce an integrated analysis tool, Omnibus P-value Association Tests (OPATs), which provides popular analysis methods of P-value combinations. The software OPATs programmed in R and R graphical user interface features a user-friendly interface. In addition to analysis modules for data quality control and single-locus association tests, OPATs provides three types of set-based association test: window-, gene- and biopathway-based association tests. P-value combinations with or without threshold and rank truncation are provided. The significance of a set-based association test is evaluated by using resampling procedures. Performance of the set-based association tests in OPATs has been evaluated by simulation studies and real data analyses. These set-based association tests help boost the statistical power, alleviate the multiple-testing problem, reduce the impact of genetic heterogeneity, increase the replication efficiency of association tests and facilitate the interpretation of association signals by streamlining the testing procedures and integrating the genetic effects of multiple variants in genomic regions of biological relevance. In summary, P-value combinations facilitate the identification of marker sets associated with disease susceptibility and uncover missing heritability in association studies, thereby establishing a foundation for the genetic dissection of complex diseases and traits. OPATs provides an easy-to-use and statistically powerful analysis tool for P-value combinations. OPATs, examples, and user guide can be downloaded from http://www.stat.sinica.edu.tw/hsinchou/genetics/association/OPATs.htm. © The Author 2017
FEATURES OF THE CLINICAL SIGNIFICANCE OF POLYMORPHIC VARIANTS OF ENOS AND AGTR2 GENES IN PATIENTS WITH CAD

Directory of Open Access Journals (Sweden)

A. L. Khokhlov

2016-01-01

Full Text Available Coronary heart disease (CHD is a major cause of mortality. Morphological substrate of CHD in most cases is atherosclerosis, which is based on structural genes polymorphism eNOS and AGTR2. The aim of the study was to study the prevalence of eNOS and AGTR2 genes in patients with coronary artery disease and the association of these genes with coronary heart disease. The study involved 187 patients aged 36 to 86 years (62,2±11,2 with different forms of CHD: stable and unstable angina, myocardial infarction and 45 people without CHD. Determination of gene polymorphisms was performed by real-time PCR analyzer of nucleic acids IQ 5 Bio-Rad. Statistical analysis was performed using Statistica 10.0. The study revealed a significant difference between the incidence of homozygous AA allelic variant gene AGTR2 group of patients with myocardial infarction and the comparison group; polymorphic variant AA AGTR2 gene is associated with earlier onset of coronary artery disease; It found that carriers of the polymorphic variant gene GA AGTR2 beginning statistically CHD occurred significantly later than in carriers of alleles GG and AA; age CHD debut TT allele carriers of the eNOS gene is associated with an earlier onset of the disease and statistically significantly different from the age of first CHD in carriers of alleles of polymorphic variants of GG and GT; revealed a positive correlation between the polymorphic allele AGTR2 gene with the presence of arterial hypertension in patients with coronary artery disease; It determined that the T allele carriers of the polymorphic gene eNOS is associated more early onset of hypertension, found the association of the polymorphic allele gene AGTR2 the need to use higher doses of ACE inhibitor — perindopril.
Statistics Using Just One Formula

Science.gov (United States)

Rosenthal, Jeffrey S.

2018-01-01

This article advocates that introductory statistics be taught by basing all calculations on a single simple margin-of-error formula and deriving all of the standard introductory statistical concepts (confidence intervals, significance tests, comparisons of means and proportions, etc) from that one formula. It is argued that this approach will…
Statistics Anxiety among Postgraduate Students

Science.gov (United States)

Koh, Denise; Zawi, Mohd Khairi

2014-01-01

Most postgraduate programmes, that have research components, require students to take at least one course of research statistics. Not all postgraduate programmes are science based, there are a significant number of postgraduate students who are from the social sciences that will be taking statistics courses, as they try to complete their…
Robust inference from multiple test statistics via permutations: a better alternative to the single test statistic approach for randomized trials.

Science.gov (United States)

Ganju, Jitendra; Yu, Xinxin; Ma, Guoguang Julie

2013-01-01

Formal inference in randomized clinical trials is based on controlling the type I error rate associated with a single pre-specified statistic. The deficiency of using just one method of analysis is that it depends on assumptions that may not be met. For robust inference, we propose pre-specifying multiple test statistics and relying on the minimum p-value for testing the null hypothesis of no treatment effect. The null hypothesis associated with the various test statistics is that the treatment groups are indistinguishable. The critical value for hypothesis testing comes from permutation distributions. Rejection of the null hypothesis when the smallest p-value is less than the critical value controls the type I error rate at its designated value. Even if one of the candidate test statistics has low power, the adverse effect on the power of the minimum p-value statistic is not much. Its use is illustrated with examples. We conclude that it is better to rely on the minimum p-value rather than a single statistic particularly when that single statistic is the logrank test, because of the cost and complexity of many survival trials. Copyright © 2013 John Wiley & Sons, Ltd.
688,112 statistical results : Content mining psychology articles for statistical test results

NARCIS (Netherlands)

Hartgerink, C.H.J.

2016-01-01

In this data deposit, I describe a dataset that is the result of content mining 167,318 published articles for statistical test results reported according to the standards prescribed by the American Psychological Association (APA). Articles published by the APA, Springer, Sage, and Taylor & Francis
Stress Exposure in Significant Relationships Is Associated with Lymph Node Status in Breast Cancer.

Directory of Open Access Journals (Sweden)

Chiara Renzi

Full Text Available Life stress exposure may impact on health and disease. Previous literature showed that stressful life events are associated with cancer incidence, survival and mortality. In animal models, patterns of maternal care have been shown to critically affect stress sensitivity and immunity trajectories later in life, by modifying DNA methylation during critical periods early in life. However, the role of parental care in breast cancer progression and survival has only limitedly been explored. Here, we investigated whether these factors may be linked to biological prognostic variables.One hundred twenty-three women hospitalized for surgery of primary breast cancer completed a questionnaire assessing parental bonding. Stressful events throughout the life span were also assessed.We found that the absence of optimal parental relationships is significantly associated with an increased risk of lymph node involvement, adjusting for confounders, while cumulative stress in the area of sentimental relationships is borderline significantly associated with the same prognostic factor.Our results suggest that parental bonding and sentimental relations may have a role in breast cancer progression. These variables represent an important evolutionary aspect which may modulate cancer progression through psycho-physiological stress pathways and influence the immune system.
A statistical approach to instrument calibration

Science.gov (United States)

Robert R. Ziemer; David Strauss

1978-01-01

Summary - It has been found that two instruments will yield different numerical values when used to measure identical points. A statistical approach is presented that can be used to approximate the error associated with the calibration of instruments. Included are standard statistical tests that can be used to determine if a number of successive calibrations of the...
Quantum mechanics from classical statistics

International Nuclear Information System (INIS)

Wetterich, C.

2010-01-01

Quantum mechanics can emerge from classical statistics. A typical quantum system describes an isolated subsystem of a classical statistical ensemble with infinitely many classical states. The state of this subsystem can be characterized by only a few probabilistic observables. Their expectation values define a density matrix if they obey a 'purity constraint'. Then all the usual laws of quantum mechanics follow, including Heisenberg's uncertainty relation, entanglement and a violation of Bell's inequalities. No concepts beyond classical statistics are needed for quantum physics - the differences are only apparent and result from the particularities of those classical statistical systems which admit a quantum mechanical description. Born's rule for quantum mechanical probabilities follows from the probability concept for a classical statistical ensemble. In particular, we show how the non-commuting properties of quantum operators are associated to the use of conditional probabilities within the classical system, and how a unitary time evolution reflects the isolation of the subsystem. As an illustration, we discuss a classical statistical implementation of a quantum computer.
Associations of geomagnetic activity with plasma sheet thinning and expansion: A statistical study

International Nuclear Information System (INIS)

Hones, E.W. Jr.; Pytte, T.; West, H.I. Jr.

1984-01-01

Associations of geomagnetic activity in the auroral zone with thinnings and expansions of the magnetotail plasma sheet are examined statistically in this paper. We first identified many plasma sheet thinnings and expansions in plasma and particle data from VELA satellites and from OGO 5 without reference to the ground magnetic data. These events were grouped according to the location of the detecting satellite in the magnetotail. For each such group the times of thinning or expansion were then used as fiducial times in a superposed-epoch analysis of the geomagnetic AL index values that were recorded in 8-hour intervals centered on the event times. The results show that many plasma sheet thinnings and expansions are related to discrete negative bay structures that are the classical signature of substorms. Furthermore, they support earlier findings that plasma sheet thinning and expansion at the VELA orbit (rroughly-equal18 R/sub E/) tend to be associated with the onset of the auroral zone negative bay and the beginning of its subsidence, respectively. Earthward of rroughly-equal13-15 R/sub E/, plasma sheet expansion occurs near the time of the onset of the negative bay, again in agreement with earlier findings. A large fraction of plasma sheet expansions to half thicknesses of > or approx. =6 R/sub E/ at the VELA orbit are associated not with a baylike geomagnetic disturbance but with subsidence of a prolonged interval of disturbance. The study also shows that many plasma sheet expansions are related simply to generally enhanced geomagnetic activity showing no baylike or other distinctive features
An Efficient Stepwise Statistical Test to Identify Multiple Linked Human Genetic Variants Associated with Specific Phenotypic Traits.

Directory of Open Access Journals (Sweden)

Iksoo Huh

Full Text Available Recent advances in genotyping methodologies have allowed genome-wide association studies (GWAS to accurately identify genetic variants that associate with common or pathological complex traits. Although most GWAS have focused on associations with single genetic variants, joint identification of multiple genetic variants, and how they interact, is essential for understanding the genetic architecture of complex phenotypic traits. Here, we propose an efficient stepwise method based on the Cochran-Mantel-Haenszel test (for stratified categorical data to identify causal joint multiple genetic variants in GWAS. This method combines the CMH statistic with a stepwise procedure to detect multiple genetic variants associated with specific categorical traits, using a series of associated I × J contingency tables and a null hypothesis of no phenotype association. Through a new stratification scheme based on the sum of minor allele count criteria, we make the method more feasible for GWAS data having sample sizes of several thousands. We also examine the properties of the proposed stepwise method via simulation studies, and show that the stepwise CMH test performs better than other existing methods (e.g., logistic regression and detection of associations by Markov blanket for identifying multiple genetic variants. Finally, we apply the proposed approach to two genomic sequencing datasets to detect linked genetic variants associated with bipolar disorder and obesity, respectively.
Quantum Statistical Operator and Classically Chaotic Hamiltonian ...

African Journals Online (AJOL)

Quantum Statistical Operator and Classically Chaotic Hamiltonian System. ... Journal of the Nigerian Association of Mathematical Physics ... In a Hamiltonian system von Neumann Statistical Operator is used to tease out the quantum consequence of (classical) chaos engendered by the nonlinear coupling of system to its ...
Testing multiple statistical hypotheses resulted in spurious associations: a study of astrological signs and health.

Science.gov (United States)

Austin, Peter C; Mamdani, Muhammad M; Juurlink, David N; Hux, Janet E

2006-09-01

To illustrate how multiple hypotheses testing can produce associations with no clinical plausibility. We conducted a study of all 10,674,945 residents of Ontario aged between 18 and 100 years in 2000. Residents were randomly assigned to equally sized derivation and validation cohorts and classified according to their astrological sign. Using the derivation cohort, we searched through 223 of the most common diagnoses for hospitalization until we identified two for which subjects born under one astrological sign had a significantly higher probability of hospitalization compared to subjects born under the remaining signs combined (P<0.05). We tested these 24 associations in the independent validation cohort. Residents born under Leo had a higher probability of gastrointestinal hemorrhage (P=0.0447), while Sagittarians had a higher probability of humerus fracture (P=0.0123) compared to all other signs combined. After adjusting the significance level to account for multiple comparisons, none of the identified associations remained significant in either the derivation or validation cohort. Our analyses illustrate how the testing of multiple, non-prespecified hypotheses increases the likelihood of detecting implausible associations. Our findings have important implications for the analysis and interpretation of clinical studies.
Statistics of Local Extremes

DEFF Research Database (Denmark)

Larsen, Gunner Chr.; Bierbooms, W.; Hansen, Kurt Schaldemose

2003-01-01

. A theoretical expression for the probability density function associated with local extremes of a stochasticprocess is presented. The expression is basically based on the lower four statistical moments and a bandwidth parameter. The theoretical expression is subsequently verified by comparison with simulated...
Confidence Intervals: From tests of statistical significance to confidence intervals, range hypotheses and substantial effects

Directory of Open Access Journals (Sweden)

Dominic Beaulieu-Prévost

2006-03-01

Full Text Available For the last 50 years of research in quantitative social sciences, the empirical evaluation of scientific hypotheses has been based on the rejection or not of the null hypothesis. However, more than 300 articles demonstrated that this method was problematic. In summary, null hypothesis testing (NHT is unfalsifiable, its results depend directly on sample size and the null hypothesis is both improbable and not plausible. Consequently, alternatives to NHT such as confidence intervals (CI and measures of effect size are starting to be used in scientific publications. The purpose of this article is, first, to provide the conceptual tools necessary to implement an approach based on confidence intervals, and second, to briefly demonstrate why such an approach is an interesting alternative to an approach based on NHT. As demonstrated in the article, the proposed CI approach avoids most problems related to a NHT approach and can often improve the scientific and contextual relevance of the statistical interpretations by testing range hypotheses instead of a point hypothesis and by defining the minimal value of a substantial effect. The main advantage of such a CI approach is that it replaces the notion of statistical power by an easily interpretable three-value logic (probable presence of a substantial effect, probable absence of a substantial effect and probabilistic undetermination. The demonstration includes a complete example.

Statistics 101 for Radiologists.

Science.gov (United States)

Anvari, Arash; Halpern, Elkan F; Samir, Anthony E

2015-10-01

Diagnostic tests have wide clinical applications, including screening, diagnosis, measuring treatment effect, and determining prognosis. Interpreting diagnostic test results requires an understanding of key statistical concepts used to evaluate test efficacy. This review explains descriptive statistics and discusses probability, including mutually exclusive and independent events and conditional probability. In the inferential statistics section, a statistical perspective on study design is provided, together with an explanation of how to select appropriate statistical tests. Key concepts in recruiting study samples are discussed, including representativeness and random sampling. Variable types are defined, including predictor, outcome, and covariate variables, and the relationship of these variables to one another. In the hypothesis testing section, we explain how to determine if observed differences between groups are likely to be due to chance. We explain type I and II errors, statistical significance, and study power, followed by an explanation of effect sizes and how confidence intervals can be used to generalize observed effect sizes to the larger population. Statistical tests are explained in four categories: t tests and analysis of variance, proportion analysis tests, nonparametric tests, and regression techniques. We discuss sensitivity, specificity, accuracy, receiver operating characteristic analysis, and likelihood ratios. Measures of reliability and agreement, including κ statistics, intraclass correlation coefficients, and Bland-Altman graphs and analysis, are introduced. © RSNA, 2015.
Statistics in the pharmacy literature.

Science.gov (United States)

Lee, Charlene M; Soin, Herpreet K; Einarson, Thomas R

2004-09-01

Research in statistical methods is essential for maintenance of high quality of the published literature. To update previous reports of the types and frequencies of statistical terms and procedures in research studies of selected professional pharmacy journals. We obtained all research articles published in 2001 in 6 journals: American Journal of Health-System Pharmacy, The Annals of Pharmacotherapy, Canadian Journal of Hospital Pharmacy, Formulary, Hospital Pharmacy, and Journal of the American Pharmaceutical Association. Two independent reviewers identified and recorded descriptive and inferential statistical terms/procedures found in the methods, results, and discussion sections of each article. Results were determined by tallying the total number of times, as well as the percentage, that each statistical term or procedure appeared in the articles. One hundred forty-four articles were included. Ninety-eight percent employed descriptive statistics; of these, 28% used only descriptive statistics. The most common descriptive statistical terms were percentage (90%), mean (74%), standard deviation (58%), and range (46%). Sixty-nine percent of the articles used inferential statistics, the most frequent being chi(2) (33%), Student's t-test (26%), Pearson's correlation coefficient r (18%), ANOVA (14%), and logistic regression (11%). Statistical terms and procedures were found in nearly all of the research articles published in pharmacy journals. Thus, pharmacy education should aim to provide current and future pharmacists with an understanding of the common statistical terms and procedures identified to facilitate the appropriate appraisal and consequential utilization of the information available in research articles.
Preventing statistical errors in scientific journals.

NARCIS (Netherlands)

Nuijten, M.B.

2016-01-01

There is evidence for a high prevalence of statistical reporting errors in psychology and other scientific fields. These errors display a systematic preference for statistically significant results, distorting the scientific literature. There are several possible causes for this systematic error
Statistics: The stethoscope of a thinking urologist

Directory of Open Access Journals (Sweden)

Arun S Sivanandam

2009-01-01

Full Text Available Understanding statistical terminology and the ability to appraise clinical research findings and statistical tests are critical to the practice of evidence-based medicine. Urologists require statistics in their toolbox of skills in order to successfully sift through increasingly complex studies and realize the drawbacks of statistical tests. Currently, the level of evidence in urology literature is low and the majority of research abstracts published for the American Urological Association (AUA meetings lag behind for full-text publication because of a lack of statistical reporting. Underlying these issues is a distinct deficiency in solid comprehension of statistics in the literature and a discomfort with the application of statistics for clinical decision-making. This review examines the plight of statistics in urology and investigates the reason behind the white-coat aversion to biostatistics. Resources such as evidence-based medicine websites, primers in statistics, and guidelines for statistical reporting exist for quick reference by urologists. Ultimately, educators should take charge of monitoring statistical knowledge among trainees by bolstering competency requirements and creating sustained opportunities for statistics and methodology exposure.
Basics of statistical physics

CERN Document Server

Müller-Kirsten, Harald J W

2013-01-01

Statistics links microscopic and macroscopic phenomena, and requires for this reason a large number of microscopic elements like atoms. The results are values of maximum probability or of averaging. This introduction to statistical physics concentrates on the basic principles, and attempts to explain these in simple terms supplemented by numerous examples. These basic principles include the difference between classical and quantum statistics, a priori probabilities as related to degeneracies, the vital aspect of indistinguishability as compared with distinguishability in classical physics, the differences between conserved and non-conserved elements, the different ways of counting arrangements in the three statistics (Maxwell-Boltzmann, Fermi-Dirac, Bose-Einstein), the difference between maximization of the number of arrangements of elements, and averaging in the Darwin-Fowler method. Significant applications to solids, radiation and electrons in metals are treated in separate chapters, as well as Bose-Eins...
Renyi statistics in equilibrium statistical mechanics

International Nuclear Information System (INIS)

Parvan, A.S.; Biro, T.S.

2010-01-01

The Renyi statistics in the canonical and microcanonical ensembles is examined both in general and in particular for the ideal gas. In the microcanonical ensemble the Renyi statistics is equivalent to the Boltzmann-Gibbs statistics. By the exact analytical results for the ideal gas, it is shown that in the canonical ensemble, taking the thermodynamic limit, the Renyi statistics is also equivalent to the Boltzmann-Gibbs statistics. Furthermore it satisfies the requirements of the equilibrium thermodynamics, i.e. the thermodynamical potential of the statistical ensemble is a homogeneous function of first degree of its extensive variables of state. We conclude that the Renyi statistics arrives at the same thermodynamical relations, as those stemming from the Boltzmann-Gibbs statistics in this limit.
Polarimetric Segmentation Using Wishart Test Statistic

DEFF Research Database (Denmark)

Skriver, Henning; Schou, Jesper; Nielsen, Allan Aasbjerg

2002-01-01

A newly developed test statistic for equality of two complex covariance matrices following the complex Wishart distribution and an associated asymptotic probability for the test statistic has been used in a segmentation algorithm. The segmentation algorithm is based on the MUM (merge using moments......) approach, which is a merging algorithm for single channel SAR images. The polarimetric version described in this paper uses the above-mentioned test statistic for merging. The segmentation algorithm has been applied to polarimetric SAR data from the Danish dual-frequency, airborne polarimetric SAR, EMISAR...
A significant association between BDNF promoter methylation and the risk of drug addiction.

Science.gov (United States)

Xu, Xuting; Ji, Huihui; Liu, Guili; Wang, Qinwen; Liu, Huifen; Shen, Wenwen; Li, Longhui; Xie, Xiaohu; Zhou, Wenhua; Duan, Shiwei

2016-06-10

As a member of the neurotrophic factor family, brain derived neurotrophic factor (BDNF) plays an important role in the survival and differentiation of neurons. The aim of our work was to evaluate the role of BDNF promoter methylation in drug addiction. A total of 60 drug abusers (30 heroin and 30 methylamphetamine addicts) and 52 healthy age- and gender-matched controls were recruited for the current case control study. Bisulfite pyrosequencing technology was used to determine the methylation levels of five CpGs (CpG1-5) on the BDNF promoter. Among the five CpGs, CpG5 methylation was significantly lower in drug abusers than controls. Moreover, significant associations were found between CpG5 methylation and addictive phenotypes including tension-anxiety, anger-hostility, fatigue-inertia, and depression-dejection. In addition, luciferase assay showed that the DNA fragment of BDNF promoter played a key role in the regulation of gene expression. Our results suggest that BDNF promoter methylation is associated with drug addiction, although further studies are needed to understand the mechanisms by which BDNF promoter methylation contributes to the pathophysiology of drug addiction. Copyright © 2016. Published by Elsevier B.V.
Hormone replacement therapy is associated with gastro-oesophageal reflux disease: a retrospective cohort study

Directory of Open Access Journals (Sweden)

Close Helen

2012-05-01

Full Text Available Abstract Background Oestrogen and progestogen have the potential to influence gastro-intestinal motility; both are key components of hormone replacement therapy (HRT. Results of observational studies in women taking HRT rely on self-reporting of gastro-oesophageal symptoms and the aetiology of gastro-oesophageal reflux disease (GORD remains unclear. This study investigated the association between HRT and GORD in menopausal women using validated general practice records. Methods 51,182 menopausal women were identified using the UK General Practice Research Database between 1995–2004. Of these, 8,831 were matched with and without hormone use. Odds ratios (ORs were calculated for GORD and proton-pump inhibitor (PPI use in hormone and non-hormone users, adjusting for age, co-morbidities, and co-pharmacy. Results In unadjusted analysis, all forms of hormone use (oestrogen-only, tibolone, combined HRT and progestogen were statistically significantly associated with GORD. In adjusted models, this association remained statistically significant for oestrogen-only treatment (OR 1.49; 1.18–1.89. Unadjusted analysis showed a statistically significant association between PPI use and oestrogen-only and combined HRT treatment. When adjusted for covariates, oestrogen-only treatment was significant (OR 1.34; 95% CI 1.03–1.74. Findings from the adjusted model demonstrated the greater use of PPI by progestogen users (OR 1.50; 1.01–2.22. Conclusions This first large cohort study of the association between GORD and HRT found a statistically significant association between oestrogen-only hormone and GORD and PPI use. This should be further investigated using prospective follow-up to validate the strength of association and describe its clinical significance.
Learning Predictive Statistics: Strategies and Brain Mechanisms.

Science.gov (United States)

Wang, Rui; Shen, Yuan; Tino, Peter; Welchman, Andrew E; Kourtzi, Zoe

2017-08-30

When immersed in a new environment, we are challenged to decipher initially incomprehensible streams of sensory information. However, quite rapidly, the brain finds structure and meaning in these incoming signals, helping us to predict and prepare ourselves for future actions. This skill relies on extracting the statistics of event streams in the environment that contain regularities of variable complexity from simple repetitive patterns to complex probabilistic combinations. Here, we test the brain mechanisms that mediate our ability to adapt to the environment's statistics and predict upcoming events. By combining behavioral training and multisession fMRI in human participants (male and female), we track the corticostriatal mechanisms that mediate learning of temporal sequences as they change in structure complexity. We show that learning of predictive structures relates to individual decision strategy; that is, selecting the most probable outcome in a given context (maximizing) versus matching the exact sequence statistics. These strategies engage distinct human brain regions: maximizing engages dorsolateral prefrontal, cingulate, sensory-motor regions, and basal ganglia (dorsal caudate, putamen), whereas matching engages occipitotemporal regions (including the hippocampus) and basal ganglia (ventral caudate). Our findings provide evidence for distinct corticostriatal mechanisms that facilitate our ability to extract behaviorally relevant statistics to make predictions. SIGNIFICANCE STATEMENT Making predictions about future events relies on interpreting streams of information that may initially appear incomprehensible. Past work has studied how humans identify repetitive patterns and associative pairings. However, the natural environment contains regularities that vary in complexity from simple repetition to complex probabilistic combinations. Here, we combine behavior and multisession fMRI to track the brain mechanisms that mediate our ability to adapt to
Assessing attitudes towards statistics among medical students: psychometric properties of the Serbian version of the Survey of Attitudes Towards Statistics (SATS.

Directory of Open Access Journals (Sweden)

Dejana Stanisavljevic

Full Text Available BACKGROUND: Medical statistics has become important and relevant for future doctors, enabling them to practice evidence based medicine. Recent studies report that students' attitudes towards statistics play an important role in their statistics achievements. The aim of the study was to test the psychometric properties of the Serbian version of the Survey of Attitudes Towards Statistics (SATS in order to acquire a valid instrument to measure attitudes inside the Serbian educational context. METHODS: The validation study was performed on a cohort of 417 medical students who were enrolled in an obligatory introductory statistics course. The SATS adaptation was based on an internationally accepted methodology for translation and cultural adaptation. Psychometric properties of the Serbian version of the SATS were analyzed through the examination of factorial structure and internal consistency. RESULTS: Most medical students held positive attitudes towards statistics. The average total SATS score was above neutral (4.3±0.8, and varied from 1.9 to 6.2. Confirmatory factor analysis validated the six-factor structure of the questionnaire (Affect, Cognitive Competence, Value, Difficulty, Interest and Effort. Values for fit indices TLI (0.940 and CFI (0.961 were above the cut-off of ≥0.90. The RMSEA value of 0.064 (0.051-0.078 was below the suggested value of ≤0.08. Cronbach's alpha of the entire scale was 0.90, indicating scale reliability. In a multivariate regression model, self-rating of ability in mathematics and current grade point average were significantly associated with the total SATS score after adjusting for age and gender. CONCLUSION: Present study provided the evidence for the appropriate metric properties of the Serbian version of SATS. Confirmatory factor analysis validated the six-factor structure of the scale. The SATS might be reliable and a valid instrument for identifying medical students' attitudes towards statistics in the
Assessing attitudes towards statistics among medical students: psychometric properties of the Serbian version of the Survey of Attitudes Towards Statistics (SATS).

Science.gov (United States)

Stanisavljevic, Dejana; Trajkovic, Goran; Marinkovic, Jelena; Bukumiric, Zoran; Cirkovic, Andja; Milic, Natasa

2014-01-01

Medical statistics has become important and relevant for future doctors, enabling them to practice evidence based medicine. Recent studies report that students' attitudes towards statistics play an important role in their statistics achievements. The aim of the study was to test the psychometric properties of the Serbian version of the Survey of Attitudes Towards Statistics (SATS) in order to acquire a valid instrument to measure attitudes inside the Serbian educational context. The validation study was performed on a cohort of 417 medical students who were enrolled in an obligatory introductory statistics course. The SATS adaptation was based on an internationally accepted methodology for translation and cultural adaptation. Psychometric properties of the Serbian version of the SATS were analyzed through the examination of factorial structure and internal consistency. Most medical students held positive attitudes towards statistics. The average total SATS score was above neutral (4.3±0.8), and varied from 1.9 to 6.2. Confirmatory factor analysis validated the six-factor structure of the questionnaire (Affect, Cognitive Competence, Value, Difficulty, Interest and Effort). Values for fit indices TLI (0.940) and CFI (0.961) were above the cut-off of ≥0.90. The RMSEA value of 0.064 (0.051-0.078) was below the suggested value of ≤0.08. Cronbach's alpha of the entire scale was 0.90, indicating scale reliability. In a multivariate regression model, self-rating of ability in mathematics and current grade point average were significantly associated with the total SATS score after adjusting for age and gender. Present study provided the evidence for the appropriate metric properties of the Serbian version of SATS. Confirmatory factor analysis validated the six-factor structure of the scale. The SATS might be reliable and a valid instrument for identifying medical students' attitudes towards statistics in the Serbian educational context.
Addressing issues associated with evaluating prediction models for survival endpoints based on the concordance statistic.

Science.gov (United States)

Wang, Ming; Long, Qi

2016-09-01

Prediction models for disease risk and prognosis play an important role in biomedical research, and evaluating their predictive accuracy in the presence of censored data is of substantial interest. The standard concordance (c) statistic has been extended to provide a summary measure of predictive accuracy for survival models. Motivated by a prostate cancer study, we address several issues associated with evaluating survival prediction models based on c-statistic with a focus on estimators using the technique of inverse probability of censoring weighting (IPCW). Compared to the existing work, we provide complete results on the asymptotic properties of the IPCW estimators under the assumption of coarsening at random (CAR), and propose a sensitivity analysis under the mechanism of noncoarsening at random (NCAR). In addition, we extend the IPCW approach as well as the sensitivity analysis to high-dimensional settings. The predictive accuracy of prediction models for cancer recurrence after prostatectomy is assessed by applying the proposed approaches. We find that the estimated predictive accuracy for the models in consideration is sensitive to NCAR assumption, and thus identify the best predictive model. Finally, we further evaluate the performance of the proposed methods in both settings of low-dimensional and high-dimensional data under CAR and NCAR through simulations. © 2016, The International Biometric Society.
Statistical Association Criteria in Forensic Psychiatry–A criminological evaluation of casuistry

Science.gov (United States)

Gheorghiu, V; Buda, O; Popescu, I; Trandafir, MS

2011-01-01

Purpose. Identification of potential shared primary psychoprophylaxis and crime prevention is measured by analyzing the rate of commitments for patients–subjects to forensic examination. Material and method. The statistic trial is a retrospective, document–based study. The statistical lot consists of 770 initial examination reports performed and completed during the whole year 2007, primarily analyzed in order to summarize the data within the National Institute of Forensic Medicine, Bucharest, Romania (INML), with one of the group variables being ‘particularities of the psychiatric patient history’, containing the items ‘forensic onset’, ‘commitments within the last year prior to the examination’ and ‘absence of commitments within the last year prior to the examination’. The method used was the Kendall bivariate correlation. For this study, the authors separately analyze only the two items regarding commitments by other correlation alternatives and by modern, elaborate statistical analyses, i.e. recording of the standard case study variables, Kendall bivariate correlation, cross tabulation, factor analysis and hierarchical cluster analysis. Results. The results are varied, from theoretically presumed clinical nosography (such as schizophrenia or manic depression), to non–presumed (conduct disorders) or unexpected behavioral acts, and therefore difficult to interpret. Conclusions. One took into consideration the features of the batch as well as the results of the previous standard correlation of the whole statistical lot. The authors emphasize the role of medical security measures that are actually applied in the therapeutic management in general and in risk and second offence management in particular, as well as the role of forensic psychiatric examinations in the detection of certain aspects related to the monitoring of mental patients. PMID:21505571
Statistical power and utility of meta-analysis methods for cross-phenotype genome-wide association studies.

Science.gov (United States)

Zhu, Zhaozhong; Anttila, Verneri; Smoller, Jordan W; Lee, Phil H

2018-01-01

Advances in recent genome wide association studies (GWAS) suggest that pleiotropic effects on human complex traits are widespread. A number of classic and recent meta-analysis methods have been used to identify genetic loci with pleiotropic effects, but the overall performance of these methods is not well understood. In this work, we use extensive simulations and case studies of GWAS datasets to investigate the power and type-I error rates of ten meta-analysis methods. We specifically focus on three conditions commonly encountered in the studies of multiple traits: (1) extensive heterogeneity of genetic effects; (2) characterization of trait-specific association; and (3) inflated correlation of GWAS due to overlapping samples. Although the statistical power is highly variable under distinct study conditions, we found the superior power of several methods under diverse heterogeneity. In particular, classic fixed-effects model showed surprisingly good performance when a variant is associated with more than a half of study traits. As the number of traits with null effects increases, ASSET performed the best along with competitive specificity and sensitivity. With opposite directional effects, CPASSOC featured the first-rate power. However, caution is advised when using CPASSOC for studying genetically correlated traits with overlapping samples. We conclude with a discussion of unresolved issues and directions for future research.
Statistical Engine Knock Control

DEFF Research Database (Denmark)

Stotsky, Alexander A.

2008-01-01

A new statistical concept of the knock control of a spark ignition automotive engine is proposed . The control aim is associated with the statistical hy pothesis test which compares the threshold value to the average value of the max imal amplitud e of the knock sensor signal at a given freq uency....... C ontrol algorithm which is used for minimization of the regulation error realizes a simple count-up-count-d own logic. A new ad aptation algorithm for the knock d etection threshold is also d eveloped . C onfi d ence interval method is used as the b asis for ad aptation. A simple statistical mod el...... which includ es generation of the amplitud e signals, a threshold value d etermination and a knock sound mod el is d eveloped for evaluation of the control concept....
Electricity statistics for Finland 1997; Saehkoetilasto 1997

Energy Technology Data Exchange (ETDEWEB)

Kangas, H; Savolainen, T [Adato Energia Oy, Helsinki (Finland)

1998-12-01

Until 1995 the electrical statistics information has according to the law about electric utilities and facilities been collected and handled by the Electrical Inspectorate. In 1996 the work was done by the Finnish Electricity Association and it was commissioned by the Ministry of Trade and Industry. Since 1996 the collection and handling of the information is based on the Electricity Market Act. The information is mainly submitted by the producers and distributors of electricity and processed since 1997 in Adato Energia Oy owned jointly by Finnish Energy Industries Federation, Finnish District Heating Association and Finnish Electricity Association. This action is based on a mutual contract of the Statistics Finland, Adato Energia Oy, Finnish Energy Industries Federation and Finnish Electricity Association. The Electricity Statistics for Finland 1997 contains several summaries about the consumption and the production. There is also summaries about the networks, the effects of electricity, the capacities of electricity, the fuels used in production and the dwellings heated by electric power. Like before a list of names, addresses, persons and telephone numbers is available. Additionally a list comprising the power consumption in all Finnish communes and a glossary in three languages (Finnish, Swedish and English) are included
Statistical analysis of grapevine mortality associated with esca or Eutypa dieback foliar expression

Directory of Open Access Journals (Sweden)

Lucia GUERIN-DUBRANA

2013-09-01

Full Text Available Esca and Eutypa dieback are two major wood diseases of grapevine in France. Their widespread distribution in vineyards leads to vine decline and to a loss in productivity. However, little is known either about the temporal dynamics of these diseases at plant level, and equally, the relationships between foliar expression of the diseases and vine death is relatively unknown too. To investigate this last question, the vines of six vineyards cv. Cabernet Sauvignon in the Bordeaux region were surveyed, by recording foliar symptoms, dead arms and dead plants from 2004 to 2010. In 2008, 2009 and 2010, approximately five percent of the asymptomatic vines died but the percentage of dead vines which had previously expressed esca foliar symptoms was higher, and varied between vineyards. A logistic regression model was used to determine the previous years of symptomatic expression associated with vine mortality. The mortality of esca is always associated with the foliar symptom expression of the year preceding vine death. One or two other earlier years of expression frequently represented additional risk factors. The Eutypa dieback symptom was also a risk factor of death, superior or equal to that of esca. The study of the internal necroses of vines expressing esca or Eutypa dieback is discussed in the light of these statistical results.
Statistics: a Bayesian perspective

National Research Council Canada - National Science Library

Berry, Donald A

1996-01-01

...: it is the only introductory textbook based on Bayesian ideas, it combines concepts and methods, it presents statistics as a means of integrating data into the significant process, it develops ideas...
Pharmacogenetics of efficacy and safety of HCV treatment in HCV-HIV coinfected patients: significant associations with IL28B and SOCS3 gene variants.

Directory of Open Access Journals (Sweden)

Francesc Vidal

Full Text Available This was a safety and efficacy pharmacogenetic study of a previously performed randomized trial which compared the effectiveness of treatment of hepatitis C virus infection with pegylated interferon alpha (pegIFNα 2a vs. 2b, both with ribavirin, for 48 weeks, in HCV-HIV coinfected patients.The study groups were made of 99 patients (efficacy pharmacogenetic substudy and of 114 patients (safety pharmacogenetic substudy. Polymorphisms in the following candidate genes IL28B, IL6, IL10, TNFα, IFNγ, CCL5, MxA, OAS1, SOCS3, CTLA4 and ITPA were assessed. Genotyping was carried out using Sequenom iPLEX-Gold, a single-base extension polymerase chain reaction. Efficacy end-points assessed were: rapid, early and sustained virological response (RVR, EVR and SVR, respectively. Safety end-points assessed were: anemia, neutropenia, thrombocytopenia, flu-like syndrome, gastrointestinal disturbances and depression. Chi square test, Student's T test, Mann-Whitney U test and logistic regression were used for statistic analyses.As efficacy is concerned, IL28B and CTLA4 gene polymorphisms were associated with RVR (p<0.05 for both comparisons. Nevertheless, only polymorphism in the IL28B gene was associated with SVR (p = 0.004. In the multivariate analysis, the only gene independently associated with SVR was IL28B (OR 2.61, 95%CI 1.2-5.6, p = 0.01. With respect to safety, there were no significant associations between flu-like syndrome or depression and the genetic variants studied. Gastrointestinal disturbances were associated with ITPA gene polymorphism (p = 0.04. Anemia was associated with OAS1 and CTLA4 gene polymorphisms (p = 0.049 and p = 0.045, respectively, neutropenia and thromobocytopenia were associated with SOCS3 gene polymorphism (p = 0.02 and p = 0.002, respectively. In the multivariate analysis, the associations of the SOCS3 gene polymorphism with neutropenia (OR 0.26, 95%CI 0.09-0.75, p = 0.01 and thrombocytopenia (OR

Transformation of Summary Statistics from Linear Mixed Model Association on All-or-None Traits to Odds Ratio.

Science.gov (United States)

Lloyd-Jones, Luke R; Robinson, Matthew R; Yang, Jian; Visscher, Peter M

2018-04-01

Genome-wide association studies (GWAS) have identified thousands of loci that are robustly associated with complex diseases. The use of linear mixed model (LMM) methodology for GWAS is becoming more prevalent due to its ability to control for population structure and cryptic relatedness and to increase power. The odds ratio (OR) is a common measure of the association of a disease with an exposure ( e.g. , a genetic variant) and is readably available from logistic regression. However, when the LMM is applied to all-or-none traits it provides estimates of genetic effects on the observed 0-1 scale, a different scale to that in logistic regression. This limits the comparability of results across studies, for example in a meta-analysis, and makes the interpretation of the magnitude of an effect from an LMM GWAS difficult. In this study, we derived transformations from the genetic effects estimated under the LMM to the OR that only rely on summary statistics. To test the proposed transformations, we used real genotypes from two large, publicly available data sets to simulate all-or-none phenotypes for a set of scenarios that differ in underlying model, disease prevalence, and heritability. Furthermore, we applied these transformations to GWAS summary statistics for type 2 diabetes generated from 108,042 individuals in the UK Biobank. In both simulation and real-data application, we observed very high concordance between the transformed OR from the LMM and either the simulated truth or estimates from logistic regression. The transformations derived and validated in this study improve the comparability of results from prospective and already performed LMM GWAS on complex diseases by providing a reliable transformation to a common comparative scale for the genetic effects. Copyright © 2018 by the Genetics Society of America.
Statistical Analysis and Evaluation of the Depth of the Ruts on Lithuanian State Significance Roads

Directory of Open Access Journals (Sweden)

Erinijus Getautis

2011-04-01

Full Text Available The aim of this work is to gather information about the national flexible pavement roads ruts depth, to determine its statistical dispersijon index and to determine their validity for needed requirements. Analysis of scientific works of ruts apearance in the asphalt and their influence for driving is presented in this work. Dynamical models of ruts in asphalt are presented in the work as well. Experimental outcome data of rut depth dispersijon in the national highway of Lithuania Vilnius – Kaunas is prepared. Conclusions are formulated and presented. Article in Lithuanian
The Future of Statistics as a Discipline.

Science.gov (United States)

1981-09-01

Uiversity Dqpsrtzuuht of Statistics Tallahassee, Florida 32306 tIhe Ru, seZ 11 La b WY delveedat he14stAzuuul PMetlnq of the Atudica...from the real world. While the academicians too often fail to enrich their instruction and research with real life pro’ lems, practi- tioners do not...of the Americm Statistical Association, 75, 575-582. Box, George E. P. (1979), "Some Problems of Statistics and Everyday Life ," Jowna of the Amerioan
Atrial fibrillation and acute myocardial infarction without significant coronary stenoses associated with subclinical hyperthyroidism and erythrocytosis.

Science.gov (United States)

Patanè, Salvatore; Marte, Filippo

2010-11-05

Subclinical hyperthyroidism is an increasingly recognized entity that is defined as a normal serum free thyroxine and free triiodothyronine levels with a thyroid-stimulating hormone level suppressed below the normal range and usually undetectable. It has been reported that sub-clinical hyperthyroidism is not associated with CHD or mortality from cardiovascular causes but is sufficient to induce arrhythmias including atrial fibrillation and atrial flutter. Moreover increased factor X activity in patients with subclinical hyperthyroidism represents a potential hypercoagulable state. It has been also reported an acute myocardial infarction with normal coronary arteries associated with iatrogenic hyperthyroidism and with a myocardial bridge too. It has been also reported an acute myocardial infarction without significant coronary stenoses associated with subclinical hyperthyroidism. Furthermore it has been reported that at highly increased hematocrit levels patients may experience hyperviscosity symptoms. We present a case of atrial fibrillation and acute myocardial infarction without significant coronary stenoses associated with subclinical hyperthyroidism and erythrocytosis. Also this case focuses attention on the importance of a correct evaluation of subclinical hyperthyroidism. Copyright © 2008 Elsevier Ireland Ltd. All rights reserved.
Perceived Statistical Knowledge Level and Self-Reported Statistical Practice Among Academic Psychologists

Directory of Open Access Journals (Sweden)

Laura Badenes-Ribera

2018-06-01

Full Text Available Introduction: Publications arguing against the null hypothesis significance testing (NHST procedure and in favor of good statistical practices have increased. The most frequently mentioned alternatives to NHST are effect size statistics (ES, confidence intervals (CIs, and meta-analyses. A recent survey conducted in Spain found that academic psychologists have poor knowledge about effect size statistics, confidence intervals, and graphic displays for meta-analyses, which might lead to a misinterpretation of the results. In addition, it also found that, although the use of ES is becoming generalized, the same thing is not true for CIs. Finally, academics with greater knowledge about ES statistics presented a profile closer to good statistical practice and research design. Our main purpose was to analyze the extension of these results to a different geographical area through a replication study.Methods: For this purpose, we elaborated an on-line survey that included the same items as the original research, and we asked academic psychologists to indicate their level of knowledge about ES, their CIs, and meta-analyses, and how they use them. The sample consisted of 159 Italian academic psychologists (54.09% women, mean age of 47.65 years. The mean number of years in the position of professor was 12.90 (SD = 10.21.Results: As in the original research, the results showed that, although the use of effect size estimates is becoming generalized, an under-reporting of CIs for ES persists. The most frequent ES statistics mentioned were Cohen's d and R2/η2, which can have outliers or show non-normality or violate statistical assumptions. In addition, academics showed poor knowledge about meta-analytic displays (e.g., forest plot and funnel plot and quality checklists for studies. Finally, academics with higher-level knowledge about ES statistics seem to have a profile closer to good statistical practices.Conclusions: Changing statistical practice is not
New Variant of the Universal Constants in the Perturbed Chain-Statistical Associating Fluid Theory Equation of State

DEFF Research Database (Denmark)

Liang, Xiaodong; Kontogeorgis, Georgios

2015-01-01

The Perturbed Chain-Statistical Associating Fluid Theory Equation of State (PC-SAFT EOS) has been successfully applied to model phase behavior of various types of systems, while it is also well-known that the PC-SAFT EOS has difficulties in describing some second-order derivative properties...... resolved the mostly criticized numerical pitfall, that is, the presence of more than three volume roots at real application conditions. Finally, the possibility of using the original PC-SAFT EOS parameters with the new universal constants has been investigated for the phase equilibria of the systems...
Combinatorial interpretation of Haldane-Wu fractional exclusion statistics.

Science.gov (United States)

Aringazin, A K; Mazhitov, M I

2002-08-01

Assuming that the maximal allowed number of identical particles in a state is an integer parameter, q, we derive the statistical weight and analyze the associated equation that defines the statistical distribution. The derived distribution covers Fermi-Dirac and Bose-Einstein ones in the particular cases q=1 and q--> infinity (n(i)/q-->1), respectively. We show that the derived statistical weight provides a natural combinatorial interpretation of Haldane-Wu fractional exclusion statistics, and present exact solutions of the distribution equation.
Gibbs' theorem for open systems with incomplete statistics

International Nuclear Information System (INIS)

Bagci, G.B.

2009-01-01

Gibbs' theorem, which is originally intended for canonical ensembles with complete statistics has been generalized to open systems with incomplete statistics. As a result of this generalization, it is shown that the stationary equilibrium distribution of inverse power law form associated with the incomplete statistics has maximum entropy even for open systems with energy or matter influx. The renormalized entropy definition given in this paper can also serve as a measure of self-organization in open systems described by incomplete statistics.
Understanding Statistics - Cancer Statistics

Science.gov (United States)

Annual reports of U.S. cancer statistics including new cases, deaths, trends, survival, prevalence, lifetime risk, and progress toward Healthy People targets, plus statistical summaries for a number of common cancer types.
Applied statistics in ecology: common pitfalls and simple solutions

Science.gov (United States)

E. Ashley Steel; Maureen C. Kennedy; Patrick G. Cunningham; John S. Stanovick

2013-01-01

The most common statistical pitfalls in ecological research are those associated with data exploration, the logic of sampling and design, and the interpretation of statistical results. Although one can find published errors in calculations, the majority of statistical pitfalls result from incorrect logic or interpretation despite correct numerical calculations. There...
Family-based Association Analyses of Imputed Genotypes Reveal Genome-Wide Significant Association of Alzheimer’s disease with OSBPL6, PTPRG and PDCL3

Science.gov (United States)

Herold, Christine; Hooli, Basavaraj V.; Mullin, Kristina; Liu, Tian; Roehr, Johannes T; Mattheisen, Manuel; Parrado, Antonio R.; Bertram, Lars; Lange, Christoph; Tanzi, Rudolph E.

2015-01-01

The genetic basis of Alzheimer's disease (AD) is complex and heterogeneous. Over 200 highly penetrant pathogenic variants in the genes APP, PSEN1 and PSEN2 cause a subset of early-onset familial Alzheimer's disease (EOFAD). On the other hand, susceptibility to late-onset forms of AD (LOAD) is indisputably associated to the ε4 allele in the gene APOE, and more recently to variants in more than two-dozen additional genes identified in the large-scale genome-wide association studies (GWAS) and meta-analyses reports. Taken together however, although the heritability in AD is estimated to be as high as 80%, a large proportion of the underlying genetic factors still remain to be elucidated. In this study we performed a systematic family-based genome-wide association and meta-analysis on close to 15 million imputed variants from three large collections of AD families (~3,500 subjects from 1,070 families). Using a multivariate phenotype combining affection status and onset age, meta-analysis of the association results revealed three single nucleotide polymorphisms (SNPs) that achieved genome-wide significance for association with AD risk: rs7609954 in the gene PTPRG (P-value = 3.98·10−08), rs1347297 in the gene OSBPL6 (P-value = 4.53·10−08), and rs1513625 near PDCL3 (P-value = 4.28·10−08). In addition, rs72953347 in OSBPL6 (P-value = 6.36·10−07) and two SNPs in the gene CDKAL1 showed marginally significant association with LOAD (rs10456232, P-value: 4.76·10−07; rs62400067, P-value: 3.54·10−07). In summary, family-based GWAS meta-analysis of imputed SNPs revealed novel genomic variants in (or near) PTPRG, OSBPL6, and PDCL3 that influence risk for AD with genome-wide significance. PMID:26830138
Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models

Science.gov (United States)

2013-01-01

Background In statistical modeling, finding the most favorable coding for an exploratory quantitative variable involves many tests. This process involves multiple testing problems and requires the correction of the significance level. Methods For each coding, a test on the nullity of the coefficient associated with the new coded variable is computed. The selected coding corresponds to that associated with the largest statistical test (or equivalently the smallest pvalue). In the context of the Generalized Linear Model, Liquet and Commenges (Stat Probability Lett,71:33–38,2005) proposed an asymptotic correction of the significance level. This procedure, based on the score test, has been developed for dichotomous and Box-Cox transformations. In this paper, we suggest the use of resampling methods to estimate the significance level for categorical transformations with more than two levels and, by definition those that involve more than one parameter in the model. The categorical transformation is a more flexible way to explore the unknown shape of the effect between an explanatory and a dependent variable. Results The simulations we ran in this study showed good performances of the proposed methods. These methods were illustrated using the data from a study of the relationship between cholesterol and dementia. Conclusion The algorithms were implemented using R, and the associated CPMCGLM R package is available on the CRAN. PMID:23758852
The prognostic significance of UCA1 for predicting clinical outcome in patients with digestive system malignancies.

Science.gov (United States)

Liu, Fang-Teng; Dong, Qing; Gao, Hui; Zhu, Zheng-Ming

2017-06-20

Urothelial Carcinoma Associated 1 (UCA1) was an originally identified lncRNA in bladder cancer. Previous studies have reported that UCA1 played a significant role in various types of cancer. This study aimed to clarify the prognostic value of UCA1 in digestive system cancers. The meta-analysis of 15 studies were included, comprising 1441 patients with digestive system cancers. The pooled results of 14 studies indicated that high expression of UCA1 was significantly associated with poorer OS in patients with digestive system cancers (HR: 1.89, 95 % CI: 1.52-2.26). In addition, UCA1 could be as an independent prognostic factor for predicting OS of patients (HR: 1.85, 95 % CI: 1.45-2.25). The pooled results of 3 studies indicated a significant association between UCA1 and DFS in patients with digestive system cancers (HR = 2.50; 95 % CI = 1.30-3.69). Statistical significance was also observed in subgroup meta-analysis. Furthermore, the clinicopathological values of UCA1 were discussed in esophageal cancer, colorectal cancer and pancreatic cancer. A comprehensive retrieval was performed to search studies evaluating the prognostic value of UCA1 in digestive system cancers. Many databases were involved, including PubMed, Web of Science, Embase and Chinese National Knowledge Infrastructure and Wanfang database. Quantitative meta-analysis was performed with standard statistical methods and the prognostic significance of UCA1 in digestive system cancers was qualified. Elevated level of UCA1 indicated the poor clinical outcome for patients with digestive system cancers. It may serve as a new biomarker related to prognosis in digestive system cancers.
Expression of HIWI in human esophageal squamous cell carcinoma is significantly associated with poorer prognosis

Directory of Open Access Journals (Sweden)

Shou Chengcao

2009-12-01

Full Text Available Abstract Background HIWI, the human homologue of Piwi family, is present in CD34+ hematopoietic stem cells and germ cells, but not in well-differentiated cell populations, indicating that HIWI may play an impotent role in determining or maintaining stemness of these cells. That HIWI expression has been detected in several type tumours may suggest its association with clinical outcome in cancer patients. Methods With the methods of real-time PCR, western blot, immunocytochemistry and immunohistochemistry, the expression of HIWI in three esophageal squamous cancer cell lines KYSE70, KYSE140 and KYSE450 has been characterized. Then, we investigated HIWI expression in a series of 153 esophageal squamous cell carcinomas using immunohistochemistry and explored its association with clinicopathological features. Results The expression of HIWI was observed in tumour cell nuclei or/and cytoplasm in 137 (89.5% cases, 16 (10.5% cases were negative in both nuclei and cytoplasm. 86 (56.2% were strongly positive in cytoplasm, while 49 (32.0% were strongly positive in nuclei. The expression level of HIWI in cytoplasm of esophageal cancer cells was significantly associated with histological grade (P = 0.011, T stage (P = 0.035, and clinic outcome (P Conclusion The expression of HIWI in the cytoplasm of esophageal cancer cells is significantly associated with higher histological grade, clinical stage and poorer clinical outcome, indicating its possible involvement in cancer development.
Statistics from dynamics in curved spacetime

International Nuclear Information System (INIS)

Parker, L.; Wang, Y.

1989-01-01

We consider quantum fields of spin 0, 1/2, 1, 3/2, and 2 with a nonzero mass in curved spacetime. We show that the dynamical Bogolubov transformations associated with gravitationally induced particle creation imply the connection between spin and statistics: By embedding two flat regions in a curved spacetime, we find that only when one imposes Bose-Einstein statistics for an integer-spin field and Fermi-Dirac statistics for a half-integer-spin field in the first flat region is the same type of statistics propagated from the first to the second flat region. This derivation of the flat-spacetime spin-statistics theorem makes use of curved-spacetime dynamics and does not reduce to any proof given in flat spacetime. We also show in the same manner that parastatistics, up to the fourth order, are consistent with the dynamical evolution of curved spacetime
Preovulatory progesterone concentration associates significantly to follicle number and LH concentration but not to pregnancy rate

DEFF Research Database (Denmark)

Yding Andersen, Claus; Bungum, Leif; Nyboe Andersen, Anders

2011-01-01

Using data from a large prospective randomized controlled trial that evaluated the effect of recombinant LH (rLH)co-administration for ovarian stimulation, the present study assessed whether progesterone concentration on the day of human chorionic gonadotrophin (HCG) administration was associated...... with or without rLH administration from day 6 of stimulation. There was no significant association between the late-follicular-phase progesterone concentration and the clinical pregnancy rate. However, progesterone concentration was strongly associated with the number of follicles and retrieved oocytes. Late...
Statistical properties of turbulent transport and fluctuations in tokamak and stellarator devices

Energy Technology Data Exchange (ETDEWEB)

Hidalgo, C; Pedrosa, M A; Milligen, B Van; Sanchez, E; Balbin, R; Garcia-Cortes, I [Euratom-CIEMAT Association, Madrid (Spain); Bleuel, J; Giannone, L.; Niedermeyer, H [Euratom-IPP Association, Garching (Germany)

1997-05-01

The statistical properties of fluctuations and turbulent transport have been studied in the plasma boundary region of stellarator (TJ-IU, W7-AS) and tokamak (TJ-I) devices. The local flux probability distribution function shows the bursty character of the flux and presents a systematic change as a function of the radial location. There exist large amplitude transport bursts that account for a significant part of the total flux. There is a strong similarity between the statistical properties of the turbulent fluxes in different devices. The value of the radial coherence associated with fluctuations and turbulent transport is strongly intermittent. This result emphasizes the importance of measurements with time resolution in understanding the interplay between the edge and the core regions in the plasma. For measurements in the plasma edge region of the TJ-IU torsatron, the turbulent flux does not, in general, show a larger radial coherence than the one associated with the fluctuations. (author). 14 refs, 6 figs.
The significance of platelet-associated immunoglobulin G in non-thrombocytopenic patients with systemic lupus erythematosus

DEFF Research Database (Denmark)

Sørensen, P G; Mickley, H; Fristed, P

1985-01-01

The possible pathogenetic significance of platelet-associated immunoglobulin G in systemic lupus erythematosus (SLE) has been studied, using a semiquantitative immunofluorescence technique. The study included 22 patients suffering from SLE during the period 1973-81. Thirteen patients had various ...
[Big data in official statistics].

Science.gov (United States)

Zwick, Markus

2015-08-01

The concept of "big data" stands to change the face of official statistics over the coming years, having an impact on almost all aspects of data production. The tasks of future statisticians will not necessarily be to produce new data, but rather to identify and make use of existing data to adequately describe social and economic phenomena. Until big data can be used correctly in official statistics, a lot of questions need to be answered and problems solved: the quality of data, data protection, privacy, and the sustainable availability are some of the more pressing issues to be addressed. The essential skills of official statisticians will undoubtedly change, and this implies a number of challenges to be faced by statistical education systems, in universities, and inside the statistical offices. The national statistical offices of the European Union have concluded a concrete strategy for exploring the possibilities of big data for official statistics, by means of the Big Data Roadmap and Action Plan 1.0. This is an important first step and will have a significant influence on implementing the concept of big data inside the statistical offices of Germany.
Expression features and prognostic significance of Yes-associated protein in hepatocellular carcinoma and cholangiocellular carcinoma

Directory of Open Access Journals (Sweden)

WANG Chun

2017-07-01

Full Text Available ObjectiveTo investigate the expression of Yes-associated protein (YAP in hepatocellular carcinoma (HCC and cholangiocellular carcinoma (CC and its association with clinical prognosis. MethodsSamples were collected from 190 patients who were treated in The Second Hospital Affiliated to Chongqing Medical University from July 2004 to July 2009, among whom 110 had HCC and 80 had CC. The difference in YAP expression and its association were analyzed in both groups, and patients′ prognosis was compared between the two groups. The chi-square test was used to investigate the association between YAP expression and clinicopathological features of HCC and CC, and the Kaplan-Meier method and the log-rank test were used to assess tumor-free survival rate and overall survival rate. A univariate Cox regression analysis was used to evaluate the influence of YAP expression on the prognosis of patients with HCC and CC. ResultsThe CC group had higher expression of YAP than the HCC group (68.7% vs 56.3%, P=0.036. High YAP expression in HCC and CC was significantly associated with tumor size (P＜0.001 and P=0.024, alpha fetoprotein (P=0.009 and 0034, liver cirrhosis (P=0032 and 0.006, vascular invasion (P=0.011 and 0.028, and intrahepatic metastasis (P=0.049 and 0030. In both groups, the patients with high YAP expression had significantly lower tumor-free survival rate and overall survival rate than those with low YAP expression(all P＜005. Multivariate analysis showed that high YAP expression is an adverse prognostic factor for tumor-free survival and overall survival in both groups (all P＜005. ConclusionHigh YAP expression is frequently found in patients with HCC and CC, and high YAP expression is associated with low survival rate.

Significant association of inflammation grade with the number of Langerhans cells in odontogenic keratocysts

Directory of Open Access Journals (Sweden)

Chun-Han Chang

2017-10-01

Conclusion: A significant association of inflammation grade with the number of LCs in OKCs is found. The paucity of finding LCs in the lining epithelia of OKCs without inflammation indicates the loss of immunosurveillance ability against the OKC lining epithelial cells; this can explain why OKCs have aggressive clinical behavior, a great growth potential, and a high recurrence rate.
Fade statistics of M-turbulent optical links

DEFF Research Database (Denmark)

Jurado-Navas, Antonio; Maria Garrido-Balsells, Jose; Castillo-Vazquez, Miguel

2017-01-01

A new and generalized statistical model, called Malaga or simply M distribution, has been derived recently to characterize the irradiance fluctuations of an unbounded optical wavefront propagating through a turbulent medium under all irradiance fluctuation conditions. The aforementioned model...... extends and unifies in a simple analytical closed-form expression most of the proposed statistical models for free-space optical (FSO) communications widely employed until now in the scientific literature. Based on that M model, we have studied some important features associated to its fade statistics...
Distinguish Dynamic Basic Blocks by Structural Statistical Testing

DEFF Research Database (Denmark)

Petit, Matthieu; Gotlieb, Arnaud

Statistical testing aims at generating random test data that respect selected probabilistic properties. A distribution probability is associated with the program input space in order to achieve statistical test purpose: to test the most frequent usage of software or to maximize the probability of...... control flow path) during the test data selection. We implemented this algorithm in a statistical test data generator for Java programs. A first experimental validation is presented...
A Novel Texture Classification Procedure by using Association Rules

Directory of Open Access Journals (Sweden)

L. Jaba Sheela

2008-11-01

Full Text Available Texture can be defined as a local statistical pattern of texture primitives in observer’s domain of interest. Texture classification aims to assign texture labels to unknown textures, according to training samples and classification rules. Association rules have been used in various applications during the past decades. Association rules capture both structural and statistical information, and automatically identify the structures that occur most frequently and relationships that have significant discriminative power. So, association rules can be adapted to capture frequently occurring local structures in textures. This paper describes the usage of association rules for texture classification problem. The performed experimental studies show the effectiveness of the association rules. The overall success rate is about 98%.
Brownian quasi-particles in statistical physics

International Nuclear Information System (INIS)

Tellez-Arenas, A.; Fronteau, J.; Combis, P.

1979-01-01

The idea of a Brownian quasi-particle and the associated differentiable flow (with nonselfadjoint forces) are used here in the context of a stochastic description of the approach towards statistical equilibrium. We show that this quasi-particle flow acquires, at equilibrium, the principal properties of a conservative Hamiltonian flow. Thus the model of Brownian quasi-particles permits us to establish a link between the stochastic description and the Gibbs description of statistical equilibrium
National Statistical Commission and Indian Official Statistics*

Indian Academy of Sciences (India)

IAS Admin

a good collection of official statistics of that time. With more .... statistical agencies and institutions to provide details of statistical activities .... ing several training programmes. .... ful completion of Indian Statistical Service examinations, the.
Quantifying the Clinical Significance of Cannabis Withdrawal

Science.gov (United States)

Allsop, David J.; Copeland, Jan; Norberg, Melissa M.; Fu, Shanlin; Molnar, Anna; Lewis, John; Budney, Alan J.

2012-01-01

Background and Aims Questions over the clinical significance of cannabis withdrawal have hindered its inclusion as a discrete cannabis induced psychiatric condition in the Diagnostic and Statistical Manual of Mental Disorders (DSM IV). This study aims to quantify functional impairment to normal daily activities from cannabis withdrawal, and looks at the factors predicting functional impairment. In addition the study tests the influence of functional impairment from cannabis withdrawal on cannabis use during and after an abstinence attempt. Methods and Results A volunteer sample of 49 non-treatment seeking cannabis users who met DSM-IV criteria for dependence provided daily withdrawal-related functional impairment scores during a one-week baseline phase and two weeks of monitored abstinence from cannabis with a one month follow up. Functional impairment from withdrawal symptoms was strongly associated with symptom severity (p = 0.0001). Participants with more severe cannabis dependence before the abstinence attempt reported greater functional impairment from cannabis withdrawal (p = 0.03). Relapse to cannabis use during the abstinence period was associated with greater functional impairment from a subset of withdrawal symptoms in high dependence users. Higher levels of functional impairment during the abstinence attempt predicted higher levels of cannabis use at one month follow up (p = 0.001). Conclusions Cannabis withdrawal is clinically significant because it is associated with functional impairment to normal daily activities, as well as relapse to cannabis use. Sample size in the relapse group was small and the use of a non-treatment seeking population requires findings to be replicated in clinical samples. Tailoring treatments to target withdrawal symptoms contributing to functional impairment during a quit attempt may improve treatment outcomes. PMID:23049760
Quantifying the clinical significance of cannabis withdrawal.

Directory of Open Access Journals (Sweden)

David J Allsop

Full Text Available Questions over the clinical significance of cannabis withdrawal have hindered its inclusion as a discrete cannabis induced psychiatric condition in the Diagnostic and Statistical Manual of Mental Disorders (DSM IV. This study aims to quantify functional impairment to normal daily activities from cannabis withdrawal, and looks at the factors predicting functional impairment. In addition the study tests the influence of functional impairment from cannabis withdrawal on cannabis use during and after an abstinence attempt.A volunteer sample of 49 non-treatment seeking cannabis users who met DSM-IV criteria for dependence provided daily withdrawal-related functional impairment scores during a one-week baseline phase and two weeks of monitored abstinence from cannabis with a one month follow up. Functional impairment from withdrawal symptoms was strongly associated with symptom severity (p=0.0001. Participants with more severe cannabis dependence before the abstinence attempt reported greater functional impairment from cannabis withdrawal (p=0.03. Relapse to cannabis use during the abstinence period was associated with greater functional impairment from a subset of withdrawal symptoms in high dependence users. Higher levels of functional impairment during the abstinence attempt predicted higher levels of cannabis use at one month follow up (p=0.001.Cannabis withdrawal is clinically significant because it is associated with functional impairment to normal daily activities, as well as relapse to cannabis use. Sample size in the relapse group was small and the use of a non-treatment seeking population requires findings to be replicated in clinical samples. Tailoring treatments to target withdrawal symptoms contributing to functional impairment during a quit attempt may improve treatment outcomes.
Official Statistics and Statistics Education: Bridging the Gap

Directory of Open Access Journals (Sweden)

Gal Iddo

2017-03-01

Full Text Available This article aims to challenge official statistics providers and statistics educators to ponder on how to help non-specialist adult users of statistics develop those aspects of statistical literacy that pertain to official statistics. We first document the gap in the literature in terms of the conceptual basis and educational materials needed for such an undertaking. We then review skills and competencies that may help adults to make sense of statistical information in areas of importance to society. Based on this review, we identify six elements related to official statistics about which non-specialist adult users should possess knowledge in order to be considered literate in official statistics: (1 the system of official statistics and its work principles; (2 the nature of statistics about society; (3 indicators; (4 statistical techniques and big ideas; (5 research methods and data sources; and (6 awareness and skills for citizens’ access to statistical reports. Based on this ad hoc typology, we discuss directions that official statistics providers, in cooperation with statistics educators, could take in order to (1 advance the conceptualization of skills needed to understand official statistics, and (2 expand educational activities and services, specifically by developing a collaborative digital textbook and a modular online course, to improve public capacity for understanding of official statistics.
TREAT (TREe-based Association Test)

Science.gov (United States)

TREAT is an R package for detecting complex joint effects in case-control studies. The test statistic is derived from a tree-structure model by recursive partitioning the data. Ultra-fast algorithm is designed to evaluate the significance of association between candidate gene and disease outcome
Epilepsy and occupational accidents in Brazil: a national statistics study.

Science.gov (United States)

Lunardi, Mariana dos Santos; Soliman, Lucas Alexandre Pedrollo; Pauli, Carla; Lin, Katia

2011-01-01

Epilepsy may restrict the patient's daily life. It causes lower quality of life and increased risk for work-related accidents (WRA). The aim of this study is to analyze the implantation of the Epidemiologic and Technical Security System Nexus (ETSSN) and WRA patterns among patients with epilepsy. Data regarding WRA, between 1999 and 2008, on the historical database of WRA Infolog Statistical Yearbook from Brazilian Ministry of Social Security were reviewed. There was a significant increase of reported cases during the ten year period, mainly after the establishment of the ETSSN. The increased granted benefits evidenced the epidemiologic association between epilepsy and WRA. ETSSN possibly raised the registration of occupational accidents and granted benefits. However, the real number of WRA may remain underestimated due to informal economy and house workers' accidents which are usually not included in the official statistics in Brazil.
Application of extended statistical combination of uncertainties methodology for digital nuclear power plants

Energy Technology Data Exchange (ETDEWEB)

In, Wang Ki; Uh, Keun Sun; Chul, Kim Heui [Korea Atomic Energy Research Institute, Taejon (Korea, Republic of)

1995-02-01

A technically more direct statistical combinations of uncertainties methodology, extended SCU (XSCU), was applied to statistically combine the uncertainties associated with the DNBR alarm setpoint and the DNBR trip setpoint of digital nuclear power plants. The modified SCU (MSCU) methodology is currently used as the USNRC approved design methodology to perform the same function. In this report, the MSCU and XSCU methodologies were compared in terms of the total uncertainties and the net margins to the DNBR alarm and trip setpoints. The MSCU methodology resulted in the small total penalties due to a significantly negative bias which are quite large. However the XSCU methodology gave the virtually unbiased total uncertainties. The net margins to the DNBR alarm and trip setpoints by the MSCU methodology agree with those by the XSCU methodology within statistical variations. (Author) 12 refs., 17 figs., 5 tabs.
Expression of HIWI in human esophageal squamous cell carcinoma is significantly associated with poorer prognosis

International Nuclear Information System (INIS)

He, Wei; Wang, Zhihui; Wang, Qi; Fan, Qingxia; Shou, Chengcao; Wang, Junsheng; Giercksky, Karl-Erik; Nesland, Jahn M; Suo, Zhenhe

2009-01-01

HIWI, the human homologue of Piwi family, is present in CD34 + hematopoietic stem cells and germ cells, but not in well-differentiated cell populations, indicating that HIWI may play an impotent role in determining or maintaining stemness of these cells. That HIWI expression has been detected in several type tumours may suggest its association with clinical outcome in cancer patients. With the methods of real-time PCR, western blot, immunocytochemistry and immunohistochemistry, the expression of HIWI in three esophageal squamous cancer cell lines KYSE70, KYSE140 and KYSE450 has been characterized. Then, we investigated HIWI expression in a series of 153 esophageal squamous cell carcinomas using immunohistochemistry and explored its association with clinicopathological features. The expression of HIWI was observed in tumour cell nuclei or/and cytoplasm in 137 (89.5%) cases, 16 (10.5%) cases were negative in both nuclei and cytoplasm. 86 (56.2%) were strongly positive in cytoplasm, while 49 (32.0%) were strongly positive in nuclei. The expression level of HIWI in cytoplasm of esophageal cancer cells was significantly associated with histological grade (P = 0.011), T stage (P = 0.035), and clinic outcome (P < 0.001), while there was no correlation between the nuclear HIWI expression and clinicopathological features. The expression of HIWI in the cytoplasm of esophageal cancer cells is significantly associated with higher histological grade, clinical stage and poorer clinical outcome, indicating its possible involvement in cancer development
Genetic polymorphisms associated with breast cancer in malaysian cohort.

Science.gov (United States)

Chahil, Jagdish Kaur; Munretnam, Khamsigan; Samsudin, Nurulhafizah; Lye, Say Hean; Hashim, Nikman Adli Nor; Ramzi, Nurul Hanis; Velapasamy, Sharmila; Wee, Ler Lian; Alex, Livy

2015-04-01

Genome-wide association studies have discovered multiple single nucleotide polymorphisms (SNPs) associated with the risk of common diseases. The objective of this study was to demonstrate the replication of previously published SNPs that showed statistical significance for breast cancer in the Malaysian population. In this case-control study, 80 subjects for each group were recruited from various hospitals in Malaysia. A total of 768 SNPs were genotyped and analyzed to distinguish risk and protective alleles. A total of three SNPs were found to be associated with increased risk of breast cancer while six SNPs showed protective effect. All nine were statistically significant SNPs (p ≤ 0.01), five SNPs from previous studies were successfully replicated in our study. Significant modifiable (diet) and non-modifiable (family history of breast cancer in first degree relative) risk factors were also observed. We identified nine SNPs from this study to be either conferring susceptibility or protection to breast cancer which may serve as potential markers in risk prediction.
ER, p53 and MIB-1 are significantly associated with malignant phyllodes tumor.

Science.gov (United States)

Munawer, Nurhayati H; Md Zin, Reena; Md Ali, Siti-Aishah; Muhammad, Rohaizak; Ali, Jasmi; Das, Srijit

2012-01-01

Fibroadenomas (FA) are common while phyllodes tumors (PT) are rare and both tumors are composed of epithelial and stromal components. We evaluated the expression status of ER, Bc12, p53, and MIB-1 protein in these tumors. One hundred and ninety-three tumors comprising of 117 FAs and 76 PTs were examined using immunohistochemistry on tissue microarray. The mean age of patients with FA was 28.5 years while the mean ages of patients with benign, borderline and malignant PTs were 41.7, 48.6 and 42.1 years, respectively. Also all types of PTs were large (>Scm). ER showed a strong nuclear staining in the epithelial component of all tumors while ER/3 immunoreactivity was detected in both the epithelial and stromal components ofF A and PT. ER/β (pcomponent were associated with tumor size. p53 expression was significantly associated with both the epithelial and stromal components of malignant PTs (pcomponent (p=0.000). In addition, MIB-1 was also found to be associated with ER and ER/3 in the stromal component (p=0.000). The expression of p53 with tumor size and histological grade in PT may increase the risk for malignancy.
A statistical approach to plasma profile analysis

International Nuclear Information System (INIS)

Kardaun, O.J.W.F.; McCarthy, P.J.; Lackner, K.; Riedel, K.S.

1990-05-01

A general statistical approach to the parameterisation and analysis of tokamak profiles is presented. The modelling of the profile dependence on both the radius and the plasma parameters is discussed, and pertinent, classical as well as robust, methods of estimation are reviewed. Special attention is given to statistical tests for discriminating between the various models, and to the construction of confidence intervals for the parameterised profiles and the associated global quantities. The statistical approach is shown to provide a rigorous approach to the empirical testing of plasma profile invariance. (orig.)
Clinical and diagnostic significance of urinary neutrophil gelatinase-associated lipocalin-2 measurement in children with microbial inflammatory kidney and urinary tract diseases

Directory of Open Access Journals (Sweden)

A. V. Eremeeva

2015-01-01

Full Text Available Objective: to study the clinical and diagnostic significance of urinary neutrophil gelatinase-associated lipocalin-2 (NGAL measurement in children with urinary tract infection (я=15 and pyelonephritis (я=15. The patients' age was 1 to 16 years (mean age, 7.32+4.52 years. The diagnosis was verified on the basis of clinical and laboratory findings and medical history and instrumental examination data. Urinary NGAL levels were measured by enzyme immunoassay (a Bio\\fendor Laboratory Medicine kit and calculated with reference to mg of creatinine. Urinary NGAL levels were established to depend on the degree of renal parenchymal damage. The investigation showed a relationship between the excretion of NGAL during the acute phase of pyelonephritis and the detection of renal scarring, as evidenced by statistical DMCA nephroscintigraphy. The acute pyelonephritis group exhibited a moderate direct correlation between the renal excretion of NGAL and the degree of leukocytosis and the blood levels of C-reactive protein. The findings allow recommendations for measuring urinary NGAL levels as an additional noninvasive marker for the early detection of renal parenchymal damage.
A New Statistical Approach to Characterize Chemical-Elicited Behavioral Effects in High-Throughput Studies Using Zebrafish.

Directory of Open Access Journals (Sweden)

Guozhu Zhang

Full Text Available Zebrafish have become an important alternative model for characterizing chemical bioactivity, partly due to the efficiency at which systematic, high-dimensional data can be generated. However, these new data present analytical challenges associated with scale and diversity. We developed a novel, robust statistical approach to characterize chemical-elicited effects in behavioral data from high-throughput screening (HTS of all 1,060 Toxicity Forecaster (ToxCast™ chemicals across 5 concentrations at 120 hours post-fertilization (hpf. Taking advantage of the immense scale of data for a global view, we show that this new approach reduces bias introduced by extreme values yet allows for diverse response patterns that confound the application of traditional statistics. We have also shown that, as a summary measure of response for local tests of chemical-associated behavioral effects, it achieves a significant reduction in coefficient of variation compared to many traditional statistical modeling methods. This effective increase in signal-to-noise ratio augments statistical power and is observed across experimental periods (light/dark conditions that display varied distributional response patterns. Finally, we integrated results with data from concomitant developmental endpoint measurements to show that appropriate statistical handling of HTS behavioral data can add important biological context that informs mechanistic hypotheses.
Statistical Optics

Science.gov (United States)

Goodman, Joseph W.

2000-07-01

The Wiley Classics Library consists of selected books that have become recognized classics in their respective fields. With these new unabridged and inexpensive editions, Wiley hopes to extend the life of these important works by making them available to future generations of mathematicians and scientists. Currently available in the Series: T. W. Anderson The Statistical Analysis of Time Series T. S. Arthanari & Yadolah Dodge Mathematical Programming in Statistics Emil Artin Geometric Algebra Norman T. J. Bailey The Elements of Stochastic Processes with Applications to the Natural Sciences Robert G. Bartle The Elements of Integration and Lebesgue Measure George E. P. Box & Norman R. Draper Evolutionary Operation: A Statistical Method for Process Improvement George E. P. Box & George C. Tiao Bayesian Inference in Statistical Analysis R. W. Carter Finite Groups of Lie Type: Conjugacy Classes and Complex Characters R. W. Carter Simple Groups of Lie Type William G. Cochran & Gertrude M. Cox Experimental Designs, Second Edition Richard Courant Differential and Integral Calculus, Volume I RIchard Courant Differential and Integral Calculus, Volume II Richard Courant & D. Hilbert Methods of Mathematical Physics, Volume I Richard Courant & D. Hilbert Methods of Mathematical Physics, Volume II D. R. Cox Planning of Experiments Harold S. M. Coxeter Introduction to Geometry, Second Edition Charles W. Curtis & Irving Reiner Representation Theory of Finite Groups and Associative Algebras Charles W. Curtis & Irving Reiner Methods of Representation Theory with Applications to Finite Groups and Orders, Volume I Charles W. Curtis & Irving Reiner Methods of Representation Theory with Applications to Finite Groups and Orders, Volume II Cuthbert Daniel Fitting Equations to Data: Computer Analysis of Multifactor Data, Second Edition Bruno de Finetti Theory of Probability, Volume I Bruno de Finetti Theory of Probability, Volume 2 W. Edwards Deming Sample Design in Business Research
[Clinical significance of drug resistance-associated mutations in treatment of hepatitis C with direct-acting antiviral agents].

Science.gov (United States)

Li, Z; Chen, Z W; Ren, H; Hu, P

2017-03-20

Direct-acting antiviral agents (DAAs) achieve a high sustained virologic response rate in the treatment of chronic hepatitis C virus infection. However, drug resistance-associated mutations play an important role in treatment failure and have attracted more and more attention. This article elaborates on the clinical significance of drug resistance-associated mutations from the aspects of their definition, association with genotype, known drug resistance-associated mutations and their prevalence rates, the impact of drug resistance-associated mutations on treatment naive and treatment-experienced patients, and the role of clinical detection, in order to provide a reference for clinical regimens with DAAs and help to achieve higher sustained virologic response rates.

Fracture risk associated with use of antiepileptic drugs.

Science.gov (United States)

Vestergaard, Peter; Rejnmark, Lars; Mosekilde, Leif

2004-11-01

To assess fracture risk associated with different antiepileptic drugs (AEDs). An increased fracture risk has been reported in patients with epilepsy. Classical AEDs have been associated with decreased bone mineral density. The effects of newer AEDs are unknown. We undertook a population-based pharmacoepidemiologic case-control study with any fracture as outcome and use of AEDs as exposure variables (124,655 fracture cases and 373,962 controls). All AEDs were associated with an increased fracture risk in an unadjusted analysis. After adjustment for prior fracture, use (ever) of corticosteroids, comorbidity, social variables, and diagnosis of epilepsy, carbamazepine [CBZ; odds ratio (OR), 1.18; 95% confidence interval (CI), 1.10-1.26], [and oxcarbazepine (OXC; 1.14, 1.03-1.26)], clonazepam (CZP; 1.27, 1.15-1.41), phenobarbital (PB; 1.79, 1.64-1.95), and valproate (VPA; 1.15, 1.05-1.26) were statistically significantly associated with risk of any fracture. Ethosuximide (0.75, 0.37-1.52), lamotrigine (1.04, 0.91-1.19), phenytoin (1.20, 1.00-1.43), primidone (1.18, 0.95-1.48), tiagabine (0.75, 0.40-1.41), topiramate (1.39, 0.99-1.96), and vigabatrin (0.93, 0.70-1.22) were not statistically significantly associated with fracture risk after adjustment for confounders. The relative increase was modest and in the same range for the significant and nonsignificant results. CBZ, PB, OXC, and VPA displayed a dose-response relation. Fracture risk was more increased by liver-inducing AEDs (OR, 1.38; 95% CI, 1.31-1.45) than by noninducing AEDs (1.19; 95% CI, 1.11-1.27). A very limited increased fracture risk is present in users of CBZ, CZP, OXC, PB, and VPA. A limited significant increase cannot be excluded for the other AEDs because of the statistical power.
Study designs, use of statistical tests, and statistical analysis software choice in 2015: Results from two Pakistani monthly Medline indexed journals.

Science.gov (United States)

Shaikh, Masood Ali

2017-09-01

Assessment of research articles in terms of study designs used, statistical tests applied and the use of statistical analysis programmes help determine research activity profile and trends in the country. In this descriptive study, all original articles published by Journal of Pakistan Medical Association (JPMA) and Journal of the College of Physicians and Surgeons Pakistan (JCPSP), in the year 2015 were reviewed in terms of study designs used, application of statistical tests, and the use of statistical analysis programmes. JPMA and JCPSP published 192 and 128 original articles, respectively, in the year 2015. Results of this study indicate that cross-sectional study design, bivariate inferential statistical analysis entailing comparison between two variables/groups, and use of statistical software programme SPSS to be the most common study design, inferential statistical analysis, and statistical analysis software programmes, respectively. These results echo previously published assessment of these two journals for the year 2014.
Mining, Validation, and Clinical Significance of Colorectal Cancer (CRC)-Associated lncRNAs.

Science.gov (United States)

Sun, Xiangwei; Hu, Yingying; Zhang, Liang; Hu, Changyuan; Guo, Gangqiang; Mao, Chenchen; Xu, Jianfeng; Ye, Sisi; Huang, Guanli; Xue, Xiangyang; Guo, Aizhen; Shen, Xian

2016-01-01

Colorectal cancer (CRC) is one of the deadliest tumours, but its pathogenesis remains unclear. The involvement of differentially expressed long non-coding RNAs (lncRNAs) in CRC tumorigenesis makes them suitable tumour biomarkers. Here, we screened 150 cases of CRC and 85 cases of paracancerous tissues in the GEO database for differentially expressed lncRNAs. The levels of lncRNA candidates in 84 CRC and paracancerous tissue samples were validated by qRT-PCR and their clinical significance was analyzed. We identified 15 lncRNAs with differential expression in CRC tumours; among them, AK098081 was significantly up-regulated, whereas AK025209, BC040303, BC037331, AK026659, and CR749831 were down-regulated in CRC. In a receiver operating characteristic curve analysis, the area under the curve for the six lncRNAs was 0.914. High expression of AK098081 and low expression of BC040303, CR749831, and BC037331 indicated poor CRC differentiation. CRC patients with lymph node metastasis had lower expression of BC037331. In addition, the group with high AK098081 expression presented significantly lower overall survival and disease-free survival rates than the low-expression group, confirming AK098081 as an independent risk factor for CRC patients. In conclusion, we have identified multiple CRC-associated lncRNAs from microarray expression profiles that can serve as novel biomarkers for the diagnosis and prognosis of CRC.
Review of the Statistical Techniques in Medical Sciences | Okeh ...

African Journals Online (AJOL)

... medical researcher in selecting the appropriate statistical techniques. Of course, all statistical techniques have certain underlying assumptions, which must be checked before the technique is applied. Keywords: Variable, Prospective Studies, Retrospective Studies, Statistical significance. Bio-Research Vol. 6 (1) 2008: pp.
Prognostic significance of macrophage invasion in hilar cholangiocarcinoma

International Nuclear Information System (INIS)

Atanasov, Georgi; Hau, Hans-Michael; Dietel, Corinna; Benzing, Christian; Krenzien, Felix; Brandl, Andreas; Wiltberger, Georg; Matia, Ivan; Prager, Isabel; Schierle, Katrin; Robson, Simon C.; Reutzel-Selke, Anja; Pratschke, Johann; Schmelzle, Moritz; Jonas, Sven

2015-01-01

Tumor-associated macrophages (TAMs) promote tumor progression and have an effect on survival in human cancer. However, little is known regarding their influence on tumor progression and prognosis in human hilar cholangiocarcinoma. We analyzed surgically resected tumor specimens of hilar cholangiocarcinoma (n = 47) for distribution and localization of TAMs, as defined by expression of CD68. Abundance of TAMs was correlated with clinicopathologic characteristics, tumor recurrence and patients’ survival. Statistical analysis was performed using SPSS software. Patients with high density of TAMs in tumor invasive front (TIF) showed significantly higher local and overall tumor recurrence (both ρ < 0.05). Furthermore, high density of TAMs was associated with decreased overall (one-year 83.6 % vs. 75.1 %; three-year 61.3 % vs. 42.4 %; both ρ < 0.05) and recurrence-free survival (one-year 93.9 % vs. 57.4 %; three-year 59.8 % vs. 26.2 %; both ρ < 0.05). TAMs in TIF and tumor recurrence, were confirmed as the only independent prognostic variables in the multivariate survival analysis (all ρ < 0.05). Overall survival and recurrence free survival of patients with hilar cholangiocarcinoma significantly improved in patients with low levels of TAMs in the area of TIF, when compared to those with a high density of TAMs. These observations suggest their utilization as valuable prognostic markers in routine histopathologic evaluation, and might indicate future therapeutic approaches by targeting TAMs
On the statistical assessment of classifiers using DNA microarray data

Directory of Open Access Journals (Sweden)

Carella M

2006-08-01

Full Text Available Abstract Background In this paper we present a method for the statistical assessment of cancer predictors which make use of gene expression profiles. The methodology is applied to a new data set of microarray gene expression data collected in Casa Sollievo della Sofferenza Hospital, Foggia – Italy. The data set is made up of normal (22 and tumor (25 specimens extracted from 25 patients affected by colon cancer. We propose to give answers to some questions which are relevant for the automatic diagnosis of cancer such as: Is the size of the available data set sufficient to build accurate classifiers? What is the statistical significance of the associated error rates? In what ways can accuracy be considered dependant on the adopted classification scheme? How many genes are correlated with the pathology and how many are sufficient for an accurate colon cancer classification? The method we propose answers these questions whilst avoiding the potential pitfalls hidden in the analysis and interpretation of microarray data. Results We estimate the generalization error, evaluated through the Leave-K-Out Cross Validation error, for three different classification schemes by varying the number of training examples and the number of the genes used. The statistical significance of the error rate is measured by using a permutation test. We provide a statistical analysis in terms of the frequencies of the genes involved in the classification. Using the whole set of genes, we found that the Weighted Voting Algorithm (WVA classifier learns the distinction between normal and tumor specimens with 25 training examples, providing e = 21% (p = 0.045 as an error rate. This remains constant even when the number of examples increases. Moreover, Regularized Least Squares (RLS and Support Vector Machines (SVM classifiers can learn with only 15 training examples, with an error rate of e = 19% (p = 0.035 and e = 18% (p = 0.037 respectively. Moreover, the error rate
Spectral and cross-spectral analysis of uneven time series with the smoothed Lomb-Scargle periodogram and Monte Carlo evaluation of statistical significance

Science.gov (United States)

Pardo-Igúzquiza, Eulogio; Rodríguez-Tovar, Francisco J.

2012-12-01

Many spectral analysis techniques have been designed assuming sequences taken with a constant sampling interval. However, there are empirical time series in the geosciences (sediment cores, fossil abundance data, isotope analysis, …) that do not follow regular sampling because of missing data, gapped data, random sampling or incomplete sequences, among other reasons. In general, interpolating an uneven series in order to obtain a succession with a constant sampling interval alters the spectral content of the series. In such cases it is preferable to follow an approach that works with the uneven data directly, avoiding the need for an explicit interpolation step. The Lomb-Scargle periodogram is a popular choice in such circumstances, as there are programs available in the public domain for its computation. One new computer program for spectral analysis improves the standard Lomb-Scargle periodogram approach in two ways: (1) It explicitly adjusts the statistical significance to any bias introduced by variance reduction smoothing, and (2) it uses a permutation test to evaluate confidence levels, which is better suited than parametric methods when neighbouring frequencies are highly correlated. Another novel program for cross-spectral analysis offers the advantage of estimating the Lomb-Scargle cross-periodogram of two uneven time series defined on the same interval, and it evaluates the confidence levels of the estimated cross-spectra by a non-parametric computer intensive permutation test. Thus, the cross-spectrum, the squared coherence spectrum, the phase spectrum, and the Monte Carlo statistical significance of the cross-spectrum and the squared-coherence spectrum can be obtained. Both of the programs are written in ANSI Fortran 77, in view of its simplicity and compatibility. The program code is of public domain, provided on the website of the journal (http://www.iamg.org/index.php/publisher/articleview/frmArticleID/112/). Different examples (with simulated and
Atypical Squamous Cells of Undetermined Significance: Bethesda Classification and Association with Human Papillomavirus

Directory of Open Access Journals (Sweden)

Ana Cristina Macêdo Barcelos

2011-01-01

Full Text Available Introduction. To analyze patients with atypical squamous cells of undetermined significance (ASCUS through a cytology review and the presence of microbiological agents, with consideration of colposcopy and semiannual tracking. Methods. 103 women with ASCUS were reviewed and reclassified: normal/inflammatory, ASCUS, low-grade squamous intraepithelial lesion (LSIL, or high-grade squamous intraepithelial lesion (HSIL. If ASCUS confirmed, it was subclassified in reactive or neoplastic ASCUS, ASC-US, or ASC-H; and Regione Emilia Romagna Screening Protocol. Patients underwent a colposcopic examination, and test for Candida sp., bacterial vaginosis, Trichomonas vaginalis, and human papillomavirus (HPV were performed. Results. Upon review, ASCUS was diagnosis in 70/103 (67.9%, being 38 (54.2% reactive ASCUS and 32 (45.71% neoplastic ASCUS; 62 (88.5% ASC-US and 8 (11.41% ASC-H. ASCUS (Regione Protocol, respectively 1-5: 15 (21.4%, 19 (27.1%, 3 (27.1%, 16 (22.8%, and 1 (1.4%. A higher number of cases of cervical intraepithelial neoplasia (CIN II/III in the biopsies of patients with ASC-H compared to ASC-US (P=.0021. High-risk HPV test and presence of CIN II/III are more frequent in ASC-H than ASC-US (P=.031. Conclusions. ASC-H is associated with clinically significant disease. High-risk HPV-positive status in the triage for colposcopy of patients with ASC-US is associated with increased of CIN.
Significant Locus and Metabolic Genetic Correlations Revealed in Genome-Wide Association Study of Anorexia Nervosa.

Science.gov (United States)

Duncan, Laramie; Yilmaz, Zeynep; Gaspar, Helena; Walters, Raymond; Goldstein, Jackie; Anttila, Verneri; Bulik-Sullivan, Brendan; Ripke, Stephan; Thornton, Laura; Hinney, Anke; Daly, Mark; Sullivan, Patrick F; Zeggini, Eleftheria; Breen, Gerome; Bulik, Cynthia M

2017-09-01

The authors conducted a genome-wide association study of anorexia nervosa and calculated genetic correlations with a series of psychiatric, educational, and metabolic phenotypes. Following uniform quality control and imputation procedures using the 1000 Genomes Project (phase 3) in 12 case-control cohorts comprising 3,495 anorexia nervosa cases and 10,982 controls, the authors performed standard association analysis followed by a meta-analysis across cohorts. Linkage disequilibrium score regression was used to calculate genome-wide common variant heritability (single-nucleotide polymorphism [SNP]-based heritability [h 2 SNP ]), partitioned heritability, and genetic correlations (r g ) between anorexia nervosa and 159 other phenotypes. Results were obtained for 10,641,224 SNPs and insertion-deletion variants with minor allele frequencies >1% and imputation quality scores >0.6. The h 2 SNP of anorexia nervosa was 0.20 (SE=0.02), suggesting that a substantial fraction of the twin-based heritability arises from common genetic variation. The authors identified one genome-wide significant locus on chromosome 12 (rs4622308) in a region harboring a previously reported type 1 diabetes and autoimmune disorder locus. Significant positive genetic correlations were observed between anorexia nervosa and schizophrenia, neuroticism, educational attainment, and high-density lipoprotein cholesterol, and significant negative genetic correlations were observed between anorexia nervosa and body mass index, insulin, glucose, and lipid phenotypes. Anorexia nervosa is a complex heritable phenotype for which this study has uncovered the first genome-wide significant locus. Anorexia nervosa also has large and significant genetic correlations with both psychiatric phenotypes and metabolic traits. The study results encourage a reconceptualization of this frequently lethal disorder as one with both psychiatric and metabolic etiology.
Healthcare-Associated Infections (HAIs) Data and Statistics

Science.gov (United States)

... 2016 2015 HAI Data Report 2015 SIRs Using Historical Baselines 2014 HAI Progress Report FAQs: 2014 HAI ... National 2015 Standardized Infection Ratios (SIRs) Calculated Using Historical Baselines CDC’s annual National and State Healthcare-Associated ...
An overview of recent developments in genomics and associated statistical methods.

Science.gov (United States)

Bickel, Peter J; Brown, James B; Huang, Haiyan; Li, Qunhua

2009-11-13

The landscape of genomics has changed drastically in the last two decades. Increasingly inexpensive sequencing has shifted the primary focus from the acquisition of biological sequences to the study of biological function. Assays have been developed to study many intricacies of biological systems, and publicly available databases have given rise to integrative analyses that combine information from many sources to draw complex conclusions. Such research was the focus of the recent workshop at the Isaac Newton Institute, 'High dimensional statistics in biology'. Many computational methods from modern genomics and related disciplines were presented and discussed. Using, as much as possible, the material from these talks, we give an overview of modern genomics: from the essential assays that make data-generation possible, to the statistical methods that yield meaningful inference. We point to current analytical challenges, where novel methods, or novel applications of extant methods, are presently needed.
Lagged Associations of Metropolitan Statistical Area- and State-Level Income Inequality with Cognitive Function: The Health and Retirement Study.

Science.gov (United States)

Kim, Daniel; Griffin, Beth Ann; Kabeto, Mohammed; Escarce, José; Langa, Kenneth M; Shih, Regina A

2016-01-01

Much variation in individual-level cognitive function in late life remains unexplained, with little exploration of area-level/contextual factors to date. Income inequality is a contextual factor that may plausibly influence cognitive function. In a nationally-representative cohort of older Americans from the Health and Retirement Study, we examined state- and metropolitan statistical area (MSA)-level income inequality as predictors of individual-level cognitive function measured by the 27-point Telephone Interview for Cognitive Status (TICS-m) scale. We modeled latency periods of 8-20 years, and controlled for state-/metropolitan statistical area (MSA)-level and individual-level factors. Higher MSA-level income inequality predicted lower cognitive function 16-18 years later. Using a 16-year lag, living in a MSA in the highest income inequality quartile predicted a 0.9-point lower TICS-m score (β = -0.86; 95% CI = -1.41, -0.31), roughly equivalent to the magnitude associated with five years of aging. We observed no associations for state-level income inequality. The findings were robust to sensitivity analyses using propensity score methods. Among older Americans, MSA-level income inequality appears to influence cognitive function nearly two decades later. Policies reducing income inequality levels within cities may help address the growing burden of declining cognitive function among older populations within the United States.
Probability and logical structure of statistical theories

International Nuclear Information System (INIS)

Hall, M.J.W.

1988-01-01

A characterization of statistical theories is given which incorporates both classical and quantum mechanics. It is shown that each statistical theory induces an associated logic and joint probability structure, and simple conditions are given for the structure to be of a classical or quantum type. This provides an alternative for the quantum logic approach to axiomatic quantum mechanics. The Bell inequalities may be derived for those statistical theories that have a classical structure and satisfy a locality condition weaker than factorizability. The relation of these inequalities to the issue of hidden variable theories for quantum mechanics is discussed and clarified
Fermions from classical statistics

International Nuclear Information System (INIS)

Wetterich, C.

2010-01-01

We describe fermions in terms of a classical statistical ensemble. The states τ of this ensemble are characterized by a sequence of values one or zero or a corresponding set of two-level observables. Every classical probability distribution can be associated to a quantum state for fermions. If the time evolution of the classical probabilities p τ amounts to a rotation of the wave function q τ (t)=±√(p τ (t)), we infer the unitary time evolution of a quantum system of fermions according to a Schroedinger equation. We establish how such classical statistical ensembles can be mapped to Grassmann functional integrals. Quantum field theories for fermions arise for a suitable time evolution of classical probabilities for generalized Ising models.
2015 ICSA/Graybill Applied Statistics Symposium

CERN Document Server

Wang, Bushi; Hu, Xiaowen; Chen, Kun; Liu, Ray

2016-01-01

The papers in this volume represent a broad, applied swath of advanced contributions to the 2015 ICSA/Graybill Applied Statistics Symposium of the International Chinese Statistical Association, held at Colorado State University in Fort Collins. The contributions cover topics that range from statistical applications in business and finance to applications in clinical trials and biomarker analysis. Each papers was peer-reviewed by at least two referees and also by an editor. The conference was attended by over 400 participants from academia, industry, and government agencies around the world, including from North America, Asia, and Europe. Focuses on statistical applications from clinical trials, biomarker analysis, and personalized medicine to applications in finance and business analytics A unique selection of papers from broad and multi-disciplinary critical hot topics - from academic, government, and industry perspectives - to appeal to a wide variety of applied research interests All papers feature origina...
Statistical thermodynamics

International Nuclear Information System (INIS)

Lim, Gyeong Hui

2008-03-01

This book consists of 15 chapters, which are basic conception and meaning of statistical thermodynamics, Maxwell-Boltzmann's statistics, ensemble, thermodynamics function and fluctuation, statistical dynamics with independent particle system, ideal molecular system, chemical equilibrium and chemical reaction rate in ideal gas mixture, classical statistical thermodynamics, ideal lattice model, lattice statistics and nonideal lattice model, imperfect gas theory on liquid, theory on solution, statistical thermodynamics of interface, statistical thermodynamics of a high molecule system and quantum statistics
Association of Placebo, Indomethacin, Ibuprofen, and Acetaminophen With Closure of Hemodynamically Significant Patent Ductus Arteriosus in Preterm Infants: A Systematic Review and Meta-analysis.

Science.gov (United States)

Mitra, Souvik; Florez, Ivan D; Tamayo, Maria E; Mbuagbaw, Lawrence; Vanniyasingam, Thuva; Veroniki, Areti Angeliki; Zea, Adriana M; Zhang, Yuan; Sadeghirad, Behnam; Thabane, Lehana

2018-03-27

Despite increasing emphasis on conservative management of patent ductus arteriosus (PDA) in preterm infants, different pharmacotherapeutic interventions are used to treat those developing a hemodynamically significant PDA. To estimate the relative likelihood of hemodynamically significant PDA closure with common pharmacotherapeutic interventions and to compare adverse event rates. The databases of MEDLINE, Embase, and the Cochrane Central Register of Controlled Trials were searched from inception until August 15, 2015, and updated on December 31, 2017, along with conference proceedings up to December 2017. Randomized clinical trials that enrolled preterm infants with a gestational age younger than 37 weeks treated with intravenous or oral indomethacin, ibuprofen, or acetaminophen vs each other, placebo, or no treatment for a clinically or echocardiographically diagnosed hemodynamically significant PDA. Data were independently extracted in pairs by 6 reviewers and synthesized with Bayesian random-effects network meta-analyses. Primary outcome: hemodynamically significant PDA closure; secondary: included surgical closure, mortality, necrotizing enterocolitis, and intraventricular hemorrhage. In 68 randomized clinical trials of 4802 infants, 14 different variations of indomethacin, ibuprofen, or acetaminophen were used as treatment modalities. The overall PDA closure rate was 67.4% (2867 of 4256 infants). A high dose of oral ibuprofen was associated with a significantly higher odds of PDA closure vs a standard dose of intravenous ibuprofen (odds ratio [OR], 3.59; 95% credible interval [CrI], 1.64-8.17; absolute risk difference, 199 [95% CrI, 95-258] more per 1000 infants) and a standard dose of intravenous indomethacin (OR, 2.35 [95% CrI, 1.08-5.31]; absolute risk difference, 124 [95% CrI, 14-188] more per 1000 infants). Based on the ranking statistics, a high dose of oral ibuprofen ranked as the best pharmacotherapeutic option for PDA closure (mean surface under the
Genome-wide association study of CSF levels of 59 alzheimer's disease candidate proteins: significant associations with proteins involved in amyloid processing and inflammation.

Science.gov (United States)

Kauwe, John S K; Bailey, Matthew H; Ridge, Perry G; Perry, Rachel; Wadsworth, Mark E; Hoyt, Kaitlyn L; Staley, Lyndsay A; Karch, Celeste M; Harari, Oscar; Cruchaga, Carlos; Ainscough, Benjamin J; Bales, Kelly; Pickering, Eve H; Bertelsen, Sarah; Fagan, Anne M; Holtzman, David M; Morris, John C; Goate, Alison M

2014-10-01

Cerebrospinal fluid (CSF) 42 amino acid species of amyloid beta (Aβ42) and tau levels are strongly correlated with the presence of Alzheimer's disease (AD) neuropathology including amyloid plaques and neurodegeneration and have been successfully used as endophenotypes for genetic studies of AD. Additional CSF analytes may also serve as useful endophenotypes that capture other aspects of AD pathophysiology. Here we have conducted a genome-wide association study of CSF levels of 59 AD-related analytes. All analytes were measured using the Rules Based Medicine Human DiscoveryMAP Panel, which includes analytes relevant to several disease-related processes. Data from two independently collected and measured datasets, the Knight Alzheimer's Disease Research Center (ADRC) and Alzheimer's Disease Neuroimaging Initiative (ADNI), were analyzed separately, and combined results were obtained using meta-analysis. We identified genetic associations with CSF levels of 5 proteins (Angiotensin-converting enzyme (ACE), Chemokine (C-C motif) ligand 2 (CCL2), Chemokine (C-C motif) ligand 4 (CCL4), Interleukin 6 receptor (IL6R) and Matrix metalloproteinase-3 (MMP3)) with study-wide significant p-values (pprocessing and pro-inflammatory signaling. SNPs associated with ACE, IL6R and MMP3 protein levels are located within the coding regions of the corresponding structural gene. The SNPs associated with CSF levels of CCL4 and CCL2 are located in known chemokine binding proteins. The genetic associations reported here are novel and suggest mechanisms for genetic control of CSF and plasma levels of these disease-related proteins. Significant SNPs in ACE and MMP3 also showed association with AD risk. Our findings suggest that these proteins/pathways may be valuable therapeutic targets for AD. Robust associations in cognitively normal individuals suggest that these SNPs also influence regulation of these proteins more generally and may therefore be relevant to other diseases.
Relational coordination is associated with productivity in general practice: a survey and register based study

DEFF Research Database (Denmark)

Lundstrøm, Sanne Lykke; Edwards, Kasper; Reventlow, Susanne

2014-01-01

In this paper we investigate the association between relational coordination among the practice team in general practice and number of consultations performed in a general practice per staff, i.e. a proxy of productivity. We measured relational coordination using the Relational Coordination Survey...... and combined the results with register data. We found that relational coordination was statistically significant associated with number of consultation per staff per year. We later divided consultations in to three types: Face-to-face, Email and phone consultations. We found a statistically significant...... associating between relational coordination and with number of face-to-face consultation per staff per year....
The Neutrophil/Lymphocyte Ratio at Diagnosis Is Significantly Associated with Survival in Metastatic Pancreatic Cancer Patients

Directory of Open Access Journals (Sweden)

Matteo Piciucchi

2017-03-01

Full Text Available Different inflammation-based scores such as the neutrophil/lymphocyte ratio (NLR, the Odonera Prognostic Nutritional Index (PNI, the Glasgow Prognostic Score, the platelet/lymphocyte ratio, and the C-reactive protein/albumin ratio have been found to be significantly associated with pancreatic cancer (PDAC prognosis. However, most studies have investigated patients undergoing surgery, and few of them have compared these scores. We aimed at evaluating the association between inflammatory-based scores and PDAC prognosis. In a single center cohort study, inflammatory-based scores were assessed at diagnosis and their prognostic relevance as well as that of clinic-pathological variables were evaluated through multiple logistic regression and survival probability analysis. In 206 patients, age, male sex, tumor size, presence of distant metastasis, access to chemotherapy, and an NLR > 5 but not other scores were associated with overall survival (OS at multivariate analysis. Patients with an NLR < 5 had a median survival of 12 months compared to 4 months in those with an NLR > 5. In the 81 patients with distant metastasis at diagnosis, an NLR > 5 resulted in the only variable significantly associated with survival. Among patients with metastatic disease who received chemotherapy, the median survival was 3 months in patients with an NLR > 5 and 7 months in those with an NLR < 5. The NLR might drive therapeutic options in PDAC patients, especially in the setting of metastatic disease.

Spatio-temporal statistical models with applications to atmospheric processes

International Nuclear Information System (INIS)

Wikle, C.K.

1996-01-01

This doctoral dissertation is presented as three self-contained papers. An introductory chapter considers traditional spatio-temporal statistical methods used in the atmospheric sciences from a statistical perspective. Although this section is primarily a review, many of the statistical issues considered have not been considered in the context of these methods and several open questions are posed. The first paper attempts to determine a means of characterizing the semiannual oscillation (SAO) spatial variation in the northern hemisphere extratropical height field. It was discovered that the midlatitude SAO in 500hPa geopotential height could be explained almost entirely as a result of spatial and temporal asymmetries in the annual variation of stationary eddies. It was concluded that the mechanism for the SAO in the northern hemisphere is a result of land-sea contrasts. The second paper examines the seasonal variability of mixed Rossby-gravity waves (MRGW) in lower stratospheric over the equatorial Pacific. Advanced cyclostationary time series techniques were used for analysis. It was found that there are significant twice-yearly peaks in MRGW activity. Analyses also suggested a convergence of horizontal momentum flux associated with these waves. In the third paper, a new spatio-temporal statistical model is proposed that attempts to consider the influence of both temporal and spatial variability. This method is mainly concerned with prediction in space and time, and provides a spatially descriptive and temporally dynamic model
Significant association of periodontal disease with anti-citrullinated peptide antibody in a Japanese healthy population - The Nagahama study.

Science.gov (United States)

Terao, Chikashi; Asai, Keita; Hashimoto, Motomu; Yamazaki, Toru; Ohmura, Koichiro; Yamaguchi, Akihiko; Takahashi, Katsu; Takei, Noriko; Ishii, Takanori; Kawaguchi, Takahisa; Tabara, Yasuharu; Takahashi, Meiko; Nakayama, Takeo; Kosugi, Shinji; Sekine, Akihiro; Fujii, Takao; Yamada, Ryo; Mimori, Tsuneyo; Matsuda, Fumihiko; Bessho, Kazuhisa

2015-05-01

Anti-citrullinated peptide antibody (ACPA) is a highly specific autoantibody to rheumatoid arthritis (RA). Recent studies have revealed that periodontal disease (PD) is closely associated with RA and production of ACPA in RA. Analyses of associations between PD and ACPA production in a healthy population may deepen our understandings. Here, we analyzed a total of 9554 adult healthy subjects. ACPA and IgM-rheumatoid factor (RF) was quantified and PD status was evaluated using the number of missing teeth (MT), the Community Periodontal Index (CPI) and Loss of Attachment (LA) for these subjects. PD status was analyzed for its association with the positivity and categorical levels of ACPA and RF conditioned for covariates which were shown to be associated with PD, ACPA or RF. As a result, all of MT, CPI and LA showed suggestive or significant associations with positivity (p = 0.024, 0.0042 and 0.037, respectively) and levels of ACPA (p ≤ 0.00031), but none of the PD parameters were associated with those of RF. These association patterns were also observed when we analyzed 6206 non-smokers of the participants. The significant associations between PD parameters and positivity and levels of ACPA in healthy population support the fundamental involvement of PD with ACPA production. Copyright © 2015 Elsevier Ltd. All rights reserved.
Visceral adiposity index is associated with significant fibrosis in patients with non-alcoholic fatty liver disease.

Science.gov (United States)

Petta, S; Amato, M C; Di Marco, V; Cammà, C; Pizzolanti, G; Barcellona, M R; Cabibi, D; Galluzzo, A; Sinagra, D; Giordano, C; Craxì, A

2012-01-01

Metabolic factors have been associated with liver damage in patients with non-alcoholic fatty liver disease (NAFLD). To test a new marker of adipose dysfunction, the visceral adiposity index (VAI), in NAFLD patients to assess whether or not it is associated with host factors, and to investigate a potential correlation with histological findings. One hundred and forty-two consecutive NAFLD patients were evaluated by liver biopsy, and clinical and metabolic measurements, including insulin resistance with the homeostasis model assessment (HOMA), and VAI by using waist circumference, body mass index, triglycerides and HDL. Serum levels of TNFα, IL-6, adiponectin and leptin were also assessed. All biopsies were scored for NAFLD activity score (NAS) and its components, and for staging (Kleiner). By multiple linear regression analysis, VAI was independently associated with higher HOMA (P = 0.04), and fibrosis (P = 0.04). In addition, an independent association was found between higher VAI and lower adiponectin levels (P = 0.002). Higher HOMA (OR 1.149, 95% CI 1.003-1.316, P = 0.04), higher VAI (OR 1.446, 95% CI 1.023-2.043, P = 0.03), lobular inflammation (OR 3.777, 95% CI 1.771-8.051, P = 0.001), and ballooning (OR 2.884, 95% CI 1.231-6.757, P = 0.01) were correlated with significant fibrosis (F2-F4) on multiple logistic regression analysis. In particular, the prevalence of significant fibrosis progressively increased from patients with a VAI ≤ 2.1 and HOMA ≤ 3.4 (26%) to those with a VAI > 2.1 and HOMA > 3.4 (83%). In NAFLD patients, visceral adiposity index is an expression of both qualitative and quantitative adipose tissue dysfunction and, together with insulin resistance, is independently correlated with significant fibrosis. © 2011 Blackwell Publishing Ltd.
[Statistics for statistics?--Thoughts about psychological tools].

Science.gov (United States)

Berger, Uwe; Stöbel-Richter, Yve

2007-12-01

Statistical methods take a prominent place among psychologists' educational programs. Being known as difficult to understand and heavy to learn, students fear of these contents. Those, who do not aspire after a research carrier at the university, will forget the drilled contents fast. Furthermore, because it does not apply for the work with patients and other target groups at a first glance, the methodological education as a whole was often questioned. For many psychological practitioners the statistical education makes only sense by enforcing respect against other professions, namely physicians. For the own business, statistics is rarely taken seriously as a professional tool. The reason seems to be clear: Statistics treats numbers, while psychotherapy treats subjects. So, does statistics ends in itself? With this article, we try to answer the question, if and how statistical methods were represented within the psychotherapeutical and psychological research. Therefore, we analyzed 46 Originals of a complete volume of the journal Psychotherapy, Psychosomatics, Psychological Medicine (PPmP). Within the volume, 28 different analyse methods were applied, from which 89 per cent were directly based upon statistics. To be able to write and critically read Originals as a backbone of research, presumes a high degree of statistical education. To ignore statistics means to ignore research and at least to reveal the own professional work to arbitrariness.
2014 ICSA/KISS Joint Applied Statistics Symposium

CERN Document Server

Liu, Mengling; Luo, Xiaolong

2016-01-01

The papers in this volume represent the most timely and advanced contributions to the 2014 Joint Applied Statistics Symposium of the International Chinese Statistical Association (ICSA) and the Korean International Statistical Society (KISS), held in Portland, Oregon. The contributions cover new developments in statistical modeling and clinical research: including model development, model checking, and innovative clinical trial design and analysis. Each paper was peer-reviewed by at least two referees and also by an editor. The conference was attended by over 400 participants from academia, industry, and government agencies around the world, including from North America, Asia, and Europe. It offered 3 keynote speeches, 7 short courses, 76 parallel scientific sessions, student paper sessions, and social events. The most timely and advanced contributions from the joint 2014 ICSA/KISS Applied Statistics Symposium All papers feature original, peer-reviewed content Coverage consists of new developments in statisti...
The earth is flat (p > 0.05): significance thresholds and the crisis of unreplicable research.

Science.gov (United States)

Amrhein, Valentin; Korner-Nievergelt, Fränzi; Roth, Tobias

2017-01-01

The widespread use of 'statistical significance' as a license for making a claim of a scientific finding leads to considerable distortion of the scientific process (according to the American Statistical Association). We review why degrading p -values into 'significant' and 'nonsignificant' contributes to making studies irreproducible, or to making them seem irreproducible. A major problem is that we tend to take small p -values at face value, but mistrust results with larger p -values. In either case, p -values tell little about reliability of research, because they are hardly replicable even if an alternative hypothesis is true. Also significance ( p ≤ 0.05) is hardly replicable: at a good statistical power of 80%, two studies will be 'conflicting', meaning that one is significant and the other is not, in one third of the cases if there is a true effect. A replication can therefore not be interpreted as having failed only because it is nonsignificant. Many apparent replication failures may thus reflect faulty judgment based on significance thresholds rather than a crisis of unreplicable research. Reliable conclusions on replicability and practical importance of a finding can only be drawn using cumulative evidence from multiple independent studies. However, applying significance thresholds makes cumulative knowledge unreliable. One reason is that with anything but ideal statistical power, significant effect sizes will be biased upwards. Interpreting inflated significant results while ignoring nonsignificant results will thus lead to wrong conclusions. But current incentives to hunt for significance lead to selective reporting and to publication bias against nonsignificant findings. Data dredging, p -hacking, and publication bias should be addressed by removing fixed significance thresholds. Consistent with the recommendations of the late Ronald Fisher, p -values should be interpreted as graded measures of the strength of evidence against the null hypothesis
Teaching statistics to nursing students: an expert panel consensus.

Science.gov (United States)

Hayat, Matthew J; Eckardt, Patricia; Higgins, Melinda; Kim, MyoungJin; Schmiege, Sarah J

2013-06-01

Statistics education is a necessary element of nursing education, and its inclusion is recommended in the American Association of Colleges of Nursing guidelines for nurse training at all levels. This article presents a cohesive summary of an expert panel discussion, "Teaching Statistics to Nursing Students," held at the 2012 Joint Statistical Meetings. All panelists were statistics experts, had extensive teaching and consulting experience, and held faculty appointments in a U.S.-based nursing college or school. The panel discussed degree-specific curriculum requirements, course content, how to ensure nursing students understand the relevance of statistics, approaches to integrating statistics consulting knowledge, experience with classroom instruction, use of knowledge from the statistics education research field to make improvements in statistics education for nursing students, and classroom pedagogy and instruction on the use of statistical software. Panelists also discussed the need for evidence to make data-informed decisions about statistics education and training for nurses. Copyright 2013, SLACK Incorporated.
The association between antihormonal treatment and cognitive complaints in breast cancer survivors with sleep problems

DEFF Research Database (Denmark)

Amidi, Ali; Damholdt, Malene; Dahlgaard, Jesper Ovesen

2016-01-01

. Statistically significant associations were observed between the CFQ and all measures of psychological distress (depression, fatigue, PTS, and perceived stress (r = 0.33–0.58, p's > 0.001)). Severity of sleep problems was also associated with the CFQ (r = 0.16, p = 0.01) There was no significant effect......, CFQtotal = 29.9(SD = 14.6); CFQ‐distractibility = 8.9(SD = 5.2) (p's = 0.06; 0.03). When adjusting for severity of sleep problems, symptoms of depression, PTS, fatigue, and perceived stress, these differences remained statistically significant (CFQ‐total: p = 0.047; CFQ‐distractibility: p = 0......Background: Cognitive complaints following chemotherapy are common and often associated with psychological distress. There is also a growing concern about cognitive problems among BC survivors receiving adjuvant antihormonal therapy. We, therefore, investigated the association between antihormonal...
On a curvature-statistics theorem

International Nuclear Information System (INIS)

Calixto, M; Aldaya, V

2008-01-01

The spin-statistics theorem in quantum field theory relates the spin of a particle to the statistics obeyed by that particle. Here we investigate an interesting correspondence or connection between curvature (κ = ±1) and quantum statistics (Fermi-Dirac and Bose-Einstein, respectively). The interrelation between both concepts is established through vacuum coherent configurations of zero modes in quantum field theory on the compact O(3) and noncompact O(2; 1) (spatial) isometry subgroups of de Sitter and Anti de Sitter spaces, respectively. The high frequency limit, is retrieved as a (zero curvature) group contraction to the Newton-Hooke (harmonic oscillator) group. We also make some comments on the physical significance of the vacuum energy density and the cosmological constant problem.
On a curvature-statistics theorem

Energy Technology Data Exchange (ETDEWEB)

Calixto, M [Departamento de Matematica Aplicada y Estadistica, Universidad Politecnica de Cartagena, Paseo Alfonso XIII 56, 30203 Cartagena (Spain); Aldaya, V [Instituto de Astrofisica de Andalucia, Apartado Postal 3004, 18080 Granada (Spain)], E-mail: Manuel.Calixto@upct.es

2008-08-15

The spin-statistics theorem in quantum field theory relates the spin of a particle to the statistics obeyed by that particle. Here we investigate an interesting correspondence or connection between curvature ({kappa} = {+-}1) and quantum statistics (Fermi-Dirac and Bose-Einstein, respectively). The interrelation between both concepts is established through vacuum coherent configurations of zero modes in quantum field theory on the compact O(3) and noncompact O(2; 1) (spatial) isometry subgroups of de Sitter and Anti de Sitter spaces, respectively. The high frequency limit, is retrieved as a (zero curvature) group contraction to the Newton-Hooke (harmonic oscillator) group. We also make some comments on the physical significance of the vacuum energy density and the cosmological constant problem.
Testing statistical hypotheses of equivalence

CERN Document Server

Wellek, Stefan

2010-01-01

Equivalence testing has grown significantly in importance over the last two decades, especially as its relevance to a variety of applications has become understood. Yet published work on the general methodology remains scattered in specialists' journals, and for the most part, it focuses on the relatively narrow topic of bioequivalence assessment.With a far broader perspective, Testing Statistical Hypotheses of Equivalence provides the first comprehensive treatment of statistical equivalence testing. The author addresses a spectrum of specific, two-sided equivalence testing problems, from the
Carotid Artery Stenosis at MSCT: Is there a Threshold in Millimeters that Determines Clinical Significance?

International Nuclear Information System (INIS)

Saba, Luca; Sanfilippo, Roberto; Montisci, Roberto; Mallarini, Giorgio

2012-01-01

Purpose: The purpose of this work was to determine whether it is possible to identify a reliable carotid stenosis threshold—measured in millimeters (mm)—that is associated with cerebrovascular symptoms. Methods: Written, informed consent was obtained for each patient; 149 consecutive patients (98 men; median age, 68 years) were studied for suspected pathology of the carotid arteries by using MDCTA. In each patient, carotid artery stenosis was quantified using the mm-method. Continuous data were described as the mean value ± standard deviation (SD), and they were compared by using the Student’s t test. A ROC curve was calculated to test the study hypothesis and identify a specific mm-stenosis threshold. Logistic regression analysis was performed to include other MDCTA findings, such as plaque type and ulcerations. A P value < 0.05 was considered to indicate statistical significance. Results: Twenty-six patients were excluded. Of those remaining, 75 patients suffered cerebrovascular symptoms (61%). There was a statistically significant difference (P = 0.0046) in the mm-carotid stenosis between patients with symptoms (1.31 ± 0.64 mm SD) and without symptoms (1.68 ± 0.79 mm SD). Multiple logistic regression analysis confirmed that symptoms were associated with increased luminal stenosis (P = 0.013) and with the presence of fatty plaques (P = 0.0491). Moreover, the ROC curve (Az = 0.669; ±0.051 SD; P = 0.0009) indicated that a threshold of 1.6 mm stenosis was associated with a sensitivity to symptoms of 76%. Conclusions: The results of our study suggest an association between luminal stenosis (measure in mm) and the presence of cerebrovascular symptoms. Luminal stenosis of 1.6 mm is associated, with a sensitivity of 76%, with cerebrovascular symptoms.
9th Symposium on Computational Statistics

CERN Document Server

Mildner, Vesna

1990-01-01

Although no-one is, probably, too enthused about the idea, it is a fact that the development of most empirical sciences to a great extent depends on the development of data analysis methods and techniques, which, due to the necessity of application of computers for that purpose, actually means that it practically depends on the advancement and orientation of computer statistics. Every other year the International Association for Statistical Computing sponsors the organizition of meetings of individual s professiona77y involved in computational statistics. Since these meetings attract professionals from allover the world, they are a good sample for the estimation of trends in this area which some believe is a statistics proper while others claim it is computer science. It seems, though, that an increasing number of colleagues treat it as an independent scientific or at least technical discipline. This volume contains six invited papers, 41 contributed papers and, finally, two papers which are, formally, softwa...
Calculating statistical distributions from operator relations: The statistical distributions of various intermediate statistics

International Nuclear Information System (INIS)

Dai, Wu-Sheng; Xie, Mi

2013-01-01

In this paper, we give a general discussion on the calculation of the statistical distribution from a given operator relation of creation, annihilation, and number operators. Our result shows that as long as the relation between the number operator and the creation and annihilation operators can be expressed as a † b=Λ(N) or N=Λ −1 (a † b), where N, a † , and b denote the number, creation, and annihilation operators, i.e., N is a function of quadratic product of the creation and annihilation operators, the corresponding statistical distribution is the Gentile distribution, a statistical distribution in which the maximum occupation number is an arbitrary integer. As examples, we discuss the statistical distributions corresponding to various operator relations. In particular, besides the Bose–Einstein and Fermi–Dirac cases, we discuss the statistical distributions for various schemes of intermediate statistics, especially various q-deformation schemes. Our result shows that the statistical distributions corresponding to various q-deformation schemes are various Gentile distributions with different maximum occupation numbers which are determined by the deformation parameter q. This result shows that the results given in much literature on the q-deformation distribution are inaccurate or incomplete. -- Highlights: ► A general discussion on calculating statistical distribution from relations of creation, annihilation, and number operators. ► A systemic study on the statistical distributions corresponding to various q-deformation schemes. ► Arguing that many results of q-deformation distributions in literature are inaccurate or incomplete
[Characteristic and clinical significance of DNA methyltransferase 3B overexpression in endometrial carcinoma].

Science.gov (United States)

Dong, Y; Zhou, M; Ba, X J; Si, J W; Li, W T; Wang, Y; Li, D; Li, T

2016-10-18

To determine the clinicopathological significance of the DNA methyltransferase 3B (DNMT3B) overexpression in endometrial carcinomas and to evaluate its correlation with hormone receptor status. Immunohistochemistry was performed to assess the expression of DNMT3B and hormone receptors in 104 endometrial carcinomas. DNMT3B overexpression occurred frequently in endometrioid carcinoma (EC, 54.8%) more than in nonendometrioid carcinoma (NEC, 30.0%) with statistical significance (P=0.028). Furthermore, there was a trend that EC with worse clinico-pathological variables and shorter survival had a higher DNMT3B expression, and the correlation between DNMT3B and tumor grade reached statistical significance (P=0.019).A negative correlation between DNMT3B and estrogen receptor (ER) or progesterone receptor (PR) expression was found in EC. NMT3B overexpression occurred frequently in the ER or PR negative subgroups (78.9%, 86.7%) more than in the positive subgroups (47.7%, 47.8%) with statistical significance (P=0.016, P=0.006). In addition, the DNMT3B overexpression increased in tumors with both ER and PR negative expression (92.9%, P=0.002). However, no such correlation was found in NEC (P>0.05). Sequence analyses demonstrated multiple ER and PR binding sites in the promoter regions of DNMT3B gene. This study showed that the expression of DNMT3B in EC and NEC was different. DNMT3B overexpression in EC was associated with the worse clinicopathological variables and might have predictive value. The methylation status of EC and NEC maybe different. In addition, in EC, DNMT3B overexpression negatively correlated with ER or PR expression. In NEC, the correlation between DNMT3B and ER or PR status was not present.
Associations Between PET Textural Features and GLUT1 Expression, and the Prognostic Significance of Textural Features in Lung Adenocarcinoma.

Science.gov (United States)

Koh, Young Wha; Park, Seong Yong; Hyun, Seung Hyup; Lee, Su Jin

2018-02-01

We evaluated the association between positron emission tomography (PET) textural features and glucose transporter 1 (GLUT1) expression level and further investigated the prognostic significance of textural features in lung adenocarcinoma. We evaluated 105 adenocarcinoma patients. We extracted texture-based PET parameters of primary tumors. Conventional PET parameters were also measured. The relationships between PET parameters and GLUT1 expression levels were evaluated. The association between PET parameters and overall survival (OS) was assessed using Cox's proportional hazard regression models. In terms of PET textural features, tumors expressing high levels of GLUT1 exhibited significantly lower coarseness, contrast, complexity, and strength, but significantly higher busyness. On univariate analysis, the metabolic tumor volume, total lesion glycolysis, contrast, busyness, complexity, and strength were significant predictors of OS. Multivariate analysis showed that lower complexity (HR=2.017, 95%CI=1.032-3.942, p=0.040) was independently associated with poorer survival. PET textural features may aid risk stratification in lung adenocarcinoma patients. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Location of skin lesions in Henoch-Schönlein purpura and its association with significant renal involvement.

Science.gov (United States)

St John, Jessica; Vedak, Priyanka; Garza-Mayers, Anna Cristina; Hoang, Mai P; Nigwekar, Sagar U; Kroshinsky, Daniela

2018-01-01

Henoch-Schönlein purpura (HSP) is a small vessel IgA-predominant vasculitis. To describe adult patients with HSP and determine if the distribution of skin lesions (ie, purpura above the waist or purpura below the waist only), is a predictor of significant renal involvement at the time of the skin biopsy and the months following. A retrospective study on renal function from 72 adult patients with skin-biopsy proven HSP. Longitudinal renal data were analyzed after HSP diagnosis by using baseline renal function for comparison. Statistical analysis adjusted for sex, age, and baseline creatinine revealed a trend between HSP lesions only on the upper and lower extremities and long-term renal involvement (4.767, P = .067). Moreover, in another analysis adjusted for age and baseline creatinine, lesions located only on the upper and lower extremities significantly increased the odds of having long-term significant renal involvement (6.55, P = .049) in men. This retrospective study used patient information that was subject to selection bias. In patients with HSP, skin lesion distribution on the extremities might be predictive of significant long-term renal involvement and might be critical for risk stratification and development of personalized diagnostics and therapeutics. Copyright © 2017 American Academy of Dermatology, Inc. Published by Elsevier Inc. All rights reserved.
Statistical and theoretical research

International Nuclear Information System (INIS)

Anon.

1983-01-01

Significant accomplishments include the creation of field designs to detect population impacts, new census procedures for small mammals, and methods for designing studies to determine where and how much of a contaminant is extent over certain landscapes. A book describing these statistical methods is currently being written and will apply to a variety of environmental contaminants, including radionuclides. PNL scientists also have devised an analytical method for predicting the success of field eexperiments on wild populations. Two highlights of current research are the discoveries that population of free-roaming horse herds can double in four years and that grizzly bear populations may be substantially smaller than once thought. As stray horses become a public nuisance at DOE and other large Federal sites, it is important to determine their number. Similar statistical theory can be readily applied to other situations where wild animals are a problem of concern to other government agencies. Another book, on statistical aspects of radionuclide studies, is written specifically for researchers in radioecology
Significance analysis of lexical bias in microarray data

Directory of Open Access Journals (Sweden)

Falkow Stanley

2003-04-01

Full Text Available Abstract Background Genes that are determined to be significantly differentially regulated in microarray analyses often appear to have functional commonalities, such as being components of the same biochemical pathway. This results in certain words being under- or overrepresented in the list of genes. Distinguishing between biologically meaningful trends and artifacts of annotation and analysis procedures is of the utmost importance, as only true biological trends are of interest for further experimentation. A number of sophisticated methods for identification of significant lexical trends are currently available, but these methods are generally too cumbersome for practical use by most microarray users. Results We have developed a tool, LACK, for calculating the statistical significance of apparent lexical bias in microarray datasets. The frequency of a user-specified list of search terms in a list of genes which are differentially regulated is assessed for statistical significance by comparison to randomly generated datasets. The simplicity of the input files and user interface targets the average microarray user who wishes to have a statistical measure of apparent lexical trends in analyzed datasets without the need for bioinformatics skills. The software is available as Perl source or a Windows executable. Conclusion We have used LACK in our laboratory to generate biological hypotheses based on our microarray data. We demonstrate the program's utility using an example in which we confirm significant upregulation of SPI-2 pathogenicity island of Salmonella enterica serovar Typhimurium by the cation chelator dipyridyl.
Retinal vascular calibres are significantly associated with cardiovascular risk factors

DEFF Research Database (Denmark)

von Hanno, T.; Bertelsen, G.; Sjølie, Anne K.

2014-01-01

. Association between retinal vessel calibre and the cardiovascular risk factors was assessed by multivariable linear and logistic regression analyses. Results: Retinal arteriolar calibre was independently associated with age, blood pressure, HbA1c and smoking in women and men, and with HDL cholesterol in men......Purpose: To describe the association between retinal vascular calibres and cardiovascular risk factors. Methods: Population-based cross-sectional study including 6353 participants of the TromsO Eye Study in Norway aged 38-87years. Retinal arteriolar calibre (central retinal artery equivalent...... cardiovascular risk factors were independently associated with retinal vascular calibre, with stronger effect of HDL cholesterol and BMI in men than in women. Blood pressure and smoking contributed most to the explained variance....

Danish electricity supply. Statistics 2003

International Nuclear Information System (INIS)

2004-01-01

The Association of Danish Electric Utilities each year issues the statistical yearbook 'Danish electricity supply'. By means of brief text, figures, and tables a description is given of the electric supply sector. The report presents data for the year 2003 for consumption, prices of electric power, power generation and transmission, and trade. (ln)
Danish electricity supply. Statistics 2000

International Nuclear Information System (INIS)

2001-07-01

The Association of Danish Electric Utilities each year issues the statistical yearbook 'Danish electricity supply'. By means of brief text, figures, and tables a description is given of the electric supply sector. The report presents data for the year 2000 for consumption, prices of electric power; power generation and transmission, and trade. (ln)
Danish electricity supply. Statistics 2002

International Nuclear Information System (INIS)

2003-01-01

The Association of Danish Electric Utilities each year issues the statistical yearbook 'Danish electricity supply'. By means of brief text, figures, and tables a description is given of the electric supply sector. The report presents data for the year 2002 for consumption, prices of electric power; power generation and transmission, and trade. (ln)
ARL Supplementary Statistics, 2006-2007

Science.gov (United States)

Bland, Les, Comp.; Kyrillidou, Martha, Comp.

2009-01-01

This report presents statistics on how Association of Research Libraries (ARL) member libraries spend money on electronic resources. This report indicates that 108 ARL libraries purchased 25,006,758 electronic books. In 2006-2007, there was an ARL median of 243,725 acquisitions of electronic books (this includes one institution that purchased…
ARL Supplementary Statistics, 2007-2008

Science.gov (United States)

Bland, Les, Comp.; Kyrillidou, Martha, Comp.

2009-01-01

This report presents statistics on how Association of Research Libraries (ARL) member libraries spend money on electronic resources. This report indicates that 109 ARL libraries purchased 32,329,187 electronic books. In 2007-2008, there was a median of 28,319 acquisitions of electronic books by ARL libraries (this includes one institution that…
Species association in tropical montane rain forest at two successional stages in Diaoluo Mountain, Hainan

Institute of Scientific and Technical Information of China (English)

Fude LIU; Wenjin WANG; Ming ZHANG; Jianwei ZHENG; Zhongsheng WANG; Shiting ZHANG; Wenjie YANG; Shuqing AN

2008-01-01

Species association is one of the basic concepts in community succession. There are different viewpoints on how species interaction changes with the progress of succession. In order to assess these relationships, we examined species associations in the tropical montane rain forest at early and late successional stages in Diaoluo Mountain, Hainan Island. Based on data from a 2 × 2 contingency table of species presence or absence, statist-ical methods including analysis of species association and χ2 tests were applied. The results show that: 1) an overall positive association was present among tree species in the communities during the two successional stages and were statistically significant at the late stage. The number of species pairs with positive and negative associations decreased throughout the process of succession, while the number with null associations was greatly increased. The same trend existed among the dominant and compan-ion species. The results indicate that the communities are developing towards a stable stage where the woody species coexist in harmony. 2) In the early-established and later invading species, all positive associations were not signifi-cant. Compared with positive and null associations, fewer negative associations were found. This implies that these species are inclined to coexist independently through por-tioning of resources. 3) Among the later invading species, positive associations were significant and no negative associations were found which suggest that these species have similar adaptive ability in the habitat and occupied overlapping niches in the community.
Whither Statistics Education Research?

Science.gov (United States)

Watson, Jane

2016-01-01

This year marks the 25th anniversary of the publication of a "National Statement on Mathematics for Australian Schools", which was the first curriculum statement this country had including "Chance and Data" as a significant component. It is hence an opportune time to survey the history of the related statistics education…
Planck 2013 results. XXIII. Isotropy and Statistics of the CMB

CERN Document Server

Ade, P.A.R.; Armitage-Caplan, C.; Arnaud, M.; Ashdown, M.; Atrio-Barandela, F.; Aumont, J.; Baccigalupi, C.; Banday, A.J.; Barreiro, R.B.; Bartlett, J.G.; Bartolo, N.; Battaner, E.; Battye, R.; Benabed, K.; Benoit, A.; Benoit-Levy, A.; Bernard, J.P.; Bersanelli, M.; Bielewicz, P.; Bobin, J.; Bock, J.J.; Bonaldi, A.; Bonavera, L.; Bond, J.R.; Borrill, J.; Bouchet, F.R.; Bridges, M.; Bucher, M.; Burigana, C.; Butler, R.C.; Cardoso, J.F.; Catalano, A.; Challinor, A.; Chamballu, A.; Chary, R.R.; Chiang, L.Y.; Chiang, H.C.; Christensen, P.R.; Church, S.; Clements, D.L.; Colombi, S.; Colombo, L.P.L.; Couchot, F.; Coulais, A.; Crill, B.P.; Cruz, M.; Curto, A.; Cuttaia, F.; Danese, L.; Davies, R.D.; Davis, R.J.; de Bernardis, P.; de Rosa, A.; de Zotti, G.; Delabrouille, J.; Delouis, J.M.; Desert, F.X.; Diego, J.M.; Dole, H.; Donzelli, S.; Dore, O.; Douspis, M.; Ducout, A.; Dupac, X.; Efstathiou, G.; Elsner, F.; Ensslin, T.A.; Eriksen, H.K.; Fantaye, Y.; Fergusson, J.; Finelli, F.; Forni, O.; Frailis, M.; Franceschi, E.; Frommert, M.; Galeotta, S.; Ganga, K.; Giard, M.; Giardino, G.; Giraud-Heraud, Y.; Gonzalez-Nuevo, J.; Gorski, K.M.; Gratton, S.; Gregorio, A.; Gruppuso, A.; Hansen, M.; Hansen, F.K.; Hanson, D.; Harrison, D.; Helou, G.; Henrot-Versille, S.; Hernandez-Monteagudo, C.; Herranz, D.; Hildebrandt, S.R.; Hivon, E.; Hobson, M.; Holmes, W.A.; Hornstrup, A.; Hovest, W.; Huffenberger, K.M.; Jaffe, T.R.; Jaffe, A.H.; Jones, W.C.; Juvela, M.; Keihanen, E.; Keskitalo, R.; Kim, J.; Kisner, T.S.; Knoche, J.; Knox, L.; Kunz, M.; Kurki-Suonio, H.; Lagache, G.; Lahteenmaki, A.; Lamarre, J.M.; Lasenby, A.; Laureijs, R.J.; Lawrence, C.R.; Leahy, J.P.; Leonardi, R.; Leroy, C.; Lesgourgues, J.; Liguori, M.; Lilje, P.B.; Linden-Vornle, M.; Lopez-Caniego, M.; Lubin, P.M.; Macias-Perez, J.F.; Maffei, B.; Maino, D.; Mandolesi, N.; Mangilli, A.; Marinucci, D.; Maris, M.; Marshall, D.J.; Martin, P.G.; Martinez-Gonzalez, E.; Masi, S.; Matarrese, S.; Matthai, F.; Mazzotta, P.; McEwen, J.D.; Meinhold, P.R.; Melchiorri, A.; Mendes, L.; Mennella, A.; Migliaccio, M.; Mikkelsen, K.; Mitra, S.; Miville-Deschenes, M.A.; Molinari, D.; Moneti, A.; Montier, L.; Morgante, G.; Mortlock, D.; Moss, A.; Munshi, D.; Naselsky, P.; Nati, F.; Natoli, P.; Netterfield, C.B.; Norgaard-Nielsen, H.U.; Noviello, F.; Novikov, D.; Novikov, I.; Osborne, S.; Oxborrow, C.A.; Paci, F.; Pagano, L.; Pajot, F.; Paoletti, D.; Pasian, F.; Patanchon, G.; Peiris, H.V.; Perdereau, O.; Perotto, L.; Perrotta, F.; Piacentini, F.; Piat, M.; Pierpaoli, E.; Pietrobon, D.; Plaszczynski, S.; Pointecouteau, E.; Pogosyan, D.; Polenta, G.; Ponthieu, N.; Popa, L.; Poutanen, T.; Pratt, G.W.; Prezeau, G.; Prunet, S.; Puget, J.L.; Rachen, J.P.; Rath, C.; Rebolo, R.; Reinecke, M.; Remazeilles, M.; Renault, C.; Renzi, A.; Ricciardi, S.; Riller, T.; Ristorcelli, I.; Rocha, G.; Rosset, C.; Rotti, A.; Roudier, G.; Rubino-Martin, J.A.; Rusholme, B.; Sandri, M.; Santos, D.; Savini, G.; Scott, D.; Seiffert, M.D.; Shellard, E.P.S.; Souradeep, T.; Spencer, L.D.; Starck, J.L.; Stolyarov, V.; Stompor, R.; Sudiwala, R.; Sureau, F.; Sutter, P.; Sutton, D.; Suur-Uski, A.S.; Sygnet, J.F.; Tauber, J.A.; Tavagnacco, D.; Terenzi, L.; Toffolatti, L.; Tomasi, M.; Tristram, M.; Tucci, M.; Tuovinen, J.; Turler, M.; Valenziano, L.; Valiviita, J.; Van Tent, B.; Varis, J.; Vielva, P.; Villa, F.; Vittorio, N.; Wade, L.A.; Wandelt, B.D.; Wehus, I.K.; White, M.; Wilkinson, A.; Yvon, D.; Zacchei, A.; Zonca, A.

2014-01-01

The two fundamental assumptions of the standard cosmological model - that the initial fluctuations are statistically isotropic and Gaussian - are rigorously tested using maps of the cosmic microwave background (CMB) anisotropy from the Planck satellite. Deviations from isotropy have been found and demonstrated to be robust against component separation algorithm, mask choice and frequency dependence. Many of these anomalies were previously observed in the WMAP data, and are now confirmed at similar levels of significance (about 3 sigma). However, we find little evidence for non-Gaussianity, with the exception of a few statistical signatures that seem to be associated with specific anomalies. In particular, we find that the quadrupole-octopole alignment is also connected to a low observed variance of the CMB signal. A power asymmetry is now found to persist to scales corresponding to about l=600, and can be described in the low-l regime by a phenomenological dipole modulation model. However, any primordial powe...
Improving esthetic results in benign parotid surgery: statistical evaluation of facelift approach, sternocleidomastoid flap, and superficial musculoaponeurotic system flap application.

Science.gov (United States)

Bianchi, Bernardo; Ferri, Andrea; Ferrari, Silvano; Copelli, Chiara; Sesenna, Enrico

2011-04-01

The purpose of this article was to analyze the efficacy of facelift incision, sternocleidomastoid muscle flap, and superficial musculoaponeurotic system flap for improving the esthetic results in patients undergoing partial parotidectomy for benign parotid tumor resection. The usefulness of partial parotidectomy is discussed, and a statistical evaluation of the esthetic results was performed. From January 1, 1996, to January 1, 2007, 274 patients treated for benign parotid tumors were studied. Of these, 172 underwent partial parotidectomy. The 172 patients were divided into 4 groups: partial parotidectomy with classic or modified Blair incision without reconstruction (group 1), partial parotidectomy with facelift incision and without reconstruction (group 2), partial parotidectomy with facelift incision associated with sternocleidomastoid muscle flap (group 3), and partial parotidectomy with facelift incision associated with superficial musculoaponeurotic system flap (group 4). Patients were considered, after a follow-up of at least 18 months, for functional and esthetic evaluation. The functional outcome was assessed considering the facial nerve function, Frey syndrome, and recurrence. The esthetic evaluation was performed by inviting the patients and a blind panel of 1 surgeon and 2 secretaries of the department to give a score of 1 to 10 to assess the final cosmetic outcome. The statistical analysis was finally performed using the Mann-Whitney U test for nonparametric data to compare the different group results. P less than .05 was considered significant. No recurrence developed in any of the 4 groups or in any of the 274 patients during the follow-up period. The statistical analysis, comparing group 1 and the other groups, revealed a highly significant statistical difference (P esthetic results in benign parotid surgery. The evaluation of functional complications and the recurrence rate in this series of patients has confirmed that this technique can be safely
Statistics

CERN Document Server

Hayslett, H T

1991-01-01

Statistics covers the basic principles of Statistics. The book starts by tackling the importance and the two kinds of statistics; the presentation of sample data; the definition, illustration and explanation of several measures of location; and the measures of variation. The text then discusses elementary probability, the normal distribution and the normal approximation to the binomial. Testing of statistical hypotheses and tests of hypotheses about the theoretical proportion of successes in a binomial population and about the theoretical mean of a normal population are explained. The text the
High energy behaviour of particles and unified statistics

International Nuclear Information System (INIS)

Chang, Y.

1984-01-01

Theories and experiments suggest that particles at high energy appear to possess a new statistics unifying Bose-Einstein and Fermi-Dirac statistics via the GAMMA distribution. This hypothesis can be obtained from many models, and agrees quantitatively with scaling, the multiplicty, large transverse momentum, the mass spectrum, and other data. It may be applied to scatterings at high energy, and agrees with experiments and known QED's results. The Veneziano model and other theories have implied new statistics, such as, the B distribution and the Polya distribution. They revert to the GAMMA distribution at high energy. The possible inapplicability of Pauli's exclusion principle within the unified statistics is considered and associated to the quark constituents
A statistical study of ionopause perturbation and associated boundary wave formation at Venus.

Science.gov (United States)

Chong, G. S.; Pope, S. A.; Walker, S. N.; Zhang, T.; Balikhin, M. A.

2017-12-01

In contrast to Earth, Venus does not possess an intrinsic magnetic field. Hence the interaction between solar wind and Venus is significantly different when compared to Earth, even though these two planets were once considered similar. Within the induced magnetosphere and ionosphere of Venus, previous studies have shown the existence of ionospheric boundary waves. These structures may play an important role in the atmospheric evolution of Venus. By using Venus Express data, the crossings of the ionopause boundary are determined based on the observations of photoelectrons during 2011. Pulses of dropouts in the electron energy spectrometer were observed in 92 events, which suggests potential perturbations of the boundary. Minimum variance analysis of the 1Hz magnetic field data for the perturbations is conducted and used to confirm the occurrence of the boundary waves. Statistical analysis shows that they were propagating mainly in the ±VSO-Y direction in the polar north terminator region. The generation mechanisms of boundary waves and their evolution into the potential nonlinear regime are discussed and analysed.
Dynamic association rules for gene expression data analysis.

Science.gov (United States)

Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

2015-10-14

The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed
Basic statistics with Microsoft Excel: a review.

Science.gov (United States)

Divisi, Duilio; Di Leonardo, Gabriella; Zaccagna, Gino; Crisci, Roberto

2017-06-01

The scientific world is enriched daily with new knowledge, due to new technologies and continuous discoveries. The mathematical functions explain the statistical concepts particularly those of mean, median and mode along with those of frequency and frequency distribution associated to histograms and graphical representations, determining elaborative processes on the basis of the spreadsheet operations. The aim of the study is to highlight the mathematical basis of statistical models that regulate the operation of spreadsheets in Microsoft Excel.
Journey Through Statistical Mechanics

Science.gov (United States)

Yang, C. N.

2013-05-01

My first involvement with statistical mechanics and the many body problem was when I was a student at The National Southwest Associated University in Kunming during the war. At that time Professor Wang Zhu-Xi had just come back from Cambridge, England, where he was a student of Fowler, and his thesis was on phase transitions, a hot topic at that time, and still a very hot topic today...
Industrial commodity statistics yearbook 2001. Production statistics (1992-2001)

International Nuclear Information System (INIS)

2003-01-01

This is the thirty-fifth in a series of annual compilations of statistics on world industry designed to meet both the general demand for information of this kind and the special requirements of the United Nations and related international bodies. Beginning with the 1992 edition, the title of the publication was changed to industrial Commodity Statistics Yearbook as the result of a decision made by the United Nations Statistical Commission at its twenty-seventh session to discontinue, effective 1994, publication of the Industrial Statistics Yearbook, volume I, General Industrial Statistics by the Statistics Division of the United Nations. The United Nations Industrial Development Organization (UNIDO) has become responsible for the collection and dissemination of general industrial statistics while the Statistics Division of the United Nations continues to be responsible for industrial commodity production statistics. The previous title, Industrial Statistics Yearbook, volume II, Commodity Production Statistics, was introduced in the 1982 edition. The first seven editions in this series were published under the title The Growth of World industry and the next eight editions under the title Yearbook of Industrial Statistics. This edition of the Yearbook contains annual quantity data on production of industrial commodities by country, geographical region, economic grouping and for the world. A standard list of about 530 commodities (about 590 statistical series) has been adopted for the publication. The statistics refer to the ten-year period 1992-2001 for about 200 countries and areas
Industrial commodity statistics yearbook 2002. Production statistics (1993-2002)

International Nuclear Information System (INIS)

2004-01-01

This is the thirty-sixth in a series of annual compilations of statistics on world industry designed to meet both the general demand for information of this kind and the special requirements of the United Nations and related international bodies. Beginning with the 1992 edition, the title of the publication was changed to industrial Commodity Statistics Yearbook as the result of a decision made by the United Nations Statistical Commission at its twenty-seventh session to discontinue, effective 1994, publication of the Industrial Statistics Yearbook, volume I, General Industrial Statistics by the Statistics Division of the United Nations. The United Nations Industrial Development Organization (UNIDO) has become responsible for the collection and dissemination of general industrial statistics while the Statistics Division of the United Nations continues to be responsible for industrial commodity production statistics. The previous title, Industrial Statistics Yearbook, volume II, Commodity Production Statistics, was introduced in the 1982 edition. The first seven editions in this series were published under the title 'The Growth of World industry' and the next eight editions under the title 'Yearbook of Industrial Statistics'. This edition of the Yearbook contains annual quantity data on production of industrial commodities by country, geographical region, economic grouping and for the world. A standard list of about 530 commodities (about 590 statistical series) has been adopted for the publication. The statistics refer to the ten-year period 1993-2002 for about 200 countries and areas
Industrial commodity statistics yearbook 2000. Production statistics (1991-2000)

International Nuclear Information System (INIS)

2002-01-01

This is the thirty-third in a series of annual compilations of statistics on world industry designed to meet both the general demand for information of this kind and the special requirements of the United Nations and related international bodies. Beginning with the 1992 edition, the title of the publication was changed to industrial Commodity Statistics Yearbook as the result of a decision made by the United Nations Statistical Commission at its twenty-seventh session to discontinue, effective 1994, publication of the Industrial Statistics Yearbook, volume I, General Industrial Statistics by the Statistics Division of the United Nations. The United Nations Industrial Development Organization (UNIDO) has become responsible for the collection and dissemination of general industrial statistics while the Statistics Division of the United Nations continues to be responsible for industrial commodity production statistics. The previous title, Industrial Statistics Yearbook, volume II, Commodity Production Statistics, was introduced in the 1982 edition. The first seven editions in this series were published under the title The Growth of World industry and the next eight editions under the title Yearbook of Industrial Statistics. This edition of the Yearbook contains annual quantity data on production of industrial commodities by country, geographical region, economic grouping and for the world. A standard list of about 530 commodities (about 590 statistical series) has been adopted for the publication. Most of the statistics refer to the ten-year period 1991-2000 for about 200 countries and areas
Acute myocardial infarction without significant coronary stenoses associated with endogenous subclinical hyperthyroidism.

Science.gov (United States)

Patanè, Salvatore; Marte, Filippo; Sturiale, Mauro

2012-04-05

Subclinical hyperthyroidism is an increasingly recognized entity that is defined as a normal serum free thyroxine and free triiodothyronine levels with a thyroid-stimulating hormone level suppressed below the normal range and usually undetectable. It has been reported that subclinical hyperthyroidism is not associated with coronary heart disease or mortality from cardiovascular causes but it is sufficient to induce arrhythmias including atrial fibrillation and atrial flutter. Nowadays, there is growing interest regarding endogenous sublinical hyperthyroidism and the cardiovascular system. We present a case of acute myocardial infarction without significant coronary stenoses in a 75-year-old Italian woman with endogenous subclinical hyperthyroidism. Also this case focuses attention on the importance of a correct evaluation of endogenous subclinical hyperthyroidism. Copyright Â© 2009 Elsevier Ireland Ltd. All rights reserved.
Macro-indicators of citation impacts of six prolific countries: InCites data and the statistical significance of trends.

Directory of Open Access Journals (Sweden)

Lutz Bornmann

Full Text Available Using the InCites tool of Thomson Reuters, this study compares normalized citation impact values calculated for China, Japan, France, Germany, United States, and the UK throughout the time period from 1981 to 2010. InCites offers a unique opportunity to study the normalized citation impacts of countries using (i a long publication window (1981 to 2010, (ii a differentiation in (broad or more narrow subject areas, and (iii allowing for the use of statistical procedures in order to obtain an insightful investigation of national citation trends across the years. Using four broad categories, our results show significantly increasing trends in citation impact values for France, the UK, and especially Germany across the last thirty years in all areas. The citation impact of papers from China is still at a relatively low level (mostly below the world average, but the country follows an increasing trend line. The USA exhibits a stable pattern of high citation impact values across the years. With small impact differences between the publication years, the US trend is increasing in engineering and technology but decreasing in medical and health sciences as well as in agricultural sciences. Similar to the USA, Japan follows increasing as well as decreasing trends in different subject areas, but the variability across the years is small. In most of the years, papers from Japan perform below or approximately at the world average in each subject area.

Clinically significant fatigue: prevalence and associated factors in an international sample of adults with multiple sclerosis recruited via the internet.

Directory of Open Access Journals (Sweden)

Tracey J Weiland

Full Text Available Fatigue contributes a significant burden of disease for people with multiple sclerosis (PwMS. Modifiable lifestyle factors have been recognized as having a role in a range of morbidity outcomes in PwMS. There is significant potential to prevent and treat fatigue in PwMS by addressing modifiable risk factors.To explore the associations between clinically significant fatigue and demographic factors, clinical factors (health-related quality of life, disability and relapse rate and modifiable lifestyle, disease-modifying drugs (DMD and supplement use in a large international sample of PwMS.PwMS were recruited to the study via Web 2.0 platforms and completed a comprehensive survey measuring demographic, lifestyle and clinical characteristics, including health-related quality of life, disability, and relapse rate.Of 2469 participants with confirmed MS, 2138 (86.6% completed a validated measure of clinically significant fatigue, the Fatigue Severity Scale. Participants were predominantly female from English speaking countries, with relatively high levels of education, and due to recruitment methods may have been highly pro-active about engaging in lifestyle management and self-help. Approximately two thirds of our sample (1402/2138; 65.6% (95% CI 63.7-67.7 screened positive for clinically significant fatigue. Bivariate associations were present between clinically significant fatigue and several demographic, clinical, lifestyle, and medication variables. After controlling for level of disability and a range of stable socio-demographic variables, we found increased odds of fatigue associated with obesity, DMD use, poor diet, and reduced odds of fatigue with exercise, fish consumption, moderate alcohol use, and supplementation with vitamin D and flaxseed oil.This study supports strong and significant associations between clinically significant fatigue and modifiable lifestyle factors. Longitudinal follow-up of this sample may help clarify the contribution
The association between previous and future severe exacerbations of chronic obstructive pulmonary disease: Updating the literature using robust statistical methodology.

Science.gov (United States)

Sadatsafavi, Mohsen; Xie, Hui; Etminan, Mahyar; Johnson, Kate; FitzGerald, J Mark

2018-01-01

There is minimal evidence on the extent to which the occurrence of a severe acute exacerbation of COPD that results in hospitalization affects the subsequent disease course. Previous studies on this topic did not generate causally-interpretable estimates. Our aim was to use corrected methodology to update previously reported estimates of the associations between previous and future exacerbations in these patients. Using administrative health data in British Columbia, Canada (1997-2012), we constructed a cohort of patients with at least one severe exacerbation, defined as an episode of inpatient care with the main diagnosis of COPD based on international classification of diseases (ICD) codes. We applied a random-effects 'joint frailty' survival model that is particularly developed for the analysis of recurrent events in the presence of competing risk of death and heterogeneity among individuals in their rate of events. Previous severe exacerbations entered the model as dummy-coded time-dependent covariates, and the model was adjusted for several observable patient and disease characteristics. 35,994 individuals (mean age at baseline 73.7, 49.8% female, average follow-up 3.21 years) contributed 34,271 severe exacerbations during follow-up. The first event was associated with a hazard ratio (HR) of 1.75 (95%CI 1.69-1.82) for the risk of future severe exacerbations. This risk decreased to HR = 1.36 (95%CI 1.30-1.42) for the second event and to 1.18 (95%CI 1.12-1.25) for the third event. The first two severe exacerbations that occurred during follow-up were also significantly associated with increased risk of all-cause mortality. There was substantial heterogeneity in the individual-specific rate of severe exacerbations. Even after adjusting for observable characteristics, individuals in the 97.5th percentile of exacerbation rate had 5.6 times higher rate of severe exacerbations than those in the 2.5th percentile. Using robust statistical methodology that controlled
A functionally significant polymorphism in ID3 is associated with human coronary pathology.

Directory of Open Access Journals (Sweden)

Ani Manichaikul

Full Text Available We previously identified association between the ID3 SNP rs11574 and carotid intima-media thickness in the Diabetes Heart Study, a predominantly White diabetic population. The nonsynonymous SNP rs11574 results in an amino acid substitution in the C-terminal region of ID3, attenuating the dominant negative function of ID3 as an inhibitor of basic HLH factor E12-mediated transcription. In the current investigation, we characterize the association between the functionally significant polymorphism in ID3, rs11574, with human coronary pathology.The Multi-Ethnic Study of Atherosclerosis (MESA is a longitudinal study of subclinical cardiovascular disease, including non-Hispanic White (n = 2,588, African American (n = 2,560 and Hispanic (n = 2,130 participants with data on coronary artery calcium (CAC. The Coronary Assessment in Virginia cohort (CAVA included 71 patients aged 30-80 years, undergoing a medically necessary cardiac catheterization and intravascular ultrasound (IVUS at the University of Virginia. ID3 SNP rs11574 risk allele was associated with the presence of CAC in MESA Whites (P = 0.017. In addition, the risk allele was associated with greater atheroma burden and stenosis in the CAVA cohort (P = 0.003, P = 0.04 respectively. The risk allele remained predictive of atheroma burden in multivariate analysis (Model 1: covariates age, gender, and LDL, regression coefficient = 9.578, SE = 3.657, p = 0.0110; Model 2: covariates Model 1, presence of hypertension, presence of diabetes, regression coefficient = 8.389, SE = 4.788, p = 0.0163.We present additional cohorts that demonstrate association of ID3 SNP rs11574 directly with human coronary artery pathology as measured by CAC and IVUS: one a multiethnic, relatively healthy population with low levels of diabetes and the second a predominantly White population with a higher incidence of T2DM referred for cardiac catheterization.
Encounter Probability of Significant Wave Height

DEFF Research Database (Denmark)

Liu, Z.; Burcharth, H. F.

The determination of the design wave height (often given as the significant wave height) is usually based on statistical analysis of long-term extreme wave height measurement or hindcast. The result of such extreme wave height analysis is often given as the design wave height corresponding to a c...
Clinical significance of intramammary arterial calcifications in diabetic women

Directory of Open Access Journals (Sweden)

Milošević Zorica

2004-01-01

Full Text Available Background. It is well known that intramammary arterial calcifications diagnosed by mammography as a part of generalized diabetic macroangiopathy may be an indirect sign of diabetes mellitus. Hence, the aim of this study was to determine the incidence of intramammary arterial calcifications, the patient’s age when the calcifications occur, as well as to observe the influence of diabetic polineuropathy, type, and the duration of diabetes on the onset of calcifications, in comparison with nondiabetic women. Methods. Mammographic findings of 113 diabetic female patients (21 with type 1 diabetes and 92 with type 2, as well as of 208 nondiabetic women (the control group were analyzed in the prospective study. The data about the type of diabetes, its duration, and polineuropathy were obtained using the questionnaire. Statistical differences were determined by Mann-Whitney test. Results. Intramammary arterial calcifications were identified in 33.3% of the women with type 1 diabetes, in 40.2% with type 2, and in 8.2% of the women from the control group, respectively. The differences comparing the women with type 1, as well as type 2 diabetes and the controls were statistically significant (p=0.0001. Women with intramammary arterial calcifications and type 1 diabetes were younger comparing to the control group (median age 52 years, comparing to 67 years of age, p=0.001, while there was no statistically significant difference in age between the women with calcifications and type 2 diabetes (61 years of age in relation to the control group (p=0.176. The incidence of polineuropathy in diabetic women was higher in the group with intramammary arterial calcifications (52.3% in comparison to the group without calcifications (26.1%, (p=0.005. The association between intramammary arterial calcifications and the duration of diabetes was not found. Conclusion. The obtained results supported the theory that intramammary arterial calcifications, detected by
Clinically significant bleeding in incurable cancer patients: effectiveness of hemostatic radiotherapy

International Nuclear Information System (INIS)

Cihoric, Nikola; Crowe, Susanne; Eychmüller, Steffen; Aebersold, Daniel M; Ghadjar, Pirus

2012-01-01

This study was performed to evaluate the outcome after hemostatic radiotherapy (RT) of significant bleeding in incurable cancer patients. Patients treated by hemostatic RT between November 2006 and February 2010 were retrospectively analyzed. Bleeding was assessed according to the World Health Organization (WHO) scale (grade 0 = no bleeding, 1 = petechial bleeding, 2 = clinically significant bleeding, 3 = bleeding requiring transfusion, 4 = bleeding associated with fatality). The primary endpoint was bleeding at the end of RT. Key secondary endpoints included overall survival (OS) and acute toxicity. The bleeding score before and after RT were compared using the Wilcoxon signed rank test. Time to event endpoints were estimated using the Kaplan Meier method. Overall 62 patients were analyzed including 1 patient whose benign cause of bleeding was pseudomyxoma peritonei. Median age was 66 (range, 37–93) years. Before RT, bleeding was graded as 2 and 3 in 24 (39%) and 38 (61%) patients, respectively. A median dose of 20 (range, 5–45) Gy of hemostatic RT was applied to the bleeding site. At the end of RT, there was a statistically significant difference in bleeding (p < 0.001); it was graded as 0 (n = 39), 1 (n = 12), 2 (n = 6), 3 (n = 4) and 4 (n = 1). With a median follow-up of 19.3 (range, 0.3-19.3) months, the 6-month OS rate was 43%. Forty patients died (65%); 5 due to bleeding. No grade 3 or above acute toxicity was observed. Hemostatic RT seems to be a safe and effective treatment for clinically and statistically significantly reducing bleeding in incurable cancer patients
688,112 statistical results: Content mining psychology articles for statistical test results

OpenAIRE

Hartgerink, C.H.J.

2016-01-01

In this data deposit, I describe a dataset that is the result of content mining 167,318 published articles for statistical test results reported according to the standards prescribed by the American Psychological Association (APA). Articles published by the APA, Springer, Sage, and Taylor & Francis were included (mining from Wiley and Elsevier was actively blocked). As a result of this content mining, 688,112 results from 50,845 articles were extracted. In order to provide a comprehensive set...
[The significances of peripheral neutrophils CD(55) and myeloperoxidase expression in patients with myeloperoxidase-specific anti-neutrophil cytoplasmic antibody associated vasculitis].

Science.gov (United States)

Zhou, X L; Zheng, M J; Shuai, Z W; Zhang, L; Zhang, M M; Chen, S Y

2017-06-01

Objective: To investigate the expression of CD(55) and myeloperoxidase (MPO) on neutrophils in patients with MPO-specific anti-neutrophil cytoplasmic antibody associated vasculitis(MPO-AAV), and analyze the relationship between the expression and clinical manifestation. Methods: Forty untreated patients with active MPO-AAV (patient group) and 30 healthy volunteers (control group) were enrolled in this study. The CD(55) on neutrophils and both membrane and cytoplasmic MPO were detected by flow cytometry. Serum fragment-from the activated complement factor B(Ba) and MPO were measured by ELISA. The clinical activity of vasculitis was valued by Birmingham vasculitis activity score-version 3(BVAS-V3). The significance of laboratory data was evaluated by Spearman correlation test and multivariate linear regression analysis. Results: (1)The mean fluorescence intensity(MFI) of CD(55) expressed on neutrophils was significantly higher than that in control group[4 068.6±2 306.0 vs 2 999.5±1 504.9, P =0.033]. Similar results of serum MPO and Ba in patient group were found compared to controls [500.0(381.0, 612.7) IU/L vs 286.9(225.5, 329.1) IU/L, P <0.001; 35.2(25.2, 79.5) ng/L vs 18.0(15.0, 28.0) ng/L, P <0.001], respectively. However, MIF of cytoplasmic MPO in patients was significantly lower than that of control group(1 577.1±1 175.9 vs 3 105.3±2 323.0, P =0.003) . (2) In patient group, cytoplasmic intensity of MPO was negatively associated with the serum levels of MPO( r =-0.710, P <0.001) and Ba ( r =-0.589, P =0.001). Moreover, serum MPO was positively associated with serum Ba( r =0.691, P <0.001). Membrane intensity of CD(55) on neutrophils was positively correlated with patient age ( r =0.514, P =0.001), C reactive protein ( r =0.376, P =0.018), peripheral neutrophils count ( r =0.485, P =0.001) and BVAS-V3 ( r =0.484, P =0.002), whereas negative correlation between membrane CD(55) and disease duration was seen ( r =-0.403, P =0.01). (3) The result of multiple
Statistical analysis of brake squeal noise

Science.gov (United States)

Oberst, S.; Lai, J. C. S.

2011-06-01

Despite substantial research efforts applied to the prediction of brake squeal noise since the early 20th century, the mechanisms behind its generation are still not fully understood. Squealing brakes are of significant concern to the automobile industry, mainly because of the costs associated with warranty claims. In order to remedy the problems inherent in designing quieter brakes and, therefore, to understand the mechanisms, a design of experiments study, using a noise dynamometer, was performed by a brake system manufacturer to determine the influence of geometrical parameters (namely, the number and location of slots) of brake pads on brake squeal noise. The experimental results were evaluated with a noise index and ranked for warm and cold brake stops. These data are analysed here using statistical descriptors based on population distributions, and a correlation analysis, to gain greater insight into the functional dependency between the time-averaged friction coefficient as the input and the peak sound pressure level data as the output quantity. The correlation analysis between the time-averaged friction coefficient and peak sound pressure data is performed by applying a semblance analysis and a joint recurrence quantification analysis. Linear measures are compared with complexity measures (nonlinear) based on statistics from the underlying joint recurrence plots. Results show that linear measures cannot be used to rank the noise performance of the four test pad configurations. On the other hand, the ranking of the noise performance of the test pad configurations based on the noise index agrees with that based on nonlinear measures: the higher the nonlinearity between the time-averaged friction coefficient and peak sound pressure, the worse the squeal. These results highlight the nonlinear character of brake squeal and indicate the potential of using nonlinear statistical analysis tools to analyse disc brake squeal.
[Comment on] Statistical discrimination

Science.gov (United States)

Chinn, Douglas

In the December 8, 1981, issue of Eos, a news item reported the conclusion of a National Research Council study that sexual discrimination against women with Ph.D.'s exists in the field of geophysics. Basically, the item reported that even when allowances are made for motherhood the percentage of female Ph.D.'s holding high university and corporate positions is significantly lower than the percentage of male Ph.D.'s holding the same types of positions. The sexual discrimination conclusion, based only on these statistics, assumes that there are no basic psychological differences between men and women that might cause different populations in the employment group studied. Therefore, the reasoning goes, after taking into account possible effects from differences related to anatomy, such as women stopping their careers in order to bear and raise children, the statistical distributions of positions held by male and female Ph.D.'s ought to be very similar to one another. Any significant differences between the distributions must be caused primarily by sexual discrimination.
Emergence of quantum mechanics from classical statistics

International Nuclear Information System (INIS)

Wetterich, C

2009-01-01

The conceptual setting of quantum mechanics is subject to an ongoing debate from its beginnings until now. The consequences of the apparent differences between quantum statistics and classical statistics range from the philosophical interpretations to practical issues as quantum computing. In this note we demonstrate how quantum mechanics can emerge from classical statistical systems. We discuss conditions and circumstances for this to happen. Quantum systems describe isolated subsystems of classical statistical systems with infinitely many states. While infinitely many classical observables 'measure' properties of the subsystem and its environment, the state of the subsystem can be characterized by the expectation values of only a few probabilistic observables. They define a density matrix, and all the usual laws of quantum mechanics follow. No concepts beyond classical statistics are needed for quantum physics - the differences are only apparent and result from the particularities of those classical statistical systems which admit a quantum mechanical description. In particular, we show how the non-commuting properties of quantum operators are associated to the use of conditional probabilities within the classical system, and how a unitary time evolution reflects the isolation of the subsystem.
Interaction between FOXO1A-209 Genotype and Tea Drinking is Significantly Associated with Reduced Mortality at Advanced Ages

DEFF Research Database (Denmark)

Zeng, Yi; Chen, Huashuai; Ni, Ting

2016-01-01

Based on the genotypic/phenotypic data from Chinese Longitudinal Healthy Longevity Survey (CLHLS) and Cox proportional hazard model, the present study demonstrates that interactions between carrying FOXO1A-209 genotypes and tea drinking are significantly associated with lower risk of mortality...... at advanced ages. Such significant association is replicated in two independent Han Chinese CLHLS cohorts (p =0.028-0.048 in the discovery and replication cohorts, and p =0.003-0.016 in the combined dataset). We found the associations between tea drinking and reduced mortality are much stronger among carriers...... of the FOXO1A-209 genotype compared to non-carriers, and drinking tea is associated with a reversal of the negative effects of carrying FOXO1A-209 minor alleles, that is, from a substantially increased mortality risk to substantially reduced mortality risk at advanced ages. The impacts are considerably...
Using statistics to understand the environment

CERN Document Server

Cook, Penny A

2000-01-01

Using Statistics to Understand the Environment covers all the basic tests required for environmental practicals and projects and points the way to the more advanced techniques that may be needed in more complex research designs. Following an introduction to project design, the book covers methods to describe data, to examine differences between samples, and to identify relationships and associations between variables.Featuring: worked examples covering a wide range of environmental topics, drawings and icons, chapter summaries, a glossary of statistical terms and a further reading section, this book focuses on the needs of the researcher rather than on the mathematics behind the tests.
Nonextensive statistical mechanics of ionic solutions

International Nuclear Information System (INIS)

Varela, L.M.; Carrete, J.; Munoz-Sola, R.; Rodriguez, J.R.; Gallego, J.

2007-01-01

Classical mean-field Poisson-Boltzmann theory of ionic solutions is revisited in the theoretical framework of nonextensive Tsallis statistics. The nonextensive equivalent of Poisson-Boltzmann equation is formulated revisiting the statistical mechanics of liquids and the Debye-Hueckel framework is shown to be valid for highly diluted solutions even under circumstances where nonextensive thermostatistics must be applied. The lowest order corrections associated to nonadditive effects are identified for both symmetric and asymmetric electrolytes and the behavior of the average electrostatic potential in a homogeneous system is analytically and numerically analyzed for various values of the complexity measurement nonextensive parameter q
GREY STATISTICS METHOD OF TECHNOLOGY SELECTION FOR ADVANCED PUBLIC TRANSPORTATION SYSTEMS

Directory of Open Access Journals (Sweden)

Chien Hung WEI

2003-01-01

Full Text Available Taiwan is involved in intelligent transportation systems planning, and is now selecting its prior focus areas for investment and development. The high social and economic impact associated with which intelligent transportation systems technology are chosen explains the efforts of various electronics and transportation corporations for developing intelligent transportation systems technology to expand their business opportunities. However, there has been no detailed research conducted with regard to selecting technology for advanced public transportation systems in Taiwan. Thus, the present paper demonstrates a grey statistics method integrated with a scenario method for solving the problem of selecting advanced public transportation systems technology for Taiwan. A comprehensive questionnaire survey was conducted to demonstrate the effectiveness of the grey statistics method. The proposed approach indicated that contactless smart card technology is the appropriate technology for Taiwan to develop in the near future. The significance of our research results implies that the grey statistics method is an effective method for selecting advanced public transportation systems technologies. We feel our information will be beneficial to the private sector for developing an appropriate intelligent transportation systems technology strategy.
Harmonic statistics

International Nuclear Information System (INIS)

Eliazar, Iddo

2017-01-01

The exponential, the normal, and the Poisson statistical laws are of major importance due to their universality. Harmonic statistics are as universal as the three aforementioned laws, but yet they fall short in their ‘public relations’ for the following reason: the full scope of harmonic statistics cannot be described in terms of a statistical law. In this paper we describe harmonic statistics, in their full scope, via an object termed harmonic Poisson process: a Poisson process, over the positive half-line, with a harmonic intensity. The paper reviews the harmonic Poisson process, investigates its properties, and presents the connections of this object to an assortment of topics: uniform statistics, scale invariance, random multiplicative perturbations, Pareto and inverse-Pareto statistics, exponential growth and exponential decay, power-law renormalization, convergence and domains of attraction, the Langevin equation, diffusions, Benford’s law, and 1/f noise. - Highlights: • Harmonic statistics are described and reviewed in detail. • Connections to various statistical laws are established. • Connections to perturbation, renormalization and dynamics are established.
Harmonic statistics

Energy Technology Data Exchange (ETDEWEB)

Eliazar, Iddo, E-mail: eliazar@post.tau.ac.il

2017-05-15

The exponential, the normal, and the Poisson statistical laws are of major importance due to their universality. Harmonic statistics are as universal as the three aforementioned laws, but yet they fall short in their ‘public relations’ for the following reason: the full scope of harmonic statistics cannot be described in terms of a statistical law. In this paper we describe harmonic statistics, in their full scope, via an object termed harmonic Poisson process: a Poisson process, over the positive half-line, with a harmonic intensity. The paper reviews the harmonic Poisson process, investigates its properties, and presents the connections of this object to an assortment of topics: uniform statistics, scale invariance, random multiplicative perturbations, Pareto and inverse-Pareto statistics, exponential growth and exponential decay, power-law renormalization, convergence and domains of attraction, the Langevin equation, diffusions, Benford’s law, and 1/f noise. - Highlights: • Harmonic statistics are described and reviewed in detail. • Connections to various statistical laws are established. • Connections to perturbation, renormalization and dynamics are established.
Willingness to share research data is related to the strength of the evidence and the quality of reporting of statistical results.

Directory of Open Access Journals (Sweden)

Jelte M Wicherts

Full Text Available BACKGROUND: The widespread reluctance to share published research data is often hypothesized to be due to the authors' fear that reanalysis may expose errors in their work or may produce conclusions that contradict their own. However, these hypotheses have not previously been studied systematically. METHODS AND FINDINGS: We related the reluctance to share research data for reanalysis to 1148 statistically significant results reported in 49 papers published in two major psychology journals. We found the reluctance to share data to be associated with weaker evidence (against the null hypothesis of no effect and a higher prevalence of apparent errors in the reporting of statistical results. The unwillingness to share data was particularly clear when reporting errors had a bearing on statistical significance. CONCLUSIONS: Our findings on the basis of psychological papers suggest that statistical results are particularly hard to verify when reanalysis is more likely to lead to contrasting conclusions. This highlights the importance of establishing mandatory data archiving policies.
Inherited Disease Genetics Improves the Identification of Cancer-Associated Genes.

Directory of Open Access Journals (Sweden)

Boyang Zhao

2016-06-01

Full Text Available The identification of biologically significant variants in cancer genomes is critical to therapeutic discovery, but it is limited by the statistical power needed to discern driver from passenger. Independent biological data can be used to filter cancer exomes and increase statistical power. Large genetic databases for inherited diseases are uniquely suited to this task because they contain specific amino acid alterations with known pathogenicity and molecular mechanisms. However, no rigorous method to overlay this information onto the cancer exome exists. Here, we present a computational methodology that overlays any variant database onto the somatic mutations in all cancer exomes. We validate the computation experimentally and identify novel associations in a re-analysis of 7362 cancer exomes. This analysis identified activating SOS1 mutations associated with Noonan syndrome as significantly altered in melanoma and the first kinase-activating mutations in ACVR1 associated with adult tumors. Beyond a filter, significant variants found in both rare cancers and rare inherited diseases increase the unmet medical need for therapeutics that target these variants and may bootstrap drug discovery efforts in orphan indications.
Statistical mechanics for a class of quantum statistics

International Nuclear Information System (INIS)

Isakov, S.B.

1994-01-01

Generalized statistical distributions for identical particles are introduced for the case where filling a single-particle quantum state by particles depends on filling states of different momenta. The system of one-dimensional bosons with a two-body potential that can be solved by means of the thermodynamic Bethe ansatz is shown to be equivalent thermodynamically to a system of free particles obeying statistical distributions of the above class. The quantum statistics arising in this way are completely determined by the two-particle scattering phases of the corresponding interacting systems. An equation determining the statistical distributions for these statistics is derived

Data-driven inference for the spatial scan statistic.

Science.gov (United States)

Almeida, Alexandre C L; Duarte, Anderson R; Duczmal, Luiz H; Oliveira, Fernando L P; Takahashi, Ricardo H C

2011-08-02

Kulldorff's spatial scan statistic for aggregated area maps searches for clusters of cases without specifying their size (number of areas) or geographic location in advance. Their statistical significance is tested while adjusting for the multiple testing inherent in such a procedure. However, as is shown in this work, this adjustment is not done in an even manner for all possible cluster sizes. A modification is proposed to the usual inference test of the spatial scan statistic, incorporating additional information about the size of the most likely cluster found. A new interpretation of the results of the spatial scan statistic is done, posing a modified inference question: what is the probability that the null hypothesis is rejected for the original observed cases map with a most likely cluster of size k, taking into account only those most likely clusters of size k found under null hypothesis for comparison? This question is especially important when the p-value computed by the usual inference process is near the alpha significance level, regarding the correctness of the decision based in this inference. A practical procedure is provided to make more accurate inferences about the most likely cluster found by the spatial scan statistic.
Statistics for Learning Genetics

Science.gov (United States)

Charles, Abigail Sheena

This study investigated the knowledge and skills that biology students may need to help them understand statistics/mathematics as it applies to genetics. The data are based on analyses of current representative genetics texts, practicing genetics professors' perspectives, and more directly, students' perceptions of, and performance in, doing statistically-based genetics problems. This issue is at the emerging edge of modern college-level genetics instruction, and this study attempts to identify key theoretical components for creating a specialized biological statistics curriculum. The goal of this curriculum will be to prepare biology students with the skills for assimilating quantitatively-based genetic processes, increasingly at the forefront of modern genetics. To fulfill this, two college level classes at two universities were surveyed. One university was located in the northeastern US and the other in the West Indies. There was a sample size of 42 students and a supplementary interview was administered to a select 9 students. Interviews were also administered to professors in the field in order to gain insight into the teaching of statistics in genetics. Key findings indicated that students had very little to no background in statistics (55%). Although students did perform well on exams with 60% of the population receiving an A or B grade, 77% of them did not offer good explanations on a probability question associated with the normal distribution provided in the survey. The scope and presentation of the applicable statistics/mathematics in some of the most used textbooks in genetics teaching, as well as genetics syllabi used by instructors do not help the issue. It was found that the text books, often times, either did not give effective explanations for students, or completely left out certain topics. The omission of certain statistical/mathematical oriented topics was seen to be also true with the genetics syllabi reviewed for this study. Nonetheless
Medical Statistics – Mathematics or Oracle? Farewell Lecture

Directory of Open Access Journals (Sweden)

Gaus, Wilhelm

2005-06-01

Full Text Available Certainty is rare in medicine. This is a direct consequence of the individuality of each and every human being and the reason why we need medical statistics. However, statistics have their pitfalls, too. Fig. 1 shows that the suicide rate peaks in youth, while in Fig. 2 the rate is highest in midlife and Fig. 3 in old age. Which of these contradictory messages is right? After an introduction to the principles of statistical testing, this lecture examines the probability with which statistical test results are correct. For this purpose the level of significance and the power of the test are compared with the sensitivity and specificity of a diagnostic procedure. The probability of obtaining correct statistical test results is the same as that for the positive and negative correctness of a diagnostic procedure and therefore depends on prevalence. The focus then shifts to the problem of multiple statistical testing. The lecture demonstrates that for each data set of reasonable size at least one test result proves to be significant - even if the data set is produced by a random number generator. It is extremely important that a hypothesis is generated independently from the data used for its testing. These considerations enable us to understand the gradation of "lame excuses, lies and statistics" and the difference between pure truth and the full truth. Finally, two historical oracles are cited.
Development of the Statistical Reasoning in Biology Concept Inventory (SRBCI)

Science.gov (United States)

Deane, Thomas; Nomme, Kathy; Jeffery, Erica; Pollock, Carol; Birol, Gülnur

2016-01-01

We followed established best practices in concept inventory design and developed a 12-item inventory to assess student ability in statistical reasoning in biology (Statistical Reasoning in Biology Concept Inventory [SRBCI]). It is important to assess student thinking in this conceptual area, because it is a fundamental requirement of being statistically literate and associated skills are needed in almost all walks of life. Despite this, previous work shows that non–expert-like thinking in statistical reasoning is common, even after instruction. As science educators, our goal should be to move students along a novice-to-expert spectrum, which could be achieved with growing experience in statistical reasoning. We used item response theory analyses (the one-parameter Rasch model and associated analyses) to assess responses gathered from biology students in two populations at a large research university in Canada in order to test SRBCI’s robustness and sensitivity in capturing useful data relating to the students’ conceptual ability in statistical reasoning. Our analyses indicated that SRBCI is a unidimensional construct, with items that vary widely in difficulty and provide useful information about such student ability. SRBCI should be useful as a diagnostic tool in a variety of biology settings and as a means of measuring the success of teaching interventions designed to improve statistical reasoning skills. PMID:26903497
Testing for significance of phase synchronisation dynamics in the EEG.

Science.gov (United States)

Daly, Ian; Sweeney-Reed, Catherine M; Nasuto, Slawomir J

2013-06-01

A number of tests exist to check for statistical significance of phase synchronisation within the Electroencephalogram (EEG); however, the majority suffer from a lack of generality and applicability. They may also fail to account for temporal dynamics in the phase synchronisation, regarding synchronisation as a constant state instead of a dynamical process. Therefore, a novel test is developed for identifying the statistical significance of phase synchronisation based upon a combination of work characterising temporal dynamics of multivariate time-series and Markov modelling. We show how this method is better able to assess the significance of phase synchronisation than a range of commonly used significance tests. We also show how the method may be applied to identify and classify significantly different phase synchronisation dynamics in both univariate and multivariate datasets.
Statistical learning and auditory processing in children with music training: An ERP study.

Science.gov (United States)

Mandikal Vasuki, Pragati Rao; Sharma, Mridula; Ibrahim, Ronny; Arciuli, Joanne

2017-07-01

The question whether musical training is associated with enhanced auditory and cognitive abilities in children is of considerable interest. In the present study, we compared children with music training versus those without music training across a range of auditory and cognitive measures, including the ability to detect implicitly statistical regularities in input (statistical learning). Statistical learning of regularities embedded in auditory and visual stimuli was measured in musically trained and age-matched untrained children between the ages of 9-11years. In addition to collecting behavioural measures, we recorded electrophysiological measures to obtain an online measure of segmentation during the statistical learning tasks. Musically trained children showed better performance on melody discrimination, rhythm discrimination, frequency discrimination, and auditory statistical learning. Furthermore, grand-averaged ERPs showed that triplet onset (initial stimulus) elicited larger responses in the musically trained children during both auditory and visual statistical learning tasks. In addition, children's music skills were associated with performance on auditory and visual behavioural statistical learning tasks. Our data suggests that individual differences in musical skills are associated with children's ability to detect regularities. The ERP data suggest that musical training is associated with better encoding of both auditory and visual stimuli. Although causality must be explored in further research, these results may have implications for developing music-based remediation strategies for children with learning impairments. Copyright © 2017 International Federation of Clinical Neurophysiology. Published by Elsevier B.V. All rights reserved.
Investigating spousal concordance of diabetes through statistical analysis and data mining.

Directory of Open Access Journals (Sweden)

Jong-Yi Wang

Full Text Available Spousal clustering of diabetes merits attention. Whether old-age vulnerability or a shared family environment determines the concordance of diabetes is also uncertain. This study investigated the spousal concordance of diabetes and compared the risk of diabetes concordance between couples and noncouples by using nationally representative data.A total of 22,572 individuals identified from the 2002-2013 National Health Insurance Research Database of Taiwan constituted 5,643 couples and 5,643 noncouples through 1:1 dual propensity score matching (PSM. Factors associated with concordance in both spouses with diabetes were analyzed at the individual level. The risk of diabetes concordance between couples and noncouples was compared at the couple level. Logistic regression was the main statistical method. Statistical data were analyzed using SAS 9.4. C&RT and Apriori of data mining conducted in IBM SPSS Modeler 13 served as a supplement to statistics.High odds of the spousal concordance of diabetes were associated with old age, middle levels of urbanization, and high comorbidities (all P < 0.05. The dual PSM analysis revealed that the risk of diabetes concordance was significantly higher in couples (5.19% than in noncouples (0.09%; OR = 61.743, P < 0.0001.A high concordance rate of diabetes in couples may indicate the influences of assortative mating and shared environment. Diabetes in a spouse implicates its risk in the partner. Family-based diabetes care that emphasizes the screening of couples at risk of diabetes by using the identified risk factors is suggested in prospective clinical practice interventions.
Statistics with JMP graphs, descriptive statistics and probability

CERN Document Server

Goos, Peter

2015-01-01

Peter Goos, Department of Statistics, University ofLeuven, Faculty of Bio-Science Engineering and University ofAntwerp, Faculty of Applied Economics, BelgiumDavid Meintrup, Department of Mathematics and Statistics,University of Applied Sciences Ingolstadt, Faculty of MechanicalEngineering, GermanyThorough presentation of introductory statistics and probabilitytheory, with numerous examples and applications using JMPDescriptive Statistics and Probability provides anaccessible and thorough overview of the most important descriptivestatistics for nominal, ordinal and quantitative data withpartic
GWIS: Genome-Wide Inferred Statistics for Functions of Multiple Phenotypes

NARCIS (Netherlands)

Nieuwboer, H.A.; Pool, R.; Dolan, C.V.; Boomsma, D.I.; Nivard, M.G.

2016-01-01

Here we present a method of genome-wide inferred study (GWIS) that provides an approximation of genome-wide association study (GWAS) summary statistics for a variable that is a function of phenotypes for which GWAS summary statistics, phenotypic means, and covariances are available. A GWIS can be
Testing University Rankings Statistically: Why this Perhaps is not such a Good Idea after All. Some Reflections on Statistical Power, Effect Size, Random Sampling and Imaginary Populations

DEFF Research Database (Denmark)

Schneider, Jesper Wiborg

2012-01-01

In this paper we discuss and question the use of statistical significance tests in relation to university rankings as recently suggested. We outline the assumptions behind and interpretations of statistical significance tests and relate this to examples from the recent SCImago Institutions Rankin...
Performance modeling, loss networks, and statistical multiplexing

CERN Document Server

Mazumdar, Ravi

2009-01-01

This monograph presents a concise mathematical approach for modeling and analyzing the performance of communication networks with the aim of understanding the phenomenon of statistical multiplexing. The novelty of the monograph is the fresh approach and insights provided by a sample-path methodology for queueing models that highlights the important ideas of Palm distributions associated with traffic models and their role in performance measures. Also presented are recent ideas of large buffer, and many sources asymptotics that play an important role in understanding statistical multiplexing. I
The Statistics of a Function

Science.gov (United States)

Gordon, Sheldon P.; Gordon, Florence S.

2010-01-01

One of the most important applications of the definite integral in a modern calculus course is the mean value of a function. Thus, if a function "f" is defined on an interval ["a", "b"], then the mean, or average value, of "f" is given by [image omitted]. In this note, we will investigate the meaning of other statistics associated with a function…
Proper joint analysis of summary association statistics requires the adjustment of heterogeneity in SNP coverage pattern.

Science.gov (United States)

Zhang, Han; Wheeler, William; Song, Lei; Yu, Kai

2017-07-07

As meta-analysis results published by consortia of genome-wide association studies (GWASs) become increasingly available, many association summary statistics-based multi-locus tests have been developed to jointly evaluate multiple single-nucleotide polymorphisms (SNPs) to reveal novel genetic architectures of various complex traits. The validity of these approaches relies on the accurate estimate of z-score correlations at considered SNPs, which in turn requires knowledge on the set of SNPs assessed by each study participating in the meta-analysis. However, this exact SNP coverage information is usually unavailable from the meta-analysis results published by GWAS consortia. In the absence of the coverage information, researchers typically estimate the z-score correlations by making oversimplified coverage assumptions. We show through real studies that such a practice can generate highly inflated type I errors, and we demonstrate the proper way to incorporate correct coverage information into multi-locus analyses. We advocate that consortia should make SNP coverage information available when posting their meta-analysis results, and that investigators who develop analytic tools for joint analyses based on summary data should pay attention to the variation in SNP coverage and adjust for it appropriately. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.
Radiation belt seed population and its association with the relativistic electron dynamics: A statistical study: Radiation Belt Seed Population

International Nuclear Information System (INIS)

Tang, C. L.; Wang, Y. X.; Ni, B.; Zhang, J.-C.

2017-01-01

Using the Van Allen Probes data, we study the radiation belt seed population and it associated with the relativistic electron dynamics during 74 geomagnetic storm events. Based on the flux changes of 1 MeV electrons before and after the storm peak, these storm events are divided into two groups of “non-preconditioned” and “preconditioned”. The statistical study shows that the storm intensity is of significant importance for the distribution of the seed population (336 keV electrons) in the outer radiation belt. However, substorm intensity can also be important to the evolution of the seed population for some geomagnetic storm events. For non-preconditioned storm events, the correlation between the peak fluxes and their L-shell locations of the seed population and relativistic electrons (592 keV, 1.0 MeV, 1.8 MeV, and 2.1 MeV) is consistent with the energy-dependent dynamic processes in the outer radiation belt. For preconditioned storm events, the correlation between the features of the seed population and relativistic electrons is not fully consistent with the energy-dependent processes. It is suggested that the good correlation between the radiation belt seed population and ≤1.0 MeV electrons contributes to the prediction of the evolution of ≤1.0 MeV electrons in the Earth’s outer radiation belt during periods of geomagnetic storms.
The presence and prognostic significance of human papillomavirus in squamous cell carcinoma of the larynx.

Science.gov (United States)

Erkul, Evren; Yilmaz, Ismail; Narli, Gizem; Babayigit, Mustafa Alparslan; Gungor, Atila; Demirel, Dilaver

2017-07-01

The aim of the present study was to evaluate the role of HPV in laryngeal squamous cell carcinoma and correlate it with patients' clinicopathological data. In total, 78 laryngeal squamous cell carcinoma patients enrolled in this study. The presence of genotype-specific HPV DNA was evaluated using Genotyping Assay in formalin-fixed paraffin-embedded tissue which was diagnosed between 2005 and 2015. All samples were also evaluated for p16 immunohistochemical staining. HPV DNA and p16 status were assessed in terms of location, smoking, alcohol consumption, lymph node status, tumor stage, overall survival, disease-free survival, perineural invasion, and vascular invasion retrospectively. Five test samples were excluded from the study due to inadequate deoxyribonucleic acid purity. HPV DNA was detected in 19 of 73 (26.02%) in patients with laryngeal squamous cell carcinoma. Human papilloma virus genotyping revealed double human papilloma virus in one case (types 16 and 59) and HPV 16 in the remaining cases. Although HPV-positive cases showed slightly better 3 years survival than HPV-negative ones, this finding was not statistically significant (overall survival p = 0.417, HPV positive: 92.3%, HPV negative: 81.4%, and disease-free survival p = 0.526, HPV positive: 93.8%, HPV negative: 80.9%). The presence of HPV DNA was not significantly associated with any clinicopathological features (p > 0.05). Among 73 patients, only 4 had an immunohistochemical staining of p16 and these patients were also HPV DNA 16 positive. Although our study results revealed a slightly better survival in patients with HPV DNA positivity for HPV 16 compared to the negative ones, the difference was not statistically significant. However, an increasing rate in especially high-risk-type HPV-16 prevalence in laryngeal squamous cell carcinoma by RT-PCR method was observed compared to our previous study. Although the presence of HPV in laryngeal SCCs seems to be associated with slightly better
Uncertainty the soul of modeling, probability & statistics

CERN Document Server

Briggs, William

2016-01-01

This book presents a philosophical approach to probability and probabilistic thinking, considering the underpinnings of probabilistic reasoning and modeling, which effectively underlie everything in data science. The ultimate goal is to call into question many standard tenets and lay the philosophical and probabilistic groundwork and infrastructure for statistical modeling. It is the first book devoted to the philosophy of data aimed at working scientists and calls for a new consideration in the practice of probability and statistics to eliminate what has been referred to as the "Cult of Statistical Significance". The book explains the philosophy of these ideas and not the mathematics, though there are a handful of mathematical examples. The topics are logically laid out, starting with basic philosophy as related to probability, statistics, and science, and stepping through the key probabilistic ideas and concepts, and ending with statistical models. Its jargon-free approach asserts that standard methods, suc...
CNTNAP2 Is Significantly Associated With Speech Sound Disorder in the Chinese Han Population.

Science.gov (United States)

Zhao, Yun-Jing; Wang, Yue-Ping; Yang, Wen-Zhu; Sun, Hong-Wei; Ma, Hong-Wei; Zhao, Ya-Ru

2015-11-01

Speech sound disorder is the most common communication disorder. Some investigations support the possibility that the CNTNAP2 gene might be involved in the pathogenesis of speech-related diseases. To investigate single-nucleotide polymorphisms in the CNTNAP2 gene, 300 unrelated speech sound disorder patients and 200 normal controls were included in the study. Five single-nucleotide polymorphisms were amplified and directly sequenced. Significant differences were found in the genotype (P = .0003) and allele (P = .0056) frequencies of rs2538976 between patients and controls. The excess frequency of the A allele in the patient group remained significant after Bonferroni correction (P = .0280). A significant haplotype association with rs2710102T/+rs17236239A/+2538976A/+2710117A (P = 4.10e-006) was identified. A neighboring single-nucleotide polymorphism, rs10608123, was found in complete linkage disequilibrium with rs2538976, and the genotypes exactly corresponded to each other. The authors propose that these CNTNAP2 variants increase the susceptibility to speech sound disorder. The single-nucleotide polymorphisms rs10608123 and rs2538976 may merge into one single-nucleotide polymorphism. © The Author(s) 2015.
An exercise in model validation: Comparing univariate statistics and Monte Carlo-based multivariate statistics

International Nuclear Information System (INIS)

Weathers, J.B.; Luck, R.; Weathers, J.W.

2009-01-01

The complexity of mathematical models used by practicing engineers is increasing due to the growing availability of sophisticated mathematical modeling tools and ever-improving computational power. For this reason, the need to define a well-structured process for validating these models against experimental results has become a pressing issue in the engineering community. This validation process is partially characterized by the uncertainties associated with the modeling effort as well as the experimental results. The net impact of the uncertainties on the validation effort is assessed through the 'noise level of the validation procedure', which can be defined as an estimate of the 95% confidence uncertainty bounds for the comparison error between actual experimental results and model-based predictions of the same quantities of interest. Although general descriptions associated with the construction of the noise level using multivariate statistics exists in the literature, a detailed procedure outlining how to account for the systematic and random uncertainties is not available. In this paper, the methodology used to derive the covariance matrix associated with the multivariate normal pdf based on random and systematic uncertainties is examined, and a procedure used to estimate this covariance matrix using Monte Carlo analysis is presented. The covariance matrices are then used to construct approximate 95% confidence constant probability contours associated with comparison error results for a practical example. In addition, the example is used to show the drawbacks of using a first-order sensitivity analysis when nonlinear local sensitivity coefficients exist. Finally, the example is used to show the connection between the noise level of the validation exercise calculated using multivariate and univariate statistics.
An exercise in model validation: Comparing univariate statistics and Monte Carlo-based multivariate statistics

Energy Technology Data Exchange (ETDEWEB)

Weathers, J.B. [Shock, Noise, and Vibration Group, Northrop Grumman Shipbuilding, P.O. Box 149, Pascagoula, MS 39568 (United States)], E-mail: James.Weathers@ngc.com; Luck, R. [Department of Mechanical Engineering, Mississippi State University, 210 Carpenter Engineering Building, P.O. Box ME, Mississippi State, MS 39762-5925 (United States)], E-mail: Luck@me.msstate.edu; Weathers, J.W. [Structural Analysis Group, Northrop Grumman Shipbuilding, P.O. Box 149, Pascagoula, MS 39568 (United States)], E-mail: Jeffrey.Weathers@ngc.com

2009-11-15

The complexity of mathematical models used by practicing engineers is increasing due to the growing availability of sophisticated mathematical modeling tools and ever-improving computational power. For this reason, the need to define a well-structured process for validating these models against experimental results has become a pressing issue in the engineering community. This validation process is partially characterized by the uncertainties associated with the modeling effort as well as the experimental results. The net impact of the uncertainties on the validation effort is assessed through the 'noise level of the validation procedure', which can be defined as an estimate of the 95% confidence uncertainty bounds for the comparison error between actual experimental results and model-based predictions of the same quantities of interest. Although general descriptions associated with the construction of the noise level using multivariate statistics exists in the literature, a detailed procedure outlining how to account for the systematic and random uncertainties is not available. In this paper, the methodology used to derive the covariance matrix associated with the multivariate normal pdf based on random and systematic uncertainties is examined, and a procedure used to estimate this covariance matrix using Monte Carlo analysis is presented. The covariance matrices are then used to construct approximate 95% confidence constant probability contours associated with comparison error results for a practical example. In addition, the example is used to show the drawbacks of using a first-order sensitivity analysis when nonlinear local sensitivity coefficients exist. Finally, the example is used to show the connection between the noise level of the validation exercise calculated using multivariate and univariate statistics.
Common stressful life events and difficulties are associated with mental health symptoms and substance use in young adolescents

Directory of Open Access Journals (Sweden)

Low Nancy CP

2012-08-01

Full Text Available Abstract Background Stressful life events are associated with mood disorders in adults in clinical settings. Less described in the literature is the association between common life stressors and a wide range of psychopathology in young adolescents. This study uses a large non-clinical sample of young adolescents to describe the associations among worry or stress about common life events/difficulties, mental health and substance use. Methods Data on lifetime stress or worry about common life events/difficulties (i.e., romantic breakups, family disruption, interpersonal difficulties, and personal stress (health, weight, school work, symptoms of depression, conduct disorder symptoms, and substance use were collected from 1025 grade 7 students (mean age 12.9 years; 45% male. The association between each source of stress and each mental health and substance use indicator was modeled in separate logistic regression analyses. Results The proportion of adolescents reporting worry or stress ranged from 7% for new family to 53% for schoolwork. Romantic breakup stress was statistically significantly associated with all the mental health and substance use indicators except illicit drug use. Family disruption was statistically significantly associated with depression symptoms, marijuana use, and cigarette use. Interpersonal difficulties stress was statistically significantly associated with depression symptoms. All sources of personal stress were statistically significantly related to depression symptoms. In addition, health-related stress was inversely related to binge drinking. Conclusion Young adolescents may benefit from learning positive coping skills to manage worry or stress about common stressors and in particular, worry or stress related to romantic breakups. Appropriate management of mental health symptoms and substance use related to common stressful life events and difficulties may help reduce emerging psychopathology.

Infants generalize representations of statistically segmented words

Directory of Open Access Journals (Sweden)

Katharine eGraf Estes

2012-10-01

Full Text Available The acoustic variation in language presents learners with a substantial challenge. To learn by tracking statistical regularities in speech, infants must recognize words across tokens that differ based on characteristics such as the speaker’s voice, affect, or the sentence context. Previous statistical learning studies have not investigated how these types of surface form variation affect learning. The present experiments used tasks tailored to two distinct developmental levels to investigate the robustness of statistical learning to variation. Experiment 1 examined statistical word segmentation in 11-month-olds and found that infants can recognize statistically segmented words across a change in the speaker’s voice from segmentation to testing. The direction of infants’ preferences suggests that recognizing words across a voice change is more difficult than recognizing them in a consistent voice. Experiment 2 tested whether 17-month-olds can generalize the output of statistical learning across variation to support word learning. The infants were successful in their generalization; they associated referents with statistically defined words despite a change in voice from segmentation to label learning. Infants’ learning patterns also indicate that they formed representations of across-word syllable sequences during segmentation. Thus, low probability sequences can act as object labels in some conditions. The findings of these experiments suggest that the units that emerge during statistical learning are not perceptually constrained, but rather are robust to naturalistic acoustic variation.
Genome-wide significant associations in schizophrenia to ITIH3/4, CACNA1C and SDCCAG8, and extensive replication of associations reported by the Schizophrenia PGC

DEFF Research Database (Denmark)

Hamshere, M L; Walters, J T R; Smith, R

2013-01-01

The Schizophrenia Psychiatric Genome-Wide Association Study Consortium (PGC) highlighted 81 single-nucleotide polymorphisms (SNPs) with moderate evidence for association to schizophrenia. After follow-up in independent samples, seven loci attained genome-wide significance (GWS), but multi-locus t...... interval (CI) 78-100%) of the original set of 78 SNPs represent true associations. We also provide strong evidence for overlap in genetic risk between schizophrenia and bipolar disorder.Molecular Psychiatry advance online publication, 22 May 2012; doi:10.1038/mp.2012.67....
Statistical and extra-statistical considerations in differential item functioning analyses

Directory of Open Access Journals (Sweden)

G. K. Huysamen

2004-10-01

Full Text Available This article briefly describes the main procedures for performing differential item functioning (DIF analyses and points out some of the statistical and extra-statistical implications of these methods. Research findings on the sources of DIF, including those associated with translated tests, are reviewed. As DIF analyses are oblivious of correlations between a test and relevant criteria, the elimination of differentially functioning items does not necessarily improve predictive validity or reduce any predictive bias. The implications of the results of past DIF research for test development in the multilingual and multi-cultural South African society are considered. Opsomming Hierdie artikel beskryf kortliks die hoofprosedures vir die ontleding van differensiële itemfunksionering (DIF en verwys na sommige van die statistiese en buite-statistiese implikasies van hierdie metodes. ’n Oorsig word verskaf van navorsingsbevindings oor die bronne van DIF, insluitend dié by vertaalde toetse. Omdat DIF-ontledings nie die korrelasies tussen ’n toets en relevante kriteria in ag neem nie, sal die verwydering van differensieel-funksionerende items nie noodwendig voorspellingsgeldigheid verbeter of voorspellingsydigheid verminder nie. Die implikasies van vorige DIF-navorsingsbevindings vir toetsontwikkeling in die veeltalige en multikulturele Suid-Afrikaanse gemeenskap word oorweeg.
Functional summary statistics for the Johnson-Mehl model

DEFF Research Database (Denmark)

Møller, Jesper; Ghorbani, Mohammad

The Johnson-Mehl germination-growth model is a spatio-temporal point process model which among other things have been used for the description of neurotransmitters datasets. However, for such datasets parametric Johnson-Mehl models fitted by maximum likelihood have yet not been evaluated by means...... of functional summary statistics. This paper therefore invents four functional summary statistics adapted to the Johnson-Mehl model, with two of them based on the second-order properties and the other two on the nuclei-boundary distances for the associated Johnson-Mehl tessellation. The functional summary...... statistics theoretical properties are investigated, non-parametric estimators are suggested, and their usefulness for model checking is examined in a simulation study. The functional summary statistics are also used for checking fitted parametric Johnson-Mehl models for a neurotransmitters dataset....
Correlation of RNA secondary structure statistics with thermodynamic stability and applications to folding.

Science.gov (United States)

Wu, Johnny C; Gardner, David P; Ozer, Stuart; Gutell, Robin R; Ren, Pengyu

2009-08-28

The accurate prediction of the secondary and tertiary structure of an RNA with different folding algorithms is dependent on several factors, including the energy functions. However, an RNA higher-order structure cannot be predicted accurately from its sequence based on a limited set of energy parameters. The inter- and intramolecular forces between this RNA and other small molecules and macromolecules, in addition to other factors in the cell such as pH, ionic strength, and temperature, influence the complex dynamics associated with transition of a single stranded RNA to its secondary and tertiary structure. Since all of the factors that affect the formation of an RNAs 3D structure cannot be determined experimentally, statistically derived potential energy has been used in the prediction of protein structure. In the current work, we evaluate the statistical free energy of various secondary structure motifs, including base-pair stacks, hairpin loops, and internal loops, using their statistical frequency obtained from the comparative analysis of more than 50,000 RNA sequences stored in the RNA Comparative Analysis Database (rCAD) at the Comparative RNA Web (CRW) Site. Statistical energy was computed from the structural statistics for several datasets. While the statistical energy for a base-pair stack correlates with experimentally derived free energy values, suggesting a Boltzmann-like distribution, variation is observed between different molecules and their location on the phylogenetic tree of life. Our statistical energy values calculated for several structural elements were utilized in the Mfold RNA-folding algorithm. The combined statistical energy values for base-pair stacks, hairpins and internal loop flanks result in a significant improvement in the accuracy of secondary structure prediction; the hairpin flanks contribute the most.
A statistical perspective on association studies of psychiatric disorders

DEFF Research Database (Denmark)

Foldager, Leslie

2014-01-01

Gene-gene (GxG) and gene-environment (GxE) interactions likely play an important role in the aetiology of complex diseases like psychiatric disorders. Thus, we aim at investigating methodological aspects of and apply methods from statistical genetics taking interactions into account. In addition we...... genes and maternal infection by virus. Paper 3 presents the initial steps (mainly data construction) of an ongoing simulation study aiming at guiding decisions by comparing methods for GxE interaction analysis including both traditional two-step logistic regression, exhaustive searches using efficient...... these markers. However, the validity of the identified haplotypes is also checked by inferring phased haplotypes from genotypes. Haplotype analysis is also used in paper 5 which is otherwise an example of a focused approach to narrow down a previously found signal to search for more precise positions of disease...
Statistical process control in nursing research.

Science.gov (United States)

Polit, Denise F; Chaboyer, Wendy

2012-02-01

In intervention studies in which randomization to groups is not possible, researchers typically use quasi-experimental designs. Time series designs are strong quasi-experimental designs but are seldom used, perhaps because of technical and analytic hurdles. Statistical process control (SPC) is an alternative analytic approach to testing hypotheses about intervention effects using data collected over time. SPC, like traditional statistical methods, is a tool for understanding variation and involves the construction of control charts that distinguish between normal, random fluctuations (common cause variation), and statistically significant special cause variation that can result from an innovation. The purpose of this article is to provide an overview of SPC and to illustrate its use in a study of a nursing practice improvement intervention. Copyright © 2011 Wiley Periodicals, Inc.
Statistical black-hole thermodynamics

International Nuclear Information System (INIS)

Bekenstein, J.D.

1975-01-01

Traditional methods from statistical thermodynamics, with appropriate modifications, are used to study several problems in black-hole thermodynamics. Jaynes's maximum-uncertainty method for computing probabilities is used to show that the earlier-formulated generalized second law is respected in statistically averaged form in the process of spontaneous radiation by a Kerr black hole discovered by Hawking, and also in the case of a Schwarzschild hole immersed in a bath of black-body radiation, however cold. The generalized second law is used to motivate a maximum-entropy principle for determining the equilibrium probability distribution for a system containing a black hole. As an application we derive the distribution for the radiation in equilibrium with a Kerr hole (it is found to agree with what would be expected from Hawking's results) and the form of the associated distribution among Kerr black-hole solution states of definite mass. The same results are shown to follow from a statistical interpretation of the concept of black-hole entropy as the natural logarithm of the number of possible interior configurations that are compatible with the given exterior black-hole state. We also formulate a Jaynes-type maximum-uncertainty principle for black holes, and apply it to obtain the probability distribution among Kerr solution states for an isolated radiating Kerr hole
Statistics Anxiety and Business Statistics: The International Student

Science.gov (United States)

Bell, James A.

2008-01-01

Does the international student suffer from statistics anxiety? To investigate this, the Statistics Anxiety Rating Scale (STARS) was administered to sixty-six beginning statistics students, including twelve international students and fifty-four domestic students. Due to the small number of international students, nonparametric methods were used to…
Spectral statistics of chaotic many-body systems

International Nuclear Information System (INIS)

Dubertrand, Rémy; Müller, Sebastian

2016-01-01

We derive a trace formula that expresses the level density of chaotic many-body systems as a smooth term plus a sum over contributions associated to solutions of the nonlinear Schrödinger (or Gross–Pitaevski) equation. Our formula applies to bosonic systems with discretised positions, such as the Bose–Hubbard model, in the semiclassical limit as well as in the limit where the number of particles is taken to infinity. We use the trace formula to investigate the spectral statistics of these systems, by studying interference between solutions of the nonlinear Schrödinger equation. We show that in the limits taken the statistics of fully chaotic many-particle systems becomes universal and agrees with predictions from the Wigner–Dyson ensembles of random matrix theory. The conditions for Wigner–Dyson statistics involve a gap in the spectrum of the Frobenius–Perron operator, leaving the possibility of different statistics for systems with weaker chaotic properties. (paper)
Statistical Signal Process in R Language in the Pharmacovigilance Programme of India.

Science.gov (United States)

Kumar, Aman; Ahuja, Jitin; Shrivastava, Tarani Prakash; Kumar, Vipin; Kalaiselvan, Vivekanandan

2018-05-01

The Ministry of Health & Family Welfare, Government of India, initiated the Pharmacovigilance Programme of India (PvPI) in July 2010. The purpose of the PvPI is to collect data on adverse reactions due to medications, analyze it, and use the reference to recommend informed regulatory intervention, besides communicating the risk to health care professionals and the public. The goal of the present study was to apply statistical tools to find the relationship between drugs and ADRs for signal detection by R programming. Four statistical parameters were proposed for quantitative signal detection. These 4 parameters are IC 025 , PRR and PRR lb , chi-square, and N 11 ; we calculated these 4 values using R programming. We analyzed 78,983 drug-ADR combinations, and the total count of drug-ADR combination was 4,20,060. During the calculation of the statistical parameter, we use 3 variables: (1) N 11 (number of counts), (2) N 1. (Drug margin), and (3) N .1 (ADR margin). The structure and calculation of these 4 statistical parameters in R language are easily understandable. On the basis of the IC value (IC value >0), out of the 78,983 drug-ADR combination (drug-ADR combination), we found the 8,667 combinations to be significantly associated. The calculation of statistical parameters in R language is time saving and allows to easily identify new signals in the Indian ICSR (Individual Case Safety Reports) database.
Methods for meta-analysis of multiple traits using GWAS summary statistics.

Science.gov (United States)

Ray, Debashree; Boehnke, Michael

2018-03-01

Genome-wide association studies (GWAS) for complex diseases have focused primarily on single-trait analyses for disease status and disease-related quantitative traits. For example, GWAS on risk factors for coronary artery disease analyze genetic associations of plasma lipids such as total cholesterol, LDL-cholesterol, HDL-cholesterol, and triglycerides (TGs) separately. However, traits are often correlated and a joint analysis may yield increased statistical power for association over multiple univariate analyses. Recently several multivariate methods have been proposed that require individual-level data. Here, we develop metaUSAT (where USAT is unified score-based association test), a novel unified association test of a single genetic variant with multiple traits that uses only summary statistics from existing GWAS. Although the existing methods either perform well when most correlated traits are affected by the genetic variant in the same direction or are powerful when only a few of the correlated traits are associated, metaUSAT is designed to be robust to the association structure of correlated traits. metaUSAT does not require individual-level data and can test genetic associations of categorical and/or continuous traits. One can also use metaUSAT to analyze a single trait over multiple studies, appropriately accounting for overlapping samples, if any. metaUSAT provides an approximate asymptotic P-value for association and is computationally efficient for implementation at a genome-wide level. Simulation experiments show that metaUSAT maintains proper type-I error at low error levels. It has similar and sometimes greater power to detect association across a wide array of scenarios compared to existing methods, which are usually powerful for some specific association scenarios only. When applied to plasma lipids summary data from the METSIM and the T2D-GENES studies, metaUSAT detected genome-wide significant loci beyond the ones identified by univariate analyses
The earth is flat (p > 0.05: significance thresholds and the crisis of unreplicable research

Directory of Open Access Journals (Sweden)

Valentin Amrhein

2017-07-01

Full Text Available The widespread use of ‘statistical significance’ as a license for making a claim of a scientific finding leads to considerable distortion of the scientific process (according to the American Statistical Association. We review why degrading p-values into ‘significant’ and ‘nonsignificant’ contributes to making studies irreproducible, or to making them seem irreproducible. A major problem is that we tend to take small p-values at face value, but mistrust results with larger p-values. In either case, p-values tell little about reliability of research, because they are hardly replicable even if an alternative hypothesis is true. Also significance (p ≤ 0.05 is hardly replicable: at a good statistical power of 80%, two studies will be ‘conflicting’, meaning that one is significant and the other is not, in one third of the cases if there is a true effect. A replication can therefore not be interpreted as having failed only because it is nonsignificant. Many apparent replication failures may thus reflect faulty judgment based on significance thresholds rather than a crisis of unreplicable research. Reliable conclusions on replicability and practical importance of a finding can only be drawn using cumulative evidence from multiple independent studies. However, applying significance thresholds makes cumulative knowledge unreliable. One reason is that with anything but ideal statistical power, significant effect sizes will be biased upwards. Interpreting inflated significant results while ignoring nonsignificant results will thus lead to wrong conclusions. But current incentives to hunt for significance lead to selective reporting and to publication bias against nonsignificant findings. Data dredging, p-hacking, and publication bias should be addressed by removing fixed significance thresholds. Consistent with the recommendations of the late Ronald Fisher, p-values should be interpreted as graded measures of the strength of evidence
Changing world extreme temperature statistics

Science.gov (United States)

Finkel, J. M.; Katz, J. I.

2018-04-01

We use the Global Historical Climatology Network--daily database to calculate a nonparametric statistic that describes the rate at which all-time daily high and low temperature records have been set in nine geographic regions (continents or major portions of continents) during periods mostly from the mid-20th Century to the present. This statistic was defined in our earlier work on temperature records in the 48 contiguous United States. In contrast to this earlier work, we find that in every region except North America all-time high records were set at a rate significantly (at least $3\\sigma$) higher than in the null hypothesis of a stationary climate. Except in Antarctica, all-time low records were set at a rate significantly lower than in the null hypothesis. In Europe, North Africa and North Asia the rate of setting new all-time highs increased suddenly in the 1990's, suggesting a change in regional climate regime; in most other regions there was a steadier increase.
Applying Statistical Mechanics to pixel detectors

International Nuclear Information System (INIS)

Pindo, Massimiliano

2002-01-01

Pixel detectors, being made of a large number of active cells of the same kind, can be considered as significant sets to which Statistical Mechanics variables and methods can be applied. By properly redefining well known statistical parameters in order to let them match the ones that actually characterize pixel detectors, an analysis of the way they work can be performed in a totally new perspective. A deeper understanding of pixel detectors is attained, helping in the evaluation and comparison of their intrinsic characteristics and performance
Introductory statistics for the behavioral sciences

CERN Document Server

Welkowitz, Joan; Cohen, Jacob

1971-01-01

Introductory Statistics for the Behavioral Sciences provides an introduction to statistical concepts and principles. This book emphasizes the robustness of parametric procedures wherein such significant tests as t and F yield accurate results even if such assumptions as equal population variances and normal population distributions are not well met.Organized into three parts encompassing 16 chapters, this book begins with an overview of the rationale upon which much of behavioral science research is based, namely, drawing inferences about a population based on data obtained from a samp
Association of ED with chronic periodontal disease.

Science.gov (United States)

Matsumoto, S; Matsuda, M; Takekawa, M; Okada, M; Hashizume, K; Wada, N; Hori, J; Tamaki, G; Kita, M; Iwata, T; Kakizaki, H

2014-01-01

To examine the relationship between chronic periodontal disease (CPD) and ED, the interview sheet including the CPD self-checklist (CPD score) and the five-item version of the International Index of Erectile Function (IIEF-5) was distributed to 300 adult men who received a comprehensive dental examination. Statistical analyses were performed by the Spearman's rank correlation coefficient and other methods. Statistical significance was accepted at the level of Pdysfunction and the systematic inflammatory changes associated with CPD. The present study also suggests that dental health is important as a preventive medicine for ED.
Neuropsychological significance of areas of high signal intensity on brain MRIs of children with neurofibromatosis.

Science.gov (United States)

Moore, B D; Slopis, J M; Schomer, D; Jackson, E F; Levy, B M

1996-06-01

Of children with neurofibromatosis (NF), 40% have a cognitive or learning impairment. Approximately 60% also have anomalous areas of high signal intensity on T2-weighted brain MRIs. The association of these hyperintensities and neuropsychological status is not fully understood. We administered a battery of neuropsychological tests and a standard clinical MRI to determine the impact of hyperintensity presence, number, and location on cognitive status in 84 children (8 to 16 years) with NF type 1. These children underwent standard clinical MRI using a GE 1.5-tesla scanner (except one child who was examined with a 1.0-tesla scanner). We conducted three types of analyses: Hyperintensity presence or absence.-Scores of children with (55%) and without hyperintensities (45%) were compared using t tests. No statistically significant differences between groups in intellectual functioning or any neuropsychological variable were found. Number of hyperintensities-The number of hyperintensity locations per child ranged from one to five (mean = 2.22). Pearson correlations revealed no significant association between the number of hyperintensities and neuropsychological performance. Location of hyperintensities-In four of the five locations studied, no statistically significant differences were found between scores of children with a hyperintensity in an area and those with one elsewhere. However, mean scores for IQ, Memory, Motor, Distractibility, and Attention domains for children with hyperintensities in the thalamus were significantly lower than scores for those with hyperintensities elsewhere. These results suggest that the simple presence or absence of hyperintensities, or their total number, is not as important as their anatomic location for detecting their relationship with neuropsychological status. Taking location into account, hyperintensities in the cerebral hemispheres, basal ganglia, brainstem, or cerebellum seem to have no impact on neuropsychological functioning
A group contribution method for associating chain molecules based on the statistical associating fluid theory (SAFT-gamma).

Science.gov (United States)

Lymperiadis, Alexandros; Adjiman, Claire S; Galindo, Amparo; Jackson, George

2007-12-21

A predictive group-contribution statistical associating fluid theory (SAFT-gamma) is developed by extending the molecular-based SAFT-VR equation of state [A. Gil-Villegas et al. J. Chem. Phys. 106, 4168 (1997)] to treat heteronuclear molecules which are formed from fused segments of different types. Our models are thus a heteronuclear generalization of the standard models used within SAFT, comparable to the optimized potentials for the liquid state OPLS models commonly used in molecular simulation; an advantage of our SAFT-gamma over simulation is that an algebraic description for the thermodynamic properties of the model molecules can be developed. In our SAFT-gamma approach, each functional group in the molecule is modeled as a united-atom spherical (square-well) segment. The different groups are thus characterized by size (diameter), energy (well depth) and range parameters representing the dispersive interaction, and by shape factor parameters (which denote the extent to which each group contributes to the overall molecular properties). For associating groups a number of bonding sites are included on the segment: in this case the site types, the number of sites of each type, and the appropriate association energy and range parameters also have to be specified. A number of chemical families (n-alkanes, branched alkanes, n-alkylbenzenes, mono- and diunsaturated hydrocarbons, and n-alkan-1-ols) are treated in order to assess the quality of the SAFT-gamma description of the vapor-liquid equilibria and to estimate the parameters of various functional groups. The group parameters for the functional groups present in these compounds (CH(3), CH(2), CH(3)CH, ACH, ACCH(2), CH(2)=, CH=, and OH) together with the unlike energy parameters between groups of different types are obtained from an optimal description of the pure component phase equilibria. The approach is found to describe accurately the vapor-liquid equilibria with an overall %AAD of 3.60% for the vapor
Clinical significance of MCM-2 and MCM-5 expression in colon cancer: association with clinicopathological parameters and tumor proliferative capacity.

Science.gov (United States)

Giaginis, Constantinos; Georgiadou, Maria; Dimakopoulou, Konstantina; Tsourouflis, Gerasimos; Gatzidou, Elisavet; Kouraklis, Gregorios; Theocharis, Stamatios

2009-02-01

Minichromosome maintenance (MCM) proteins are essential components of DNA replication, being related to cell proliferation, and serve as useful markers for cancer screening, surveillance, and prognosis. Our aim was to examine the clinical significance of MCM-2 and MCM-5 protein expression in colon cancer and to evaluate the association with various clinicopathological characteristics and tumor proliferative capacity. Immunohistochemical expression of MCM-2 and MCM-5 was performed on paraffin-embedded malignant tissue sections obtained from 96 patients with colon cancer. MCM-2 and MCM-5 expression was correlated with different clinicopathological characteristics, proliferative capacity (Ki-67 labeling index), and p53 cell-cycle regulator expression. MCM-2 and Ki-67 expression was significantly associated with the tumors' histological grade (P = 0.003), existence of nodular metastases (N) (P = 0.003 and P = 0.030, respectively), malignancy on adenoma (P = 0.029 and P = 0.024, respectively), and vascular invasion (P = 0.010 and P = 0.011, respectively). MCM-2 expression was additionally associated with Dukes' stage (P = 0.005). Significant positive relationships were found between the expression of MCM-2 or MCM-5 proteins and that of Ki-67 protein (r = 0.963, P-value characteristics examined. The current data suggest that MCM-2 protein expression is significantly associated with important clinicopathological characteristics for patients' management, being correlated with the cell proliferation state in colon cancer.

An R2 statistic for fixed effects in the linear mixed model.

Science.gov (United States)

Edwards, Lloyd J; Muller, Keith E; Wolfinger, Russell D; Qaqish, Bahjat F; Schabenberger, Oliver

2008-12-20

Statisticians most often use the linear mixed model to analyze Gaussian longitudinal data. The value and familiarity of the R(2) statistic in the linear univariate model naturally creates great interest in extending it to the linear mixed model. We define and describe how to compute a model R(2) statistic for the linear mixed model by using only a single model. The proposed R(2) statistic measures multivariate association between the repeated outcomes and the fixed effects in the linear mixed model. The R(2) statistic arises as a 1-1 function of an appropriate F statistic for testing all fixed effects (except typically the intercept) in a full model. The statistic compares the full model with a null model with all fixed effects deleted (except typically the intercept) while retaining exactly the same covariance structure. Furthermore, the R(2) statistic leads immediately to a natural definition of a partial R(2) statistic. A mixed model in which ethnicity gives a very small p-value as a longitudinal predictor of blood pressure (BP) compellingly illustrates the value of the statistic. In sharp contrast to the extreme p-value, a very small R(2) , a measure of statistical and scientific importance, indicates that ethnicity has an almost negligible association with the repeated BP outcomes for the study.
Association mapping to discover significant marker-trait associations for resistance against fusarium wilt variant 2 in pigeonpea [Cajanus cajan (L.) Millspaugh] using SSR markers.

Science.gov (United States)

Patil, Prakash G; Dubey, Jyotirmay; Bohra, Abhishek; Mishra, R K; Saabale, P R; Das, Alok; Rathore, Meenal; Singh, N P

2017-08-01

Pigeonpea production is severely constrained by wilt disease caused by Fusarium udum. In the current study, we discover the putative genomic regions that control resistance response to variant 2 of fusarium wilt using association mapping approach. The association panel comprised of 89 diverse pigeonpea genotypes including seven varieties, three landraces and 79 germplasm lines. The panel was screened rigorously for 3 consecutive years (2013-14, 2014-15 and 2015-2016) against variant 2 in a wilt-sick field. A total of 65 pigeonpea specific hypervariable SSR markers (HASSRs) were screened representing seven linkage groups and 29 scaffolds of the pigeonpea genome. A total of 181 alleles were detected, with average values of gene diversity and polymorphism information content (PIC) of 0.55 and 0.47, respectively. Further analysis using model based (STRUCTURE) and distance based (clustering) approaches separated the entire pigeonpea collection into two distinct subgroups (K = 2). The marker trait associations (MTAs) were established based on three-year wilt incidence data and SSR dataset using a unified mixed linear model. Consequently, six SSR markers were identified, which were significantly associated with wilt resistance and explained up to 6% phenotypic variance (PV) across the years. Among these SSRs, HASSR18 was found to be the most stable and significant, accounting for 5-6% PV across the years. To the best of our knowledge, this is the first report of identification of favourable alleles for resistance to variant 2 of Fusarium udum in pigeonpea using association mapping. The SSR markers identified here will greatly facilitate marker assisted resistance breeding against fusarium wilt in pigeonpea.
Association between dental erosion and possible risk factors: A hospital-based study in gastroesophageal reflux disease patients

Directory of Open Access Journals (Sweden)

Vamsi Krishna Reddy

2016-01-01

Full Text Available Introduction: Gastroesophageal reflux disease (GERD is a condition, with a prevalence of up to 10–20% in the general population. GERD may involve damage to the oral cavity, and dental erosion may occur with a higher frequency. Aim: To estimate the prevalence of dental erosion in GERD patients and to evaluate the association between dental erosion and possible risk factors. Materials and Methods: The study was conducted in the Sanjay Gandhi Post Graduate Institute of Medical Sciences, Lucknow among patients attending outpatient department between June and August 2014. The study group comprised 91 subjects with GERD and 114 subjects without GERD. Information regarding symptoms of GERD, medicines, any chronic disease, and dietary habits were recorded. Dental examination was done to assess the presence or absence of dental erosions and its severity was measured using O'Sullivan Index (2000. Statistical analysis was done using Mann–Whitney U-test and Kruskal–Wallis test. Results: Of 91 GERD patients, 87 (95.6% patients had dental erosion. In both groups, association between frequent intake of fruit juice, carbonated drinks, milk, yoghurt, fruits, and tea/coffee with occurrence of dental erosion were statistically significant (P < 0.05. In GERD patients, association between intake of milk and occurrence of dental erosion were statistically significant (P < 0.05. Association of medication with dental erosion was found to be statistically significant (P < 0.05. Chronic diseases like diabetes and asthma were also found to be statistically significant with dental erosion (P < 0.05. Conclusion: This study showed that GERD patients were at increased risk of developing dental erosion compared to controls.
Spreadsheets as tools for statistical computing and statistics education

OpenAIRE

Neuwirth, Erich

2000-01-01

Spreadsheets are an ubiquitous program category, and we will discuss their use in statistics and statistics education on various levels, ranging from very basic examples to extremely powerful methods. Since the spreadsheet paradigm is very familiar to many potential users, using it as the interface to statistical methods can make statistics more easily accessible.
Long-term use of amiodarone before heart transplantation significantly reduces early post-transplant atrial fibrillation and is not associated with increased mortality after heart transplantation

Directory of Open Access Journals (Sweden)

Rivinius R

2016-02-01

group (P=0.0123. There was no statistically significant difference between patients with and without long-term use of amiodarone prior to HTX in 1-year (P=0.8596, 2-year (P=0.8620, 5-year (P=0.2737, or overall follow-up mortality after HTX (P=0.1049. Moreover, Kaplan–Meier survival analysis showed no statistically significant difference in overall survival (P=0.1786.Conclusion: Long-term use of amiodarone in patients before HTX significantly reduces early post-transplant AF and is not associated with increased mortality after HTX. Keywords: amiodarone, atrial fibrillation, heart failure, heart transplantation, mortality
Dimensions of adult attachment are significantly associated with specific affective temperament constellations in a Hungarian university sample.

Science.gov (United States)

Lang, Andras; Papp, Barbara; Gonda, Xenia; Dome, Peter; Rihmer, Zoltan

2016-02-01

Related to emotion regulation and mental health, adult attachment and affective temperaments are relevant research topics of contemporary psychiatry and clinical psychology. However, to date, only one study investigated the relationship between these two constructs. Thus, we aimed to further reveal adult attachment's association with affective temperaments. Affective temperament and adult attachment dimensions of 1469 Hungarian university students were assessed with self-report measures (Temperament Evaluation of Memphis, Pisa and San Diego autoquestionnaire and Experiences in Close Relationships Scale, respectively). Age and measured variables were compared between genders with ANOVAs. Associations between attachment dimensions and affective temperaments were examined with Pearson's correlations and partial correlations; the moderation effect of age and gender on these relationships was tested with PROCESS macro. Using Fisher r-to-z transformation, we also compared our results with the findings of the previous study. Cohen's ds were used to report effect size and Cronbach's alphas were computed as indices of internal reliability. Significant correlations were found between attachment dimensions and affective temperaments. Correlations were especially robust between attachment anxiety and depressive, cyclothymic and anxious temperaments. Contrasted with the results of the previous study, hyperthymic temperament was negatively related to attachment avoidance and anxious temperament was significantly more strongly correlated with attachment anxiety in our study. We used a previous version of the adult attachment measure. Our sample differed from the target sample in several ways. Participants were not screened for mental disorders. Findings highlight that adult attachment dimensions are significantly associated with affective temperaments. Copyright © 2015 Elsevier B.V. All rights reserved.
Serious fighting-related injuries produce a significant reduction in intelligence.

Science.gov (United States)

Schwartz, Joseph A; Beaver, Kevin M

2013-10-01

Fighting-related injuries are common among adolescents within the United States, but how such injuries relate to subsequent cognitive functioning remains unclear. In particular, the long-term effect of fighting-related injuries suffered during important developmental periods, such as adolescence, on subsequent cognitive functioning has been overlooked by previous studies. The purpose of this study is to examine the association between sustaining serious fighting-related injuries and changes in verbal intelligence (IQ) over a 5- to 6-year time period. Longitudinal multivariate statistical models were used to analyze data from the National Longitudinal Study of Adolescent Health collected between 1994 and 2002 and analyzed in 2013. Even a single fighting-related injury resulted in a significant reduction in IQ over time even after controlling for age, race, sex, and changes in socioeconomic status (SES) over the study period. Additionally, females experienced a significantly greater reduction in IQ from each fighting-related injury than males. Fighting-related injuries have a significant impact on subsequent cognitive functioning and intelligence. The implications for future policies and research are discussed in more detail. Copyright © 2013 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
Is everything we eat associated with cancer? A systematic cookbook review.

Science.gov (United States)

Schoenfeld, Jonathan D; Ioannidis, John P A

2013-01-01

Nutritional epidemiology is a highly prolific field. Debates on associations of nutrients with disease risk are common in the literature and attract attention in public media. We aimed to examine the conclusions, statistical significance, and reproducibility in the literature on associations between specific foods and cancer risk. We selected 50 common ingredients from random recipes in a cookbook. PubMed queries identified recent studies that evaluated the relation of each ingredient to cancer risk. Information regarding author conclusions and relevant effect estimates were extracted. When >10 articles were found, we focused on the 10 most recent articles. Forty ingredients (80%) had articles reporting on their cancer risk. Of 264 single-study assessments, 191 (72%) concluded that the tested food was associated with an increased (n = 103) or a decreased (n = 88) risk; 75% of the risk estimates had weak (0.05 > P ≥ 0.001) or no statistical (P > 0.05) significance. Statistically significant results were more likely than nonsignificant findings to be published in the study abstract than in only the full text (P < 0.0001). Meta-analyses (n = 36) presented more conservative results; only 13 (26%) reported an increased (n = 4) or a decreased (n = 9) risk (6 had more than weak statistical support). The median RRs (IQRs) for studies that concluded an increased or a decreased risk were 2.20 (1.60, 3.44) and 0.52 (0.39, 0.66), respectively. The RRs from the meta-analyses were on average null (median: 0.96; IQR: 0.85, 1.10). Associations with cancer risk or benefits have been claimed for most food ingredients. Many single studies highlight implausibly large effects, even though evidence is weak. Effect sizes shrink in meta-analyses.
Statistical Validation of Engineering and Scientific Models: Background

International Nuclear Information System (INIS)

Hills, Richard G.; Trucano, Timothy G.

1999-01-01

A tutorial is presented discussing the basic issues associated with propagation of uncertainty analysis and statistical validation of engineering and scientific models. The propagation of uncertainty tutorial illustrates the use of the sensitivity method and the Monte Carlo method to evaluate the uncertainty in predictions for linear and nonlinear models. Four example applications are presented; a linear model, a model for the behavior of a damped spring-mass system, a transient thermal conduction model, and a nonlinear transient convective-diffusive model based on Burger's equation. Correlated and uncorrelated model input parameters are considered. The model validation tutorial builds on the material presented in the propagation of uncertainty tutoriaI and uses the damp spring-mass system as the example application. The validation tutorial illustrates several concepts associated with the application of statistical inference to test model predictions against experimental observations. Several validation methods are presented including error band based, multivariate, sum of squares of residuals, and optimization methods. After completion of the tutorial, a survey of statistical model validation literature is presented and recommendations for future work are made
Ensuring Positiveness of the Scaled Difference Chi-square Test Statistic.

Science.gov (United States)

Satorra, Albert; Bentler, Peter M

2010-06-01

A scaled difference test statistic [Formula: see text] that can be computed from standard software of structural equation models (SEM) by hand calculations was proposed in Satorra and Bentler (2001). The statistic [Formula: see text] is asymptotically equivalent to the scaled difference test statistic T̄(d) introduced in Satorra (2000), which requires more involved computations beyond standard output of SEM software. The test statistic [Formula: see text] has been widely used in practice, but in some applications it is negative due to negativity of its associated scaling correction. Using the implicit function theorem, this note develops an improved scaling correction leading to a new scaled difference statistic T̄(d) that avoids negative chi-square values.
Data-driven inference for the spatial scan statistic

Directory of Open Access Journals (Sweden)

Duczmal Luiz H

2011-08-01

Full Text Available Abstract Background Kulldorff's spatial scan statistic for aggregated area maps searches for clusters of cases without specifying their size (number of areas or geographic location in advance. Their statistical significance is tested while adjusting for the multiple testing inherent in such a procedure. However, as is shown in this work, this adjustment is not done in an even manner for all possible cluster sizes. Results A modification is proposed to the usual inference test of the spatial scan statistic, incorporating additional information about the size of the most likely cluster found. A new interpretation of the results of the spatial scan statistic is done, posing a modified inference question: what is the probability that the null hypothesis is rejected for the original observed cases map with a most likely cluster of size k, taking into account only those most likely clusters of size k found under null hypothesis for comparison? This question is especially important when the p-value computed by the usual inference process is near the alpha significance level, regarding the correctness of the decision based in this inference. Conclusions A practical procedure is provided to make more accurate inferences about the most likely cluster found by the spatial scan statistic.
Register-based statistics statistical methods for administrative data

CERN Document Server

Wallgren, Anders

2014-01-01

This book provides a comprehensive and up to date treatment of theory and practical implementation in Register-based statistics. It begins by defining the area, before explaining how to structure such systems, as well as detailing alternative approaches. It explains how to create statistical registers, how to implement quality assurance, and the use of IT systems for register-based statistics. Further to this, clear details are given about the practicalities of implementing such statistical methods, such as protection of privacy and the coordination and coherence of such an undertaking. Thi
Cancer Statistics

Science.gov (United States)

... What Is Cancer? Cancer Statistics Cancer Disparities Cancer Statistics Cancer has a major impact on society in ... success of efforts to control and manage cancer. Statistics at a Glance: The Burden of Cancer in ...
Accelerator driven reactors, - the significance of the energy distribution of spallation neutrons on the neutron statistics

Energy Technology Data Exchange (ETDEWEB)

Fhager, V

2000-01-01

In order to make correct predictions of the second moment of statistical nuclear variables, such as the number of fissions and the number of thermalized neutrons, the dependence of the energy distribution of the source particles on their number should be considered. It has been pointed out recently that neglecting this number dependence in accelerator driven systems might result in bad estimates of the second moment, and this paper contains qualitative and quantitative estimates of the size of these efforts. We walk towards the requested results in two steps. First, models of the number dependent energy distributions of the neutrons that are ejected in the spallation reactions are constructed, both by simple assumptions and by extracting energy distributions of spallation neutrons from a high-energy particle transport code. Then, the second moment of nuclear variables in a sub-critical reactor, into which spallation neutrons are injected, is calculated. The results from second moment calculations using number dependent energy distributions for the source neutrons are compared to those where only the average energy distribution is used. Two physical models are employed to simulate the neutron transport in the reactor. One is analytical, treating only slowing down of neutrons by elastic scattering in the core material. For this model, equations are written down and solved for the second moment of thermalized neutrons that include the distribution of energy of the spallation neutrons. The other model utilizes Monte Carlo methods for tracking the source neutrons as they travel inside the reactor material. Fast and thermal fission reactions are considered, as well as neutron capture and elastic scattering, and the second moment of the number of fissions, the number of neutrons that leaked out of the system, etc. are calculated. Both models use a cylindrical core with a homogenous mixture of core material. Our results indicate that the number dependence of the energy
Accelerator driven reactors, - the significance of the energy distribution of spallation neutrons on the neutron statistics

International Nuclear Information System (INIS)

Fhager, V.

2000-01-01

In order to make correct predictions of the second moment of statistical nuclear variables, such as the number of fissions and the number of thermalized neutrons, the dependence of the energy distribution of the source particles on their number should be considered. It has been pointed out recently that neglecting this number dependence in accelerator driven systems might result in bad estimates of the second moment, and this paper contains qualitative and quantitative estimates of the size of these efforts. We walk towards the requested results in two steps. First, models of the number dependent energy distributions of the neutrons that are ejected in the spallation reactions are constructed, both by simple assumptions and by extracting energy distributions of spallation neutrons from a high-energy particle transport code. Then, the second moment of nuclear variables in a sub-critical reactor, into which spallation neutrons are injected, is calculated. The results from second moment calculations using number dependent energy distributions for the source neutrons are compared to those where only the average energy distribution is used. Two physical models are employed to simulate the neutron transport in the reactor. One is analytical, treating only slowing down of neutrons by elastic scattering in the core material. For this model, equations are written down and solved for the second moment of thermalized neutrons that include the distribution of energy of the spallation neutrons. The other model utilizes Monte Carlo methods for tracking the source neutrons as they travel inside the reactor material. Fast and thermal fission reactions are considered, as well as neutron capture and elastic scattering, and the second moment of the number of fissions, the number of neutrons that leaked out of the system, etc. are calculated. Both models use a cylindrical core with a homogenous mixture of core material. Our results indicate that the number dependence of the energy
Association between unemployment rates and prescription drug utilization in the United States, 2007–2010

Science.gov (United States)

2012-01-01

Background While extensive evidence suggests that the economic recession has had far reaching effects on many economic sectors, little is known regarding its impact on prescription drug utilization. The purpose of this study is to describe the association between state-level unemployment rates and retail sales of seven therapeutic classes (statins, antidepressants, antipsychotics, angiotensin-converting enzyme [ACE] inhibitors, opiates, phosphodiesterase [PDE] inhibitors and oral contraceptives) in the United States. Methods Using a retrospective mixed ecological design, we examined retail prescription sales using IMS Health Xponent™ from September 2007 through July 2010, and we used the Bureau of Labor Statistics to derive population-based rates and mixed-effects modeling with state-level controls to examine the association between unemployment and utilization. Our main outcome measure was state-level utilization per 100,000 people for each class. Results Monthly unemployment levels and rates of use of each class varied substantially across the states. There were no statistically significant associations between use of ACE inhibitors or SSRIs/SNRIs and average unemployment in analyses across states, while for opioids and PDE inhibitors there were small statistically significant direct associations, and for the remaining classes inverse associations. Analyses using each state as its own control collectively exhibited statistically significant positive associations between increases in unemployment and prescription drug utilization for five of seven areas examined. This relationship was greatest for statins (on average, a 4% increase in utilization per 1% increased unemployment) and PDE inhibitors (3% increase in utilization per 1% increased unemployment), and lower for oral contraceptives and atypical antipsychotics. Conclusion We found no evidence of an association between increasing unemployment and decreasing prescription utilization, suggesting that any
Factors Associated With Specific Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition Sexual Dysfunctions in Breast Cancer Survivors: A Study of Patients and Their Partners.

Science.gov (United States)

Hummel, Susanna B; Hahn, Daniela E E; van Lankveld, Jacques J D M; Oldenburg, Hester S A; Broomans, Eva; Aaronson, Neil K

2017-10-01

Many women develop sexual problems after breast cancer (BC) treatment. Little is known about BC survivors with a Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) sexual dysfunction and their partners, and the factors associated with their sexual functioning. To evaluate (i) patient-related and clinical factors associated with (a) specific DSM-IV sexual dysfunctions and (b) level of sexual functioning and sexual distress as reported by BC survivors and (ii) the association between the sexual functioning of BC survivors and that of their partners. We analyzed baseline data from a study of the efficacy of online cognitive-behavioral therapy for sexual dysfunction in BC survivors. Women completed self-report questionnaires assessing sexual functioning, sexual distress, relationship intimacy, marital functioning, menopausal symptoms, body image, and psychological distress. Their partners completed questionnaires assessing sexual functioning. The study included 169 BC survivors and 67 partners. The most prevalent female sexual dysfunctions were hypoactive sexual desire disorder (HSDD; 83%), sexual arousal disorder (40%), and dyspareunia (33%). Endocrine therapy was associated with HSDD (P = .003), and immunotherapy was associated with dyspareunia (P = .009). Older age was associated with lower sexual distress (P disorder (P = .004). An indication for erectile disorder was present in two thirds of partners. Lower overall partner sexual satisfaction was associated with lower overall BC survivor sexual functioning (P = .001), lower female arousal (P = .002), and lower female sexual satisfaction (P = .001). Poorer male erectile function was related to higher female sexual pain (P = .006). Partners of women who underwent breast reconstruction reported marginally significantly better orgasmic functioning (P = .012) and overall sexual functioning (P = .015) than partners of women who had undergone breast-conserving treatment. BC survivors
Prevalence of Malaria, Dengue, and Chikungunya Significantly Associated with Mosquito Breeding Sites

Science.gov (United States)

Islam, Mohammad Nazrul; ZulKifle, Mohammad; Sherwani, Arish Mohammad Khan; Ghosh, Susanta Kumar; Tiwari, Satyanarayan

2011-01-01

Objectives: To observe the prevalence of malaria, dengue, and chikungunya and their association with mosquito breeding sites. Methods: The study was observational and analytical. A total of 162 houses and 670 subjects were observed during the study period. One hundred forty-two febrile patients were eligible for the study. After obtaining informed consent from all febrile patients, 140 blood samples were collected to diagnose malaria, dengue, and chikungunya. Larval samples were collected by the standard protocol that follows. Correlation of data was performed by Pearson correlation test. Results: Forty-seven blood samples were found positive: 33 for chikungunya, 3 for dengue, and 11 for malaria. Fifty-one out of 224 larval samples were found positive. Out of the 51 positive samples, 37 were positive for Aedes, 12 were positive for Anopheles, and two were positive for Culex larvae. Interpretation and Conclusion: Mosquito-borne fevers, especially malaria, dengue, and chikungunya, have shown a significant relationship with mosquito breeding sites. PMID:23610486
Software Used to Generate Cancer Statistics - SEER Cancer Statistics

Science.gov (United States)

Videos that highlight topics and trends in cancer statistics and definitions of statistical terms. Also software tools for analyzing and reporting cancer statistics, which are used to compile SEER's annual reports.
Investigation of Exomic Variants Associated with Overall Survival in Ovarian Cancer

DEFF Research Database (Denmark)

Winham, Stacey J; Pirie, Ailith; Chen, Yian Ann

2016-01-01

). Results: No individual variant reached genome-wide statistical significance. A SNP previously implicated to be associated with EOC risk and, to a lesser extent, survival, rs8170, showed the strongest evidence of association with survival and similar effect size estimates across sets (Pmeta=1.1E-6,HRSet1......=1.17,HRSet2= 1.14). Rare variants in ATG2B, an autophagy gene important for apoptosis, were significantly associated with survival after multiple testing correction (Pmeta = 1.1E-6; Pcorrected = 0.01). Conclusions: Common variant rs8170 and rare variants in ATG2B may be associated with EOC overall survival...

The antibiotic resistome of swine manure is significantly altered by association with the Musca domestica larvae gut microbiome.

Science.gov (United States)

Wang, Hang; Sangwan, Naseer; Li, Hong-Yi; Su, Jian-Qiang; Oyang, Wei-Yin; Zhang, Zhi-Jian; Gilbert, Jack A; Zhu, Yong-Guan; Ping, Fan; Zhang, Han-Luo

2017-01-01

The overuse of antibiotics as veterinary feed additives is potentially contributing to a significant reservoir of antibiotic resistance in agricultural farmlands via the application of antibiotic-contaminated manure. Vermicomposting of swine manure using housefly larvae is a promising biotechnology for waste reduction and control of antibiotic pollution. To determine how vermicomposting influences antibiotic resistance traits in swine manure, we explored the resistome and associated bacterial community dynamics during larvae gut transit over 6 days of treatment. In total, 94 out of 158 antibiotic resistance genes (ARGs) were significantly attenuated (by 85%), while 23 were significantly enriched (3.9-fold) following vermicomposting. The manure-borne bacterial community showed a decrease in the relative abundance of Bacteroidetes, and an increase in Proteobacteria, specifically Ignatzschineria, following gut transit. ARG attenuation was significantly correlated with changes in microbial community succession, especially reduction in Clostridiales and Bacteroidales. Six genomes were assembled from the manure, vermicompost (final product) and gut samples, including Pseudomonas, Providencia, Enterococcus, Bacteroides and Alcanivorax. Transposon-linked ARGs were more abundant in gut-associated bacteria compared with those from manure and vermicompost. Further, ARG-transposon gene cassettes had a high degree of synteny between metagenomic assemblies from gut and vermicompost samples, highlighting the significant contribution of gut microbiota through horizontal gene transfer to the resistome of vermicompost. In conclusion, the larvae gut microbiome significantly influences manure-borne community succession and the antibiotic resistome during animal manure processing.
Selective neurocognitive deficits and poor life functioning are associated with significant depressive symptoms in alcoholism-HIV infection comorbidity

OpenAIRE

Sassoon, Stephanie A.; Rosenbloom, Margaret J.; Fama, Rosemary; Sullivan, Edith V.; Pfefferbaum, Adolf

2012-01-01

Alcoholism, HIV, and depressive symptoms frequently co-occur and are associated with impairment in cognition and life function. We administered the Beck Depression Inventory-II (BDI-II), measures of life function, and neurocognitive tests to 67 alcoholics, 56 HIV+ patients, 63 HIV+ alcoholics, and 64 controls to examine whether current depressive symptom level (significant, BDI-II ≥ 14 vs. minimal, BDI-II < 14) was associated with poorer cognitive or psychosocial function in alcoholism-HIV co...
A Powerful Approach to Estimating Annotation-Stratified Genetic Covariance via GWAS Summary Statistics.

Science.gov (United States)

Lu, Qiongshi; Li, Boyang; Ou, Derek; Erlendsdottir, Margret; Powles, Ryan L; Jiang, Tony; Hu, Yiming; Chang, David; Jin, Chentian; Dai, Wei; He, Qidu; Liu, Zefeng; Mukherjee, Shubhabrata; Crane, Paul K; Zhao, Hongyu

2017-12-07

Despite the success of large-scale genome-wide association studies (GWASs) on complex traits, our understanding of their genetic architecture is far from complete. Jointly modeling multiple traits' genetic profiles has provided insights into the shared genetic basis of many complex traits. However, large-scale inference sets a high bar for both statistical power and biological interpretability. Here we introduce a principled framework to estimate annotation-stratified genetic covariance between traits using GWAS summary statistics. Through theoretical and numerical analyses, we demonstrate that our method provides accurate covariance estimates, thereby enabling researchers to dissect both the shared and distinct genetic architecture across traits to better understand their etiologies. Among 50 complex traits with publicly accessible GWAS summary statistics (N total ≈ 4.5 million), we identified more than 170 pairs with statistically significant genetic covariance. In particular, we found strong genetic covariance between late-onset Alzheimer disease (LOAD) and amyotrophic lateral sclerosis (ALS), two major neurodegenerative diseases, in single-nucleotide polymorphisms (SNPs) with high minor allele frequencies and in SNPs located in the predicted functional genome. Joint analysis of LOAD, ALS, and other traits highlights LOAD's correlation with cognitive traits and hints at an autoimmune component for ALS. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
High cumulants of conserved charges and their statistical uncertainties

Science.gov (United States)

Li-Zhu, Chen; Ye-Yin, Zhao; Xue, Pan; Zhi-Ming, Li; Yuan-Fang, Wu

2017-10-01

We study the influence of measured high cumulants of conserved charges on their associated statistical uncertainties in relativistic heavy-ion collisions. With a given number of events, the measured cumulants randomly fluctuate with an approximately normal distribution, while the estimated statistical uncertainties are found to be correlated with corresponding values of the obtained cumulants. Generally, with a given number of events, the larger the cumulants we measure, the larger the statistical uncertainties that are estimated. The error-weighted averaged cumulants are dependent on statistics. Despite this effect, however, it is found that the three sigma rule of thumb is still applicable when the statistics are above one million. Supported by NSFC (11405088, 11521064, 11647093), Major State Basic Research Development Program of China (2014CB845402) and Ministry of Science and Technology (MoST) (2016YFE0104800)
Understanding Statistics and Statistics Education: A Chinese Perspective

Science.gov (United States)

Shi, Ning-Zhong; He, Xuming; Tao, Jian

2009-01-01

In recent years, statistics education in China has made great strides. However, there still exists a fairly large gap with the advanced levels of statistics education in more developed countries. In this paper, we identify some existing problems in statistics education in Chinese schools and make some proposals as to how they may be overcome. We…
Diagonal earlobe crease: Prevalence and association with medical ailments

Directory of Open Access Journals (Sweden)

Yugantara Ramesh Kadam

2018-01-01

Full Text Available Context: It has been hypothesized that diagonal earlobe crease (DELC, “Frank's sign” is indicative of coronary artery disease (CAD and/or diabetes mellitus (DM. Several studies have confirmed an association between DELC and cardiac morbidity, mortality, and hypertension (HTN. However, some studies have not found any significant association. Aims: This study aims to find out the prevalence of DELC and its association with CAD, DM, and HTN. Settings and Design: Sangli-Miraj-Kupwad Corporation area. This was a cross-sectional analytical study. Subjects and Methods: Study participants: Adults from 18 to 60 years age. Inclusion criteria: willing to participate in the study Exclusion criteria: Wearing heavy ear rings and excessive normal generalized wrinkling of the skin. Sample size: Sample size 6310, determined after a pilot study revealing DELC in 1.5%. Sampling technique: Two-stage cluster sampling. Duration of study: 6 months. Study tools: Predesigned, pilot tested pro forma. Statistical Analysis: Statistical analysis was done by using SPSS 22 software. Prevalence and percentages were calculated, and Chi-square test was applied. Results: Out of 6638 participants, 179 had DELC. The prevalence of bilateral DELC was 2.7%. The prevalence was significantly high among males (4.13% and in the 51–60 years age group (5.29%. The prevalence of Grade 3 DELC was high and 91% of young adults had Grade 3 DELC. There were 408 (6.15% participants who gave a history of CAD, 827 (12.46% of DM, and 670 (10.09% HTN. Significantly high association observed between DELC and CAD, DM, and HTN. CAD, DM, and HTN were significantly associated with Grade 3. Conclusions: The prevalence of bilateral DELC was 2.7% and is significantly associated with CAD, DM, and HTN.
Factors Associated With Specific Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition Sexual Dysfunctions in Breast Cancer Survivors : A Study of Patients and Their Partners

NARCIS (Netherlands)

Hummel, S.B.; Hahn, D.E.E.; van Lankveld, J.J.D.M.; Oldenburg, H.S.A.; Broomans, E.; Aaronson, N.K.

2017-01-01

BACKGROUND: Many women develop sexual problems after breast cancer (BC) treatment. Little is known about BC survivors with a Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) sexual dysfunction and their partners, and the factors associated with their sexual functioning.
Fractional statistics and the butterfly effect

International Nuclear Information System (INIS)

Gu, Yingfei; Qi, Xiao-Liang

2016-01-01

Fractional statistics and quantum chaos are both phenomena associated with the non-local storage of quantum information. In this article, we point out a connection between the butterfly effect in (1+1)-dimensional rational conformal field theories and fractional statistics in (2+1)-dimensional topologically ordered states. This connection comes from the characterization of the butterfly effect by the out-of-time-order-correlator proposed recently. We show that the late-time behavior of such correlators is determined by universal properties of the rational conformal field theory such as the modular S-matrix and conformal spins. Using the bulk-boundary correspondence between rational conformal field theories and (2+1)-dimensional topologically ordered states, we show that the late time behavior of out-of-time-order-correlators is intrinsically connected with fractional statistics in the topological order. We also propose a quantitative measure of chaos in a rational conformal field theory, which turns out to be determined by the topological entanglement entropy of the corresponding topological order.
Fractional statistics and the butterfly effect

Energy Technology Data Exchange (ETDEWEB)

Gu, Yingfei; Qi, Xiao-Liang [Department of Physics, Stanford University,Stanford, CA 94305 (United States)

2016-08-23

Fractional statistics and quantum chaos are both phenomena associated with the non-local storage of quantum information. In this article, we point out a connection between the butterfly effect in (1+1)-dimensional rational conformal field theories and fractional statistics in (2+1)-dimensional topologically ordered states. This connection comes from the characterization of the butterfly effect by the out-of-time-order-correlator proposed recently. We show that the late-time behavior of such correlators is determined by universal properties of the rational conformal field theory such as the modular S-matrix and conformal spins. Using the bulk-boundary correspondence between rational conformal field theories and (2+1)-dimensional topologically ordered states, we show that the late time behavior of out-of-time-order-correlators is intrinsically connected with fractional statistics in the topological order. We also propose a quantitative measure of chaos in a rational conformal field theory, which turns out to be determined by the topological entanglement entropy of the corresponding topological order.
Mathematical statistics

CERN Document Server

Pestman, Wiebe R

2009-01-01

This textbook provides a broad and solid introduction to mathematical statistics, including the classical subjects hypothesis testing, normal regression analysis, and normal analysis of variance. In addition, non-parametric statistics and vectorial statistics are considered, as well as applications of stochastic analysis in modern statistics, e.g., Kolmogorov-Smirnov testing, smoothing techniques, robustness and density estimation. For students with some elementary mathematical background. With many exercises. Prerequisites from measure theory and linear algebra are presented.
Evaluating statistical and clinical significance of intervention effects in single-case experimental designs: An SPSS method to analyze univariate data

NARCIS (Netherlands)

Maric, M.; de Haan, M.; Hogendoorn, S.M.; Wolters, L.H.; Huizenga, H.M.

2015-01-01

Single-case experimental designs are useful methods in clinical research practice to investigate individual client progress. Their proliferation might have been hampered by methodological challenges such as the difficulty applying existing statistical procedures. In this article, we describe a
Evaluating statistical and clinical significance of intervention effects in single-case experimental designs: an SPSS method to analyze univariate data

NARCIS (Netherlands)

Maric, Marija; de Haan, Else; Hogendoorn, Sanne M.; Wolters, Lidewij H.; Huizenga, Hilde M.

2015-01-01

Single-case experimental designs are useful methods in clinical research practice to investigate individual client progress. Their proliferation might have been hampered by methodological challenges such as the difficulty applying existing statistical procedures. In this article, we describe a
A knowledge-based T2-statistic to perform pathway analysis for quantitative proteomic data.

Science.gov (United States)

Lai, En-Yu; Chen, Yi-Hau; Wu, Kun-Pin

2017-06-01

Approaches to identify significant pathways from high-throughput quantitative data have been developed in recent years. Still, the analysis of proteomic data stays difficult because of limited sample size. This limitation also leads to the practice of using a competitive null as common approach; which fundamentally implies genes or proteins as independent units. The independent assumption ignores the associations among biomolecules with similar functions or cellular localization, as well as the interactions among them manifested as changes in expression ratios. Consequently, these methods often underestimate the associations among biomolecules and cause false positives in practice. Some studies incorporate the sample covariance matrix into the calculation to address this issue. However, sample covariance may not be a precise estimation if the sample size is very limited, which is usually the case for the data produced by mass spectrometry. In this study, we introduce a multivariate test under a self-contained null to perform pathway analysis for quantitative proteomic data. The covariance matrix used in the test statistic is constructed by the confidence scores retrieved from the STRING database or the HitPredict database. We also design an integrating procedure to retain pathways of sufficient evidence as a pathway group. The performance of the proposed T2-statistic is demonstrated using five published experimental datasets: the T-cell activation, the cAMP/PKA signaling, the myoblast differentiation, and the effect of dasatinib on the BCR-ABL pathway are proteomic datasets produced by mass spectrometry; and the protective effect of myocilin via the MAPK signaling pathway is a gene expression dataset of limited sample size. Compared with other popular statistics, the proposed T2-statistic yields more accurate descriptions in agreement with the discussion of the original publication. We implemented the T2-statistic into an R package T2GA, which is available at https
Statistical inference and visualization in scale-space for spatially dependent images

KAUST Repository

Vaughan, Amy

2012-03-01

SiZer (SIgnificant ZERo crossing of the derivatives) is a graphical scale-space visualization tool that allows for statistical inferences. In this paper we develop a spatial SiZer for finding significant features and conducting goodness-of-fit tests for spatially dependent images. The spatial SiZer utilizes a family of kernel estimates of the image and provides not only exploratory data analysis but also statistical inference with spatial correlation taken into account. It is also capable of comparing the observed image with a specific null model being tested by adjusting the statistical inference using an assumed covariance structure. Pixel locations having statistically significant differences between the image and a given null model are highlighted by arrows. The spatial SiZer is compared with the existing independent SiZer via the analysis of simulated data with and without signal on both planar and spherical domains. We apply the spatial SiZer method to the decadal temperature change over some regions of the Earth. © 2011 The Korean Statistical Society.
Long-term associations of morbidity with air pollution: A catalog and synthesis.

Science.gov (United States)

Lipfert, Frederick W

2018-01-01

I searched the National Institutes of Health MEDLINE database through January 2017 for long-term studies of morbidity and air pollution and cataloged them with respect to cardiovascular, respiratory, cancer, diabetes, hospitalization, neurological, and pregnancy-birth endpoints. The catalog is presented as an online appendix. Associations with PM 2.5 (particulate matter with an aerodynamic diameter pollutant significance (yes, no), duration of exposure, and publication date. I found statistically significant pollutant relationships (P pollutant effect estimates, 396 are statistically significant. Pollutant associations with cardiovascular indicators, lung function, respiratory symptoms, and low birth weight are more likely to be significant than with disease incidence, heart attacks, diabetes, or neurological endpoints. Elemental carbon (EC), traffic, and PM 2.5 are most likely to be significant for cardiovascular outcomes; TSP, EC, and ozone (O 3 ) for respiratory outcomes; NO 2 for neurological outcomes; and PM 10 for birth/pregnancy outcomes. Durations of exposure range from 60 days to 35 yr, but I found no consistent relationships with the likelihood of statistical significance. Respiratory studies began ca. 1975; studies of diabetes, cardiovascular, and neurological effects increased after about 2005. I found 72 studies of occupational air pollution exposures; 40 reported statistically significant adverse health effects, especially for respiratory conditions. I conclude that the aggregate of these studies supports the existence of nonlethal physiological effects of various pollutants, more so for non-life-threatening endpoints and for noncriteria pollutants (TSP, EC, PM 2.5 metals). However, most studies were cross-sectional analyses over limited time spans with no consideration of lag or disease latency. Further longitudinal studies are thus needed to investigate the progress of disease incidence in association with air pollution exposure. Relationships of
HOMA-IR is associated with significant angiographic coronary artery disease in non-diabetic, non-obese individuals: a cross-sectional study.

Science.gov (United States)

Mossmann, Márcio; Wainstein, Marco V; Gonçalves, Sandro C; Wainstein, Rodrigo V; Gravina, Gabriela L; Sangalli, Marlei; Veadrigo, Francine; Matte, Roselene; Reich, Rejane; Costa, Fernanda G; Bertoluci, Marcello C

2015-01-01

Insulin resistance is a major component of metabolic syndrome, type 2 Diabetes Mellitus (T2DM) and coronary artery disease (CAD). Although important in T2DM, its role as a predictor of CAD in non-diabetic patients is less studied. In the present study, we aimed to evaluate the association of HOMA-IR with significant CAD, determined by coronary angiography in non-obese, non-T2DM patients. We also evaluate the association between 3 oral glucose tolerance test (OGTT) based insulin sensitivity indexes (Matsuda, STUMVOLL-ISI and OGIS) and CAD. We conducted a cross-sectional study with 54 non-obese, non-diabetic individuals referred for coronary angiography due to suspected CAD. CAD was classified as the "anatomic burden score" corresponding to any stenosis equal or larger than 50 % in diameter on the coronary distribution. Patients without lesions were included in No-CAD group. Patients with at least 1 lesion were included in the CAD group. A 75 g oral glucose tolerance test (OGTT) with measurements of plasma glucose and serum insulin at 0, 30, 60, 90 and 120 min was obtained to calculate insulin sensitivity parameters. HOMA-IR results were ranked and patients were also categorized into insulin resistant (IR) or non-insulin resistant (NIR) if they were respectively above or below the 75th percentile (HOMA-IR > 4.21). The insulin sensitivity tests results were also divided into IR and NIR, respectively below and above each 25th percentile. Chi square was used to study association. Poisson Regression Model was used to compare prevalence ratios between categorized CAD and IR groups. Fifty-four patients were included in the study. There were 26 patients (48 %) with significant CAD. The presence of clinically significant CAD was significant associated with HOMA-IR above p75 (Chi square 4.103, p = 0.0428) and 71 % of patients with HOMA-IR above p75 had significant CAD. Subjects with CAD had increased prevalence ratio of HOMA-IR above p75 compared to subjects without
On the analysis of line profile variations: A statistical approach

International Nuclear Information System (INIS)

McCandliss, S.R.

1988-01-01

This study is concerned with the empirical characterization of the line profile variations (LPV), which occur in many of and Wolf-Rayet stars. The goal of the analysis is to gain insight into the physical mechanisms producing the variations. The analytic approach uses a statistical method to quantify the significance of the LPV and to identify those regions in the line profile which are undergoing statistically significant variations. Line positions and flux variations are then measured and subject to temporal and correlative analysis. Previous studies of LPV have for the most part been restricted to observations of a single line. Important information concerning the range and amplitude of the physical mechanisms involved can be obtained by simultaneously observing spectral features formed over a range of depths in the extended mass losing atmospheres of massive, luminous stars. Time series of a Wolf-Rayet and two of stars with nearly complete spectral coverage from 3940 angstrom to 6610 angstrom and with spectral resolution of R = 10,000 are analyzed here. These three stars exhibit a wide range of both spectral and temporal line profile variations. The HeII Pickering lines of HD 191765 show a monotonic increase in the peak rms variation amplitude with lines formed at progressively larger radii in the Wolf-Rayet star wind. Two times scales of variation have been identified in this star: a less than one day variation associated with small scale flickering in the peaks of the line profiles and a greater than one day variation associated with large scale asymmetric changes in the overall line profile shapes. However, no convincing period phenomena are evident at those periods which are well sampled in this time series
Sampling, Probability Models and Statistical Reasoning Statistical

Indian Academy of Sciences (India)

Home; Journals; Resonance – Journal of Science Education; Volume 1; Issue 5. Sampling, Probability Models and Statistical Reasoning Statistical Inference. Mohan Delampady V R Padmawar. General Article Volume 1 Issue 5 May 1996 pp 49-58 ...
Statistical Symbolic Execution with Informed Sampling

Science.gov (United States)

Filieri, Antonio; Pasareanu, Corina S.; Visser, Willem; Geldenhuys, Jaco

2014-01-01

Symbolic execution techniques have been proposed recently for the probabilistic analysis of programs. These techniques seek to quantify the likelihood of reaching program events of interest, e.g., assert violations. They have many promising applications but have scalability issues due to high computational demand. To address this challenge, we propose a statistical symbolic execution technique that performs Monte Carlo sampling of the symbolic program paths and uses the obtained information for Bayesian estimation and hypothesis testing with respect to the probability of reaching the target events. To speed up the convergence of the statistical analysis, we propose Informed Sampling, an iterative symbolic execution that first explores the paths that have high statistical significance, prunes them from the state space and guides the execution towards less likely paths. The technique combines Bayesian estimation with a partial exact analysis for the pruned paths leading to provably improved convergence of the statistical analysis. We have implemented statistical symbolic execution with in- formed sampling in the Symbolic PathFinder tool. We show experimentally that the informed sampling obtains more precise results and converges faster than a purely statistical analysis and may also be more efficient than an exact symbolic analysis. When the latter does not terminate symbolic execution with informed sampling can give meaningful results under the same time and memory limits.
RS-SNP: a random-set method for genome-wide association studies

Directory of Open Access Journals (Sweden)

Mukherjee Sayan

2011-03-01

Full Text Available Abstract Background The typical objective of Genome-wide association (GWA studies is to identify single-nucleotide polymorphisms (SNPs and corresponding genes with the strongest evidence of association (the 'most-significant SNPs/genes' approach. Borrowing ideas from micro-array data analysis, we propose a new method, named RS-SNP, for detecting sets of genes enriched in SNPs moderately associated to the phenotype. RS-SNP assesses whether the number of significant SNPs, with p-value P ≤ α, belonging to a given SNP set is statistically significant. The rationale of proposed method is that two kinds of null hypotheses are taken into account simultaneously. In the first null model the genotype and the phenotype are assumed to be independent random variables and the null distribution is the probability of the number of significant SNPs in greater than observed by chance. The second null model assumes the number of significant SNPs in depends on the size of and not on the identity of the SNPs in . Statistical significance is assessed using non-parametric permutation tests. Results We applied RS-SNP to the Crohn's disease (CD data set collected by the Wellcome Trust Case Control Consortium (WTCCC and compared the results with GENGEN, an approach recently proposed in literature. The enrichment analysis using RS-SNP and the set of pathways contained in the MSigDB C2 CP pathway collection highlighted 86 pathways rich in SNPs weakly associated to CD. Of these, 47 were also indicated to be significant by GENGEN. Similar results were obtained using the MSigDB C5 pathway collection. Many of the pathways found to be enriched by RS-SNP have a well-known connection to CD and often with inflammatory diseases. Conclusions The proposed method is a valuable alternative to other techniques for enrichment analysis of SNP sets. It is well founded from a theoretical and statistical perspective. Moreover, the experimental comparison with GENGEN highlights that it is

Some links on this page may take you to non-federal websites. Their policies may differ from this site.