Significance levels for studies with correlated test statistics.
Shi, Jianxin; Levinson, Douglas F; Whittemore, Alice S
2008-07-01
When testing large numbers of null hypotheses, one needs to assess the evidence against the global null hypothesis that none of the hypotheses is false. Such evidence typically is based on the test statistic of the largest magnitude, whose statistical significance is evaluated by permuting the sample units to simulate its null distribution. Efron (2007) has noted that correlation among the test statistics can induce substantial interstudy variation in the shapes of their histograms, which may cause misleading tail counts. Here, we show that permutation-based estimates of the overall significance level also can be misleading when the test statistics are correlated. We propose that such estimates be conditioned on a simple measure of the spread of the observed histogram, and we provide a method for obtaining conditional significance levels. We justify this conditioning using the conditionality principle described by Cox and Hinkley (1974). Application of the method to gene expression data illustrates the circumstances when conditional significance levels are needed.
Directory of Open Access Journals (Sweden)
Ibáñez Berta
2009-04-01
Full Text Available Abstract Background The importance of Small Area Variation Analysis for policy-making contrasts with the scarcity of work on the validity of the statistics used in these studies. Our study aims at 1 determining whether variation in utilization rates between health areas is higher than would be expected by chance, 2 estimating the statistical power of the variation statistics; and 3 evaluating the ability of different statistics to compare the variability among different procedures regardless of their rates. Methods Parametric bootstrap techniques were used to derive the empirical distribution for each statistic under the hypothesis of homogeneity across areas. Non-parametric procedures were used to analyze the empirical distribution for the observed statistics and compare the results in six situations (low/medium/high utilization rates and low/high variability. A small scale simulation study was conducted to assess the capacity of each statistic to discriminate between different scenarios with different degrees of variation. Results Bootstrap techniques proved to be good at quantifying the difference between the null hypothesis and the variation observed in each situation, and to construct reliable tests and confidence intervals for each of the variation statistics analyzed. Although the good performance of Systematic Component of Variation (SCV, Empirical Bayes (EB statistic shows better behaviour under the null hypothesis, it is able to detect variability if present, it is not influenced by the procedure rate and it is best able to discriminate between different degrees of heterogeneity. Conclusion The EB statistics seems to be a good alternative to more conventional statistics used in small-area variation analysis in health service research because of its robustness.
International Nuclear Information System (INIS)
Garcia, Francisco; Palacio, Carlos; Garcia, Uriel
2012-01-01
Multivariate statistical techniques were used to investigate the temporal and spatial variations of water quality at the Santa Marta coastal area where a submarine out fall that discharges 1 m3/s of domestic wastewater is located. Two-way analysis of variance (ANOVA), cluster and principal component analysis and Krigging interpolation were considered for this report. Temporal variation showed two heterogeneous periods. From December to April, and July, where the concentration of the water quality parameters is higher; the rest of the year (May, June, August-November) were significantly lower. The spatial variation reported two areas where the water quality is different, this difference is related to the proximity to the submarine out fall discharge.
On the analysis of line profile variations: A statistical approach
International Nuclear Information System (INIS)
McCandliss, S.R.
1988-01-01
This study is concerned with the empirical characterization of the line profile variations (LPV), which occur in many of and Wolf-Rayet stars. The goal of the analysis is to gain insight into the physical mechanisms producing the variations. The analytic approach uses a statistical method to quantify the significance of the LPV and to identify those regions in the line profile which are undergoing statistically significant variations. Line positions and flux variations are then measured and subject to temporal and correlative analysis. Previous studies of LPV have for the most part been restricted to observations of a single line. Important information concerning the range and amplitude of the physical mechanisms involved can be obtained by simultaneously observing spectral features formed over a range of depths in the extended mass losing atmospheres of massive, luminous stars. Time series of a Wolf-Rayet and two of stars with nearly complete spectral coverage from 3940 angstrom to 6610 angstrom and with spectral resolution of R = 10,000 are analyzed here. These three stars exhibit a wide range of both spectral and temporal line profile variations. The HeII Pickering lines of HD 191765 show a monotonic increase in the peak rms variation amplitude with lines formed at progressively larger radii in the Wolf-Rayet star wind. Two times scales of variation have been identified in this star: a less than one day variation associated with small scale flickering in the peaks of the line profiles and a greater than one day variation associated with large scale asymmetric changes in the overall line profile shapes. However, no convincing period phenomena are evident at those periods which are well sampled in this time series
Identifying significant temporal variation in time course microarray data without replicates
Directory of Open Access Journals (Sweden)
Porter Weston
2009-03-01
Full Text Available Abstract Background An important component of time course microarray studies is the identification of genes that demonstrate significant time-dependent variation in their expression levels. Until recently, available methods for performing such significance tests required replicates of individual time points. This paper describes a replicate-free method that was developed as part of a study of the estrous cycle in the rat mammary gland in which no replicate data was collected. Results A temporal test statistic is proposed that is based on the degree to which data are smoothed when fit by a spline function. An algorithm is presented that uses this test statistic together with a false discovery rate method to identify genes whose expression profiles exhibit significant temporal variation. The algorithm is tested on simulated data, and is compared with another recently published replicate-free method. The simulated data consists both of genes with known temporal dependencies, and genes from a null distribution. The proposed algorithm identifies a larger percentage of the time-dependent genes for a given false discovery rate. Use of the algorithm in a study of the estrous cycle in the rat mammary gland resulted in the identification of genes exhibiting distinct circadian variation. These results were confirmed in follow-up laboratory experiments. Conclusion The proposed algorithm provides a new approach for identifying expression profiles with significant temporal variation without relying on replicates. When compared with a recently published algorithm on simulated data, the proposed algorithm appears to identify a larger percentage of time-dependent genes for a given false discovery rate. The development of the algorithm was instrumental in revealing the presence of circadian variation in the virgin rat mammary gland during the estrous cycle.
Statistical Significance for Hierarchical Clustering
Kimes, Patrick K.; Liu, Yufeng; Hayes, D. Neil; Marron, J. S.
2017-01-01
Summary Cluster analysis has proved to be an invaluable tool for the exploratory and unsupervised analysis of high dimensional datasets. Among methods for clustering, hierarchical approaches have enjoyed substantial popularity in genomics and other fields for their ability to simultaneously uncover multiple layers of clustering structure. A critical and challenging question in cluster analysis is whether the identified clusters represent important underlying structure or are artifacts of natural sampling variation. Few approaches have been proposed for addressing this problem in the context of hierarchical clustering, for which the problem is further complicated by the natural tree structure of the partition, and the multiplicity of tests required to parse the layers of nested clusters. In this paper, we propose a Monte Carlo based approach for testing statistical significance in hierarchical clustering which addresses these issues. The approach is implemented as a sequential testing procedure guaranteeing control of the family-wise error rate. Theoretical justification is provided for our approach, and its power to detect true clustering structure is illustrated through several simulation studies and applications to two cancer gene expression datasets. PMID:28099990
Statistical significance of cis-regulatory modules
Directory of Open Access Journals (Sweden)
Smith Andrew D
2007-01-01
Full Text Available Abstract Background It is becoming increasingly important for researchers to be able to scan through large genomic regions for transcription factor binding sites or clusters of binding sites forming cis-regulatory modules. Correspondingly, there has been a push to develop algorithms for the rapid detection and assessment of cis-regulatory modules. While various algorithms for this purpose have been introduced, most are not well suited for rapid, genome scale scanning. Results We introduce methods designed for the detection and statistical evaluation of cis-regulatory modules, modeled as either clusters of individual binding sites or as combinations of sites with constrained organization. In order to determine the statistical significance of module sites, we first need a method to determine the statistical significance of single transcription factor binding site matches. We introduce a straightforward method of estimating the statistical significance of single site matches using a database of known promoters to produce data structures that can be used to estimate p-values for binding site matches. We next introduce a technique to calculate the statistical significance of the arrangement of binding sites within a module using a max-gap model. If the module scanned for has defined organizational parameters, the probability of the module is corrected to account for organizational constraints. The statistical significance of single site matches and the architecture of sites within the module can be combined to provide an overall estimation of statistical significance of cis-regulatory module sites. Conclusion The methods introduced in this paper allow for the detection and statistical evaluation of single transcription factor binding sites and cis-regulatory modules. The features described are implemented in the Search Tool for Occurrences of Regulatory Motifs (STORM and MODSTORM software.
Sibling Competition & Growth Tradeoffs. Biological vs. Statistical Significance.
Kramer, Karen L; Veile, Amanda; Otárola-Castillo, Erik
2016-01-01
Early childhood growth has many downstream effects on future health and reproduction and is an important measure of offspring quality. While a tradeoff between family size and child growth outcomes is theoretically predicted in high-fertility societies, empirical evidence is mixed. This is often attributed to phenotypic variation in parental condition. However, inconsistent study results may also arise because family size confounds the potentially differential effects that older and younger siblings can have on young children's growth. Additionally, inconsistent results might reflect that the biological significance associated with different growth trajectories is poorly understood. This paper addresses these concerns by tracking children's monthly gains in height and weight from weaning to age five in a high fertility Maya community. We predict that: 1) as an aggregate measure family size will not have a major impact on child growth during the post weaning period; 2) competition from young siblings will negatively impact child growth during the post weaning period; 3) however because of their economic value, older siblings will have a negligible effect on young children's growth. Accounting for parental condition, we use linear mixed models to evaluate the effects that family size, younger and older siblings have on children's growth. Congruent with our expectations, it is younger siblings who have the most detrimental effect on children's growth. While we find statistical evidence of a quantity/quality tradeoff effect, the biological significance of these results is negligible in early childhood. Our findings help to resolve why quantity/quality studies have had inconsistent results by showing that sibling competition varies with sibling age composition, not just family size, and that biological significance is distinct from statistical significance.
Sibling Competition & Growth Tradeoffs. Biological vs. Statistical Significance.
Directory of Open Access Journals (Sweden)
Karen L Kramer
Full Text Available Early childhood growth has many downstream effects on future health and reproduction and is an important measure of offspring quality. While a tradeoff between family size and child growth outcomes is theoretically predicted in high-fertility societies, empirical evidence is mixed. This is often attributed to phenotypic variation in parental condition. However, inconsistent study results may also arise because family size confounds the potentially differential effects that older and younger siblings can have on young children's growth. Additionally, inconsistent results might reflect that the biological significance associated with different growth trajectories is poorly understood. This paper addresses these concerns by tracking children's monthly gains in height and weight from weaning to age five in a high fertility Maya community. We predict that: 1 as an aggregate measure family size will not have a major impact on child growth during the post weaning period; 2 competition from young siblings will negatively impact child growth during the post weaning period; 3 however because of their economic value, older siblings will have a negligible effect on young children's growth. Accounting for parental condition, we use linear mixed models to evaluate the effects that family size, younger and older siblings have on children's growth. Congruent with our expectations, it is younger siblings who have the most detrimental effect on children's growth. While we find statistical evidence of a quantity/quality tradeoff effect, the biological significance of these results is negligible in early childhood. Our findings help to resolve why quantity/quality studies have had inconsistent results by showing that sibling competition varies with sibling age composition, not just family size, and that biological significance is distinct from statistical significance.
The thresholds for statistical and clinical significance
DEFF Research Database (Denmark)
Jakobsen, Janus Christian; Gluud, Christian; Winkel, Per
2014-01-01
BACKGROUND: Thresholds for statistical significance are insufficiently demonstrated by 95% confidence intervals or P-values when assessing results from randomised clinical trials. First, a P-value only shows the probability of getting a result assuming that the null hypothesis is true and does...... not reflect the probability of getting a result assuming an alternative hypothesis to the null hypothesis is true. Second, a confidence interval or a P-value showing significance may be caused by multiplicity. Third, statistical significance does not necessarily result in clinical significance. Therefore...... of the probability that a given trial result is compatible with a 'null' effect (corresponding to the P-value) divided by the probability that the trial result is compatible with the intervention effect hypothesised in the sample size calculation; (3) adjust the confidence intervals and the statistical significance...
The insignificance of statistical significance testing
Johnson, Douglas H.
1999-01-01
Despite their use in scientific journals such as The Journal of Wildlife Management, statistical hypothesis tests add very little value to the products of research. Indeed, they frequently confuse the interpretation of data. This paper describes how statistical hypothesis tests are often viewed, and then contrasts that interpretation with the correct one. I discuss the arbitrariness of P-values, conclusions that the null hypothesis is true, power analysis, and distinctions between statistical and biological significance. Statistical hypothesis testing, in which the null hypothesis about the properties of a population is almost always known a priori to be false, is contrasted with scientific hypothesis testing, which examines a credible null hypothesis about phenomena in nature. More meaningful alternatives are briefly outlined, including estimation and confidence intervals for determining the importance of factors, decision theory for guiding actions in the face of uncertainty, and Bayesian approaches to hypothesis testing and other statistical practices.
Variation in reaction norms: Statistical considerations and biological interpretation.
Morrissey, Michael B; Liefting, Maartje
2016-09-01
Analysis of reaction norms, the functions by which the phenotype produced by a given genotype depends on the environment, is critical to studying many aspects of phenotypic evolution. Different techniques are available for quantifying different aspects of reaction norm variation. We examine what biological inferences can be drawn from some of the more readily applicable analyses for studying reaction norms. We adopt a strongly biologically motivated view, but draw on statistical theory to highlight strengths and drawbacks of different techniques. In particular, consideration of some formal statistical theory leads to revision of some recently, and forcefully, advocated opinions on reaction norm analysis. We clarify what simple analysis of the slope between mean phenotype in two environments can tell us about reaction norms, explore the conditions under which polynomial regression can provide robust inferences about reaction norm shape, and explore how different existing approaches may be used to draw inferences about variation in reaction norm shape. We show how mixed model-based approaches can provide more robust inferences than more commonly used multistep statistical approaches, and derive new metrics of the relative importance of variation in reaction norm intercepts, slopes, and curvatures. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
On Teaching about the Coefficient of Variation in Introductory Statistics Courses
Trafimow, David
2014-01-01
The standard deviation is related to the mean by virtue of the coefficient of variation. Teachers of statistics courses can make use of that fact to make the standard deviation more comprehensible for statistics students.
Caveats for using statistical significance tests in research assessments
DEFF Research Database (Denmark)
Schneider, Jesper Wiborg
2013-01-01
controversial and numerous criticisms have been leveled against their use. Based on examples from articles by proponents of the use statistical significance tests in research assessments, we address some of the numerous problems with such tests. The issues specifically discussed are the ritual practice......This article raises concerns about the advantages of using statistical significance tests in research assessments as has recently been suggested in the debate about proper normalization procedures for citation indicators by Opthof and Leydesdorff (2010). Statistical significance tests are highly...... argue that applying statistical significance tests and mechanically adhering to their results are highly problematic and detrimental to critical thinking. We claim that the use of such tests do not provide any advantages in relation to deciding whether differences between citation indicators...
Statistically significant relational data mining :
Energy Technology Data Exchange (ETDEWEB)
Berry, Jonathan W.; Leung, Vitus Joseph; Phillips, Cynthia Ann; Pinar, Ali; Robinson, David Gerald; Berger-Wolf, Tanya; Bhowmick, Sanjukta; Casleton, Emily; Kaiser, Mark; Nordman, Daniel J.; Wilson, Alyson G.
2014-02-01
This report summarizes the work performed under the project (3z(BStatitically significant relational data mining.(3y (BThe goal of the project was to add more statistical rigor to the fairly ad hoc area of data mining on graphs. Our goal was to develop better algorithms and better ways to evaluate algorithm quality. We concetrated on algorithms for community detection, approximate pattern matching, and graph similarity measures. Approximate pattern matching involves finding an instance of a relatively small pattern, expressed with tolerance, in a large graph of data observed with uncertainty. This report gathers the abstracts and references for the eight refereed publications that have appeared as part of this work. We then archive three pieces of research that have not yet been published. The first is theoretical and experimental evidence that a popular statistical measure for comparison of community assignments favors over-resolved communities over approximations to a ground truth. The second are statistically motivated methods for measuring the quality of an approximate match of a small pattern in a large graph. The third is a new probabilistic random graph model. Statisticians favor these models for graph analysis. The new local structure graph model overcomes some of the issues with popular models such as exponential random graph models and latent variable models.
Directory of Open Access Journals (Sweden)
Priya Ranganathan
2015-01-01
Full Text Available In the second part of a series on pitfalls in statistical analysis, we look at various ways in which a statistically significant study result can be expressed. We debunk some of the myths regarding the ′P′ value, explain the importance of ′confidence intervals′ and clarify the importance of including both values in a paper
Farrell, Mary Beth
2018-06-01
This article is the second part of a continuing education series reviewing basic statistics that nuclear medicine and molecular imaging technologists should understand. In this article, the statistics for evaluating interpretation accuracy, significance, and variance are discussed. Throughout the article, actual statistics are pulled from the published literature. We begin by explaining 2 methods for quantifying interpretive accuracy: interreader and intrareader reliability. Agreement among readers can be expressed simply as a percentage. However, the Cohen κ-statistic is a more robust measure of agreement that accounts for chance. The higher the κ-statistic is, the higher is the agreement between readers. When 3 or more readers are being compared, the Fleiss κ-statistic is used. Significance testing determines whether the difference between 2 conditions or interventions is meaningful. Statistical significance is usually expressed using a number called a probability ( P ) value. Calculation of P value is beyond the scope of this review. However, knowing how to interpret P values is important for understanding the scientific literature. Generally, a P value of less than 0.05 is considered significant and indicates that the results of the experiment are due to more than just chance. Variance, standard deviation (SD), confidence interval, and standard error (SE) explain the dispersion of data around a mean of a sample drawn from a population. SD is commonly reported in the literature. A small SD indicates that there is not much variation in the sample data. Many biologic measurements fall into what is referred to as a normal distribution taking the shape of a bell curve. In a normal distribution, 68% of the data will fall within 1 SD, 95% will fall within 2 SDs, and 99.7% will fall within 3 SDs. Confidence interval defines the range of possible values within which the population parameter is likely to lie and gives an idea of the precision of the statistic being
Wilkinson, D; Hiller, J; Moss, J; Ryan, P; Worsley, T
2000-06-01
To describe variation in all cause and selected cause-specific mortality rates across Australia. Mortality and population data for 1997 were obtained from the Australian Bureau of Statistics. All cause and selected cause-specific mortality rates were calculated and directly standardised to the 1997 Australian population in 5-year age groups. Selected major causes of death included cancer, coronary artery disease, cerebrovascular disease, diabetes, accidents and suicide. Rates are reported by statistical division, and State and Territory. All cause age-standardised mortality was 6.98 per 1000 in 1997 and this varied 2-fold from a low in the statistical division of Pilbara, Western Australia (5.78, 95% confidence interval 5.06-6.56), to a high in Northern Territory--excluding Darwin (11.30, 10.67-11.98). Similar mortality variation (all p killers. Larger variation (all p suicide (0.6-3.8 per 10,000). Less marked variation was observed when analysed by State and Territory, but Northern Territory consistently has the highest age-standardised mortality rates. Analysed by statistical division, substantial mortality gradients exist across Australia, suggesting an inequitable distribution of the determinants of health. Further research is required to better understand this heterogeneity.
Health significance and statistical uncertainty. The value of P-value.
Consonni, Dario; Bertazzi, Pier Alberto
2017-10-27
The P-value is widely used as a summary statistics of scientific results. Unfortunately, there is a widespread tendency to dichotomize its value in "P0.05" ("statistically not significant"), with the former implying a "positive" result and the latter a "negative" one. To show the unsuitability of such an approach when evaluating the effects of environmental and occupational risk factors. We provide examples of distorted use of P-value and of the negative consequences for science and public health of such a black-and-white vision. The rigid interpretation of P-value as a dichotomy favors the confusion between health relevance and statistical significance, discourages thoughtful thinking, and distorts attention from what really matters, the health significance. A much better way to express and communicate scientific results involves reporting effect estimates (e.g., risks, risks ratios or risk differences) and their confidence intervals (CI), which summarize and convey both health significance and statistical uncertainty. Unfortunately, many researchers do not usually consider the whole interval of CI but only examine if it includes the null-value, therefore degrading this procedure to the same P-value dichotomy (statistical significance or not). In reporting statistical results of scientific research present effects estimates with their confidence intervals and do not qualify the P-value as "significant" or "not significant".
Ranganathan, Priya; Pramesh, C. S.; Buyse, Marc
2015-01-01
In the second part of a series on pitfalls in statistical analysis, we look at various ways in which a statistically significant study result can be expressed. We debunk some of the myths regarding the ‘P’ value, explain the importance of ‘confidence intervals’ and clarify the importance of including both values in a paper PMID:25878958
International Nuclear Information System (INIS)
Boning, Duane S.; Chung, James E.
1998-01-01
Advanced process technology will require more detailed understanding and tighter control of variation in devices and interconnects. The purpose of statistical metrology is to provide methods to measure and characterize variation, to model systematic and random components of that variation, and to understand the impact of variation on both yield and performance of advanced circuits. Of particular concern are spatial or pattern-dependencies within individual chips; such systematic variation within the chip can have a much larger impact on performance than wafer-level random variation. Statistical metrology methods will play an important role in the creation of design rules for advanced technologies. For example, a key issue in multilayer interconnect is the uniformity of interlevel dielectric (ILD) thickness within the chip. For the case of ILD thickness, we describe phases of statistical metrology development and application to understanding and modeling thickness variation arising from chemical-mechanical polishing (CMP). These phases include screening experiments including design of test structures and test masks to gather electrical or optical data, techniques for statistical decomposition and analysis of the data, and approaches to calibrating empirical and physical variation models. These models can be integrated with circuit CAD tools to evaluate different process integration or design rule strategies. One focus for the generation of interconnect design rules are guidelines for the use of 'dummy fill' or 'metal fill' to improve the uniformity of underlying metal density and thus improve the uniformity of oxide thickness within the die. Trade-offs that can be evaluated via statistical metrology include the improvements to uniformity possible versus the effect of increased capacitance due to additional metal
Advanced statistics for tokamak transport colinearity and tokamak to tokamak variation
International Nuclear Information System (INIS)
Riedel, K.S.
1989-03-01
This is a compendium of three separate articles on the statistical analysis of tokamak transport. The first article is an expository introduction to advanced statistics and scaling laws. The second analyzes two important problems of tokamak data---colinearity and tokamak to tokamak variation in detail. The third article generalizes the Swamy random coefficient model to the case of degenerate matrices. Three papers have been processed separately
Understanding the Sampling Distribution and Its Use in Testing Statistical Significance.
Breunig, Nancy A.
Despite the increasing criticism of statistical significance testing by researchers, particularly in the publication of the 1994 American Psychological Association's style manual, statistical significance test results are still popular in journal articles. For this reason, it remains important to understand the logic of inferential statistics. A…
Statistical mechanics of learning: A variational approach for real data
International Nuclear Information System (INIS)
Malzahn, Doerthe; Opper, Manfred
2002-01-01
Using a variational technique, we generalize the statistical physics approach of learning from random examples to make it applicable to real data. We demonstrate the validity and relevance of our method by computing approximate estimators for generalization errors that are based on training data alone
Swiss solar power statistics 2007 - Significant expansion
International Nuclear Information System (INIS)
Hostettler, T.
2008-01-01
This article presents and discusses the 2007 statistics for solar power in Switzerland. A significant number of new installations is noted as is the high production figures from newer installations. The basics behind the compilation of the Swiss solar power statistics are briefly reviewed and an overview for the period 1989 to 2007 is presented which includes figures on the number of photovoltaic plant in service and installed peak power. Typical production figures in kilowatt-hours (kWh) per installed kilowatt-peak power (kWp) are presented and discussed for installations of various sizes. Increased production after inverter replacement in older installations is noted. Finally, the general political situation in Switzerland as far as solar power is concerned are briefly discussed as are international developments.
Test for the statistical significance of differences between ROC curves
International Nuclear Information System (INIS)
Metz, C.E.; Kronman, H.B.
1979-01-01
A test for the statistical significance of observed differences between two measured Receiver Operating Characteristic (ROC) curves has been designed and evaluated. The set of observer response data for each ROC curve is assumed to be independent and to arise from a ROC curve having a form which, in the absence of statistical fluctuations in the response data, graphs as a straight line on double normal-deviate axes. To test the significance of an apparent difference between two measured ROC curves, maximum likelihood estimates of the two parameters of each curve and the associated parameter variances and covariance are calculated from the corresponding set of observer response data. An approximate Chi-square statistic with two degrees of freedom is then constructed from the differences between the parameters estimated for each ROC curve and from the variances and covariances of these estimates. This statistic is known to be truly Chi-square distributed only in the limit of large numbers of trials in the observer performance experiments. Performance of the statistic for data arising from a limited number of experimental trials was evaluated. Independent sets of rating scale data arising from the same underlying ROC curve were paired, and the fraction of differences found (falsely) significant was compared to the significance level, α, used with the test. Although test performance was found to be somewhat dependent on both the number of trials in the data and the position of the underlying ROC curve in the ROC space, the results for various significance levels showed the test to be reliable under practical experimental conditions
Directory of Open Access Journals (Sweden)
Rawid Banchuin
2014-01-01
Full Text Available In this research, the analysis of statistical variations in subthreshold MOSFET's high frequency characteristics defined in terms of gate capacitance and transition frequency, have been shown and the resulting comprehensive analytical models of such variations in terms of their variances have been proposed. Major imperfection in the physical level properties including random dopant fluctuation and effects of variations in MOSFET's manufacturing process, have been taken into account in the proposed analysis and modeling. The up to dated comprehensive analytical model of statistical variation in MOSFET's parameter has been used as the basis of analysis and modeling. The resulting models have been found to be both analytic and comprehensive as they are the precise mathematical expressions in terms of physical level variables of MOSFET. Furthermore, they have been verified at the nanometer level by using 65~nm level BSIM4 based benchmarks and have been found to be very accurate with smaller than 5 % average percentages of errors. Hence, the performed analysis gives the resulting models which have been found to be the potential mathematical tool for the statistical and variability aware analysis and design of subthreshold MOSFET based VHF circuits, systems and applications.
Reducing lumber thickness variation using real-time statistical process control
Thomas M. Young; Brian H. Bond; Jan Wiedenbeck
2002-01-01
A technology feasibility study for reducing lumber thickness variation was conducted from April 2001 until March 2002 at two sawmills located in the southern U.S. A real-time statistical process control (SPC) system was developed that featured Wonderware human machine interface technology (HMI) with distributed real-time control charts for all sawing centers and...
Characterization and potential functional significance of human-chimpanzee large INDEL variation
Directory of Open Access Journals (Sweden)
Polavarapu Nalini
2011-10-01
Full Text Available Abstract Background Although humans and chimpanzees have accumulated significant differences in a number of phenotypic traits since diverging from a common ancestor about six million years ago, their genomes are more than 98.5% identical at protein-coding loci. This modest degree of nucleotide divergence is not sufficient to explain the extensive phenotypic differences between the two species. It has been hypothesized that the genetic basis of the phenotypic differences lies at the level of gene regulation and is associated with the extensive insertion and deletion (INDEL variation between the two species. To test the hypothesis that large INDELs (80 to 12,000 bp may have contributed significantly to differences in gene regulation between the two species, we categorized human-chimpanzee INDEL variation mapping in or around genes and determined whether this variation is significantly correlated with previously determined differences in gene expression. Results Extensive, large INDEL variation exists between the human and chimpanzee genomes. This variation is primarily attributable to retrotransposon insertions within the human lineage. There is a significant correlation between differences in gene expression and large human-chimpanzee INDEL variation mapping in genes or in proximity to them. Conclusions The results presented herein are consistent with the hypothesis that large INDELs, particularly those associated with retrotransposons, have played a significant role in human-chimpanzee regulatory evolution.
On detection and assessment of statistical significance of Genomic Islands
Directory of Open Access Journals (Sweden)
Chaudhuri Probal
2008-04-01
Full Text Available Abstract Background Many of the available methods for detecting Genomic Islands (GIs in prokaryotic genomes use markers such as transposons, proximal tRNAs, flanking repeats etc., or they use other supervised techniques requiring training datasets. Most of these methods are primarily based on the biases in GC content or codon and amino acid usage of the islands. However, these methods either do not use any formal statistical test of significance or use statistical tests for which the critical values and the P-values are not adequately justified. We propose a method, which is unsupervised in nature and uses Monte-Carlo statistical tests based on randomly selected segments of a chromosome. Such tests are supported by precise statistical distribution theory, and consequently, the resulting P-values are quite reliable for making the decision. Results Our algorithm (named Design-Island, an acronym for Detection of Statistically Significant Genomic Island runs in two phases. Some 'putative GIs' are identified in the first phase, and those are refined into smaller segments containing horizontally acquired genes in the refinement phase. This method is applied to Salmonella typhi CT18 genome leading to the discovery of several new pathogenicity, antibiotic resistance and metabolic islands that were missed by earlier methods. Many of these islands contain mobile genetic elements like phage-mediated genes, transposons, integrase and IS elements confirming their horizontal acquirement. Conclusion The proposed method is based on statistical tests supported by precise distribution theory and reliable P-values along with a technique for visualizing statistically significant islands. The performance of our method is better than many other well known methods in terms of their sensitivity and accuracy, and in terms of specificity, it is comparable to other methods.
Increasing the statistical significance of entanglement detection in experiments.
Jungnitsch, Bastian; Niekamp, Sönke; Kleinmann, Matthias; Gühne, Otfried; Lu, He; Gao, Wei-Bo; Chen, Yu-Ao; Chen, Zeng-Bing; Pan, Jian-Wei
2010-05-28
Entanglement is often verified by a violation of an inequality like a Bell inequality or an entanglement witness. Considerable effort has been devoted to the optimization of such inequalities in order to obtain a high violation. We demonstrate theoretically and experimentally that such an optimization does not necessarily lead to a better entanglement test, if the statistical error is taken into account. Theoretically, we show for different error models that reducing the violation of an inequality can improve the significance. Experimentally, we observe this phenomenon in a four-photon experiment, testing the Mermin and Ardehali inequality for different levels of noise. Furthermore, we provide a way to develop entanglement tests with high statistical significance.
Testing the Difference of Correlated Agreement Coefficients for Statistical Significance
Gwet, Kilem L.
2016-01-01
This article addresses the problem of testing the difference between two correlated agreement coefficients for statistical significance. A number of authors have proposed methods for testing the difference between two correlated kappa coefficients, which require either the use of resampling methods or the use of advanced statistical modeling…
Anomalous variations of NmF2 over the Argentine Islands: a statistical study
Directory of Open Access Journals (Sweden)
A. V. Pavlov
2009-04-01
Full Text Available We present a statistical study of variations in the F2-layer peak electron density, NmF2, and altitude, hmF2, over the Argentine Islands ionosonde. The critical frequencies, foF2, and, foE, of the F2 and E-layers, and the propagation factor, M(3000F2, measured by the ionosonde during the 1957–1959 and 1962–1995 time periods were used in the statistical analysis to determine the values of NmF2 and hmF2. The probabilities to observe maximum and minimum values of NmF2 and hmF2 in a diurnal variation of the electron density are calculated. Our study shows that the main part of the maximum diurnal values of NmF2 is observed in a time sector close to midnight in November, December, January, and February exhibiting the anomalous diurnal variations of NmF2. Another anomalous feature of the diurnal variations of NmF2 exhibited during November, December, and January when the minimum diurnal value of NmF2 is mainly located close to the noon sector. These anomalous diurnal variations of NmF2 are found to be during both geomagnetically quiet and disturbed conditions. Anomalous features are not found in the diurnal variations of hmF2. The statistical study of the NmF2 winter anomaly phenomena over the Argentine Islands ionosonde was carried out. The variations in a maximum daytime value, R, of a ratio of a geomagnetically quiet daytime winter NmF2 to a geomagnetically quiet daytime summer NmF2 taken at a given UT and for approximately the same level of solar activity were studied. The conditional probability of the occurrence of R in an interval of R, the most frequent value of R, the mean expected value of R, and the conditional probability to observe the F2-region winter anomaly during a daytime period were calculated for low, moderate, and high solar activity. The calculations show that the mean expected value of R and the occurrence frequency of the F2-region winter anomaly increase with increasing solar activity.
Directory of Open Access Journals (Sweden)
Anita Lindmark
Full Text Available When profiling hospital performance, quality inicators are commonly evaluated through hospital-specific adjusted means with confidence intervals. When identifying deviations from a norm, large hospitals can have statistically significant results even for clinically irrelevant deviations while important deviations in small hospitals can remain undiscovered. We have used data from the Swedish Stroke Register (Riksstroke to illustrate the properties of a benchmarking method that integrates considerations of both clinical relevance and level of statistical significance.The performance measure used was case-mix adjusted risk of death or dependency in activities of daily living within 3 months after stroke. A hospital was labeled as having outlying performance if its case-mix adjusted risk exceeded a benchmark value with a specified statistical confidence level. The benchmark was expressed relative to the population risk and should reflect the clinically relevant deviation that is to be detected. A simulation study based on Riksstroke patient data from 2008-2009 was performed to investigate the effect of the choice of the statistical confidence level and benchmark value on the diagnostic properties of the method.Simulations were based on 18,309 patients in 76 hospitals. The widely used setting, comparing 95% confidence intervals to the national average, resulted in low sensitivity (0.252 and high specificity (0.991. There were large variations in sensitivity and specificity for different requirements of statistical confidence. Lowering statistical confidence improved sensitivity with a relatively smaller loss of specificity. Variations due to different benchmark values were smaller, especially for sensitivity. This allows the choice of a clinically relevant benchmark to be driven by clinical factors without major concerns about sufficiently reliable evidence.The study emphasizes the importance of combining clinical relevance and level of statistical
Lindmark, Anita; van Rompaye, Bart; Goetghebeur, Els; Glader, Eva-Lotta; Eriksson, Marie
2016-01-01
When profiling hospital performance, quality inicators are commonly evaluated through hospital-specific adjusted means with confidence intervals. When identifying deviations from a norm, large hospitals can have statistically significant results even for clinically irrelevant deviations while important deviations in small hospitals can remain undiscovered. We have used data from the Swedish Stroke Register (Riksstroke) to illustrate the properties of a benchmarking method that integrates considerations of both clinical relevance and level of statistical significance. The performance measure used was case-mix adjusted risk of death or dependency in activities of daily living within 3 months after stroke. A hospital was labeled as having outlying performance if its case-mix adjusted risk exceeded a benchmark value with a specified statistical confidence level. The benchmark was expressed relative to the population risk and should reflect the clinically relevant deviation that is to be detected. A simulation study based on Riksstroke patient data from 2008-2009 was performed to investigate the effect of the choice of the statistical confidence level and benchmark value on the diagnostic properties of the method. Simulations were based on 18,309 patients in 76 hospitals. The widely used setting, comparing 95% confidence intervals to the national average, resulted in low sensitivity (0.252) and high specificity (0.991). There were large variations in sensitivity and specificity for different requirements of statistical confidence. Lowering statistical confidence improved sensitivity with a relatively smaller loss of specificity. Variations due to different benchmark values were smaller, especially for sensitivity. This allows the choice of a clinically relevant benchmark to be driven by clinical factors without major concerns about sufficiently reliable evidence. The study emphasizes the importance of combining clinical relevance and level of statistical confidence when
A population based statistical model for daily geometric variations in the thorax
Szeto, Yenny Z.; Witte, Marnix G.; van Herk, Marcel; Sonke, Jan-Jakob
2017-01-01
To develop a population based statistical model of the systematic interfraction geometric variations between the planning CT and first treatment week of lung cancer patients for inclusion as uncertainty term in future probabilistic planning. Deformable image registrations between the planning CT and
Hacker, Joshua; Vandenberghe, Francois; Jung, Byoung-Jo; Snyder, Chris
2017-04-01
Effective assimilation of cloud-affected radiance observations from space-borne imagers, with the aim of improving cloud analysis and forecasting, has proven to be difficult. Large observation biases, nonlinear observation operators, and non-Gaussian innovation statistics present many challenges. Ensemble-variational data assimilation (EnVar) systems offer the benefits of flow-dependent background error statistics from an ensemble, and the ability of variational minimization to handle nonlinearity. The specific benefits of ensemble statistics, relative to static background errors more commonly used in variational systems, have not been quantified for the problem of assimilating cloudy radiances. A simple experiment framework is constructed with a regional NWP model and operational variational data assimilation system, to provide the basis understanding the importance of ensemble statistics in cloudy radiance assimilation. Restricting the observations to those corresponding to clouds in the background forecast leads to innovations that are more Gaussian. The number of large innovations is reduced compared to the more general case of all observations, but not eliminated. The Huber norm is investigated to handle the fat tails of the distributions, and allow more observations to be assimilated without the need for strict background checks that eliminate them. Comparing assimilation using only ensemble background error statistics with assimilation using only static background error statistics elucidates the importance of the ensemble statistics. Although the cost functions in both experiments converge to similar values after sufficient outer-loop iterations, the resulting cloud water, ice, and snow content are greater in the ensemble-based analysis. The subsequent forecasts from the ensemble-based analysis also retain more condensed water species, indicating that the local environment is more supportive of clouds. In this presentation we provide details that explain the
Statistical significance of trends in monthly heavy precipitation over the US
Mahajan, Salil
2011-05-11
Trends in monthly heavy precipitation, defined by a return period of one year, are assessed for statistical significance in observations and Global Climate Model (GCM) simulations over the contiguous United States using Monte Carlo non-parametric and parametric bootstrapping techniques. The results from the two Monte Carlo approaches are found to be similar to each other, and also to the traditional non-parametric Kendall\\'s τ test, implying the robustness of the approach. Two different observational data-sets are employed to test for trends in monthly heavy precipitation and are found to exhibit consistent results. Both data-sets demonstrate upward trends, one of which is found to be statistically significant at the 95% confidence level. Upward trends similar to observations are observed in some climate model simulations of the twentieth century, but their statistical significance is marginal. For projections of the twenty-first century, a statistically significant upwards trend is observed in most of the climate models analyzed. The change in the simulated precipitation variance appears to be more important in the twenty-first century projections than changes in the mean precipitation. Stochastic fluctuations of the climate-system are found to be dominate monthly heavy precipitation as some GCM simulations show a downwards trend even in the twenty-first century projections when the greenhouse gas forcings are strong. © 2011 Springer-Verlag.
Temporal aspects of surface water quality variation using robust statistical tools.
Mustapha, Adamu; Aris, Ahmad Zaharin; Ramli, Mohammad Firuz; Juahir, Hafizan
2012-01-01
Robust statistical tools were applied on the water quality datasets with the aim of determining the most significance parameters and their contribution towards temporal water quality variation. Surface water samples were collected from four different sampling points during dry and wet seasons and analyzed for their physicochemical constituents. Discriminant analysis (DA) provided better results with great discriminatory ability by using five parameters with (P < 0.05) for dry season affording more than 96% correct assignation and used five and six parameters for forward and backward stepwise in wet season data with P-value (P < 0.05) affording 68.20% and 82%, respectively. Partial correlation results revealed that there are strong (r(p) = 0.829) and moderate (r(p) = 0.614) relationships between five-day biochemical oxygen demand (BOD(5)) and chemical oxygen demand (COD), total solids (TS) and dissolved solids (DS) controlling for the linear effect of nitrogen in the form of ammonia (NH(3)) and conductivity for dry and wet seasons, respectively. Multiple linear regression identified the contribution of each variable with significant values r = 0.988, R(2) = 0.976 and r = 0.970, R(2) = 0.942 (P < 0.05) for dry and wet seasons, respectively. Repeated measure t-test confirmed that the surface water quality varies significantly between the seasons with significant value P < 0.05.
Increasing the statistical significance of entanglement detection in experiments
Energy Technology Data Exchange (ETDEWEB)
Jungnitsch, Bastian; Niekamp, Soenke; Kleinmann, Matthias; Guehne, Otfried [Institut fuer Quantenoptik und Quanteninformation, Innsbruck (Austria); Lu, He; Gao, Wei-Bo; Chen, Zeng-Bing [Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei (China); Chen, Yu-Ao; Pan, Jian-Wei [Hefei National Laboratory for Physical Sciences at Microscale and Department of Modern Physics, University of Science and Technology of China, Hefei (China); Physikalisches Institut, Universitaet Heidelberg (Germany)
2010-07-01
Entanglement is often verified by a violation of an inequality like a Bell inequality or an entanglement witness. Considerable effort has been devoted to the optimization of such inequalities in order to obtain a high violation. We demonstrate theoretically and experimentally that such an optimization does not necessarily lead to a better entanglement test, if the statistical error is taken into account. Theoretically, we show for different error models that reducing the violation of an inequality can improve the significance. We show this to be the case for an error model in which the variance of an observable is interpreted as its error and for the standard error model in photonic experiments. Specifically, we demonstrate that the Mermin inequality yields a Bell test which is statistically more significant than the Ardehali inequality in the case of a photonic four-qubit state that is close to a GHZ state. Experimentally, we observe this phenomenon in a four-photon experiment, testing the above inequalities for different levels of noise.
Reporting effect sizes as a supplement to statistical significance ...
African Journals Online (AJOL)
The purpose of the article is to review the statistical significance reporting practices in reading instruction studies and to provide guidelines for when to calculate and report effect sizes in educational research. A review of six readily accessible (online) and accredited journals publishing research on reading instruction ...
Your Chi-Square Test Is Statistically Significant: Now What?
Sharpe, Donald
2015-01-01
Applied researchers have employed chi-square tests for more than one hundred years. This paper addresses the question of how one should follow a statistically significant chi-square test result in order to determine the source of that result. Four approaches were evaluated: calculating residuals, comparing cells, ransacking, and partitioning. Data…
Directory of Open Access Journals (Sweden)
Melissa Coulson
2010-07-01
Full Text Available A statistically significant result, and a non-significant result may differ little, although significance status may tempt an interpretation of difference. Two studies are reported that compared interpretation of such results presented using null hypothesis significance testing (NHST, or confidence intervals (CIs. Authors of articles published in psychology, behavioural neuroscience, and medical journals were asked, via email, to interpret two fictitious studies that found similar results, one statistically significant, and the other non-significant. Responses from 330 authors varied greatly, but interpretation was generally poor, whether results were presented as CIs or using NHST. However, when interpreting CIs respondents who mentioned NHST were 60% likely to conclude, unjustifiably, the two results conflicted, whereas those who interpreted CIs without reference to NHST were 95% likely to conclude, justifiably, the two results were consistent. Findings were generally similar for all three disciplines. An email survey of academic psychologists confirmed that CIs elicit better interpretations if NHST is not invoked. Improved statistical inference can result from encouragement of meta-analytic thinking and use of CIs but, for full benefit, such highly desirable statistical reform requires also that researchers interpret CIs without recourse to NHST.
Testing statistical significance scores of sequence comparison methods with structure similarity
Directory of Open Access Journals (Sweden)
Leunissen Jack AM
2006-10-01
Full Text Available Abstract Background In the past years the Smith-Waterman sequence comparison algorithm has gained popularity due to improved implementations and rapidly increasing computing power. However, the quality and sensitivity of a database search is not only determined by the algorithm but also by the statistical significance testing for an alignment. The e-value is the most commonly used statistical validation method for sequence database searching. The CluSTr database and the Protein World database have been created using an alternative statistical significance test: a Z-score based on Monte-Carlo statistics. Several papers have described the superiority of the Z-score as compared to the e-value, using simulated data. We were interested if this could be validated when applied to existing, evolutionary related protein sequences. Results All experiments are performed on the ASTRAL SCOP database. The Smith-Waterman sequence comparison algorithm with both e-value and Z-score statistics is evaluated, using ROC, CVE and AP measures. The BLAST and FASTA algorithms are used as reference. We find that two out of three Smith-Waterman implementations with e-value are better at predicting structural similarities between proteins than the Smith-Waterman implementation with Z-score. SSEARCH especially has very high scores. Conclusion The compute intensive Z-score does not have a clear advantage over the e-value. The Smith-Waterman implementations give generally better results than their heuristic counterparts. We recommend using the SSEARCH algorithm combined with e-values for pairwise sequence comparisons.
Kim, Sung-Min; Choi, Yosoon
2017-06-18
To develop appropriate measures to prevent soil contamination in abandoned mining areas, an understanding of the spatial variation of the potentially toxic trace elements (PTEs) in the soil is necessary. For the purpose of effective soil sampling, this study uses hot spot analysis, which calculates a z -score based on the Getis-Ord Gi* statistic to identify a statistically significant hot spot sample. To constitute a statistically significant hot spot, a feature with a high value should also be surrounded by other features with high values. Using relatively cost- and time-effective portable X-ray fluorescence (PXRF) analysis, sufficient input data are acquired from the Busan abandoned mine and used for hot spot analysis. To calibrate the PXRF data, which have a relatively low accuracy, the PXRF analysis data are transformed using the inductively coupled plasma atomic emission spectrometry (ICP-AES) data. The transformed PXRF data of the Busan abandoned mine are classified into four groups according to their normalized content and z -scores: high content with a high z -score (HH), high content with a low z -score (HL), low content with a high z -score (LH), and low content with a low z -score (LL). The HL and LH cases may be due to measurement errors. Additional or complementary surveys are required for the areas surrounding these suspect samples or for significant hot spot areas. The soil sampling is conducted according to a four-phase procedure in which the hot spot analysis and proposed group classification method are employed to support the development of a sampling plan for the following phase. Overall, 30, 50, 80, and 100 samples are investigated and analyzed in phases 1-4, respectively. The method implemented in this case study may be utilized in the field for the assessment of statistically significant soil contamination and the identification of areas for which an additional survey is required.
Directory of Open Access Journals (Sweden)
Sung-Min Kim
2017-06-01
Full Text Available To develop appropriate measures to prevent soil contamination in abandoned mining areas, an understanding of the spatial variation of the potentially toxic trace elements (PTEs in the soil is necessary. For the purpose of effective soil sampling, this study uses hot spot analysis, which calculates a z-score based on the Getis-Ord Gi* statistic to identify a statistically significant hot spot sample. To constitute a statistically significant hot spot, a feature with a high value should also be surrounded by other features with high values. Using relatively cost- and time-effective portable X-ray fluorescence (PXRF analysis, sufficient input data are acquired from the Busan abandoned mine and used for hot spot analysis. To calibrate the PXRF data, which have a relatively low accuracy, the PXRF analysis data are transformed using the inductively coupled plasma atomic emission spectrometry (ICP-AES data. The transformed PXRF data of the Busan abandoned mine are classified into four groups according to their normalized content and z-scores: high content with a high z-score (HH, high content with a low z-score (HL, low content with a high z-score (LH, and low content with a low z-score (LL. The HL and LH cases may be due to measurement errors. Additional or complementary surveys are required for the areas surrounding these suspect samples or for significant hot spot areas. The soil sampling is conducted according to a four-phase procedure in which the hot spot analysis and proposed group classification method are employed to support the development of a sampling plan for the following phase. Overall, 30, 50, 80, and 100 samples are investigated and analyzed in phases 1–4, respectively. The method implemented in this case study may be utilized in the field for the assessment of statistically significant soil contamination and the identification of areas for which an additional survey is required.
Camps; Prevot
1996-08-09
The statistical characteristics of the local magnetic field of Earth during paleosecular variation, excursions, and reversals are described on the basis of a database that gathers the cleaned mean direction and average remanent intensity of 2741 lava flows that have erupted over the last 20 million years. A model consisting of a normally distributed axial dipole component plus an independent isotropic set of vectors with a Maxwellian distribution that simulates secular variation fits the range of geomagnetic fluctuations, in terms of both direction and intensity. This result suggests that the magnitude of secular variation vectors is independent of the magnitude of Earth's axial dipole moment and that the amplitude of secular variation is unchanged during reversals.
Statistical significance versus clinical relevance.
van Rijn, Marieke H C; Bech, Anneke; Bouyer, Jean; van den Brand, Jan A J G
2017-04-01
In March this year, the American Statistical Association (ASA) posted a statement on the correct use of P-values, in response to a growing concern that the P-value is commonly misused and misinterpreted. We aim to translate these warnings given by the ASA into a language more easily understood by clinicians and researchers without a deep background in statistics. Moreover, we intend to illustrate the limitations of P-values, even when used and interpreted correctly, and bring more attention to the clinical relevance of study findings using two recently reported studies as examples. We argue that P-values are often misinterpreted. A common mistake is saying that P < 0.05 means that the null hypothesis is false, and P ≥0.05 means that the null hypothesis is true. The correct interpretation of a P-value of 0.05 is that if the null hypothesis were indeed true, a similar or more extreme result would occur 5% of the times upon repeating the study in a similar sample. In other words, the P-value informs about the likelihood of the data given the null hypothesis and not the other way around. A possible alternative related to the P-value is the confidence interval (CI). It provides more information on the magnitude of an effect and the imprecision with which that effect was estimated. However, there is no magic bullet to replace P-values and stop erroneous interpretation of scientific results. Scientists and readers alike should make themselves familiar with the correct, nuanced interpretation of statistical tests, P-values and CIs. © The Author 2017. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.
Statistical process control in nursing research.
Polit, Denise F; Chaboyer, Wendy
2012-02-01
In intervention studies in which randomization to groups is not possible, researchers typically use quasi-experimental designs. Time series designs are strong quasi-experimental designs but are seldom used, perhaps because of technical and analytic hurdles. Statistical process control (SPC) is an alternative analytic approach to testing hypotheses about intervention effects using data collected over time. SPC, like traditional statistical methods, is a tool for understanding variation and involves the construction of control charts that distinguish between normal, random fluctuations (common cause variation), and statistically significant special cause variation that can result from an innovation. The purpose of this article is to provide an overview of SPC and to illustrate its use in a study of a nursing practice improvement intervention. Copyright © 2011 Wiley Periodicals, Inc.
Statistical significance of epidemiological data. Seminar: Evaluation of epidemiological studies
International Nuclear Information System (INIS)
Weber, K.H.
1993-01-01
In stochastic damages, the numbers of events, e.g. the persons who are affected by or have died of cancer, and thus the relative frequencies (incidence or mortality) are binomially distributed random variables. Their statistical fluctuations can be characterized by confidence intervals. For epidemiologic questions, especially for the analysis of stochastic damages in the low dose range, the following issues are interesting: - Is a sample (a group of persons) with a definite observed damage frequency part of the whole population? - Is an observed frequency difference between two groups of persons random or statistically significant? - Is an observed increase or decrease of the frequencies with increasing dose random or statistically significant and how large is the regression coefficient (= risk coefficient) in this case? These problems can be solved by sttistical tests. So-called distribution-free tests and tests which are not bound to the supposition of normal distribution are of particular interest, such as: - χ 2 -independence test (test in contingency tables); - Fisher-Yates-test; - trend test according to Cochran; - rank correlation test given by Spearman. These tests are explained in terms of selected epidemiologic data, e.g. of leukaemia clusters, of the cancer mortality of the Japanese A-bomb survivors especially in the low dose range as well as on the sample of the cancer mortality in the high background area in Yangjiang (China). (orig.) [de
Huttary, Rudolf; Goubergrits, Leonid; Schütte, Christof; Bernhard, Stefan
2017-08-01
It has not yet been possible to obtain modeling approaches suitable for covering a wide range of real world scenarios in cardiovascular physiology because many of the system parameters are uncertain or even unknown. Natural variability and statistical variation of cardiovascular system parameters in healthy and diseased conditions are characteristic features for understanding cardiovascular diseases in more detail. This paper presents SISCA, a novel software framework for cardiovascular system modeling and its MATLAB implementation. The framework defines a multi-model statistical ensemble approach for dimension reduced, multi-compartment models and focuses on statistical variation, system identification and patient-specific simulation based on clinical data. We also discuss a data-driven modeling scenario as a use case example. The regarded dataset originated from routine clinical examinations and comprised typical pre and post surgery clinical data from a patient diagnosed with coarctation of aorta. We conducted patient and disease specific pre/post surgery modeling by adapting a validated nominal multi-compartment model with respect to structure and parametrization using metadata and MRI geometry. In both models, the simulation reproduced measured pressures and flows fairly well with respect to stenosis and stent treatment and by pre-treatment cross stenosis phase shift of the pulse wave. However, with post-treatment data showing unrealistic phase shifts and other more obvious inconsistencies within the dataset, the methods and results we present suggest that conditioning and uncertainty management of routine clinical data sets needs significantly more attention to obtain reasonable results in patient-specific cardiovascular modeling. Copyright © 2017 Elsevier Ltd. All rights reserved.
Evolutionary significance of epigenetic variation
Richards, C.L.; Verhoeven, K.J.F.; Bossdorf, O.; Wendel, J.F.; Greilhuber, J.; Dolezel, J.; Leitch, I.J.
2012-01-01
Several chapters in this volume demonstrate how epigenetic work at the molecular level over the last few decades has revolutionized our understanding of genome function and developmental biology. However, epigenetic processes not only further our understanding of variation and regulation at the
Statistical determination of significant curved I-girder bridge seismic response parameters
Seo, Junwon
2013-06-01
Curved steel bridges are commonly used at interchanges in transportation networks and more of these structures continue to be designed and built in the United States. Though the use of these bridges continues to increase in locations that experience high seismicity, the effects of curvature and other parameters on their seismic behaviors have been neglected in current risk assessment tools. These tools can evaluate the seismic vulnerability of a transportation network using fragility curves. One critical component of fragility curve development for curved steel bridges is the completion of sensitivity analyses that help identify influential parameters related to their seismic response. In this study, an accessible inventory of existing curved steel girder bridges located primarily in the Mid-Atlantic United States (MAUS) was used to establish statistical characteristics used as inputs for a seismic sensitivity study. Critical seismic response quantities were captured using 3D nonlinear finite element models. Influential parameters from these quantities were identified using statistical tools that incorporate experimental Plackett-Burman Design (PBD), which included Pareto optimal plots and prediction profiler techniques. The findings revealed that the potential variation in the influential parameters included number of spans, radius of curvature, maximum span length, girder spacing, and cross-frame spacing. These parameters showed varying levels of influence on the critical bridge response.
Statistical Significance and Effect Size: Two Sides of a Coin.
Fan, Xitao
This paper suggests that statistical significance testing and effect size are two sides of the same coin; they complement each other, but do not substitute for one another. Good research practice requires that both should be taken into consideration to make sound quantitative decisions. A Monte Carlo simulation experiment was conducted, and a…
Terán-Hernández, Mónica; Ramis-Prieto, Rebeca; Calderón-Hernández, Jaqueline; Garrocho-Rangel, Carlos Félix; Campos-Alanís, Juan; Ávalos-Lozano, José Antonio; Aguilar-Robledo, Miguel
2016-09-29
Worldwide, Cervical Cancer (CC) is the fourth most common type of cancer and cause of death in women. It is a significant public health problem, especially in low and middle-income/Gross Domestic Product (GDP) countries. In the past decade, several studies of CC have been published, that identify the main modifiable and non-modifiable CC risk factors for Mexican women. However, there are no studies that attempt to explain the residual spatial variation in CC incidence In Mexico, i.e. spatial variation that cannot be ascribed to known, spatially varying risk factors. This paper uses a spatial statistical methodology that takes into account spatial variation in socio-economic factors and accessibility to health services, whilst allowing for residual, unexplained spatial variation in risk. To describe residual spatial variations in CC risk, we used generalised linear mixed models (GLMM) with both spatially structured and unstructured random effects, using a Bayesian approach to inference. The highest risk is concentrated in the southeast, where the Matlapa and Aquismón municipalities register excessive risk, with posterior probabilities greater than 0.8. The lack of coverage of Cervical Cancer-Screening Programme (CCSP) (RR 1.17, 95 % CI 1.12-1.22), Marginalisation Index (RR 1.05, 95 % CI 1.03-1.08), and lack of accessibility to health services (RR 1.01, 95 % CI 1.00-1.03) were significant covariates. There are substantial differences between municipalities, with high-risk areas mainly in low-resource areas lacking accessibility to health services for CC. Our results clearly indicate the presence of spatial patterns, and the relevance of the spatial analysis for public health intervention. Ignoring the spatial variability means to continue a public policy that does not tackle deficiencies in its national CCSP and to keep disadvantaging and disempowering Mexican women in regard to their health care.
Papageorgiou, Spyridon N; Kloukos, Dimitrios; Petridis, Haralampos; Pandis, Nikolaos
2015-10-01
To assess the hypothesis that there is excessive reporting of statistically significant studies published in prosthodontic and implantology journals, which could indicate selective publication. The last 30 issues of 9 journals in prosthodontics and implant dentistry were hand-searched for articles with statistical analyses. The percentages of significant and non-significant results were tabulated by parameter of interest. Univariable/multivariable logistic regression analyses were applied to identify possible predictors of reporting statistically significance findings. The results of this study were compared with similar studies in dentistry with random-effects meta-analyses. From the 2323 included studies 71% of them reported statistically significant results, with the significant results ranging from 47% to 86%. Multivariable modeling identified that geographical area and involvement of statistician were predictors of statistically significant results. Compared to interventional studies, the odds that in vitro and observational studies would report statistically significant results was increased by 1.20 times (OR: 2.20, 95% CI: 1.66-2.92) and 0.35 times (OR: 1.35, 95% CI: 1.05-1.73), respectively. The probability of statistically significant results from randomized controlled trials was significantly lower compared to various study designs (difference: 30%, 95% CI: 11-49%). Likewise the probability of statistically significant results in prosthodontics and implant dentistry was lower compared to other dental specialties, but this result did not reach statistical significant (P>0.05). The majority of studies identified in the fields of prosthodontics and implant dentistry presented statistically significant results. The same trend existed in publications of other specialties in dentistry. Copyright © 2015 Elsevier Ltd. All rights reserved.
Significant Statistics: Viewed with a Contextual Lens
Tait-McCutcheon, Sandi
2010-01-01
This paper examines the pedagogical and organisational changes three lead teachers made to their statistics teaching and learning programs. The lead teachers posed the research question: What would the effect of contextually integrating statistical investigations and literacies into other curriculum areas be on student achievement? By finding the…
Kellerer-Pirklbauer, Andreas
2016-04-01
Longer data series (e.g. >10 a) of ground temperatures in alpine regions are helpful to improve the understanding regarding the effects of present climate change on distribution and thermal characteristics of seasonal frost- and permafrost-affected areas. Beginning in 2004 - and more intensively since 2006 - a permafrost and seasonal frost monitoring network was established in Central and Eastern Austria by the University of Graz. This network consists of c.60 ground temperature (surface and near-surface) monitoring sites which are located at 1922-3002 m a.s.l., at latitude 46°55'-47°22'N and at longitude 12°44'-14°41'E. These data allow conclusions about general ground thermal conditions, potential permafrost occurrence, trend during the observation period, and regional pattern of changes. Calculations and analyses of several different temperature-related parameters were accomplished. At an annual scale a region-wide statistical significant warming during the observation period was revealed by e.g. an increase in mean annual temperature values (mean, maximum) or the significant lowering of the surface frost number (F+). At a seasonal scale no significant trend of any temperature-related parameter was in most cases revealed for spring (MAM) and autumn (SON). Winter (DJF) shows only a weak warming. In contrast, the summer (JJA) season reveals in general a significant warming as confirmed by several different temperature-related parameters such as e.g. mean seasonal temperature, number of thawing degree days, number of freezing degree days, or days without night frost. On a monthly basis August shows the statistically most robust and strongest warming of all months, although regional differences occur. Despite the fact that the general ground temperature warming during the last decade is confirmed by the field data in the study region, complications in trend analyses arise by temperature anomalies (e.g. warm winter 2006/07) or substantial variations in the winter
"What If" Analyses: Ways to Interpret Statistical Significance Test Results Using EXCEL or "R"
Ozturk, Elif
2012-01-01
The present paper aims to review two motivations to conduct "what if" analyses using Excel and "R" to understand the statistical significance tests through the sample size context. "What if" analyses can be used to teach students what statistical significance tests really do and in applied research either prospectively to estimate what sample size…
DEFF Research Database (Denmark)
Engsted, Tom
I comment on the controversy between McCloskey & Ziliak and Hoover & Siegler on statistical versus economic significance, in the March 2008 issue of the Journal of Economic Methodology. I argue that while McCloskey & Ziliak are right in emphasizing 'real error', i.e. non-sampling error that cannot...... be eliminated through specification testing, they fail to acknowledge those areas in economics, e.g. rational expectations macroeconomics and asset pricing, where researchers clearly distinguish between statistical and economic significance and where statistical testing plays a relatively minor role in model...
Seasonal variations of volcanic eruption frequencies
Stothers, Richard B.
1989-01-01
Do volcanic eruptions have a tendency to occur more frequently in the months of May and June? Some past evidence suggests that they do. The present study, based on the new eruption catalog of Simkin et al.(1981), investigates the monthly statistics of the largest eruptions, grouped according to explosive magnitude, geographical latitude, and year. At the 2-delta level, no month-to-month variations in eruption frequency are found to be statistically significant. Examination of previously published month-to-month variations suggests that they, too, are not statistically significant. It is concluded that volcanism, at least averaged over large portions of the globe, is probably not periodic on a seasonal or annual time scale.
Nam, Sungsik; Hasna, Mazen Omar; Alouini, Mohamed-Slim
2011-01-01
-interference on GSC RAKE receivers. The major difficulty in these problems is to derive some joint statistics of ordered exponential variates. With this motivation in mind, we capitalize in this paper on some new order statistics results to derive exact closed
Wilkinson, Michael
2014-03-01
Decisions about support for predictions of theories in light of data are made using statistical inference. The dominant approach in sport and exercise science is the Neyman-Pearson (N-P) significance-testing approach. When applied correctly it provides a reliable procedure for making dichotomous decisions for accepting or rejecting zero-effect null hypotheses with known and controlled long-run error rates. Type I and type II error rates must be specified in advance and the latter controlled by conducting an a priori sample size calculation. The N-P approach does not provide the probability of hypotheses or indicate the strength of support for hypotheses in light of data, yet many scientists believe it does. Outcomes of analyses allow conclusions only about the existence of non-zero effects, and provide no information about the likely size of true effects or their practical/clinical value. Bayesian inference can show how much support data provide for different hypotheses, and how personal convictions should be altered in light of data, but the approach is complicated by formulating probability distributions about prior subjective estimates of population effects. A pragmatic solution is magnitude-based inference, which allows scientists to estimate the true magnitude of population effects and how likely they are to exceed an effect magnitude of practical/clinical importance, thereby integrating elements of subjective Bayesian-style thinking. While this approach is gaining acceptance, progress might be hastened if scientists appreciate the shortcomings of traditional N-P null hypothesis significance testing.
Directory of Open Access Journals (Sweden)
Zhang Zhang
2012-03-01
Full Text Available Abstract Background Genetic mutation, selective pressure for translational efficiency and accuracy, level of gene expression, and protein function through natural selection are all believed to lead to codon usage bias (CUB. Therefore, informative measurement of CUB is of fundamental importance to making inferences regarding gene function and genome evolution. However, extant measures of CUB have not fully accounted for the quantitative effect of background nucleotide composition and have not statistically evaluated the significance of CUB in sequence analysis. Results Here we propose a novel measure--Codon Deviation Coefficient (CDC--that provides an informative measurement of CUB and its statistical significance without requiring any prior knowledge. Unlike previous measures, CDC estimates CUB by accounting for background nucleotide compositions tailored to codon positions and adopts the bootstrapping to assess the statistical significance of CUB for any given sequence. We evaluate CDC by examining its effectiveness on simulated sequences and empirical data and show that CDC outperforms extant measures by achieving a more informative estimation of CUB and its statistical significance. Conclusions As validated by both simulated and empirical data, CDC provides a highly informative quantification of CUB and its statistical significance, useful for determining comparative magnitudes and patterns of biased codon usage for genes or genomes with diverse sequence compositions.
Laurinaviciene, Aida; Plancoulaine, Benoit; Baltrusaityte, Indra; Meskauskas, Raimundas; Besusparis, Justinas; Lesciute-Krilaviciene, Daiva; Raudeliunas, Darius; Iqbal, Yasir; Herlin, Paulette; Laurinavicius, Arvydas
2014-01-01
Digital immunohistochemistry (IHC) is one of the most promising applications brought by new generation image analysis (IA). While conventional IHC staining quality is monitored by semi-quantitative visual evaluation of tissue controls, IA may require more sensitive measurement. We designed an automated system to digitally monitor IHC multi-tissue controls, based on SQL-level integration of laboratory information system with image and statistical analysis tools. Consecutive sections of TMA containing 10 cores of breast cancer tissue were used as tissue controls in routine Ki67 IHC testing. Ventana slide label barcode ID was sent to the LIS to register the serial section sequence. The slides were stained and scanned (Aperio ScanScope XT), IA was performed by the Aperio/Leica Colocalization and Genie Classifier/Nuclear algorithms. SQL-based integration ensured automated statistical analysis of the IA data by the SAS Enterprise Guide project. Factor analysis and plot visualizations were performed to explore slide-to-slide variation of the Ki67 IHC staining results in the control tissue. Slide-to-slide intra-core IHC staining analysis revealed rather significant variation of the variables reflecting the sample size, while Brown and Blue Intensity were relatively stable. To further investigate this variation, the IA results from the 10 cores were aggregated to minimize tissue-related variance. Factor analysis revealed association between the variables reflecting the sample size detected by IA and Blue Intensity. Since the main feature to be extracted from the tissue controls was staining intensity, we further explored the variation of the intensity variables in the individual cores. MeanBrownBlue Intensity ((Brown+Blue)/2) and DiffBrownBlue Intensity (Brown-Blue) were introduced to better contrast the absolute intensity and the colour balance variation in each core; relevant factor scores were extracted. Finally, tissue-related factors of IHC staining variance were
Systematic reviews of anesthesiologic interventions reported as statistically significant
DEFF Research Database (Denmark)
Imberger, Georgina; Gluud, Christian; Boylan, John
2015-01-01
statistically significant meta-analyses of anesthesiologic interventions, we used TSA to estimate power and imprecision in the context of sparse data and repeated updates. METHODS: We conducted a search to identify all systematic reviews with meta-analyses that investigated an intervention that may......: From 11,870 titles, we found 682 systematic reviews that investigated anesthesiologic interventions. In the 50 sampled meta-analyses, the median number of trials included was 8 (interquartile range [IQR], 5-14), the median number of participants was 964 (IQR, 523-1736), and the median number...
Xu, Kuan-Man
2006-01-01
A new method is proposed to compare statistical differences between summary histograms, which are the histograms summed over a large ensemble of individual histograms. It consists of choosing a distance statistic for measuring the difference between summary histograms and using a bootstrap procedure to calculate the statistical significance level. Bootstrapping is an approach to statistical inference that makes few assumptions about the underlying probability distribution that describes the data. Three distance statistics are compared in this study. They are the Euclidean distance, the Jeffries-Matusita distance and the Kuiper distance. The data used in testing the bootstrap method are satellite measurements of cloud systems called cloud objects. Each cloud object is defined as a contiguous region/patch composed of individual footprints or fields of view. A histogram of measured values over footprints is generated for each parameter of each cloud object and then summary histograms are accumulated over all individual histograms in a given cloud-object size category. The results of statistical hypothesis tests using all three distances as test statistics are generally similar, indicating the validity of the proposed method. The Euclidean distance is determined to be most suitable after comparing the statistical tests of several parameters with distinct probability distributions among three cloud-object size categories. Impacts on the statistical significance levels resulting from differences in the total lengths of satellite footprint data between two size categories are also discussed.
Black, Joshua A.; Knowles, Peter J.
2018-06-01
The performance of quasi-variational coupled-cluster (QV) theory applied to the calculation of activation and reaction energies has been investigated. A statistical analysis of results obtained for six different sets of reactions has been carried out, and the results have been compared to those from standard single-reference methods. In general, the QV methods lead to increased activation energies and larger absolute reaction energies compared to those obtained with traditional coupled-cluster theory.
P-Value, a true test of statistical significance? a cautionary note ...
African Journals Online (AJOL)
While it's not the intention of the founders of significance testing and hypothesis testing to have the two ideas intertwined as if they are complementary, the inconvenient marriage of the two practices into one coherent, convenient, incontrovertible and misinterpreted practice has dotted our standard statistics textbooks and ...
Zhang, Zhang
2012-03-22
Background: Genetic mutation, selective pressure for translational efficiency and accuracy, level of gene expression, and protein function through natural selection are all believed to lead to codon usage bias (CUB). Therefore, informative measurement of CUB is of fundamental importance to making inferences regarding gene function and genome evolution. However, extant measures of CUB have not fully accounted for the quantitative effect of background nucleotide composition and have not statistically evaluated the significance of CUB in sequence analysis.Results: Here we propose a novel measure--Codon Deviation Coefficient (CDC)--that provides an informative measurement of CUB and its statistical significance without requiring any prior knowledge. Unlike previous measures, CDC estimates CUB by accounting for background nucleotide compositions tailored to codon positions and adopts the bootstrapping to assess the statistical significance of CUB for any given sequence. We evaluate CDC by examining its effectiveness on simulated sequences and empirical data and show that CDC outperforms extant measures by achieving a more informative estimation of CUB and its statistical significance.Conclusions: As validated by both simulated and empirical data, CDC provides a highly informative quantification of CUB and its statistical significance, useful for determining comparative magnitudes and patterns of biased codon usage for genes or genomes with diverse sequence compositions. 2012 Zhang et al; licensee BioMed Central Ltd.
Interpreting Statistical Significance Test Results: A Proposed New "What If" Method.
Kieffer, Kevin M.; Thompson, Bruce
As the 1994 publication manual of the American Psychological Association emphasized, "p" values are affected by sample size. As a result, it can be helpful to interpret the results of statistical significant tests in a sample size context by conducting so-called "what if" analyses. However, these methods can be inaccurate…
Brouwer, D.; Meijer, R.R.; Zevalkink, D.J.
2013-01-01
Several researchers have emphasized that item response theory (IRT)-based methods should be preferred over classical approaches in measuring change for individual patients. In the present study we discuss and evaluate the use of IRT-based statistics to measure statistical significant individual
Genetic variation and significance of hepatitis B surface antigen
Directory of Open Access Journals (Sweden)
ZHANG Zhenhua
2013-11-01
Full Text Available Hepatitis B virus (HBV is prone to genetic variation because there is reverse transcription in the process of HBV replication. The gene mutation of hepatitis B surface antigen may affect clinical diagnosis of HBV infection, viral replication, and vaccine effect. The current research and existing problems are discussed from the following aspects: the mechanism and biological and clinical significance of S gene mutation. Most previous studies focused on S gene alone, so S gene should be considered as part of HBV DNA in the future research on S gene mutation.
Fidalgo, Angel M.; Alavi, Seyed Mohammad; Amirian, Seyed Mohammad Reza
2014-01-01
This study examines three controversial aspects in differential item functioning (DIF) detection by logistic regression (LR) models: first, the relative effectiveness of different analytical strategies for detecting DIF; second, the suitability of the Wald statistic for determining the statistical significance of the parameters of interest; and…
Causes and significance of variation in mammalian basal metabolism.
Raichlen, David A; Gordon, Adam D; Muchlinski, Magdalena N; Snodgrass, J Josh
2010-02-01
Mammalian basal metabolic rates (BMR) increase with body mass, whichs explains approximately 95% of the variation in BMR. However, at a given mass, there remains a large amount of variation in BMR. While many researchers suggest that the overall scaling of BMR with body mass is due to physiological constraints, variation at a given body mass may provide clues as to how selection acts on BMR. Here, we examine this variation in BMR in a broad sample of mammals and we test the hypothesis that, across mammals, body composition explains differences in BMR at a given body mass. Variation in BMR is strongly correlated with variation in muscle mass, and both of these variables are correlated with latitude and ambient temperature. These results suggest that selection alters BMR in response to thermoregulatory pressures, and that selection uses muscle mass as a means to generate this variation.
DEFF Research Database (Denmark)
Jakobsen, Janus Christian; Wetterslev, Jorn; Winkel, Per
2014-01-01
BACKGROUND: Thresholds for statistical significance when assessing meta-analysis results are being insufficiently demonstrated by traditional 95% confidence intervals and P-values. Assessment of intervention effects in systematic reviews with meta-analysis deserves greater rigour. METHODS......: Methodologies for assessing statistical and clinical significance of intervention effects in systematic reviews were considered. Balancing simplicity and comprehensiveness, an operational procedure was developed, based mainly on The Cochrane Collaboration methodology and the Grading of Recommendations...... Assessment, Development, and Evaluation (GRADE) guidelines. RESULTS: We propose an eight-step procedure for better validation of meta-analytic results in systematic reviews (1) Obtain the 95% confidence intervals and the P-values from both fixed-effect and random-effects meta-analyses and report the most...
Logsdon, Benjamin A.; Carty, Cara L.; Reiner, Alexander P.; Dai, James Y.; Kooperberg, Charles
2012-01-01
Motivation: For many complex traits, including height, the majority of variants identified by genome-wide association studies (GWAS) have small effects, leaving a significant proportion of the heritable variation unexplained. Although many penalized multiple regression methodologies have been proposed to increase the power to detect associations for complex genetic architectures, they generally lack mechanisms for false-positive control and diagnostics for model over-fitting. Our methodology is the first penalized multiple regression approach that explicitly controls Type I error rates and provide model over-fitting diagnostics through a novel normally distributed statistic defined for every marker within the GWAS, based on results from a variational Bayes spike regression algorithm. Results: We compare the performance of our method to the lasso and single marker analysis on simulated data and demonstrate that our approach has superior performance in terms of power and Type I error control. In addition, using the Women's Health Initiative (WHI) SNP Health Association Resource (SHARe) GWAS of African-Americans, we show that our method has power to detect additional novel associations with body height. These findings replicate by reaching a stringent cutoff of marginal association in a larger cohort. Availability: An R-package, including an implementation of our variational Bayes spike regression (vBsr) algorithm, is available at http://kooperberg.fhcrc.org/soft.html. Contact: blogsdon@fhcrc.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22563072
Directory of Open Access Journals (Sweden)
G. M. J. HASAN
2014-10-01
Full Text Available Climate, one of the major controlling factors for well-being of the inhabitants in the world, has been changing in accordance with the natural forcing and manmade activities. Bangladesh, the most densely populated countries in the world is under threat due to climate change caused by excessive use or abuse of ecology and natural resources. This study checks the rainfall patterns and their associated changes in the north-eastern part of Bangladesh mainly Sylhet city through statistical analysis of daily rainfall data during the period of 1957 - 2006. It has been observed that a good correlation exists between the monthly mean and daily maximum rainfall. A linear regression analysis of the data is found to be significant for all the months. Some key statistical parameters like the mean values of Coefficient of Variability (CV, Relative Variability (RV and Percentage Inter-annual Variability (PIV have been studied and found to be at variance. Monthly, yearly and seasonal variation of rainy days also analysed to check for any significant changes.
This paper describes variations in the estrogenic potency of effluent from a "model" wastewater treatment plant in Duluth, MN, and explores the significance of these variations relative to sampling approaches for monitoring effluents and their toxicity to fish.
Specific Gravity Variation in a Lower Mississippi Valley Cottonwood Population
R. E. Farmer; J. R. Wilcox
1966-01-01
Specific gravity varied from 0,32 to 0.46, averaging 0.38. Most of the variation was associated with individual trees; samples within locations accounted for a smaller, but statistically significant, portion of the variation. Variation between locatians was not significant. It was concluded that individual high-density trees' should be sought throughout the...
International Nuclear Information System (INIS)
Ghazimirsaied, Ahmad; Koch, Charles Robert
2012-01-01
Highlights: ► Misfire reduction in a combustion engine based on chaotic theory methods. ► Chaotic theory analysis of cyclic variation of a HCCI engine near misfire. ► Symbol sequence approach is used to predict ignition timing one cycle-ahead. ► Prediction is combined with feedback control to lower HCCI combustion variation. ► Feedback control extends the HCCI operating range into the misfire region. -- Abstract: Cyclic variation of a Homogeneous Charge Compression Ignition (HCCI) engine near misfire is analyzed using chaotic theory methods and feedback control is used to stabilize high cyclic variations. Variation of consecutive cycles of θ Pmax (the crank angle of maximum cylinder pressure over an engine cycle) for a Primary Reference Fuel engine is analyzed near misfire operation for five test points with similar conditions but different octane numbers. The return map of the time series of θ Pmax at each combustion cycle reveals the deterministic and random portions of the dynamics near misfire for this HCCI engine. A symbol-statistic approach is used to predict θ Pmax one cycle-ahead. Predicted θ Pmax has similar dynamical behavior to the experimental measurements. Based on this cycle ahead prediction, and using fuel octane as the input, feedback control is used to stabilize the instability of θ Pmax variations at this engine condition near misfire.
Handbook of Spatial Statistics
Gelfand, Alan E
2010-01-01
Offers an introduction detailing the evolution of the field of spatial statistics. This title focuses on the three main branches of spatial statistics: continuous spatial variation (point referenced data); discrete spatial variation, including lattice and areal unit data; and, spatial point patterns.
Diedrich, Alice; Schlegl, Sandra; Greetfeld, Martin; Fumi, Markus; Voderholzer, Ulrich
2018-03-01
This study examines the statistical and clinical significance of symptom changes during an intensive inpatient treatment program with a strong psychotherapeutic focus for individuals with severe bulimia nervosa. 295 consecutively admitted bulimic patients were administered the Structured Interview for Anorexic and Bulimic Syndromes-Self-Rating (SIAB-S), the Eating Disorder Inventory-2 (EDI-2), the Brief Symptom Inventory (BSI), and the Beck Depression Inventory-II (BDI-II) at treatment intake and discharge. Results indicated statistically significant symptom reductions with large effect sizes regarding severity of binge eating and compensatory behavior (SIAB-S), overall eating disorder symptom severity (EDI-2), overall psychopathology (BSI), and depressive symptom severity (BDI-II) even when controlling for antidepressant medication. The majority of patients showed either reliable (EDI-2: 33.7%, BSI: 34.8%, BDI-II: 18.1%) or even clinically significant symptom changes (EDI-2: 43.2%, BSI: 33.9%, BDI-II: 56.9%). Patients with clinically significant improvement were less distressed at intake and less likely to suffer from a comorbid borderline personality disorder when compared with those who did not improve to a clinically significant extent. Findings indicate that intensive psychotherapeutic inpatient treatment may be effective in about 75% of severely affected bulimic patients. For the remaining non-responding patients, inpatient treatment might be improved through an even stronger focus on the reduction of comorbid borderline personality traits.
Binny, Diana; Mezzenga, Emilio; Lancaster, Craig M; Trapp, Jamie V; Kairn, Tanya; Crowe, Scott B
2017-06-01
The aims of this study were to investigate machine beam parameters using the TomoTherapy quality assurance (TQA) tool, establish a correlation to patient delivery quality assurance results and to evaluate the relationship between energy variations detected using different TQA modules. TQA daily measurement results from two treatment machines for periods of up to 4years were acquired. Analyses of beam quality, helical and static output variations were made. Variations from planned dose were also analysed using Statistical Process Control (SPC) technique and their relationship to output trends were studied. Energy variations appeared to be one of the contributing factors to delivery output dose seen in the analysis. Ion chamber measurements were reliable indicators of energy and output variations and were linear with patient dose verifications. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.
Recent Literature on Whether Statistical Significance Tests Should or Should Not Be Banned.
Deegear, James
This paper summarizes the literature regarding statistical significant testing with an emphasis on recent literature in various discipline and literature exploring why researchers have demonstrably failed to be influenced by the American Psychological Association publication manual's encouragement to report effect sizes. Also considered are…
Leigh, S R; Jungers, W L
1994-12-01
A recent study suggests that differing populations of woolly spider monkeys exhibit a substantial degree of morphological, cytogenetic, and behavioral variation. We re-evaluate the differences between populations in the degree of canine tooth height sexual dimorphism and in the frequency of thumbs. Statistical analysis of variation in the degree of canine sexual dimorphism between these populations fails to provide strong evidence for subspecific variation: differences in the degree of canine dimorphism cannot be considered statistically significant. Differences between populations in the frequency of thumbs are, however, statistically significant. The lack of clear distinctions between populations in the degree of canine dimorphism complicates assessments of behavioral variation between these populations. We suggest that the level of geographic variation in woolly spider monkey canine dimorphism is not consistent with subspecific status.
Hayslett, H T
1991-01-01
Statistics covers the basic principles of Statistics. The book starts by tackling the importance and the two kinds of statistics; the presentation of sample data; the definition, illustration and explanation of several measures of location; and the measures of variation. The text then discusses elementary probability, the normal distribution and the normal approximation to the binomial. Testing of statistical hypotheses and tests of hypotheses about the theoretical proportion of successes in a binomial population and about the theoretical mean of a normal population are explained. The text the
Ji, Jun; Ling, Jeffrey; Jiang, Helen; Wen, Qiaojun; Whitin, John C; Tian, Lu; Cohen, Harvey J; Ling, Xuefeng B
2013-03-23
Mass spectrometry (MS) has evolved to become the primary high throughput tool for proteomics based biomarker discovery. Until now, multiple challenges in protein MS data analysis remain: large-scale and complex data set management; MS peak identification, indexing; and high dimensional peak differential analysis with the concurrent statistical tests based false discovery rate (FDR). "Turnkey" solutions are needed for biomarker investigations to rapidly process MS data sets to identify statistically significant peaks for subsequent validation. Here we present an efficient and effective solution, which provides experimental biologists easy access to "cloud" computing capabilities to analyze MS data. The web portal can be accessed at http://transmed.stanford.edu/ssa/. Presented web application supplies large scale MS data online uploading and analysis with a simple user interface. This bioinformatic tool will facilitate the discovery of the potential protein biomarkers using MS.
Gaskin, Cadeyrn J; Happell, Brenda
2014-05-01
improvement. Most importantly, researchers should abandon the misleading practice of interpreting the results from inferential tests based solely on whether they are statistically significant (or not) and, instead, focus on reporting and interpreting effect sizes, confidence intervals, and significance levels. Nursing researchers also need to conduct and report a priori power analyses, and to address the issue of Type I experiment-wise error inflation in their studies. Crown Copyright © 2013. Published by Elsevier Ltd. All rights reserved.
International Nuclear Information System (INIS)
Smart, V.; Curwen, G.B.; Whitehouse, C.A.; Edwards, A.; Tawn, E.J.
2003-01-01
The G 2 chromosomal radiosensitivity assay is a technically demanding assay. To ensure that it is reproducible in our laboratory, we have examined the effects of storage and culture conditions by applying the assay to a group of healthy controls and determined the extent of intra- and inter-individual variations. Nineteen different individuals provided one or more blood samples resulting in a total of 57 successful tests. Multiple cultures from a single blood sample showed no statistically significant difference in the number of chromatid type aberrations between cultures. A 24 h delay prior to culturing the lymphocytes did not significantly affect the induced G 2 score. Intra-individual variation was not statistically significant in seven out of nine individuals. Inter-individual variation was highly statistically significant (P<0.001), indicating that there is a real difference between individuals in the response to radiation using this assay
Solar cycle variations in IMF intensity
International Nuclear Information System (INIS)
King, J.H.
1979-01-01
Annual averages of logarithms of hourly interplanetary magnetic field (IMF) intensities, obtained from geocentric spacecraft between November 1963 and December 1977, reveal the following solar cycle variation. For 2--3 years at each solar minimum period, the IMF intensity is depressed by 10--15% relative to its mean value realized during a broad 9-year period contered at solar maximum. No systematic variations occur during this 9-year period. The solar minimum decrease, although small in relation to variations in some other solar wind parameters, is both statistically and physically significant
Van Aert, R.C.M.; Van Assen, M.A.L.M.
2018-01-01
The unrealistically high rate of positive results within psychology has increased the attention to replication research. However, researchers who conduct a replication and want to statistically combine the results of their replication with a statistically significant original study encounter
A tutorial on hunting statistical significance by chasing N
Directory of Open Access Journals (Sweden)
Denes Szucs
2016-09-01
Full Text Available There is increasing concern about the replicability of studies in psychology and cognitive neuroscience. Hidden data dredging (also called p-hacking is a major contributor to this crisis because it substantially increases Type I error resulting in a much larger proportion of false positive findings than the usually expected 5%. In order to build better intuition to avoid, detect and criticise some typical problems, here I systematically illustrate the large impact of some easy to implement and so, perhaps frequent data dredging techniques on boosting false positive findings. I illustrate several forms of two special cases of data dredging. First, researchers may violate the data collection stopping rules of null hypothesis significance testing by repeatedly checking for statistical significance with various numbers of participants. Second, researchers may group participants post-hoc along potential but unplanned independent grouping variables. The first approach 'hacks' the number of participants in studies, the second approach ‘hacks’ the number of variables in the analysis. I demonstrate the high amount of false positive findings generated by these techniques with data from true null distributions. I also illustrate that it is extremely easy to introduce strong bias into data by very mild selection and re-testing. Similar, usually undocumented data dredging steps can easily lead to having 20-50%, or more false positives.
Li, Changyang; Wang, Xiuying; Eberl, Stefan; Fulham, Michael; Yin, Yong; Dagan Feng, David
2015-01-01
Automated and general medical image segmentation can be challenging because the foreground and the background may have complicated and overlapping density distributions in medical imaging. Conventional region-based level set algorithms often assume piecewise constant or piecewise smooth for segments, which are implausible for general medical image segmentation. Furthermore, low contrast and noise make identification of the boundaries between foreground and background difficult for edge-based level set algorithms. Thus, to address these problems, we suggest a supervised variational level set segmentation model to harness the statistical region energy functional with a weighted probability approximation. Our approach models the region density distributions by using the mixture-of-mixtures Gaussian model to better approximate real intensity distributions and distinguish statistical intensity differences between foreground and background. The region-based statistical model in our algorithm can intuitively provide better performance on noisy images. We constructed a weighted probability map on graphs to incorporate spatial indications from user input with a contextual constraint based on the minimization of contextual graphs energy functional. We measured the performance of our approach on ten noisy synthetic images and 58 medical datasets with heterogeneous intensities and ill-defined boundaries and compared our technique to the Chan-Vese region-based level set model, the geodesic active contour model with distance regularization, and the random walker model. Our method consistently achieved the highest Dice similarity coefficient when compared to the other methods.
Directory of Open Access Journals (Sweden)
Aloisie Poulíčková
Full Text Available It is now clear that whole genome duplications have occurred in all eukaryotic evolutionary lineages, and that the vast majority of flowering plants have experienced polyploidisation in their evolutionary history. However, study of genome size variation in microalgae lags behind that of higher plants and seaweeds. In this study, we have addressed the question whether microalgal phylogeny is associated with DNA content variation in order to evaluate the evolutionary significance of polyploidy in the model genus Micrasterias. We applied flow-cytometric techniques of DNA quantification to microalgae and mapped the estimated DNA content along the phylogenetic tree. Correlations between DNA content and cell morphometric parameters were also tested using geometric morphometrics. In total, DNA content was successfully determined for 34 strains of the genus Micrasterias. The estimated absolute 2C nuclear DNA amount ranged from 2.1 to 64.7 pg; intraspecific variation being 17.4-30.7 pg in M. truncata and 32.0-64.7 pg in M. rotata. There were significant differences between DNA contents of related species. We found strong correlation between the absolute nuclear DNA content and chromosome numbers and significant positive correlation between the DNA content and both cell size and number of terminal lobes. Moreover, the results showed the importance of cell/life cycle studies for interpretation of DNA content measurements in microalgae.
Sibling Competition & Growth Tradeoffs. Biological vs. Statistical Significance
Kramer, Karen L.; Veile, Amanda; Ot?rola-Castillo, Erik
2016-01-01
Early childhood growth has many downstream effects on future health and reproduction and is an important measure of offspring quality. While a tradeoff between family size and child growth outcomes is theoretically predicted in high-fertility societies, empirical evidence is mixed. This is often attributed to phenotypic variation in parental condition. However, inconsistent study results may also arise because family size confounds the potentially differential effects that older and younger s...
International Nuclear Information System (INIS)
Huijsmans, D.P.
1982-01-01
The aim of this research was to distinguish as accurately as possible between two mechanisms behind a half-daily variation in detected numbers of neutrons and mesons in the secondary cosmic ray particles at sea level. These two mechanisms are due to air pressure variations at sea level and affect the number of primary particles with a certain arrival direction. The distribution among arrival directions in the ecliptic plane varies if a gradient exists in the guiding centre density of primaries in directions perpendicular to the neutral sheet. Chapter 2 is devoted to the calculation of a physically and statistically justifiable determination of the barometric coefficient for neutron measurements and air pressures. Chapter 3 deals with the estimation of atmospheric correction coefficients for the elimination of the influence of changing atmospheric conditions on the number of detected mesons. For mesons the variation of total mass, and also the variations in mass-distribution along the trajectory of the mesons are important. After correction for atmospheric variations using the resulting atmospheric correction coefficients from chapter 2 and 3, the influence of the structure of the interplanetary magnetic field near the earth is examined in chapter 4. 0inally, in chapter 5, a power spectral analysis of variations in corrected intensities of neutrons and mesons is carried out. Such an analysis distinguishes the variance of a time series into contributions within small frequency intervals. From the power spectra of variations on a yearly basis, a statistically fundamented judgement can be given as to the significance of the semi-diurnal variation during the different phases of the solar magnetic activity cycle. (Auth.)
DEFF Research Database (Denmark)
Jones, Allan; Sommerlund, Bo
2007-01-01
The uses of null hypothesis significance testing (NHST) and statistical power analysis within psychological research are critically discussed. The article looks at the problems of relying solely on NHST when dealing with small and large sample sizes. The use of power-analysis in estimating...... the potential error introduced by small and large samples is advocated. Power analysis is not recommended as a replacement to NHST but as an additional source of information about the phenomena under investigation. Moreover, the importance of conceptual analysis in relation to statistical analysis of hypothesis...
International Nuclear Information System (INIS)
Tang Jie; Nett, Brian E; Chen Guanghong
2009-01-01
Of all available reconstruction methods, statistical iterative reconstruction algorithms appear particularly promising since they enable accurate physical noise modeling. The newly developed compressive sampling/compressed sensing (CS) algorithm has shown the potential to accurately reconstruct images from highly undersampled data. The CS algorithm can be implemented in the statistical reconstruction framework as well. In this study, we compared the performance of two standard statistical reconstruction algorithms (penalized weighted least squares and q-GGMRF) to the CS algorithm. In assessing the image quality using these iterative reconstructions, it is critical to utilize realistic background anatomy as the reconstruction results are object dependent. A cadaver head was scanned on a Varian Trilogy system at different dose levels. Several figures of merit including the relative root mean square error and a quality factor which accounts for the noise performance and the spatial resolution were introduced to objectively evaluate reconstruction performance. A comparison is presented between the three algorithms for a constant undersampling factor comparing different algorithms at several dose levels. To facilitate this comparison, the original CS method was formulated in the framework of the statistical image reconstruction algorithms. Important conclusions of the measurements from our studies are that (1) for realistic neuro-anatomy, over 100 projections are required to avoid streak artifacts in the reconstructed images even with CS reconstruction, (2) regardless of the algorithm employed, it is beneficial to distribute the total dose to more views as long as each view remains quantum noise limited and (3) the total variation-based CS method is not appropriate for very low dose levels because while it can mitigate streaking artifacts, the images exhibit patchy behavior, which is potentially harmful for medical diagnosis.
Statistical issues in reporting quality data: small samples and casemix variation.
Zaslavsky, A M
2001-12-01
To present two key statistical issues that arise in analysis and reporting of quality data. Casemix variation is relevant to quality reporting when the units being measured have differing distributions of patient characteristics that also affect the quality outcome. When this is the case, adjustment using stratification or regression may be appropriate. Such adjustments may be controversial when the patient characteristic does not have an obvious relationship to the outcome. Stratified reporting poses problems for sample size and reporting format, but may be useful when casemix effects vary across units. Although there are no absolute standards of reliability, high reliabilities (interunit F > or = 10 or reliability > or = 0.9) are desirable for distinguishing above- and below-average units. When small or unequal sample sizes complicate reporting, precision may be improved using indirect estimation techniques that incorporate auxiliary information, and 'shrinkage' estimation can help to summarize the strength of evidence about units with small samples. With broader understanding of casemix adjustment and methods for analyzing small samples, quality data can be analysed and reported more accurately.
Daytime Variations of Tear Osmolarity Measurement in Dry Eye Patients
Directory of Open Access Journals (Sweden)
Ulviye Yiğit
2013-12-01
Full Text Available Purpose: We have targeted primarily to show the variations of tear osmolarity during the daytime period in subjects with dry eyes and non-dry eyes and, secondarily, to evaluate the relationship of these variations with Schirmer’s test and break-up time (BUT. Material and Method: Twenty newly diagnosed dry eye patients and 20 healthy voluntary subjects with similar age and gender were included in this prospective study. In addition to the full ophthalmic examination, Schirmer’s test and BUT test were applied to all participants. Tear osmolarity measurements were done after pre-examination but in different day. The measurements were registered with TearLab Osmolarity System (TearLab Corporation, San Diego, CA, USA every 3 hours within 8:00 AM and 5:00 PM. The results were evaluated statistically. Results: No statistically significant difference was found between the mean age and gender of dry eye syndrome (DES and control groups (p>0.05. The mean measurements of Schirmer’s test and BUT in the DES group were statistically significantly lower than those in the control group (p=0.0001. The mean measurements of tear osmolarity at 8:00 AM, 11:00 AM, 2:00 PM, and 5:00 PM in the DES group were statistically significantly higher than those in the control group (p=0.001, p=0.0001. No statistically significant difference in tear osmolarity at 8:00 AM, 11: 00 AM, 2:00 PM, and 5:00 PM was found between the groups, and within DES and control groups (p>0.05. Discussion: We did not determine significant change in daytime variations of the tear osmolarity in dry eye patients and healthy subjects. As a secondary result, we can conclude that there is no difference among tear osmolarity, Shirmer’s and BUT tests in the diagnosis of DES. (Turk J Ophthalmol 2013; 43: 437-41
Infants generalize representations of statistically segmented words
Directory of Open Access Journals (Sweden)
Katharine eGraf Estes
2012-10-01
Full Text Available The acoustic variation in language presents learners with a substantial challenge. To learn by tracking statistical regularities in speech, infants must recognize words across tokens that differ based on characteristics such as the speaker’s voice, affect, or the sentence context. Previous statistical learning studies have not investigated how these types of surface form variation affect learning. The present experiments used tasks tailored to two distinct developmental levels to investigate the robustness of statistical learning to variation. Experiment 1 examined statistical word segmentation in 11-month-olds and found that infants can recognize statistically segmented words across a change in the speaker’s voice from segmentation to testing. The direction of infants’ preferences suggests that recognizing words across a voice change is more difficult than recognizing them in a consistent voice. Experiment 2 tested whether 17-month-olds can generalize the output of statistical learning across variation to support word learning. The infants were successful in their generalization; they associated referents with statistically defined words despite a change in voice from segmentation to label learning. Infants’ learning patterns also indicate that they formed representations of across-word syllable sequences during segmentation. Thus, low probability sequences can act as object labels in some conditions. The findings of these experiments suggest that the units that emerge during statistical learning are not perceptually constrained, but rather are robust to naturalistic acoustic variation.
Statistical significance estimation of a signal within the GooFit framework on GPUs
Directory of Open Access Journals (Sweden)
Cristella Leonardo
2017-01-01
Full Text Available In order to test the computing capabilities of GPUs with respect to traditional CPU cores a high-statistics toy Monte Carlo technique has been implemented both in ROOT/RooFit and GooFit frameworks with the purpose to estimate the statistical significance of the structure observed by CMS close to the kinematical boundary of the J/ψϕ invariant mass in the three-body decay B+ → J/ψϕK+. GooFit is a data analysis open tool under development that interfaces ROOT/RooFit to CUDA platform on nVidia GPU. The optimized GooFit application running on GPUs hosted by servers in the Bari Tier2 provides striking speed-up performances with respect to the RooFit application parallelised on multiple CPUs by means of PROOF-Lite tool. The considerable resulting speed-up, evident when comparing concurrent GooFit processes allowed by CUDA Multi Process Service and a RooFit/PROOF-Lite process with multiple CPU workers, is presented and discussed in detail. By means of GooFit it has also been possible to explore the behaviour of a likelihood ratio test statistic in different situations in which the Wilks Theorem may or may not apply because its regularity conditions are not satisfied.
Statistical intensity variation analysis for rapid volumetric imaging of capillary network flux.
Lee, Jonghwan; Jiang, James Y; Wu, Weicheng; Lesage, Frederic; Boas, David A
2014-04-01
We present a novel optical coherence tomography (OCT)-based technique for rapid volumetric imaging of red blood cell (RBC) flux in capillary networks. Previously we reported that OCT can capture individual RBC passage within a capillary, where the OCT intensity signal at a voxel fluctuates when an RBC passes the voxel. Based on this finding, we defined a metric of statistical intensity variation (SIV) and validated that the mean SIV is proportional to the RBC flux [RBC/s] through simulations and measurements. From rapidly scanned volume data, we used Hessian matrix analysis to vectorize a segment path of each capillary and estimate its flux from the mean of the SIVs gathered along the path. Repeating this process led to a 3D flux map of the capillary network. The present technique enabled us to trace the RBC flux changes over hundreds of capillaries with a temporal resolution of ~1 s during functional activation.
A statistical light use efficiency model explains 85% variations in global GPP
Jiang, C.; Ryu, Y.
2016-12-01
Photosynthesis is a complicated process whose modeling requires different levels of assumptions, simplification, and parameterization. Among models, light use efficiency (LUE) model is highly compact but powerful in monitoring gross primary production (GPP) from satellite data. Most of LUE models adopt a multiplicative from of maximum LUE, absorbed photosynthetically active radiation (APAR), and temperature and water stress functions. However, maximum LUE is a fitting parameter with large spatial variations, but most studies only use several biome dependent constants. In addition, stress functions are empirical and arbitrary in literatures. Moreover, meteorological data used are usually coarse-resolution, e.g., 1°, which could cause large errors. Finally, sunlit and shade canopy have completely different light responses but little considered. Targeting these issues, we derived a new statistical LUE model from a process-based and satellite-driven model, the Breathing Earth System Simulator (BESS). We have already derived a set of global radiation (5-km resolution), carbon and water fluxes (1-km resolution) products from 2000 to 2015 from BESS. By exploring these datasets, we found strong correlation between APAR and GPP for sunlit (R2=0.84) and shade (R2=0.96) canopy, respectively. A simple model, only driven by sunlit and shade APAR, was thus built based on linear relationships. The slopes of the linear function act as effective LUE of global ecosystem, with values of 0.0232 and 0.0128 umol C/umol quanta for sunlit and shade canopy, respectively. When compared with MPI-BGC GPP products, a global proxy of FLUXNET data, BESS-LUE achieved an overall accuracy of R2 = 0.85, whereas original BESS was R2 = 0.83 and MODIS GPP product was R2 = 0.76. We investigated spatiotemporal variations of the effective LUE. Spatially, the ratio of sunlit to shade values ranged from 0.1 (wet tropic) to 4.5 (dry inland). By using maps of sunlit and shade effective LUE the accuracy of
Directory of Open Access Journals (Sweden)
Hamad Al-Khalid
2011-12-01
Full Text Available Hardness homogeneity of the commonly used structural ferrous and nonferrous engineering materials is of vital importance in the design stage, therefore, reliable information regarding material properties homogeneity should be validated and any deviation should be addressed. In the current study the hardness variation, over wide spectrum radial locations of some ferrous and nonferrous structural engineering materials, was investigated. Measurements were performed over both faces (cross-section of each stock bar according to a pre-specified stratified design, ensuring the coverage of the entire area both in radial and circumferential directions. Additionally the credibility of the apparatus and measuring procedures were examined through a statistically based calibration process of the hardness reference block. Statistical and response surface graphical analysis are used to examine the nature, adequacy and significance of the measured hardness values. Calibration of the apparatus reference block proved the reliability of the measuring system, where no strong evidence was found against the stochastic nature of hardness measures over the various stratified locations. Also, outlier elimination procedures were proved to be beneficial only at fewer measured points. Hardness measurements showed a dispersion domain that is within the acceptable confidence interval. For AISI 4140 and AISI 1020 steels, hardness is found to have a slight decrease trend as the diameter is reduced, while an opposite behavior is observed for AA 6082 aluminum alloy. However, no definite significant behavior was noticed regarding the effect of the sector sequence (circumferential direction.
Sierevelt, Inger N.; van Oldenrijk, Jakob; Poolman, Rudolf W.
2007-01-01
In this paper we describe several issues that influence the reporting of statistical significance in relation to clinical importance, since misinterpretation of p values is a common issue in orthopaedic literature. Orthopaedic research is tormented by the risks of false-positive (type I error) and
Morphometric variations of the 7th cervical vertebrae of Zulu, White, and Colored South Africans.
Kibii, Job M; Pan, Rualing; Tobias, Phillip V
2010-05-01
The 7th cervical vertebrae of 240 cadavers of South African Zulu, White, and Colored population groups were examined to determine morphometric variation. White and Colored females had statistically significant narrower cervical anteroposterior diameters than their male counterparts, whereas no statistically significant difference between sexes of the Zulu population group was observed in this variable. In addition, although Zulu and Colored females had statistically significant narrower cervical transverse diameters than their male counterparts, there was no statistically significant variation between South African white males and females in this respect. The findings indicate that sexual dimorphism is more apparent in the vertebral centrum, across the three population groups, where males had significantly larger dimensions in centrum anteroposterior diameter, height, and width than their female counterparts. The study further reveals that sexual dimorphism is more apparent when one compares aspects of the 7th cervical vertebra between sexes within the same population group. Overall, the dimensions of the various variates of the vertebra are substantially smaller in women than in men. The smaller dimensions, particularly of the centrum, may be the result of lower skeletal mass in women and render them more vulnerable to fractures resulting from compression forces. 2010 Wiley-Liss, Inc.
International Nuclear Information System (INIS)
Oylumoglu, G.
2005-01-01
In this study variation of additional enthalpy with respect to pH has been investigated by the statistical mechanical methods.. To bring up the additional effect, the partition function of the proteins are calculated by single protein molecule approximation. From the partition function, free energies of the proteins are obtained and by this way additional free energy has been used in the calculation of the terms in the thermodynamical quantity. Additional enthalpy H D has been obtained by taking effective electric field E and constant dipole moment M as thermodynamical variables and using Maxwell Equations. In the presented semi phenomenological theory, necessary data are taken from the experimental study of P.L. Privalov. The variation in the additional enthalpy H D has been investigated in the pH interval of 1-5 and the results of the calculations are discussed for Lysozyme
A variational approach to liver segmentation using statistics from multiple sources
Zheng, Shenhai; Fang, Bin; Li, Laquan; Gao, Mingqi; Wang, Yi
2018-01-01
Medical image segmentation plays an important role in digital medical research, and therapy planning and delivery. However, the presence of noise and low contrast renders automatic liver segmentation an extremely challenging task. In this study, we focus on a variational approach to liver segmentation in computed tomography scan volumes in a semiautomatic and slice-by-slice manner. In this method, one slice is selected and its connected component liver region is determined manually to initialize the subsequent automatic segmentation process. From this guiding slice, we execute the proposed method downward to the last one and upward to the first one, respectively. A segmentation energy function is proposed by combining the statistical shape prior, global Gaussian intensity analysis, and enforced local statistical feature under the level set framework. During segmentation, the shape of the liver shape is estimated by minimization of this function. The improved Chan-Vese model is used to refine the shape to capture the long and narrow regions of the liver. The proposed method was verified on two independent public databases, the 3D-IRCADb and the SLIVER07. Among all the tested methods, our method yielded the best volumetric overlap error (VOE) of 6.5 +/- 2.8 % , the best root mean square symmetric surface distance (RMSD) of 2.1 +/- 0.8 mm, the best maximum symmetric surface distance (MSD) of 18.9 +/- 8.3 mm in 3D-IRCADb dataset, and the best average symmetric surface distance (ASD) of 0.8 +/- 0.5 mm, the best RMSD of 1.5 +/- 1.1 mm in SLIVER07 dataset, respectively. The results of the quantitative comparison show that the proposed liver segmentation method achieves competitive segmentation performance with state-of-the-art techniques.
International Nuclear Information System (INIS)
DUDEK, J; SZPAK, B; FORNAL, B; PORQUET, M-G
2011-01-01
In this and the follow-up article we briefly discuss what we believe represents one of the most serious problems in contemporary nuclear structure: the question of statistical significance of parametrizations of nuclear microscopic Hamiltonians and the implied predictive power of the underlying theories. In the present Part I, we introduce the main lines of reasoning of the so-called Inverse Problem Theory, an important sub-field in the contemporary Applied Mathematics, here illustrated on the example of the Nuclear Mean-Field Approach.
Bossart, J L; Scriber, J M
1995-12-01
Differential selection in a heterogeneous environment is thought to promote the maintenance of ecologically significant genetic variation. Variation is maintained when selection is counterbalanced by the homogenizing effects of gene flow and random mating. In this study, we examine the relative importance of differential selection and gene flow in maintaining genetic variation in Papilio glaucus. Differential selection on traits contributing to successful use of host plants (oviposition preference and larval performance) was assessed by comparing the responses of southern Ohio, north central Georgia, and southern Florida populations of P. glaucus to three hosts: Liriodendron tulipifera, Magnolia virginiana, and Prunus serotina. Gene flow among populations was estimated using allozyme frequencies from nine polymorphic loci. Significant genetic differentiation was observed among populations for both oviposition preference and larval performance. This differentiation was interpreted to be the result of selection acting on Florida P. glaucus for enhanced use of Magnolia, the prevalent host in Florida. In contrast, no evidence of population differentiation was revealed by allozyme frequencies. F ST -values were very small and Nm, an estimate of the relative strengths of gene flow and genetic drift, was large, indicating that genetic exchange among P. glaucus populations is relatively unrestricted. The contrasting patterns of spatial differentiation for host-use traits and lack of differentiation for electrophoretically detectable variation implies that differential selection among populations will be counterbalanced by gene flow, thereby maintaining genetic variation for host-use traits. © 1995 The Society for the Study of Evolution.
Linting, Marielle; van Os, Bart Jan; Meulman, Jacqueline J.
2011-01-01
In this paper, the statistical significance of the contribution of variables to the principal components in principal components analysis (PCA) is assessed nonparametrically by the use of permutation tests. We compare a new strategy to a strategy used in previous research consisting of permuting the columns (variables) of a data matrix…
Directory of Open Access Journals (Sweden)
Vujović Svetlana R.
2013-01-01
Full Text Available This paper illustrates the utility of multivariate statistical techniques for analysis and interpretation of water quality data sets and identification of pollution sources/factors with a view to get better information about the water quality and design of monitoring network for effective management of water resources. Multivariate statistical techniques, such as factor analysis (FA/principal component analysis (PCA and cluster analysis (CA, were applied for the evaluation of variations and for the interpretation of a water quality data set of the natural water bodies obtained during 2010 year of monitoring of 13 parameters at 33 different sites. FA/PCA attempts to explain the correlations between the observations in terms of the underlying factors, which are not directly observable. Factor analysis is applied to physico-chemical parameters of natural water bodies with the aim classification and data summation as well as segmentation of heterogeneous data sets into smaller homogeneous subsets. Factor loadings were categorized as strong and moderate corresponding to the absolute loading values of >0.75, 0.75-0.50, respectively. Four principal factors were obtained with Eigenvalues >1 summing more than 78 % of the total variance in the water data sets, which is adequate to give good prior information regarding data structure. Each factor that is significantly related to specific variables represents a different dimension of water quality. The first factor F1 accounting for 28 % of the total variance and represents the hydrochemical dimension of water quality. The second factor F2 accounting for 18% of the total variance and may be taken factor of water eutrophication. The third factor F3 accounting 17 % of the total variance and represents the influence of point sources of pollution on water quality. The fourth factor F4 accounting 13 % of the total variance and may be taken as an ecological dimension of water quality. Cluster analysis (CA is an
A Note on Comparing the Power of Test Statistics at Low Significance Levels.
Morris, Nathan; Elston, Robert
2011-01-01
It is an obvious fact that the power of a test statistic is dependent upon the significance (alpha) level at which the test is performed. It is perhaps a less obvious fact that the relative performance of two statistics in terms of power is also a function of the alpha level. Through numerous personal discussions, we have noted that even some competent statisticians have the mistaken intuition that relative power comparisons at traditional levels such as α = 0.05 will be roughly similar to relative power comparisons at very low levels, such as the level α = 5 × 10 -8 , which is commonly used in genome-wide association studies. In this brief note, we demonstrate that this notion is in fact quite wrong, especially with respect to comparing tests with differing degrees of freedom. In fact, at very low alpha levels the cost of additional degrees of freedom is often comparatively low. Thus we recommend that statisticians exercise caution when interpreting the results of power comparison studies which use alpha levels that will not be used in practice.
Morgenstern Horing, Norman J
2017-01-01
This book provides an introduction to the methods of coupled quantum statistical field theory and Green's functions. The methods of coupled quantum field theory have played a major role in the extensive development of nonrelativistic quantum many-particle theory and condensed matter physics. This introduction to the subject is intended to facilitate delivery of the material in an easily digestible form to advanced undergraduate physics majors at a relatively early stage of their scientific development. The main mechanism to accomplish this is the early introduction of variational calculus and the Schwinger Action Principle, accompanied by Green's functions. Important achievements of the theory in condensed matter and quantum statistical physics are reviewed in detail to help develop research capability. These include the derivation of coupled field Green's function equations-of-motion for a model electron-hole-phonon system, extensive discussions of retarded, thermodynamic and nonequilibrium Green's functions...
Local multiplicity adjustment for the spatial scan statistic using the Gumbel distribution.
Gangnon, Ronald E
2012-03-01
The spatial scan statistic is an important and widely used tool for cluster detection. It is based on the simultaneous evaluation of the statistical significance of the maximum likelihood ratio test statistic over a large collection of potential clusters. In most cluster detection problems, there is variation in the extent of local multiplicity across the study region. For example, using a fixed maximum geographic radius for clusters, urban areas typically have many overlapping potential clusters, whereas rural areas have relatively few. The spatial scan statistic does not account for local multiplicity variation. We describe a previously proposed local multiplicity adjustment based on a nested Bonferroni correction and propose a novel adjustment based on a Gumbel distribution approximation to the distribution of a local scan statistic. We compare the performance of all three statistics in terms of power and a novel unbiased cluster detection criterion. These methods are then applied to the well-known New York leukemia dataset and a Wisconsin breast cancer incidence dataset. © 2011, The International Biometric Society.
DEFF Research Database (Denmark)
Serviss, Jason T.; Gådin, Jesper R.; Eriksson, Per
2017-01-01
, e.g. genes in a specific pathway, alone can separate samples into these established classes. Despite this, the evaluation of class separations is often subjective and performed via visualization. Here we present the ClusterSignificance package; a set of tools designed to assess the statistical...... significance of class separations downstream of dimensionality reduction algorithms. In addition, we demonstrate the design and utility of the ClusterSignificance package and utilize it to determine the importance of long non-coding RNA expression in the identity of multiple hematological malignancies....
Clinical significance of vagus nerve variation in radiofrequency ablation of thyroid nodules
International Nuclear Information System (INIS)
Ha, Eun Ju; Baek, Jung Hwan; Lee, Jeong Hyun; Shong, Young Kee; Kim, Jae Kyun
2011-01-01
To evaluate the types and incidence of vagus nerve variations and to assess factors related to the vulnerability of vagus nerves during the radiofrequency (RF) ablation of thyroid nodules. Bilateral vagus nerves of 304 consecutive patients who underwent ultrasound of the neck were assessed. Two radiologists evaluated vagus nerve type (types 1-4; lateral/anterior/medial/posterior), the shortest distance between the thyroid gland and vagus nerve, and thyroid contour. Vagus nerve vulnerability was defined as a vagus nerve located within 2 mm of the thyroid gland through the ex vivo experiments, and factors associated with vulnerability were assessed. We were unable to find one vagus nerve. Of the 607 vagus nerves, 467 (76.9%) were type 1, 128 (21.1%) were type 2, 10 (1.6%) were type 3, and 2 (0.3%) were type 4, with 81 (13.3%) being vulnerable. Univariate analysis showed that sex, location, thyroid contour and type were significantly associated with vagus nerve vulnerability. Multivariate analysis showed that bulging contour caused by thyroid nodules (P = 0.001), vagus nerve types 2/4 (P < 0.001) and type 3 (P < 0.001) were independent predictors. The operator should pay attention to anatomical variations and the resulting vagus nerve injury during RF ablation of bulging thyroid nodules. (orig.)
van Tulder, M.W.; Malmivaara, A.; Hayden, J.; Koes, B.
2007-01-01
STUDY DESIGN. Critical appraisal of the literature. OBJECIVES. The objective of this study was to assess if results of back pain trials are statistically significant and clinically important. SUMMARY OF BACKGROUND DATA. There seems to be a discrepancy between conclusions reported by authors and
Statistical investigation of expected wave energy and its reliability
International Nuclear Information System (INIS)
Ozger, M.; Altunkaynak, A.; Sen, Z.
2004-01-01
The statistical behavior of wave energy at a single site is derived by considering simultaneous variations in the period and wave height. In this paper, the general wave power formulation is derived by using the theory of perturbation. This method leads to a general formulation of the wave power expectation and other statistical parameter expressions, such as standard deviation and coefficient of variation. The statistical parameters, namely the mean value and variance of wave energy, are found in terms of the simple statistical parameters of period, significant wave height and zero up-crossing period. The elegance of these parameters is that they are distribution free. These parameters provide a means for defining the wave energy distribution function by employing the Chebyschev's inequality. Subsequently, an approximate probability distribution function of the wave energy is also derived for assessment of risk and reliability associated with wave energy. Necessary simple charts are given for risk and reliability assessments. Two procedures are presented for such assessments in wave energy calculations and the applications of these procedures are provided for wave energy potential assessment in the regions of the Pacific Ocean off the west coast of U.S. (author)
Statistical investigation of expected wave energy and its reliability
International Nuclear Information System (INIS)
Oezger, Mehmet; Altunkaynak, Abduesselam; Sen, Zekai
2004-01-01
The statistical behavior of wave energy at a single site is derived by considering simultaneous variations in the period and wave height. In this paper, the general wave power formulation is derived by using the theory of perturbation. This method leads to a general formulation of the wave power expectation and other statistical parameter expressions, such as standard deviation and coefficient of variation. The statistical parameters, namely the mean value and variance of wave energy, are found in terms of the simple statistical parameters of period, significant wave height and zero up-crossing period. The elegance of these parameters is that they are distribution free. These parameters provide a means for defining the wave energy distribution function by employing the Chebyschev's inequality. Subsequently, an approximate probability distribution function of the wave energy is also derived for assessment of risk and reliability associated with wave energy. Necessary simple charts are given for risk and reliability assessments. Two procedures are presented for such assessments in wave energy calculations and the applications of these procedures are provided for wave energy potential assessment in the regions of the Pacific Ocean off the west coast of U.S
Nick, Todd G
2007-01-01
Statistics is defined by the Medical Subject Headings (MeSH) thesaurus as the science and art of collecting, summarizing, and analyzing data that are subject to random variation. The two broad categories of summarizing and analyzing data are referred to as descriptive and inferential statistics. This chapter considers the science and art of summarizing data where descriptive statistics and graphics are used to display data. In this chapter, we discuss the fundamentals of descriptive statistics, including describing qualitative and quantitative variables. For describing quantitative variables, measures of location and spread, for example the standard deviation, are presented along with graphical presentations. We also discuss distributions of statistics, for example the variance, as well as the use of transformations. The concepts in this chapter are useful for uncovering patterns within the data and for effectively presenting the results of a project.
Peng, Chengtao; Qiu, Bensheng; Zhang, Cheng; Ma, Changyu; Yuan, Gang; Li, Ming
2017-07-01
Over the years, the X-ray computed tomography (CT) has been successfully used in clinical diagnosis. However, when the body of the patient to be examined contains metal objects, the image reconstructed would be polluted by severe metal artifacts, which affect the doctor's diagnosis of disease. In this work, we proposed a dynamic re-weighted total variation (DRWTV) technique combined with the statistic iterative reconstruction (SIR) method to reduce the artifacts. The DRWTV method is based on the total variation (TV) and re-weighted total variation (RWTV) techniques, but it provides a sparser representation than TV and protects the tissue details better than RWTV. Besides, the DRWTV can suppress the artifacts and noise, and the SIR convergence speed is also accelerated. The performance of the algorithm is tested on both simulated phantom dataset and clinical dataset, which are the teeth phantom with two metal implants and the skull with three metal implants, respectively. The proposed algorithm (SIR-DRWTV) is compared with two traditional iterative algorithms, which are SIR and SIR constrained by RWTV regulation (SIR-RWTV). The results show that the proposed algorithm has the best performance in reducing metal artifacts and protecting tissue details.
Statistical implications in Monte Carlo depletions - 051
International Nuclear Information System (INIS)
Zhiwen, Xu; Rhodes, J.; Smith, K.
2010-01-01
As a result of steady advances of computer power, continuous-energy Monte Carlo depletion analysis is attracting considerable attention for reactor burnup calculations. The typical Monte Carlo analysis is set up as a combination of a Monte Carlo neutron transport solver and a fuel burnup solver. Note that the burnup solver is a deterministic module. The statistical errors in Monte Carlo solutions are introduced into nuclide number densities and propagated along fuel burnup. This paper is towards the understanding of the statistical implications in Monte Carlo depletions, including both statistical bias and statistical variations in depleted fuel number densities. The deterministic Studsvik lattice physics code, CASMO-5, is modified to model the Monte Carlo depletion. The statistical bias in depleted number densities is found to be negligible compared to its statistical variations, which, in turn, demonstrates the correctness of the Monte Carlo depletion method. Meanwhile, the statistical variation in number densities generally increases with burnup. Several possible ways of reducing the statistical errors are discussed: 1) to increase the number of individual Monte Carlo histories; 2) to increase the number of time steps; 3) to run additional independent Monte Carlo depletion cases. Finally, a new Monte Carlo depletion methodology, called the batch depletion method, is proposed, which consists of performing a set of independent Monte Carlo depletions and is thus capable of estimating the overall statistical errors including both the local statistical error and the propagated statistical error. (authors)
Reducing the variation in animal models by standardizing the gut microbiota
DEFF Research Database (Denmark)
Ellekilde, Merete; Hufeldt, Majbritt Ravn; Hansen, Camilla Hartmann Friis
2011-01-01
, a large proportion of laboratory animals are used to study such diseases, but inter-individual variation in these animal models leads to the need for larger group sizes to reach statistical significance and adequate power. By standardizing the microbial and immunological status of laboratory animals we...... mice changed the glucose tolerance without affecting weight or mucosal immunity. Further investigations concerning the mechanisms of how GM influences disease development is necessary, but based on these results it seems reasonable to assume that by manipulating the GM we may produce animal models...... may therefore be able to produce animals with a more standardized response and less variation. This would lead to more precise results and a reduced number of animals needed for statistical significance. Denaturing gradient gel electrophoresis (DGGE) - a culture independent approach separating PCR...
Statistical program for the data evaluation of a thermal ionization mass spectrometer
Energy Technology Data Exchange (ETDEWEB)
van Raaphorst, J. G.
1978-12-15
A computer program has been written to statistically analyze mass spectrometer measurements. The program tests whether the difference between signal and background intensities is statistically significant, corrects for signal drift in the measured values, and calculates ratios against the main isotope from the corrected intensities. Repeated ratio value measurements are screened for outliers using the Dixon statistical test. Means of ratios and the coefficient of variation are calculated and reported. The computer program is written in Basic and is available for anyone who is interested.
Bastianelli, Carole; Ali, Adam A.; Beguin, Julien; Bergeron, Yves; Grondin, Pierre; Hély, Christelle; Paré, David
2017-07-01
At the northernmost extent of the managed forest in Quebec, Canada, the boreal forest is currently undergoing an ecological transition between two forest ecosystems. Open lichen woodlands (LW) are spreading southward at the expense of more productive closed-canopy black spruce-moss forests (MF). The objective of this study was to investigate whether soil properties could distinguish MF from LW in the transition zone where both ecosystem types coexist. This study brings out clear evidence that differences in vegetation cover can lead to significant variations in soil physical and geochemical properties.Here, we showed that soil carbon, exchangeable cations, and iron and aluminium crystallinity vary between boreal closed-canopy forests and open lichen woodlands, likely attributed to variations in soil microclimatic conditions. All the soils studied were typical podzolic soil profiles evolved from glacial till deposits that shared a similar texture of the C layer. However, soil humus and the B layer varied in thickness and chemistry between the two forest ecosystems at the pedon scale. Multivariate analyses of variance were used to evaluate how soil properties could help distinguish the two types at the site scale. MF humus (FH horizons horizons composing the O layer) showed significantly higher concentrations of organic carbon and nitrogen and of the main exchangeable base cations (Ca, Mg) than LW soils. The B horizon of LW sites held higher concentrations of total Al and Fe oxides and particularly greater concentrations of inorganic amorphous Fe oxides than MF mineral soils, while showing a thinner B layer. Overall, our results show that MF store three times more organic carbon in their soils (B+FH horizons, roots apart) than LW. We suggest that variations in soil properties between MF and LW are linked to a cascade of events involving the impacts of natural disturbances such as wildfires on forest regeneration that determines the vegetation structure (stand density
Directory of Open Access Journals (Sweden)
C. Bastianelli
2017-07-01
Full Text Available At the northernmost extent of the managed forest in Quebec, Canada, the boreal forest is currently undergoing an ecological transition between two forest ecosystems. Open lichen woodlands (LW are spreading southward at the expense of more productive closed-canopy black spruce–moss forests (MF. The objective of this study was to investigate whether soil properties could distinguish MF from LW in the transition zone where both ecosystem types coexist. This study brings out clear evidence that differences in vegetation cover can lead to significant variations in soil physical and geochemical properties.Here, we showed that soil carbon, exchangeable cations, and iron and aluminium crystallinity vary between boreal closed-canopy forests and open lichen woodlands, likely attributed to variations in soil microclimatic conditions. All the soils studied were typical podzolic soil profiles evolved from glacial till deposits that shared a similar texture of the C layer. However, soil humus and the B layer varied in thickness and chemistry between the two forest ecosystems at the pedon scale. Multivariate analyses of variance were used to evaluate how soil properties could help distinguish the two types at the site scale. MF humus (FH horizons horizons composing the O layer showed significantly higher concentrations of organic carbon and nitrogen and of the main exchangeable base cations (Ca, Mg than LW soils. The B horizon of LW sites held higher concentrations of total Al and Fe oxides and particularly greater concentrations of inorganic amorphous Fe oxides than MF mineral soils, while showing a thinner B layer. Overall, our results show that MF store three times more organic carbon in their soils (B+FH horizons, roots apart than LW. We suggest that variations in soil properties between MF and LW are linked to a cascade of events involving the impacts of natural disturbances such as wildfires on forest regeneration that determines the vegetation
International Nuclear Information System (INIS)
Hurkmans, Coen; Admiraal, Marjan; Sangen, Maurice van der; Dijkmans, Ingrid
2009-01-01
Background and purpose: Nowadays, many departments introduce CT images for breast irradiation techniques, aiming to obtain a better accuracy in the definition of the relevant target volumes. However, the definition of the breast boost volume based on CT images requires further investigation, because it may not only vary between observers, but it may also change during the course of treatment. This study aims to quantify the variability of the CT based visible boost volume (VBV) during the course of treatment in relation to the variability between observers. Materials and methods: Ten patients with stage T1-2 invasive breast cancer treated with breast conservative surgery and post surgical radiotherapy were included in this study. In addition to the regular planning CT which is obtained several days prior to radiotherapy, three additional CT scans were acquired 3, 5 and 7 weeks after the planning CT scan. Four radiation oncologists delineated the VBV in all scans. Conformity of the delineations was analysed both between observers, and between scans taken at different periods of the radiotherapy treatment. Results: The VBV averaged over all patients decreased during the course of the treatment from an initial 40 cm 3 to 28 cm 3 , 27 cm 3 and 25 cm 3 after 3, 5 and 7 weeks, respectively. Assuming the VBV to be spherical, this corresponds to a reduction in diameter of 5-6 mm. More detailed analysis revealed that this reduction was more pronounced when radiotherapy started within 30 days after surgery. These boost volume changes over time were found to be significant (p = 0.02) even in the presence of interobserver variations. Moreover, the conformity index (CI) for the volume changes was of the same magnitude as the conformity index for the interobserver variation (0.25 and 0.31, respectively). Conclusions: Breast boost volume variations during a course of radiotherapy are significant in relation to current clinical interobserver variations. This is an important
PPARGC1A sequence variation and cardiovascular risk-factor levels
DEFF Research Database (Denmark)
Brito, E C; Vimaleswaran, K S; Brage, S
2009-01-01
.005; rs13117172, p = 0.008) and fasting glucose concentrations (rs7657071, p = 0.002). None remained significant after correcting for the number of statistical comparisons. We proceeded by testing for gene x physical activity interactions for the polymorphisms that showed nominal evidence of association...... in the main effect models. None of these tests was statistically significant. CONCLUSIONS/INTERPRETATION: Variants at PPARGC1A may influence several metabolic traits in this European paediatric cohort. However, variation at PPARGC1A is unlikely to have a major impact on cardiovascular or metabolic health...
Seasonal variation of radon concentrations in UK homes
International Nuclear Information System (INIS)
Miles, J C H; Howarth, C B; Hunter, N
2012-01-01
The patterns of seasonal variation of radon concentrations were measured in 91 homes in five regions of the UK over a period of two years. The results showed that there was no significant difference between the regions in the pattern or magnitude of seasonal variation in radon concentrations. The arithmetic mean variation was found to be close to that found previously in the UK national survey. Differences in the pattern between the two years of the study were not significant. Two-thirds of homes in the study followed the expected pattern of high radon in the winter and low radon in the summer. Most of the rest showed little seasonal variation, and a few showed a reversed seasonal pattern. The study does not provide any clear evidence for the recorded house characteristics having an effect on the seasonal variation in radon concentrations in UK homes, though the statistical power for determining such effects is limited in this study. The magnitude of the seasonal variation varied widely between homes. Analysis of the individual results from the homes showed that because of the wide variation in the amount of seasonal variation, applying seasonal correction factors to the results of three-month measurements can yield only relatively small improvements in the accuracy of estimates of annual mean concentrations.
Indirectional statistics and the significance of an asymmetry discovered by Birch
International Nuclear Information System (INIS)
Kendall, D.G.; Young, G.A.
1984-01-01
Birch (1982, Nature, 298, 451) reported an apparent 'statistical asymmetry of the Universe'. The authors here develop 'indirectional analysis' as a technique for investigating statistical effects of this kind and conclude that the reported effect (whatever may be its origin) is strongly supported by the observations. The estimated pole of the asymmetry is at RA 13h 30m, Dec. -37deg. The angular error in its estimation is unlikely to exceed 20-30deg. (author)
International Nuclear Information System (INIS)
Daziano, C.
2010-01-01
Statistical analysis of trace elements in volcanics research s, allowed to distinguish two independent populations with the same geochemical environment. For each component they have variable index of homogeneity resulting in dissimilar average values that reveal geochemical intra telluric phenomena. On the other hand the inhomogeneities observed in these rocks - as reflected in its petrochemical characters - could be exacerbated especially at so remote and dispersed location of their pitches, their relations with the enclosing rocks for the ranges of compositional variation, due differences relative ages
Multivariate Statistical Process Control
DEFF Research Database (Denmark)
Kulahci, Murat
2013-01-01
As sensor and computer technology continues to improve, it becomes a normal occurrence that we confront with high dimensional data sets. As in many areas of industrial statistics, this brings forth various challenges in statistical process control (SPC) and monitoring for which the aim...... is to identify “out-of-control” state of a process using control charts in order to reduce the excessive variation caused by so-called assignable causes. In practice, the most common method of monitoring multivariate data is through a statistic akin to the Hotelling’s T2. For high dimensional data with excessive...... amount of cross correlation, practitioners are often recommended to use latent structures methods such as Principal Component Analysis to summarize the data in only a few linear combinations of the original variables that capture most of the variation in the data. Applications of these control charts...
Rococo, E; Mazouni, C; Or, Z; Mobillion, V; Koon Sun Pat, M; Bonastre, J
2016-01-01
Minimum volume thresholds were introduced in France in 2008 to improve the quality of cancer care. We investigated whether/how the quality of treatment decisions in breast cancer surgery had evolved before and after this policy was implemented. We used Hospital Episode Statistics for all women having undergone breast conserving surgery (BCS) or mastectomy in France in 2005 and 2012. Three surgical procedures considered as better treatment options were analyzed: BCS, immediate breast reconstruction (IBR) and sentinel lymph node biopsy (SLNB). We studied the mean rates and variation according to the hospital profile and volume. Between 2005 and 2012, the volume of breast cancer surgery increased by 11% whereas one third of the hospitals no longer performed this type of surgery. In 2012, the mean rate of BCS was 74% and similar in all hospitals whatever the volume. Conversely, IBR and SLNB rates were much higher in cancer centers (CC) and regional teaching hospitals (RTH) [IBR: 19% and 14% versus 8% on average; SLNB: 61% and 47% versus 39% on average]; the greater the hospital volume, the higher the IBR and SLNB rates (p < 0.0001). Overall, whatever the surgical procedure considered, inter-hospital variation in rates declined substantially in CC and RTH. We identified considerable variation in IBR and SLNB rates between French hospitals. Although more complex and less standardized than BCS, most clinical guidelines recommended these procedures. This apparent heterogeneity suggests unequal access to high-quality procedures for women with breast cancer. Copyright © 2015 Elsevier Ltd. All rights reserved.
Alves, Gelio; Wang, Guanghui; Ogurtsov, Aleksey Y; Drake, Steven K; Gucek, Marjan; Sacks, David B; Yu, Yi-Kuo
2018-06-05
Rapid and accurate identification and classification of microorganisms is of paramount importance to public health and safety. With the advance of mass spectrometry (MS) technology, the speed of identification can be greatly improved. However, the increasing number of microbes sequenced is complicating correct microbial identification even in a simple sample due to the large number of candidates present. To properly untwine candidate microbes in samples containing one or more microbes, one needs to go beyond apparent morphology or simple "fingerprinting"; to correctly prioritize the candidate microbes, one needs to have accurate statistical significance in microbial identification. We meet these challenges by using peptide-centric representations of microbes to better separate them and by augmenting our earlier analysis method that yields accurate statistical significance. Here, we present an updated analysis workflow that uses tandem MS (MS/MS) spectra for microbial identification or classification. We have demonstrated, using 226 MS/MS publicly available data files (each containing from 2500 to nearly 100,000 MS/MS spectra) and 4000 additional MS/MS data files, that the updated workflow can correctly identify multiple microbes at the genus and often the species level for samples containing more than one microbe. We have also shown that the proposed workflow computes accurate statistical significances, i.e., E values for identified peptides and unified E values for identified microbes. Our updated analysis workflow MiCId, a freely available software for Microorganism Classification and Identification, is available for download at https://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads.html . Graphical Abstract ᅟ.
Effect of posture on the diurnal variation in clinically significant diabetic macular edema.
Polito, Antonio; Polini, Giovanni; Chiodini, Raffaella Gortana; Isola, Miriam; Soldano, Franca; Bandello, Francesco
2007-07-01
To investigate the role of posture and other systemic factors in the diurnal variation of clinically significant diabetic macular edema (CSDME). Ten eyes of 10 diabetic subjects with CSDME underwent four OCT foveal thickness measurements with StratusOCT at 9 AM and 12, 3, and 6 PM consecutively on two different days, with the subject in an upright position on one and in a recumbent position on the other. For the "recumbent-position" measurements, the patients were admitted the night before and remained in bed during the entire day of testing. Clinical laboratory results at baseline included HbA1c, urinary albumin, and serum creatinine. Refraction and Early Treatment Diabetic Retinopathy Study (ETDRS) visual acuity were also measured before each OCT measurement was taken. Variations in blood pressure, body temperature, plasma glucose, renin, aldosterone, and cortisol levels were measured and then correlated with macular thickness. Foveal thickening decreased in all cases over the course of the day. The decrease, however, was significantly greater for the upright-position measurements (relative mean +/- SD decrease of 20.6% +/- 6.5% in the upright position and 6.2% +/- 4.6% in the recumbent position). Visual acuity improved by at least 1 ETDRS line in three eyes in the upright position as opposed to only one eye in the recumbent position. There seemed to be no association between any of the systemic factors studied and foveal thickening, with the exception of cortisol. The results support the hypothesis that posture and hydrostatic pressure play a major role in determining time-related shifts in CSDME and suggest that the forces of Starling's law can in part, account for CSDME formation.
Anatomical variations in dorsal metatarsal arteries with surgical significance: A cadaveric study
Directory of Open Access Journals (Sweden)
Preeti Shivshankar Awari
2017-01-01
Full Text Available Introduction: Based on angiosome concept to revascularize a particular artery, the microvascular and reconstructive surgeons must know the anatomy and variations in the arteries in that specific region of the body to achieve better results. Nowadays, dorsal metatarsal artery (DMTA perforator flaps and toe grafts are becoming popular which also demand adequate information about normal anatomy and variants in these arteries for fruitful results. Materials and Methods: The authors studied normal anatomy and variations in the origin of DMTAs in 50 lower extremities of 25 embalmed cadavers. Results: The authors found many variations as the absence of DMTAs, origin of the DMTA from the deep plantar arch. The places wherever the arcuate artery was absent the lateral tarsal artery gave rise to dorsal metatarsal arteries. Conclusion: Being familiar with the incidence of anatomical variations in the origin of the DMTAs can increase vigilance in vascular and reconstructive surgeries leading to better prognosis. surgeries leading to better prognosis.
Automated robust generation of compact 3D statistical shape models
Vrtovec, Tomaz; Likar, Bostjan; Tomazevic, Dejan; Pernus, Franjo
2004-05-01
Ascertaining the detailed shape and spatial arrangement of anatomical structures is important not only within diagnostic settings but also in the areas of planning, simulation, intraoperative navigation, and tracking of pathology. Robust, accurate and efficient automated segmentation of anatomical structures is difficult because of their complexity and inter-patient variability. Furthermore, the position of the patient during image acquisition, the imaging device and protocol, image resolution, and other factors induce additional variations in shape and appearance. Statistical shape models (SSMs) have proven quite successful in capturing structural variability. A possible approach to obtain a 3D SSM is to extract reference voxels by precisely segmenting the structure in one, reference image. The corresponding voxels in other images are determined by registering the reference image to each other image. The SSM obtained in this way describes statistically plausible shape variations over the given population as well as variations due to imperfect registration. In this paper, we present a completely automated method that significantly reduces shape variations induced by imperfect registration, thus allowing a more accurate description of variations. At each iteration, the derived SSM is used for coarse registration, which is further improved by describing finer variations of the structure. The method was tested on 64 lumbar spinal column CT scans, from which 23, 38, 45, 46 and 42 volumes of interest containing vertebra L1, L2, L3, L4 and L5, respectively, were extracted. Separate SSMs were generated for each vertebra. The results show that the method is capable of reducing the variations induced by registration errors.
Pompei-Reynolds, Renée C; Kanavakis, Georgios
2014-08-01
The manufacturing process for copper-nickel-titanium archwires is technique sensitive. The primary aim of this investigation was to examine the interlot consistency of the mechanical properties of copper-nickel-titanium wires from 2 manufacturers. Wires of 2 sizes (0.016 and 0.016 × 0.022 in) and 3 advertised austenite finish temperatures (27°C, 35°C, and 40°C) from 2 manufacturers were tested for transition temperature ranges and force delivery using differential scanning calorimetry and the 3-point bend test, respectively. Variations of these properties were analyzed for statistical significance by calculating the F statistic for equality of variances for transition temperature and force delivery in each group of wires. All statistical analyses were performed at the 0.05 level of significance. Statistically significant interlot variations in austenite finish were found for the 0.016 in/27°C (P = 0.041) and 0.016 × 0.022 in/35°C (P = 0.048) wire categories, and in austenite start for the 0.016 × 0.022 in/35°C wire category (P = 0.01). In addition, significant variations in force delivery were found between the 2 manufacturers for the 0.016 in/27°C (P = 0.002), 0.016 in/35.0°C (P = 0.049), and 0.016 × 0.022 in/35°C (P = 0.031) wires. Orthodontic wires of the same material, dimension, and manufacturer but from different production lots do not always have similar mechanical properties. Clinicians should be aware that copper-nickel-titanium wires might not always deliver the expected force, even when they come from the same manufacturer, because of interlot variations in the performance of the material. Copyright © 2014 American Association of Orthodontists. Published by Mosby, Inc. All rights reserved.
Energy Technology Data Exchange (ETDEWEB)
Crow, C.J.
1985-01-01
Middle Ordovician age Chickamauga Group carbonates crop out along the Birmingham and Murphrees Valley anticlines in central Alabama. The macrofossil contents on exposed surfaces of seven bioherms have been counted to determine their various paleontologic characteristics. Twelve groups of organisms are present in these bioherms. Dominant organisms include bryozoans, algae, brachiopods, sponges, pelmatozoans, stromatoporoids and corals. Minor accessory fauna include predators, scavengers and grazers such as gastropods, ostracods, trilobites, cephalopods and pelecypods. Vertical and horizontal niche zonation has been detected for some of the bioherm dwelling fauna. No one bioherm of those studied exhibits all 12 groups of organisms; rather, individual bioherms display various subsets of the total diversity. Statistical treatment (G-test) of the diversity data indicates a lack of statistical homogeneity of the bioherms, both within and between localities. Between-locality population heterogeneity can be ascribed to differences in biologic responses to such gross environmental factors as water depth and clarity, and energy levels. At any one locality, gross aspects of the paleoenvironments are assumed to have been more uniform. Significant differences among bioherms at any one locality may have resulted from patchy distribution of species populations, differential preservation and other factors.
Detecting Statistically Significant Communities of Triangle Motifs in Undirected Networks
2016-04-26
Systems, Statistics & Management Science, University of Alabama, USA. 1 DISTRIBUTION A: Distribution approved for public release. Contents 1 Summary 5...13 5 Application to Real Networks 18 5.1 2012 FBS Football Schedule Network... football schedule network. . . . . . . . . . . . . . . . . . . . . . 21 14 Stem plot of degree-ordered vertices versus the degree for college football
Complication rates of ostomy surgery are high and vary significantly between hospitals.
Sheetz, Kyle H; Waits, Seth A; Krell, Robert W; Morris, Arden M; Englesbe, Michael J; Mullard, Andrew; Campbell, Darrell A; Hendren, Samantha
2014-05-01
Ostomy surgery is common and has traditionally been associated with high rates of morbidity and mortality, suggesting an important target for quality improvement. The purpose of this work was to evaluate the variation in outcomes after ostomy creation surgery within Michigan to identify targets for quality improvement. This was a retrospective cohort study. The study took place within the 34-hospital Michigan Surgical Quality Collaborative. Patients included were those undergoing ostomy creation surgery between 2006 and 2011. We evaluated hospital morbidity and mortality rates after risk adjustment (age, comorbidities, emergency vs elective, and procedure type). A total of 4250 patients underwent ostomy creation surgery; 3866 procedures (91.0%) were open and 384 (9.0%) were laparoscopic. Unadjusted morbidity and mortality rates were 43.9% and 10.7%. Unadjusted morbidity rates for specific procedures ranged from 32.7% for ostomy-creation-only procedures to 47.8% for Hartmann procedures. Risk-adjusted morbidity rates varied significantly between hospitals, ranging from 31.2% (95% CI, 18.4-43.9) to 60.8% (95% CI, 48.9-72.6). There were 5 statistically significant high-outlier hospitals and 3 statistically significant low-outlier hospitals for risk-adjusted morbidity. The pattern of complication types was similar between high- and low-outlier hospitals. Case volume, operative duration, and use of laparoscopic surgery did not explain the variation in morbidity rates across hospitals. This work was limited by its retrospective study design, by unmeasured variation in case severity, and by our inability to differentiate between colostomies and ileostomies because of the use of Current Procedural Terminology codes. Morbidity and mortality rates for modern ostomy surgery are high. Although this type of surgery has received little attention in healthcare policy, these data reveal that it is both common and uncommonly morbid. Variation in hospital performance provides an
[Hydrologic variability and sensitivity based on Hurst coefficient and Bartels statistic].
Lei, Xu; Xie, Ping; Wu, Zi Yi; Sang, Yan Fang; Zhao, Jiang Yan; Li, Bin Bin
2018-04-01
Due to the global climate change and frequent human activities in recent years, the pure stochastic components of hydrological sequence is mixed with one or several of the variation ingredients, including jump, trend, period and dependency. It is urgently needed to clarify which indices should be used to quantify the degree of their variability. In this study, we defined the hydrological variability based on Hurst coefficient and Bartels statistic, and used Monte Carlo statistical tests to test and analyze their sensitivity to different variants. When the hydrological sequence had jump or trend variation, both Hurst coefficient and Bartels statistic could reflect the variation, with the Hurst coefficient being more sensitive to weak jump or trend variation. When the sequence had period, only the Bartels statistic could detect the mutation of the sequence. When the sequence had a dependency, both the Hurst coefficient and the Bartels statistics could reflect the variation, with the latter could detect weaker dependent variations. For the four variations, both the Hurst variability and Bartels variability increased with the increases of variation range. Thus, they could be used to measure the variation intensity of the hydrological sequence. We analyzed the temperature series of different weather stations in the Lancang River basin. Results showed that the temperature of all stations showed the upward trend or jump, indicating that the entire basin had experienced warming in recent years and the temperature variability in the upper and lower reaches was much higher. This case study showed the practicability of the proposed method.
International Nuclear Information System (INIS)
Tonchev, N.; Shumovskij, A.S.
1986-01-01
The history of investigations, conducted at the JINR in the field of statistical mechanics, beginning with the fundamental works by Bogolyubov N.N. on superconductivity microscopic theory is presented. Ideas, introduced in these works and methods developed in them, have largely determined the ways for developing statistical mechanics in the JINR and Hartree-Fock-Bogolyubov variational principle has become an important method of the modern nucleus theory. A brief review of the main achievements, connected with the development of statistical mechanics methods and their application in different fields of physical science is given
Statistical analysis of aerosol species, trace gasses, and meteorology in Chicago.
Binaku, Katrina; O'Brien, Timothy; Schmeling, Martina; Fosco, Tinamarie
2013-09-01
Both canonical correlation analysis (CCA) and principal component analysis (PCA) were applied to atmospheric aerosol and trace gas concentrations and meteorological data collected in Chicago during the summer months of 2002, 2003, and 2004. Concentrations of ammonium, calcium, nitrate, sulfate, and oxalate particulate matter, as well as, meteorological parameters temperature, wind speed, wind direction, and humidity were subjected to CCA and PCA. Ozone and nitrogen oxide mixing ratios were also included in the data set. The purpose of statistical analysis was to determine the extent of existing linear relationship(s), or lack thereof, between meteorological parameters and pollutant concentrations in addition to reducing dimensionality of the original data to determine sources of pollutants. In CCA, the first three canonical variate pairs derived were statistically significant at the 0.05 level. Canonical correlation between the first canonical variate pair was 0.821, while correlations of the second and third canonical variate pairs were 0.562 and 0.461, respectively. The first canonical variate pair indicated that increasing temperatures resulted in high ozone mixing ratios, while the second canonical variate pair showed wind speed and humidity's influence on local ammonium concentrations. No new information was uncovered in the third variate pair. Canonical loadings were also interpreted for information regarding relationships between data sets. Four principal components (PCs), expressing 77.0 % of original data variance, were derived in PCA. Interpretation of PCs suggested significant production and/or transport of secondary aerosols in the region (PC1). Furthermore, photochemical production of ozone and wind speed's influence on pollutants were expressed (PC2) along with overall measure of local meteorology (PC3). In summary, CCA and PCA results combined were successful in uncovering linear relationships between meteorology and air pollutants in Chicago and
Conducting tests for statistically significant differences using forest inventory data
James A. Westfall; Scott A. Pugh; John W. Coulston
2013-01-01
Many forest inventory and monitoring programs are based on a sample of ground plots from which estimates of forest resources are derived. In addition to evaluating metrics such as number of trees or amount of cubic wood volume, it is often desirable to make comparisons between resource attributes. To properly conduct statistical tests for differences, it is imperative...
Khankhet, Jordan; Vanderwolf, Karen J; McAlpine, Donald F; McBurney, Scott; Overy, David P; Slavic, Durda; Xu, Jianping
2014-01-01
Pseudogymnoascus destructans is the causative agent of an emerging infectious disease that threatens populations of several North American bat species. The fungal disease was first observed in 2006 and has since caused the death of nearly six million bats. The disease, commonly known as white-nose syndrome, is characterized by a cutaneous infection with P. destructans causing erosions and ulcers in the skin of nose, ears and/or wings of bats. Previous studies based on sequences from eight loci have found that isolates of P. destructans from bats in the US all belong to one multilocus genotype. Using the same multilocus sequence typing method, we found that isolates from eastern and central Canada also had the same genotype as those from the US, consistent with the clonal expansion of P. destructans into Canada. However, our PCR fingerprinting revealed that among the 112 North American isolates we analyzed, three, all from Canada, showed minor genetic variation. Furthermore, we found significant variations among isolates in mycelial growth rate; the production of mycelial exudates; and pigment production and diffusion into agar media. These phenotypic differences were influenced by culture medium and incubation temperature, indicating significant variation in environmental condition--dependent phenotypic expression among isolates of the clonal P. destructans genotype in North America.
Directory of Open Access Journals (Sweden)
Jordan Khankhet
Full Text Available Pseudogymnoascus destructans is the causative agent of an emerging infectious disease that threatens populations of several North American bat species. The fungal disease was first observed in 2006 and has since caused the death of nearly six million bats. The disease, commonly known as white-nose syndrome, is characterized by a cutaneous infection with P. destructans causing erosions and ulcers in the skin of nose, ears and/or wings of bats. Previous studies based on sequences from eight loci have found that isolates of P. destructans from bats in the US all belong to one multilocus genotype. Using the same multilocus sequence typing method, we found that isolates from eastern and central Canada also had the same genotype as those from the US, consistent with the clonal expansion of P. destructans into Canada. However, our PCR fingerprinting revealed that among the 112 North American isolates we analyzed, three, all from Canada, showed minor genetic variation. Furthermore, we found significant variations among isolates in mycelial growth rate; the production of mycelial exudates; and pigment production and diffusion into agar media. These phenotypic differences were influenced by culture medium and incubation temperature, indicating significant variation in environmental condition--dependent phenotypic expression among isolates of the clonal P. destructans genotype in North America.
Perneger, Thomas V; Combescure, Christophe
2017-07-01
Published P-values provide a window into the global enterprise of medical research. The aim of this study was to use the distribution of published P-values to estimate the relative frequencies of null and alternative hypotheses and to seek irregularities suggestive of publication bias. This cross-sectional study included P-values published in 120 medical research articles in 2016 (30 each from the BMJ, JAMA, Lancet, and New England Journal of Medicine). The observed distribution of P-values was compared with expected distributions under the null hypothesis (i.e., uniform between 0 and 1) and the alternative hypothesis (strictly decreasing from 0 to 1). P-values were categorized according to conventional levels of statistical significance and in one-percent intervals. Among 4,158 recorded P-values, 26.1% were highly significant (P values values equal to 1, and (3) about twice as many P-values less than 0.05 compared with those more than 0.05. The latter finding was seen in both randomized trials and observational studies, and in most types of analyses, excepting heterogeneity tests and interaction tests. Under plausible assumptions, we estimate that about half of the tested hypotheses were null and the other half were alternative. This analysis suggests that statistical tests published in medical journals are not a random sample of null and alternative hypotheses but that selective reporting is prevalent. In particular, significant results are about twice as likely to be reported as nonsignificant results. Copyright © 2017 Elsevier Inc. All rights reserved.
Alternative derivations of the statistical mechanical distribution laws.
Wall, F T
1971-08-01
A new approach is presented for the derivation of statistical mechanical distribution laws. The derivations are accomplished by minimizing the Helmholtz free energy under constant temperature and volume, instead of maximizing the entropy under constant energy and volume. An alternative method involves stipulating equality of chemical potential, or equality of activity, for particles in different energy levels. This approach leads to a general statement of distribution laws applicable to all systems for which thermodynamic probabilities can be written. The methods also avoid use of the calculus of variations, Lagrangian multipliers, and Stirling's approximation for the factorial. The results are applied specifically to Boltzmann, Fermi-Dirac, and Bose-Einstein statistics. The special significance of chemical potential and activity is discussed for microscopic systems.
Advanced statistics for tokamak transport colinearity and tokamak to tokamak variation
International Nuclear Information System (INIS)
Riedel, K.S.
1989-01-01
This paper is an expository introduction to advanced statistics and scaling laws and their application to tokamak devices. Topics of discussion are as follows: implicit assumptions in the standard analysis; advanced regression techniques; specialized tools in statistics and their applications in fusion physics; and improved datasets for transport studies
Spatio-temporal statistical models with applications to atmospheric processes
International Nuclear Information System (INIS)
Wikle, C.K.
1996-01-01
This doctoral dissertation is presented as three self-contained papers. An introductory chapter considers traditional spatio-temporal statistical methods used in the atmospheric sciences from a statistical perspective. Although this section is primarily a review, many of the statistical issues considered have not been considered in the context of these methods and several open questions are posed. The first paper attempts to determine a means of characterizing the semiannual oscillation (SAO) spatial variation in the northern hemisphere extratropical height field. It was discovered that the midlatitude SAO in 500hPa geopotential height could be explained almost entirely as a result of spatial and temporal asymmetries in the annual variation of stationary eddies. It was concluded that the mechanism for the SAO in the northern hemisphere is a result of land-sea contrasts. The second paper examines the seasonal variability of mixed Rossby-gravity waves (MRGW) in lower stratospheric over the equatorial Pacific. Advanced cyclostationary time series techniques were used for analysis. It was found that there are significant twice-yearly peaks in MRGW activity. Analyses also suggested a convergence of horizontal momentum flux associated with these waves. In the third paper, a new spatio-temporal statistical model is proposed that attempts to consider the influence of both temporal and spatial variability. This method is mainly concerned with prediction in space and time, and provides a spatially descriptive and temporally dynamic model
Directory of Open Access Journals (Sweden)
Y. Yuan
2018-03-01
Full Text Available Critical data selection is essential for determining representative baseline levels of atmospheric trace gases even at remote measurement sites. Different data selection techniques have been used around the world, which could potentially lead to reduced compatibility when comparing data from different stations. This paper presents a novel statistical data selection method named adaptive diurnal minimum variation selection (ADVS based on CO2 diurnal patterns typically occurring at elevated mountain stations. Its capability and applicability were studied on records of atmospheric CO2 observations at six Global Atmosphere Watch stations in Europe, namely, Zugspitze-Schneefernerhaus (Germany, Sonnblick (Austria, Jungfraujoch (Switzerland, Izaña (Spain, Schauinsland (Germany, and Hohenpeissenberg (Germany. Three other frequently applied statistical data selection methods were included for comparison. Among the studied methods, our ADVS method resulted in a lower fraction of data selected as a baseline with lower maxima during winter and higher minima during summer in the selected data. The measured time series were analyzed for long-term trends and seasonality by a seasonal-trend decomposition technique. In contrast to unselected data, mean annual growth rates of all selected datasets were not significantly different among the sites, except for the data recorded at Schauinsland. However, clear differences were found in the annual amplitudes as well as the seasonal time structure. Based on a pairwise analysis of correlations between stations on the seasonal-trend decomposed components by statistical data selection, we conclude that the baseline identified by the ADVS method is a better representation of lower free tropospheric (LFT conditions than baselines identified by the other methods.
Yuan, Ye; Ries, Ludwig; Petermeier, Hannes; Steinbacher, Martin; Gómez-Peláez, Angel J.; Leuenberger, Markus C.; Schumacher, Marcus; Trickl, Thomas; Couret, Cedric; Meinhardt, Frank; Menzel, Annette
2018-03-01
Critical data selection is essential for determining representative baseline levels of atmospheric trace gases even at remote measurement sites. Different data selection techniques have been used around the world, which could potentially lead to reduced compatibility when comparing data from different stations. This paper presents a novel statistical data selection method named adaptive diurnal minimum variation selection (ADVS) based on CO2 diurnal patterns typically occurring at elevated mountain stations. Its capability and applicability were studied on records of atmospheric CO2 observations at six Global Atmosphere Watch stations in Europe, namely, Zugspitze-Schneefernerhaus (Germany), Sonnblick (Austria), Jungfraujoch (Switzerland), Izaña (Spain), Schauinsland (Germany), and Hohenpeissenberg (Germany). Three other frequently applied statistical data selection methods were included for comparison. Among the studied methods, our ADVS method resulted in a lower fraction of data selected as a baseline with lower maxima during winter and higher minima during summer in the selected data. The measured time series were analyzed for long-term trends and seasonality by a seasonal-trend decomposition technique. In contrast to unselected data, mean annual growth rates of all selected datasets were not significantly different among the sites, except for the data recorded at Schauinsland. However, clear differences were found in the annual amplitudes as well as the seasonal time structure. Based on a pairwise analysis of correlations between stations on the seasonal-trend decomposed components by statistical data selection, we conclude that the baseline identified by the ADVS method is a better representation of lower free tropospheric (LFT) conditions than baselines identified by the other methods.
Methods and statistics for combining motif match scores.
Bailey, T L; Gribskov, M
1998-01-01
Position-specific scoring matrices are useful for representing and searching for protein sequence motifs. A sequence family can often be described by a group of one or more motifs, and an effective search must combine the scores for matching a sequence to each of the motifs in the group. We describe three methods for combining match scores and estimating the statistical significance of the combined scores and evaluate the search quality (classification accuracy) and the accuracy of the estimate of statistical significance of each. The three methods are: 1) sum of scores, 2) sum of reduced variates, 3) product of score p-values. We show that method 3) is superior to the other two methods in both regards, and that combining motif scores indeed gives better search accuracy. The MAST sequence homology search algorithm utilizing the product of p-values scoring method is available for interactive use and downloading at URL http:/(/)www.sdsc.edu/MEME.
PINGU and the neutrino mass hierarchy: Statistical and systematical aspects
International Nuclear Information System (INIS)
Capozzi, F.; Marrone, A.; Lisi, E.
2016-01-01
The proposed PINGU project (Precision IceCube Next Generation Upgrade) is supposed to determine neutrino mass hierarchy through matter effects of atmospheric neutrinos crossing the Earth core and mantle, which leads to variations in the events spectrum in energy and zenith angle. The presence of non-negligible (and partly unknown) systematics on the spectral shape can make the statistical analysis particularly challenging in the limit of high statistics. Assuming plausible spectral shape uncertainties at the percent level (due to effective volume, cross section, resolution functions, oscillation parameters, etc.), we obtain a significant reduction in the sensitivity to the hierarchy. The obtained results show the importance of a dedicated research program aimed at a better characterization and reduction of the uncertainties in future high-statistics experiments with atmospheric neutrinos.
Atmospheric pressure variations and abdominal aortic aneurysm rupture.
LENUS (Irish Health Repository)
Killeen, S D
2012-02-03
BACKGROUND: Ruptured abdominal aortic aneurysm (RAAA) presents with increased frequency in the winter and spring months. Seasonal changes in atmospheric pressure mirrors this pattern. AIM: To establish if there was a seasonal variation in the occurrence of RAAA and to determine if there was any association with atmospheric pressure changes. METHODS: A retrospective cohort-based study was performed. Daily atmospheric pressure readings for the region were obtained. RESULTS: There was a statistically significant monthly variation in RAAA presentation with 107 cases (52.5%) occurring from November to March. The monthly number of RAAA and the mean atmospheric pressure in the previous month were inversely related (r = -0.752, r (2) = 0.566, P = 0.03), and there was significantly greater daily atmospheric pressure variability on days when patients with RAAA were admitted. CONCLUSION: These findings suggest a relationship between atmospheric pressure and RAAA.
Directory of Open Access Journals (Sweden)
Helen R Griffin
Full Text Available Several previous studies have investigated the role of common promoter variants in the vascular endothelial growth factor (VEGF gene in causing congenital cardiovascular malformation (CVM. However, results have been discrepant between studies and no study to date has comprehensively characterised variation throughout the gene. We genotyped 771 CVM cases, of whom 595 had the outflow tract malformation Tetralogy of Fallot (TOF, and carried out TDT and case-control analyses using haplotype-tagging SNPs in VEGF. We carried out a meta-analysis of previous case-control or family-based studies that had typed VEGF promoter SNPs, which included an additional 570 CVM cases. To identify rare variants potentially causative of CVM, we carried out mutation screening in all VEGF exons and splice sites in 93 TOF cases. There was no significant effect of any VEGF haplotype-tagging SNP on the risk of CVM in our analyses of 771 probands. When the results of this and all previous studies were combined, there was no significant effect of the VEGF promoter SNPs rs699947 (OR 1.05 [95% CI 0.95-1.17]; rs1570360 (OR 1.17 [95% CI 0.99-1.26]; and rs2010963 (OR 1.04 [95% CI 0.93-1.16] on the risk of CVM in 1341 cases. Mutation screening of 93 TOF cases revealed no VEGF coding sequence variants and no changes at splice consensus sequences. Genetic variation in VEGF appears to play a small role, if any, in outflow tract CVM susceptibility.
Tiger hair morphology and its variations for wildlife forensic investigation
Directory of Open Access Journals (Sweden)
Thitika Kitpipit
2013-11-01
Full Text Available Tiger population has dramatically decreased due to illegal consumption and commercialisation of their body parts. Frequently, hair samples are the only evidence found in the crime scene. Thus, they play an important role in species identification for wildlife forensic investigation. In this study, we provide the first in-depth report on a variety of qualitative and quantitative characteristics of tiger guard hairs (24 hairs per individual from four individuals. The proposed method could reduce subjectivity of expert opinions on species identification based on hair morphology. Variations in 23 hair morphological characteristics were quantified at three levels: hair section, body region, and intra-species. The results indicate statistically significant variations in most morphological characteristics in all levels. Intra-species variations of four variables, namely hair length, hair index, scale separation and scale pattern, were low. Therefore, identification of tiger hairs using these multiple features in combination with other characteristics with high inter-species variations (e.g. medulla type should bring about objective and accurate tiger hair identification. The method used should serve as a guideline and be further applied to other species to establish a wildlife hair morphology database. Statistical models could then be constructed to distinguish species and provide evidential values in terms of likelihood ratios.
Cultivar and year-to-year variation of phytosterol content in rye (Secale cereale L.)
DEFF Research Database (Denmark)
Zangenberg, M.; Hansen, H.B.; Jørgensen, J.R.
2004-01-01
on phytosterol content in the different cultivars. The studied cultivars had all the lowest phytosterol contents in the dry and warm harvest season of 1999. Although there were statistically significant cultivar and year-to-year variations in the sterol composition (p
Solar cycle variations of geocoronal balmer α emission
International Nuclear Information System (INIS)
Nossal, S.; Reynolds, R.J.; Roesler, F.L.; Scherb, F.
1993-01-01
Observations of the geocoronal Balmer in nightglow have been made from Wisconsin for more than a solar cycle with an internally consistent intensity reference to standard astronomical nebulae. These measurements were made with a double etalon, pressure-scanned, 15-cm aperture Fabry-Perot interferometer. The resulting long time data provides an opportunity to examine solar cycle influence on the mid-latitude exosphere and to address accompanying questions concerning the degree to which the exosphere is locally static or changing. The exospheric Balmer α absolute intensity measurements reported here show no statistically significant variations throughout the solar cycle when the variation with viewing geometry is removed by normalizing the data to reference exospheric model predictions by Anderson et al. However, the relative intensity dependence on solar depression angle does show a solar cycle variation. This variation suggests a possible related variation in the exospheric hydrogen density profile, although other interpretations are also possible. The results suggest that additional well-calibrated data taken over a longer time span could probe low-amplitude variations over the solar cycle and test predictions of a slow monotonic increase in exospheric hydrogen arising from greenhouse gases. 21 refs., 9 figs., 2 tabs
Sources of Variation in the Age Composition of Sandeel Landings
DEFF Research Database (Denmark)
Kvist, Trine; Gislason, Hannes; Thyregod, Poul
2001-01-01
in the samples is significantly lower in the start and end of the fishing season. This suggests that the older sandeel are available to the fishery for a shorter time period that the 1-group. Significant differences are found in the age composition between the four laboratories involved in the age determination......The variation of the age composition of the landings of lesser sandeel in the Danish industrial fishery in the North Sea over the period From 1984-1993 is analysed by continuation-ratio logits and generalised linear models. The analysis takes the multinomial characteristics of the age composition....... Although the variation between ICES statistical rectangles is substantial there is a significant difference between the age composition in the northern and southern part of the North Sea. However, only one of the three finer geographical stratifications proposed to improve the assessment results...
Hashim, Muhammad Jawad
2010-09-01
Post-hoc secondary data analysis with no prespecified hypotheses has been discouraged by textbook authors and journal editors alike. Unfortunately no single term describes this phenomenon succinctly. I would like to coin the term "sigsearch" to define this practice and bring it within the teaching lexicon of statistics courses. Sigsearch would include any unplanned, post-hoc search for statistical significance using multiple comparisons of subgroups. It would also include data analysis with outcomes other than the prespecified primary outcome measure of a study as well as secondary data analyses of earlier research.
Directory of Open Access Journals (Sweden)
Carmen Ródenas
2013-01-01
Full Text Available The Spanish Institute for National Statistics (INE has decided to create new Migration Statistics (Estadística de Migraciones based upon Residential Variation Statistics (Estadística de Variaciones Residenciales. This article presents arguments to support this decision, in view of the continued lack of consistency found among the sources of the Spanish statistics system for measuring population mobility. Specifically, an insight is provided into the problems of underestimation and internal inconsistency in the Spanish Labour Force Survey when measuring immigration rates, based upon discrepancies identified in the three international immigration flow series produced by this survey.
Bayesian statistics an introduction
Lee, Peter M
2012-01-01
Bayesian Statistics is the school of thought that combines prior beliefs with the likelihood of a hypothesis to arrive at posterior beliefs. The first edition of Peter Lee’s book appeared in 1989, but the subject has moved ever onwards, with increasing emphasis on Monte Carlo based techniques. This new fourth edition looks at recent techniques such as variational methods, Bayesian importance sampling, approximate Bayesian computation and Reversible Jump Markov Chain Monte Carlo (RJMCMC), providing a concise account of the way in which the Bayesian approach to statistics develops as wel
Statistical mechanics of superconductivity
Kita, Takafumi
2015-01-01
This book provides a theoretical, step-by-step comprehensive explanation of superconductivity for undergraduate and graduate students who have completed elementary courses on thermodynamics and quantum mechanics. To this end, it adopts the unique approach of starting with the statistical mechanics of quantum ideal gases and successively adding and clarifying elements and techniques indispensible for understanding it. They include the spin-statistics theorem, second quantization, density matrices, the Bloch–De Dominicis theorem, the variational principle in statistical mechanics, attractive interaction, and bound states. Ample examples of their usage are also provided in terms of topics from advanced statistical mechanics such as two-particle correlations of quantum ideal gases, derivation of the Hartree–Fock equations, and Landau’s Fermi-liquid theory, among others. With these preliminaries, the fundamental mean-field equations of superconductivity are derived with maximum mathematical clarity based on ...
Application of descriptive statistics in analysis of experimental data
Mirilović Milorad; Pejin Ivana
2008-01-01
Statistics today represent a group of scientific methods for the quantitative and qualitative investigation of variations in mass appearances. In fact, statistics present a group of methods that are used for the accumulation, analysis, presentation and interpretation of data necessary for reaching certain conclusions. Statistical analysis is divided into descriptive statistical analysis and inferential statistics. The values which represent the results of an experiment, and which are the subj...
Directory of Open Access Journals (Sweden)
E. A. Tatokchin
2017-01-01
Full Text Available Development of the modern educational technologies caused by broad introduction of comput-er testing and development of distant forms of education does necessary revision of methods of an examination of pupils. In work it was shown, need transition to mathematical criteria, exami-nations of knowledge which are deprived of subjectivity. In article the review of the problems arising at realization of this task and are offered approaches for its decision. The greatest atten-tion is paid to discussion of a problem of objective transformation of rated estimates of the ex-pert on to the scale estimates of the student. In general, the discussion this question is was con-cluded that the solution to this problem lies in the creation of specialized intellectual systems. The basis for constructing intelligent system laid the mathematical model of self-organizing nonequilibrium dissipative system, which is a group of students. This article assumes that the dissipative system is provided by the constant influx of new test items of the expert and non-equilibrium – individual psychological characteristics of students in the group. As a result, the system must self-organize themselves into stable patterns. This patern will allow for, relying on large amounts of data, get a statistically significant assessment of student. To justify the pro-posed approach in the work presents the data of the statistical analysis of the results of testing a large sample of students (> 90. Conclusions from this statistical analysis allowed to develop intelligent system statistically significant examination of student performance. It is based on data clustering algorithm (k-mean for the three key parameters. It is shown that this approach allows you to create of the dynamics and objective expertise evaluation.
International Nuclear Information System (INIS)
Hightower, J.H. III
1994-01-01
Objectives of this field experiment were: (1) determine whether there was a statistically significant difference between the radon concentrations of samples collected by EPA's standard method, using a syringe, and an alternative, slow-flow method; (2) determine whether there was a statistically significant difference between the measured radon concentrations of samples mailed vs samples not mailed; and (3) determine whether there was a temporal variation of water radon concentration over a 7-month period. The field experiment was conducted at 9 sites, 5 private wells, and 4 public wells, at various locations in North Carolina. Results showed that a syringe is not necessary for sample collection, there was generally no significant radon loss due to mailing samples, and there was statistically significant evidence of temporal variations in water radon concentrations
Rules of parameter variation in homotype series of birdsong can indicate a 'sollwert' significance.
Hultsch, H; Todt, D
1996-11-01
Various bird species produce songs which include homotype pattern series, i.e. segments composed of a number of repeated vocal units. We compared such units and analyzed the variation of their parameters, especially in the time and the frequency domain. In addition, we examined whether and how serial changes of both the range and the trend of variation were related to song constituents following the repetitions. Data evaluation showed that variation of specific serial parameters (e.g., unit pitch or unit duration) occurring in the whistle song-types of nightingales (Luscinia megarhynchos) were converging towards a distinct terminal value. Although song-types differed in this terminal value, it was found to play the role of a key cue ('sollwert'). The continuation of a song depended on a preceding attainment of its specific 'sollwert'. Our results suggest that the study of signal parameters and rules of their variations make a useful tool for the behavioral access to the properties of the control systems mediating serial signal performances.
Nam, Sungsik
2011-08-01
Spread spectrum receivers with generalized selection combining (GSC) RAKE reception were proposed and have been studied as alternatives to the classical two fundamental schemes: maximal ratio combining and selection combining because the number of diversity paths increases with the transmission bandwidth. Previous work on performance analyses of GSC RAKE receivers based on the signal to noise ratio focused on the development of methodologies to derive exact closed-form expressions for various performance measures. However, some open problems related to the performance evaluation of GSC RAKE receivers still remain to be solved such as the exact performance analysis of the capture probability and an exact assessment of the impact of self-interference on GSC RAKE receivers. The major difficulty in these problems is to derive some joint statistics of ordered exponential variates. With this motivation in mind, we capitalize in this paper on some new order statistics results to derive exact closed-form expressions for the capture probability and outage probability of GSC RAKE receivers subject to self-interference over independent and identically distributed Rayleigh fading channels, and compare it to that of partial RAKE receivers. © 2011 IEEE.
Directory of Open Access Journals (Sweden)
Sadreyev Ruslan I
2004-08-01
Full Text Available Abstract Background Profile-based analysis of multiple sequence alignments (MSA allows for accurate comparison of protein families. Here, we address the problems of detecting statistically confident dissimilarities between (1 MSA position and a set of predicted residue frequencies, and (2 between two MSA positions. These problems are important for (i evaluation and optimization of methods predicting residue occurrence at protein positions; (ii detection of potentially misaligned regions in automatically produced alignments and their further refinement; and (iii detection of sites that determine functional or structural specificity in two related families. Results For problems (1 and (2, we propose analytical estimates of P-value and apply them to the detection of significant positional dissimilarities in various experimental situations. (a We compare structure-based predictions of residue propensities at a protein position to the actual residue frequencies in the MSA of homologs. (b We evaluate our method by the ability to detect erroneous position matches produced by an automatic sequence aligner. (c We compare MSA positions that correspond to residues aligned by automatic structure aligners. (d We compare MSA positions that are aligned by high-quality manual superposition of structures. Detected dissimilarities reveal shortcomings of the automatic methods for residue frequency prediction and alignment construction. For the high-quality structural alignments, the dissimilarities suggest sites of potential functional or structural importance. Conclusion The proposed computational method is of significant potential value for the analysis of protein families.
Statistical inference for financial engineering
Taniguchi, Masanobu; Ogata, Hiroaki; Taniai, Hiroyuki
2014-01-01
This monograph provides the fundamentals of statistical inference for financial engineering and covers some selected methods suitable for analyzing financial time series data. In order to describe the actual financial data, various stochastic processes, e.g. non-Gaussian linear processes, non-linear processes, long-memory processes, locally stationary processes etc. are introduced and their optimal estimation is considered as well. This book also includes several statistical approaches, e.g., discriminant analysis, the empirical likelihood method, control variate method, quantile regression, realized volatility etc., which have been recently developed and are considered to be powerful tools for analyzing the financial data, establishing a new bridge between time series and financial engineering. This book is well suited as a professional reference book on finance, statistics and statistical financial engineering. Readers are expected to have an undergraduate-level knowledge of statistics.
Fang, Yongxiang; Wit, Ernst
2008-01-01
Fisher’s combined probability test is the most commonly used method to test the overall significance of a set independent p-values. However, it is very obviously that Fisher’s statistic is more sensitive to smaller p-values than to larger p-value and a small p-value may overrule the other p-values
Pitch, roll, and yaw variations in patient positioning
International Nuclear Information System (INIS)
Kaiser, Adeel; Schultheiss, Timothy E.; Wong, Jeffrey Y.C.; Smith, David D.; Han, Chunhui; Vora, Nayana L.; Pezner, Richard D.; Chen Yijen; Radany, Eric H.
2006-01-01
Purpose: To use pretreatment megavoltage-computed tomography (MVCT) scans to evaluate positioning variations in pitch, roll, and yaw for patients treated with helical tomotherapy. Methods and Materials: Twenty prostate and 15 head-and-neck cancer patients were selected. Pretreatment MVCT scans were performed before every treatment fraction and automatically registered to planning kilovoltage CT (KVCT) scans by bony landmarks. Image registration data were used to adjust patient setups before treatment. Corrections for pitch, roll, and yaw were recorded after bone registration, and data from fractions 1-5 and 16-20 were used to analyze mean rotational corrections. Results: For prostate patients, the means and standard deviations (in degrees) for pitch, roll, and yaw corrections were -0.60 ± 1.42, 0.66 ± 1.22, and -0.33 ± 0.83. In head-and-neck patients, the means and standard deviations (in degrees) were -0.24 ± 1.19, -0.12 ± 1.53, and 0.25 ± 1.42 for pitch, roll, and yaw, respectively. No significant difference in rotational variations was observed between Weeks 1 and 4 of treatment. Head-and-neck patients had significantly smaller pitch variation, but significantly larger yaw variation, than prostate patients. No difference was found in roll corrections between the two groups. Overall, 96.6% of the rotational corrections were less than 4 deg. Conclusions: The initial rotational setup errors for prostate and head-and-neck patients were all small in magnitude, statistically significant, but did not vary considerably during the course of radiotherapy. The data are relevant to couch hardware design for correcting rotational setup variations. There should be no theoretical difference between these data and data collected using cone beam KVCT on conventional linacs
Statistical analysis of geomagnetic field variations during solar eclipses
Kim, Jung-Hee; Chang, Heon-Young
2018-04-01
We investigate the geomagnetic field variations recorded by INTERMAGNET geomagnetic observatories, which are observed while the Moon's umbra or penumbra passed over them during a solar eclipse event. Though it is generally considered that the geomagnetic field can be modulated during solar eclipses, the effect of the solar eclipse on the observed geomagnetic field has proved subtle to be detected. Instead of exploring the geomagnetic field as a case study, we analyze 207 geomagnetic manifestations acquired by 100 geomagnetic observatories during 39 solar eclipses occurring from 1991 to 2016. As a result of examining a pattern of the geomagnetic field variation on average, we confirm that the effect can be seen over an interval of 180 min centered at the time of maximum eclipse on a site of a geomagnetic observatory. That is, demonstrate an increase in the Y component of the geomagnetic field and decreases in the X component and the total strength of the geomagnetic field. We also find that the effect can be overwhelmed, depending more sensitively on the level of daily geomagnetic events than on the level of solar activity and/or the phase of solar cycle. We have demonstrated it by dividing the whole data set into subsets based on parameters of the geomagnetic field, solar activity, and solar eclipses. It is suggested, therefore, that an evidence of the solar eclipse effect can be revealed even at the solar maximum, as long as the day of the solar eclipse is magnetically quiet.
Genome-wide associations of gene expression variation in humans.
Directory of Open Access Journals (Sweden)
Barbara E Stranger
2005-12-01
Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.
Genome-Wide Associations of Gene Expression Variation in Humans.
Directory of Open Access Journals (Sweden)
2005-12-01
Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.
Body size and allometric variation in facial shape in children.
Larson, Jacinda R; Manyama, Mange F; Cole, Joanne B; Gonzalez, Paula N; Percival, Christopher J; Liberton, Denise K; Ferrara, Tracey M; Riccardi, Sheri L; Kimwaga, Emmanuel A; Mathayo, Joshua; Spitzmacher, Jared A; Rolian, Campbell; Jamniczky, Heather A; Weinberg, Seth M; Roseman, Charles C; Klein, Ophir; Lukowiak, Ken; Spritz, Richard A; Hallgrimsson, Benedikt
2018-02-01
Morphological integration, or the tendency for covariation, is commonly seen in complex traits such as the human face. The effects of growth on shape, or allometry, represent a ubiquitous but poorly understood axis of integration. We address the question of to what extent age and measures of size converge on a single pattern of allometry for human facial shape. Our study is based on two large cross-sectional cohorts of children, one from Tanzania and the other from the United States (N = 7,173). We employ 3D facial imaging and geometric morphometrics to relate facial shape to age and anthropometric measures. The two populations differ significantly in facial shape, but the magnitude of this difference is small relative to the variation within each group. Allometric variation for facial shape is similar in both populations, representing a small but significant proportion of total variation in facial shape. Different measures of size are associated with overlapping but statistically distinct aspects of shape variation. Only half of the size-related variation in facial shape can be explained by the first principal component of four size measures and age while the remainder associates distinctly with individual measures. Allometric variation in the human face is complex and should not be regarded as a singular effect. This finding has important implications for how size is treated in studies of human facial shape and for the developmental basis for allometric variation more generally. © 2017 Wiley Periodicals, Inc.
International Nuclear Information System (INIS)
Mendes, L.M.M.; Pereira, W.B.R.; Vieira, J.G.; Lamounier, C.S.; Gonçalves, D.A.; Carvalho, G.N.P.; Santana, P.C.; Oliveira, P.M.C.; Reis, L.P.
2017-01-01
Computed tomography had great advances in the equipment used in the diagnostic practice, directly influencing the levels of radiation for the patient. It is essential to optimize techniques that must be employed to comply with the ALARA (As Low As Reasonably Achievable) principle of radioprotection. The relationship of ASIR (Adaptive Statistical Iterative Reconstruction) with image noise was studied. Central images of a homogeneous water simulator were obtained in a 20 mm scan using a 64-channel Lightspeed VCT tomograph of General Electric in helical acquisitions with a rotation time of 0.5 seconds, Pitch 0.984: 1, and thickness of cut 0.625 mm. All these constant parameters varying the voltage in two distinct values: 120 and 140 kV with use of the automatic current by the CAE (Automatic Exposure Control), ranging from 50 to 675 mA (120 kV) and from 50 to 610 mA (140kV), minimum and maximum values, respectively allowed for each voltage. Image noise was determined through ImageJ free software. The analysis of the obtained data compared the percentage variation of the noise in the image based on the ASIR value of 10%, concluding that there is a variation of approximately 50% when compared to the values of ASIR (100%) in both tensions. Dose evaluation is required in future studies to better utilize the relationship between dose and image quality
Statistical Description of Segregation in a Powder Mixture
DEFF Research Database (Denmark)
Chapiro, Alexander; Stenby, Erling Halfdan
1996-01-01
In this paper we apply the statistical mechanics of powders to describe a segregated state in a mixture of grains of different sizes. Variation of the density of a packing with depth arising due to changes of particle configurations is studied. The statistical mechanics of powders is generalized...
Longitudinal Variations in the Variability of Spread F Occurrence
Groves, K. M.; Bridgwood, C.; Carrano, C. S.
2017-12-01
The complex dynamics of the equatorial ionosphere have attracted the interest and attention of researchers for many decades. The relatively local processes that give rise to large meridional gradients have been well documented and the associated terminology has entered the common lexicon of ionospheric research (e.g., fountain effect, equatorial anomaly, bubbles, Spread F). Zonal variations have also been noted, principally at the level of determining longitudinal differences in seasonal activity patterns. Due to a historical lack of high resolution ground-based observations at low latitudes, the primary source of data for such analyses has been space-based observations from satellites such as ROCSAT, DMSP, C/NOFS that measure in situ electron density variations. An important longitudinal variation in electron density structure associated with non-migrating diurnal tides was discovered by Immel et al. in 2006 using data from the FUV sensor aboard the NASA IMAGE satellite. These satellite observations have been very helpful in identifying the structural characteristics of the equatorial ionosphere and the occurrence of Spread F, but they provide little insight into variations in scintillation features and potential differences in bubble development characteristics. Moreover space-based studies tend towards the statistics of occurrence frequency over periods of weeks to months. A recent analysis of daily spread F occurrence as determined by low latitude VHF scintillation activity shows that statistical results that are consistent with previous space-based observations, but the level of variability in the occurrence data show marked variations with longitude. For example, the American sector shows very low in-season variability while the African and Asian sectors exhibit true day-to-day variability regardless of seasonal variations. The results have significant implications for space weather as they suggest that long-term forecasts of equatorial scintillation may be
Dai, Mingwei; Ming, Jingsi; Cai, Mingxuan; Liu, Jin; Yang, Can; Wan, Xiang; Xu, Zongben
2017-09-15
Results from genome-wide association studies (GWAS) suggest that a complex phenotype is often affected by many variants with small effects, known as 'polygenicity'. Tens of thousands of samples are often required to ensure statistical power of identifying these variants with small effects. However, it is often the case that a research group can only get approval for the access to individual-level genotype data with a limited sample size (e.g. a few hundreds or thousands). Meanwhile, summary statistics generated using single-variant-based analysis are becoming publicly available. The sample sizes associated with the summary statistics datasets are usually quite large. How to make the most efficient use of existing abundant data resources largely remains an open question. In this study, we propose a statistical approach, IGESS, to increasing statistical power of identifying risk variants and improving accuracy of risk prediction by i ntegrating individual level ge notype data and s ummary s tatistics. An efficient algorithm based on variational inference is developed to handle the genome-wide analysis. Through comprehensive simulation studies, we demonstrated the advantages of IGESS over the methods which take either individual-level data or summary statistics data as input. We applied IGESS to perform integrative analysis of Crohns Disease from WTCCC and summary statistics from other studies. IGESS was able to significantly increase the statistical power of identifying risk variants and improve the risk prediction accuracy from 63.2% ( ±0.4% ) to 69.4% ( ±0.1% ) using about 240 000 variants. The IGESS software is available at https://github.com/daviddaigithub/IGESS . zbxu@xjtu.edu.cn or xwan@comp.hkbu.edu.hk or eeyang@hkbu.edu.hk. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Bee, Mark A
2004-12-01
Acoustic signals provide a basis for social recognition in a wide range of animals. Few studies, however, have attempted to relate the patterns of individual variation in signals to behavioral discrimination thresholds used by receivers to discriminate among individuals. North American bullfrogs (Rana catesbeiana) discriminate among familiar and unfamiliar individuals based on individual variation in advertisement calls. The sources, patterns, and magnitudes of variation in eight acoustic properties of multiple-note advertisement calls were examined to understand how patterns of within-individual variation might either constrain, or provide additional cues for, vocal recognition. Six of eight acoustic properties exhibited significant note-to-note variation within multiple-note calls. Despite this source of within-individual variation, all call properties varied significantly among individuals, and multivariate analyses indicated that call notes were individually distinct. Fine-temporal and spectral call properties exhibited less within-individual variation compared to gross-temporal properties and contributed most toward statistically distinguishing among individuals. Among-individual differences in the patterns of within-individual variation in some properties suggest that within-individual variation could also function as a recognition cue. The distributions of among-individual and within-individual differences were used to generate hypotheses about the expected behavioral discrimination thresholds of receivers.
Statistical analysis of the W Cyg light curve
International Nuclear Information System (INIS)
Klyus, I.A.
1983-01-01
A statistical analysis of the light curve of W Cygni has been carried out. The process of brightness variations brightness of the star is shown to be a stationary stochastic one. The hypothesis of stationarity of the process was checked at the significance level of α=0.05. Oscillations of the brightness with average durations of 131 and 250 days have been found. It is proved that oscillations are narrow-band noise, i.e. cycles. Peaks on the power spectrum corresponding to these cycles exceed 99% confidence interval. It has been stated that the oscillations are independent
Variation in the hematocrit of the rat after gamma irradiation
International Nuclear Information System (INIS)
Marble, G.; Breuil, L.; Berthelet, J.
1966-01-01
Statistical analysis of hematocrit measurement results has been carried out for twenty batches of rats with a view to studying the variation in the hematocrit as a function of the irradiation dose during the first four days after this irradiation. A significant increase in the hematocrit has been observed on the third day for 750 and 1000 rads and on the fourth day for 1000 rads only. (authors) [fr
Statistical methods in spatial genetics
DEFF Research Database (Denmark)
Guillot, Gilles; Leblois, Raphael; Coulon, Aurelie
2009-01-01
The joint analysis of spatial and genetic data is rapidly becoming the norm in population genetics. More and more studies explicitly describe and quantify the spatial organization of genetic variation and try to relate it to underlying ecological processes. As it has become increasingly difficult...... to keep abreast with the latest methodological developments, we review the statistical toolbox available to analyse population genetic data in a spatially explicit framework. We mostly focus on statistical concepts but also discuss practical aspects of the analytical methods, highlighting not only...
Directory of Open Access Journals (Sweden)
A. D. Love
2010-01-01
Full Text Available Raw materials used in cement manufacturing normally have varying chemical compositions and require regular analyses for plant control purposes. This is achieved by using several analytical instruments, such as XRF and ICP. The values obtained for the major elements Ca, Si, Fe and Al, are used to calculate the plant control parameters Lime Saturation Factor (LSF, Silica Ratio (SR and Alumina Modulus (AM. These plant control parameters are used to regulate the mixing and blending of various raw meal components and to operate the plant optimally. Any errors and large ﬂuctuations in these plant parameters not only inﬂuence the quality of the cement produced, but also have a major effect on the cost of production of cement clinker through their inﬂuence on the energy consumption and residence time in the kiln. This paper looks at the role that statistical variances in the analytical measurements of the major elements Ca, Si, Fe and Al can have on the ultimate LSF, SR and AM values calculated from these measurements. The inﬂuence of too high and too low values of the LSF, SR and AM on clinker quality and energy consumption is discussed, and acceptable variances in these three parameters, based on plant experiences, are established. The effect of variances in the LSF, SR and AM parameters on the production costs is then analysed, and it is shown that variations of as large as 30% and as little as 5% can potentially occur. The LSF calculation incorporates most chemical elements and therefore is prone to the largest number of variations due to statistical variances in the analytical determinations of the chemical elements. Despite all these variations in LSF values they actually produced the smallest inﬂuence on the production cost of the clinker. It is therefore concluded that the LSF value is the most practical parameter for plant control purposes.
Higher-Order Moment Characterisation of Rogue Wave Statistics in Supercontinuum Generation
DEFF Research Database (Denmark)
Sørensen, Simon Toft; Bang, Ole; Wetzel, Benjamin
2012-01-01
The noise characteristics of supercontinuum generation are characterized using higherorder statistical moments. Measures of skew and kurtosis, and the coefficient of variation allow quantitative identification of spectral regions dominated by rogue wave like behaviour.......The noise characteristics of supercontinuum generation are characterized using higherorder statistical moments. Measures of skew and kurtosis, and the coefficient of variation allow quantitative identification of spectral regions dominated by rogue wave like behaviour....
Statistical considerations of graphite strength for assessing design allowable stresses
International Nuclear Information System (INIS)
Ishihara, M.; Mogi, H.; Ioka, I.; Arai, T.; Oku, T.
1987-01-01
Several aspects of statistics need to be considered to determine design allowable stresses for graphite structures. These include: 1) Statistical variation of graphite material strength. 2) Uncertainty of calculated stress. 3) Reliability (survival probability) required from operational and safety performance of graphite structures. This paper deals with some statistical considerations of structural graphite for assessing design allowable stress. Firstly, probability distribution functions of tensile and compressive strengths are investigated on experimental Very High Temperature candidated graphites. Normal, logarithmic normal and Weibull distribution functions are compared in terms of coefficient of correlation to measured strength data. This leads to the adaptation of normal distribution function. Then, the relation between factor of safety and fracture probability is discussed on the following items: 1) As the graphite strength is more variable than metalic material's strength, the effect of strength variation to the fracture probability is evaluated. 2) Fracture probability depending on survival probability of 99 ∼ 99.9 (%) with confidence level of 90 ∼ 95 (%) is discussed. 3) As the material properties used in the design analysis are usually the mean values of their variation, the additional effect of these variations on the fracture probability is discussed. Finally, the way to assure the minimum ultimate strength with required survival probability with confidence level is discussed in view of statistical treatment of the strength data from varying sample numbers in a material acceptance test. (author)
Towards Structural Analysis of Audio Recordings in the Presence of Musical Variations
Directory of Open Access Journals (Sweden)
Müller Meinard
2007-01-01
Full Text Available One major goal of structural analysis of an audio recording is to automatically extract the repetitive structure or, more generally, the musical form of the underlying piece of music. Recent approaches to this problem work well for music, where the repetitions largely agree with respect to instrumentation and tempo, as is typically the case for popular music. For other classes of music such as Western classical music, however, musically similar audio segments may exhibit significant variations in parameters such as dynamics, timbre, execution of note groups, modulation, articulation, and tempo progression. In this paper, we propose a robust and efficient algorithm for audio structure analysis, which allows to identify musically similar segments even in the presence of large variations in these parameters. To account for such variations, our main idea is to incorporate invariance at various levels simultaneously: we design a new type of statistical features to absorb microvariations, introduce an enhanced local distance measure to account for local variations, and describe a new strategy for structure extraction that can cope with the global variations. Our experimental results with classical and popular music show that our algorithm performs successfully even in the presence of significant musical variations.
Induction of micronuclei in hemocytes of Mytilus edulis and statistical analysis
DEFF Research Database (Denmark)
Wrisberg, M. N.; Bilbo, Carl M.; Spliid, Henrik
1992-01-01
biological variation, emphasizing the importance of application of a correct statistical method. A systematic approach to the statistical evaluation of the mussel MN test is outlined. The statistical model includes three different situations: (a) estimation of parameters of a single sample, (b) estimation...
Siegal, B. S.; Short, N. M.
1977-01-01
The significance of operator variation and the angle of illumination in acquired imagery is analyzed for lineament analysis. Five operators analyzed a LANDSAT image and four photographs of a plastic relief map illuminated at a low angle from varying directions of the Prescott, Arizona region. Significant differences were found in both number and length of the lineaments recognized by the different investigators for the images. The actual coincidence of lineaments recognized by the investigators for the same image is exceptionally low. Even the directional data on lineament orientation is significantly different from operator to operator and from image to image. Cluster analysis of the orientation data displays a clustering by operators rather than by images. It is recommended that extreme caution be taken before attempting to compare different investigators' results in lineament analysis.
Directory of Open Access Journals (Sweden)
Abhishek Singh Nayyar
2013-07-01
Full Text Available Background: The aim of this study was to measure the concentrations (levels ofserum total proteins and advanced oxidation protein products as markers of oxidantmediated protein damage in the sera of patients with oral cancers.Methods: The study consisted of the sera analyses of serum total protein andadvanced oxidation protein products’ levels in 30 age and sex matched controls, 60patients with reported pre-cancerous lesions and/or conditions and 60 patients withhistologically proven oral squamous cell carcinoma. One way analyses of variance wereused to test the difference between groups. To determine which of the two groups’ meanswere significantly different, the post-hoc test of Bonferroni was used. The results wereaveraged as mean ± standard deviation. In the above test, P values less than 0.05 weretaken to be statistically significant. The normality of data was checked before thestatistical analysis was performed.Results: The study revealed statistically significant variations in serum levels ofadvanced oxidation protein products (P<0.001. Serum levels of total protein showedextensive variations; therefore the results were largely inconclusive and statisticallyinsignificant.Conclusion: The results emphasize the need for more studies with larger samplesizes to be conducted before a conclusive role can be determined for sera levels of totalprotein and advanced oxidation protein products as markers both for diagnosticsignificance and the transition from the various oral pre-cancerous lesions and conditionsinto frank oral cancers.
Spatial and temporal snowpack variation in the crown of the continent ecosystem
Selkowitz, D.J.; Fagre, D.B.; Reardon, B.A.
2002-01-01
Snowpack related ecosystem changes such as glacier recession and alpine treeline advance have been documented in the Crown of the Continent Ecosystem (CCE) over the course of the previous 150 years. Using data from the Natural Resource Conservation Service's SNOTEL sites and snow course surveys, we examined the spatial and temporal variation in snowpack in the region. SNOTEL data suggest CCE snowpacks are larger and more persistent than in most regions of the Western U.S., and that water year precipitation, rather than mean temperature, is the primary control on April 1 snow water equivalent (SWE). Snow course data indicate a statistically significant downward trend in mean April 1 SWE for the period 1950-2001 but no statistically significant trend in mean May 1 SWE for the longer period 1922-2001. Further analysis reveals that variations in both April 1 and May 1 mean SWE are closely tied to the Pacific Decadal Oscillation, an ENSO-like interdecadal pattern of Pacific Ocean climate variability. Despite no significant trend in mean May 1 SWE between 1922-2001, glaciers in Glacier National Park receded steadily during this period, implying changing climatic conditions crossed a threshold for glacier mass balance maintenace sometime between the Little Ice Age glacial maxima and 1922.
Are studies reporting significant results more likely to be published?
Koletsi, Despina; Karagianni, Anthi; Pandis, Nikolaos; Makou, Margarita; Polychronopoulou, Argy; Eliades, Theodore
2009-11-01
Our objective was to assess the hypothesis that there are variations of the proportion of articles reporting a significant effect, with a higher percentage of those articles published in journals with impact factors. The contents of 5 orthodontic journals (American Journal of Orthodontics and Dentofacial Orthopedics, Angle Orthodontist, European Journal of Orthodontics, Journal of Orthodontics, and Orthodontics and Craniofacial Research), published between 2004 and 2008, were hand-searched. Articles with statistical analysis of data were included in the study and classified into 4 categories: behavior and psychology, biomaterials and biomechanics, diagnostic procedures and treatment, and craniofacial growth, morphology, and genetics. In total, 2622 articles were examined, with 1785 included in the analysis. Univariate and multivariate logistic regression analyses were applied with statistical significance as the dependent variable, and whether the journal had an impact factor, the subject, and the year were the independent predictors. A higher percentage of articles showed significant results relative to those without significant associations (on average, 88% vs 12%) for those journals. Overall, these journals published significantly more studies with significant results, ranging from 75% to 90% (P = 0.02). Multivariate modeling showed that journals with impact factors had a 100% increased probability of publishing a statistically significant result compared with journals with no impact factor (odds ratio [OR], 1.99; 95% CI, 1.19-3.31). Compared with articles on biomaterials and biomechanics, all other subject categories showed lower probabilities of significant results. Nonsignificant findings in behavior and psychology and diagnosis and treatment were 1.8 (OR, 1.75; 95% CI, 1.51-2.67) and 3.5 (OR, 3.50; 95% CI, 2.27-5.37) times more likely to be published, respectively. Journals seem to prefer reporting significant results; this might be because of authors
Lack of seasonal variation in bone mass and biochemical estimates of bone turnover
International Nuclear Information System (INIS)
Overgaard, K.; Nilas, L.; Johansen, J.S.; Christiansen, C.
1988-01-01
Three previous studies have indicated a seasonal variation in bone mineral content, with values during the summer being 1.7% to 7.5% higher than during the winter. We have examined the seasonal influence on both bone mass, biochemical estimates of bone turnover and vitamin D metabolites in 86 healthy women, aged 29-53 years. All participants were followed up for 2 years with examinations every 6 weeks or 3 months. Bone mineral content in the proximal and distal part of the forearm (single photon absorptiometry) did not reveal any significant seasonal variation, whereas bone mineral density of the lumbar spine (dual photon absorptiometry) indicated that the highest values occurred in winter. None of the biochemical parameters showed any statistically significant cyclical changes. Serum concentrations of 25-hydroxyvitamin D and 24,25-dihydroxyvitamin D3 showed a highly significant seasonal variation, whereas the serum 1,25-dihydroxyvitamin D concentration was virtually unchanged. We conclude that seasonal variation in bone mineral content and bone turnover should not be taken into account when interpreting data from longitudinal studies of healthy pre- and postmenopausal women on a sufficient vitamin D nutriture
Directory of Open Access Journals (Sweden)
Leitner Dietmar
2005-04-01
Full Text Available Abstract Background A reliable prediction of the Xaa-Pro peptide bond conformation would be a useful tool for many protein structure calculation methods. We have analyzed the Protein Data Bank and show that the combined use of sequential and structural information has a predictive value for the assessment of the cis versus trans peptide bond conformation of Xaa-Pro within proteins. For the analysis of the data sets different statistical methods such as the calculation of the Chou-Fasman parameters and occurrence matrices were used. Furthermore we analyzed the relationship between the relative solvent accessibility and the relative occurrence of prolines in the cis and in the trans conformation. Results One of the main results of the statistical investigations is the ranking of the secondary structure and sequence information with respect to the prediction of the Xaa-Pro peptide bond conformation. We observed a significant impact of secondary structure information on the occurrence of the Xaa-Pro peptide bond conformation, while the sequence information of amino acids neighboring proline is of little predictive value for the conformation of this bond. Conclusion In this work, we present an extensive analysis of the occurrence of the cis and trans proline conformation in proteins. Based on the data set, we derived patterns and rules for a possible prediction of the proline conformation. Upon adoption of the Chou-Fasman parameters, we are able to derive statistically relevant correlations between the secondary structure of amino acid fragments and the Xaa-Pro peptide bond conformation.
Exploring language variation across Europe
DEFF Research Database (Denmark)
Hovy, Dirk; Johannsen, Anders Trærup
2016-01-01
Language varies not only between countries, but also along regional and sociodemographic lines. This variation is one of the driving factors behind language change. However, investigating language variation is a complex undertaking: the more factors we want to consider, the more data we need. Tra...... use of large amounts of data and provides statistical analyses, maps, and interactive features that enable scholars to explore language variation in a data-driven way.......Language varies not only between countries, but also along regional and sociodemographic lines. This variation is one of the driving factors behind language change. However, investigating language variation is a complex undertaking: the more factors we want to consider, the more data we need...... training in both variational linguistics and computational methods, a combination that is still not common. We take a first step here to alleviate the problem by providing an interface to explore large-scale language variation along several socio-demographic factors without programming knowledge. It makes...
Statistical calculation of hot channel factors
International Nuclear Information System (INIS)
Farhadi, K.
2007-01-01
It is a conventional practice in the design of nuclear reactors to introduce hot channel factors to allow for spatial variations of power generation and flow distribution. Consequently, it is not enough to be able to calculate the nominal temperature distributions of fuel element, cladding, coolant, and central fuel. Indeed, one must be able to calculate the probability that the imposed temperature or heat flux limits in the entire core is not exceeded. In this paper, statistical methods are used to calculate hot channel factors for a particular case of a heterogeneous, Material Testing Reactor (MTR) and compare the results obtained from different statistical methods. It is shown that among the statistical methods available, the semi-statistical method is the most reliable one
Telesca, Luciano; Lovallo, Michele; Lopez, Carmen; Marti Molist, Joan
2016-03-01
A detailed statistical investigation of the seismicity occurred at El Hierro volcano (Canary Islands) from 2011 to 2014 has been performed by analysing the time variation of four parameters: the Gutenberg-Richter b-value, the local coefficient of variation, the scaling exponent of the magnitude distribution and the main periodicity of the earthquake sequence calculated by using the Schuster's test. These four parameters are good descriptors of the time and magnitude distributions of the seismic sequence, and their variation indicate dynamical changes in the volcanic system. These variations can be attributed to the causes and types of seismicity, thus allowing to distinguish between different host-rock fracturing processes caused by intrusions of magma at different depths and overpressures. The statistical patterns observed among the studied unrest episodes and between them and the eruptive episode of 2011-2012 indicate that the response of the host rock to the deformation imposed by magma intrusion did not differ significantly from one episode to the other, thus suggesting that no significant local stress changes induced by magma intrusion occurred when comparing between all them. Therefore, despite the studied unrest episodes were caused by intrusions of magma at different depths and locations below El Hierro island, the mechanical response of the lithosphere was similar in all cases. This suggests that the reason why the first unrest culminated in an eruption while the other did not, may be related to the role of the regional/local tectonics acting at that moment, rather than to the forceful of magma intrusion.
CONFIDENCE LEVELS AND/VS. STATISTICAL HYPOTHESIS TESTING IN STATISTICAL ANALYSIS. CASE STUDY
Directory of Open Access Journals (Sweden)
ILEANA BRUDIU
2009-05-01
Full Text Available Estimated parameters with confidence intervals and testing statistical assumptions used in statistical analysis to obtain conclusions on research from a sample extracted from the population. Paper to the case study presented aims to highlight the importance of volume of sample taken in the study and how this reflects on the results obtained when using confidence intervals and testing for pregnant. If statistical testing hypotheses not only give an answer "yes" or "no" to some questions of statistical estimation using statistical confidence intervals provides more information than a test statistic, show high degree of uncertainty arising from small samples and findings build in the "marginally significant" or "almost significant (p very close to 0.05.
Directory of Open Access Journals (Sweden)
Lisa L Ellis
2014-07-01
Full Text Available We determined female genome sizes using flow cytometry for 211 Drosophila melanogaster sequenced inbred strains from the Drosophila Genetic Reference Panel, and found significant conspecific and intrapopulation variation in genome size. We also compared several life history traits for 25 lines with large and 25 lines with small genomes in three thermal environments, and found that genome size as well as genome size by temperature interactions significantly correlated with survival to pupation and adulthood, time to pupation, female pupal mass, and female eclosion rates. Genome size accounted for up to 23% of the variation in developmental phenotypes, but the contribution of genome size to variation in life history traits was plastic and varied according to the thermal environment. Expression data implicate differences in metabolism that correspond to genome size variation. These results indicate that significant genome size variation exists within D. melanogaster and this variation may impact the evolutionary ecology of the species. Genome size variation accounts for a significant portion of life history variation in an environmentally dependent manner, suggesting that potential fitness effects associated with genome size variation also depend on environmental conditions.
Ellis, Lisa L.; Huang, Wen; Quinn, Andrew M.; Ahuja, Astha; Alfrejd, Ben; Gomez, Francisco E.; Hjelmen, Carl E.; Moore, Kristi L.; Mackay, Trudy F. C.; Johnston, J. Spencer; Tarone, Aaron M.
2014-01-01
We determined female genome sizes using flow cytometry for 211 Drosophila melanogaster sequenced inbred strains from the Drosophila Genetic Reference Panel, and found significant conspecific and intrapopulation variation in genome size. We also compared several life history traits for 25 lines with large and 25 lines with small genomes in three thermal environments, and found that genome size as well as genome size by temperature interactions significantly correlated with survival to pupation and adulthood, time to pupation, female pupal mass, and female eclosion rates. Genome size accounted for up to 23% of the variation in developmental phenotypes, but the contribution of genome size to variation in life history traits was plastic and varied according to the thermal environment. Expression data implicate differences in metabolism that correspond to genome size variation. These results indicate that significant genome size variation exists within D. melanogaster and this variation may impact the evolutionary ecology of the species. Genome size variation accounts for a significant portion of life history variation in an environmentally dependent manner, suggesting that potential fitness effects associated with genome size variation also depend on environmental conditions. PMID:25057905
Observer variation in skeletal radiology
Energy Technology Data Exchange (ETDEWEB)
Cockshott, W.P.; Park, W.M.
1983-08-01
The factors that affect observer variation in bone radiology are analysed from data in the literature and on the basis of studies carried out at McMaster University on the hands and sacroiliac joints. A plea is made for presenting results in terms of Kappa statistics so that agreement due purely to chance is eliminated. In the conclusions the main variables that affect concordance are listed so that strategies can be developed to reduce observer variation. This is important in serial studies to ensure that the observer variations are smaller than the effect one wishes to measure.
Variational Bayesian labeled multi-Bernoulli filter with unknown sensor noise statistics
Directory of Open Access Journals (Sweden)
Qiu Hao
2016-10-01
Full Text Available It is difficult to build accurate model for measurement noise covariance in complex backgrounds. For the scenarios of unknown sensor noise variances, an adaptive multi-target tracking algorithm based on labeled random finite set and variational Bayesian (VB approximation is proposed. The variational approximation technique is introduced to the labeled multi-Bernoulli (LMB filter to jointly estimate the states of targets and sensor noise variances. Simulation results show that the proposed method can give unbiased estimation of cardinality and has better performance than the VB probability hypothesis density (VB-PHD filter and the VB cardinality balanced multi-target multi-Bernoulli (VB-CBMeMBer filter in harsh situations. The simulations also confirm the robustness of the proposed method against the time-varying noise variances. The computational complexity of proposed method is higher than the VB-PHD and VB-CBMeMBer in extreme cases, while the mean execution times of the three methods are close when targets are well separated.
Energy Technology Data Exchange (ETDEWEB)
Arcega-Cabrera, F. [Unidad de Quimica en Sisal, Facultad de Quimica, UNAM, Sisal 97355 (Mexico)], E-mail: arcega@icmyl.unam.mx; Armienta, M.A. [Instituto de Geofisica, UNAM, Mexico 04510 (Mexico); Daessle, L.W. [Instituto de Investigaciones Oceanologicas, UABC, Ensenada 22870 (Mexico); Castillo-Blum, S.E. [Facultad de Quimica, UNAM, Mexico 04510 (Mexico); Talavera, O. [Escuela de Ciencias de la Tierra, UAG, Taxco Viejo 40201 (Mexico); Dotor, A. [Instituto de Geofisica, UNAM, Mexico 04510 (Mexico)
2009-01-15
The potential environmental threat from Pb in Mexican rivers impacted by historic mining activities was studied using geochemical, isotopic and statistical methods. Lead geochemical fractionation and factor analysis of fractionated and total Pb indicate that anthropogenic sources have contributed significantly to Pb concentrations, while natural sources have contributed only small amounts. The analyses also indicate that two main processes are controlling the total Pb variation throughout the year in both rivers: erosion with discharge processes, and proportional dilution related to differences in grain-size distribution processes. Bio-available Pb in riverbed sediments was greater than 50% in 80% of the sampling stations indicating a high potential environmental risk, according to the risk assessment criteria (RAC). Nevertheless, based on the environmental chemistry of Pb and on multivariate statistical analysis, these criteria did not apply in this particular case. Significant differences (p < 0.05) in total Pb concentrations (from 50 to 5820 mg kg{sup -1}) and in the geochemical fractionation were observed as a function of seasonality and location along the river flow path. In the Cacalotenango and Taxco rivers, the highest concentrations of total Pb were found at stations close to tailings during the rainy and post-rainy seasons. The geochemistry of Pb was mainly controlled, during the dry and post-rainy seasons by the organic matter and carbonate content, and in the rainy season by hydrological conditions (e.g., the increase in river flux), hydrological basin erosion, and the suspended solids concentration. Isotopic analyses of the {sup 210}Pb/{sup 214}Pb ratio showed three processes in the Cacalotenango and Taxco rivers. First, the accumulation of atmospheric excess {sup 210}Pb, favoured during calmer hydrodynamic conditions in the river basin commonly during dry periods, is recorded by a {sup 210}Pb/{sup 214}Pb ratio of >1. In the case of the Cacalotenango
Thomas, Sarah A.; Weeks, Justin W.; Dougherty, Lea R.; Lipton, Melanie F.; Daruwala, Samantha E.; Kline, Kathryn
2015-01-01
Social anxiety often develops in adolescence, and precedes the onset of depression and substance use disorders. The link between social anxiety and use of behaviors to minimize distress in social situations (i.e., safety behaviors) is strong and for some patients, this link poses difficulty for engaging in, and benefiting from, exposure-based treatment. Yet, little is known about whether individual differences may moderate links between social anxiety and safety behaviors, namely variations in genetic alleles germane to anxiety. We examined the relation between adolescent social anxiety and expressions of safety behaviors, and whether allelic variation for anxiety moderates this relation. Adolescents (n=75; ages 14–17) were recruited from two larger studies investigating measurement of family relationships or adolescent social anxiety. Adolescents completed self-report measures about social anxiety symptoms and use of safety behaviors. They also provided saliva samples to assess allelic variations for anxiety from two genetic polymorphisms (BDNF rs6265; TAQ1A rs1800497). Controlling for adolescent age and gender, we observed a significant interaction between social anxiety symptoms and allelic variation (β=0.37, t=2.41, p=.02). Specifically, adolescents carrying allelic variations for anxiety evidenced a statistically significant and relatively strong positive relation between social anxiety symptoms and safety behaviors (β=0.73), whereas adolescents not carrying allelic variation evidenced a statistically non-significant and relatively weak relation (β=0.22). These findings have important implications for treating adolescent social anxiety, in that we identified an individual difference variable that can be used to identify people who evidence a particularly strong link between use of safety behaviors and expressing social anxiety. PMID:26692635
Ugurel, M S; Battal, B; Bozlar, U; Nural, M S; Tasar, M; Ors, F; Saglam, M; Karademir, I
2010-08-01
The purpose of our investigation was to determine the anatomical variations in the coeliac trunk-hepatic arterial system and the renal arteries in patients who underwent multidetector CT (MDCT) angiography of the abdominal aorta for various reasons. A total of 100 patients were analysed retrospectively. The coeliac trunk, hepatic arterial system and renal arteries were analysed individually and anatomical variations were recorded. Statistical analysis of the relationship between hepatocoeliac variations and renal artery variations was performed using a chi(2) test. There was a coeliac trunk trifurcation in 89% and bifurcation in 8% of the cases. Coeliac trunk was absent in 1%, a hepatosplenomesenteric trunk was seen in 1% and a splenomesenteric trunk was present in 1%. Hepatic artery variation was present in 48% of patients. Coeliac trunk and/or hepatic arterial variation was present in 23 (39.7%) of the 58 patients with normal renal arteries, and in 27 (64.3%) of the 42 patients with accessory renal arteries. There was a statistically significant correlation between renal artery variations and coeliac trunk-hepatic arterial system variations (p = 0.015). MDCT angiography permits a correct and detailed evaluation of hepatic and renal vascular anatomy. The prevalence of variations in the coeliac trunk and/or hepatic arteries is increased in people with accessory renal arteries. For that reason, when undertaking angiographic examinations directed towards any single organ, the possibility of variations in the vascular structure of other organs should be kept in mind.
Influence of Design Variations on Systems Performance
Tumer, Irem Y.; Stone, Robert B.; Huff, Edward M.; Norvig, Peter (Technical Monitor)
2000-01-01
High-risk aerospace components have to meet very stringent quality, performance, and safety requirements. Any source of variation is a concern, as it may result in scrap or rework. poor performance, and potentially unsafe flying conditions. The sources of variation during product development, including design, manufacturing, and assembly, and during operation are shown. Sources of static and dynamic variation during development need to be detected accurately in order to prevent failure when the components are placed in operation. The Systems' Health and Safety (SHAS) research at the NASA Ames Research Center addresses the problem of detecting and evaluating the statistical variation in helicopter transmissions. In this work, we focus on the variations caused by design, manufacturing, and assembly of these components, prior to being placed in operation (DMV). In particular, we aim to understand and represent the failure and variation information, and their correlation to performance and safety and feed this information back into the development cycle at an early stage. The feedback of such critical information will assure the development of more reliable components with less rework and scrap. Variations during design and manufacturing are a common source of concern in the development and production of such components. Accounting for these variations, especially those that have the potential to affect performance, is accomplished in a variety ways, including Taguchi methods, FMEA, quality control, statistical process control, and variation risk management. In this work, we start with the assumption that any of these variations can be represented mathematically, and accounted for by using analytical tools incorporating these mathematical representations. In this paper, we concentrate on variations that are introduced during design. Variations introduced during manufacturing are investigated in parallel work.
Probability, statistics, and computational science.
Beerenwinkel, Niko; Siebourg, Juliane
2012-01-01
In this chapter, we review basic concepts from probability theory and computational statistics that are fundamental to evolutionary genomics. We provide a very basic introduction to statistical modeling and discuss general principles, including maximum likelihood and Bayesian inference. Markov chains, hidden Markov models, and Bayesian network models are introduced in more detail as they occur frequently and in many variations in genomics applications. In particular, we discuss efficient inference algorithms and methods for learning these models from partially observed data. Several simple examples are given throughout the text, some of which point to models that are discussed in more detail in subsequent chapters.
International Nuclear Information System (INIS)
Silva-Rodríguez, Jesús; Domínguez-Prado, Inés; Pardo-Montero, Juan; Ruibal, Álvaro
2017-01-01
Purpose: The aim of this work is to study the effect of physiological muscular uptake variations and statistical noise on tumor quantification in FDG-PET studies. Methods: We designed a realistic framework based on simulated FDG-PET acquisitions from an anthropomorphic phantom that included different muscular uptake levels and three spherical lung lesions with diameters of 31, 21 and 9 mm. A distribution of muscular uptake levels was obtained from 136 patients remitted to our center for whole-body FDG-PET. Simulated FDG-PET acquisitions were obtained by using the Simulation System for Emission Tomography package (SimSET) Monte Carlo package. Simulated data was reconstructed by using an iterative Ordered Subset Expectation Maximization (OSEM) algorithm implemented in the Software for Tomographic Image Reconstruction (STIR) library. Tumor quantification was carried out by using estimations of SUV max , SUV 50 and SUV mean from different noise realizations, lung lesions and multiple muscular uptakes. Results: Our analysis provided quantification variability values of 17–22% (SUV max ), 11–19% (SUV 50 ) and 8–10% (SUV mean ) when muscular uptake variations and statistical noise were included. Meanwhile, quantification variability due only to statistical noise was 7–8% (SUV max ), 3–7% (SUV 50 ) and 1–2% (SUV mean ) for large tumors (>20 mm) and 13% (SUV max ), 16% (SUV 50 ) and 8% (SUV mean ) for small tumors (<10 mm), thus showing that the variability in tumor quantification is mainly affected by muscular uptake variations when large enough tumors are considered. In addition, our results showed that quantification variability is strongly dominated by statistical noise when the injected dose decreases below 222 MBq. Conclusions: Our study revealed that muscular uptake variations between patients who are totally relaxed should be considered as an uncertainty source of tumor quantification values. - Highlights: • Distribution of muscular uptake from 136 PET
Chakhmouradian, Anton R.; Reguir, Ekaterina P.; Zaitsev, Anatoly N.; Couëslan, Christopher; Xu, Cheng; Kynický, Jindřich; Mumin, A. Hamid; Yang, Panseok
2017-03-01
-spectroscopy proved inconclusive for apatites with small P-site deficiencies and other substituent elements in this site. Indicator REE ratios sensitive to redox conditions (δCe, δEu) and hydrothermal overprint (δY) form a fairly tight cluster of values (0.8-1.3, 0.8-1.1 and 0.6-0.9, respectively) and may be used in combination with trace-element abundances for the development of geochemical exploration tools. Hydrothermal apatite forms in carbonatites as the product of replacement of primary apatite, or is deposited in fractures and interstices as euhedral crystals and aggregates associated with typical late-stage minerals (e.g., quartz and chlorite). Hydrothermal apatite is typically depleted in Sr, REE, Mn and Th, but enriched in F (up to 4.8 wt.%) relative to its igneous precursor, and also differs from the latter in at least some of key REE ratios [e.g., shows (La/Yb)cn ≤ 25, or a negative Ce anomaly]. The only significant exception is Sr(± REE,Na)-rich replacement zones and overgrowths on igneous apatite from some dolomite(-bearing) carbonatites. Their crystallization conditions and source fluid appear to be very different from the more common Sr-REE-depleted variety. Based on the new evidence presented in this work, trace-element partitioning between apatite and carbonatitic magmas, phosphate solubility in these magmas, and compositional variation of apatite-group minerals from spatially associated carbonatitic rocks are critically re-evaluated.
To P or Not to P: Backing Bayesian Statistics.
Buchinsky, Farrel J; Chadha, Neil K
2017-12-01
In biomedical research, it is imperative to differentiate chance variation from truth before we generalize what we see in a sample of subjects to the wider population. For decades, we have relied on null hypothesis significance testing, where we calculate P values for our data to decide whether to reject a null hypothesis. This methodology is subject to substantial misinterpretation and errant conclusions. Instead of working backward by calculating the probability of our data if the null hypothesis were true, Bayesian statistics allow us instead to work forward, calculating the probability of our hypothesis given the available data. This methodology gives us a mathematical means of incorporating our "prior probabilities" from previous study data (if any) to produce new "posterior probabilities." Bayesian statistics tell us how confidently we should believe what we believe. It is time to embrace and encourage their use in our otolaryngology research.
Vindras, Philippe; Desmurget, Michel; Baraduc, Pierre
2012-01-01
In science, it is a common experience to discover that although the investigated effect is very clear in some individuals, statistical tests are not significant because the effect is null or even opposite in other individuals. Indeed, t-tests, Anovas and linear regressions compare the average effect with respect to its inter-individual variability, so that they can fail to evidence a factor that has a high effect in many individuals (with respect to the intra-individual variability). In such paradoxical situations, statistical tools are at odds with the researcher's aim to uncover any factor that affects individual behavior, and not only those with stereotypical effects. In order to go beyond the reductive and sometimes illusory description of the average behavior, we propose a simple statistical method: applying a Kolmogorov-Smirnov test to assess whether the distribution of p-values provided by individual tests is significantly biased towards zero. Using Monte-Carlo studies, we assess the power of this two-step procedure with respect to RM Anova and multilevel mixed-effect analyses, and probe its robustness when individual data violate the assumption of normality and homoscedasticity. We find that the method is powerful and robust even with small sample sizes for which multilevel methods reach their limits. In contrast to existing methods for combining p-values, the Kolmogorov-Smirnov test has unique resistance to outlier individuals: it cannot yield significance based on a high effect in one or two exceptional individuals, which allows drawing valid population inferences. The simplicity and ease of use of our method facilitates the identification of factors that would otherwise be overlooked because they affect individual behavior in significant but variable ways, and its power and reliability with small sample sizes (<30-50 individuals) suggest it as a tool of choice in exploratory studies.
Directory of Open Access Journals (Sweden)
Philippe Vindras
Full Text Available In science, it is a common experience to discover that although the investigated effect is very clear in some individuals, statistical tests are not significant because the effect is null or even opposite in other individuals. Indeed, t-tests, Anovas and linear regressions compare the average effect with respect to its inter-individual variability, so that they can fail to evidence a factor that has a high effect in many individuals (with respect to the intra-individual variability. In such paradoxical situations, statistical tools are at odds with the researcher's aim to uncover any factor that affects individual behavior, and not only those with stereotypical effects. In order to go beyond the reductive and sometimes illusory description of the average behavior, we propose a simple statistical method: applying a Kolmogorov-Smirnov test to assess whether the distribution of p-values provided by individual tests is significantly biased towards zero. Using Monte-Carlo studies, we assess the power of this two-step procedure with respect to RM Anova and multilevel mixed-effect analyses, and probe its robustness when individual data violate the assumption of normality and homoscedasticity. We find that the method is powerful and robust even with small sample sizes for which multilevel methods reach their limits. In contrast to existing methods for combining p-values, the Kolmogorov-Smirnov test has unique resistance to outlier individuals: it cannot yield significance based on a high effect in one or two exceptional individuals, which allows drawing valid population inferences. The simplicity and ease of use of our method facilitates the identification of factors that would otherwise be overlooked because they affect individual behavior in significant but variable ways, and its power and reliability with small sample sizes (<30-50 individuals suggest it as a tool of choice in exploratory studies.
Energy Technology Data Exchange (ETDEWEB)
In, Wang Ki; Uh, Keun Sun; Chul, Kim Heui [Korea Atomic Energy Research Institute, Taejon (Korea, Republic of)
1995-02-01
A technically more direct statistical combinations of uncertainties methodology, extended SCU (XSCU), was applied to statistically combine the uncertainties associated with the DNBR alarm setpoint and the DNBR trip setpoint of digital nuclear power plants. The modified SCU (MSCU) methodology is currently used as the USNRC approved design methodology to perform the same function. In this report, the MSCU and XSCU methodologies were compared in terms of the total uncertainties and the net margins to the DNBR alarm and trip setpoints. The MSCU methodology resulted in the small total penalties due to a significantly negative bias which are quite large. However the XSCU methodology gave the virtually unbiased total uncertainties. The net margins to the DNBR alarm and trip setpoints by the MSCU methodology agree with those by the XSCU methodology within statistical variations. (Author) 12 refs., 17 figs., 5 tabs.
Variations of some parameters of enzyme induction in chemical workers
Energy Technology Data Exchange (ETDEWEB)
Dolara, P. (Univ. of Florence, Italy); Lodovici, M.; Buffoni, F.; Buiatti, E.; Baccetti, S.; Ciofini, O.; Bavazzano, P.; Barchielli, S.; Vannucci, V.
1982-01-01
Several parameters related to mono-oxygenase activity were followed in a population of chemical workers and controls. Workers exposed to toluene and xylene had a significant increase of urinary glucaric acid, that was correlated with hippuric acid excretion. On the other hand, workers exposed to pigments showed a marked increase of antipyrine half-life. A dose-related decrease of liver N-demethylase was induced in rats by the administration of a mixture of three of the pigments in use in the plant. Serum gamma-glutamyltranspeptidase was decreased in the workers exposed to pigments, but this variation was not statistically significant. The exposure to different chemicals in the workplace seemed to induce a complicated variation of mono-oxygenase levels, some enzyme being inhibited and others induced in the same group of workers. The sensitivity of these workers to toxic effects of chemicals, carcinogenic compounds and drugs seems to differ markedly from the control population.
MSD Recombination Method in Statistical Machine Translation
Gros, Jerneja Žganec
2008-11-01
Freely available tools and language resources were used to build the VoiceTRAN statistical machine translation (SMT) system. Various configuration variations of the system are presented and evaluated. The VoiceTRAN SMT system outperformed the baseline conventional rule-based MT system in all English-Slovenian in-domain test setups. To further increase the generalization capability of the translation model for lower-coverage out-of-domain test sentences, an "MSD-recombination" approach was proposed. This approach not only allows a better exploitation of conventional translation models, but also performs well in the more demanding translation direction; that is, into a highly inflectional language. Using this approach in the out-of-domain setup of the English-Slovenian JRC-ACQUIS task, we have achieved significant improvements in translation quality.
A generalized regression model of arsenic variations in the shallow groundwater of Bangladesh
Taylor, Richard G.; Chandler, Richard E.
2015-01-01
Abstract Localized studies of arsenic (As) in Bangladesh have reached disparate conclusions regarding the impact of irrigation‐induced recharge on As concentrations in shallow (≤50 m below ground level) groundwater. We construct generalized regression models (GRMs) to describe observed spatial variations in As concentrations in shallow groundwater both (i) nationally, and (ii) regionally within Holocene deposits where As concentrations in groundwater are generally high (>10 μg L−1). At these scales, the GRMs reveal statistically significant inverse associations between observed As concentrations and two covariates: (1) hydraulic conductivity of the shallow aquifer and (2) net increase in mean recharge between predeveloped and developed groundwater‐fed irrigation periods. Further, the GRMs show that the spatial variation of groundwater As concentrations is well explained by not only surface geology but also statistical interactions (i.e., combined effects) between surface geology and mean groundwater recharge, thickness of surficial silt and clay, and well depth. Net increases in recharge result from intensive groundwater abstraction for irrigation, which induces additional recharge where it is enabled by a permeable surface geology. Collectively, these statistical associations indicate that irrigation‐induced recharge serves to flush mobile As from shallow groundwater. PMID:27524841
Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior
2011-09-23
Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
Regional variations in the management of testicular or ovotesticular disorders of sex development.
Josso, N; Audi, L; Shaw, G
2011-01-01
Disorders of sex development arise in parts of the world with different socio-economic and cultural characteristics. We wished to determine the regional variations in the management of these conditions. A questionnaire was e-mailed to the 650 members of the European Society for Paediatric Endocrinology (ESPE), an international society with a mainly European membership but which also includes professionals from other continents. Results were subjected to statistical analysis. A total of 62 answers were received, a satisfactory rate given that not all members are involved in this issue. Results show statistically significant regional differences for available diagnostic resources, age of the patient at gender assignment, parameters considered important for gender assignment, and timing of discussion of various issues with parents and patient. The regional variations exist not only between different continents, as already demonstrated by others, but also between Northern, Latin and Eastern European countries. This suggests that 'one-fits-all' guidelines for management are not appropriate. Copyright © 2011 S. Karger AG, Basel.
Assessment of the beryllium lymphocyte proliferation test using statistical process control.
Cher, Daniel J; Deubner, David C; Kelsh, Michael A; Chapman, Pamela S; Ray, Rose M
2006-10-01
Despite more than 20 years of surveillance and epidemiologic studies using the beryllium blood lymphocyte proliferation test (BeBLPT) as a measure of beryllium sensitization (BeS) and as an aid for diagnosing subclinical chronic beryllium disease (CBD), improvements in specific understanding of the inhalation toxicology of CBD have been limited. Although epidemiologic data suggest that BeS and CBD risks vary by process/work activity, it has proven difficult to reach specific conclusions regarding the dose-response relationship between workplace beryllium exposure and BeS or subclinical CBD. One possible reason for this uncertainty could be misclassification of BeS resulting from variation in BeBLPT testing performance. The reliability of the BeBLPT, a biological assay that measures beryllium sensitization, is unknown. To assess the performance of four laboratories that conducted this test, we used data from a medical surveillance program that offered testing for beryllium sensitization with the BeBLPT. The study population was workers exposed to beryllium at various facilities over a 10-year period (1992-2001). Workers with abnormal results were offered diagnostic workups for CBD. Our analyses used a standard statistical technique, statistical process control (SPC), to evaluate test reliability. The study design involved a repeated measures analysis of BeBLPT results generated from the company-wide, longitudinal testing. Analytical methods included use of (1) statistical process control charts that examined temporal patterns of variation for the stimulation index, a measure of cell reactivity to beryllium; (2) correlation analysis that compared prior perceptions of BeBLPT instability to the statistical measures of test variation; and (3) assessment of the variation in the proportion of missing test results and how time periods with more missing data influenced SPC findings. During the period of this study, all laboratories displayed variation in test results that
The influence of ENSO, PDO and PNA on secular rainfall variations in Hawai‘i
Abby G. Frazier; Oliver Elison Timm; Thomas W. Giambelluca; Henry F. Diaz
2017-01-01
Over the last century, significant declines in rainfall across the state of Hawaiâi have been observed, and it is unknown whether these declines are due to natural variations in climate, or manifestations of human-induced climate change. Here, a statistical analysis of the observed rainfall variability was applied as first step towards better understanding causes for...
International Nuclear Information System (INIS)
Wang Xiaodong; Yang Renjie
2009-01-01
Objective: To investigate the variations of hepatic artery and its extrahepatic arteries on hepatic arteriogram and to provide benefit for transhepatic arterical chemoemblization. Methods: The hepatic arteriograms of 200 cases with unresectable hepatic malignant tumor before interventional therapy were analysed. Two interventional radiologists in common reviewed the incidences of various types according to Michels' classification, the absence of proper hepatic artery, and the variations of extrahepatic arteries originating from hepatic artery. Results: The most common hepatic artery variation was Michels type III(n=17,8.5%), followed by type II(n=10,5.0%) and V(n=9,4.5%). Proper hepatic absence was found in 25 cases and appeared as 5 subtypes. 5 kinds of extrahepatic arteries were found. The most common extrahepatic artery was the right gastric artery (n=156,78.0%), followed by cystic artery (n=126,63.0%), accessory left gastric artery (n=19,9.5%), the hepatic falciform artery (n=5,2.5%), and accessory left inferior phrenic artery (n=4,2.0%). Conclusion: There are some other variations of hepatic artery beside Michels' classification,and there are many variations of extrahepatic arteries originating from hepatic artery, it is important to assure interventional therapy effect for hepatic cancer and prevent complication. (authors)
Medial depression with bony dehiscence of lamina papyracea as an anatomic variation: CT evaluation
International Nuclear Information System (INIS)
Na, Sun Young; Lee, Young Uk; Youn, Eun Kyung; Suh, Sang Gyung; Kim, Dong Hyun
1994-01-01
To evaluate the incidence and CT findings of the medial depression and bony dehiscence of lamina papyracea as an anatomic variation. 1472 PNS CTs of the patients with symptoms of chronic sinusitis were retrospectively evaluated. The total incidence of depressed lamina papyracea as an anatomic variation was 3.5%(52/1472) on PNS CT. There was a statistically significant correlation between the increasing age and the incidence of depressed lamina papyracea. Depression of lamina papyracea anterior to the basal lamella were more common than those of the posterior depression. Associated findings were herniation of adjacent fatty tissue in all cases and the medial bowing and hypertrophied configuration of the medial rectus muscle without significant herniation in 19 cases(34%). Nontraumatic, asymptomatic depression with bony dehiscence of lamina papyracea as an anatomic variation is not uncommon with the incidence of 3.5%. Recognition of its existence and degree may be helpful in avoiding various ocular complication during ethmoid surgery
Assessment of spatio-temporal variations in surface water quality of ...
African Journals Online (AJOL)
MANN
2012-10-02
Oct 2, 2012 ... 1School of Biological and Environmental Sciences, Shoolini University, Solan, H.P.- 173229, India. 2Department ... variation. Of late, multivariate statistical techniques such ..... Statistical Package for the Social Sciences 10.0.
Directory of Open Access Journals (Sweden)
Emma Lightfoot
Full Text Available Oxygen isotope analysis of archaeological skeletal remains is an increasingly popular tool to study past human migrations. It is based on the assumption that human body chemistry preserves the δ18O of precipitation in such a way as to be a useful technique for identifying migrants and, potentially, their homelands. In this study, the first such global survey, we draw on published human tooth enamel and bone bioapatite data to explore the validity of using oxygen isotope analyses to identify migrants in the archaeological record. We use human δ18O results to show that there are large variations in human oxygen isotope values within a population sample. This may relate to physiological factors influencing the preservation of the primary isotope signal, or due to human activities (such as brewing, boiling, stewing, differential access to water sources and so on causing variation in ingested water and food isotope values. We compare the number of outliers identified using various statistical methods. We determine that the most appropriate method for identifying migrants is dependent on the data but is likely to be the IQR or median absolute deviation from the median under most archaeological circumstances. Finally, through a spatial assessment of the dataset, we show that the degree of overlap in human isotope values from different locations across Europe is such that identifying individuals' homelands on the basis of oxygen isotope analysis alone is not possible for the regions analysed to date. Oxygen isotope analysis is a valid method for identifying first-generation migrants from an archaeological site when used appropriately, however it is difficult to identify migrants using statistical methods for a sample size of less than c. 25 individuals. In the absence of local previous analyses, each sample should be treated as an individual dataset and statistical techniques can be used to identify migrants, but in most cases pinpointing a specific
Freedman, Laurence S; Midthune, Douglas; Dodd, Kevin W; Carroll, Raymond J; Kipnis, Victor
2015-11-30
Most statistical methods that adjust analyses for measurement error assume that the target exposure T is a fixed quantity for each individual. However, in many applications, the value of T for an individual varies with time. We develop a model that accounts for such variation, describing the model within the framework of a meta-analysis of validation studies of dietary self-report instruments, where the reference instruments are biomarkers. We demonstrate that in this application, the estimates of the attenuation factor and correlation with true intake, key parameters quantifying the accuracy of the self-report instrument, are sometimes substantially modified under the time-varying exposure model compared with estimates obtained under a traditional fixed-exposure model. We conclude that accounting for the time element in measurement error problems is potentially important. Copyright © 2015 John Wiley & Sons, Ltd.
Short and long periodic atmospheric variations between 25 and 200 km
Justus, C. G.; Woodrum, A.
1973-01-01
Previously collected data on atmospheric pressure, density, temperature and winds between 25 and 200 km from sources including Meteorological Rocket Network data, ROBIN falling sphere data, grenade release and pitot tube data, meteor winds, chemical release winds, satellite data, and others were analyzed by a daily difference method and results on the distribution statistics, magnitude, and spatial structure of gravity wave and planetary wave atmospheric variations are presented. Time structure of the gravity wave variations were determined by the analysis of residuals from harmonic analysis of time series data. Planetary wave contributions in the 25-85 km range were discovered and found to have significant height and latitudinal variation. Long period planetary waves, and seasonal variations were also computed by harmonic analysis. Revised height variations of the gravity wave contributions in the 25 to 85 km height range were computed. An engineering method and design values for gravity wave magnitudes and wave lengths are given to be used for such tasks as evaluating the effects on the dynamical heating, stability and control of spacecraft such as the space shuttle vehicle in launch or reentry trajectories.
Reese, Sarah E; Archer, Kellie J; Therneau, Terry M; Atkinson, Elizabeth J; Vachon, Celine M; de Andrade, Mariza; Kocher, Jean-Pierre A; Eckel-Passow, Jeanette E
2013-11-15
Batch effects are due to probe-specific systematic variation between groups of samples (batches) resulting from experimental features that are not of biological interest. Principal component analysis (PCA) is commonly used as a visual tool to determine whether batch effects exist after applying a global normalization method. However, PCA yields linear combinations of the variables that contribute maximum variance and thus will not necessarily detect batch effects if they are not the largest source of variability in the data. We present an extension of PCA to quantify the existence of batch effects, called guided PCA (gPCA). We describe a test statistic that uses gPCA to test whether a batch effect exists. We apply our proposed test statistic derived using gPCA to simulated data and to two copy number variation case studies: the first study consisted of 614 samples from a breast cancer family study using Illumina Human 660 bead-chip arrays, whereas the second case study consisted of 703 samples from a family blood pressure study that used Affymetrix SNP Array 6.0. We demonstrate that our statistic has good statistical properties and is able to identify significant batch effects in two copy number variation case studies. We developed a new statistic that uses gPCA to identify whether batch effects exist in high-throughput genomic data. Although our examples pertain to copy number data, gPCA is general and can be used on other data types as well. The gPCA R package (Available via CRAN) provides functionality and data to perform the methods in this article. reesese@vcu.edu
A robust statistical method for association-based eQTL analysis.
Directory of Open Access Journals (Sweden)
Ning Jiang
Full Text Available It has been well established that theoretical kernel for recently surging genome-wide association study (GWAS is statistical inference of linkage disequilibrium (LD between a tested genetic marker and a putative locus affecting a disease trait. However, LD analysis is vulnerable to several confounding factors of which population stratification is the most prominent. Whilst many methods have been proposed to correct for the influence either through predicting the structure parameters or correcting inflation in the test statistic due to the stratification, these may not be feasible or may impose further statistical problems in practical implementation.We propose here a novel statistical method to control spurious LD in GWAS from population structure by incorporating a control marker into testing for significance of genetic association of a polymorphic marker with phenotypic variation of a complex trait. The method avoids the need of structure prediction which may be infeasible or inadequate in practice and accounts properly for a varying effect of population stratification on different regions of the genome under study. Utility and statistical properties of the new method were tested through an intensive computer simulation study and an association-based genome-wide mapping of expression quantitative trait loci in genetically divergent human populations.The analyses show that the new method confers an improved statistical power for detecting genuine genetic association in subpopulations and an effective control of spurious associations stemmed from population structure when compared with other two popularly implemented methods in the literature of GWAS.
Quantifying variation in speciation and extinction rates with clade data.
Paradis, Emmanuel; Tedesco, Pablo A; Hugueny, Bernard
2013-12-01
High-level phylogenies are very common in evolutionary analyses, although they are often treated as incomplete data. Here, we provide statistical tools to analyze what we name "clade data," which are the ages of clades together with their numbers of species. We develop a general approach for the statistical modeling of variation in speciation and extinction rates, including temporal variation, unknown variation, and linear and nonlinear modeling. We show how this approach can be generalized to a wide range of situations, including testing the effects of life-history traits and environmental variables on diversification rates. We report the results of an extensive simulation study to assess the performance of some statistical tests presented here as well as of the estimators of speciation and extinction rates. These latter results suggest the possibility to estimate correctly extinction rate in the absence of fossils. An example with data on fish is presented. © 2013 The Author(s). Evolution © 2013 The Society for the Study of Evolution.
Yermolaev, Yu. I.; Lodkina, I. G.; Nikolaeva, N. S.; Yermolaev, M. Yu.
2011-02-01
We investigate the behavior of mean values of the solar wind’s and interplanetary magnetic field’s (IMF) parameters and their absolute and relative variations during the magnetic storms generated by various types of the solar wind. In this paper, which is a continuation of paper [1], we, on the basis of the OMNI data archive for the period of 1976-2000, have analyzed 798 geomagnetic storms with D st ≤ -50 nT and their interplanetary sources: corotating interaction regions CIR, compression regions Sheath before the interplanetary CMEs; magnetic clouds MC; “Pistons” Ejecta, and an uncertain type of a source. For the analysis the double superposed epoch analysis method was used, in which the instants of the magnetic storm onset and the minimum of the D st index were taken as reference times. It is shown that the set of interplanetary sources of magnetic storms can be sub-divided into two basic groups according to their slowly and fast varying characteristics: (1) ICME (MC and Ejecta) and (2) CIR and Sheath. The mean values, the absolute and relative variations in MC and Ejecta for all parameters appeared to be either mean or lower than the mean value (the mean values of the electric field E y and of the B z component of IMF are higher in absolute value), while in CIR and Sheath they are higher than the mean value. High values of the relative density variation sN/ are observed in MC. At the same time, the high values for relative variations of the velocity, B z component, and IMF magnitude are observed in Sheath and CIR. No noticeable distinctions in the relationships between considered parameters for moderate and strong magnetic storms were observed.
Statistical process control for serially correlated data
Wieringa, Jakob Edo
1999-01-01
Statistical Process Control (SPC) aims at quality improvement through reduction of variation. The best known tool of SPC is the control chart. Over the years, the control chart has proved to be a successful practical technique for monitoring process measurements. However, its usefulness in practice
Xia, Li C; Ai, Dongmei; Cram, Jacob A; Liang, Xiaoyi; Fuhrman, Jed A; Sun, Fengzhu
2015-09-21
Local trend (i.e. shape) analysis of time series data reveals co-changing patterns in dynamics of biological systems. However, slow permutation procedures to evaluate the statistical significance of local trend scores have limited its applications to high-throughput time series data analysis, e.g., data from the next generation sequencing technology based studies. By extending the theories for the tail probability of the range of sum of Markovian random variables, we propose formulae for approximating the statistical significance of local trend scores. Using simulations and real data, we show that the approximate p-value is close to that obtained using a large number of permutations (starting at time points >20 with no delay and >30 with delay of at most three time steps) in that the non-zero decimals of the p-values obtained by the approximation and the permutations are mostly the same when the approximate p-value is less than 0.05. In addition, the approximate p-value is slightly larger than that based on permutations making hypothesis testing based on the approximate p-value conservative. The approximation enables efficient calculation of p-values for pairwise local trend analysis, making large scale all-versus-all comparisons possible. We also propose a hybrid approach by integrating the approximation and permutations to obtain accurate p-values for significantly associated pairs. We further demonstrate its use with the analysis of the Polymouth Marine Laboratory (PML) microbial community time series from high-throughput sequencing data and found interesting organism co-occurrence dynamic patterns. The software tool is integrated into the eLSA software package that now provides accelerated local trend and similarity analysis pipelines for time series data. The package is freely available from the eLSA website: http://bitbucket.org/charade/elsa.
Fang, Yongxiang; Wit, Ernst
2008-01-01
Fisher’s combined probability test is the most commonly used method to test the overall significance of a set independent p-values. However, it is very obviously that Fisher’s statistic is more sensitive to smaller p-values than to larger p-value and a small p-value may overrule the other p-values and decide the test result. This is, in some cases, viewed as a flaw. In order to overcome this flaw and improve the power of the test, the joint tail probability of a set p-values is proposed as a ...
Research variations the channel of gallbladder in relation to gender
Directory of Open Access Journals (Sweden)
Spasojević Goran
2014-01-01
Full Text Available Channel of the gallbladder (DC shows the different anatomical variations, the knowledge is important in the diagnosis and particularly abdominal surgery. The aim of this study was to investigate possible gender differences in the morphology of the gallbladder channel (DC. A total sample consists of 50 anatomical liver taken from autopsy cases of both sexes (32 men and 18 women. Samples of liver were fixed in 4% formalin for 4 weeks, and then studied microdissected under scrutiny. We were working length measurement channels of the gallbladder (DC and typing of junction of the cystic (DC with the common hepatic duct (DHC. We classify anatomical junction variations DC with DHC into three types: angular, parallel and spiral type, and then performed statistical analysis of the data with respect to gender. Results: The angular type compound we found approximately at 2/3 or 64% of the samples (men-60% of women-72%, the parallel type combination of a total of 22% of the sample (men-31% of women-6% and spiral type of 14% of the samples (males-9%, women 22%. Parallel and spiral type together make up about 1/3 of the cases. The average length of DC is 3.1 cm and the variation interval of 1.7 to 6.2 cm. We found that the difference circuit between DC and DHC the angular variable (parallel junction and spiral with respect to gender there but the difference was not statistically significant (p> 0.05.
A Variational Statistical-Field Theory for Polar Liquid Mixtures
Zhuang, Bilin; Wang, Zhen-Gang
Using a variational field-theoretic approach, we derive a molecularly-based theory for polar liquid mixtures. The resulting theory consists of simple algebraic expressions for the free energy of mixing and the dielectric constant as functions of mixture composition. Using only the dielectric constants and the molar volumes of the pure liquid constituents, the theory evaluates the mixture dielectric constants in good agreement with the experimental values for a wide range of liquid mixtures, without using adjustable parameters. In addition, the theory predicts that liquids with similar dielectric constants and molar volumes dissolve well in each other, while sufficient disparity in these parameters result in phase separation. The calculated miscibility map on the dielectric constant-molar volume axes agrees well with known experimental observations for a large number of liquid pairs. Thus the theory provides a quantification for the well-known empirical ``like-dissolves-like'' rule. Bz acknowledges the A-STAR fellowship for the financial support.
Simple statistical model for branched aggregates
DEFF Research Database (Denmark)
Lemarchand, Claire; Hansen, Jesper Schmidt
2015-01-01
, given that it already has bonds with others. The model is applied here to asphaltene nanoaggregates observed in molecular dynamics simulations of Cooee bitumen. The variation with temperature of the probabilities deduced from this model is discussed in terms of statistical mechanics arguments....... The relevance of the statistical model in the case of asphaltene nanoaggregates is checked by comparing the predicted value of the probability for one molecule to have exactly i bonds with the same probability directly measured in the molecular dynamics simulations. The agreement is satisfactory......We propose a statistical model that can reproduce the size distribution of any branched aggregate, including amylopectin, dendrimers, molecular clusters of monoalcohols, and asphaltene nanoaggregates. It is based on the conditional probability for one molecule to form a new bond with a molecule...
Regional variation in rates of pediatric perforated appendicitis.
Sarda, Samir; Short, Heather L; Hockenberry, Jason M; McCarthy, Ian; Raval, Mehul V
2017-09-01
While trends in perforated appendicitis (PA) rates have been studied, regional variability in pediatric admissions for PA remains unknown. A retrospective, cross-sectional analysis of the 2006-2012 Kids' Inpatient Database was conducted to examine variation in PA admission rates by region of the United States and insurance status. PA rates were calculated and reported as per 1000 admissions in accordance with national quality measure specifications. National PA rates per 1000 admissions for 2006, 2009, and 2012 were 313.9, 279.2, and 309.1, respectively. Similarly, all regions demonstrated a statistically significant decrease in PA rates between 2006 and 2009 (pappendicitis, geographic region and insurance status appear to be associated with perforation upon presentation. Understanding regional variation in pediatric PA rates may inform health policymakers in the constantly evolving insurance coverage landscape. Level III Treatment Study - Retrospective comparative study of appendicitis presentation in children by region of the country. Copyright © 2017 Elsevier Inc. All rights reserved.
Directory of Open Access Journals (Sweden)
Xiuchen Wu
2017-10-01
Full Text Available Rapid climate warming, with much higher warming rates in winter and spring, could affect the vernalization fulfillment, a critical process for induction of crop reproductive growth and consequent grain filling in temperate winter crops. However, regional observational evidence of the effects of historical warming-mediated vernalization variations on temperate winter crop yields is lacking. Here, we statistically quantified the interannual sensitivity of winter wheat yields to vernalization degree days (VDD during 1975–2009 and its spatial relationship with multi-year mean VDD over temperate Europe (TE, using EUROSTAT crop yield statistics, observed and simulated crop phenology data and gridded daily climate data. Our results revealed a pervasively positive interannual sensitivity of winter wheat yields to variations in VDD (γVDD over TE, with a mean γVDD of 2.8 ± 1.5 kg ha−1 VDD−1. We revealed a significant (p < 0.05 negative exponential relationship between γVDD and multi-year mean VDD for winter wheat across TE, with higher γVDD in winter wheat planting areas with lower multi-year mean VDD. Our findings shed light on potential vulnerability of winter wheat yields to warming-mediated vernalization variations over TE, particularly considering a likely future warmer climate.
Directory of Open Access Journals (Sweden)
Kouki Fujioka
2013-11-01
Full Text Available Electronic noses have the benefit of obtaining smell information in a simple and objective manner, therefore, many applications have been developed for broad analysis areas such as food, drinks, cosmetics, medicine, and agriculture. However, measurement values from electronic noses have a tendency to vary under humidity or alcohol exposure conditions, since several types of sensors in the devices are affected by such variables. Consequently, we show three techniques for reducing the variation of sensor values: (1 using a trapping system to reduce the infering components; (2 performing statistical standardization (calculation of z-score; and (3 selecting suitable sensors. With these techniques, we discriminated the volatiles of four types of fresh mushrooms: golden needle (Flammulina velutipes, white mushroom (Agaricus bisporus, shiitake (Lentinus edodes, and eryngii (Pleurotus eryngii among six fresh mushrooms (hen of the woods (Grifola frondosa, shimeji (Hypsizygus marmoreus plus the above mushrooms. Additionally, we succeeded in discrimination of white mushroom, only comparing with artificial mushroom flavors, such as champignon flavor and truffle flavor. In conclusion, our techniques will expand the options to reduce variations in sensor values.
Variations in testosterone pathway genes and susceptibility to testicular cancer in Norwegian men.
Kristiansen, W; Aschim, E L; Andersen, J M; Witczak, O; Fosså, S D; Haugen, T B
2012-12-01
Imbalance between the oestrogen and androgen levels in utero is hypothesized to influence testicular cancer (TC) risk. Thus, variation in genes involved in the action of sex hormones may contribute to variability of an individual's susceptibility to TC. Mutations in testosterone pathway genes may alter the level of testosterone in vivo and hypothetically the risk of developing TC. Luteinizing hormone receptor (LHR), 5α-reductase II (SRD5A2) and androgen receptor (AR) are key elements in androgen action. A case-control study comprising 651 TC cases and 313 controls in a Norwegian population was conducted for investigation of polymorphisms in the LHR, SRD5A and AR genes and their possible association with TC. A statistical significant difference was observed in patients being heterozygous for the LHR Asn312Ser polymorphism when comparing genotypes between all TC cases and controls (OR = 0.66, 95% CI = 0.48-0.89, p(adj) = 0.049). No statistically significant difference between the histological subtypes seminoma and non-seminoma was observed. Our results may suggest a possible association between genetic variation in the LHR gene and the risk of developing TC. © 2012 The Authors. International Journal of Andrology © 2012 European Academy of Andrology.
A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data
Directory of Open Access Journals (Sweden)
Scherer Stephen W
2011-05-01
Full Text Available Abstract Background Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. Results We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. Conclusions The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.
Blood lipid measurements. Variations and practical utility.
Cooper, G R; Myers, G L; Smith, S J; Schlant, R C
1992-03-25
To describe the magnitude and impact of the major biological and analytical sources of variation in serum lipid and lipoprotein levels on risk of coronary heart disease; to present a way to qualitatively estimate the total intraindividual variation; and to demonstrate how to determine the number of specimens required to estimate, with 95% confidence, the "true" underlying total cholesterol value in the serum of a patient. Representative references on each source of variation were selected from more than 300 reviewed publications, most published within the past 5 years, to document current findings and concepts. Most articles reviewed were in English. Studies on biological sources of variation were selected using the following criteria: representative of published findings, clear statement of either significant or insignificant results, and acquisition of clinical and laboratory data under standardized conditions. Representative results for special populations such as women and children are reported when results differ from those of adult men. References were selected based on acceptable experimental design and use of standardized laboratory lipid measurements. The lipid levels considered representative for a selected source of variation arose from quantitative measurements by a suitably standardized laboratory. Statistical analysis of data was examined to assure reliability. The proposed method of estimating the biological coefficient of variation must be considered to give qualitative results, because only two or three serial specimens are collected in most cases for the estimation. Concern has arisen about the magnitude, impact, and interpretation of preanalytical as well as analytical sources of variation on reported results of lipid measurements of an individual. Preanalytical sources of variation from behavioral, clinical, and sampling sources constitute about 60% of the total variation in a reported lipid measurement of an individual. A technique is presented
Chen, Zuying; Godfrey-Bailey, Linda; Schiff, Isaac; Hauser, Russ
2004-01-01
Background To investigate the relationship of human semen parameters with season, age and smoking status. Methods The present study used data from subjects recruited into an ongoing cross-sectional study on the relationship between environmental agents and semen characteristics. Our population consisted of 306 patients who presented to the Vincent Memorial Andrology Laboratory of Massachusetts General Hospital for semen evaluation. Sperm concentration and motility were measured with computer aided sperm analysis (CASA). Sperm morphology was scored using Tygerberg Kruger strict criteria. Regression analyses were used to investigate the relationships between semen parameters and season, age and smoking status, adjusting for abstinence interval. Results Sperm concentration in the spring was significantly higher than in winter, fall and summer (p seasons. There were no statistically significant relationships between semen parameters and smoking status, though current smokers tended to have lower sperm concentration. We also did not find a statistically significant relationship between age and semen parameters. Conclusions We found seasonal variations in sperm concentration and suggestive evidence of seasonal variation in sperm motility and percent sperm with normal morphology. Although smoking status was not a significant predictor of semen parameters, this may have been due to the small number of current smokers in the study. PMID:15507127
Detecting hippocampal shape changes in Alzheimer's disease using statistical shape models
Shen, Kaikai; Bourgeat, Pierrick; Fripp, Jurgen; Meriaudeau, Fabrice; Salvado, Olivier
2011-03-01
The hippocampus is affected at an early stage in the development of Alzheimer's disease (AD). Using brain Magnetic Resonance (MR) images, we can investigate the effect of AD on the morphology of the hippocampus. Statistical shape models (SSM) are usually used to describe and model the hippocampal shape variations among the population. We use the shape variation from SSM as features to classify AD from normal control cases (NC). Conventional SSM uses principal component analysis (PCA) to compute the modes of variations among the population. Although these modes are representative of variations within the training data, they are not necessarily discriminant on labelled data. In this study, a Hotelling's T 2 test is used to qualify the landmarks which can be used for PCA. The resulting variation modes are used as predictors of AD from NC. The discrimination ability of these predictors is evaluated in terms of their classification performances using support vector machines (SVM). Using only landmarks statistically discriminant between AD and NC in SSM showed a better separation between AD and NC. These predictors also showed better correlation to the cognitive scores such as mini-mental state examination (MMSE) and Alzheimer's disease assessment scale (ADAS).
Gooya, Ali; Lekadir, Karim; Alba, Xenia; Swift, Andrew J; Wild, Jim M; Frangi, Alejandro F
2015-01-01
Construction of Statistical Shape Models (SSMs) from arbitrary point sets is a challenging problem due to significant shape variation and lack of explicit point correspondence across the training data set. In medical imaging, point sets can generally represent different shape classes that span healthy and pathological exemplars. In such cases, the constructed SSM may not generalize well, largely because the probability density function (pdf) of the point sets deviates from the underlying assumption of Gaussian statistics. To this end, we propose a generative model for unsupervised learning of the pdf of point sets as a mixture of distinctive classes. A Variational Bayesian (VB) method is proposed for making joint inferences on the labels of point sets, and the principal modes of variations in each cluster. The method provides a flexible framework to handle point sets with no explicit point-to-point correspondences. We also show that by maximizing the marginalized likelihood of the model, the optimal number of clusters of point sets can be determined. We illustrate this work in the context of understanding the anatomical phenotype of the left and right ventricles in heart. To this end, we use a database containing hearts of healthy subjects, patients with Pulmonary Hypertension (PH), and patients with Hypertrophic Cardiomyopathy (HCM). We demonstrate that our method can outperform traditional PCA in both generalization and specificity measures.
Alisa P. Ramakrishnan; Susan Meyer; Daniel J. Fairbanks; Craig E. Coleman
2006-01-01
Bromus tectorum (cheatgrass or downy brome) is an exotic annual weed that is abundant in western USA. We examined variation in six microsatellite loci for 17 populations representing a range of habitats in Utah, Idaho, Nevada and Colorado (USA) and then intensively sampled four representative populations, for a total sample size of approximately 1000 individuals. All...
Directory of Open Access Journals (Sweden)
Zhifeng Zhang
2017-09-01
Full Text Available Objective The vertebral number is associated with body length and carcass traits, which represents an economically important trait in farm animals. The variation of vertebral number has been observed in a few mammalian species. However, the variation of vertebral number and quantitative trait loci in sheep breeds have not been well addressed. Methods In our investigation, the information including gender, age, carcass weight, carcass length and the number of thoracic and lumbar vertebrae from 624 China Kazakh sheep was collected. The effect of vertebral number variation on carcass weight and carcass length was estimated by general linear model. Further, the polymorphic sites of Vertnin (VRTN gene were identified by sequencing, and the association of the genotype and vertebral number variation was analyzed by the one-way analysis of variance model. Results The variation of thoracolumbar vertebrae number in Kazakh sheep (18 to 20 was smaller than that in Texel sheep (17 to 21. The individuals with 19 thoracolumbar vertebrae (T13L6 were dominant in Kazakh sheep (79.2%. The association study showed that the numbers of thoracolumbar vertebrae were positively correlated with the carcass length and carcass weight, statistically significant with carcass length. To investigate the association of thoracolumbar vertebrae number with VRTN gene, we genotyped the VRTN gene. A total of 9 polymorphic sites were detected and only a single nucleotide polymorphism (SNP (rs426367238 was suggested to associate with thoracic vertebral number statistically. Conclusion The variation of thoracolumbar vertebrae number positively associated with the carcass length and carcass weight, especially with the carcass length. VRTN gene polymorphism of the SNP (rs426367238 with significant effect on thoracic vertebral number could be as a candidate marker to further evaluate its role in influence of thoracolumbar vertebral number.
Zhang, Zhifeng; Sun, Yawei; Du, Wei; He, Sangang; Liu, Mingjun; Tian, Changyan
2017-09-01
The vertebral number is associated with body length and carcass traits, which represents an economically important trait in farm animals. The variation of vertebral number has been observed in a few mammalian species. However, the variation of vertebral number and quantitative trait loci in sheep breeds have not been well addressed. In our investigation, the information including gender, age, carcass weight, carcass length and the number of thoracic and lumbar vertebrae from 624 China Kazakh sheep was collected. The effect of vertebral number variation on carcass weight and carcass length was estimated by general linear model. Further, the polymorphic sites of Vertnin ( VRTN ) gene were identified by sequencing, and the association of the genotype and vertebral number variation was analyzed by the one-way analysis of variance model. The variation of thoracolumbar vertebrae number in Kazakh sheep (18 to 20) was smaller than that in Texel sheep (17 to 21). The individuals with 19 thoracolumbar vertebrae (T13L6) were dominant in Kazakh sheep (79.2%). The association study showed that the numbers of thoracolumbar vertebrae were positively correlated with the carcass length and carcass weight, statistically significant with carcass length. To investigate the association of thoracolumbar vertebrae number with VRTN gene, we genotyped the VRTN gene. A total of 9 polymorphic sites were detected and only a single nucleotide polymorphism (SNP) (rs426367238) was suggested to associate with thoracic vertebral number statistically. The variation of thoracolumbar vertebrae number positively associated with the carcass length and carcass weight, especially with the carcass length. VRTN gene polymorphism of the SNP (rs426367238) with significant effect on thoracic vertebral number could be as a candidate marker to further evaluate its role in influence of thoracolumbar vertebral number.
Statistical Analysis of Data for Timber Strengths
DEFF Research Database (Denmark)
Sørensen, John Dalsgaard
2003-01-01
Statistical analyses are performed for material strength parameters from a large number of specimens of structural timber. Non-parametric statistical analysis and fits have been investigated for the following distribution types: Normal, Lognormal, 2 parameter Weibull and 3-parameter Weibull...... fits to the data available, especially if tail fits are used whereas the Log Normal distribution generally gives a poor fit and larger coefficients of variation, especially if tail fits are used. The implications on the reliability level of typical structural elements and on partial safety factors...... for timber are investigated....
Net analyte signal based statistical quality control
Skibsted, E.T.S.; Boelens, H.F.M.; Westerhuis, J.A.; Smilde, A.K.; Broad, N.W.; Rees, D.R.; Witte, D.T.
2005-01-01
Net analyte signal statistical quality control (NAS-SQC) is a new methodology to perform multivariate product quality monitoring based on the net analyte signal approach. The main advantage of NAS-SQC is that the systematic variation in the product due to the analyte (or property) of interest is
Vascular Variations Associated with Intracranial Aneurysms.
Orakdogen, Metin; Emon, Selin Tural; Somay, Hakan; Engin, Taner; Is, Merih; Hakan, Tayfun
2017-01-01
To investigate the vascular variations in patients with intracranial aneurysm in circle of Willis. We used the data on 128 consecutive intracranial aneurysm cases. Cerebral angiography images were analyzed retrospectively. Arteries were grouped as anterior cerebral arterial system (ACS), posterior cerebral arterial system (PCS) and middle cerebral arterial system (MCS) for grouping vascular variations. Lateralization, being single/multiple, gender; and also any connection with accompanying aneurysms" number, localization, dimension, whether bleeding/incidental aneurysm has been inspected. Variations were demonstrated in 57.8% of the cases. The most common variation was A1 variation (34.4%). The rate of variations was 36.7%, 24.2% and 10.2% respectively in ACS, PCS and MCS. MCS variations were significantly higher in males. Anterior communicating artery (ACoA) aneurysm observance rates were significantly higher and posterior communicating artery (PCoA) aneurysm and middle cerebral artery (MCA) aneurysm observance rates were significantly lower when compared to "no ACS variation detected" cases. In "PCS variation detected" cases, PCoA aneurysm observance rates and coexistence of multiple variations were significantly higher. The rate of vascular variations in patients with aneurysms was 57.8%. Arterial hypoplasia and aplasia were the most common variations. ACS was the most common region that variations were located in; they were mostly detected on the right side. Coexistence of ACoA aneurysm was higher than PCoA and MCA aneurysms. In the PCS variations group, PCoA aneurysms were the most common aneurysms that accompanying the variation and multiple variations were more common than in the other two groups. The variations in MCS were most common in males.
Genetic variation of inbreeding depression among floral and fitness traits in Silene nutans
DEFF Research Database (Denmark)
Thiele, Jan; Hansen, Thomas Møller; Siegismund, Hans Redlef
2010-01-01
The magnitude and variation of inbreeding depression (ID) within populations is important for the evolution and maintenance of mixed mating systems. We studied ID and its genetic variation in a range of floral and fitness traits in a small and large population of the perennial herb Silene nutans......, using controlled pollinations in a fully factorial North Carolina II design. Floral traits and early fitness traits, that is seed mass and germination rate, were not much affected by inbreeding (delta0.4). Lack of genetic correlations indicated that ID in floral, early and late traits is genetically...... was statistically significant in most floral and all seed traits, but not in late fitness traits. However, some paternal families had delta...
Annual and semiannual variations of the cosmic radiation
International Nuclear Information System (INIS)
Khor, H.P.; Kwok, W.K.; Owens, A.J.
1979-01-01
We determine the annual and semiannual harmonics in the Deep River Neutron Monitor counting rate for the years 1960--1975. A new Fourier analysis technique is used to eliminate solar cycle variations, an we discuss the statistical errors in the determination of the harmonics. The annual and semiannual waves changed markedly from year to year. The yearly harmonic has an average amplitude approx.0.6% with a maximum in early March, corresponding to a southward anisotropy of approx.5%/AU perpendicular to the solar equatorial plane. The semiannual harmonic shows no phase coherence and its average amplitude is only marginally significant, < or approx. =0.2%
Studies of ecomorphological variations of the European hare (Lepus europaeus in Turkey
Directory of Open Access Journals (Sweden)
Demirbaş Y.
2013-01-01
Full Text Available Hares (Lepus spp. are widely distributed across the globe and are adapted to diverse climatic conditions. In order to study the ecomorphological variations of hares from Turkey, the body and cranial measurements and body weight, as well as coat color types, of 138 hares collected from all over Turkey between 2006 and 2012, were examined. Statistically significant differences between regional samples (p <0.05, ANOVA only in terms of body weight and hindfoot length were found; however, there were a good number of external phenotypes, particularly in terms of coat color variants of the hare specimens. Furthermore, populations had similar variations in terms of morphometric measurement, body weight and coat coloration between different geographical regions. Turkish hares did not exhibit clinal variations from south to north in body and cranial measurements depending on the mean annual temperatures and precipitation. Therefore, it was assumed that all of these variations might be a polymorphism related to the local adaptations and high level of admixture of gene pools in Anatolia.
After statistics reform : Should we still teach significance testing?
A. Hak (Tony)
2014-01-01
textabstractIn the longer term null hypothesis significance testing (NHST) will disappear because p- values are not informative and not replicable. Should we continue to teach in the future the procedures of then abolished routines (i.e., NHST)? Three arguments are discussed for not teaching NHST in
Updated constraints on spatial variations of the fine-structure constant
Directory of Open Access Journals (Sweden)
A.M.M. Pinho
2016-05-01
Full Text Available Recent work by Webb et al. has provided indications of spatial variations of the fine-structure constant, α, at a level of a few parts per million. Using a dataset of 293 archival measurements, they further show that a dipole provides a statistically good fit to the data, a result subsequently confirmed by other authors. Here we show that a more recent dataset of dedicated measurements further constrains these variations: although there are only 10 such measurements, their uncertainties are considerably smaller. We find that a dipolar variation is still a good fit to the combined dataset, but the amplitude of such a dipole must be somewhat smaller: 8.1±1.7 ppm for the full dataset, versus 9.4±2.2 ppm for the Webb et al. data alone, both at the 68.3% confidence level. Constraints on the direction on the sky of such a dipole are also significantly improved. On the other hand the data can't yet discriminate between a pure spatial dipole and one with an additional redshift dependence.
Statistical science: a grammar for research.
Cox, David R
2017-06-01
I greatly appreciate the invitation to give this lecture with its century long history. The title is a warning that the lecture is rather discursive and not highly focused and technical. The theme is simple. That statistical thinking provides a unifying set of general ideas and specific methods relevant whenever appreciable natural variation is present. To be most fruitful these ideas should merge seamlessly with subject-matter considerations. By contrast, there is sometimes a temptation to regard formal statistical analysis as a ritual to be added after the serious work has been done, a ritual to satisfy convention, referees, and regulatory agencies. I want implicitly to refute that idea.
International Nuclear Information System (INIS)
Gilbert, R.O.; Bernhardt, D.E.; Hahn, P.B.
1983-01-01
A summary of a field soil sampling study conducted around the Rocky Flats Colorado plant in May 1977 is preseted. Several different soil sampling techniques that had been used in the area were applied at four different sites. One objective was to comparethe average 239 - 240 Pu concentration values obtained by the various soil sampling techniques used. There was also interest in determining whether there are differences in the reproducibility of the various techniques and how the techniques compared with the proposed EPA technique of sampling to 1 cm depth. Statistically significant differences in average concentrations between the techniques were found. The differences could be largely related to the differences in sampling depth-the primary physical variable between the techniques. The reproducibility of the techniques was evaluated by comparing coefficients of variation. Differences between coefficients of variation were not statistically significant. Average (median) coefficients ranged from 21 to 42 percent for the five sampling techniques. A laboratory study indicated that various sample treatment and particle sizing techniques could increase the concentration of plutonium in the less than 10 micrometer size fraction by up to a factor of about 4 compared to the 2 mm size fraction
International Nuclear Information System (INIS)
Bock, B.
1986-01-01
An analysis of hormone measurements in sera from healthy volunteers and patients that was carried out on the basis of different criteria yielded the following results: 1) The testosterone levels determined in the patients sera were significantly lower than those of the healthy individuals and the daily rhythmic variations seen here did not attain statistical significance. 2) There were no statistically relevant differences in the serum concentrations of cortisol between healthy individuals and patients, nor was the amplitude of the daily variations observed to be changed in a consistent way. 3) In the patients, as compared to the healthy individuals, the prolactin level was considerably increased, as was the amplitude of the daily rhythmic variations. 4) The values determined for the human growth hormone (HCG) varied considerably between the individuals of either group. Since this held true for both the fluctuations with time and the height of the serum concentrations, a statistical analysis of the results appeared pointless. The results confirm that central and autonomous components have an important role in ectopic eczemae. (TRV) [de
Chounlamany, Vanseng; Tanchuling, Maria Antonia; Inoue, Takanobu
2017-09-01
Payatas landfill in Quezon City, Philippines, releases leachate to the Marikina River through a creek. Multivariate statistical techniques were applied to study temporal and spatial variations in water quality of a segment of the Marikina River. The data set included 12 physico-chemical parameters for five monitoring stations over a year. Cluster analysis grouped the monitoring stations into four clusters and identified January-May as dry season and June-September as wet season. Principal components analysis showed that three latent factors are responsible for the data set explaining 83% of its total variance. The chemical oxygen demand, biochemical oxygen demand, total dissolved solids, Cl - and PO 4 3- are influenced by anthropogenic impact/eutrophication pollution from point sources. Total suspended solids, turbidity and SO 4 2- are influenced by rain and soil erosion. The highest state of pollution is at the Payatas creek outfall from March to May, whereas at downstream stations it is in May. The current study indicates that the river monitoring requires only four stations, nine water quality parameters and testing over three specific months of the year. The findings of this study imply that Payatas landfill requires a proper leachate collection and treatment system to reduce its impact on the Marikina River.
Inferring Demographic History Using Two-Locus Statistics.
Ragsdale, Aaron P; Gutenkunst, Ryan N
2017-06-01
Population demographic history may be learned from contemporary genetic variation data. Methods based on aggregating the statistics of many single loci into an allele frequency spectrum (AFS) have proven powerful, but such methods ignore potentially informative patterns of linkage disequilibrium (LD) between neighboring loci. To leverage such patterns, we developed a composite-likelihood framework for inferring demographic history from aggregated statistics of pairs of loci. Using this framework, we show that two-locus statistics are more sensitive to demographic history than single-locus statistics such as the AFS. In particular, two-locus statistics escape the notorious confounding of depth and duration of a bottleneck, and they provide a means to estimate effective population size based on the recombination rather than mutation rate. We applied our approach to a Zambian population of Drosophila melanogaster Notably, using both single- and two-locus statistics, we inferred a substantially lower ancestral effective population size than previous works and did not infer a bottleneck history. Together, our results demonstrate the broad potential for two-locus statistics to enable powerful population genetic inference. Copyright © 2017 by the Genetics Society of America.
Worry, Intolerance of Uncertainty, and Statistics Anxiety
Williams, Amanda S.
2013-01-01
Statistics anxiety is a problem for most graduate students. This study investigates the relationship between intolerance of uncertainty, worry, and statistics anxiety. Intolerance of uncertainty was significantly related to worry, and worry was significantly related to three types of statistics anxiety. Six types of statistics anxiety were…
Chiou, Chei-Chang; Wang, Yu-Min; Lee, Li-Tze
2014-08-01
Statistical knowledge is widely used in academia; however, statistics teachers struggle with the issue of how to reduce students' statistics anxiety and enhance students' statistics learning. This study assesses the effectiveness of a "one-minute paper strategy" in reducing students' statistics-related anxiety and in improving students' statistics-related achievement. Participants were 77 undergraduates from two classes enrolled in applied statistics courses. An experiment was implemented according to a pretest/posttest comparison group design. The quasi-experimental design showed that the one-minute paper strategy significantly reduced students' statistics anxiety and improved students' statistics learning achievement. The strategy was a better instructional tool than the textbook exercise for reducing students' statistics anxiety and improving students' statistics achievement.
Kossobokov, V.G.; Romashkova, L.L.; Keilis-Borok, V. I.; Healy, J.H.
1999-01-01
Algorithms M8 and MSc (i.e., the Mendocino Scenario) were used in a real-time intermediate-term research prediction of the strongest earthquakes in the Circum-Pacific seismic belt. Predictions are made by M8 first. Then, the areas of alarm are reduced by MSc at the cost that some earthquakes are missed in the second approximation of prediction. In 1992-1997, five earthquakes of magnitude 8 and above occurred in the test area: all of them were predicted by M8 and MSc identified correctly the locations of four of them. The space-time volume of the alarms is 36% and 18%, correspondingly, when estimated with a normalized product measure of empirical distribution of epicenters and uniform time. The statistical significance of the achieved results is beyond 99% both for M8 and MSc. For magnitude 7.5 + , 10 out of 19 earthquakes were predicted by M8 in 40% and five were predicted by M8-MSc in 13% of the total volume considered. This implies a significance level of 81% for M8 and 92% for M8-MSc. The lower significance levels might result from a global change in seismic regime in 1993-1996, when the rate of the largest events has doubled and all of them become exclusively normal or reversed faults. The predictions are fully reproducible; the algorithms M8 and MSc in complete formal definitions were published before we started our experiment [Keilis-Borok, V.I., Kossobokov, V.G., 1990. Premonitory activation of seismic flow: Algorithm M8, Phys. Earth and Planet. Inter. 61, 73-83; Kossobokov, V.G., Keilis-Borok, V.I., Smith, S.W., 1990. Localization of intermediate-term earthquake prediction, J. Geophys. Res., 95, 19763-19772; Healy, J.H., Kossobokov, V.G., Dewey, J.W., 1992. A test to evaluate the earthquake prediction algorithm, M8. U.S. Geol. Surv. OFR 92-401]. M8 is available from the IASPEI Software Library [Healy, J.H., Keilis-Borok, V.I., Lee, W.H.K. (Eds.), 1997. Algorithms for Earthquake Statistics and Prediction, Vol. 6. IASPEI Software Library]. ?? 1999 Elsevier
Effect of variations in rainfall intensity on slope stability in Singapore
Directory of Open Access Journals (Sweden)
Christofer Kristo
2017-12-01
Full Text Available Numerous scientific evidence has given credence to the true existence and deleterious impacts of climate change. One aspect of climate change is the variations in rainfall patterns, which affect the flux boundary condition across ground surface. A possible disastrous consequence of this change is the occurrence of rainfall-induced slope failures. This paper aims to investigate the variations in rainfall patterns in Singapore and its effect on slope stability. Singapore's historical rainfall data from Seletar and Paya Lebar weather stations for the period of 1985â2009 were obtained and analysed by duration using linear regression. A general increasing trend was observed in both weather stations, with a possible shift to longer duration rainfall events, despite being statistically insignificant according to the Mann-Kendall test. Using the derived trends, projected rainfall intensities in 2050 and 2100 were used in the seepage and slope stability analyses performed on a typical residual soil slope in Singapore. A significant reduction in factor of safety was observed in the next 50 years, with only a marginal decrease in factor of safety in the subsequent 50 years. This indicates a possible detrimental effect of variations in rainfall patterns on slope stability in Singapore, especially in the next 50 years. The statistical analyses on rainfall data from Seletar and Paya Lebar weather stations for the period of 1985â2009 indicated that rainfall intensity tend to increase over the years, with a possible shift to longer duration rainfall events in the future. The stability analyses showed a significant decrease in factor of safety from 2003 to 2050 due to increase in rainfall intensity, suggesting that a climate change might have existed beyond 2009 with possibly detrimental effects to slope stability. Keywords: Climate change, Rainfall, Seepage, Slope stability
International Nuclear Information System (INIS)
Martin, Robert P.; Nutt, William T.
2011-01-01
Research highlights: → Historical recitation on application of order-statistics models to nuclear power plant thermal-hydraulics safety analysis. → Interpretation of regulatory language regarding 10 CFR 50.46 reference to a 'high level of probability'. → Derivation and explanation of order-statistics-based evaluation methodologies considering multi-variate acceptance criteria. → Summary of order-statistics models and recommendations to the nuclear power plant thermal-hydraulics safety analysis community. - Abstract: The application of order-statistics in best-estimate plus uncertainty nuclear safety analysis has received a considerable amount of attention from methodology practitioners, regulators, and academia. At the root of the debate are two questions: (1) what is an appropriate quantitative interpretation of 'high level of probability' in regulatory language appearing in the LOCA rule, 10 CFR 50.46 and (2) how best to mathematically characterize the multi-variate case. An original derivation is offered to provide a quantitative basis for 'high level of probability.' At root of the second question is whether one should recognize a probability statement based on the tolerance region method of Wald and Guba, et al., for multi-variate problems, one explicitly based on the regulatory limits, best articulated in the Wallis-Nutt 'Testing Method', or something else entirely. This paper reviews the origins of the different positions, key assumptions, limitations, and relationship to addressing acceptance criteria. It presents a mathematical interpretation of the regulatory language, including a complete derivation of uni-variate order-statistics (as credited in AREVA's Realistic Large Break LOCA methodology) and extension to multi-variate situations. Lastly, it provides recommendations for LOCA applications, endorsing the 'Testing Method' and addressing acceptance methods allowing for limited sample failures.
Huffman and linear scanning methods with statistical language models.
Roark, Brian; Fried-Oken, Melanie; Gibbons, Chris
2015-03-01
Current scanning access methods for text generation in AAC devices are limited to relatively few options, most notably row/column variations within a matrix. We present Huffman scanning, a new method for applying statistical language models to binary-switch, static-grid typing AAC interfaces, and compare it to other scanning options under a variety of conditions. We present results for 16 adults without disabilities and one 36-year-old man with locked-in syndrome who presents with complex communication needs and uses AAC scanning devices for writing. Huffman scanning with a statistical language model yielded significant typing speedups for the 16 participants without disabilities versus any of the other methods tested, including two row/column scanning methods. A similar pattern of results was found with the individual with locked-in syndrome. Interestingly, faster typing speeds were obtained with Huffman scanning using a more leisurely scan rate than relatively fast individually calibrated scan rates. Overall, the results reported here demonstrate great promise for the usability of Huffman scanning as a faster alternative to row/column scanning.
A look at the links between drainage density and flood statistics
Directory of Open Access Journals (Sweden)
A. Montanari
2009-07-01
Full Text Available We investigate the links between the drainage density of a river basin and selected flood statistics, namely, mean, standard deviation, coefficient of variation and coefficient of skewness of annual maximum series of peak flows. The investigation is carried out through a three-stage analysis. First, a numerical simulation is performed by using a spatially distributed hydrological model in order to highlight how flood statistics change with varying drainage density. Second, a conceptual hydrological model is used in order to analytically derive the dependence of flood statistics on drainage density. Third, real world data from 44 watersheds located in northern Italy were analysed. The three-level analysis seems to suggest that a critical value of the drainage density exists for which a minimum is attained in both the coefficient of variation and the absolute value of the skewness coefficient. Such minima in the flood statistics correspond to a minimum of the flood quantile for a given exceedance probability (i.e., recurrence interval. Therefore, the results of this study may provide useful indications for flood risk assessment in ungauged basins.
Directory of Open Access Journals (Sweden)
Zheng-Yun Wu
2016-01-01
Full Text Available The use of multiple fermentations is one of the most specific characteristics of Maotai-flavoured liquor production. In this research, the variation of volatile composition of Maotai-flavoured liquor during its multiple fermentations is investigated using statistical approaches. Cluster analysis shows that the obtained samples are grouped mainly according to the fermentation steps rather than the distillery they originate from, and the samples from the first two fermentation steps show the greatest difference, suggesting that multiple fermentation and distillation steps result in the end in similar volatile composition of the liquor. Back-propagation neural network (BNN models were developed that satisfactorily predict the number of fermentation steps and the organoleptic evaluation scores of liquor samples from their volatile compositions. Mean impact value (MIV analysis shows that ethyl lactate, furfural and some high-boiling-point acids play important roles, while pyrazine contributes much less to the improvement of the flavour and taste of Maotai-flavoured liquor during its production. This study contributes to further understanding of the mechanisms of Maotai-flavoured liquor production.
Statistical Compression for Climate Model Output
Hammerling, D.; Guinness, J.; Soh, Y. J.
2017-12-01
Numerical climate model simulations run at high spatial and temporal resolutions generate massive quantities of data. As our computing capabilities continue to increase, storing all of the data is not sustainable, and thus is it important to develop methods for representing the full datasets by smaller compressed versions. We propose a statistical compression and decompression algorithm based on storing a set of summary statistics as well as a statistical model describing the conditional distribution of the full dataset given the summary statistics. We decompress the data by computing conditional expectations and conditional simulations from the model given the summary statistics. Conditional expectations represent our best estimate of the original data but are subject to oversmoothing in space and time. Conditional simulations introduce realistic small-scale noise so that the decompressed fields are neither too smooth nor too rough compared with the original data. Considerable attention is paid to accurately modeling the original dataset-one year of daily mean temperature data-particularly with regard to the inherent spatial nonstationarity in global fields, and to determining the statistics to be stored, so that the variation in the original data can be closely captured, while allowing for fast decompression and conditional emulation on modest computers.
Liu, Wei; Ding, Jinhui
2018-04-01
The application of the principle of the intention-to-treat (ITT) to the analysis of clinical trials is challenged in the presence of missing outcome data. The consequences of stopping an assigned treatment in a withdrawn subject are unknown. It is difficult to make a single assumption about missing mechanisms for all clinical trials because there are complicated reactions in the human body to drugs due to the presence of complex biological networks, leading to data missing randomly or non-randomly. Currently there is no statistical method that can tell whether a difference between two treatments in the ITT population of a randomized clinical trial with missing data is significant at a pre-specified level. Making no assumptions about the missing mechanisms, we propose a generalized complete-case (GCC) analysis based on the data of completers. An evaluation of the impact of missing data on the ITT analysis reveals that a statistically significant GCC result implies a significant treatment effect in the ITT population at a pre-specified significance level unless, relative to the comparator, the test drug is poisonous to the non-completers as documented in their medical records. Applications of the GCC analysis are illustrated using literature data, and its properties and limits are discussed.
Response of noctilucent cloud brightness to daily solar variations
Dalin, P.; Pertsev, N.; Perminov, V.; Dubietis, A.; Zadorozhny, A.; Zalcik, M.; McEachran, I.; McEwan, T.; Černis, K.; Grønne, J.; Taustrup, T.; Hansen, O.; Andersen, H.; Melnikov, D.; Manevich, A.; Romejko, V.; Lifatova, D.
2018-04-01
For the first time, long-term data sets of ground-based observations of noctilucent clouds (NLC) around the globe have been analyzed in order to investigate a response of NLC to solar UV irradiance variability on a day-to-day scale. NLC brightness has been considered versus variations of solar Lyman-alpha flux. We have found that day-to-day solar variability, whose effect is generally masked in the natural NLC variability, has a statistically significant effect when considering large statistics for more than ten years. Average increase in day-to-day solar Lyman-α flux results in average decrease in day-to-day NLC brightness that can be explained by robust physical mechanisms taking place in the summer mesosphere. Average time lags between variations of Lyman-α flux and NLC brightness are short (0-3 days), suggesting a dominant role of direct solar heating and of the dynamical mechanism compared to photodissociation of water vapor by solar Lyman-α flux. All found regularities are consistent between various ground-based NLC data sets collected at different locations around the globe and for various time intervals. Signatures of a 27-day periodicity seem to be present in the NLC brightness for individual summertime intervals; however, this oscillation cannot be unambiguously retrieved due to inevitable periods of tropospheric cloudiness.
Green mathematics: Benefits of including biological variation in your data analysis
Tijskens, L.M.M.; Schouten, R.E.; Unuk, T.; Simcic, M.
2015-01-01
Biological variation is omnipresent in nature. It contains useful information that is neglected by the usually applied statistical procedures. To extract this information special procedures have to be applied. Biological variation is seen in properties (e.g. size, colour, firmness), but the
Human Genetic Variation and Yellow Fever Mortality during 19th Century U.S. Epidemics
2014-01-01
ABSTRACT We calculated the incidence, mortality, and case fatality rates for Caucasians and non-Caucasians during 19th century yellow fever (YF) epidemics in the United States and determined statistical significance for differences in the rates in different populations. We evaluated nongenetic host factors, including socioeconomic, environmental, cultural, demographic, and acquired immunity status that could have influenced these differences. While differences in incidence rates were not significant between Caucasians and non-Caucasians, differences in mortality and case fatality rates were statistically significant for all epidemics tested (P < 0.01). Caucasians diagnosed with YF were 6.8 times more likely to succumb than non-Caucasians with the disease. No other major causes of death during the 19th century demonstrated a similar mortality skew toward Caucasians. Nongenetic host factors were examined and could not explain these large differences. We propose that the remarkably lower case mortality rates for individuals of non-Caucasian ancestry is the result of human genetic variation in loci encoding innate immune mediators. PMID:24895309
Statistical analysis of laser-interferometric detector Dylkin-1 data and data on seismic activity
International Nuclear Information System (INIS)
Kirillov, R S; Bochkarev, V V; Dulkyn, Academy of Sciences of the Republic of Tatarstan (Russian Federation))" data-affiliation=" (Scientific Center of Gravitational-Wave Research Dulkyn, Academy of Sciences of the Republic of Tatarstan (Russian Federation))" >Skochilov, A F
2014-01-01
This work presents statistical analysis of data collected from laser interferometric detector ''Dylkin-1'' and nearby seismic stations. The final goal of Dylkin project consists in creating detector of theoretically predicted gravitational waves produced by binary relativistic astrophysical objects. Currently, works are underway to improve sensitivity of detector by 2-3 orders. The goals of this research were to test isolation of detector from noise caused by seismic waves and to find out whether it is sensitive to variations in the gradient of gravitational potential (acceleration of free fall) caused by free Earth oscillations. Noise isolation has been tested by comparing energy of signals during significant seismic events. Sensitivity to variations in acceleration of free fall has been tested by means of cross-spectral analysis
Tools to analyse and display variations in anatomical delineation
International Nuclear Information System (INIS)
Ebert, Martin A.; McDermott, L.N.; Haworth, A; Van der Wath, E.; Hooton, B.
2012-01-01
Variations in anatomical delineation, principally due to a combination of inter-observer contributions and image-specificity, remain one of the most significant impediments to geometrically-accurate radiotherapy. Quantification of spatial variability of the delineated contours comprising a structure can be made with a variety of metrics, and the availability of software tools to apply such metrics to data collected during inter-observer or repeat-imaging studies would allow their validation. A suite of such tools have been developed which use an Extensible Markup Language format for the exchange of delineated 3D structures with radiotherapy planning or review systems. These tools provide basic operations for manipulating and operating on individual structures and related structure sets, and for deriving statistics on spatial variations of contours that can be mapped onto the surface of a reference structure. Use of these tools on a sample dataset is demonstrated together with import and display of results in the SWAN treatment plan review system.
Geographic Variation in Oxaliplatin Chemotherapy and Survival in Patients With Colon Cancer.
Panchal, Janki M; Lairson, David R; Chan, Wenyaw; Du, Xianglin L
2016-01-01
Geographic disparity in colon cancer survival has received less attention, despite the fact that health care delivery varied across regions. To examine geographic variation in colon cancer survival and explore factors affecting this variation, including the use of oxaliplatin chemotherapy, we studied cases with resected stage-III colon cancer in 2004-2009, identified from the Surveillance, Epidemiology and End Results-Medicare linked database. Cox proportional hazard model was used to estimate the effect of oxaliplatin-containing chemotherapy on survival across regions. Propensity score adjustments were made to control for potential selection bias and confounding. Rural regions showed lowest 3-year survival, whereas big metro regions showed better 3-year survival rate than any other region (67.3% in rural regions vs. 69.5% in big metro regions). Hazard ratio for patients residing in metro region was comparable with those residing in big metro region (1.27, 95% confidence interval: 0.90-1.80). However, patients residing in urban area were exhibiting lower mortality than those in other regions, although not statistically significant. Patients who received oxaliplatin chemotherapy were 23% significantly less likely to die of cancer than those received 5-fluorouracil only chemotherapy (adjusted hazard ratio = 0.77, 95% confidence interval: 0.63-0.95). In conclusion, there were some differences in survival across geographic regions, which were not statistically significant after adjusting for sociodemographic, tumor, chemotherapy, and other treatment characteristics. Oxaliplatin chemotherapy was associated with improved survival outcomes compared with 5-fluorouracil only chemotherapy across regions. Further studies may evaluate other factors and newer chemotherapy regimens on mortality/survival of older patients.
Modeling stimulus variation in three common implicit attitude tasks.
Wolsiefer, Katie; Westfall, Jacob; Judd, Charles M
2017-08-01
We explored the consequences of ignoring the sampling variation due to stimuli in the domain of implicit attitudes. A large literature in psycholinguistics has examined the statistical treatment of random stimulus materials, but the recommendations from this literature have not been applied to the social psychological literature on implicit attitudes. This is partly because of inherent complications in applying crossed random-effect models to some of the most common implicit attitude tasks, and partly because no work to date has demonstrated that random stimulus variation is in fact consequential in implicit attitude measurement. We addressed this problem by laying out statistically appropriate and practically feasible crossed random-effect models for three of the most commonly used implicit attitude measures-the Implicit Association Test, affect misattribution procedure, and evaluative priming task-and then applying these models to large datasets (average N = 3,206) that assess participants' implicit attitudes toward race, politics, and self-esteem. We showed that the test statistics from the traditional analyses are substantially (about 60 %) inflated relative to the more-appropriate analyses that incorporate stimulus variation. Because all three tasks used the same stimulus words and faces, we could also meaningfully compare the relative contributions of stimulus variation across the tasks. In an appendix, we give syntax in R, SAS, and SPSS for fitting the recommended crossed random-effects models to data from all three tasks, as well as instructions on how to structure the data file.
Genetic variation in the endangered Southwestern Willow Flycatcher
Busch, Joseph; Miller, Mark P.; Paxton, E.H.; Sogge, M.K.; Keim, Paul
2000-01-01
The Southwestern Willow Flycatcher (Empidonax traillii extimus) is an endangered Neotropical migrant that breeds in isolated remnants of dense riparian habitat in the southwestern United States. We estimated genetic variation at 20 breeding sites of the Southwestern Willow Flycatcher (290 individuals) using 38 amplified fragment length polymorphisms (AFLPs). Our results suggest that considerable genetic diversity exists within the subspecies and within local breeding sites. Statistical analyses of genetic variation revealed only slight, although significant, differentiation among breeding sites (Mantel's r = 0.0705, P UPGMA cluster analysis of the AFLP markers indicates that extensive gene flow has occurred among breeding sites. No one site stood out as being genetically unique or isolated. Therefore, the small level of genetic structure that we detected may not be biologically significant. Ongoing field studies are consistent with this conclusion. Of the banded birds that were resighted or recaptured in Arizona during the 1996 to 1998 breeding seasons, one-third moved between breeding sites and two-thirds were philopatric. Low differentiation may be the result of historically high rangewide diversity followed by recent geographic isolation of breeding sites, although observational data indicate that gene flow is a current phenomenon. Our data suggest that breeding groups of E. t. extimus act as a metapopulation.
POWERNEXT Carbon statistics September 30, 2006
International Nuclear Information System (INIS)
2006-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 trading market, for the July-September 2006 period: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The monthly volumes and closing prices for the September 2005 - September 2006 era are summarized in a graphics. (J.S.)
POWERNEXT Carbon statistics November 30, 2006
International Nuclear Information System (INIS)
2006-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 trading market, for the September-November 2006 period: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The monthly volumes and closing prices for the November 2005 - November 2006 era are summarized in a graphics. (J.S.)
Powernext Carbon statistics - June 30, 2006
International Nuclear Information System (INIS)
2006-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 quotas trading market, for the second quarter of 2006: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The monthly volumes and closing prices from June 2005 to June 2006 are summarized in a graphics. (J.S.)
POWERNEXT Carbon statistics October 31, 2006
International Nuclear Information System (INIS)
2006-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 trading market, for the August-October 2006 period: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The monthly volumes and closing prices for the October 2005 - October 2006 era are summarized in a graphics. (J.S.)
Powernext Carbon statistics - May 31, 2006
International Nuclear Information System (INIS)
2006-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 quotas trading market, for March, April and May 2006: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The daily volume and closing price from June 2005 to May 2006 are summarized in a graphics. (J.S.)
[Suicide in Luxembourg: a statistical study].
1983-01-01
A review of the situation concerning suicide in Luxembourg is presented. The existing laws are first described, and some methodological questions are summarized. A statistical analysis of suicide in the country is then presented. Data are included on trends over time, 1881-1982; and on variations in suicide by sex, age, marital status, religion, nationality, and occupation and standard of living. A bibliography is also provided.
Ferragut, G.; Liu, T.; Klemperer, S. L.
2017-12-01
In recent years Virtual Deep Seismic Sounding (VDSS) emerged as a novel method to image the Moho, which uses the post-critical reflection P waves at the Moho generated by teleseismic S waves at the free surface near the receivers (SsPmp). However, observed SsPmp sometimes have significantly lower amplitude than predicted, raising doubts among the seismic community on the theoretical basis of the method. With over two decades of continuous digital broadband records and major subduction zones in the range of 30-50 degrees, the Yellowknife Array in northern Canada provides a rich opportunity for observation of post-critical SsPmp. We analyze S wave coda of events with epicenter distances of 30-50°, and pay special attention to earthquakes in a narrow azimuth range that encompasses the Kamchatka Peninsula. Among 21 events with strong direct S energy on the radial components, we observe significant variation of SsPmp energy. After associating the SsPmp energy with the virtual source location of each event, we observe a general trend of decreasing SsPmp energy from NE to SW. As the trend coincides with the transition from exposed basement of the Slave Craton to Paleozoic platform covered by Phanerozoic sediment, we interpret the decreasing SsPmp energy as a result of lower S velocity at the virtual sources, which reduces S-to-P reflection coefficients. We plan to include more events from the Aleutian Islands, the virtual sources of which are primarily located in the Paleozoic platform. This will allow us to further investigate the relationship between SsPmp amplitude and near-surface velocity.
A statistical skull geometry model for children 0-3 years old.
Directory of Open Access Journals (Sweden)
Zhigang Li
Full Text Available Head injury is the leading cause of fatality and long-term disability for children. Pediatric heads change rapidly in both size and shape during growth, especially for children under 3 years old (YO. To accurately assess the head injury risks for children, it is necessary to understand the geometry of the pediatric head and how morphologic features influence injury causation within the 0-3 YO population. In this study, head CT scans from fifty-six 0-3 YO children were used to develop a statistical model of pediatric skull geometry. Geometric features important for injury prediction, including skull size and shape, skull thickness and suture width, along with their variations among the sample population, were quantified through a series of image and statistical analyses. The size and shape of the pediatric skull change significantly with age and head circumference. The skull thickness and suture width vary with age, head circumference and location, which will have important effects on skull stiffness and injury prediction. The statistical geometry model developed in this study can provide a geometrical basis for future development of child anthropomorphic test devices and pediatric head finite element models.
A statistical skull geometry model for children 0-3 years old.
Li, Zhigang; Park, Byoung-Keon; Liu, Weiguo; Zhang, Jinhuan; Reed, Matthew P; Rupp, Jonathan D; Hoff, Carrie N; Hu, Jingwen
2015-01-01
Head injury is the leading cause of fatality and long-term disability for children. Pediatric heads change rapidly in both size and shape during growth, especially for children under 3 years old (YO). To accurately assess the head injury risks for children, it is necessary to understand the geometry of the pediatric head and how morphologic features influence injury causation within the 0-3 YO population. In this study, head CT scans from fifty-six 0-3 YO children were used to develop a statistical model of pediatric skull geometry. Geometric features important for injury prediction, including skull size and shape, skull thickness and suture width, along with their variations among the sample population, were quantified through a series of image and statistical analyses. The size and shape of the pediatric skull change significantly with age and head circumference. The skull thickness and suture width vary with age, head circumference and location, which will have important effects on skull stiffness and injury prediction. The statistical geometry model developed in this study can provide a geometrical basis for future development of child anthropomorphic test devices and pediatric head finite element models.
Morphological variation of 508 hatchling alligators from three lakes in north central Florida (Lakes Woodruff, Apopka, and Orange) was analyzed using multivariate statistics. Morphological variation was found among clutches as well as among lakes. Principal components analysis wa...
Point defect characterization in HAADF-STEM images using multivariate statistical analysis
International Nuclear Information System (INIS)
Sarahan, Michael C.; Chi, Miaofang; Masiel, Daniel J.; Browning, Nigel D.
2011-01-01
Quantitative analysis of point defects is demonstrated through the use of multivariate statistical analysis. This analysis consists of principal component analysis for dimensional estimation and reduction, followed by independent component analysis to obtain physically meaningful, statistically independent factor images. Results from these analyses are presented in the form of factor images and scores. Factor images show characteristic intensity variations corresponding to physical structure changes, while scores relate how much those variations are present in the original data. The application of this technique is demonstrated on a set of experimental images of dislocation cores along a low-angle tilt grain boundary in strontium titanate. A relationship between chemical composition and lattice strain is highlighted in the analysis results, with picometer-scale shifts in several columns measurable from compositional changes in a separate column. -- Research Highlights: → Multivariate analysis of HAADF-STEM images. → Distinct structural variations among SrTiO 3 dislocation cores. → Picometer atomic column shifts correlated with atomic column population changes.
A Statistical Mechanics Approach to Approximate Analytical Bootstrap Averages
DEFF Research Database (Denmark)
Malzahn, Dorthe; Opper, Manfred
2003-01-01
We apply the replica method of Statistical Physics combined with a variational method to the approximate analytical computation of bootstrap averages for estimating the generalization error. We demonstrate our approach on regression with Gaussian processes and compare our results with averages...
Anatomical variations of the circle of Willis and cerebrovascular accidents in transitional Albania
Directory of Open Access Journals (Sweden)
Edlira Harizi (Shemsi
2015-12-01
Full Text Available Aim: The purpose of this study was twofold: i in a case-control design, to determine the relationship between anatomical variations of the circle of Willis and cerebrovascular accidents; ii to assess the association between anatomical variations of the circle of Willis and aneurisms among patients with subarachnoid hemorrhage. Methods: A case-control study was conducted in Albania in 2013-2014, including 100 patients with subarachnoid hemorrhage and 100 controls (individuals without cerebrovascular accidents. Patients with subarachnoid hemorrhage underwent a CT angiography procedure, whereas individuals in the control group underwent a magnetic resonance angiography procedure. Binary logistic regression was used to assess the association between cerebrovascular accidents and the anatomical variations of the circle of Willis. Conversely, Fisher’s exact test was used to compare the prevalence of aneurisms between subarachnoid hemorrhage patients with and without anatomical variations of the circle of Willis. Results: Among patients, there were 22 (22% cases with anatomical variations of the circle of Willis compared with 10 (10% individuals in the control group (P=0.033. There was no evidence of a statistically significant difference in the types of the anatomical variations of the circle of Willis between patients and controls (P=0.402. In age- and-sex adjusted logistic regression models, there was evidence of a significant positive association between cerebrovascular accidents and the anatomical variations of the circle of Willis (OR=1.87, 95%CI=1.03-4.68, P=0.048. Within the patients’ group, of the 52 cases with aneurisms, there were 22 (42.3% individuals with anatomical variations of the circle of Willis compared with no individuals with anatomical variations among the 48 patients without aneurisms (P<0.001. Conclusion: This study provides useful evidence on the association between anatomical variations of the circle of Willis and
Memory-type control charts in statistical process control
Abbas, N.
2012-01-01
Control chart is the most important statistical tool to manage the business processes. It is a graph of measurements on a quality characteristic of the process on the vertical axis plotted against time on the horizontal axis. The graph is completed with control limits that cause variation mark. Once
WE-FG-BRB-01: Clinical Significance of RBE Variations in Proton Therapy
Energy Technology Data Exchange (ETDEWEB)
Paganetti, H. [Massachusetts General Hospital (United States)
2016-06-15
The physical pattern of energy deposition and the enhanced relative biological effectiveness (RBE) of protons and carbon ions compared to photons offer unique and not fully understood or exploited opportunities to improve the efficacy of radiation therapy. Variations in RBE within a pristine or spread out Bragg peak and between particle types may be exploited to enhance cell killing in target regions without a corresponding increase in damage to normal tissue structures. In addition, the decreased sensitivity of hypoxic tumors to photon-based therapies may be partially overcome through the use of more densely ionizing radiations. These and other differences between particle and photon beams may be used to generate biologically optimized treatments that reduce normal tissue complications. In this symposium, speakers will examine the impact of the RBE of charged particles on measurable biological endpoints, treatment plan optimization, and the prediction or retrospective assessment of treatment outcomes. In particular, an AAPM task group was formed to critically examine the evidence for a spatially-variant RBE in proton therapy. Current knowledge of proton RBE variation with respect to dose, biological endpoint, and physics parameters will be reviewed. Further, the clinical relevance of these variations will be discussed. Recent work focused on improving simulations of radiation physics and biological response in proton and carbon ion therapy will also be presented. Finally, relevant biology research and areas of research needs will be highlighted, including the dependence of RBE on genetic factors including status of DNA repair pathways, the sensitivity of cancer stem-like cells to charged particles, the role of charged particles in hypoxic tumors, and the importance of fractionation effects. In addition to the physical advantages of protons and more massive ions over photons, the future application of biologically optimized treatment plans and their potential to
Wen, Yi Feng; Wong, Hai Ming; Lin, Ruitao; Yin, Guosheng; McGrath, Colman
2015-01-01
Numerous facial photogrammetric studies have been published around the world. We aimed to critically review these studies so as to establish population norms for various angular and linear facial measurements; and to determine inter-ethnic/racial facial variations. A comprehensive and systematic search of PubMed, ISI Web of Science, Embase, and Scopus was conducted to identify facial photogrammetric studies published before December, 2014. Subjects of eligible studies were either Africans, Asians or Caucasians. A Bayesian hierarchical random effects model was developed to estimate posterior means and 95% credible intervals (CrI) for each measurement by ethnicity/race. Linear contrasts were constructed to explore inter-ethnic/racial facial variations. We identified 38 eligible studies reporting 11 angular and 18 linear facial measurements. Risk of bias of the studies ranged from 0.06 to 0.66. At the significance level of 0.05, African males were found to have smaller nasofrontal angle (posterior mean difference: 8.1°, 95% CrI: 2.2°-13.5°) compared to Caucasian males and larger nasofacial angle (7.4°, 0.1°-13.2°) compared to Asian males. Nasolabial angle was more obtuse in Caucasian females than in African (17.4°, 0.2°-35.3°) and Asian (9.1°, 0.4°-17.3°) females. Additional inter-ethnic/racial variations were revealed when the level of statistical significance was set at 0.10. A comprehensive database for angular and linear facial measurements was established from existing studies using the statistical model and inter-ethnic/racial variations of facial features were observed. The results have implications for clinical practice and highlight the need and value for high quality photogrammetric studies.
Wen, Yi Feng; Wong, Hai Ming; Lin, Ruitao; Yin, Guosheng; McGrath, Colman
2015-01-01
Background Numerous facial photogrammetric studies have been published around the world. We aimed to critically review these studies so as to establish population norms for various angular and linear facial measurements; and to determine inter-ethnic/racial facial variations. Methods and Findings A comprehensive and systematic search of PubMed, ISI Web of Science, Embase, and Scopus was conducted to identify facial photogrammetric studies published before December, 2014. Subjects of eligible studies were either Africans, Asians or Caucasians. A Bayesian hierarchical random effects model was developed to estimate posterior means and 95% credible intervals (CrI) for each measurement by ethnicity/race. Linear contrasts were constructed to explore inter-ethnic/racial facial variations. We identified 38 eligible studies reporting 11 angular and 18 linear facial measurements. Risk of bias of the studies ranged from 0.06 to 0.66. At the significance level of 0.05, African males were found to have smaller nasofrontal angle (posterior mean difference: 8.1°, 95% CrI: 2.2°–13.5°) compared to Caucasian males and larger nasofacial angle (7.4°, 0.1°–13.2°) compared to Asian males. Nasolabial angle was more obtuse in Caucasian females than in African (17.4°, 0.2°–35.3°) and Asian (9.1°, 0.4°–17.3°) females. Additional inter-ethnic/racial variations were revealed when the level of statistical significance was set at 0.10. Conclusions A comprehensive database for angular and linear facial measurements was established from existing studies using the statistical model and inter-ethnic/racial variations of facial features were observed. The results have implications for clinical practice and highlight the need and value for high quality photogrammetric studies. PMID:26247212
Directory of Open Access Journals (Sweden)
Yi Feng Wen
Full Text Available Numerous facial photogrammetric studies have been published around the world. We aimed to critically review these studies so as to establish population norms for various angular and linear facial measurements; and to determine inter-ethnic/racial facial variations.A comprehensive and systematic search of PubMed, ISI Web of Science, Embase, and Scopus was conducted to identify facial photogrammetric studies published before December, 2014. Subjects of eligible studies were either Africans, Asians or Caucasians. A Bayesian hierarchical random effects model was developed to estimate posterior means and 95% credible intervals (CrI for each measurement by ethnicity/race. Linear contrasts were constructed to explore inter-ethnic/racial facial variations. We identified 38 eligible studies reporting 11 angular and 18 linear facial measurements. Risk of bias of the studies ranged from 0.06 to 0.66. At the significance level of 0.05, African males were found to have smaller nasofrontal angle (posterior mean difference: 8.1°, 95% CrI: 2.2°-13.5° compared to Caucasian males and larger nasofacial angle (7.4°, 0.1°-13.2° compared to Asian males. Nasolabial angle was more obtuse in Caucasian females than in African (17.4°, 0.2°-35.3° and Asian (9.1°, 0.4°-17.3° females. Additional inter-ethnic/racial variations were revealed when the level of statistical significance was set at 0.10.A comprehensive database for angular and linear facial measurements was established from existing studies using the statistical model and inter-ethnic/racial variations of facial features were observed. The results have implications for clinical practice and highlight the need and value for high quality photogrammetric studies.
VarB Plus: An Integrated Tool for Visualization of Genome Variation Datasets
Hidayah, Lailatul
2012-07-01
Research on genomic sequences has been improving significantly as more advanced technology for sequencing has been developed. This opens enormous opportunities for sequence analysis. Various analytical tools have been built for purposes such as sequence assembly, read alignments, genome browsing, comparative genomics, and visualization. From the visualization perspective, there is an increasing trend towards use of large-scale computation. However, more than power is required to produce an informative image. This is a challenge that we address by providing several ways of representing biological data in order to advance the inference endeavors of biologists. This thesis focuses on visualization of variations found in genomic sequences. We develop several visualization functions and embed them in an existing variation visualization tool as extensions. The tool we improved is named VarB, hence the nomenclature for our enhancement is VarB Plus. To the best of our knowledge, besides VarB, there is no tool that provides the capability of dynamic visualization of genome variation datasets as well as statistical analysis. Dynamic visualization allows users to toggle different parameters on and off and see the results on the fly. The statistical analysis includes Fixation Index, Relative Variant Density, and Tajima’s D. Hence we focused our efforts on this tool. The scope of our work includes plots of per-base genome coverage, Principal Coordinate Analysis (PCoA), integration with a read alignment viewer named LookSeq, and visualization of geo-biological data. In addition to description of embedded functionalities, significance, and limitations, future improvements are discussed. The result is four extensions embedded successfully in the original tool, which is built on the Qt framework in C++. Hence it is portable to numerous platforms. Our extensions have shown acceptable execution time in a beta testing with various high-volume published datasets, as well as positive
Gaussian statistics for palaeomagnetic vectors
Love, J.J.; Constable, C.G.
2003-01-01
formulate the inverse problem, and how to estimate the mean and variance of the magnetic vector field, even when the data consist of mixed combinations of directions and intensities. We examine palaeomagnetic secular-variation data from Hawaii and Re??union, and although these two sites are on almost opposite latitudes, we find significant differences in the mean vector and differences in the local vectorial variances, with the Hawaiian data being particularly anisotropic. These observations are inconsistent with a description of the mean field as being a simple geocentric axial dipole and with secular variation being statistically symmetrical with respect to reflection through the equatorial plane. Finally, our analysis of palaeomagnetic acquisition data from the 1960 Kilauea flow in Hawaii and the Holocene Xitle flow in Mexico, is consistent with the widely held suspicion that directional data are more accurate than intensity data.
Gaussian statistics for palaeomagnetic vectors
Love, J. J.; Constable, C. G.
2003-03-01
formulate the inverse problem, and how to estimate the mean and variance of the magnetic vector field, even when the data consist of mixed combinations of directions and intensities. We examine palaeomagnetic secular-variation data from Hawaii and Réunion, and although these two sites are on almost opposite latitudes, we find significant differences in the mean vector and differences in the local vectorial variances, with the Hawaiian data being particularly anisotropic. These observations are inconsistent with a description of the mean field as being a simple geocentric axial dipole and with secular variation being statistically symmetrical with respect to reflection through the equatorial plane. Finally, our analysis of palaeomagnetic acquisition data from the 1960 Kilauea flow in Hawaii and the Holocene Xitle flow in Mexico, is consistent with the widely held suspicion that directional data are more accurate than intensity data.
Powernext Carbon statistics - August 31, 2005
International Nuclear Information System (INIS)
2005-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 quotas trading market, for June, July and August 2005: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The daily volume and closing price from June 2005 to August 2005 are summarized in a graphics and a members list is supplied. (J.S.)
Powernext Carbon statistics - March 31, 2006
International Nuclear Information System (INIS)
2006-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 quotas trading market, for the first quarter of 2006: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The daily volume and closing price from June 2005 to March 2006 are summarized in a graphics and a members list is supplied. (J.S.)
Powernext Carbon statistics - November 30, 2005
International Nuclear Information System (INIS)
2005-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 quotas trading market, for September, October and November 2005: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The daily volume and closing price from June 2005 to November 2005 are summarized in a graphics and a members list is supplied. (J.S.)
Powernext Carbon statistics - February 28, 2006
International Nuclear Information System (INIS)
2006-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 quotas trading market, for December 2005 and January-February 2006: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The daily volume and closing price from June 2005 to February 2005 are summarized in a graphics and a members list is supplied. (J.S.)
Powernext Carbon statistics - October 31, 2005
International Nuclear Information System (INIS)
2005-01-01
This short document summarizes the statistics of Powernext Carbon, the European CO 2 quotas trading market, for August, September and October 2005: total market volume, daily average, highest, number and average size of trades, number of members, average closing price, variation, low and high traded. The daily volume and closing price from June 2005 to November 2005 are summarized in a graphics and a members list is supplied. (J.S.)
Variational Approach in the Theory of Liquid-Crystal State
Gevorkyan, E. V.
2018-03-01
The variational calculus by Leonhard Euler is the basis for modern mathematics and theoretical physics. The efficiency of variational approach in statistical theory of liquid-crystal state and in general case in condensed state theory is shown. The developed approach in particular allows us to introduce correctly effective pair interactions and optimize the simple models of liquid crystals with help of realistic intermolecular potentials.
Maric, Marija; de Haan, Else; Hogendoorn, Sanne M; Wolters, Lidewij H; Huizenga, Hilde M
2015-03-01
Single-case experimental designs are useful methods in clinical research practice to investigate individual client progress. Their proliferation might have been hampered by methodological challenges such as the difficulty applying existing statistical procedures. In this article, we describe a data-analytic method to analyze univariate (i.e., one symptom) single-case data using the common package SPSS. This method can help the clinical researcher to investigate whether an intervention works as compared with a baseline period or another intervention type, and to determine whether symptom improvement is clinically significant. First, we describe the statistical method in a conceptual way and show how it can be implemented in SPSS. Simulation studies were performed to determine the number of observation points required per intervention phase. Second, to illustrate this method and its implications, we present a case study of an adolescent with anxiety disorders treated with cognitive-behavioral therapy techniques in an outpatient psychotherapy clinic, whose symptoms were regularly assessed before each session. We provide a description of the data analyses and results of this case study. Finally, we discuss the advantages and shortcomings of the proposed method. Copyright © 2014. Published by Elsevier Ltd.
Di Florio, Adriano
2017-10-01
In order to test the computing capabilities of GPUs with respect to traditional CPU cores a high-statistics toy Monte Carlo technique has been implemented both in ROOT/RooFit and GooFit frameworks with the purpose to estimate the statistical significance of the structure observed by CMS close to the kinematical boundary of the J/ψϕ invariant mass in the three-body decay B + → J/ψϕK +. GooFit is a data analysis open tool under development that interfaces ROOT/RooFit to CUDA platform on nVidia GPU. The optimized GooFit application running on GPUs hosted by servers in the Bari Tier2 provides striking speed-up performances with respect to the RooFit application parallelised on multiple CPUs by means of PROOF-Lite tool. The considerable resulting speed-up, evident when comparing concurrent GooFit processes allowed by CUDA Multi Process Service and a RooFit/PROOF-Lite process with multiple CPU workers, is presented and discussed in detail. By means of GooFit it has also been possible to explore the behaviour of a likelihood ratio test statistic in different situations in which the Wilks Theorem may or may not apply because its regularity conditions are not satisfied.
Processed dairy beverages pH evaluation: consequences of temperature variation.
Ferreira, Fabiana Vargas; Pozzobon, Roselaine Terezinha
2009-01-01
This study assessed the pH from processed dairy beverages as well as eventual consequences deriving from different ingestion temperatures. 50 adults who accompanied children attended to at the Dentistry School were randomly selected and they answered a questionnaire on beverages. The beverages were divided into 4 groups: yogurt (GI) fermented milk (GII), chocolate-based products (GIII) and fermented dairy beverages (GIV). They were asked which type, flavor and temperature. The most popular beverages were selected, and these made up the sample. A pH meter Quimis 400A device was used to verify pH. The average pH from each beverage was calculated and submitted to statistical analysis (Variance and Tukey test with a 5% significance level). for groups I, II and III beverages, type x temperature interaction was significant, showing the pH averages were influenced by temperature variation. At iced temperatures, they presented lower pH values, which were considered statistically significant when compared to the values found for the same beverages at room temperature. All dairy beverages, with the exception of the chocolate-based type presented pH below critical level for enamel and present corrosive potential; as to ingestion temperature, iced temperature influenced pH reducing its values, in vitro.
Changing statistics of storms in the North Atlantic?
International Nuclear Information System (INIS)
Storch, H. von; Guddal, J.; Iden, K.A.; Jonsson, T.; Perlwitz, J.; Reistad, M.; Ronde, J. de; Schmidt, H.; Zorita, E.
1993-01-01
Problems in the present discussion about increasing storminess in the North Atlantic area are discusesd. Observational data so far available do not indicate a change in the storm statistics. Output from climate models points to an itensified storm track in the North Atlantic, but because of the limited skill of present-day climate models in simulating high-frequency variability and regional details any such 'forecast' has to be considered with caution. A downscaling procedure which relates large-scale time-mean aspects of the state of the atmosphere and ocean to the local statistics of storms is proposed to reconstruct past variations of high-frequency variability in the atmosphere (storminess) and in the sea state (wave statistics). First results are presented. (orig.)
He, Ping
2012-01-01
The long-standing puzzle surrounding the statistical mechanics of self-gravitating systems has not yet been solved successfully. We formulate a systematic theoretical framework of entropy-based statistical mechanics for spherically symmetric collisionless self-gravitating systems. We use an approach that is very different from that of the conventional statistical mechanics of short-range interaction systems. We demonstrate that the equilibrium states of self-gravitating systems consist of both mechanical and statistical equilibria, with the former characterized by a series of velocity-moment equations and the latter by statistical equilibrium equations, which should be derived from the entropy principle. The velocity-moment equations of all orders are derived from the steady-state collisionless Boltzmann equation. We point out that the ergodicity is invalid for the whole self-gravitating system, but it can be re-established locally. Based on the local ergodicity, using Fermi-Dirac-like statistics, with the non-degenerate condition and the spatial independence of the local microstates, we rederive the Boltzmann-Gibbs entropy. This is consistent with the validity of the collisionless Boltzmann equation, and should be the correct entropy form for collisionless self-gravitating systems. Apart from the usual constraints of mass and energy conservation, we demonstrate that the series of moment or virialization equations must be included as additional constraints on the entropy functional when performing the variational calculus; this is an extension to the original prescription by White & Narayan. Any possible velocity distribution can be produced by the statistical-mechanical approach that we have developed with the extended Boltzmann-Gibbs/White-Narayan statistics. Finally, we discuss the questions of negative specific heat and ensemble inequivalence for self-gravitating systems.
Hersoug, Lars-Georg; Brasch-Andersen, Charlotte; Husemoen, Lise Lotte Nystrup; Sigsgaard, Torben; Linneberg, Allan
2012-07-01
Exposure to particulate matter (PM) may induce inflammation and oxidative stress in the airways. Carriers of null polymorphisms of glutathione S-transferases (GSTs), which detoxify reactive oxygen species, may be particularly susceptible to the effects of PM. To investigate whether deletions of GSTM1 and GSTT1 modify the potential effects of exposure to indoor sources of PM on symptoms and objective markers of respiratory disease. We conducted a population-based, cross-sectional study of 3471 persons aged 18-69 years. Information about exposure to indoor sources of PM and respiratory symptoms was obtained by a self-administered questionnaire. In addition, measurements of lung function (spirometry) and fractional exhaled nitric oxide were performed. Copy number variation of GSTM1 and GSTT1 was determined by polymerase chain reaction-based assays. We found that none of the symptoms and objective markers of respiratory disease were significantly associated with the GST null polymorphisms. An increasing number of positive alleles of the GSTM1 polymorphism tended to be associated lower prevalence of wheeze, cough, and high forced expiratory volume in 1 s (FEV(1) ), but these trends were not statistically significant. Furthermore, we did not observe any statistically significant interactions between GST copy number variation and exposure to indoor sources of PM in relation to respiratory symptoms and markers. In this adult population, GST copy number variations were not significantly associated with respiratory outcomes and did not modify the effects of self-reported exposure to indoor sources of PM on respiratory outcomes. © 2011 Blackwell Publishing Ltd.
Duncan, Fiona; Haigh, Carol
2013-10-01
To explore and improve the quality of continuous epidural analgesia for pain relief using Statistical Process Control tools. Measuring the quality of pain management interventions is complex. Intermittent audits do not accurately capture the results of quality improvement initiatives. The failure rate for one intervention, epidural analgesia, is approximately 30% in everyday practice, so it is an important area for improvement. Continuous measurement and analysis are required to understand the multiple factors involved in providing effective pain relief. Process control and quality improvement Routine prospectively acquired data collection started in 2006. Patients were asked about their pain and side effects of treatment. Statistical Process Control methods were applied for continuous data analysis. A multidisciplinary group worked together to identify reasons for variation in the data and instigated ideas for improvement. The key measure for improvement was a reduction in the percentage of patients with an epidural in severe pain. The baseline control charts illustrated the recorded variation in the rate of several processes and outcomes for 293 surgical patients. The mean visual analogue pain score (VNRS) was four. There was no special cause variation when data were stratified by surgeons, clinical area or patients who had experienced pain before surgery. Fifty-seven per cent of patients were hypotensive on the first day after surgery. We were able to demonstrate a significant improvement in the failure rate of epidurals as the project continued with quality improvement interventions. Statistical Process Control is a useful tool for measuring and improving the quality of pain management. The applications of Statistical Process Control methods offer the potential to learn more about the process of change and outcomes in an Acute Pain Service both locally and nationally. We have been able to develop measures for improvement and benchmarking in routine care that
Metabolomics reveals variation and correlation among different tissues of olive (Olea europaea L.
Directory of Open Access Journals (Sweden)
Rao Guodong
2017-09-01
Full Text Available Metabolites in olives are associated with nutritional value and physiological properties. However, comprehensive information regarding the olive metabolome is limited. In this study, we identified 226 metabolites from three different tissues of olive using a non-targeted metabolomic profiling approach, of which 76 named metabolites were confirmed. Further statistical analysis revealed that these 76 metabolites covered different types of primary metabolism and some of the secondary metabolism pathways. One-way analysis of variance (ANOVA statistical assay was performed to calculate the variations within the detected metabolites, and levels of 65 metabolites were differentially expressed in different samples. Hierarchical cluster analysis (HCA dendrograms showed variations among different tissues that were similar to the metabolite profiles observed in new leaves and fruit. Additionally, 5776 metabolite-metabolite correlations were detected by a Pearson correlation coefficient approach. Screening of the calculated correlations revealed 3136, 3025, and 5184 were determined to metabolites and had significant correlations in three different combinations, respectively. This work provides the first comprehensive metabolomic of olive, which will provide new insights into understanding the olive metabolism, and potentially help advance studies in olive metabolic engineering.
Testing the significance of canonical axes in redundancy analysis
Legendre, P.; Oksanen, J.; Braak, ter C.J.F.
2011-01-01
1. Tests of significance of the individual canonical axes in redundancy analysis allow researchers to determine which of the axes represent variation that can be distinguished from random. Variation along the significant axes can be mapped, used to draw biplots or interpreted through subsequent
SU-F-I-10: Spatially Local Statistics for Adaptive Image Filtering
International Nuclear Information System (INIS)
Iliopoulos, AS; Sun, X; Floros, D; Zhang, Y; Yin, FF; Ren, L; Pitsianis, N
2016-01-01
Purpose: To facilitate adaptive image filtering operations, addressing spatial variations in both noise and signal. Such issues are prevalent in cone-beam projections, where physical effects such as X-ray scattering result in spatially variant noise, violating common assumptions of homogeneous noise and challenging conventional filtering approaches to signal extraction and noise suppression. Methods: We present a computational mechanism for probing into and quantifying the spatial variance of noise throughout an image. The mechanism builds a pyramid of local statistics at multiple spatial scales; local statistical information at each scale includes (weighted) mean, median, standard deviation, median absolute deviation, as well as histogram or dynamic range after local mean/median shifting. Based on inter-scale differences of local statistics, the spatial scope of distinguishable noise variation is detected in a semi- or un-supervised manner. Additionally, we propose and demonstrate the incorporation of such information in globally parametrized (i.e., non-adaptive) filters, effectively transforming the latter into spatially adaptive filters. The multi-scale mechanism is materialized by efficient algorithms and implemented in parallel CPU/GPU architectures. Results: We demonstrate the impact of local statistics for adaptive image processing and analysis using cone-beam projections of a Catphan phantom, fitted within an annulus to increase X-ray scattering. The effective spatial scope of local statistics calculations is shown to vary throughout the image domain, necessitating multi-scale noise and signal structure analysis. Filtering results with and without spatial filter adaptation are compared visually, illustrating improvements in imaging signal extraction and noise suppression, and in preserving information in low-contrast regions. Conclusion: Local image statistics can be incorporated in filtering operations to equip them with spatial adaptivity to spatial
SU-F-I-10: Spatially Local Statistics for Adaptive Image Filtering
Energy Technology Data Exchange (ETDEWEB)
Iliopoulos, AS; Sun, X [Duke University, Durham, NC (United States); Floros, D [Aristotle University of Thessaloniki (Greece); Zhang, Y; Yin, FF; Ren, L [Duke University Medical Center, Durham, NC (United States); Pitsianis, N [Aristotle University of Thessaloniki (Greece); Duke University, Durham, NC (United States)
2016-06-15
Purpose: To facilitate adaptive image filtering operations, addressing spatial variations in both noise and signal. Such issues are prevalent in cone-beam projections, where physical effects such as X-ray scattering result in spatially variant noise, violating common assumptions of homogeneous noise and challenging conventional filtering approaches to signal extraction and noise suppression. Methods: We present a computational mechanism for probing into and quantifying the spatial variance of noise throughout an image. The mechanism builds a pyramid of local statistics at multiple spatial scales; local statistical information at each scale includes (weighted) mean, median, standard deviation, median absolute deviation, as well as histogram or dynamic range after local mean/median shifting. Based on inter-scale differences of local statistics, the spatial scope of distinguishable noise variation is detected in a semi- or un-supervised manner. Additionally, we propose and demonstrate the incorporation of such information in globally parametrized (i.e., non-adaptive) filters, effectively transforming the latter into spatially adaptive filters. The multi-scale mechanism is materialized by efficient algorithms and implemented in parallel CPU/GPU architectures. Results: We demonstrate the impact of local statistics for adaptive image processing and analysis using cone-beam projections of a Catphan phantom, fitted within an annulus to increase X-ray scattering. The effective spatial scope of local statistics calculations is shown to vary throughout the image domain, necessitating multi-scale noise and signal structure analysis. Filtering results with and without spatial filter adaptation are compared visually, illustrating improvements in imaging signal extraction and noise suppression, and in preserving information in low-contrast regions. Conclusion: Local image statistics can be incorporated in filtering operations to equip them with spatial adaptivity to spatial
DEFF Research Database (Denmark)
Madsen, Tobias
2017-01-01
In the present thesis I develop, implement and apply statistical methods for detecting genomic elements implicated in cancer development and progression. This is done in two separate bodies of work. The first uses the somatic mutation burden to distinguish cancer driver mutations from passenger m...
Morris, Brian J; Carnes, Bruce A; Chen, Randi; Donlon, Timothy A; He, Qimei; Grove, John S; Masaki, Kamal H; Elliott, Ayako; Willcox, Donald C; Allsopp, Richard; Willcox, Bradley J
2015-04-01
The mechanistic target of rapamycin (mTOR) pathway is pivotal for cell growth. Regulatory associated protein of mTOR complex I (Raptor) is a unique component of this pro-growth complex. The present study tested whether variation across the raptor gene (RPTOR) is associated with overweight and hypertension. We tested 61 common (allele frequency ≥ 0.1) tagging single nucleotide polymorphisms (SNPs) that captured most of the genetic variation across RPTOR in 374 subjects of normal lifespan and 439 subjects with a lifespan exceeding 95 years for association with overweight/obesity, essential hypertension, and isolated systolic hypertension. Subjects were drawn from the Honolulu Heart Program, a homogeneous population of American men of Japanese ancestry, well characterized for phenotypes relevant to conditions of aging. Hypertension status was ascertained when subjects were 45-68 years old. Statistical evaluation involved contingency table analysis, logistic regression, and the powerful method of recursive partitioning. After analysis of RPTOR genotypes by each statistical approach, we found no significant association between genetic variation in RPTOR and either essential hypertension or isolated systolic hypertension. Models generated by recursive partitioning analysis showed that RPTOR SNPs significantly enhanced the ability of the model to accurately assign individuals to either the overweight/obese or the non-overweight/obese groups (P = 0.008 by 1-tailed Z test). Common genetic variation in RPTOR is associated with overweight/obesity but does not discernibly contribute to either essential hypertension or isolated systolic hypertension in the population studied. © American Journal of Hypertension, Ltd 2014. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Directory of Open Access Journals (Sweden)
I-Wen Liu
Full Text Available A vertebral artery (VA terminating in a posterior inferior cerebellar artery (PICA is often considered to be a normal variation associated with VA hypoplasia. We aimed to investigate the clinical significance of this cerebrovascular variant. A total of 80 patients with clinically evident cerebrovascular events in posterior circulation were examined by duplex sonography and magnetic resonance angiography (MRA. Eighty healthy subjects who had MRA check-up were recruited as controls. PICA termination of the VA (PICA-VA was identified as the VA not communicating with the basilar artery (BA but ending into a PICA. We compared the prevalence of PICA-VA and associated hemodynamic parameters between the patients with and without PICA-VA, and investigated their relationships with VA hypoplasia. The prevalence of PICA-VA was higher in the patient group than in the controls (18.7% vs. 6.3%, p = 0.015. Most measurements (73.3% of PICA-VA did not fit the criteria of VA hypoplasia. In comparison with the non-PICA-terminating group, the PICA-VA has a smaller diameter (3.7 ± 0.7 mm vs. 3.0 ± 0.5 mm, p < 0.001, lower mean velocity (241 ± 100 mm/sec vs. 164 ± 88 mm/sec, p < 0.01, and higher pulsatility index (1.3 ± 0.5 vs. 1.9 ± 0.6, p < 0.001. Moreover, a smaller diameter of the BA (3.2 ± 0.5 mm vs. 2.5 ± 0.9 mm, p = 0.004 and the posterior cerebral artery (PCA (2.0 ± 0.1 mm vs. 1.6 ± 0.1 mm, p = 0.006 were also noted in the PICA-VA group. The higher prevalence of PICA-VA in the patient group with smaller diameter of VA, BA and PCA reflected its clinical significance, suggesting that PICA-VA may have a detrimental impact on cerebral hemodynamics. However, the sample is small, and further studies are needed with larger sample size for confirmation.
Directory of Open Access Journals (Sweden)
Alberquilla Angel
2008-02-01
Full Text Available Abstract Background The study of Hospitalizations for ambulatory care sensitive conditions (ACSH has been proposed as an indirect measure of access to and receipt of care by older persons at the entryway to the Spanish public health system. The aim of this work is to identify the rates of ACSH in persons 65 years or older living in different small-areas of the Community of Madrid (CM and to detect possible differences in ACSH. Methods Cross-sectional, ecologic study, which covered all 34 health districts of the CM. The study population consisted of all individuals aged 65 years or older residing in the CM between 2001 and 2003, inclusive. Using hospital discharge data, avoidable ACSH were selected from the list of conditions validated for Spain. Age- and sex-adjusted ACSH rates were calculated for the population of each health district and the statistics describing the data variability. Point graphs and maps were designed to represent the ACSH rates in the different health districts. Results Of all the hospitalizations, 16.5% (64,409 were ACSH. Globally, the rate was higher among men: 33.15 per 1,000 populations vs. 22.10 in women and these differences were statistically significant (p Conclusion A significant variation is demonstrated in "preventable" hospitalizations between the different districts. In all the districts the men present rates significantly higher than women. Important variations in the access are observed the Primary Attention in spite of existing a universal sanitary cover.
Nuclear modification factor using Tsallis non-extensive statistics
Energy Technology Data Exchange (ETDEWEB)
Tripathy, Sushanta; Garg, Prakhar; Kumar, Prateek; Sahoo, Raghunath [Indian Institute of Technology Indore, Discipline of Physics, School of Basic Sciences, Simrol (India); Bhattacharyya, Trambak; Cleymans, Jean [University of Cape Town, UCT-CERN Research Centre and Department of Physics, Rondebosch (South Africa)
2016-09-15
The nuclear modification factor is derived using Tsallis non-extensive statistics in relaxation time approximation. The variation of the nuclear modification factor with transverse momentum for different values of the non-extensive parameter, q, is also observed. The experimental data from RHIC and LHC are analysed in the framework of Tsallis non-extensive statistics in a relaxation time approximation. It is shown that the proposed approach explains the R{sub AA} of all particles over a wide range of transverse momentum but does not seem to describe the rise in R{sub AA} at very high transverse momenta. (orig.)
Significance evaluation in factor graphs
DEFF Research Database (Denmark)
Madsen, Tobias; Hobolth, Asger; Jensen, Jens Ledet
2017-01-01
in genomics and the multiple-testing issues accompanying them, accurate significance evaluation is of great importance. We here address the problem of evaluating statistical significance of observations from factor graph models. Results Two novel numerical approximations for evaluation of statistical...... significance are presented. First a method using importance sampling. Second a saddlepoint approximation based method. We develop algorithms to efficiently compute the approximations and compare them to naive sampling and the normal approximation. The individual merits of the methods are analysed both from....... Conclusions The applicability of saddlepoint approximation and importance sampling is demonstrated on known models in the factor graph framework. Using the two methods we can substantially improve computational cost without compromising accuracy. This contribution allows analyses of large datasets...
Buyuk, C; Gunduz, K; Avsever, H
2018-01-01
The aim of this investigation was to evaluate the length, thickness, sagittal and transverse angulations and the morphological variations of the stylohyoid complex (SHC), to assess their probable associations with age and gender, and to investigate the prevalence of it in a wide range of a Turkish sub-population by using cone beam computed tomography (CBCT). The CBCT images of the 1000 patients were evaluated retrospectively. The length, thickness, sagittal and transverse angulations, morphological variations and ossification degrees of SHC were evaluated on multiplanar reconstructions (MPR) adnd three-dimensional (3D) volume rendering (3DVR) images. The data were analysed statistically by using nonparametric tests, Pearson's correlation coefficient, Student's t test, c2 test and one-way ANOVA. Statistical significance was considered at p 35 mm). The mean sagittal angle value was measured to be 72.24° and the mean transverse angle value was 70.81°. Scalariform shape, elongated type and nodular calcification pattern have the highest mean age values between the morphological groups, respectively. Calcified outline was the most prevalent calcification pattern in males. There was no correlation between length and the calcification pattern groups while scalariform shape and pseudoarticular type were the longest variations. We observed that as the anterior sagittal angle gets wider, SHC tends to get longer. The most observed morphological variations were linear shape, elongated type and calcified outline pattern. Detailed studies on the classification will contribute to the literature. (Folia Morphol 2018; 77, 1: 79-89).
Energy Technology Data Exchange (ETDEWEB)
Chertkov, Michael [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Ahn, Sungsoo [Korea Advanced Inst. Science and Technology (KAIST), Daejeon (Korea, Republic of); Shin, Jinwoo [Korea Advanced Inst. Science and Technology (KAIST), Daejeon (Korea, Republic of)
2017-05-25
Computing partition function is the most important statistical inference task arising in applications of Graphical Models (GM). Since it is computationally intractable, approximate methods have been used to resolve the issue in practice, where meanfield (MF) and belief propagation (BP) are arguably the most popular and successful approaches of a variational type. In this paper, we propose two new variational schemes, coined Gauged-MF (G-MF) and Gauged-BP (G-BP), improving MF and BP, respectively. Both provide lower bounds for the partition function by utilizing the so-called gauge transformation which modifies factors of GM while keeping the partition function invariant. Moreover, we prove that both G-MF and G-BP are exact for GMs with a single loop of a special structure, even though the bare MF and BP perform badly in this case. Our extensive experiments, on complete GMs of relatively small size and on large GM (up-to 300 variables) confirm that the newly proposed algorithms outperform and generalize MF and BP.
Some calculations of the failure statistics of coated fuel particles
International Nuclear Information System (INIS)
Martin, D.G.; Hobbs, J.E.
1977-03-01
Statistical variations of coated fuel particle parameters were considered in stress model calculations and the resulting particle failure fraction versus burn-up evaluated. Variations in the following parameters were considered simultaneously: kernel diameter and porosity, thickness of the buffer, seal, silicon carbide and inner and outer pyrocarbon layers, which were all assumed to be normally distributed, and the silicon carbide fracture stress which was assumed to follow a Weibull distribution. Two methods, based respectively on random sampling and convolution of the variations were employed and applied to particles manufactured by Dragon Project and RFL Springfields. Convolution calculations proved the more satisfactory. In the present calculations variations in the silicon carbide fracture stress caused the greatest spread in burn-up for a given change in failure fraction; kernel porosity is the next most important parameter. (author)
Model output statistics applied to wind power prediction
Energy Technology Data Exchange (ETDEWEB)
Joensen, A; Giebel, G; Landberg, L [Risoe National Lab., Roskilde (Denmark); Madsen, H; Nielsen, H A [The Technical Univ. of Denmark, Dept. of Mathematical Modelling, Lyngby (Denmark)
1999-03-01
Being able to predict the output of a wind farm online for a day or two in advance has significant advantages for utilities, such as better possibility to schedule fossil fuelled power plants and a better position on electricity spot markets. In this paper prediction methods based on Numerical Weather Prediction (NWP) models are considered. The spatial resolution used in NWP models implies that these predictions are not valid locally at a specific wind farm. Furthermore, due to the non-stationary nature and complexity of the processes in the atmosphere, and occasional changes of NWP models, the deviation between the predicted and the measured wind will be time dependent. If observational data is available, and if the deviation between the predictions and the observations exhibits systematic behavior, this should be corrected for; if statistical methods are used, this approaches is usually referred to as MOS (Model Output Statistics). The influence of atmospheric turbulence intensity, topography, prediction horizon length and auto-correlation of wind speed and power is considered, and to take the time-variations into account, adaptive estimation methods are applied. Three estimation techniques are considered and compared, Extended Kalman Filtering, recursive least squares and a new modified recursive least squares algorithm. (au) EU-JOULE-3. 11 refs.
Wang, Fang; Dang, Cong; Chang, Xuefei; Tian, Junce; Lu, Zengbin; Chen, Yang; Ye, Gongyin
2017-02-01
The current difficulty facing risk evaluations of Bacillus thuringiensis (Bt) crops on nontarget arthropods (NTAs) is the lack of criteria for determining what represents unacceptable risk. In this study, we investigated the biological parameters in the laboratory and field population abundance of Nilaparvata lugens (Hemiptera: Delphacidae) on two Bt rice lines and the non-Bt parent, together with 14 other conventional rice cultivars. Significant difference were found in nymphal duration and fecundity of N. lugens fed on Bt rice KMD2, as well as field population density on 12 October, compared with non-Bt parent. However, compared with the variation among conventional rice cultivars, the variation of each parameter between Bt rice and the non-Bt parent was much smaller, which can be easily seen from low-high bar graphs and also the coefficient of variation value (C.V). The variation among conventional cultivars is proposed to be used as a criterion for the safety assessment of Bt rice on NTAs, particularly when statistically significant differences in several parameters are found between Bt rice and its non-Bt parent. Coefficient of variation is suggested as a promising parameter for ecological risk judgement of IRGM rice on NTAs.
Statistical analysis of next generation sequencing data
Nettleton, Dan
2014-01-01
Next Generation Sequencing (NGS) is the latest high throughput technology to revolutionize genomic research. NGS generates massive genomic datasets that play a key role in the big data phenomenon that surrounds us today. To extract signals from high-dimensional NGS data and make valid statistical inferences and predictions, novel data analytic and statistical techniques are needed. This book contains 20 chapters written by prominent statisticians working with NGS data. The topics range from basic preprocessing and analysis with NGS data to more complex genomic applications such as copy number variation and isoform expression detection. Research statisticians who want to learn about this growing and exciting area will find this book useful. In addition, many chapters from this book could be included in graduate-level classes in statistical bioinformatics for training future biostatisticians who will be expected to deal with genomic data in basic biomedical research, genomic clinical trials and personalized med...
4P: fast computing of population genetics statistics from large DNA polymorphism panels.
Benazzo, Andrea; Panziera, Alex; Bertorelle, Giorgio
2015-01-01
Massive DNA sequencing has significantly increased the amount of data available for population genetics and molecular ecology studies. However, the parallel computation of simple statistics within and between populations from large panels of polymorphic sites is not yet available, making the exploratory analyses of a set or subset of data a very laborious task. Here, we present 4P (parallel processing of polymorphism panels), a stand-alone software program for the rapid computation of genetic variation statistics (including the joint frequency spectrum) from millions of DNA variants in multiple individuals and multiple populations. It handles a standard input file format commonly used to store DNA variation from empirical or simulation experiments. The computational performance of 4P was evaluated using large SNP (single nucleotide polymorphism) datasets from human genomes or obtained by simulations. 4P was faster or much faster than other comparable programs, and the impact of parallel computing using multicore computers or servers was evident. 4P is a useful tool for biologists who need a simple and rapid computer program to run exploratory population genetics analyses in large panels of genomic data. It is also particularly suitable to analyze multiple data sets produced in simulation studies. Unix, Windows, and MacOs versions are provided, as well as the source code for easier pipeline implementations.
Integrated Data Collection Analysis (IDCA) Program - Statistical Analysis of RDX Standard Data Sets
Energy Technology Data Exchange (ETDEWEB)
Sandstrom, Mary M. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Brown, Geoffrey W. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Preston, Daniel N. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Pollard, Colin J. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Warner, Kirstin F. [Naval Surface Warfare Center (NSWC), Indian Head, MD (United States). Indian Head Division; Sorensen, Daniel N. [Naval Surface Warfare Center (NSWC), Indian Head, MD (United States). Indian Head Division; Remmers, Daniel L. [Naval Surface Warfare Center (NSWC), Indian Head, MD (United States). Indian Head Division; Phillips, Jason J. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Shelley, Timothy J. [Air Force Research Lab. (AFRL), Tyndall AFB, FL (United States); Reyes, Jose A. [Applied Research Associates, Tyndall AFB, FL (United States); Hsu, Peter C. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Reynolds, John G. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
2015-10-30
The Integrated Data Collection Analysis (IDCA) program is conducting a Proficiency Test for Small- Scale Safety and Thermal (SSST) testing of homemade explosives (HMEs). Described here are statistical analyses of the results for impact, friction, electrostatic discharge, and differential scanning calorimetry analysis of the RDX Type II Class 5 standard. The material was tested as a well-characterized standard several times during the proficiency study to assess differences among participants and the range of results that may arise for well-behaved explosive materials. The analyses show that there are detectable differences among the results from IDCA participants. While these differences are statistically significant, most of them can be disregarded for comparison purposes to assess potential variability when laboratories attempt to measure identical samples using methods assumed to be nominally the same. The results presented in this report include the average sensitivity results for the IDCA participants and the ranges of values obtained. The ranges represent variation about the mean values of the tests of between 26% and 42%. The magnitude of this variation is attributed to differences in operator, method, and environment as well as the use of different instruments that are also of varying age. The results appear to be a good representation of the broader safety testing community based on the range of methods, instruments, and environments included in the IDCA Proficiency Test.
Giraldo, Mario A.; Bosch, David; Madden, Marguerite; Usery, Lynn; Kvien, Craig
2008-08-01
SummaryThis research addressed the temporal and spatial variation of soil moisture (SM) in a heterogeneous landscape. The research objective was to investigate soil moisture variation in eight homogeneous 30 by 30 m plots, similar to the pixel size of a Landsat Thematic Mapper (TM) or Enhanced Thematic Mapper plus (ETM+) image. The plots were adjacent to eight stations of an in situ soil moisture network operated by the United States Department of Agriculture-Agriculture Research Service USDA-ARS in Tifton, GA. We also studied five adjacent agricultural fields to examine the effect of different landuses/land covers (LULC) (grass, orchard, peanuts, cotton and bare soil) on the temporal and spatial variation of soil moisture. Soil moisture field data were collected on eight occasions throughout 2005 and January 2006 to establish comparisons within and among eight homogeneous plots. Consistently throughout time, analysis of variance (ANOVA) showed high variation in the soil moisture behavior among the plots and high homogeneity in the soil moisture behavior within them. A precipitation analysis for the eight sampling dates throughout the year 2005 showed similar rainfall conditions for the eight study plots. Therefore, soil moisture variation among locations was explained by in situ local conditions. Temporal stability geostatistical analysis showed that soil moisture has high temporal stability within the small plots and that a single point reading can be used to monitor soil moisture status for the plot within a maximum 3% volume/volume (v/v) soil moisture variation. Similarly, t-statistic analysis showed that soil moisture status in the upper soil layer changes within 24 h. We found statistical differences in the soil moisture between the different LULC in the agricultural fields as well as statistical differences between these fields and the adjacent 30 by 30 m plots. From this analysis, it was demonstrated that spatial proximity is not enough to produce similar
Statistical timing for parametric yield prediction of digital integrated circuits
Jess, J.A.G.; Kalafala, K.; Naidu, S.R.; Otten, R.H.J.M.; Visweswariah, C.
2006-01-01
Uncertainty in circuit performance due to manufacturing and environmental variations is increasing with each new generation of technology. It is therefore important to predict the performance of a chip as a probabilistic quantity. This paper proposes three novel path-based algorithms for statistical
Visualization of the variability of 3D statistical shape models by animation.
Lamecker, Hans; Seebass, Martin; Lange, Thomas; Hege, Hans-Christian; Deuflhard, Peter
2004-01-01
Models of the 3D shape of anatomical objects and the knowledge about their statistical variability are of great benefit in many computer assisted medical applications like images analysis, therapy or surgery planning. Statistical model of shapes have successfully been applied to automate the task of image segmentation. The generation of 3D statistical shape models requires the identification of corresponding points on two shapes. This remains a difficult problem, especially for shapes of complicated topology. In order to interpret and validate variations encoded in a statistical shape model, visual inspection is of great importance. This work describes the generation and interpretation of statistical shape models of the liver and the pelvic bone.
International Nuclear Information System (INIS)
Aminu Ibrahim; Hafizan Juahir; Mohd Ekhwan Toriman; Mustapha, A.; Azman Azid; Isiyaka, H.A.
2015-01-01
Multivariate Statistical techniques including cluster analysis, discriminant analysis, and principal component analysis/factor analysis were applied to investigate the spatial variation and pollution sources in the Terengganu river basin during 5 years of monitoring 13 water quality parameters at thirteen different stations. Cluster analysis (CA) classified 13 stations into 2 clusters low polluted (LP) and moderate polluted (MP) based on similar water quality characteristics. Discriminant analysis (DA) rendered significant data reduction with 4 parameters (pH, NH 3 -NL, PO 4 and EC) and correct assignation of 95.80 %. The PCA/ FA applied to the data sets, yielded in five latent factors accounting 72.42 % of the total variance in the water quality data. The obtained varifactors indicate that parameters in charge for water quality variations are mainly related to domestic waste, industrial, runoff and agricultural (anthropogenic activities). Therefore, multivariate techniques are important in environmental management. (author)
Statistical lamb wave localization based on extreme value theory
Harley, Joel B.
2018-04-01
Guided wave localization methods based on delay-and-sum imaging, matched field processing, and other techniques have been designed and researched to create images that locate and describe structural damage. The maximum value of these images typically represent an estimated damage location. Yet, it is often unclear if this maximum value, or any other value in the image, is a statistically significant indicator of damage. Furthermore, there are currently few, if any, approaches to assess the statistical significance of guided wave localization images. As a result, we present statistical delay-and-sum and statistical matched field processing localization methods to create statistically significant images of damage. Our framework uses constant rate of false alarm statistics and extreme value theory to detect damage with little prior information. We demonstrate our methods with in situ guided wave data from an aluminum plate to detect two 0.75 cm diameter holes. Our results show an expected improvement in statistical significance as the number of sensors increase. With seventeen sensors, both methods successfully detect damage with statistical significance.
Pye, K.; Blott, S. J.
2008-12-01
Monitoring of frontal dune erosion and accretion on the Sefton coast in northwest England over the past 50 years has revealed significant spatial and temporal variations. Previous work has shown that the spatial variations primarily reflect longshore differences in beach and nearshore morphology, energy regime and sediment budget, but the causes of temporal variations have not previously been studied in detail. This paper presents the results of work carried out to test the hypothesis that a major cause of temporal variation is changes in the frequency and magnitude of storms, surges and resulting high tides. Dune toe erosion/accretion records dating from 1958 have been compared with tide gauge records at Liverpool and Heysham. Relatively high dune erosion rates at Formby Point 1958-1968 were associated with a relatively large number of storm tides. Slower erosion at Formby, and relatively rapid accretion in areas to the north and south, occurred during the 1970's and 1980's when there were relatively few major storm tides. After 1990 rates of dune erosion at Formby increased again, and dunes to the north and south experienced slower accretion. During this period high storm tides have been more frequent, and the annual number of hours with water levels above the critical level for dune erosion has increased significantly. An increase in the rate of mean sea-level rise at both Liverpool and Heysham is evident since 1990, but we conclude that this factor is of less importance than the occurrence of extreme high tides and wave action associated with storms. The incidence of extreme high tides shows an identifiable relationship with the lunar nodal tidal cycle, but the evidence indicates that meteorological forcing has also had a significant effect. Storms and surges in the eastern Irish Sea are associated with Atlantic depressions whose direction and rate of movement have a strong influence on wind speeds, wave energy and the height of surge tides. However
Yang, Sinil; Oh, Jaiho
2018-02-01
Seasonal extreme wave statistics were reproduced by using the 25-km-grid global wave model of WAVEWATCH-III. The results showed that the simulated wave dataset for the present climate (1979-2009) was similar to Climate Forecast System Reanalysis (CFSR) wave data. Statistics such as the root mean squared error (RMSE) and correlation coefficient (CC) over the western North Pacific (WNP) basin were 0.5 m and 0.69 over the analysis domain. The largest trends and standard deviation were around the southern coast of Japan and western edge of the WNP. Linear regression analysis was employed to identify the relationship between the leading principal components (PCs) of significant wave heights (SWHs) in the peak season of July to September and sea surface temperature (SST) anomalies in the equatorial Pacific. The results indicated that the inter-annual variability of SWH can be associated with the El Niño-Southern Oscillation in the peak season. The CC between the first PC of the SWH and anomalies in the Nino 3.4 SST index was also significant at a 99% confidence level. Significant variations in the SWH are affected by tropical cyclones (TCs) caused by increased SST anomalies. The genesis and development of simulated TCs can be important to the variation in SWHs for the WNP in the peak season. Therefore, we can project the variability of SWHs through TC activity based on changes in SST conditions for the equatorial Pacific in the future.
Wu, Johnny C; Gardner, David P; Ozer, Stuart; Gutell, Robin R; Ren, Pengyu
2009-08-28
The accurate prediction of the secondary and tertiary structure of an RNA with different folding algorithms is dependent on several factors, including the energy functions. However, an RNA higher-order structure cannot be predicted accurately from its sequence based on a limited set of energy parameters. The inter- and intramolecular forces between this RNA and other small molecules and macromolecules, in addition to other factors in the cell such as pH, ionic strength, and temperature, influence the complex dynamics associated with transition of a single stranded RNA to its secondary and tertiary structure. Since all of the factors that affect the formation of an RNAs 3D structure cannot be determined experimentally, statistically derived potential energy has been used in the prediction of protein structure. In the current work, we evaluate the statistical free energy of various secondary structure motifs, including base-pair stacks, hairpin loops, and internal loops, using their statistical frequency obtained from the comparative analysis of more than 50,000 RNA sequences stored in the RNA Comparative Analysis Database (rCAD) at the Comparative RNA Web (CRW) Site. Statistical energy was computed from the structural statistics for several datasets. While the statistical energy for a base-pair stack correlates with experimentally derived free energy values, suggesting a Boltzmann-like distribution, variation is observed between different molecules and their location on the phylogenetic tree of life. Our statistical energy values calculated for several structural elements were utilized in the Mfold RNA-folding algorithm. The combined statistical energy values for base-pair stacks, hairpins and internal loop flanks result in a significant improvement in the accuracy of secondary structure prediction; the hairpin flanks contribute the most.
Lee, Sang-Hee
2005-07-01
This study uses data resampling to test the null hypothesis that the degree of variation in the cranial capacity of the Dmanisi hominid sample is within the range variation of a single species. The statistical significance of the variation in the Dmanisi sample is examined using simulated distributions based on comparative samples of modern humans, chimpanzees, and gorillas. Results show that it is unlikely to find the maximum difference observed in the Dmanisi sample in distributions of female-female pairs from comparative single-species samples. Given that two sexes are represented, the difference in the Dmanisi sample is not enough to reject the null hypothesis of a single species. Results of this study suggest no compelling reason to invoke multiple taxa to explain variation in the cranial capacity of the Dmanisi hominids. (c) 2004 Wiley-Liss, Inc
Geometric morphometrics in primatology: craniofacial variation in Homo sapiens and Pan troglodytes.
Lynch, J M; Wood, C G; Luboga, S A
1996-01-01
Traditionally, morphometric studies have relied on statistical analysis of distances, angles or ratios to investigate morphometric variation among taxa. Recently, geometric techniques have been developed for the direct analysis of landmark data. In this paper, we offer a summary (with examples) of three of these newer techniques, namely shape coordinate, thin-plate spline and relative warp analyses. Shape coordinate analysis detected significant craniofacial variation between 4 modern human populations, with African and Australian Aboriginal specimens being relatively prognathous compared with their Eurasian counterparts. In addition, the Australian specimens exhibited greater basicranial flexion than all other samples. The observed relationships between size and craniofacial shape were weak. The decomposition of shape variation into affine and non-affine components is illustrated via a thin-plate spline analysis of Homo and Pan cranial landmarks. We note differences between Homo and Pan in the degree of prognathism and basicranial flexion and the position and orientation of the foramen magnum. We compare these results with previous studies of these features in higher primates and discuss the utility of geometric morphometrics as a tool in primatology and physical anthropology. We conclude that many studies of morphological variation, both within and between taxa, would benefit from the graphical nature of these techniques.
Hohn, M. Ed; Nuhfer, E.B.; Vinopal, R.J.; Klanderman, D.S.
1980-01-01
Classifying very fine-grained rocks through fabric elements provides information about depositional environments, but is subject to the biases of visual taxonomy. To evaluate the statistical significance of an empirical classification of very fine-grained rocks, samples from Devonian shales in four cored wells in West Virginia and Virginia were measured for 15 variables: quartz, illite, pyrite and expandable clays determined by X-ray diffraction; total sulfur, organic content, inorganic carbon, matrix density, bulk density, porosity, silt, as well as density, sonic travel time, resistivity, and ??-ray response measured from well logs. The four lithologic types comprised: (1) sharply banded shale, (2) thinly laminated shale, (3) lenticularly laminated shale, and (4) nonbanded shale. Univariate and multivariate analyses of variance showed that the lithologic classification reflects significant differences for the variables measured, difference that can be detected independently of stratigraphic effects. Little-known statistical methods found useful in this work included: the multivariate analysis of variance with more than one effect, simultaneous plotting of samples and variables on canonical variates, and the use of parametric ANOVA and MANOVA on ranked data. ?? 1980 Plenum Publishing Corporation.
Geographic variation in expenditures for workers' compensation physician claims.
Miller, T R; Levy, D T
1997-07-01
We examine interstate variations in the cost of claims for physician care using injury claims from Worker's Compensation, and consider some of the factors that may explain cost differences. Multivariate regression analysis is used to isolate state variations, while controlling for personal and injury characteristics, and state characteristics. Statistical analyses reveal considerable variation in expenditures for physician care of injuries across states, even after controlling for case mix and state characteristics. We also find that the presence of HMOs and of general practitioners as a percent of physicians are associated with lower claims, and that the percent of the state that is urban is associated with higher claims. The large variation in costs suggests a potential to affect the costs of physician care for work-related injuries.
Medrano, Mónica; Herrera, Carlos M; Bazaga, Pilar
2014-10-01
The ecological significance of epigenetic variation has been generally inferred from studies on model plants under artificial conditions, but the importance of epigenetic differences between individuals as a source of intraspecific diversity in natural plant populations remains essentially unknown. This study investigates the relationship between epigenetic variation and functional plant diversity by conducting epigenetic (methylation-sensitive amplified fragment length polymorphisms, MSAP) and genetic (amplified fragment length polymorphisms, AFLP) marker-trait association analyses for 20 whole-plant, leaf and regenerative functional traits in a large sample of wild-growing plants of the perennial herb Helleborus foetidus from ten sampling sites in south-eastern Spain. Plants differed widely in functional characteristics, and exhibited greater epigenetic than genetic diversity, as shown by per cent polymorphism of MSAP fragments (92%) or markers (69%) greatly exceeding that for AFLP ones (41%). After controlling for genetic structuring and possible cryptic relatedness, every functional trait considered exhibited a significant association with at least one AFLP or MSAP marker. A total of 27 MSAP (13.0% of total) and 12 AFLP (4.4%) markers were involved in significant associations, which explained on average 8.2% and 8.0% of trait variance, respectively. Individual MSAP markers were more likely to be associated with functional traits than AFLP markers. Between-site differences in multivariate functional diversity were directly related to variation in multilocus epigenetic diversity after multilocus genetic diversity was statistically accounted for. Results suggest that epigenetic variation can be an important source of intraspecific functional diversity in H. foetidus, possibly endowing this species with the capacity to exploit a broad range of ecological conditions despite its modest genetic diversity. © 2014 John Wiley & Sons Ltd.
Exhaled nitric oxide - circadian variations in healthy subjects
Directory of Open Access Journals (Sweden)
Antosova M
2009-12-01
Full Text Available Abstract Objective Exhaled nitric oxide (eNO has been suggested as a marker of airway inflammatory diseases. The level of eNO is influenced by many various factor including age, sex, menstrual cycle, exercise, food, drugs, etc. The aim of our study was to investigate a potential influence of circadian variation on eNO level in healthy subjects. Methods Measurements were performed in 44 women and 10 men, non-smokers, without respiratory tract infection in last 2 weeks. The eNO was detected at 4-hour intervals from 6 a.m. to 10 p.m. using an NIOX analyzer. We followed the ATS/ERS guidelines for eNO measurement and analysis. Results Peak of eNO levels were observed at 10 a.m. (11.1 ± 7.2 ppb, the lowest value was detected at 10 p.m. (10.0 ± 5.8 ppb. The difference was statistically significant (paired t-test, P Conclusions The daily variations in eNO, with the peak in the morning hours, could be of importance in clinical practice regarding the choice of optimal time for monitoring eNO in patients with respiratory disease.
International Nuclear Information System (INIS)
Zhao, J; Tang, J; Wang, K W
2008-01-01
The frequency-shift-based damage detection method entertains advantages such as global detection capability and easy implementation, but also suffers from drawbacks that include low detection accuracy and sensitivity and the difficulty in identifying damage using a small number of measurable frequencies. Moreover, the damage detection/identification performance is inevitably affected by the uncertainty/variations in the baseline model. In this research, we investigate an enhanced statistical damage identification method using the tunable piezoelectric transducer circuitry. The tunable piezoelectric transducer circuitry can lead to much enriched information on frequency shift (before and after damage occurrence). The circuitry elements, meanwhile, can be directly and accurately measured and thus can be considered uncertainty-free. A statistical damage identification algorithm is formulated which can identify both the mean and variance of the elemental property change. Our analysis indicates that the integration of the tunable piezoelectric transducer circuitry can significantly enhance the robustness of the frequency-shift-based damage identification approach under uncertainty and noise
Development of a statistically-based lower bound fracture toughness curve (Ksub(IR) curve)
International Nuclear Information System (INIS)
Wullaert, R.A.; Server, W.L.; Oldfield, W.; Stahlkopf, K.E.
1977-01-01
A program of initiation fracture toughness measurements on fifty heats of nuclear pressure vessel production materials (including weldments) was used to develop a methodology for establishing a revised reference toughness curve. The new methodology was statistically developed and provides a predefined confidence limit (or tolerance limit) for fracture toughness based upon many heats of a particular type of material. Overall reference curves were developed for seven specific materials using large specimen static and dynamic fracture toughness results. The heat-to-heat variation was removed by normalizing both the fracture toughness and temperature data with the precracked Charpy tanh curve coefficients for each particular heat. The variance and distribution about the curve were determined, and lower bounds of predetermined statistical significance were drawn based upon a Pearson distribution in the lower shelf region (since the data were skewed to high values) and a t-distribution in the transition temperature region (since the data were normally distributed)
Wargo, Andrew R.; Kell, Alison M.; Scott, Robert J.; Thorgaard, Gary H.; Kurath, Gael
2012-01-01
Little is known about the factors that drive the high levels of between-host variation in pathogen burden that are frequently observed in viral infections. Here, two factors thought to impact viral load variability, host genetic diversity and stochastic processes linked with viral entry into the host, were examined. This work was conducted with the aquatic vertebrate virus, Infectious hematopoietic necrosis virus (IHNV), in its natural host, rainbow trout. It was found that in controlled in vivo infections of IHNV, a suggestive trend of reduced between-fish viral load variation was observed in a clonal population of isogenic trout compared to a genetically diverse population of out-bred trout. However, this trend was not statistically significant for any of the four viral genotypes examined, and high levels of fish-to-fish variation persisted even in the isogenic trout population. A decrease in fish-to-fish viral load variation was also observed in virus injection challenges that bypassed the host entry step, compared to fish exposed to the virus through the natural water-borne immersion route of infection. This trend was significant for three of the four virus genotypes examined and suggests host entry may play a role in viral load variability. However, high levels of viral load variation also remained in the injection challenges. Together, these results indicate that although host genetic diversity and viral entry may play some role in between-fish viral load variation, they are not major factors. Other biological and non-biological parameters that may influence viral load variation are discussed.
Gourgoulis, Vassilios; Koulexidis, Stylianos; Gketzenis, Panagiotis; Tzouras, Grigoris
2018-03-01
Gourgoulis, V, Koulexidis, S, Gketzenis, P, and Tzouras, G. Intra-cyclic velocity variation of the center of mass and hip in breaststroke swimming with maximal intensity. J Strength Cond Res 32(3): 830-840, 2018-The aim of the study was to compare the center of mass (CM) and hip (HIP) intracyclic velocity variation in breaststroke swimming using 3-dimensional kinematic analysis. Nine male breaststrokes, of moderate performance level, swam 25-m breaststroke with maximal intensity, and their movements were recorded, both under and above the water surface, using 8 digital cameras. Their CM and HIP velocities and their intracyclic variations were estimated after manual digitization of 28 selected points on the body in a complete arm and leg breaststroke cycle. Paired sample t-tests or Wilcoxon tests, when the assumption of normality was broken, were used for statistical analyses. In both, CM and HIP velocity-time curves, the results revealed a similar pattern of 2 clear peaks associated with the leg and arm propulsive phases and 2 minimal velocities that corresponded to the arm and leg recovery phase and the lag time between the leg and arm propulsive phases, respectively. However, despite this similar general pattern, the HIP minimum resultant velocity was significantly lower, whereas its maximal value was significantly greater, than the corresponding CM values. Consequently, the HIP intracyclic swimming velocity fluctuation significantly overestimates the actual variation of the swimmer's velocity in breaststroke swimming.
Morphological variation in leaf dissection of Rheum palmatum complex (Polygonaceae.
Directory of Open Access Journals (Sweden)
Xu-Mei Wang
Full Text Available AIMS: Rheum palmatum complex comprises all taxa within section Palmata in the genus Rheum, including R. officinale, R. palmatum, R. tanguticum, R. tanguticum var. liupanshanense and R. laciniatum. The identification of the taxa in section Palmata is based primarily on the degree of leaf blade dissection and the shape of the lobes; however, difficulties in species identification may arise from their significant variation. The aim of this study is to analyze the patterns of variation in leaf blade characteristics within and among populations through population-based sampling covering the entire distribution range of R. palmatum complex. METHODS: Samples were taken from 2340 leaves from 780 individuals and 44 populations representing the four species, and the degree of leaf blade dissection and the shape of the lobe were measured to yield a set of quantitative data. Furthermore, those data were statistically analyzed. IMPORTANT FINDINGS: The statistical analysis showed that the degree of leaf blade dissection is continuous from lobed to parted, and the shape of the lobe is also continuous from broadly triangular to lanceolate both within and between populations. We suggested that taxa in section Palmata should be considered as one species. Based on the research on the R. palmatum complex, we considered that the quantitative characteristics were greatly influenced by the environment. Therefore, it is not reliable to delimitate the species according to the continuously quantitative vegetative characteristics.
Statistical analysis of random pulse trains
International Nuclear Information System (INIS)
Da Costa, G.
1977-02-01
Some experimental and theoretical results concerning the statistical properties of optical beams formed by a finite number of independent pulses are presented. The considered waves (corresponding to each pulse) present important spatial variations of the illumination distribution in a cross-section of the beam, due to the time-varying random refractive index distribution in the active medium. Some examples of this kind of emission are: (a) Free-running ruby laser emission; (b) Mode-locked pulse trains; (c) Randomly excited nonlinear media
Kaygusuz, Ahmet; Haksever, Mehmet; Akduman, Davut; Aslan, Sündüs; Sayar, Zeynep
2014-09-01
The anatomy of the sinonasal area has a very wide rage of anatomical variations. The significance of these anatomical variations in pathogenesis of rhinosinusitis, which is the commonest disease in the region, is still unclear. The aims of the study were to compare the rate of sinonasal anatomical variations with development and severity of chronic rhinosinusitis patients. CT scan of paranasal sinuses images of 99 individuals were retrospectively reviewed. 65 cases of chronic rhinosinusitis (study group) who had undergone endoscopic sinus surgery were compared with 34 cases without chronic rhinosinusitis (control group). Also in study group Lund-Mackay score of the sinus disease were calculated and compared to the rate of related anatomical variations. There were 74 (74.7 %) males and 25 (25.2 %) females with ages ranging from 13 to 70 years (mean 32.2 years). The anatomical variations recorded were: Septal deviation 47 (72.3) in study and 25 (73.5 %) in control group, concha bullosa 27 (41.5 %) in study and 18 (52.9 %) in control group, overpneumatized ethmoid bulla 17 (26.1 %) in study and 14 (41.1 %) in control group, pneumatized uncinate 3 (4.6 %) in study and 3 (8.8 %) in control group, agger nasi 42 (64.6 %) in study and 19 (55.8 %) in control group, paradoxical middle turbinates 9 (13.8 %) in study and 4 (11.7 %) in control group, Onodi cell 6 (9.2 %) in study and 2 (5.8 %) in control group, Haller's cells (infraorbital ethmoid cell) 9 (13.8 %) in study and 7 (20.5 %) in control group. None of these results were statistically significant between study and control group (p > 0.05). Lund-Mackay score (which was assumed to show the severity of the disease) of the maxillary, ethmoid and frontal sinus were calculated and compared to rate of septal deviation, concha bullosa, agger nasi cells. No significant correlation was conducted (p > 0.05). The results of study showed no statistically significant correlation between sinonasal anatomical
Khana, Diba; Rossen, Lauren M; Hedegaard, Holly; Warner, Margaret
2018-01-01
Hierarchical Bayes models have been used in disease mapping to examine small scale geographic variation. State level geographic variation for less common causes of mortality outcomes have been reported however county level variation is rarely examined. Due to concerns about statistical reliability and confidentiality, county-level mortality rates based on fewer than 20 deaths are suppressed based on Division of Vital Statistics, National Center for Health Statistics (NCHS) statistical reliability criteria, precluding an examination of spatio-temporal variation in less common causes of mortality outcomes such as suicide rates (SRs) at the county level using direct estimates. Existing Bayesian spatio-temporal modeling strategies can be applied via Integrated Nested Laplace Approximation (INLA) in R to a large number of rare causes of mortality outcomes to enable examination of spatio-temporal variations on smaller geographic scales such as counties. This method allows examination of spatiotemporal variation across the entire U.S., even where the data are sparse. We used mortality data from 2005-2015 to explore spatiotemporal variation in SRs, as one particular application of the Bayesian spatio-temporal modeling strategy in R-INLA to predict year and county-specific SRs. Specifically, hierarchical Bayesian spatio-temporal models were implemented with spatially structured and unstructured random effects, correlated time effects, time varying confounders and space-time interaction terms in the software R-INLA, borrowing strength across both counties and years to produce smoothed county level SRs. Model-based estimates of SRs were mapped to explore geographic variation.
Miller, John
1994-01-01
Presents an approach to document numbering, document titling, and process measurement which, when used with fundamental techniques of statistical process control, reveals meaningful process-element variation as well as nominal productivity models. (SR)
Identifying clusters of active transportation using spatial scan statistics.
Huang, Lan; Stinchcomb, David G; Pickle, Linda W; Dill, Jennifer; Berrigan, David
2009-08-01
There is an intense interest in the possibility that neighborhood characteristics influence active transportation such as walking or biking. The purpose of this paper is to illustrate how a spatial cluster identification method can evaluate the geographic variation of active transportation and identify neighborhoods with unusually high/low levels of active transportation. Self-reported walking/biking prevalence, demographic characteristics, street connectivity variables, and neighborhood socioeconomic data were collected from respondents to the 2001 California Health Interview Survey (CHIS; N=10,688) in Los Angeles County (LAC) and San Diego County (SDC). Spatial scan statistics were used to identify clusters of high or low prevalence (with and without age-adjustment) and the quantity of time spent walking and biking. The data, a subset from the 2001 CHIS, were analyzed in 2007-2008. Geographic clusters of significantly high or low prevalence of walking and biking were detected in LAC and SDC. Structural variables such as street connectivity and shorter block lengths are consistently associated with higher levels of active transportation, but associations between active transportation and socioeconomic variables at the individual and neighborhood levels are mixed. Only one cluster with less time spent walking and biking among walkers/bikers was detected in LAC, and this was of borderline significance. Age-adjustment affects the clustering pattern of walking/biking prevalence in LAC, but not in SDC. The use of spatial scan statistics to identify significant clustering of health behaviors such as active transportation adds to the more traditional regression analysis that examines associations between behavior and environmental factors by identifying specific geographic areas with unusual levels of the behavior independent of predefined administrative units.
Mirosław Mrozkowiak; Hanna Żukowska
2015-01-01
Mrozkowiak Mirosław, Żukowska Hanna. Znaczenie Dobrego Krzesła, jako elementu szkolnego i domowego środowiska ucznia, w profilaktyce zaburzeń statyki postawy ciała = The significance of Good Chair as part of children’s school and home environment in the preventive treatment of body statistics distortions. Journal of Education, Health and Sport. 2015;5(7):179-215. ISSN 2391-8306. DOI 10.5281/zenodo.19832 http://ojs.ukw.edu.pl/index.php/johs/article/view/2015%3B5%287%29%3A179-215 https:...
Can a significance test be genuinely Bayesian?
Pereira, Carlos A. de B.; Stern, Julio Michael; Wechsler, Sergio
2008-01-01
The Full Bayesian Significance Test, FBST, is extensively reviewed. Its test statistic, a genuine Bayesian measure of evidence, is discussed in detail. Its behavior in some problems of statistical inference like testing for independence in contingency tables is discussed.
Gupta, Munish; Kaplan, Heather C
2017-09-01
Quality improvement (QI) is based on measuring performance over time, and variation in data measured over time must be understood to guide change and make optimal improvements. Common cause variation is natural variation owing to factors inherent to any process; special cause variation is unnatural variation owing to external factors. Statistical process control methods, and particularly control charts, are robust tools for understanding data over time and identifying common and special cause variation. This review provides a practical introduction to the use of control charts in health care QI, with a focus on neonatology. Copyright © 2017 Elsevier Inc. All rights reserved.
Statistics as Unbiased Estimators: Exploring the Teaching of Standard Deviation
Wasserman, Nicholas H.; Casey, Stephanie; Champion, Joe; Huey, Maryann
2017-01-01
This manuscript presents findings from a study about the knowledge for and planned teaching of standard deviation. We investigate how understanding variance as an unbiased (inferential) estimator--not just a descriptive statistic for the variation (spread) in data--is related to teachers' instruction regarding standard deviation, particularly…
CORINE land cover and floristic variation in a Mediterranean wetland.
Giallonardo, Tommaso; Landi, Marco; Frignani, Flavio; Geri, Francesco; Lastrucci, Lorenzo; Angiolini, Claudia
2011-11-01
The aims of the present study were to: (1) investigate whether CORINE land cover classes reflect significant differences in floristic composition, using a very detailed CORINE land cover map (scale 1:5000); (2) decompose the relationships between floristic assemblages and three groups of explanatory variables (CORINE land cover classes, environmental characteristics and spatial structure) into unique and interactive components. Stratified sampling was used to select a set of 100-m(2) plots in each land cover class identified in the semi-natural wetland surrounding a lake in central Italy. The following six classes were considered: stable meadows, deciduous oak dominated woods, hygrophilous broadleaf dominated woods, heaths and shrublands, inland swamps, canals or watercourses. The relationship between land cover classes and floristic composition was tested using several statistical techniques in order to determine whether the results remained consistent with different procedures. The variation partitioning approach was applied to identify the relative importance of three groups of explanatory variables in relation to floristic variation. The most important predictor was land cover, which explained 20.7% of the variation in plant distribution, although the hypothesis that each land cover class could be associated with a particular floristic pattern was not verified. Multi Response Permutation Analysis did not indicate a strong floristic separability between land cover classes and only 9.5% of species showed a significant indicator value for a specific land cover class. We suggest that land cover classes linked with hygrophilous and herbaceous communities in a wetland may have floristic patterns that vary with fine scale and are not compatible with a land cover map.
On passive scalar derivative statistics in grid turbulence
International Nuclear Information System (INIS)
Tong, C.; Warhaft, Z.
1994-01-01
The probability density function, and related statistics, of scalar (temperature) derivative fluctuations in decaying grid turbulence with an imposed cross-stream, passive linear temperature profile, is studied for a turbulence Reynolds number range, Re l , varying from 50 to 1200, (corresponding to a Taylor Reynolds number range 30 λ θy has a value of 1.8±0.2 (twice the value observed in shear flows), and has no significant variation with Reynolds number. The ratio of the temperature derivative standard deviation along the gradient to that normal to it is approximately 1.2±0.1 also, with no variation with Re. The kurtosis of the derivatives increases approximately as Re 0.2 l . The results show that the rare, intense temperature deviations that produce the skewed scalar derivative, increase in frequency, but their area fraction (of the total field) becomes smaller as the Reynolds number increases. Thus, since S θy remains constant, they become sharper and more intense, occurring deeper in the tails of the probability density function. Measurements in a thermal mixing layer, which has a nonlinear mean temperature profile, are also presented, and these show a similar value of S θy to the linear profile case. The experiments broadly confirm the two-dimensional numerical simulations of Holzer and Siggia [Phys. Fluids (in press)], as well as other recent simulations, although there are some differences
Phonetic diversity, statistical learning, and acquisition of phonology.
Pierrehumbert, Janet B
2003-01-01
In learning to perceive and produce speech, children master complex language-specific patterns. Daunting language-specific variation is found both in the segmental domain and in the domain of prosody and intonation. This article reviews the challenges posed by results in phonetic typology and sociolinguistics for the theory of language acquisition. It argues that categories are initiated bottom-up from statistical modes in use of the phonetic space, and sketches how exemplar theory can be used to model the updating of categories once they are initiated. It also argues that bottom-up initiation of categories is successful thanks to the perception-production loop operating in the speech community. The behavior of this loop means that the superficial statistical properties of speech available to the infant indirectly reflect the contrastiveness and discriminability of categories in the adult grammar. The article also argues that the developing system is refined using internal feedback from type statistics over the lexicon, once the lexicon is well-developed. The application of type statistics to a system initiated with surface statistics does not cause a fundamental reorganization of the system. Instead, it exploits confluences across levels of representation which characterize human language and make bootstrapping possible.
To Identify the Important Soil Properties Affecting Dinoseb Adsorption with Statistical Analysis
Directory of Open Access Journals (Sweden)
Yiqing Guan
2013-01-01
Full Text Available Investigating the influences of soil characteristic factors on dinoseb adsorption parameter with different statistical methods would be valuable to explicitly figure out the extent of these influences. The correlation coefficients and the direct, indirect effects of soil characteristic factors on dinoseb adsorption parameter were analyzed through bivariate correlation analysis, and path analysis. With stepwise regression analysis the factors which had little influence on the adsorption parameter were excluded. Results indicate that pH and CEC had moderate relationship and lower direct effect on dinoseb adsorption parameter due to the multicollinearity with other soil factors, and organic carbon and clay contents were found to be the most significant soil factors which affect the dinoseb adsorption process. A regression is thereby set up to explore the relationship between the dinoseb adsorption parameter and the two soil factors: the soil organic carbon and clay contents. A 92% of the variation of dinoseb sorption coefficient could be attributed to the variation of the soil organic carbon and clay contents.
Kravchuk, Olena; Elliott, Antony; Bhandari, Bhesh
2005-01-01
A simple laboratory experiment, based on the Maillard reaction, served as a project in Introductory Statistics for undergraduates in Food Science and Technology. By using the principles of randomization and replication and reflecting on the sources of variation in the experimental data, students reinforced the statistical concepts and techniques…
Massei, Nicolas; Dieppois, Bastien; Hannah, David; Lavers, David; Fossa, Manuel; Laignel, Benoit; Debret, Maxime
2017-04-01
Geophysical signals oscillate over several time-scales that explain different amount of their overall variability and may be related to different physical processes. Characterizing and understanding such variabilities in hydrological variations and investigating their determinism is one important issue in a context of climate change, as these variabilities can be occasionally superimposed to long-term trend possibly due to climate change. It is also important to refine our understanding of time-scale dependent linkages between large-scale climatic variations and hydrological responses on the regional or local-scale. Here we investigate such links by conducting a wavelet multiresolution statistical dowscaling approach of precipitation in northwestern France (Seine river catchment) over 1950-2016 using sea level pressure (SLP) and sea surface temperature (SST) as indicators of atmospheric and oceanic circulations, respectively. Previous results demonstrated that including multiresolution decomposition in a statistical downscaling model (within a so-called multiresolution ESD model) using SLP as large-scale predictor greatly improved simulation of low-frequency, i.e. interannual to interdecadal, fluctuations observed in precipitation. Building on these results, continuous wavelet transform of simulated precipiation using multiresolution ESD confirmed the good performance of the model to better explain variability at all time-scales. A sensitivity analysis of the model to the choice of the scale and wavelet function used was also tested. It appeared that whatever the wavelet used, the model performed similarly. The spatial patterns of SLP found as the best predictors for all time-scales, which resulted from the wavelet decomposition, revealed different structures according to time-scale, showing possible different determinisms. More particularly, some low-frequency components ( 3.2-yr and 19.3-yr) showed a much wide-spread spatial extentsion across the Atlantic
Variation of reference evapotranspiration in the central region of Argentina between 1941 and 2010
Directory of Open Access Journals (Sweden)
A.C. de la Casa
2016-03-01
Full Text Available Study region: Changes in reference evapotranspiration (ETo may have important consequences for agricultural suitability in the central region of Argentina. Annual ETo variation was assessed, in terms of both territory and time, for the 7 decades between 1941 and 2010, analyzing the behavior of the 4 atmospheric variables which determine it: temperature, vapor pressure, wind speed and cloud cover. Study focus: The influence of each variable on ETo was evaluated from a multiple regression model and a simple correlation analysis, using climate data from the observation network, and repeating this analysis using interpolated variables. In this grid scheme, linear relationships were determined between ETo and the different key atmospheric variables, plus precipitation (PP, and the t test was applied to establish the statistically significant sectors (P 91% presents a non-significant variation of ETo over time, with a mostly non-significant change of each driving variable, regarding both its relationship with ETo and its own trend of change. The beneficial change in agricultural suitability reported for this water-limited region was found to be produced almost exclusively by increasing PP. Keywords: Reference evapotranspiration, Climate change, Climate variables, Precipitation
Ogunsua, B. O.; Laoye, J. A.
2018-05-01
In this paper, the Tsallis non-extensive q-statistics in ionospheric dynamics was investigated using the total electron content (TEC) obtained from two Global Positioning System (GPS) receiver stations. This investigation was carried out considering the geomagnetically quiet and storm periods. The micro density variation of the ionospheric total electron content was extracted from the TEC data by method of detrending. The detrended total electron content, which represent the variation in the internal dynamics of the system was further analyzed using for non-extensive statistical mechanics using the q-Gaussian methods. Our results reveals that for all the analyzed data sets the Tsallis Gaussian probability distribution (q-Gaussian) with value q > 1 were obtained. It was observed that there is no distinct difference in pattern between the values of qquiet and qstorm. However the values of q varies with geophysical conditions and possibly with local dynamics for the two stations. Also observed are the asymmetric pattern of the q-Gaussian and a highly significant level of correlation for the q-index values obtained for the storm periods compared to the quiet periods between the two GPS receiver stations where the TEC was measured. The factors responsible for this variation can be mostly attributed to the varying mechanisms resulting in the self-reorganization of the system dynamics during the storm periods. The result shows the existence of long range correlation for both quiet and storm periods for the two stations.
Directory of Open Access Journals (Sweden)
Guozhu Zhang
Full Text Available Zebrafish have become an important alternative model for characterizing chemical bioactivity, partly due to the efficiency at which systematic, high-dimensional data can be generated. However, these new data present analytical challenges associated with scale and diversity. We developed a novel, robust statistical approach to characterize chemical-elicited effects in behavioral data from high-throughput screening (HTS of all 1,060 Toxicity Forecaster (ToxCast™ chemicals across 5 concentrations at 120 hours post-fertilization (hpf. Taking advantage of the immense scale of data for a global view, we show that this new approach reduces bias introduced by extreme values yet allows for diverse response patterns that confound the application of traditional statistics. We have also shown that, as a summary measure of response for local tests of chemical-associated behavioral effects, it achieves a significant reduction in coefficient of variation compared to many traditional statistical modeling methods. This effective increase in signal-to-noise ratio augments statistical power and is observed across experimental periods (light/dark conditions that display varied distributional response patterns. Finally, we integrated results with data from concomitant developmental endpoint measurements to show that appropriate statistical handling of HTS behavioral data can add important biological context that informs mechanistic hypotheses.
Variation in Quality of Urgent Health Care Provided During Commercial Virtual Visits.
Schoenfeld, Adam J; Davies, Jason M; Marafino, Ben J; Dean, Mitzi; DeJong, Colette; Bardach, Naomi S; Kazi, Dhruv S; Boscardin, W John; Lin, Grace A; Duseja, Reena; Mei, Y John; Mehrotra, Ateev; Dudley, R Adams
2016-05-01
.4%). No statistically significant variation in guideline adherence by mode of communication (videoconference vs telephone vs webchat) was found. Significant variation in quality was found among companies providing virtual visits for management of common acute illnesses. More variation was found in performance for some conditions than for others, but no variation by mode of communication.
Directory of Open Access Journals (Sweden)
Qing Gu
2016-03-01
Full Text Available Qiandao Lake (Xin’an Jiang reservoir plays a significant role in drinking water supply for eastern China, and it is an attractive tourist destination. Three multivariate statistical methods were comprehensively applied to assess the spatial and temporal variations in water quality as well as potential pollution sources in Qiandao Lake. Data sets of nine parameters from 12 monitoring sites during 2010–2013 were obtained for analysis. Cluster analysis (CA was applied to classify the 12 sampling sites into three groups (Groups A, B and C and the 12 monitoring months into two clusters (April-July, and the remaining months. Discriminant analysis (DA identified Secchi disc depth, dissolved oxygen, permanganate index and total phosphorus as the significant variables for distinguishing variations of different years, with 79.9% correct assignments. Dissolved oxygen, pH and chlorophyll-a were determined to discriminate between the two sampling periods classified by CA, with 87.8% correct assignments. For spatial variation, DA identified Secchi disc depth and ammonia nitrogen as the significant discriminating parameters, with 81.6% correct assignments. Principal component analysis (PCA identified organic pollution, nutrient pollution, domestic sewage, and agricultural and surface runoff as the primary pollution sources, explaining 84.58%, 81.61% and 78.68% of the total variance in Groups A, B and C, respectively. These results demonstrate the effectiveness of integrated use of CA, DA and PCA for reservoir water quality evaluation and could assist managers in improving water resources management.
International Nuclear Information System (INIS)
Zhang Yingjie; Li Jianbin; Tian Shiyu; Li Fengxiang; Fan Tingyong; Shao Qian; Xu Min; Lu Jie
2011-01-01
Objective: To investigate the correlation of position movement of primary tumor with interested organs and skin markers, and to investigate the correlation of volume variation of primary tumors and lungs during different respiration phases for patients with lung cancer at free breath condition scanned by four-dimensional CT (4DCT) simulation. Methods: 16 patients with lung cancer were scanned at free breath condition by simulation 4DCT which connected to a respiration-monitoring system. A coordinate system was created based on image of T 5 phase,gross tumor volume (GTV) and normal tissue structures of 10 phases were contoured. The three dimensional position variation of them were measured and their correlation were analyzed, and the same for the volume variation of GTV and lungs of 10 respiratory phases. Results: Movement range of lung cancer in different lobe differed extinct: 0.8 - 5.0 mm in upper lobe, 5.7 -5.9 mm in middle lobe and 10.2 - 13.7 mm in lower lobe, respectively. Movement range of lung cancer in three dimensional direction was different: z-axis 4.3 mm ± 4.3 mm > y-axis 2.2 mm ± 1.0 mm > x-axis 1.7 mm ± 1.5 mm (χ 2 =16.22, P =0.000), respectively. There was no statistical significant correlation for movement vector of GTV and interested structures (r =-0.50 - -0.01, P =0.058 - -0.961), nor for volume variation of tumor and lung (r =0.23, P =0.520). Conclusions: Based on 4DCT, statistically significant differences of GTV centroid movement are observed at different pulmonary lobes and in three dimensional directions. So individual 4DCT measurement is necessary for definition of internal target volume margin for lung cancer. (authors)
Tan, Mei-xiu; Wang, Jing; Yu, Wei-dong; He, Di; Wang, Na; Dai, Tong; Sun, Yan; Tang, Jian-zhao; Chang, Qing
2015-12-01
Sowing date is one of the vital factors for determining crop yield. In this study, temporal and spatial variation of optimal sowing date of summer maize was analyzed by statistical model and the APSIM-Maize model in Henan Province, China. The results showed that average summer maize optimal sowing dates ranged from May 30 to June 13 across Henan Province with earlier sowing before June 8 in the southern part and later sowing from June 4 to June 13 in the northern part. The optimal sowing date in mountain area of western Henan Province should be around May 30. Late-maturing variety Nongda 108 should be planted at least two days earlier than middle-maturing variety Danyu 13. Under climate warming background, maize sowing should be postponed for at least 3 days if maize harvesting date could be delayed for a week. It was proposed that sowing should be delayed for about a week for a yearly less precipitation pattern while advanced for about a week for a yearly more precipitation pattern compared to the normal one. Across Henan Province, the optimal sowing dates of summer maize showed no significant change trend in 1971-2010, while the potential sowing period had been extended for some regions, such as south from Zhumadian, Yichuan, Nei-xiang and Nanyang in the middle part of Henan, Linzhou in the northern Henan and Sanmenxia in the western Henan, as a result from advanced maturity of winter wheat due to increasing temperature and winter wheat cultivar change. Optimal sowing dates at 76.7% of the study stations showed no significant difference between the two methods. It was recommended that the northern Henan should sow maize immediately after any rainfall and replant afterward, while the southern Henan should not sow maize until that there were valid precipitation (3.9 mm and 8.3 mm for upper south and south parts, respectively) during sowing period, both required enough precipitation during key water requirement period and optimal temperature during grain
Directory of Open Access Journals (Sweden)
Katy Denise Heath
2014-04-01
Full Text Available Predicting how species interactions evolve requires that we understand the mechanistic basis of coevolution, and thus the functional genotype-by-genotype interactions (G × G that drive reciprocal natural selection. Theory on host-parasite coevolution provides testable hypotheses for empiricists, but depends upon models of functional G × G that remain loosely tethered to the molecular details of any particular system. In practice, reciprocal cross-infection studies are often used to partition the variation in infection or fitness in a population that is attributable to G × G (statistical G × G. Here we use simulations to demonstrate that within-population statistical G × G likely tells us little about the existence of coevolution, its strength, or the genetic basis of functional G × G. Combined with studies of multiple populations or points in time, mapping and molecular techniques can bridge the gap between natural variation and mechanistic models of coevolution, while model-based statistics can formally confront coevolutionary models with cross-infection data. Together these approaches provide a robust framework for inferring the infection genetics underlying statistical G × G, helping unravel the genetic basis of coevolution.
Statistics for experimentalists
Cooper, B E
2014-01-01
Statistics for Experimentalists aims to provide experimental scientists with a working knowledge of statistical methods and search approaches to the analysis of data. The book first elaborates on probability and continuous probability distributions. Discussions focus on properties of continuous random variables and normal variables, independence of two random variables, central moments of a continuous distribution, prediction from a normal distribution, binomial probabilities, and multiplication of probabilities and independence. The text then examines estimation and tests of significance. Topics include estimators and estimates, expected values, minimum variance linear unbiased estimators, sufficient estimators, methods of maximum likelihood and least squares, and the test of significance method. The manuscript ponders on distribution-free tests, Poisson process and counting problems, correlation and function fitting, balanced incomplete randomized block designs and the analysis of covariance, and experiment...
Energy Technology Data Exchange (ETDEWEB)
Tsili, A.C., E-mail: a_tsili@yahoo.gr [Department of Clinical Radiology, University Hospital of Ioannina (Greece); Argyropoulou, M.I., E-mail: margyrop@cc.uoi.gr [Department of Clinical Radiology, University Hospital of Ioannina (Greece); Tzarouchi, L., E-mail: ltzar@cc.uoi.gr [Department of Clinical Radiology, University Hospital of Ioannina (Greece); Dalkalitsis, N., E-mail: ndalkal@cc.uoi.gr [Department of Obstetrics and Gynaecology, University Hospital of Ioannina (Greece); Koliopoulos, G., E-mail: georgekoliopoulos@yahoo.com [Department of Obstetrics and Gynaecology, University Hospital of Ioannina (Greece); Paraskevaidis, E., E-mail: eparaske@cc.uoi.gr [Department of Obstetrics and Gynaecology, University Hospital of Ioannina (Greece); Tsampoulas, K., E-mail: ctsampou@uoi.gr [Department of Clinical Radiology, University Hospital of Ioannina (Greece)
2012-08-15
Objectives: To assess the apparent diffusion coefficient (ADC) changes of the normal uterine zones among reproductive women during the menstrual cycle. Methods: The study included 101 women of reproductive age, each with regular cycle and normal endometrium/myometrium, as proved on histopathology or MR imaging examination. Diffusion-weighted (DW) imaging was performed along the axial plane, using a single shot, multi-slice spin-echo planar diffusion pulse sequence and b-values of 0 and 800 s/mm{sup 2}. The mean and standard deviation of the ADC values of normal endometrium/myometrium were calculated for menstrual, proliferative and secretory phase. Analysis of variance followed by the least significant difference test was used for statistical analysis. Results: The ADC values of the endometrium were different in the three phases of the menstrual cycle (menstrual phase: 1.25 {+-} 0.27; proliferative phase: 1.39 {+-} 0.20; secretory phase: 1.50 {+-} 0.18) (F: 9.64, p: 0.00). Statistical significant difference was observed among all groups (p < 0.05). The ADC values of the normal myometrium were different in the three phases of the menstrual cycle (menstrual phase: 1.91 {+-} 0.35; proliferative phase: 1.72 {+-} 0.27; secretory phase: 1.87 {+-} 0.28) (F: 3.60, p: 0.03). Statistical significant difference was observed between menstrual and proliferative phase and between proliferative and secretory phase (p < 0.05). No significant difference was noted between menstrual and secretory phase (p > 0.05). Conclusions: A wide variation of ADC values of normal endometrium and myometrium is observed during different phases of the menstrual cycle.
DEFF Research Database (Denmark)
Mmbando, Bruno P; Kamugisha, Mathias L; Lusingu, John P
2011-01-01
system (GPS) unit. The effects of risk factors were determined using generalized estimating equation and spatial risk of P. falciparum infection was modelled using a kernel (non-parametric) method. RESULTS: There was a significant spatial variation of P. falciparum infection, and urban areas were......ABSTRACT: BACKGROUND: Malaria due to Plasmodium falciparum is the leading cause of morbidity and mortality in Tanzania. According to health statistics, malaria accounts for about 30% and 15% of hospital admissions and deaths, respectively. The risk of P. falciparum infection varies across...... the country. This study describes the spatial variation and socio-economic determinants of P. falciparum infection in northeastern Tanzania. METHODS: The study was conducted in 14 villages located in highland, lowland and urban areas of Korogwe district. Four cross-sectional malaria surveys involving...
de Freitas, Patricia Moreira; Menezes, Andressa Nery; da Mota, Ana Carolina Costa; Simões, Alyne; Mendes, Fausto Medeiros; Lago, Andrea Dias Neves; Ferreira, Leila Soares; Ramos-Oliveira, Thayanne Monteiro
2016-01-01
The present study investigated how a hybrid light source (LED/laser) influences temperature variation on the enamel surfaces during 35% hydrogen peroxide (HP) bleaching. Effects on the whitening effectiveness and tooth sensitivity were analyzed. Twenty-two volunteers were randomly assigned to two different treatments in a split-mouth experimental model: group 1 (control), 35% HP; group 2 (experimental), 35% HP + LED/laser. Color evaluation was performed before treatment, and 7 and 14 days after completion of bleaching, using a color shade scale. Tooth sensitivity was assessed using a visual analog scale (VAS; before, immediately, and 24 hours after bleaching). During the bleaching treatment, thermocouple channels positioned on the tooth surfaces recorded the temperature. Data on color and temperature changes were subjected to statistical analysis (α = 5%). Tooth sensitivity data were evaluated descriptively. Groups 1 and 2 showed mean temperatures (± standard deviation) of 30.7 ± 1.2 °C and 34.1 ± 1.3 °C, respectively. It was found that there were statistically significant differences between the groups, with group 2 showing higher mean variation (P enamel surface. The color change results showed no differences in bleaching between the two treatment groups (P = .177). The variation of the average temperature during the treatments was not statistically associated with color variation (P = .079). Immediately after bleaching, it was found that 36.4% of the subjects in group 2 had mild to moderate sensitivity. In group 1, 45.5% showed moderate sensitivity. In both groups, the sensitivity ceased within 24 hours. Hybrid light source (LED/ laser) influences temperature variation on the enamel surface during 35% HP bleaching and is not related to greater tooth sensitivity.
Directory of Open Access Journals (Sweden)
Jaijesh P
2006-01-01
Full Text Available Anomalies of the calf muscles are rare. One such anomalous muscle, known as the Muscle Flexor accessorius longus (also named accessorius ad accessorium, accessorius secondus, accessory flexor digitorum longus or pronator pedis is of morphological significance. When present, this originates in the deep fascia of the tibia or fibula and inserts in the foot either into the flexor digitorum accessorius or into the tendons of the flexor digitorum longus. In this report we present a discussion of the morphological significance and phylogenetic history of one such muscle observed. In this case report we describe an anomalous calf muscle which extends from the popliteal region, runs along the posterior compartment of the leg, reaches the sole and is inserted to the flexor digitorum longus muscle. This kind of muscle variations are considered to be the higher origin of the flexor digitorum accessorius muscle of the sole. Here we discuss the phylogenetic history of this muscle as this muscle variant is present in some primitive mammals, absent in apes and in this particular case appeared as one of the muscles of the flexor compartment of the leg.
Ren, W. X.; Lin, Y. Q.; Fang, S. E.
2011-11-01
One of the key issues in vibration-based structural health monitoring is to extract the damage-sensitive but environment-insensitive features from sampled dynamic response measurements and to carry out the statistical analysis of these features for structural damage detection. A new damage feature is proposed in this paper by using the system matrices of the forward innovation model based on the covariance-driven stochastic subspace identification of a vibrating system. To overcome the variations of the system matrices, a non-singularity transposition matrix is introduced so that the system matrices are normalized to their standard forms. For reducing the effects of modeling errors, noise and environmental variations on measured structural responses, a statistical pattern recognition paradigm is incorporated into the proposed method. The Mahalanobis and Euclidean distance decision functions of the damage feature vector are adopted by defining a statistics-based damage index. The proposed structural damage detection method is verified against one numerical signal and two numerical beams. It is demonstrated that the proposed statistics-based damage index is sensitive to damage and shows some robustness to the noise and false estimation of the system ranks. The method is capable of locating damage of the beam structures under different types of excitations. The robustness of the proposed damage detection method to the variations in environmental temperature is further validated in a companion paper by a reinforced concrete beam tested in the laboratory and a full-scale arch bridge tested in the field.
Mavukkandy, Musthafa Odayooth; Karmakar, Subhankar; Harikumar, P S
2014-09-01
The establishment of an efficient surface water quality monitoring (WQM) network is a critical component in the assessment, restoration and protection of river water quality. A periodic evaluation of monitoring network is mandatory to ensure effective data collection and possible redesigning of existing network in a river catchment. In this study, the efficacy and appropriateness of existing water quality monitoring network in the Kabbini River basin of Kerala, India is presented. Significant multivariate statistical techniques like principal component analysis (PCA) and principal factor analysis (PFA) have been employed to evaluate the efficiency of the surface water quality monitoring network with monitoring stations as the evaluated variables for the interpretation of complex data matrix of the river basin. The main objective is to identify significant monitoring stations that must essentially be included in assessing annual and seasonal variations of river water quality. Moreover, the significance of seasonal redesign of the monitoring network was also investigated to capture valuable information on water quality from the network. Results identified few monitoring stations as insignificant in explaining the annual variance of the dataset. Moreover, the seasonal redesign of the monitoring network through a multivariate statistical framework was found to capture valuable information from the system, thus making the network more efficient. Cluster analysis (CA) classified the sampling sites into different groups based on similarity in water quality characteristics. The PCA/PFA identified significant latent factors standing for different pollution sources such as organic pollution, industrial pollution, diffuse pollution and faecal contamination. Thus, the present study illustrates that various multivariate statistical techniques can be effectively employed in sustainable management of water resources. The effectiveness of existing river water quality monitoring
Energy Technology Data Exchange (ETDEWEB)
Huenicke, B. [GKSS-Forschungszentrum Geesthacht GmbH (Germany). Inst. fuer Kuestenforschung
2008-11-06
This study aims at the estimation of the impact of different atmospheric factors on the past sealevel variations (up to 200 years) in the Baltic Sea by statistically analysing the relationship between Baltic Sea level records and observational and proxy-based reconstructed climatic data sets. The focus lies on the identification and possible quantification of the contribution of sealevel pressure (wind), air-temperature and precipitation to the low-frequency (decadal and multi-decadal) variability of Baltic Sea level. It is known that the wind forcing is the main factor explaining average Baltic Sea level variability at inter-annual to decadal timescales, especially in wintertime. In this thesis it is statistically estimated to what extent other regional climate factors contribute to the spatially heterogeneous Baltic Sea level variations around the isostatic trend at multi-decadal timescales. Although the statistical analysis cannot be completely conclusive, as the potential climate drivers are all statistically interrelated to some degree, the results indicate that precipitation should be taken into account as an explanatory variable for sea-level variations. On the one hand it has been detected that the amplitude of the annual cycle of Baltic Sea level has increased throughout the 20th century and precipitation seems to be the only factor among those analysed (wind through SLP field, barometric effect, temperature and precipitation) that can account for this evolution. On the other hand, precipitation increases the ability to hindcast inter-annual variations of sea level in some regions and seasons, especially in the Southern Baltic in summertime. The mechanism by which precipitation exerts its influence on Baltic Sea level is not ascertained in this statistical analysis due to the lack of long salinity time series. This result, however, represents a working hypothesis that can be confirmed or disproved by long simulations of the Baltic Sea system - ocean
The large deviation approach to statistical mechanics
International Nuclear Information System (INIS)
Touchette, Hugo
2009-01-01
The theory of large deviations is concerned with the exponential decay of probabilities of large fluctuations in random systems. These probabilities are important in many fields of study, including statistics, finance, and engineering, as they often yield valuable information about the large fluctuations of a random system around its most probable state or trajectory. In the context of equilibrium statistical mechanics, the theory of large deviations provides exponential-order estimates of probabilities that refine and generalize Einstein's theory of fluctuations. This review explores this and other connections between large deviation theory and statistical mechanics, in an effort to show that the mathematical language of statistical mechanics is the language of large deviation theory. The first part of the review presents the basics of large deviation theory, and works out many of its classical applications related to sums of random variables and Markov processes. The second part goes through many problems and results of statistical mechanics, and shows how these can be formulated and derived within the context of large deviation theory. The problems and results treated cover a wide range of physical systems, including equilibrium many-particle systems, noise-perturbed dynamics, nonequilibrium systems, as well as multifractals, disordered systems, and chaotic systems. This review also covers many fundamental aspects of statistical mechanics, such as the derivation of variational principles characterizing equilibrium and nonequilibrium states, the breaking of the Legendre transform for nonconcave entropies, and the characterization of nonequilibrium fluctuations through fluctuation relations.
The large deviation approach to statistical mechanics
Touchette, Hugo
2009-07-01
The theory of large deviations is concerned with the exponential decay of probabilities of large fluctuations in random systems. These probabilities are important in many fields of study, including statistics, finance, and engineering, as they often yield valuable information about the large fluctuations of a random system around its most probable state or trajectory. In the context of equilibrium statistical mechanics, the theory of large deviations provides exponential-order estimates of probabilities that refine and generalize Einstein’s theory of fluctuations. This review explores this and other connections between large deviation theory and statistical mechanics, in an effort to show that the mathematical language of statistical mechanics is the language of large deviation theory. The first part of the review presents the basics of large deviation theory, and works out many of its classical applications related to sums of random variables and Markov processes. The second part goes through many problems and results of statistical mechanics, and shows how these can be formulated and derived within the context of large deviation theory. The problems and results treated cover a wide range of physical systems, including equilibrium many-particle systems, noise-perturbed dynamics, nonequilibrium systems, as well as multifractals, disordered systems, and chaotic systems. This review also covers many fundamental aspects of statistical mechanics, such as the derivation of variational principles characterizing equilibrium and nonequilibrium states, the breaking of the Legendre transform for nonconcave entropies, and the characterization of nonequilibrium fluctuations through fluctuation relations.
Statistical competencies for medical research learners: What is fundamental?
Enders, Felicity T; Lindsell, Christopher J; Welty, Leah J; Benn, Emma K T; Perkins, Susan M; Mayo, Matthew S; Rahbar, Mohammad H; Kidwell, Kelley M; Thurston, Sally W; Spratt, Heidi; Grambow, Steven C; Larson, Joseph; Carter, Rickey E; Pollock, Brad H; Oster, Robert A
2017-06-01
It is increasingly essential for medical researchers to be literate in statistics, but the requisite degree of literacy is not the same for every statistical competency in translational research. Statistical competency can range from 'fundamental' (necessary for all) to 'specialized' (necessary for only some). In this study, we determine the degree to which each competency is fundamental or specialized. We surveyed members of 4 professional organizations, targeting doctorally trained biostatisticians and epidemiologists who taught statistics to medical research learners in the past 5 years. Respondents rated 24 educational competencies on a 5-point Likert scale anchored by 'fundamental' and 'specialized.' There were 112 responses. Nineteen of 24 competencies were fundamental. The competencies considered most fundamental were assessing sources of bias and variation (95%), recognizing one's own limits with regard to statistics (93%), identifying the strengths, and limitations of study designs (93%). The least endorsed items were meta-analysis (34%) and stopping rules (18%). We have identified the statistical competencies needed by all medical researchers. These competencies should be considered when designing statistical curricula for medical researchers and should inform which topics are taught in graduate programs and evidence-based medicine courses where learners need to read and understand the medical research literature.
2014-01-01
Background Thoroughbred racehorses are subject to non-traumatic distal limb bone fractures that occur during racing and exercise. Susceptibility to fracture may be due to underlying disturbances in bone metabolism which have a genetic cause. Fracture risk has been shown to be heritable in several species but this study is the first genetic analysis of fracture risk in the horse. Results Fracture cases (n = 269) were horses that sustained catastrophic distal limb fractures while racing on UK racecourses, necessitating euthanasia. Control horses (n = 253) were over 4 years of age, were racing during the same time period as the cases, and had no history of fracture at the time the study was carried out. The horses sampled were bred for both flat and National Hunt (NH) jump racing. 43,417 SNPs were employed to perform a genome-wide association analysis and to estimate the proportion of genetic variance attributable to the SNPs on each chromosome using restricted maximum likelihood (REML). Significant genetic variation associated with fracture risk was found on chromosomes 9, 18, 22 and 31. Three SNPs on chromosome 18 (62.05 Mb – 62.15 Mb) and one SNP on chromosome 1 (14.17 Mb) reached genome-wide significance (p fracture than cases, p = 1 × 10-4), while a second haplotype increases fracture risk (cases at 3.39 times higher risk of fracture than controls, p = 0.042). Conclusions Fracture risk in the Thoroughbred horse is a complex condition with an underlying genetic basis. Multiple genomic regions contribute to susceptibility to fracture risk. This suggests there is the potential to develop SNP-based estimators for genetic risk of fracture in the Thoroughbred racehorse, using methods pioneered in livestock genetics such as genomic selection. This information would be useful to racehorse breeders and owners, enabling them to reduce the risk of injury in their horses. PMID:24559379
On two methods of statistical image analysis
Missimer, J; Knorr, U; Maguire, RP; Herzog, H; Seitz, RJ; Tellman, L; Leenders, K.L.
1999-01-01
The computerized brain atlas (CBA) and statistical parametric mapping (SPM) are two procedures for voxel-based statistical evaluation of PET activation studies. Each includes spatial standardization of image volumes, computation of a statistic, and evaluation of its significance. In addition,
International Nuclear Information System (INIS)
Szeto, Samuel S. W.; Reinke, Stacey N.; Lemire, Bernard D.
2011-01-01
The application of metabolomics to human and animal model systems is poised to provide great insight into our understanding of disease etiology and the metabolic changes that are associated with these conditions. However, metabolomic studies have also revealed that there is significant, inherent biological variation in human samples and even in samples from animal model systems where the animals are housed under carefully controlled conditions. This inherent biological variability is an important consideration for all metabolomics analyses. In this study, we examined the biological variation in 1 H NMR-based metabolic profiling of two model systems, the yeast Saccharomyces cerevisiae and the nematode Caenorhabditis elegans. Using relative standard deviations (RSD) as a measure of variability, our results reveal that both model systems have significant amounts of biological variation. The C. elegans metabolome possesses greater metabolic variance with average RSD values of 29 and 39%, depending on the food source that was used. The S. cerevisiae exometabolome RSD values ranged from 8% to 12% for the four strains examined. We also determined whether biological variation occurs between pairs of phenotypically identical yeast strains. Multivariate statistical analysis allowed us to discriminate between pair members based on their metabolic phenotypes. Our results highlight the variability of the metabolome that exists even for less complex model systems cultured under defined conditions. We also highlight the efficacy of metabolic profiling for defining these subtle metabolic alterations.
DEFF Research Database (Denmark)
Korneliussen, Thorfinn Sand; Moltke, Ida; Albrechtsen, Anders
2013-01-01
A number of different statistics are used for detecting natural selection using DNA sequencing data, including statistics that are summaries of the frequency spectrum, such as Tajima's D. These statistics are now often being applied in the analysis of Next Generation Sequencing (NGS) data. Howeve......, estimates of frequency spectra from NGS data are strongly affected by low sequencing coverage; the inherent technology dependent variation in sequencing depth causes systematic differences in the value of the statistic among genomic regions....
Statistical process control: separating signal from noise in emergency department operations.
Pimentel, Laura; Barrueto, Fermin
2015-05-01
Statistical process control (SPC) is a visually appealing and statistically rigorous methodology very suitable to the analysis of emergency department (ED) operations. We demonstrate that the control chart is the primary tool of SPC; it is constructed by plotting data measuring the key quality indicators of operational processes in rationally ordered subgroups such as units of time. Control limits are calculated using formulas reflecting the variation in the data points from one another and from the mean. SPC allows managers to determine whether operational processes are controlled and predictable. We review why the moving range chart is most appropriate for use in the complex ED milieu, how to apply SPC to ED operations, and how to determine when performance improvement is needed. SPC is an excellent tool for operational analysis and quality improvement for these reasons: 1) control charts make large data sets intuitively coherent by integrating statistical and visual descriptions; 2) SPC provides analysis of process stability and capability rather than simple comparison with a benchmark; 3) SPC allows distinction between special cause variation (signal), indicating an unstable process requiring action, and common cause variation (noise), reflecting a stable process; and 4) SPC keeps the focus of quality improvement on process rather than individual performance. Because data have no meaning apart from their context, and every process generates information that can be used to improve it, we contend that SPC should be seriously considered for driving quality improvement in emergency medicine. Copyright © 2015 Elsevier Inc. All rights reserved.
A Guideline to Univariate Statistical Analysis for LC/MS-Based Untargeted Metabolomics-Derived Data
Directory of Open Access Journals (Sweden)
Maria Vinaixa
2012-10-01
Full Text Available Several metabolomic software programs provide methods for peak picking, retention time alignment and quantification of metabolite features in LC/MS-based metabolomics. Statistical analysis, however, is needed in order to discover those features significantly altered between samples. By comparing the retention time and MS/MS data of a model compound to that from the altered feature of interest in the research sample, metabolites can be then unequivocally identified. This paper reports on a comprehensive overview of a workflow for statistical analysis to rank relevant metabolite features that will be selected for further MS/MS experiments. We focus on univariate data analysis applied in parallel on all detected features. Characteristics and challenges of this analysis are discussed and illustrated using four different real LC/MS untargeted metabolomic datasets. We demonstrate the influence of considering or violating mathematical assumptions on which univariate statistical test rely, using high-dimensional LC/MS datasets. Issues in data analysis such as determination of sample size, analytical variation, assumption of normality and homocedasticity, or correction for multiple testing are discussed and illustrated in the context of our four untargeted LC/MS working examples.
Statistics Using Just One Formula
Rosenthal, Jeffrey S.
2018-01-01
This article advocates that introductory statistics be taught by basing all calculations on a single simple margin-of-error formula and deriving all of the standard introductory statistical concepts (confidence intervals, significance tests, comparisons of means and proportions, etc) from that one formula. It is argued that this approach will…
Statistics Anxiety among Postgraduate Students
Koh, Denise; Zawi, Mohd Khairi
2014-01-01
Most postgraduate programmes, that have research components, require students to take at least one course of research statistics. Not all postgraduate programmes are science based, there are a significant number of postgraduate students who are from the social sciences that will be taking statistics courses, as they try to complete their…
Detecting Variation Trends of Temperature and Precipitation for the Dadu River Basin, China
Directory of Open Access Journals (Sweden)
Ying Wu
2016-01-01
Full Text Available This study analyzes the variation trends of temperature and precipitation in the Dadu River Basin of China based on observed records from fourteen meteorological stations. The magnitude of trends was estimated using Sen’s linear method while its statistical significance was evaluated using Mann-Kendall’s test. The results of analysis depict increase change from northwest to southeast of annual temperature and precipitation in space. In temporal scale, the annual temperature showed significant increase trend and the annual precipitation showed increase trend. For extreme indices, the trends for temperature are more consistent in the region compared to precipitation. This paper has practical meanings for an effective management of climate risk and provides a foundation for further study of hydrological situation in this river basin.
International Nuclear Information System (INIS)
Zhu, Ting Ting; Ai, Tao; Zhang, Wei; Li, Tao; Li, Xiao Ming
2015-01-01
To investigate the changes in water content in the lumbar intervertebral discs by quantitative T2 MR imaging in the morning after bed rest and evening after a diurnal load. Twenty healthy volunteers were separately examined in the morning after bed rest and in the evening after finishing daily work. T2-mapping images were obtained and analyzed. An equally-sized rectangular region of interest (ROI) was manually placed in both, the anterior and the posterior annulus fibrosus (AF), in the outermost 20% of the disc. Three ROIs were placed in the space defined as the nucleus pulposus (NP). Repeated-measures analysis of variance and paired 2-tailed t tests were used for statistical analysis, with p < 0.05 as significantly different. T2 values significantly decreased from morning to evening, in the NP (anterior NP = -13.9 ms; central NP = -17.0 ms; posterior NP = -13.3 ms; all p < 0.001). Meanwhile T2 values significantly increased in the anterior AF (+2.9 ms; p = 0.025) and the posterior AF (+5.9 ms; p < 0.001). T2 values in the posterior AF showed the largest degree of variation among the 5 ROIs, but there was no statistical significance (p = 0.414). Discs with initially low T2 values in the center NP showed a smaller degree of variation in the anterior NP and in the central NP, than in discs with initially high T2 values in the center NP (10.0% vs. 16.1%, p = 0.037; 6.4% vs. 16.1%, p = 0.006, respectively). Segmental quantitative T2 MRI provides valuable insights into physiological aspects of normal discs.
Anatomy and variations of palmaris longus in fetuses.
Albay, S; Kastamoni, Yadigar; Sakalli, Büşra; Tunali, S
2013-01-01
The aim of this study was to assess the absence of the palmaris longus, the proportion of the lengths of tendon and muscle belly, the development of the tendon and the belly during the fetal period, look for any difference between sides and gender. Fifty-eight spontaneously aborted human fetuses (26 female, 32 male, 116 upper extremities) were studied. The presence or absence of the palmaris longus was determined. The lengths of the belly and tendon were measured, and belly/tendon length ratio was calculated. Correlation with gestational age, body side and gender were studied. The muscle was absent in 44 forearms (37.93%; 20 right side, 34.48%; 24 left side, 41.38%); being bilateral in 19 of 58 fetuses (32.76%) and unilateral in six (10.34%). The unilateral absence rate was higher on the left side with a statistically significant difference. The absence of palmaris longus was more common in females, and the difference was statistically significant. The belly/tendon length ratio was 1.04 ± 0.35 on the right side and 1.09 ± 0.3 on the left. It did not show any difference according the fetal age. A sound knowledge on the anatomy and variations of palmaris longus is of great importance during surgical interventions; because it is the first choice for tendon grafts, by the virtue of its structure and function. Thus, this study is of academic interest for anatomists and hand surgeons alike.
Hyvärinen, A
1985-01-01
The main purpose of the present study was to describe the statistical behaviour of daily analytical errors in the dimensions of place and time, providing a statistical basis for realistic estimates of the analytical error, and hence allowing the importance of the error and the relative contributions of its different sources to be re-evaluated. The observation material consists of creatinine and glucose results for control sera measured in daily routine quality control in five laboratories for a period of one year. The observation data were processed and computed by means of an automated data processing system. Graphic representations of time series of daily observations, as well as their means and dispersion limits when grouped over various time intervals, were investigated. For partition of the total variation several two-way analyses of variance were done with laboratory and various time classifications as factors. Pooled sets of observations were tested for normality of distribution and for consistency of variances, and the distribution characteristics of error variation in different categories of place and time were compared. Errors were found from the time series to vary typically between days. Due to irregular fluctuations in general and particular seasonal effects in creatinine, stable estimates of means or of dispersions for errors in individual laboratories could not be easily obtained over short periods of time but only from data sets pooled over long intervals (preferably at least one year). Pooled estimates of proportions of intralaboratory variation were relatively low (less than 33%) when the variation was pooled within days. However, when the variation was pooled over longer intervals this proportion increased considerably, even to a maximum of 89-98% (95-98% in each method category) when an outlying laboratory in glucose was omitted, with a concomitant decrease in the interaction component (representing laboratory-dependent variation with time
Martin, Jordan S; Suarez, Scott A
2017-08-01
Interest in quantifying consistent among-individual variation in primate behavior, also known as personality, has grown rapidly in recent decades. Although behavioral coding is the most frequently utilized method for assessing primate personality, limitations in current statistical practice prevent researchers' from utilizing the full potential of their coding datasets. These limitations include the use of extensive data aggregation, not modeling biologically relevant sources of individual variance during repeatability estimation, not partitioning between-individual (co)variance prior to modeling personality structure, the misuse of principal component analysis, and an over-reliance upon exploratory statistical techniques to compare personality models across populations, species, and data collection methods. In this paper, we propose a statistical framework for primate personality research designed to address these limitations. Our framework synthesizes recently developed mixed-effects modeling approaches for quantifying behavioral variation with an information-theoretic model selection paradigm for confirmatory personality research. After detailing a multi-step analytic procedure for personality assessment and model comparison, we employ this framework to evaluate seven models of personality structure in zoo-housed bonobos (Pan paniscus). We find that differences between sexes, ages, zoos, time of observation, and social group composition contributed to significant behavioral variance. Independently of these factors, however, personality nonetheless accounted for a moderate to high proportion of variance in average behavior across observational periods. A personality structure derived from past rating research receives the strongest support relative to our model set. This model suggests that personality variation across the measured behavioral traits is best described by two correlated but distinct dimensions reflecting individual differences in affiliation and
Geographic variation in expenditures for Workers' Compensation hospitalized claims.
Miller, T R; Levy, D T
1999-02-01
Past literature finds considerable variation in the cost of physician care and in the utilization of medical procedures. Variation in the cost of hospitalized care has received little attention. We examine injury costs of hospitalized claims across states. Multivariate regression analysis is used to isolate state variations, while controlling for personal and injury characteristics, and state characteristics. Injuries to workers filing Workers' Compensation lost workday claims. About 35,000 randomly sampled Workers' Compensation claims from 17 states filed between 1979 and 1988. Medical payments per episode of three injury groups: upper and lower extremity fractures and dislocations, other upper extremity injuries, and back strains and sprains. Statistical analyses reveal considerable variation in expenditures for hospitalized injuries across states, even after controlling for case mix and state characteristics. A substantial portion of the variation is explained by state rate regulations; regulated states have lower costs. The large variation in costs suggests a potential to affect the costs of hospitalized care. Efforts should be directed at those areas that have higher costs without sufficient input price, quality, or case mix justification.
Statistical modelling of traffic safety development
DEFF Research Database (Denmark)
Christens, Peter
2004-01-01
there were 6861 injury trafficc accidents reported by the police, resulting in 4519 minor injuries, 3946 serious injuries, and 431 fatalities. The general purpose of the research was to improve the insight into aggregated road safety methodology in Denmark. The aim was to analyse advanced statistical methods......, that were designed to study developments over time, including effects of interventions. This aim has been achieved by investigating variations in aggregated Danish traffic accident series and by applying state of the art methodologies to specific case studies. The thesis comprises an introduction...
Wave scattering from statistically rough surfaces
Bass, F G; ter Haar, D
2013-01-01
Wave Scattering from Statistically Rough Surfaces discusses the complications in radio physics and hydro-acoustics in relation to wave transmission under settings seen in nature. Some of the topics that are covered include radar and sonar, the effect of variations in topographic relief or ocean waves on the transmission of radio and sound waves, the reproduction of radio waves from the lower layers of the ionosphere, and the oscillations of signals within the earth-ionosphere waveguide. The book begins with some fundamental idea of wave transmission theory and the theory of random processes a
Seasonal and Long-term Variations in 137Cs Among Adults from Swedish Hunter Families
International Nuclear Information System (INIS)
Agren, G.
2001-01-01
To study seasonal variations in 137 Cs, whole-body content measurements of adults from Swedish hunter families have been performed in autumn 1997 and spring 1998. Measurements were performed in three locations, By, Harbo and Gavle, geographically close (within 100 km of each other) but with large differences in ground deposition levels. The hunter families at these three locations were previously measured in 1994. The measured persons were also asked for their frequency of intake of moose, roe-deer, freshwater fish, mushrooms and berries. A statistically significant lower frequency of intake of mushrooms and berries in By, moose, roe-deer and mushrooms in Harbo, and moose in Gavle was found in springtime compared to autumn. In one of the locations, there was a statistically significant lower average 137 Cs whole-body content in spring 1998 than in autumn 1997 while in the other two locations no such effects could be seen. The 137 Cs whole-body content has decreased by 37% from 1994 and to 1998 (including physical decay) correlated to an effective ecological half time of 6 years. (author)
Morphometric variation in the Tunisian green frog, Rana saharica ...
African Journals Online (AJOL)
Rana saharica is the most widely distributed anuran in Tunisia. We examined morphological variation in 124 specimens as a function of their geographical origin, using univariate and multivariate statistics with traditional morphometrics. Our results supported the existence of three morphotypes of this species, correctly ...
Sociolinguistic and Contact-induced Variation in Hungarian Language Use in Subcarpathia, Ukraine
Directory of Open Access Journals (Sweden)
István Csernicskó
2012-01-01
Full Text Available In addition to showing regional and social variation, the language use of the minority Hungarians of Subcarpathia, Ukraine, also presents a reflection of the region’s complex linguistic history and its effects from contact with Russian and Ukrainian. On the basis of quantitative empirical findings, this study shows Subcarpathian Hungarians to be a sociolinguistically stratified group of speakers whose Hungarian language use varies in a systematic manner according to sex, age, level of education, and place of residence. The paper also outlines some of the main differences in the language use of Hungarians in Subcarpathia and Hungary which are manifested in statistically significant ways.
Li, Tianxin; Zhou, Xing Chen; Ikhumhen, Harrison Odion; Difei, An
2018-05-01
In recent years, with the significant increase in urban development, it has become necessary to optimize the current air monitoring stations to reflect the quality of air in the environment. Highlighting the spatial representation of some air monitoring stations using Beijing's regional air monitoring station data from 2012 to 2014, the monthly mean particulate matter concentration (PM10) in the region was calculated and through the IDW interpolation method and spatial grid statistical method using GIS, the spatial distribution of PM10 concentration in the whole region was deduced. The spatial distribution variation of districts in Beijing using the gridding model was performed, and through the 3-year spatial analysis, PM10 concentration data including the variation and spatial overlay (1.5 km × 1.5 km cell resolution grid), the spatial distribution result obtained showed that the total PM10 concentration frequency variation exceeded the standard. It is very important to optimize the layout of the existing air monitoring stations by combining the concentration distribution of air pollutants with the spatial region using GIS.
Language learning, language use and the evolution of linguistic variation
Perfors, Amy; Fehér, Olga; Samara, Anna; Swoboda, Kate; Wonnacott, Elizabeth
2017-01-01
Linguistic universals arise from the interaction between the processes of language learning and language use. A test case for the relationship between these factors is linguistic variation, which tends to be conditioned on linguistic or sociolinguistic criteria. How can we explain the scarcity of unpredictable variation in natural language, and to what extent is this property of language a straightforward reflection of biases in statistical learning? We review three strands of experimental work exploring these questions, and introduce a Bayesian model of the learning and transmission of linguistic variation along with a closely matched artificial language learning experiment with adult participants. Our results show that while the biases of language learners can potentially play a role in shaping linguistic systems, the relationship between biases of learners and the structure of languages is not straightforward. Weak biases can have strong effects on language structure as they accumulate over repeated transmission. But the opposite can also be true: strong biases can have weak or no effects. Furthermore, the use of language during interaction can reshape linguistic systems. Combining data and insights from studies of learning, transmission and use is therefore essential if we are to understand how biases in statistical learning interact with language transmission and language use to shape the structural properties of language. This article is part of the themed issue ‘New frontiers for statistical learning in the cognitive sciences’. PMID:27872370
Language learning, language use and the evolution of linguistic variation.
Smith, Kenny; Perfors, Amy; Fehér, Olga; Samara, Anna; Swoboda, Kate; Wonnacott, Elizabeth
2017-01-05
Linguistic universals arise from the interaction between the processes of language learning and language use. A test case for the relationship between these factors is linguistic variation, which tends to be conditioned on linguistic or sociolinguistic criteria. How can we explain the scarcity of unpredictable variation in natural language, and to what extent is this property of language a straightforward reflection of biases in statistical learning? We review three strands of experimental work exploring these questions, and introduce a Bayesian model of the learning and transmission of linguistic variation along with a closely matched artificial language learning experiment with adult participants. Our results show that while the biases of language learners can potentially play a role in shaping linguistic systems, the relationship between biases of learners and the structure of languages is not straightforward. Weak biases can have strong effects on language structure as they accumulate over repeated transmission. But the opposite can also be true: strong biases can have weak or no effects. Furthermore, the use of language during interaction can reshape linguistic systems. Combining data and insights from studies of learning, transmission and use is therefore essential if we are to understand how biases in statistical learning interact with language transmission and language use to shape the structural properties of language.This article is part of the themed issue 'New frontiers for statistical learning in the cognitive sciences'. © 2016 The Authors.
Statistical Reporting Errors and Collaboration on Statistical Analyses in Psychological Science.
Veldkamp, Coosje L S; Nuijten, Michèle B; Dominguez-Alvarez, Linda; van Assen, Marcel A L M; Wicherts, Jelte M
2014-01-01
Statistical analysis is error prone. A best practice for researchers using statistics would therefore be to share data among co-authors, allowing double-checking of executed tasks just as co-pilots do in aviation. To document the extent to which this 'co-piloting' currently occurs in psychology, we surveyed the authors of 697 articles published in six top psychology journals and asked them whether they had collaborated on four aspects of analyzing data and reporting results, and whether the described data had been shared between the authors. We acquired responses for 49.6% of the articles and found that co-piloting on statistical analysis and reporting results is quite uncommon among psychologists, while data sharing among co-authors seems reasonably but not completely standard. We then used an automated procedure to study the prevalence of statistical reporting errors in the articles in our sample and examined the relationship between reporting errors and co-piloting. Overall, 63% of the articles contained at least one p-value that was inconsistent with the reported test statistic and the accompanying degrees of freedom, and 20% of the articles contained at least one p-value that was inconsistent to such a degree that it may have affected decisions about statistical significance. Overall, the probability that a given p-value was inconsistent was over 10%. Co-piloting was not found to be associated with reporting errors.
A variational Bayes discrete mixture test for rare variant association.
Logsdon, Benjamin A; Dai, James Y; Auer, Paul L; Johnsen, Jill M; Ganesh, Santhi K; Smith, Nicholas L; Wilson, James G; Tracy, Russell P; Lange, Leslie A; Jiao, Shuo; Rich, Stephen S; Lettre, Guillaume; Carlson, Christopher S; Jackson, Rebecca D; O'Donnell, Christopher J; Wurfel, Mark M; Nickerson, Deborah A; Tang, Hua; Reiner, Alexander P; Kooperberg, Charles
2014-01-01
Recently, many statistical methods have been proposed to test for associations between rare genetic variants and complex traits. Most of these methods test for association by aggregating genetic variations within a predefined region, such as a gene. Although there is evidence that "aggregate" tests are more powerful than the single marker test, these tests generally ignore neutral variants and therefore are unable to identify specific variants driving the association with phenotype. We propose a novel aggregate rare-variant test that explicitly models a fraction of variants as neutral, tests associations at the gene-level, and infers the rare-variants driving the association. Simulations show that in the practical scenario where there are many variants within a given region of the genome with only a fraction causal our approach has greater power compared to other popular tests such as the Sequence Kernel Association Test (SKAT), the Weighted Sum Statistic (WSS), and the collapsing method of Morris and Zeggini (MZ). Our algorithm leverages a fast variational Bayes approximate inference methodology to scale to exome-wide analyses, a significant computational advantage over exact inference model selection methodologies. To demonstrate the efficacy of our methodology we test for associations between von Willebrand Factor (VWF) levels and VWF missense rare-variants imputed from the National Heart, Lung, and Blood Institute's Exome Sequencing project into 2,487 African Americans within the VWF gene. Our method suggests that a relatively small fraction (~10%) of the imputed rare missense variants within VWF are strongly associated with lower VWF levels in African Americans.
Directory of Open Access Journals (Sweden)
Jinliang Huang
Full Text Available Surface water samples of baseflow were collected from 20 headwater sub-watersheds which were classified into three types of watersheds (natural, urban and agricultural in the flood, dry and transition seasons during three consecutive years (2010-2012 within a coastal watershed of Southeast China. Integrating spatial statistics with multivariate statistical techniques, river water quality variations and their interactions with natural and anthropogenic controls were examined to identify the causal factors and underlying mechanisms governing spatiotemporal patterns of water quality. Anthropogenic input related to industrial effluents and domestic wastewater, agricultural activities associated with the precipitation-induced surface runoff, and natural weathering process were identified as the potential important factors to drive the seasonal variations in stream water quality for the transition, flood and dry seasons, respectively. All water quality indicators except SRP had the highest mean concentrations in the dry and transition seasons. Anthropogenic activities and watershed characteristics led to the spatial variations in stream water quality in three types of watersheds. Concentrations of NH(4(+-N, SRP, K(+, COD(Mn, and Cl- were generally highest in urban watersheds. NO3(-N Concentration was generally highest in agricultural watersheds. Mg(2+ concentration in natural watersheds was significantly higher than that in agricultural watersheds. Spatial autocorrelations analysis showed similar levels of water pollution between the neighboring sub-watersheds exhibited in the dry and transition seasons while non-point source pollution contributed to the significant variations in water quality between neighboring sub-watersheds. Spatial regression analysis showed anthropogenic controls played critical roles in variations of water quality in the JRW. Management implications were further discussed for water resource management. This research
Directory of Open Access Journals (Sweden)
Dominic Beaulieu-Prévost
2006-03-01
Full Text Available For the last 50 years of research in quantitative social sciences, the empirical evaluation of scientific hypotheses has been based on the rejection or not of the null hypothesis. However, more than 300 articles demonstrated that this method was problematic. In summary, null hypothesis testing (NHT is unfalsifiable, its results depend directly on sample size and the null hypothesis is both improbable and not plausible. Consequently, alternatives to NHT such as confidence intervals (CI and measures of effect size are starting to be used in scientific publications. The purpose of this article is, first, to provide the conceptual tools necessary to implement an approach based on confidence intervals, and second, to briefly demonstrate why such an approach is an interesting alternative to an approach based on NHT. As demonstrated in the article, the proposed CI approach avoids most problems related to a NHT approach and can often improve the scientific and contextual relevance of the statistical interpretations by testing range hypotheses instead of a point hypothesis and by defining the minimal value of a substantial effect. The main advantage of such a CI approach is that it replaces the notion of statistical power by an easily interpretable three-value logic (probable presence of a substantial effect, probable absence of a substantial effect and probabilistic undetermination. The demonstration includes a complete example.
Regional compensation for statistical maximum likelihood reconstruction error of PET image pixels
International Nuclear Information System (INIS)
Forma, J; Ruotsalainen, U; Niemi, J A
2013-01-01
In positron emission tomography (PET), there is an increasing interest in studying not only the regional mean tracer concentration, but its variation arising from local differences in physiology, the tissue heterogeneity. However, in reconstructed images this physiological variation is shadowed by a large reconstruction error, which is caused by noisy data and the inversion of tomographic problem. We present a new procedure which can quantify the error variation in regional reconstructed values for given PET measurement, and reveal the remaining tissue heterogeneity. The error quantification is made by creating and reconstructing the noise realizations of virtual sinograms, which are statistically similar with the measured sinogram. Tests with physical phantom data show that the characterization of error variation and the true heterogeneity are possible, despite the existing model error when real measurement is considered. (paper)
The System of Indicators for the Statistical Evaluation of Market Conjuncture
Directory of Open Access Journals (Sweden)
Chernenko Daryna I.
2017-04-01
Full Text Available The article is aimed at systematizing and improving the system of statistical indicators for the market of laboratory health services (LHS and developing methods for their calculation. In the course of formation of the system of statistical indicators for the market of LHS, allocation of nine blocks has been proposed: market size; proportionality of market; market demand; market proposal; level and dynamics of prices; variation of the LHS; dynamics, development trends, and cycles of the market; market structure; level of competition and monopolization. The proposed system of statistical indicators together with methods for their calculation should ensure studying the trends and regularities in formation of the market for laboratory health services in Ukraine.
Statistics 101 for Radiologists.
Anvari, Arash; Halpern, Elkan F; Samir, Anthony E
2015-10-01
Diagnostic tests have wide clinical applications, including screening, diagnosis, measuring treatment effect, and determining prognosis. Interpreting diagnostic test results requires an understanding of key statistical concepts used to evaluate test efficacy. This review explains descriptive statistics and discusses probability, including mutually exclusive and independent events and conditional probability. In the inferential statistics section, a statistical perspective on study design is provided, together with an explanation of how to select appropriate statistical tests. Key concepts in recruiting study samples are discussed, including representativeness and random sampling. Variable types are defined, including predictor, outcome, and covariate variables, and the relationship of these variables to one another. In the hypothesis testing section, we explain how to determine if observed differences between groups are likely to be due to chance. We explain type I and II errors, statistical significance, and study power, followed by an explanation of effect sizes and how confidence intervals can be used to generalize observed effect sizes to the larger population. Statistical tests are explained in four categories: t tests and analysis of variance, proportion analysis tests, nonparametric tests, and regression techniques. We discuss sensitivity, specificity, accuracy, receiver operating characteristic analysis, and likelihood ratios. Measures of reliability and agreement, including κ statistics, intraclass correlation coefficients, and Bland-Altman graphs and analysis, are introduced. © RSNA, 2015.
Preventing statistical errors in scientific journals.
Nuijten, M.B.
2016-01-01
There is evidence for a high prevalence of statistical reporting errors in psychology and other scientific fields. These errors display a systematic preference for statistically significant results, distorting the scientific literature. There are several possible causes for this systematic error
Variation of elemental concentration in hair of the Japanese in terms of age, sex and hair treatment
International Nuclear Information System (INIS)
Takeuchi, T.; Hayashi, T.; Takada, J.; Hayashi, Y.; Koyama, M.; Shinogi, M.; Aoki, A.; Tomiyama, T.; Katayama, K.
1982-01-01
Instrumental neutron activation analysis has been performed on human hair of the normal Japanese individuals to define the baseline levels of trace elements. A statistical analysis which is not influenced by detection limits, has been carried out to elucidate the variations of elemental concentrations in terms of age, sex and hair treatment. Correlation coefficients have been calculated between the logarithmic concentrations of the elements determined in the groups classified according to sex, age and hair treatment. Their significant levels have been evaluated. (author)
Spatiotemporal Variations and Driving Factors of Air Pollution in China.
Zhan, Dongsheng; Kwan, Mei-Po; Zhang, Wenzhong; Wang, Shaojian; Yu, Jianhui
2017-12-08
In recent years, severe and persistent air pollution episodes in China have drawn wide public concern. Based on ground monitoring air quality data collected in 2015 in Chinese cities above the prefectural level, this study identifies the spatiotemporal variations of air pollution and its associated driving factors in China using descriptive statistics and geographical detector methods. The results show that the average air pollution ratio and continuous air pollution ratio across Chinese cities in 2015 were 23.1 ± 16.9% and 16.2 ± 14.8%. The highest levels of air pollution ratio and continuous air pollution ratio were observed in northern China, especially in the Bohai Rim region and Xinjiang province, and the lowest levels were found in southern China. The average and maximum levels of continuous air pollution show distinct spatial variations when compared with those of the continuous air pollution ratio. Monthly changes in both air pollution ratio and continuous air pollution ratio have a U-shaped variation, indicating that the highest levels of air pollution occurred in winter and the lowest levels happened in summer. The results of the geographical detector model further reveal that the effect intensity of natural factors on the spatial disparity of the air pollution ratio is greater than that of human-related factors. Specifically, among natural factors, the annual average temperature, land relief, and relative humidity have the greatest and most significant negative effects on the air pollution ratio, whereas human factors such as population density, the number of vehicles, and Gross Domestic Product (GDP) witness the strongest and most significant positive effects on air pollution ratio.
Spatiotemporal Variations and Driving Factors of Air Pollution in China
Directory of Open Access Journals (Sweden)
Dongsheng Zhan
2017-12-01
Full Text Available In recent years, severe and persistent air pollution episodes in China have drawn wide public concern. Based on ground monitoring air quality data collected in 2015 in Chinese cities above the prefectural level, this study identifies the spatiotemporal variations of air pollution and its associated driving factors in China using descriptive statistics and geographical detector methods. The results show that the average air pollution ratio and continuous air pollution ratio across Chinese cities in 2015 were 23.1 ± 16.9% and 16.2 ± 14.8%. The highest levels of air pollution ratio and continuous air pollution ratio were observed in northern China, especially in the Bohai Rim region and Xinjiang province, and the lowest levels were found in southern China. The average and maximum levels of continuous air pollution show distinct spatial variations when compared with those of the continuous air pollution ratio. Monthly changes in both air pollution ratio and continuous air pollution ratio have a U-shaped variation, indicating that the highest levels of air pollution occurred in winter and the lowest levels happened in summer. The results of the geographical detector model further reveal that the effect intensity of natural factors on the spatial disparity of the air pollution ratio is greater than that of human-related factors. Specifically, among natural factors, the annual average temperature, land relief, and relative humidity have the greatest and most significant negative effects on the air pollution ratio, whereas human factors such as population density, the number of vehicles, and Gross Domestic Product (GDP witness the strongest and most significant positive effects on air pollution ratio.
Bonnet, R.; Boé, J.; Dayon, G.; Martin, E.
2017-10-01
Characterizing and understanding the multidecadal variations of the continental hydrological cycle is a challenging issue given the limitation of observed data sets. In this paper, a new approach to derive twentieth century hydrological reconstructions over France with an hydrological model is presented. The method combines the results of long-term atmospheric reanalyses downscaled with a stochastic statistical method and homogenized station observations to derive the meteorological forcing needed for hydrological modeling. Different methodological choices are tested and evaluated. We show that using homogenized observations to constrain the results of statistical downscaling help to improve the reproduction of precipitation, temperature, and river flows variability. In particular, it corrects some unrealistic long-term trends associated with the atmospheric reanalyses. Observationally constrained reconstructions therefore constitute a valuable data set to study the multidecadal hydrological variations over France. Thanks to these reconstructions, we confirm that the multidecadal variations previously noted in French river flows have mainly a climatic origin. Moreover, we show that multidecadal variations exist in other hydrological variables (evapotranspiration, snow cover, and soil moisture). Depending on the region, the persistence from spring to summer of soil moisture or snow anomalies generated during spring by temperature and precipitation variations may explain river flows variations in summer, when no concomitant climate variations exist.
Müller-Kirsten, Harald J W
2013-01-01
Statistics links microscopic and macroscopic phenomena, and requires for this reason a large number of microscopic elements like atoms. The results are values of maximum probability or of averaging. This introduction to statistical physics concentrates on the basic principles, and attempts to explain these in simple terms supplemented by numerous examples. These basic principles include the difference between classical and quantum statistics, a priori probabilities as related to degeneracies, the vital aspect of indistinguishability as compared with distinguishability in classical physics, the differences between conserved and non-conserved elements, the different ways of counting arrangements in the three statistics (Maxwell-Boltzmann, Fermi-Dirac, Bose-Einstein), the difference between maximization of the number of arrangements of elements, and averaging in the Darwin-Fowler method. Significant applications to solids, radiation and electrons in metals are treated in separate chapters, as well as Bose-Eins...
Renyi statistics in equilibrium statistical mechanics
International Nuclear Information System (INIS)
Parvan, A.S.; Biro, T.S.
2010-01-01
The Renyi statistics in the canonical and microcanonical ensembles is examined both in general and in particular for the ideal gas. In the microcanonical ensemble the Renyi statistics is equivalent to the Boltzmann-Gibbs statistics. By the exact analytical results for the ideal gas, it is shown that in the canonical ensemble, taking the thermodynamic limit, the Renyi statistics is also equivalent to the Boltzmann-Gibbs statistics. Furthermore it satisfies the requirements of the equilibrium thermodynamics, i.e. the thermodynamical potential of the statistical ensemble is a homogeneous function of first degree of its extensive variables of state. We conclude that the Renyi statistics arrives at the same thermodynamical relations, as those stemming from the Boltzmann-Gibbs statistics in this limit.
Noro, Takahiko; Nakamoto, Kenji; Sato, Makoto; Yasuda, Noriko; Ito, Yoshinori; Ogawa, Shumpei; Nakano, Tadashi; Tsuneoka, Hiroshi
2014-10-01
We retrospectively examined intraocular pressure variations after visual field examination in primary open angle glaucoma (POAG), together with its influencing factors and its association with 24-hour intraocular pressure variations. Subjects were 94 eyes (52 POAG patients) subjected to measurements of 24-hour intraocular pressure and of changes in intraocular pressure after visual field examination using a Humphrey Visual Field Analyzer. Subjects were classified into three groups according to the magnitude of variation (large, intermediate and small), and 24-hour intraocular pressure variations were compared among the three groups. Factors influencing intraocular pressure variations after visual field examination and those associated with the large variation group were investigated. Average intraocular pressure variation after visual field examination was -0.28 ± 1.90 (range - 6.0(-) + 5.0) mmHg. No significant influencing factors were identified. The intraocular pressure at 3 a.m. was significantly higher in the large variation group than other two groups (p field examination. Increases in intraocular pressure during the night might be associated with large intraocular pressure variations after visual field examination.
Statistical constraints on binary black hole inspiral dynamics
Energy Technology Data Exchange (ETDEWEB)
Galley, Chad R; Herrmann, Frank; Silberholz, John; Tiglio, Manuel [Department of Physics, Center for Fundamental Physics, Center for Scientific Computation and Mathematical Modeling, Joint Space Institute, University of Maryland, College Park, MD 20742 (United States); Guerberoff, Gustavo, E-mail: tiglio@umd.ed [Facultad de IngenierIa, Instituto de Matematica y EstadIstica, ' Prof. Ing. Rafael Laguardia' , Universidad de la Republica, Montevideo (Uruguay)
2010-12-21
We perform a statistical analysis of binary black holes in the post-Newtonian approximation by systematically sampling and evolving the parameter space of initial configurations for quasi-circular inspirals. Through a principal component analysis of spin and orbital angular momentum variables, we systematically look for uncorrelated quantities and find three of them which are highly conserved in a statistical sense, both as functions of time and with respect to variations in initial spin orientations. For example, we find a combination of spin scalar products, 2S-circumflex{sub 1{center_dot}}S-circumflex{sub 2} + (S-circumflex{sub 1{center_dot}}L-circumflex) (S-circumflex{sub 2{center_dot}}L-circumflex), that is exactly conserved in time at the considered post-Newtonian order (including spin-spin and radiative effects) for binaries with equal masses and spin magnitudes evolving in a quasi-circular inspiral. We also look for and find the variables that account for the largest variations in the problem. We present binary black hole simulations of the full Einstein equations analyzing to what extent these results might carry over to the full theory in the inspiral and merger regimes. Among other applications these results should be useful both in semi-analytical and numerical building of templates of gravitational waves for gravitational wave detectors.
Bladder filling variation during conformal radiotherapy for rectal cancer
Sithamparam, S.; Ahmad, R.; Sabarudin, A.; Othman, Z.; Ismail, M.
2017-05-01
Conformal radiotherapy for rectal cancer is associated with small bowel toxicity mainly diarrhea. Treating patients with a full bladder is one of the practical solutions to reduce small bowel toxicity. Previous studies on prostate and cervix cancer patients revealed that maintaining consistent bladder volume throughout radiotherapy treatment is challenging. The aim of this study was to measure bladder volume variation throughout radiotherapy treatment. This study also measured the association between bladder volume changes and diarrhea. Twenty two rectal cancer patients were recruited prospectively. Patients were planned for treatment with full bladder following departmental bladder filling protocol and the planning bladder volume was measured during CT-simulation. During radiotherapy, the bladder volume was measured weekly using cone-beam computed tomography (CBCT) and compared to planning bladder volume. Incidence and severity of diarrhea were recorded during the weekly patient review. There was a negative time trend for bladder volume throughout five weeks treatment. The mean bladder volume decreased 18 % from 123 mL (SD 54 mL) during CT-simulation to 101 mL (SD 71 mL) on the 5th week of radiotherapy, but the decrease is not statistically significant. However, there was a large variation of bladder volume within each patient during treatment. This study showed an association between changes of bladder volume and diarrhea (P = 0.045). In conclusion bladder volume reduced throughout radiotherapy treatment for conformal radiotherapy for rectal cancer and there was a large variation of bladder volume within patients.
Bladder filling variation during conformal radiotherapy for rectal cancer
International Nuclear Information System (INIS)
Sithamparam, S; Ahmad, R; Sabarudin, A; Othman, Z; Ismail, M
2017-01-01
Conformal radiotherapy for rectal cancer is associated with small bowel toxicity mainly diarrhea. Treating patients with a full bladder is one of the practical solutions to reduce small bowel toxicity. Previous studies on prostate and cervix cancer patients revealed that maintaining consistent bladder volume throughout radiotherapy treatment is challenging. The aim of this study was to measure bladder volume variation throughout radiotherapy treatment. This study also measured the association between bladder volume changes and diarrhea. Twenty two rectal cancer patients were recruited prospectively. Patients were planned for treatment with full bladder following departmental bladder filling protocol and the planning bladder volume was measured during CT-simulation. During radiotherapy, the bladder volume was measured weekly using cone-beam computed tomography (CBCT) and compared to planning bladder volume. Incidence and severity of diarrhea were recorded during the weekly patient review. There was a negative time trend for bladder volume throughout five weeks treatment. The mean bladder volume decreased 18 % from 123 mL (SD 54 mL) during CT-simulation to 101 mL (SD 71 mL) on the 5th week of radiotherapy, but the decrease is not statistically significant. However, there was a large variation of bladder volume within each patient during treatment. This study showed an association between changes of bladder volume and diarrhea (P = 0.045). In conclusion bladder volume reduced throughout radiotherapy treatment for conformal radiotherapy for rectal cancer and there was a large variation of bladder volume within patients. (paper)
Directory of Open Access Journals (Sweden)
C.C. Cabuga
2017-09-01
Full Text Available Pomacea caniculata or Golden Apple Snail (GAS existed to be a rice pest in the Philippines and in Asia. Likewise, geographic location also contributes its increasing populations thus making it invasive among freshwater habitats and rice field areas. This study was conducted in order to describe shell shape variations and sexual dimorphism among the populations of P. caniculata. A total of 180 were randomly collected in the three lakes of Esperanza, Agusan del Sur (Lake Dakong Napo, Lake Oro, and Lake Cebulan, of which each lake comprised of 60 samples (30 males and 30 females. To determine the variations and sexual dimorphism in the shell shape of golden apple snail, coordinates was administered to relative warp analysis and the resulting data were subjected to Multivariate Analysis of Variance (MANOVA, Principal Component Analysis (PCA and Canonical Variate Analysis (CVA. The results show statistically significant (P<0.05 from the appended male and female dorsal and ventral/apertural portion. While male and female spire height, body size, and shell shape opening also shows significant variations. These phenotypic distinctions could be associated with geographic isolation, predation and nutrient component of the gastropods. Thus, the importance of using geometric morphometric advances in describing sexual dimorphism in the shell shape of P. caniculata.
International Nuclear Information System (INIS)
Robinson, M.T.
1993-01-01
The MARLOWE program was used to study the statistics of sputtering on the example of 1- to 100-keV Au atoms normally incident on static (001) and (111) Au crystals. The yield of sputtered atoms was examined as a function of the impact point of the incident particles (''ions'') on the target surfaces. There were variations on two scales. The effects of the axial and planar channeling of the ions could be traced, the details depending on the orientation of the target and the energies of the ions. Locally, the sputtering yield was very sensitive to the impact point, small changes in position often producing large changes yield. Results indicate strongly that the sputtering yield is a random (''chaotic'') function of the impact point
Identification of basin characteristics influencing spatial variation of river flows
Mazvimavi, D.; Burgers, S.L.G.E.; Stein, A.
2006-01-01
The selection of basin characteristics that explain spatial variation of river flows is important for hydrological regionalization as this enables estimation of flow statistics of ungauged basins. A direct gradient analysis method, redundancy analysis, is used to identify basin characteristics,
International Nuclear Information System (INIS)
De Coninck, Dieter I.M.; Janssen, Colin R.; De Schamphelaere, Karel A.C.
2013-01-01
Highlights: •Interaction of a metal and cyanobacterium in 20 genetically distinct waterflea clones. •All observed effects were non-interactive. •This contrasted expectations based on shared modes of toxic action. -- Abstract: Interactive effects between chemical and natural stressors as well as genetically determined variation in stress tolerance among individuals may complicate risk assessment and management of chemical pollutants in natural ecosystems. Although genetic variation in tolerance to single stressors has been described extensively, genetic variation in interactive effects between two stressors has only rarely been investigated. Here, we examined the interactive effects between a chemical stressor (Cd) and a natural stressor (the cyanobacteria Microcystis aeruginosa) on the reproduction of Daphnia magna in 20 genetically different clones using a full-factorial experimental design and with the independent action model of joint stressor action as the reference theoretical framework. Across all clones, the reduction of 21-day reproduction compared to the control treatment (no Cd, no M. aeruginosa) ranged from −10% to 98% following Cd exposure alone, from 44% to 89% for Microcystis exposure alone, and from 61% to 98% after exposure to Cd + Microcystis combined. Three-way ANOVA on log-transformed reproduction data of all clones together did not detect a statistically significant Cd × Microcystis interaction term (F-test, p = 0.11), meaning that on average both stressors do not interact in inhibiting reproductive performance of D. magna. This finding contrasted expectations based on some known shared mechanisms of toxicity of Cd and Microcystis and therefore cautions against making predictions of interactive chemical + natural stressor effects from incomplete knowledge on affected biological processes and pathways. Further, still based on three-way ANOVA, we did not find statistically significant clone × Cd × Microcystis interaction when data for
Energy Technology Data Exchange (ETDEWEB)
De Coninck, Dieter I.M., E-mail: Dieter.DeConinck@UGent.be; Janssen, Colin R.; De Schamphelaere, Karel A.C.
2013-09-15
Highlights: •Interaction of a metal and cyanobacterium in 20 genetically distinct waterflea clones. •All observed effects were non-interactive. •This contrasted expectations based on shared modes of toxic action. -- Abstract: Interactive effects between chemical and natural stressors as well as genetically determined variation in stress tolerance among individuals may complicate risk assessment and management of chemical pollutants in natural ecosystems. Although genetic variation in tolerance to single stressors has been described extensively, genetic variation in interactive effects between two stressors has only rarely been investigated. Here, we examined the interactive effects between a chemical stressor (Cd) and a natural stressor (the cyanobacteria Microcystis aeruginosa) on the reproduction of Daphnia magna in 20 genetically different clones using a full-factorial experimental design and with the independent action model of joint stressor action as the reference theoretical framework. Across all clones, the reduction of 21-day reproduction compared to the control treatment (no Cd, no M. aeruginosa) ranged from −10% to 98% following Cd exposure alone, from 44% to 89% for Microcystis exposure alone, and from 61% to 98% after exposure to Cd + Microcystis combined. Three-way ANOVA on log-transformed reproduction data of all clones together did not detect a statistically significant Cd × Microcystis interaction term (F-test, p = 0.11), meaning that on average both stressors do not interact in inhibiting reproductive performance of D. magna. This finding contrasted expectations based on some known shared mechanisms of toxicity of Cd and Microcystis and therefore cautions against making predictions of interactive chemical + natural stressor effects from incomplete knowledge on affected biological processes and pathways. Further, still based on three-way ANOVA, we did not find statistically significant clone × Cd × Microcystis interaction when data for
Joseph, Bindu; Corwin, Jason A.; Kliebenstein, Daniel J.
2015-01-01
Recent studies are starting to show that genetic control over stochastic variation is a key evolutionary solution of single celled organisms in the face of unpredictable environments. This has been expanded to show that genetic variation can alter stochastic variation in transcriptional processes within multi-cellular eukaryotes. However, little is known about how genetic diversity can control stochastic variation within more non-cell autonomous phenotypes. Using an Arabidopsis reciprocal RIL population, we showed that there is significant genetic diversity influencing stochastic variation in the plant metabolome, defense chemistry, and growth. This genetic diversity included loci specific for the stochastic variation of each phenotypic class that did not affect the other phenotypic classes or the average phenotype. This suggests that the organism's networks are established so that noise can exist in one phenotypic level like metabolism and not permeate up or down to different phenotypic levels. Further, the genomic variation within the plastid and mitochondria also had significant effects on the stochastic variation of all phenotypic classes. The genetic influence over stochastic variation within the metabolome was highly metabolite specific, with neighboring metabolites in the same metabolic pathway frequently showing different levels of noise. As expected from bet-hedging theory, there was more genetic diversity and a wider range of stochastic variation for defense chemistry than found for primary metabolism. Thus, it is possible to begin dissecting the stochastic variation of whole organismal phenotypes in multi-cellular organisms. Further, there are loci that modulate stochastic variation at different phenotypic levels. Finding the identity of these genes will be key to developing complete models linking genotype to phenotype. PMID:25569687
Energy Technology Data Exchange (ETDEWEB)
Wallace, Jack, E-mail: jack.wallace@ce.queensu.ca [Department of Civil Engineering, Queen’s University, Ellis Hall, 58 University Avenue, Kingston, Ontario K7L 3N6 (Canada); Champagne, Pascale, E-mail: champagne@civil.queensu.ca [Department of Civil Engineering, Queen’s University, Ellis Hall, 58 University Avenue, Kingston, Ontario K7L 3N6 (Canada); Monnier, Anne-Charlotte, E-mail: anne-charlotte.monnier@insa-lyon.fr [National Institute for Applied Sciences – Lyon, 20 Avenue Albert Einstein, 69621 Villeurbanne Cedex (France)
2015-01-15
Highlights: • Performance of a hybrid passive landfill leachate treatment system was evaluated. • 33 Water chemistry parameters were sampled for 21 months and statistically analyzed. • Parameters were strongly linked and explained most (>40%) of the variation in data. • Alkalinity, ammonia, COD, heavy metals, and iron were criteria for performance. • Eight other parameters were key in modeling system dynamics and criteria. - Abstract: A pilot-scale hybrid-passive treatment system operated at the Merrick Landfill in North Bay, Ontario, Canada, treats municipal landfill leachate and provides for subsequent natural attenuation. Collected leachate is directed to a hybrid-passive treatment system, followed by controlled release to a natural attenuation zone before entering the nearby Little Sturgeon River. The study presents a comprehensive evaluation of the performance of the system using multivariate statistical techniques to determine the interactions between parameters, major pollutants in the leachate, and the biological and chemical processes occurring in the system. Five parameters (ammonia, alkalinity, chemical oxygen demand (COD), “heavy” metals of interest, with atomic weights above calcium, and iron) were set as criteria for the evaluation of system performance based on their toxicity to aquatic ecosystems and importance in treatment with respect to discharge regulations. System data for a full range of water quality parameters over a 21-month period were analyzed using principal components analysis (PCA), as well as principal components (PC) and partial least squares (PLS) regressions. PCA indicated a high degree of association for most parameters with the first PC, which explained a high percentage (>40%) of the variation in the data, suggesting strong statistical relationships among most of the parameters in the system. Regression analyses identified 8 parameters (set as independent variables) that were most frequently retained for modeling
Modelling multiple hospital outcomes: the impact of small area and primary care practice variation
Directory of Open Access Journals (Sweden)
Congdon Peter
2006-11-01
Full Text Available Abstract Background: Appropriate management of care – for example, avoiding unnecessary attendances at, or admissions to, hospital emergency units when they could be handled in primary care – is an important part of health strategy. However, some variations in these outcomes could be due to genuine variations in health need. This paper proposes a new method of explaining variations in hospital utilisation across small areas and the general practices (GPs responsible for patient primary care. By controlling for the influence of true need on such variations, one may identify remaining sources of excess emergency attendances and admissions, both at area and practice level, that may be related to the quality, resourcing or organisation of care. The present paper accordingly develops a methodology that recognises the interplay between population mix factors (health need and primary care factors (e.g. referral thresholds, that allows for unobserved influences on hospitalisation usage, and that also reflects interdependence between hospital outcomes. A case study considers relativities in attendance and admission rates at a North London hospital involving 149 small areas and 53 GP practices. Results: A fixed effects model shows variations in attendances and admissions are significantly related (positively to area and practice need, and nursing home patients, and related (negatively to primary care access and distance of patient homes from the hospital. Modelling the impact of known factors alone is not sufficient to produce a satisfactory fit to the observations, and random effects at area and practice level are needed to improve fit and account for overdispersion. Conclusion: The case study finds variation in attendance and admission rates across areas and practices after controlling for need, and remaining differences between practices may be attributable to referral behaviour unrelated to need, or to staffing, resourcing, and access issues. In
Lack of Day/Night variation in fibroblast growth factor 21 levels in young healthy men.
Foo, J-P; Aronis, K N; Chamberland, J P; Mantzoros, C S
2015-06-01
Fibroblast growth factor (FGF) 21 is an endocrine factor with an emerging role as a metabolic regulator. We previously reported the presence of a significant day/night variation of FGF-21 in energy-replete, healthy female subjects. However the day/night patterns of secretion in male subjects remain to be fully elucidated. To elucidate day/night pattern of FGF-21 levels in male subjects in the energy-replete state, its relationship to FFA and to investigate whether a sexual dimorphism exists in FGF-21 physiology. Eight healthy lean male subjects were studied for up to 5 days while on an isocaloric diet. Blood samples were obtained for measurement of FGF-21 and free fatty acids (FFA) hourly from 0800 AM on day 4 till 0800AM on day 5. FGF-21 did not exhibit any statistically significant day/night variation pattern of circulating FGF-21 levels during the isocaloric fed state in male subjects. FGF-21 levels in male subjects are closely cross-correlated with FFA levels, similar to female subjects. A sexual dimorphism exists in FGF-21 physiology; that as opposed to female subjects, no significant day/night variation exists in FGF-21 rhythm in male subjects in the energy-replete state. Circulating pattern of FGF-21, similar to the female subjects, was highly cross-correlated to the FFA levels in the male subjects, signifying that the sexual dimorphism in FGF-21 physiology may be related to the differing lipid metabolism in both the genders.
Feature discrimination/identification based upon SAR return variations
Rasco, W. A., Sr.; Pietsch, R.
1978-01-01
A study of the statistics of The look-to-look variation statistics in the returns recorded in-flight by a digital, realtime SAR system are analyzed. The determination that the variations in the look-to-look returns from different classes do carry information content unique to the classes was illustrated by a model based on four variants derived from four look in-flight SAR data under study. The model was limited to four classes of returns: mowed grass on a athletic field, rough unmowed grass and weeds on a large vacant field, young fruit trees in a large orchard, and metal mobile homes and storage buildings in a large mobile home park. The data population in excess of 1000 returns represented over 250 individual pixels from the four classes. The multivariant discriminant model operated on the set of returns for each pixel and assigned that pixel to one of the four classes, based on the target variants and the probability distribution function of the four variants for each class.
Directory of Open Access Journals (Sweden)
Yan-Lin Zheng
Full Text Available The budding yeast Saccharomyces cerevisiae is a platform organism for bioethanol production from various feedstocks and robust strains are desirable for efficient fermentation because yeast cells inevitably encounter stressors during the process. Recently, diverse S. cerevisiae lineages were identified, which provided novel resources for understanding stress tolerance variations and related shaping factors in the yeast. This study characterized the tolerance of diverse S. cerevisiae strains to the stressors of high ethanol concentrations, temperature shocks, and osmotic stress. The results showed that the isolates from human-associated environments overall presented a higher level of stress tolerance compared with those from forests spared anthropogenic influences. Statistical analyses indicated that the variations of stress tolerance were significantly correlated with both ecological sources and geographical locations of the strains. This study provides guidelines for selection of robust S. cerevisiae strains for bioethanol production from nature.
Prediction of ppm level electrical failure by using physical variation analysis
Hou, Hsin-Ming; Kung, Ji-Fu; Hsu, Y.-B.; Yamazaki, Y.; Maruyama, Kotaro; Toyoshima, Yuya; Chen, Chu-en
2016-03-01
The quality of patterns printed on wafer may be attributed to factors such as process window control, pattern fidelity, overlay performance, and metrology. Each of these factors play an important role in making the process more effective by ensuring that certain design- and process-specific parameters are kept within acceptable variation. Since chip size and pattern density are increasing accordingly, in-line real time catching the in-chip weak patterns/defects per million opportunities (WP-DPMO) plays more and more significant role for product yield with high density memory. However, the current in-line inspection tools focus on single layer defect inspection, not effectively and efficiently to catch multi-layer weak patterns/defects even through voltage contrast and/or special test structure design [1]-[2]. In general, the multi-layer weak patterns/defects are escaped easily by using in-line inspection and cause ignorance of product dysfunction until off-line time-consuming final PFA/EFA will be used. To effectively and efficiently in-line real time monitor the potential multi-layer weak patterns, we quantify the bridge electrical metric between contact and gate electrodes into CD physical metric via big data from the larger field of view (FOV: 8k x 16k with 3 nm pixel equalizes to image main field size 34 um x 34 um @ 3 nm pixel) e-beam quality image contour compared to layout GDS database (D2DB) as shown in Fig. 1. Hadoop-based distributed parallel computing is implemented to improve the performance of big data architectures, Fig. 2. Therefore, the state of art in-line real time catching in-chip potential multi-layer weak patterns can be proven and achieved by following some studying cases [3]. Therefore, manufacturing sources of variations can be partitioned to systematic and random variations by applying statistical techniques based on the big data fundamental infrastructures. After big data handling, the in-chip CD and AA variations are distinguished by
International Nuclear Information System (INIS)
Densmore, Jeffery D.; Larsen, Edward W.
2003-01-01
The Variational Variance Reduction (VVR) method is an effective technique for increasing the efficiency of Monte Carlo simulations [Ann. Nucl. Energy 28 (2001) 457; Nucl. Sci. Eng., in press]. This method uses a variational functional, which employs first-order estimates of forward and adjoint fluxes, to yield a second-order estimate of a desired system characteristic - which, in this paper, is the criticality eigenvalue k. If Monte Carlo estimates of the forward and adjoint fluxes are used, each having global 'first-order' errors of O(1/√N), where N is the number of histories used in the Monte Carlo simulation, then the statistical error in the VVR estimation of k will in principle be O(1/N). In this paper, we develop this theoretical possibility and demonstrate with numerical examples that implementations of the VVR method for criticality problems can approximate O(1/N) convergence for significantly large values of N
Hanihara, T; Ishida, H
2001-06-01
Four supernumerary ossicle variations-the ossicle at the lambda, the parietal notch bone, the asterionic bone, and the occipitomastoid bone-were examined for laterality differences, intertrait correlations, sex differences, and between group variations in the samples from around the world. Significant laterality differences were not detected in almost all samples. In some pairs of traits, significant association of occurrence were found. Several geographic samples were sexually dimorphic with respect to the asterionic bone and to a lesser extent for the parietal notch bone. East/Northeast Asians including the Arctic populations in general had lower frequencies of the 4 accessory ossicles. Australians, Melanesians and the majority of the New World peoples, on the other hand, generally had high frequencies. In the western hemisphere of the Old World, Subsaharan Africans had relatively high frequencies. Except for the ossicle at the lambda, the distribution pattern in incidence showed clinal variation from south to north. Any identifiable adaptive value related to environmental or subsistence factors may be expressed in such clinal variation. This may allow us to hypothesise that not only mechanical factors but a founder effect, genetic drift, and population structure could have been the underlying causes for interregional variation and possible clines in the incidences of the accessory ossicles.
Nowok, B.
2010-01-01
In today's globalized world, there is increasing demand for reliable and comparable statistics on international migration. This book contributes to a more profound understanding of the effect of definitional variations on the figures that are reported.The framework developed here for the
Directory of Open Access Journals (Sweden)
Sunando Roy
2009-10-01
Full Text Available Feline immunodeficiency virus (FIV and human immunodeficiency virus (HIV are recently identified lentiviruses that cause progressive immune decline and ultimately death in infected cats and humans. It is of great interest to understand how to prevent immune system collapse caused by these lentiviruses. We recently described that disease caused by a virulent FIV strain in cats can be attenuated if animals are first infected with a feline immunodeficiency virus derived from a wild cougar. The detailed temporal tracking of cat immunological parameters in response to two viral infections resulted in high-dimensional datasets containing variables that exhibit strong co-variation. Initial analyses of these complex data using univariate statistical techniques did not account for interactions among immunological response variables and therefore potentially obscured significant effects between infection state and immunological parameters.Here, we apply a suite of multivariate statistical tools, including Principal Component Analysis, MANOVA and Linear Discriminant Analysis, to temporal immunological data resulting from FIV superinfection in domestic cats. We investigated the co-variation among immunological responses, the differences in immune parameters among four groups of five cats each (uninfected, single and dual infected animals, and the "immune profiles" that discriminate among them over the first four weeks following superinfection. Dual infected cats mount an immune response by 24 days post superinfection that is characterized by elevated levels of CD8 and CD25 cells and increased expression of IL4 and IFNgamma, and FAS. This profile discriminates dual infected cats from cats infected with FIV alone, which show high IL-10 and lower numbers of CD8 and CD25 cells.Multivariate statistical analyses demonstrate both the dynamic nature of the immune response to FIV single and dual infection and the development of a unique immunological profile in dual
Genetic variation in genes of the fatty acid synthesis pathway and breast cancer risk
DEFF Research Database (Denmark)
Campa, Daniele; McKay, James; Sinilnikova, Olga
2009-01-01
and FASN) is related to breast cancer risk and body-mass index (BMI) by studying 1,294 breast cancer cases and 2,452 controls from the European Prospective Investigation on Cancer (EPIC). We resequenced the FAS gene and combined information of SNPs found by resequencing and SNPs from public databases....... Using a tagging approach and selecting 20 SNPs, we covered all the common genetic variation of these genes. In this study we were not able to find any statistically significant association between the SNPs in the FAS, ChREBP and SREPB-1 genes and an increased risk of breast cancer overall...
Ely, Craig R.; Fox, A.D.; Alisauskas, R.T.; Andreev, A.; Bromley, R.G.; Degtyarev, Andrei G.; Ebbinge, B.; Gurtovaya, E.N.; Kerbes, R.; Kondratyev, Alexander V.; Kostin, I.; Krechmar, A.V.; Litvin, K.E.; Miyabayashi, Y.; Moou, J.H.; Oates, R.M.; Orthmeyer, D.L.; Sabano, Yutaka; Simpson, S.G.; Solovieva, D.V.; Spindler, Michael A.; Syroechkovsky, Y.V.; Takekawa, John Y.; Walsh, A.
2005-01-01
Capsule: Greater White-fronted Geese show significant variation in body size from sampling locations throughout their circumpolar breeding range. Aims: To determine the degree of geographical variation in body size of Greater White-fronted Geese and identify factors contributing to any apparent patterns in variation. Methods: Structural measures of >3000 geese from 16 breeding areas throughout the Holarctic breeding range of the species were compared statistically. Results: Palearctic forms varied clinally, and increased in size from the smallest forms on the Kanin and Taimyr peninsulas in western Eurasia to the largest forms breeding in the Anadyr Lowlands of eastern Chukotka. Clinal variation was less apparent in the Nearctic, as both the smallest form in the Nearctic and the largest form overall (the Tule Goose) were from different breeding areas in Alaska. The Tule Goose was 25% larger than the smallest form. Birds from Greenland (A. a. flavirostris) were the second largest, although only slightly larger than geese from several North American populations. Body size was not correlated with breeding latitude but was positively correlated with temperature on the breeding grounds, breeding habitat, and migration distance. Body mass of Greater White-fronted Geese from all populations remained relatively constant during the period of wing moult. Morphological distinctness of eastern and western Palearctic forms concurs with earlier findings of complete range disjunction. Conclusions: Patterns of morphological variation in Greater White-fronted Geese across the Holarctic can be generally attributed to adaptation to variable breeding environments, migration requirements, and phylo-geographical histories.
Statistical shape and appearance models of bones.
Sarkalkan, Nazli; Weinans, Harrie; Zadpoor, Amir A
2014-03-01
When applied to bones, statistical shape models (SSM) and statistical appearance models (SAM) respectively describe the mean shape and mean density distribution of bones within a certain population as well as the main modes of variations of shape and density distribution from their mean values. The availability of this quantitative information regarding the detailed anatomy of bones provides new opportunities for diagnosis, evaluation, and treatment of skeletal diseases. The potential of SSM and SAM has been recently recognized within the bone research community. For example, these models have been applied for studying the effects of bone shape on the etiology of osteoarthritis, improving the accuracy of clinical osteoporotic fracture prediction techniques, design of orthopedic implants, and surgery planning. This paper reviews the main concepts, methods, and applications of SSM and SAM as applied to bone. Copyright © 2013 Elsevier Inc. All rights reserved.
Security of statistical data bases: invasion of privacy through attribute correlational modeling
Energy Technology Data Exchange (ETDEWEB)
Palley, M.A.
1985-01-01
This study develops, defines, and applies a statistical technique for the compromise of confidential information in a statistical data base. Attribute Correlational Modeling (ACM) recognizes that the information contained in a statistical data base represents real world statistical phenomena. As such, ACM assumes correlational behavior among the database attributes. ACM proceeds to compromise confidential information through creation of a regression model, where the confidential attribute is treated as the dependent variable. The typical statistical data base may preclude the direct application of regression. In this scenario, the research introduces the notion of a synthetic data base, created through legitimate queries of the actual data base, and through proportional random variation of responses to these queries. The synthetic data base is constructed to resemble the actual data base as closely as possible in a statistical sense. ACM then applies regression analysis to the synthetic data base, and utilizes the derived model to estimate confidential information in the actual database.
Occupational Variation in End-of-Life Care Intensity.
Hyder, Joseph A; Haring, R Sterling; Sturgeon, Daniel; Gazarian, Priscilla K; Jiang, Wei; Cooper, Zara; Lipsitz, Stuart R; Prigerson, Holly G; Weissman, Joel S
2018-03-01
End-of-life (EOL) care intensity is known to vary by secular and geographic patterns. US physicians receive less aggressive EOL care than the general population, presumably the result of preferences shaped by work-place experience with EOL care. We investigated occupation as a source of variation in EOL care intensity. Across 4 states, we identified 660 599, nonhealth maintenance organization Medicare beneficiaries aged ≥66 years who died between 2004 and 2011. Linking death certificates, we identified beneficiaries with prespecified occupations: nurses, farmers, clergy, mortuary workers, homemakers, first-responders, veterinary workers, teachers, accountants, and the general population. End-of-life care intensity over the last 6 months of life was assessed using 5 validated measures: (1) Medicare expenditures, rates of (2) hospice, (3) surgery, (4) intensive care, and (5) in-hospital death. Occupation was a source of large variation in EOL care intensity across all measures, before and after adjustment for sex, education, age-adjusted Charlson Comorbidity Index, race/ethnicity, and hospital referral region. For example, absolute and relative adjusted differences in expenditures were US$9991 and 42% of population mean expenditure ( P EOL care intensity measures, teachers (5 of 5), homemakers (4 of 5), farmers (4 of 5), and clergy (3 of 5) demonstrated significantly less aggressive care. Mortuary workers had lower EOL care intensity (4 of 5) but small numbers limited statistical significance. Occupations with likely exposure to child development, death/bereavement, and naturalistic influences demonstrated lower EOL care intensity. These findings may inform patients and clinicians navigating choices around individual EOL care preferences.
Detecting microsatellites within genomes: significant variation among algorithms
Directory of Open Access Journals (Sweden)
Rivals Eric
2007-04-01
Full Text Available Abstract Background Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker. Results Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp, regardless of motif. Conclusion Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.
International Nuclear Information System (INIS)
Tsili, A.C.; Argyropoulou, M.I.; Tzarouchi, L.; Dalkalitsis, N.; Koliopoulos, G.; Paraskevaidis, E.; Tsampoulas, K.
2012-01-01
Objectives: To assess the apparent diffusion coefficient (ADC) changes of the normal uterine zones among reproductive women during the menstrual cycle. Methods: The study included 101 women of reproductive age, each with regular cycle and normal endometrium/myometrium, as proved on histopathology or MR imaging examination. Diffusion-weighted (DW) imaging was performed along the axial plane, using a single shot, multi-slice spin-echo planar diffusion pulse sequence and b-values of 0 and 800 s/mm 2 . The mean and standard deviation of the ADC values of normal endometrium/myometrium were calculated for menstrual, proliferative and secretory phase. Analysis of variance followed by the least significant difference test was used for statistical analysis. Results: The ADC values of the endometrium were different in the three phases of the menstrual cycle (menstrual phase: 1.25 ± 0.27; proliferative phase: 1.39 ± 0.20; secretory phase: 1.50 ± 0.18) (F: 9.64, p: 0.00). Statistical significant difference was observed among all groups (p 0.05). Conclusions: A wide variation of ADC values of normal endometrium and myometrium is observed during different phases of the menstrual cycle.
Richardson, A. D.; Reichstein, M.; Piao, S.; Ciais, P.; Luyssaert, S.; Stockli, R.; Friedl, M.; Gobron, N.; Fluxnet Site Pis, 21
2009-04-01
generally explaining more, and remote sensing-derived metrics generally explaining less, of the variation in flux anomalies. We found that GPP (gross primary productivity) was consistently more sensitive (both in terms of magnitude and statistical significance; ≈3 g C m-2 d-1 for DBF and ≈2 g C m-2 d-1 for ENF) to phenology than was Reco (ecosystem respiration), which meant that NEP (net ecosystem productivity) tended to be increased both by earlier springs and later autumns. Without exception, when the difference between DBF and ENF in the sensitivity to phenological anomalies was statistically significant, DBF sensitivity was always larger in absolute magnitude than ENF sensitivity. Phenology explained a much larger fraction of the variation in fluxes across sites compared to within sites. Across sites, the rate of increase in GPP with an "exta" day in spring (≈10 g C m-2 d-1) was much larger than in autumn (≈3 g C m-2 d-1). Furthermore, a one-day increase in growing season length across sites increased annual NEP by just ≈2 g C m-2 d-1; this resulted from an increase in GPP of ≈6 g C m-2 d-1 being offset by an increase in RE of ≈4 g C m-2 d-1. In general, there was no statistically significant difference between DBF and ENF in the sensitivity to spatial variation in phenology for either NEP or the component fluxes GPP and Reco. In relation to both within- and across-site variation in phenology and fluxes, the results obtained tended to depend on the phenological metric used, i.e. definition of "start" and "end" of growing season, emphasizing the need for improved understanding of the relationships between these different metrics and ecosystem processes. Furthermore, the differences in flux-phenology relationships in the context of spatial and temporal variation in phenology raise questions about using results from either short-term or space-for-time studies to anticipate responses to future climate change.
Statistics: a Bayesian perspective
National Research Council Canada - National Science Library
Berry, Donald A
1996-01-01
...: it is the only introductory textbook based on Bayesian ideas, it combines concepts and methods, it presents statistics as a means of integrating data into the significant process, it develops ideas...
Directory of Open Access Journals (Sweden)
Tebaldi Toma
2012-06-01
Full Text Available Abstract Background The classical view on eukaryotic gene expression proposes the scheme of a forward flow for which fluctuations in mRNA levels upon a stimulus contribute to determine variations in mRNA availability for translation. Here we address this issue by simultaneously profiling with microarrays the total mRNAs (the transcriptome and the polysome-associated mRNAs (the translatome after EGF treatment of human cells, and extending the analysis to other 19 different transcriptome/translatome comparisons in mammalian cells following different stimuli or undergoing cell programs. Results Triggering of the EGF pathway results in an early induction of transcriptome and translatome changes, but 90% of the significant variation is limited to the translatome and the degree of concordant changes is less than 5%. The survey of other 19 different transcriptome/translatome comparisons shows that extensive uncoupling is a general rule, in terms of both RNA movements and inferred cell activities, with a strong tendency of translation-related genes to be controlled purely at the translational level. By different statistical approaches, we finally provide evidence of the lack of dependence between changes at the transcriptome and translatome levels. Conclusions We propose a model of diffused independency between variation in transcript abundances and variation in their engagement on polysomes, which implies the existence of specific mechanisms to couple these two ways of regulating gene expression.
Directory of Open Access Journals (Sweden)
Fernando Carvalho
2014-09-01
Full Text Available This study aimed to analyze the occurrence of seasonal variations in the number of captures of Artibeus lituratus and Sturnira lilium in the upper strata of an Atlantic Forest remnant in southern Brazil. It was conducted in the town of Pedras Grandes, in the southern end of Santa Catarina. The chiropterans were captured with mist nets installed in the canopy and subcanopy. To check whether there were differences in the number of captures between seasons, we used the chi-square test (χ2, with a significance level of 0.05, and, whenever needed, partial χ2 tests. Artibeus lituratus showed significant differences between seasons, and the largest number of captures occurs in autumn. For S. lilium we did not observe statistically significant differences. The seasonal variation found out for A. lituratus may be related to its diet, which is based on fruits whose availability has seasonal variations. For S. lilium, besides the diet, mainly based on plants that do not have seasonal variations with regard to fruit availability, the altitude of the study area and its variations in temperature also seem to explain the absence of seasonal variation.
Evaluation of the temporal variation of air quality in Rome, Italy from 1999 to 2008
Directory of Open Access Journals (Sweden)
Giorgio Cattani
2010-01-01
Full Text Available The main objective of this study was to asses the temporal variation (1999 trough 2008 of air quality in Rome, focusing on airborn concentration of selected pollutants (PM10 and PM2.5 mass concentration and particle number concentration, PNC, carbon monoxide, CO, nitrogen oxides, NO and NO2 used for health effects assessment in epidemiological analyses. Time series analysis using Seasonal Kendall test has been applied. A statistically significant decreasing trend was found for primary gaseous pollutants and total particle number concentrations. Moreover a decreasing trend was assessed for PM10, PM2.5 and NO2 measured at traffic oriented sites even if the estimated reduction was lower compared with NO, CO and PNC. The urban background PM10 and NO2 concentrations seem to be practically unchanged since 1999 as no statistically significant trends were found. All the pollutants show higher slope of the estimated trend line at traffic oriented sites compared with those observed at the urban background. Thus a reduction of the intra-city concentration variability throughout the years occurred.
Directory of Open Access Journals (Sweden)
Romeo B. Lee
2016-02-01
Full Text Available The study seeks to estimate gender variations in the direct effects of (a number of organizational memberships, (b number of social networking sites (SNS, and (c grade-point average (GPA on global social responsibility (GSR; and in the indirect effects of (a and of (b through (c on GSR. Cross-sectional survey data were drawn from questionnaire interviews involving 3,173 Filipino university students. Based on a path model, the three factors were tested to determine their inter-relationships and their relationships with GSR. The direct and total effects of the exogenous factors on the dependent variable are statistically significantly robust. The indirect effects of organizational memberships on GSR through GPA are also statistically significant, but the indirect effects of SNS on GSR through GPA are marginal. Men and women significantly differ only in terms of the total effects of their organizational memberships on GSR. The lack of broad gender variations in the effects of SNS, organizational memberships and GPA on GSR may be linked to the relatively homogenous characteristics and experiences of the university students interviewed. There is a need for more path models to better understand the predictors of GSR in local students.
Lee, Romeo B.; Baring, Rito V.; Sta. Maria, Madelene A.
2016-01-01
The study seeks to estimate gender variations in the direct effects of (a) number of organizational memberships, (b) number of social networking sites (SNS), and (c) grade-point average (GPA) on global social responsibility (GSR); and in the indirect effects of (a) and of (b) through (c) on GSR. Cross-sectional survey data were drawn from questionnaire interviews involving 3,173 Filipino university students. Based on a path model, the three factors were tested to determine their inter-relationships and their relationships with GSR. The direct and total effects of the exogenous factors on the dependent variable are statistically significantly robust. The indirect effects of organizational memberships on GSR through GPA are also statistically significant, but the indirect effects of SNS on GSR through GPA are marginal. Men and women significantly differ only in terms of the total effects of their organizational memberships on GSR. The lack of broad gender variations in the effects of SNS, organizational memberships and GPA on GSR may be linked to the relatively homogenous characteristics and experiences of the university students interviewed. There is a need for more path models to better understand the predictors of GSR in local students. PMID:27247700
Lee, Romeo B; Baring, Rito V; Sta Maria, Madelene A
2016-02-01
The study seeks to estimate gender variations in the direct effects of (a) number of organizational memberships, (b) number of social networking sites (SNS), and (c) grade-point average (GPA) on global social responsibility (GSR); and in the indirect effects of (a) and of (b) through (c) on GSR. Cross-sectional survey data were drawn from questionnaire interviews involving 3,173 Filipino university students. Based on a path model, the three factors were tested to determine their inter-relationships and their relationships with GSR. The direct and total effects of the exogenous factors on the dependent variable are statistically significantly robust. The indirect effects of organizational memberships on GSR through GPA are also statistically significant, but the indirect effects of SNS on GSR through GPA are marginal. Men and women significantly differ only in terms of the total effects of their organizational memberships on GSR. The lack of broad gender variations in the effects of SNS, organizational memberships and GPA on GSR may be linked to the relatively homogenous characteristics and experiences of the university students interviewed. There is a need for more path models to better understand the predictors of GSR in local students.
Statistical Analysis and Evaluation of the Depth of the Ruts on Lithuanian State Significance Roads
Directory of Open Access Journals (Sweden)
Erinijus Getautis
2011-04-01
Full Text Available The aim of this work is to gather information about the national flexible pavement roads ruts depth, to determine its statistical dispersijon index and to determine their validity for needed requirements. Analysis of scientific works of ruts apearance in the asphalt and their influence for driving is presented in this work. Dynamical models of ruts in asphalt are presented in the work as well. Experimental outcome data of rut depth dispersijon in the national highway of Lithuania Vilnius – Kaunas is prepared. Conclusions are formulated and presented. Article in Lithuanian
A survey of variational principles
International Nuclear Information System (INIS)
Lewins, J.D.
1993-01-01
In this article survey of variational principles has been given. Variational principles play a significant role in mathematical theory with emphasis on the physical aspects. There are two principals used i.e. to represent the equation of the system in a succinct way and to enable a particular computation in the system to be carried out with greater accuracy. The survey of variational principles has ranged widely from its starting point in the Lagrange multiplier to optimisation principles. In an age of digital computation, these classic methods can be adapted to improve such calculations. We emphasize particularly the advantage of basic finite element methods on variational principles. (A.B.)
The effect of cast-to-cast variations on the quality of thin section nickel alloy welded joints
International Nuclear Information System (INIS)
Lambert, J.A.
1989-02-01
The welding behaviour of 26 commercial casts of Alloy 800 has been quantified for mechanised, autogenous, full penetration, bead-on-strip tungsten inert gas welding tests. Weld front and back widths have been measured and correlated with minor element variations. Casts with similar welding responses have been sorted into groups. The behaviour of the weld pool, surface slags and arc have been compared and a convection controlled model has been used to account for differences between the groups of casts. The main factors governing laboratory process control variability have been identified and a statistical method has been used to identify all the components of weld variance. An optimum size of welding test matrix has been proposed to determine typical cast-to-cast variations at high significance levels. (author)
Falgreen, Steffen; Laursen, Maria Bach; Bødker, Julie Støve; Kjeldsen, Malene Krag; Schmitz, Alexander; Nyegaard, Mette; Johnsen, Hans Erik; Dybkær, Karen; Bøgsted, Martin
2014-06-05
In vitro generated dose-response curves of human cancer cell lines are widely used to develop new therapeutics. The curves are summarised by simplified statistics that ignore the conventionally used dose-response curves' dependency on drug exposure time and growth kinetics. This may lead to suboptimal exploitation of data and biased conclusions on the potential of the drug in question. Therefore we set out to improve the dose-response assessments by eliminating the impact of time dependency. First, a mathematical model for drug induced cell growth inhibition was formulated and used to derive novel dose-response curves and improved summary statistics that are independent of time under the proposed model. Next, a statistical analysis workflow for estimating the improved statistics was suggested consisting of 1) nonlinear regression models for estimation of cell counts and doubling times, 2) isotonic regression for modelling the suggested dose-response curves, and 3) resampling based method for assessing variation of the novel summary statistics. We document that conventionally used summary statistics for dose-response experiments depend on time so that fast growing cell lines compared to slowly growing ones are considered overly sensitive. The adequacy of the mathematical model is tested for doxorubicin and found to fit real data to an acceptable degree. Dose-response data from the NCI60 drug screen were used to illustrate the time dependency and demonstrate an adjustment correcting for it. The applicability of the workflow was illustrated by simulation and application on a doxorubicin growth inhibition screen. The simulations show that under the proposed mathematical model the suggested statistical workflow results in unbiased estimates of the time independent summary statistics. Variance estimates of the novel summary statistics are used to conclude that the doxorubicin screen covers a significant diverse range of responses ensuring it is useful for biological
International Nuclear Information System (INIS)
Wada, Y.
1981-01-01
The surface-ionization type mass-spectrometer is widely used as an apparatus for quality assurance, accountability and safeguarding of nuclear materials, and for this analysis it has become an important factor to statistically evaluate an analytical error which consists of a random error and a systematic error. The major factor of this systematic error was the mass-discrimination effect. In this paper, various assays for evaluating the factor of variation on the mass-discrimination effect were studied and the data obtained were statistically evaluated. As a result of these analyses, it was proved that the factor of variation on the mass-discrimination effect was not attributed to the acid concentration of sample, sample size on the filament and supplied voltage for a multiplier, but mainly to the filament temperature during the mass-spectrometric analysis. The mass-discrimination effect values β which were usually calculated from the measured data of uranium, plutonium or boron isotopic standard sample were not so significant dependently of the difference of U-235, Pu-239 or B-10 isotopic abundance. Furthermore, in the case of U and Pu, measurement conditions and the mass range of these isotopes were almost similar, and these values β were not statistically significant between U and Pu. On the other hand, the value β for boron was about a third of the value β for U or Pu, but compared with the coefficient of the correction on the mass-discrimination effect for the difference of mass-number, ΔM, these coefficient values were almost the same among U, Pu, and B.As for the isotopic analysis error of U, Pu, Nd and B, it was proved that the isotopic abundance of these elements and the isotopic analysis error were in a relationship of quadratic curves on a logarithmic-logarithmic scale
Causes of seasonal variations of Cs-134/137 activity concentrations in surface air
International Nuclear Information System (INIS)
Hoetzl, H.; Winkler, R.
1993-01-01
In winter months maxima of Cs-134/137 activity concentrations in air are observed at several locations in Europe. To clarify this phenomenon, from October 1991 to November 1992 we performed a program for aerosol collection on a short-term scale based on collecting intervals of 48-72 hours. The local meteorological parameters were determined simultaneously. Statistical analysis of these observations reveiled a highly significant positive correlation between Cs-137 activity concentration and the so-called 'Stagnationsindex'. Based on this relationship the seasonal variations of Cs-134/137 concentrations in ground-level air can be explained by atmospheric inversion conditions frequently occurring during fall- and wintermonths. (orig.) [de
Directory of Open Access Journals (Sweden)
Laura Badenes-Ribera
2018-06-01
Full Text Available Introduction: Publications arguing against the null hypothesis significance testing (NHST procedure and in favor of good statistical practices have increased. The most frequently mentioned alternatives to NHST are effect size statistics (ES, confidence intervals (CIs, and meta-analyses. A recent survey conducted in Spain found that academic psychologists have poor knowledge about effect size statistics, confidence intervals, and graphic displays for meta-analyses, which might lead to a misinterpretation of the results. In addition, it also found that, although the use of ES is becoming generalized, the same thing is not true for CIs. Finally, academics with greater knowledge about ES statistics presented a profile closer to good statistical practice and research design. Our main purpose was to analyze the extension of these results to a different geographical area through a replication study.Methods: For this purpose, we elaborated an on-line survey that included the same items as the original research, and we asked academic psychologists to indicate their level of knowledge about ES, their CIs, and meta-analyses, and how they use them. The sample consisted of 159 Italian academic psychologists (54.09% women, mean age of 47.65 years. The mean number of years in the position of professor was 12.90 (SD = 10.21.Results: As in the original research, the results showed that, although the use of effect size estimates is becoming generalized, an under-reporting of CIs for ES persists. The most frequent ES statistics mentioned were Cohen's d and R2/η2, which can have outliers or show non-normality or violate statistical assumptions. In addition, academics showed poor knowledge about meta-analytic displays (e.g., forest plot and funnel plot and quality checklists for studies. Finally, academics with higher-level knowledge about ES statistics seem to have a profile closer to good statistical practices.Conclusions: Changing statistical practice is not
Understanding Statistics - Cancer Statistics
Annual reports of U.S. cancer statistics including new cases, deaths, trends, survival, prevalence, lifetime risk, and progress toward Healthy People targets, plus statistical summaries for a number of common cancer types.
International Nuclear Information System (INIS)
Jussila, Vilho; Li, Yue; Fülöp, Ludovic
2016-01-01
Highlights: • Floor flexibility plays a non-negligible role in amplifying horizontal vibrations. • COV of in-floor horizontal and vertical acceleration are 0.15–0.25 and 0.25–0.55. • In-floor variation of vibrations is higher in lower floors. • Floor spectra from limited nodes underestimates vibrations by a factor of 1.5–1.75. - Abstract: Floor vibration of a reactor building subjected to seismic loads was investigated, with the aim of quantifying the variability of vibrations on each floor. A detailed 3D building model founded on the bedrock was excited simultaneously in three directions by artificial accelerograms compatible with Finnish ground response spectra. Dynamic simulation for 21 s was carried out using explicit time integration. The extracted results of the simulation were acceleration in several floor locations, transformed to pseudo-acceleration (PSA) spectra in the next stage. At first, the monitored locations on the floors were estimated by engineering judgement in order to arrive at a feasible number of floor nodes for post processing of the data. It became apparent that engineering judgment was insufficient to depict the key locations with high floor vibrations, which resulted in un-conservative vibration estimates. For this reason, a more systematic approach was later considered, in which nodes of the floors were selected with a more refined grid of 2 m. With this method, in addition to the highest PSA peaks in all directions, the full vibration distribution in each floor can be determined. A statistical evaluation of the floor responses was also carried out in order to define floor accelerations and PSAs with high confidence of non-exceedance. The conclusion was that in-floor variability can be as high as 50–60% and models with sufficiently dense node grids should be used in order to achieve a realistic estimate of floor vibration under seismic action. The effects of the shape of the input spectra, damping, and flexibility of the
Energy Technology Data Exchange (ETDEWEB)
Jussila, Vilho [VTT Technical Research Centre of Finland Ltd, Kemistintie 3, 02230 Espoo (Finland); Li, Yue [Dept. of Civil Engineering, Case Western Reserve University, Cleveland, OH 44106 (United States); Fülöp, Ludovic, E-mail: ludovic.fulop@vtt.fi [VTT Technical Research Centre of Finland Ltd, Kemistintie 3, 02230 Espoo (Finland)
2016-12-01
Highlights: • Floor flexibility plays a non-negligible role in amplifying horizontal vibrations. • COV of in-floor horizontal and vertical acceleration are 0.15–0.25 and 0.25–0.55. • In-floor variation of vibrations is higher in lower floors. • Floor spectra from limited nodes underestimates vibrations by a factor of 1.5–1.75. - Abstract: Floor vibration of a reactor building subjected to seismic loads was investigated, with the aim of quantifying the variability of vibrations on each floor. A detailed 3D building model founded on the bedrock was excited simultaneously in three directions by artificial accelerograms compatible with Finnish ground response spectra. Dynamic simulation for 21 s was carried out using explicit time integration. The extracted results of the simulation were acceleration in several floor locations, transformed to pseudo-acceleration (PSA) spectra in the next stage. At first, the monitored locations on the floors were estimated by engineering judgement in order to arrive at a feasible number of floor nodes for post processing of the data. It became apparent that engineering judgment was insufficient to depict the key locations with high floor vibrations, which resulted in un-conservative vibration estimates. For this reason, a more systematic approach was later considered, in which nodes of the floors were selected with a more refined grid of 2 m. With this method, in addition to the highest PSA peaks in all directions, the full vibration distribution in each floor can be determined. A statistical evaluation of the floor responses was also carried out in order to define floor accelerations and PSAs with high confidence of non-exceedance. The conclusion was that in-floor variability can be as high as 50–60% and models with sufficiently dense node grids should be used in order to achieve a realistic estimate of floor vibration under seismic action. The effects of the shape of the input spectra, damping, and flexibility of the
Directory of Open Access Journals (Sweden)
Sreeram V Ramagopalan
2015-04-01
Full Text Available Background: We and others have shown a significant proportion of interventional trials registered on ClinicalTrials.gov have their primary outcomes altered after the listed study start and completion dates. The objectives of this study were to investigate whether changes made to primary outcomes are associated with the likelihood of reporting a statistically significant primary outcome on ClinicalTrials.gov. Methods: A cross-sectional analysis of all interventional clinical trials registered on ClinicalTrials.gov as of 20 November 2014 was performed. The main outcome was any change made to the initially listed primary outcome and the time of the change in relation to the trial start and end date. Findings: 13,238 completed interventional trials were registered with ClinicalTrials.gov that also had study results posted on the website. 2555 (19.3% had one or more statistically significant primary outcomes. Statistical analysis showed that registration year, funding source and primary outcome change after trial completion were associated with reporting a statistically significant primary outcome. Conclusions: Funding source and primary outcome change after trial completion are associated with a statistically significant primary outcome report on clinicaltrials.gov.
Seasonal variation of air pollution in Warsaw conurbation
Directory of Open Access Journals (Sweden)
Katarzyna Rozbicka
2014-09-01
Full Text Available Long term research shows many substances in the atmosphere are in concentration dangerous for human health and welfare and even for human life. The work presents time and spatial variation of tropospheric ozone and nitrogen dioxide concentrations. Analysis was carried out on the base of hourly values of mentioned pollutants (O3 and NO2 concentrations. Data used in the analysis comes from atmospheric monitoring stations situated in various parts of Warsaw and concerns the period 2008–2011. The influence of meteorological elements on concentration of analyzed pollutants was stated by the use of correlation and multiple regression analysis for months and seasonal periods. On this base results of statistical analysis strong correlation between tropospheric ozone, nitrogen dioxide concentration and meteorological elements is stated. In case of ozone and nitrogen dioxide the relationships with air temperature, relative humidity and solar radiation are most significant.
Statistical analysis and data management
International Nuclear Information System (INIS)
Anon.
1981-01-01
This report provides an overview of the history of the WIPP Biology Program. The recommendations of the American Institute of Biological Sciences (AIBS) for the WIPP biology program are summarized. The data sets available for statistical analyses and problems associated with these data sets are also summarized. Biological studies base maps are presented. A statistical model is presented to evaluate any correlation between climatological data and small mammal captures. No statistically significant relationship between variance in small mammal captures on Dr. Gennaro's 90m x 90m grid and precipitation records from the Duval Potash Mine were found
Stinchcombe, John R; Weinig, Cynthia; Heath, Katy D; Brock, Marcus T; Schmitt, Johanna
2009-07-01
The importance of genes of major effect for evolutionary trajectories within and among natural populations has long been the subject of intense debate. For example, if allelic variation at a major-effect locus fundamentally alters the structure of quantitative trait variation, then fixation of a single locus can have rapid and profound effects on the rate or direction of subsequent evolutionary change. Using an Arabidopsis thaliana RIL mapping population, we compare G-matrix structure between lines possessing different alleles at ERECTA, a locus known to affect ecologically relevant variation in plant architecture. We find that the allele present at ERECTA significantly alters G-matrix structure-in particular the genetic correlations between branch number and flowering time traits-and may also modulate the strength of natural selection on these traits. Despite these differences, however, when we extend our analysis to determine how evolution might differ depending on the ERECTA allele, we find that predicted responses to selection are similar. To compare responses to selection between allele classes, we developed a resampling strategy that incorporates uncertainty in estimates of selection that can also be used for statistical comparisons of G matrices.
Statistical comparison of two or more SAGE libraries: one tag at a time
Schaaf, Gerben J.; van Ruissen, Fred; van Kampen, Antoine; Kool, Marcel; Ruijter, Jan M.
2008-01-01
Several statistical tests have been introduced for the comparison of serial analysis of gene expression (SAGE) libraries to quantitatively analyze the differential expression of genes. As each SAGE library is only one measurement, the necessary information on biological variation or experimental
Describing temporal variation in reticuloruminal pH using continuous monitoring data.
Denwood, M J; Kleen, J L; Jensen, D B; Jonsson, N N
2018-01-01
Reticuloruminal pH has been linked to subclinical disease in dairy cattle, leading to considerable interest in identifying pH observations below a given threshold. The relatively recent availability of continuously monitored data from pH boluses gives new opportunities for characterizing the normal patterns of pH over time and distinguishing these from abnormal patterns using more sensitive and specific methods than simple thresholds. We fitted a series of statistical models to continuously monitored data from 93 animals on 13 farms to characterize normal variation within and between animals. We used a subset of the data to relate deviations from the normal pattern to the productivity of 24 dairy cows from a single herd. Our findings show substantial variation in pH characteristics between animals, although animals within the same farm tended to show more consistent patterns. There was strong evidence for a predictable diurnal variation in all animals, and up to 70% of the observed variation in pH could be explained using a simple statistical model. For the 24 animals with available production information, there was also a strong association between productivity (as measured by both milk yield and dry matter intake) and deviations from the expected diurnal pattern of pH 2 d before the productivity observation. In contrast, there was no association between productivity and the occurrence of observations below a threshold pH. We conclude that statistical models can be used to account for a substantial proportion of the observed variability in pH and that future work with continuously monitored pH data should focus on deviations from a predictable pattern rather than the frequency of observations below an arbitrary pH threshold. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Statistical study of the non-linear propagation of a partially coherent laser beam
International Nuclear Information System (INIS)
Ayanides, J.P.
2001-01-01
This research thesis is related to the LMJ project (Laser MegaJoule) and thus to the study and development of thermonuclear fusion. It reports the study of the propagation of a partially-coherent laser beam by using a statistical modelling in order to obtain mean values for the field, and thus bypassing a complex and costly calculation of deterministic quantities. Random fluctuations of the propagated field are supposed to comply with a Gaussian statistics; the laser central wavelength is supposed to be small with respect with fluctuation magnitude; a scale factor is introduced to clearly distinguish the scale of the random and fast variations of the field fluctuations, and the scale of the slow deterministic variations of the field envelopes. The author reports the study of propagation through a purely linear media and through a non-dispersive media, and then through slow non-dispersive and non-linear media (in which the reaction time is large with respect to grain correlation duration, but small with respect to the variation scale of the field macroscopic envelope), and thirdly through an instantaneous dispersive and non linear media (which instantaneously reacts to the field) [fr
Directory of Open Access Journals (Sweden)
Mashhood Ahmed Sheikh
2017-08-01
mediate the association between childhood adversity and ADS in adulthood. However, when education was excluded as a mediator-response confounding variable, the indirect effect of childhood adversity on ADS in adulthood was statistically significant (p < 0.05. This study shows that a careful inclusion of potential confounding variables is important when assessing mediation.
Kyle J. Haynes; Andrew M. Liebhold; Ottar N. Bjørnstad; Andrew J. Allstadt; Randall S. Morin
2018-01-01
Evaluating the causes of spatial synchrony in population dynamics in nature is notoriously difficult due to a lack of data and appropriate statistical methods. Here, we use a recently developed method, a multivariate extension of the local indicators of spatial autocorrelation statistic, to map geographic variation in the synchrony of gypsy moth outbreaks. Regression...
Development of 3D statistical mandible models for cephalometric measurements
International Nuclear Information System (INIS)
Kim, Sung Goo; Yi, Won Jin; Hwang, Soon Jung; Choi, Soon Chul; Lee, Sam Sun; Heo, Min Suk; Huh, Kyung Hoe; Kim, Tae Il; Hong, Helen; Yoo, Ji Hyun
2012-01-01
The aim of this study was to provide sex-matched three-dimensional (3D) statistical shape models of the mandible, which would provide cephalometric parameters for 3D treatment planning and cephalometric measurements in orthognathic surgery. The subjects used to create the 3D shape models of the mandible included 23 males and 23 females. The mandibles were segmented semi-automatically from 3D facial CT images. Each individual mandible shape was reconstructed as a 3D surface model, which was parameterized to establish correspondence between different individual surfaces. The principal component analysis (PCA) applied to all mandible shapes produced a mean model and characteristic models of variation. The cephalometric parameters were measured directly from the mean models to evaluate the 3D shape models. The means of the measured parameters were compared with those from other conventional studies. The male and female 3D statistical mean models were developed from 23 individual mandibles, respectively. The male and female characteristic shapes of variation produced by PCA showed a large variability included in the individual mandibles. The cephalometric measurements from the developed models were very close to those from some conventional studies. We described the construction of 3D mandibular shape models and presented the application of the 3D mandibular template in cephalometric measurements. Optimal reference models determined from variations produced by PCA could be used for craniofacial patients with various types of skeletal shape.
Development of 3D statistical mandible models for cephalometric measurements
Energy Technology Data Exchange (ETDEWEB)
Kim, Sung Goo; Yi, Won Jin; Hwang, Soon Jung; Choi, Soon Chul; Lee, Sam Sun; Heo, Min Suk; Huh, Kyung Hoe; Kim, Tae Il [School of Dentistry, Seoul National University, Seoul (Korea, Republic of); Hong, Helen; Yoo, Ji Hyun [Division of Multimedia Engineering, Seoul Women' s University, Seoul (Korea, Republic of)
2012-09-15
The aim of this study was to provide sex-matched three-dimensional (3D) statistical shape models of the mandible, which would provide cephalometric parameters for 3D treatment planning and cephalometric measurements in orthognathic surgery. The subjects used to create the 3D shape models of the mandible included 23 males and 23 females. The mandibles were segmented semi-automatically from 3D facial CT images. Each individual mandible shape was reconstructed as a 3D surface model, which was parameterized to establish correspondence between different individual surfaces. The principal component analysis (PCA) applied to all mandible shapes produced a mean model and characteristic models of variation. The cephalometric parameters were measured directly from the mean models to evaluate the 3D shape models. The means of the measured parameters were compared with those from other conventional studies. The male and female 3D statistical mean models were developed from 23 individual mandibles, respectively. The male and female characteristic shapes of variation produced by PCA showed a large variability included in the individual mandibles. The cephalometric measurements from the developed models were very close to those from some conventional studies. We described the construction of 3D mandibular shape models and presented the application of the 3D mandibular template in cephalometric measurements. Optimal reference models determined from variations produced by PCA could be used for craniofacial patients with various types of skeletal shape.
Voet, van der H.; Perry, J.N.; Amzal, B.; Paoletti, C.
2011-01-01
Background - Safety assessment of genetically modified organisms is currently often performed by comparative evaluation. However, natural variation of plant characteristics between commercial varieties is usually not considered explicitly in the statistical computations underlying the assessment.
Generalized quasi variational inequalities
Energy Technology Data Exchange (ETDEWEB)
Noor, M.A. [King Saud Univ., Riyadh (Saudi Arabia)
1996-12-31
In this paper, we establish the equivalence between the generalized quasi variational inequalities and the generalized implicit Wiener-Hopf equations using essentially the projection technique. This equivalence is used to suggest and analyze a number of new iterative algorithms for solving generalized quasi variational inequalities and the related complementarity problems. The convergence criteria is also considered. The results proved in this paper represent a significant improvement and refinement of the previously known results.
A parametric method for assessing diversification-rate variation in phylogenetic trees.
Shah, Premal; Fitzpatrick, Benjamin M; Fordyce, James A
2013-02-01
Phylogenetic hypotheses are frequently used to examine variation in rates of diversification across the history of a group. Patterns of diversification-rate variation can be used to infer underlying ecological and evolutionary processes responsible for patterns of cladogenesis. Most existing methods examine rate variation through time. Methods for examining differences in diversification among groups are more limited. Here, we present a new method, parametric rate comparison (PRC), that explicitly compares diversification rates among lineages in a tree using a variety of standard statistical distributions. PRC can identify subclades of the tree where diversification rates are at variance with the remainder of the tree. A randomization test can be used to evaluate how often such variance would appear by chance alone. The method also allows for comparison of diversification rate among a priori defined groups. Further, the application of the PRC method is not restricted to monophyletic groups. We examined the performance of PRC using simulated data, which showed that PRC has acceptable false-positive rates and statistical power to detect rate variation. We apply the PRC method to the well-studied radiation of North American Plethodon salamanders, and support the inference that the large-bodied Plethodon glutinosus clade has a higher historical rate of diversification compared to other Plethodon salamanders. © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.
Estimating Predictive Variance for Statistical Gas Distribution Modelling
International Nuclear Information System (INIS)
Lilienthal, Achim J.; Asadi, Sahar; Reggente, Matteo
2009-01-01
Recent publications in statistical gas distribution modelling have proposed algorithms that model mean and variance of a distribution. This paper argues that estimating the predictive concentration variance entails not only a gradual improvement but is rather a significant step to advance the field. This is, first, since the models much better fit the particular structure of gas distributions, which exhibit strong fluctuations with considerable spatial variations as a result of the intermittent character of gas dispersal. Second, because estimating the predictive variance allows to evaluate the model quality in terms of the data likelihood. This offers a solution to the problem of ground truth evaluation, which has always been a critical issue for gas distribution modelling. It also enables solid comparisons of different modelling approaches, and provides the means to learn meta parameters of the model, to determine when the model should be updated or re-initialised, or to suggest new measurement locations based on the current model. We also point out directions of related ongoing or potential future research work.
Statistical Characterization of Dispersed Single-Wall Carbon Nanotube Quantum Dots
International Nuclear Information System (INIS)
Shimizu, M; Moriyama, S; Suzuki, M; Fuse, T; Homma, Y; Ishibashi, K
2006-01-01
Quantum dots have been fabricated in single-wall carbon nanotubes (SWCNTs) simply by depositing metallic contacts on top of them. The fabricated quantum dots show different characteristics from sample to sample, which are even different in samples fabricated in the same chip. In this report, we study the statistical variations of the quantum dots fabricated with our method, and suggest their possible origin
Cisneiros, Roberta Araújo; de Almeida, Argus Vasconcelos; de Melo, Gabriel Rivas; da Câmara, Cláudio Augusto Gomes
2012-01-01
The present study describes morphometric variations in the grasshopper, Chromacris speciosa (Thunberg, 1824) (Orthoptera: Acridoidea: Romaleidae) from two locations in the state of Pernambuco, Brazil. The distance between the sites chosen for collections (Recife and São Lourenço da Mata) is approximately 16 km. The investigation was based on a comparative study of external morphological characteristics of the grasshoppers. Morphometric measurements took into account the different body parts and appendages. Statistical analysis of the measurements revealed significant differences in the size of the specimens between the two locations. Homogeneity tests of the covariance and equality matrices between mean vectors of the results revealed that the grasshopper populations in Recife and São Lourenço da Mata are distinctly different. These findings provide morphological evidence for intraspecific variation in morphological characteristics of the C. speciosa populations from the two locations. PMID:23421530
[Big data in official statistics].
Zwick, Markus
2015-08-01
The concept of "big data" stands to change the face of official statistics over the coming years, having an impact on almost all aspects of data production. The tasks of future statisticians will not necessarily be to produce new data, but rather to identify and make use of existing data to adequately describe social and economic phenomena. Until big data can be used correctly in official statistics, a lot of questions need to be answered and problems solved: the quality of data, data protection, privacy, and the sustainable availability are some of the more pressing issues to be addressed. The essential skills of official statisticians will undoubtedly change, and this implies a number of challenges to be faced by statistical education systems, in universities, and inside the statistical offices. The national statistical offices of the European Union have concluded a concrete strategy for exploring the possibilities of big data for official statistics