WorldWideScience

Sample records for genomes extending ensembl

  1. Extended Ensemble Monte Carlo

    OpenAIRE

    Iba, Yukito

    2000-01-01

    ``Extended Ensemble Monte Carlo''is a generic term that indicates a set of algorithms which are now popular in a variety of fields in physics and statistical information processing. Exchange Monte Carlo (Metropolis-Coupled Chain, Parallel Tempering), Simulated Tempering (Expanded Ensemble Monte Carlo), and Multicanonical Monte Carlo (Adaptive Umbrella Sampling) are typical members of this family. Here we give a cross-disciplinary survey of these algorithms with special emphasis on the great f...

  2. Ensembl Genomes 2013

    DEFF Research Database (Denmark)

    Kersey, Paul Julian; Allen, James E; Christensen, Mikkel

    2014-01-01

    genomes, and now includes the genomes of over 9000 bacteria. Specific extensions to the web and programmatic interfaces have been developed to support users in navigating these large data sets. Looking forward, analytic tools to allow targeted selection of data for visualization and download are likely...

  3. Ensembl Genomes 2013: scaling up access to genome-wide data

    Science.gov (United States)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provi...

  4. Understanding the structural ensembles of a highly extended disordered protein.

    Science.gov (United States)

    Daughdrill, Gary W; Kashtanov, Stepan; Stancik, Amber; Hill, Shannon E; Helms, Gregory; Muschol, Martin; Receveur-Bréchot, Véronique; Ytreberg, F Marty

    2012-01-01

    Developing a comprehensive description of the equilibrium structural ensembles for intrinsically disordered proteins (IDPs) is essential to understanding their function. The p53 transactivation domain (p53TAD) is an IDP that interacts with multiple protein partners and contains numerous phosphorylation sites. Multiple techniques were used to investigate the equilibrium structural ensemble of p53TAD in its native and chemically unfolded states. The results from these experiments show that the native state of p53TAD has dimensions similar to a classical random coil while the chemically unfolded state is more extended. To investigate the molecular properties responsible for this behavior, a novel algorithm that generates diverse and unbiased structural ensembles of IDPs was developed. This algorithm was used to generate a large pool of plausible p53TAD structures that were reweighted to identify a subset of structures with the best fit to small angle X-ray scattering data. High weight structures in the native state ensemble show features that are localized to protein binding sites and regions with high proline content. The features localized to the protein binding sites are mostly eliminated in the chemically unfolded ensemble; while, the regions with high proline content remain relatively unaffected. Data from NMR experiments support these results, showing that residues from the protein binding sites experience larger environmental changes upon unfolding by urea than regions with high proline content. This behavior is consistent with the urea-induced exposure of nonpolar and aromatic side-chains in the protein binding sites that are partially excluded from solvent in the native state ensemble.

  5. Extended ensemble theory, spontaneous symmetry breaking, and phase transitions

    Science.gov (United States)

    Xiao, Ming-wen

    2006-09-01

    In this paper, as a personal review, we suppose a possible extension of Gibbs ensemble theory so that it can provide a reasonable description of phase transitions and spontaneous symmetry breaking. The extension is founded on three hypotheses, and can be regarded as a microscopic edition of the Landau phenomenological theory of phase transitions. Within its framework, the stable state of a system is determined by the evolution of order parameter with temperature according to such a principle that the entropy of the system will reach its minimum in this state. The evolution of order parameter can cause a change in representation of the system Hamiltonian; different phases will realize different representations, respectively; a phase transition amounts to a representation transformation. Physically, it turns out that phase transitions originate from the automatic interference among matter waves as the temperature is cooled down. Typical quantum many-body systems are studied with this extended ensemble theory. We regain the Bardeen Cooper Schrieffer solution for the weak-coupling superconductivity, and prove that it is stable. We find that negative-temperature and laser phases arise from the same mechanism as phase transitions, and that they are unstable. For the ideal Bose gas, we demonstrate that it will produce Bose Einstein condensation (BEC) in the thermodynamic limit, which confirms exactly Einstein's deep physical insight. In contrast, there is no BEC either within the phonon gas in a black body or within the ideal photon gas in a solid body. We prove that it is not admissible to quantize the Dirac field by using Bose Einstein statistics. We show that a structural phase transition belongs physically to the BEC happening in configuration space, and that a double-well anharmonic system will undergo a structural phase transition at a finite temperature. For the O(N)-symmetric vector model, we demonstrate that it will yield spontaneous symmetry breaking and produce

  6. Ensemble analysis of adaptive compressed genome sequencing strategies

    Science.gov (United States)

    2014-01-01

    Background Acquiring genomes at single-cell resolution has many applications such as in the study of microbiota. However, deep sequencing and assembly of all of millions of cells in a sample is prohibitively costly. A property that can come to rescue is that deep sequencing of every cell should not be necessary to capture all distinct genomes, as the majority of cells are biological replicates. Biologically important samples are often sparse in that sense. In this paper, we propose an adaptive compressed method, also known as distilled sensing, to capture all distinct genomes in a sparse microbial community with reduced sequencing effort. As opposed to group testing in which the number of distinct events is often constant and sparsity is equivalent to rarity of an event, sparsity in our case means scarcity of distinct events in comparison to the data size. Previously, we introduced the problem and proposed a distilled sensing solution based on the breadth first search strategy. We simulated the whole process which constrained our ability to study the behavior of the algorithm for the entire ensemble due to its computational intensity. Results In this paper, we modify our previous breadth first search strategy and introduce the depth first search strategy. Instead of simulating the entire process, which is intractable for a large number of experiments, we provide a dynamic programming algorithm to analyze the behavior of the method for the entire ensemble. The ensemble analysis algorithm recursively calculates the probability of capturing every distinct genome and also the expected total sequenced nucleotides for a given population profile. Our results suggest that the expected total sequenced nucleotides grows proportional to log of the number of cells and proportional linearly with the number of distinct genomes. The probability of missing a genome depends on its abundance and the ratio of its size over the maximum genome size in the sample. The modified resource

  7. Comparison of extended medium-range forecast skill between KMA ensemble, ocean coupled ensemble, and GloSea5

    Science.gov (United States)

    Park, Sangwook; Kim, Dong-Joon; Lee, Seung-Woo; Lee, Kie-Woung; Kim, Jongkhun; Song, Eun-Ji; Seo, Kyong-Hwan

    2017-08-01

    This article describes a three way inter-comparison of forecast skill on an extended medium-range time scale using the Korea Meteorological Administration (KMA) operational ensemble numerical weather prediction (NWP) systems (i.e., atmosphere-only global ensemble prediction system (EPSG) and ocean-atmosphere coupledEPSG) and KMA operational seasonal prediction system, the Global Seasonal forecast system version 5 (GloSea5). The main motivation is to investigate whether the ensemble NWP system can provide advantage over the existing seasonal prediction system for the extended medium-range forecast (30 days) even with putting extra resources in extended integration or coupling with ocean with NWP system. Two types of evaluation statistics are examined: the basic verification statistics - the anomaly correlation and RMSE of 500-hPa geopotential height and 1.5-meter surface temperature for the global and East Asia area, and the other is the Real-time Multivariate Madden and Julian Oscillation (MJO) indices (RMM1 and RMM2) - which is used to examine the MJO prediction skill. The MJO is regarded as a main source of forecast skill in the tropics linked to the mid-latitude weather on monthly time scale. Under limited number of experiment cases, the coupled NWP extends the forecast skill of the NWP by a few more days, and thereafter such forecast skill is overtaken by that of the seasonal prediction system. At present stage, it seems there is little gain from the coupled NWP even though more resources are put into it. Considering this, the best combination of numerical product guidance for operational forecasters for an extended medium-range is extension of the forecast lead time of the current ensemble NWP (EPSG) up to 20 days and use of the seasonal prediction system (GloSea5) forecast thereafter, though there exists a matter of consistency between the two systems.

  8. Extended-Range Ensemble Predictions of Convection in the North Australian Monsoon Region

    Science.gov (United States)

    Drosdowsky, Wasyl; Wheeler, Matthew C.

    2017-04-01

    Extended-range (Bureau of Meteorology’s Predictive Ocean-Atmosphere Model for Australia (POAMA). Hindcasts from 1980-2011 are used, initialised on the 1st, 11th and 21st of each month, with a 33-member ensemble. The measure of convection is outgoing longwave radiation (OLR) averaged over the box 120ºE-150ºE, 5ºS-17.5ºS. This averaging serves to focus on the intraseasonal and longer time scales, and is an area of interest to users. The raw hindcasts of daily OLR show a strong systematic adjustment away from their initial values during the first week, and then converge to a mean seasonal cycle of similar amplitude and phase to observations. Hence, forecast OLR anomalies are formed by removing the model’s own seasonal cycle of OLR, which is a function of start time and lead time, a usual practice for dynamical seasonal prediction. Over all hindcasts, the model forecast root-mean-square (RMS) error is smaller than the RMS error of persistence and climatological reference forecasts for leads 3-35 days. Ensemble spread is less than the forecast RMS error (i.e. under-spread) for days 1-12, but slightly greater than the RMS error for longer leads. Binning the individual forecasts based on ensemble spread shows a generally positive relationship between spread and error. Therefore, greater certainty can be given for forecasts with smaller spread.

  9. Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomics Data.

    Science.gov (United States)

    Bolser, Dan; Staines, Daniel M; Pritchard, Emily; Kersey, Paul

    2016-01-01

    Ensembl Plants ( http://plants.ensembl.org ) is an integrative resource presenting genome-scale information for a growing number of sequenced plant species (currently 33). Data provided includes genome sequence, gene models, functional annotation, and polymorphic loci. Various additional information are provided for variation data, including population structure, individual genotypes, linkage, and phenotype data. In each release, comparative analyses are performed on whole genome and protein sequences, and genome alignments and gene trees are made available that show the implied evolutionary history of each gene family. Access to the data is provided through a genome browser incorporating many specialist interfaces for different data types, and through a variety of additional methods for programmatic access and data mining. These access routes are consistent with those offered through the Ensembl interface for the genomes of non-plant species, including those of plant pathogens, pests, and pollinators.Ensembl Plants is updated 4-5 times a year and is developed in collaboration with our international partners in the Gramene ( http://www.gramene.org ) and transPLANT projects ( http://www.transplantdb.org ).

  10. An Extended-range Hydrometeorological Ensemble Prediction System for Alpine Catchments in Switzerland

    Science.gov (United States)

    Monhart, Samuel; Bogner, Konrad; Spirig, Christoph; Bhend, Jonas; Liniger, Mark A.; Zappa, Massimiliano; Schär, Christoph

    2017-04-01

    In recent years meteorological ensemble prediction systems have increasingly be used to feed hydrological models in order to provide probabilistic streamflow forecasts. Such hydrological ensemble prediction systems (HEPS) have been analyzed for different lead times from short-term to seasonal predictions and are used for different applications. Especially at longer lead times both such forecasts exhibit systematic biases which can be removed by applying bias correction techniques to both the meteorological and/or the hydrological output. However, it is still an open question if pre- or post-processing techniques or both should be applied. We will present first results of the analysis of pre- and post-processed extended-range hydrometeorological forecasts. In a first step the performance of bias corrected and downscaled (using quantile mapping) extended-range meteorological forecasts provided by the ECMWF is assessed for approximately 1000 ground observation sites across Europe. Generally, bias corrected meteorological forecasts show positive skill in terms of CRPSS up to three (two) weeks for weekly mean temperature (precipitation) compared to climatological forecasts. For the Alpine region the absolute skill is generally lower but the relative gain in skill resulting from the bias correction is larger. These pre-processed meteorological forecasts of one year of ECMWF extended-range forecasts and corresponding hindcasts are used to feed a hydrological model for a selected catchment in the Alpine area in Switzerland. Furthermore, different post-processing techniques are tested to correct the resulting streamflow forecasts. This will allow to determine the relative effect of pre- and post-processing of extended-range hydrometeorological predictions in Alpine catchments. Future work will include the combination of these corrected streamflow forecasts with electricity price forecasts to optimize the operations and revenues of hydropower systems in the Alps.

  11. Comparing the ensemble and extended Kalman filters for in situ soil moisture assimilation with contrasting conditions

    Science.gov (United States)

    Fairbairn, D.; Barbu, A. L.; Mahfouf, J.-F.; Calvet, J.-C.; Gelati, E.

    2015-12-01

    Two data assimilation (DA) methods are compared for their ability to produce an accurate soil moisture analysis using the Météo-France land surface model: (i) SEKF, a simplified extended Kalman filter, which uses a climatological background-error covariance, and (ii) EnSRF, the ensemble square root filter, which uses an ensemble background-error covariance and approximates random rainfall errors stochastically. In situ soil moisture observations at 5 cm depth are assimilated into the surface layer and 30 cm deep observations are used to evaluate the root-zone analysis on 12 sites in south-western France (SMOSMANIA network). These sites differ in terms of climate and soil texture. The two methods perform similarly and improve on the open loop. Both methods suffer from incorrect linear assumptions which are particularly degrading to the analysis during water-stressed conditions: the EnSRF by a dry bias and the SEKF by an over-sensitivity of the model Jacobian between the surface and the root-zone layers. These problems are less severe for the sites with wetter climates. A simple bias correction technique is tested on the EnSRF. Although this reduces the bias, it modifies the soil moisture fluxes and suppresses the ensemble spread, which degrades the analysis performance. However, the EnSRF flow-dependent background-error covariance evidently captures seasonal variability in the soil moisture errors and should exploit planned improvements in the model physics. Synthetic twin experiments demonstrate that when there is only a random component in the precipitation forcing errors, the correct stochastic representation of these errors enables the EnSRF to perform better than the SEKF. It might therefore be possible for the EnSRF to perform better than the SEKF with real data, if the rainfall uncertainty was accurately captured. However, the simple rainfall error model is not advantageous in our real experiments. More realistic rainfall error models are suggested.

  12. Comparing the ensemble and extended Kalman filters for in situ soil moisture assimilation with contrasting conditions

    Directory of Open Access Journals (Sweden)

    D. Fairbairn

    2015-12-01

    Full Text Available Two data assimilation (DA methods are compared for their ability to produce an accurate soil moisture analysis using the Météo-France land surface model: (i SEKF, a simplified extended Kalman filter, which uses a climatological background-error covariance, and (ii EnSRF, the ensemble square root filter, which uses an ensemble background-error covariance and approximates random rainfall errors stochastically. In situ soil moisture observations at 5 cm depth are assimilated into the surface layer and 30 cm deep observations are used to evaluate the root-zone analysis on 12 sites in south-western France (SMOSMANIA network. These sites differ in terms of climate and soil texture. The two methods perform similarly and improve on the open loop. Both methods suffer from incorrect linear assumptions which are particularly degrading to the analysis during water-stressed conditions: the EnSRF by a dry bias and the SEKF by an over-sensitivity of the model Jacobian between the surface and the root-zone layers. These problems are less severe for the sites with wetter climates. A simple bias correction technique is tested on the EnSRF. Although this reduces the bias, it modifies the soil moisture fluxes and suppresses the ensemble spread, which degrades the analysis performance. However, the EnSRF flow-dependent background-error covariance evidently captures seasonal variability in the soil moisture errors and should exploit planned improvements in the model physics. Synthetic twin experiments demonstrate that when there is only a random component in the precipitation forcing errors, the correct stochastic representation of these errors enables the EnSRF to perform better than the SEKF. It might therefore be possible for the EnSRF to perform better than the SEKF with real data, if the rainfall uncertainty was accurately captured. However, the simple rainfall error model is not advantageous in our real experiments. More realistic rainfall error models are

  13. HEPS4Power - Extended-range Hydrometeorological Ensemble Predictions for Improved Hydropower Operations and Revenues

    Science.gov (United States)

    Bogner, Konrad; Monhart, Samuel; Liniger, Mark; Spririg, Christoph; Jordan, Fred; Zappa, Massimiliano

    2015-04-01

    In recent years large progresses have been achieved in the operational prediction of floods and hydrological drought with up to ten days lead time. Both the public and the private sectors are currently using probabilistic runoff forecast in order to monitoring water resources and take actions when critical conditions are to be expected. The use of extended-range predictions with lead times exceeding 10 days is not yet established. The hydropower sector in particular might have large benefits from using hydro meteorological forecasts for the next 15 to 60 days in order to optimize the operations and the revenues from their watersheds, dams, captions, turbines and pumps. The new Swiss Competence Centers in Energy Research (SCCER) targets at boosting research related to energy issues in Switzerland. The objective of HEPS4POWER is to demonstrate that operational extended-range hydro meteorological forecasts have the potential to become very valuable tools for fine tuning the production of energy from hydropower systems. The project team covers a specific system-oriented value chain starting from the collection and forecast of meteorological data (MeteoSwiss), leading to the operational application of state-of-the-art hydrological models (WSL) and terminating with the experience in data presentation and power production forecasts for end-users (e-dric.ch). The first task of the HEPS4POWER will be the downscaling and post-processing of ensemble extended-range meteorological forecasts (EPS). The goal is to provide well-tailored forecasts of probabilistic nature that should be reliable in statistical and localized at catchment or even station level. The hydrology related task will consist in feeding the post-processed meteorological forecasts into a HEPS using a multi-model approach by implementing models with different complexity. Also in the case of the hydrological ensemble predictions, post-processing techniques need to be tested in order to improve the quality of the

  14. Extended regional climate model projections for Europe until the mid-twentyfirst century: combining ENSEMBLES and CMIP3

    Science.gov (United States)

    Heinrich, Georg; Gobiet, Andreas; Mendlik, Thomas

    2014-01-01

    This study aims at sharpening the existing knowledge of expected seasonal mean climate change and its uncertainty over Europe for the two key climate variables air temperature and precipitation amount until the mid-twentyfirst century. For this purpose, we assess and compensate the global climate model (GCM) sampling bias of the ENSEMBLES regional climate model (RCM) projections by combining them with the full set of the CMIP3 GCM ensemble. We first apply a cross-validation in order to assess the skill of different statistical data reconstruction methods in reproducing ensemble mean and standard deviation. We then select the most appropriate reconstruction method in order to fill the missing values of the ENSEMBLES simulation matrix and further extend the matrix by all available CMIP3 GCM simulations forced by the A1B emission scenario. Cross-validation identifies a randomized scaling approach as superior in reconstructing the ensemble spread. Errors in ensemble mean and standard deviation are mostly less than 0.1 K and 1.0 % for air temperature and precipitation amount, respectively. Reconstruction of the missing values reveals that expected seasonal mean climate change of the ENSEMBLES RCM projections is not significantly biased and that the associated uncertainty is not underestimated due to sampling of only a few driving GCMs. In contrast, the spread of the extended simulation matrix is partly significantly lower, sharpening our knowledge about future climate change over Europe by reducing uncertainty in some regions. Furthermore, this study gives substantial weight to recent climate change impact studies based on the ENSEMBLES projections, since it confirms the robustness of the climate forcing of these studies concerning GCM sampling.

  15. Extending the square root method to account for additive forecast noise in ensemble methods

    CERN Document Server

    Raanes, Patrick N; Bertino, Laurent

    2015-01-01

    A square root approach is considered for the problem of accounting for model noise in the forecast step of the ensemble Kalman filter (EnKF) and related algorithms. The primary aim is to replace the method of simulated, pseudo-random, additive noise so as to eliminate the associated sampling errors. The core method is based on the analysis step of ensemble square root filters, and consists in the deterministic computation of a transform matrix. The theoretical advantages regarding dynamical consistency are surveyed, applying equally well to the square root method in the analysis step. A fundamental problem due to the limited size of the ensemble subspace is discussed, and novel solutions that complement the core method are suggested and studied. Benchmarks from twin experiments with simple, low-order dynamics indicate improved performance over standard approaches such as additive, simulated noise and multiplicative inflation.

  16. Finishing genomes with limited resources: lessons from an ensemble of microbial genomes

    Directory of Open Access Journals (Sweden)

    Bishop-Lilly Kimberly A

    2010-04-01

    Full Text Available Abstract While new sequencing technologies have ushered in an era where microbial genomes can be easily sequenced, the goal of routinely producing high-quality draft and finished genomes in a cost-effective fashion has still remained elusive. Due to shorter read lengths and limitations in library construction protocols, shotgun sequencing and assembly based on these technologies often results in fragmented assemblies. Correspondingly, while draft assemblies can be obtained in days, finishing can take many months and hence the time and effort can only be justified for high-priority genomes and in large sequencing centers. In this work, we revisit this issue in light of our own experience in producing finished and nearly-finished genomes for a range of microbial species in a small-lab setting. These genomes were finished with surprisingly little investments in terms of time, computational effort and lab work, suggesting that the increased access to sequencing might also eventually lead to a greater proportion of finished genomes from small labs and genomics cores.

  17. Breaking-Cas-interactive design of guide RNAs for CRISPR-Cas experiments for ENSEMBL genomes.

    Science.gov (United States)

    Oliveros, Juan C; Franch, Mònica; Tabas-Madrid, Daniel; San-León, David; Montoliu, Lluis; Cubas, Pilar; Pazos, Florencio

    2016-07-08

    The CRISPR/Cas technology is enabling targeted genome editing in multiple organisms with unprecedented accuracy and specificity by using RNA-guided nucleases. A critical point when planning a CRISPR/Cas experiment is the design of the guide RNA (gRNA), which directs the nuclease and associated machinery to the desired genomic location. This gRNA has to fulfil the requirements of the nuclease and lack homology with other genome sites that could lead to off-target effects. Here we introduce the Breaking-Cas system for the design of gRNAs for CRISPR/Cas experiments, including those based in the Cas9 nuclease as well as others recently introduced. The server has unique features not available in other tools, including the possibility of using all eukaryotic genomes available in ENSEMBL (currently around 700), placing variable PAM sequences at 5' or 3' and setting the guide RNA length and the scores per nucleotides. It can be freely accessed at: http://bioinfogp.cnb.csic.es/tools/breakingcas, and the code is available upon request.

  18. Finishing Genomes with Limited Resources: Lessons from an Ensemble of Microbial Genomes

    Science.gov (United States)

    2010-01-01

    Vibrio cholerae 0395 Salmonella enterica SL476 4.68 5.4 4.1 4.99 TIGR 12/12/07 5.642 JGI 02/06/06 2,417 TIGR 05/08/07 4,521 JCVI 07/24/08...Sanger 10/01/08 2,033 Data was collected from NCBI’s Trace Archive and Genomes Database. The V. cholerae genome was sequenced using a 454/Sanger

  19. VIGOR extended to annotate genomes for additional 12 different viruses.

    Science.gov (United States)

    Wang, Shiliang; Sundaram, Jaideep P; Stockwell, Timothy B

    2012-07-01

    A gene prediction program, VIGOR (Viral Genome ORF Reader), was developed at J. Craig Venter Institute in 2010 and has been successfully performing gene calling in coronavirus, influenza, rhinovirus and rotavirus for projects at the Genome Sequencing Center for Infectious Diseases. VIGOR uses sequence similarity search against custom protein databases to identify protein coding regions, start and stop codons and other gene features. Ribonucleicacid editing and other features are accurately identified based on sequence similarity and signature residues. VIGOR produces four output files: a gene prediction file, a complementary DNA file, an alignment file, and a gene feature table file. The gene feature table can be used to create GenBank submission. VIGOR takes a single input: viral genomic sequences in FASTA format. VIGOR has been extended to predict genes for 12 viruses: measles virus, mumps virus, rubella virus, respiratory syncytial virus, alphavirus and Venezuelan equine encephalitis virus, norovirus, metapneumovirus, yellow fever virus, Japanese encephalitis virus, parainfluenza virus and Sendai virus. VIGOR accurately detects the complex gene features like ribonucleicacid editing, stop codon leakage and ribosomal shunting. Precisely identifying the mat_peptide cleavage for some viruses is a built-in feature of VIGOR. The gene predictions for these viruses have been evaluated by testing from 27 to 240 genomes from GenBank.

  20. Extending Climate Analytics as a Service to the Earth System Grid Federation Progress Report on the Reanalysis Ensemble Service

    Science.gov (United States)

    Tamkin, G.; Schnase, J. L.; Duffy, D.; Li, J.; Strong, S.; Thompson, J. H.

    2016-12-01

    We are extending climate analytics-as-a-service, including: (1) A high-performance Virtual Real-Time Analytics Testbed supporting six major reanalysis data sets using advanced technologies like the Cloudera Impala-based SQL and Hadoop-based MapReduce analytics over native NetCDF files. (2) A Reanalysis Ensemble Service (RES) that offers a basic set of commonly used operations over the reanalysis collections that are accessible through NASA's climate data analytics Web services and our client-side Climate Data Services Python library, CDSlib. (3) An Open Geospatial Consortium (OGC) WPS-compliant Web service interface to CDSLib to accommodate ESGF's Web service endpoints. This presentation will report on the overall progress of this effort, with special attention to recent enhancements that have been made to the Reanalysis Ensemble Service, including the following: - An CDSlib Python library that supports full temporal, spatial, and grid-based resolution services - A new reanalysis collections reference model to enable operator design and implementation - An enhanced library of sample queries to demonstrate and develop use case scenarios - Extended operators that enable single- and multiple reanalysis area average, vertical average, re-gridding, and trend, climatology, and anomaly computations - Full support for the MERRA-2 reanalysis and the initial integration of two additional reanalyses - A prototype Jupyter notebook-based distribution mechanism that combines CDSlib documentation with interactive use case scenarios and personalized project management - Prototyped uncertainty quantification services that combine ensemble products with comparative observational products - Convenient, one-stop shopping for commonly used data products from multiple reanalyses, including basic subsetting and arithmetic operations over the data and extractions of trends, climatologies, and anomalies - The ability to compute and visualize multiple reanalysis intercomparisons

  1. Statistical Viewer: a tool to upload and integrate linkage and association data as plots displayed within the Ensembl genome browser

    Directory of Open Access Journals (Sweden)

    Hauser Elizabeth R

    2005-04-01

    Full Text Available Abstract Background To facilitate efficient selection and the prioritization of candidate complex disease susceptibility genes for association analysis, increasingly comprehensive annotation tools are essential to integrate, visualize and analyze vast quantities of disparate data generated by genomic screens, public human genome sequence annotation and ancillary biological databases. We have developed a plug-in package for Ensembl called "Statistical Viewer" that facilitates the analysis of genomic features and annotation in the regions of interest defined by linkage analysis. Results Statistical Viewer is an add-on package to the open-source Ensembl Genome Browser and Annotation System that displays disease study-specific linkage and/or association data as 2 dimensional plots in new panels in the context of Ensembl's Contig View and Cyto View pages. An enhanced upload server facilitates the upload of statistical data, as well as additional feature annotation to be displayed in DAS tracts, in the form of Excel Files. The Statistical View panel, drawn directly under the ideogram, illustrates lod score values for markers from a study of interest that are plotted against their position in base pairs. A module called "Get Map" easily converts the genetic locations of markers to genomic coordinates. The graph is placed under the corresponding ideogram features a synchronized vertical sliding selection box that is seamlessly integrated into Ensembl's Contig- and Cyto- View pages to choose the region to be displayed in Ensembl's "Overview" and "Detailed View" panels. To resolve Association and Fine mapping data plots, a "Detailed Statistic View" plot corresponding to the "Detailed View" may be displayed underneath. Conclusion Features mapping to regions of linkage are accentuated when Statistic View is used in conjunction with the Distributed Annotation System (DAS to display supplemental laboratory information such as differentially expressed disease

  2. Ensembl 2017

    Science.gov (United States)

    Aken, Bronwen L.; Achuthan, Premanand; Akanni, Wasiu; Amode, M. Ridwan; Bernsdorff, Friederike; Bhai, Jyothish; Billis, Konstantinos; Carvalho-Silva, Denise; Cummins, Carla; Clapham, Peter; Gil, Laurent; Girón, Carlos García; Gordon, Leo; Hourlier, Thibaut; Hunt, Sarah E.; Janacek, Sophie H.; Juettemann, Thomas; Keenan, Stephen; Laird, Matthew R.; Lavidas, Ilias; Maurel, Thomas; McLaren, William; Moore, Benjamin; Murphy, Daniel N.; Nag, Rishi; Newman, Victoria; Nuhn, Michael; Ong, Chuang Kee; Parker, Anne; Patricio, Mateus; Riat, Harpreet Singh; Sheppard, Daniel; Sparrow, Helen; Taylor, Kieron; Thormann, Anja; Vullo, Alessandro; Walts, Brandon; Wilder, Steven P.; Zadissa, Amonida; Kostadima, Myrto; Martin, Fergal J.; Muffato, Matthieu; Perry, Emily; Ruffier, Magali; Staines, Daniel M.; Trevanion, Stephen J.; Cunningham, Fiona; Yates, Andrew; Zerbino, Daniel R.; Flicek, Paul

    2017-01-01

    Ensembl (www.ensembl.org) is a database and genome browser for enabling research on vertebrate genomes. We import, analyse, curate and integrate a diverse collection of large-scale reference data to create a more comprehensive view of genome biology than would be possible from any individual dataset. Our extensive data resources include evidence-based gene and regulatory region annotation, genome variation and gene trees. An accompanying suite of tools, infrastructure and programmatic access methods ensure uniform data analysis and distribution for all supported species. Together, these provide a comprehensive solution for large-scale and targeted genomics applications alike. Among many other developments over the past year, we have improved our resources for gene regulation and comparative genomics, and added CRISPR/Cas9 target sites. We released new browser functionality and tools, including improved filtering and prioritization of genome variation, Manhattan plot visualization for linkage disequilibrium and eQTL data, and an ontology search for phenotypes, traits and disease. We have also enhanced data discovery and access with a track hub registry and a selection of new REST end points. All Ensembl data are freely released to the scientific community and our source code is available via the open source Apache 2.0 license. PMID:27899575

  3. Ensembl variation resources

    Directory of Open Access Journals (Sweden)

    Marin-Garcia Pablo

    2010-05-01

    Full Text Available Abstract Background The maturing field of genomics is rapidly increasing the number of sequenced genomes and producing more information from those previously sequenced. Much of this additional information is variation data derived from sampling multiple individuals of a given species with the goal of discovering new variants and characterising the population frequencies of the variants that are already known. These data have immense value for many studies, including those designed to understand evolution and connect genotype to phenotype. Maximising the utility of the data requires that it be stored in an accessible manner that facilitates the integration of variation data with other genome resources such as gene annotation and comparative genomics. Description The Ensembl project provides comprehensive and integrated variation resources for a wide variety of chordate genomes. This paper provides a detailed description of the sources of data and the methods for creating the Ensembl variation databases. It also explores the utility of the information by explaining the range of query options available, from using interactive web displays, to online data mining tools and connecting directly to the data servers programmatically. It gives a good overview of the variation resources and future plans for expanding the variation data within Ensembl. Conclusions Variation data is an important key to understanding the functional and phenotypic differences between individuals. The development of new sequencing and genotyping technologies is greatly increasing the amount of variation data known for almost all genomes. The Ensembl variation resources are integrated into the Ensembl genome browser and provide a comprehensive way to access this data in the context of a widely used genome bioinformatics system. All Ensembl data is freely available at http://www.ensembl.org and from the public MySQL database server at ensembldb.ensembl.org.

  4. Improved method for measuring the ensemble average of strand breaks in genomic DNA.

    Science.gov (United States)

    Bespalov, V A; Conconi, A; Zhang, X; Fahy, D; Smerdon, M J

    2001-01-01

    The cis-syn cyclobutane pyrimidine dimer (CPD) is the major photoproduct induced in DNA by low wavelength ultraviolet radiation. An improved method was developed to detect CPD formation and removal in genomic DNA that avoids the problems encountered with the standard method of endonuclease detection of these photoproducts. Since CPD-specific endonucleases make single-strand cuts at CPD sites, quantification of the frequency of CPDs in DNA is usually done by denaturing gel electrophoresis. The standard method of ethidium bromide staining and gel photography requires more than 10 microg of DNA per gel lane, and correction of the photographic signal for the nonlinear film response. To simplify this procedure, a standard Southern blot protocol, coupled with phosphorimage analysis, was developed. This method uses random hybridization probes to detect genomic sequences with minimal sequence bias. Because of the vast linearity range of phosphorimage detection, scans of the signal profiles for the heterogeneous population of DNA fragments can be integrated directly to determine the number-average size of the population.

  5. The Extended Nutrigenomics - Understanding the Interplay between the Genomes of Food, Gut Microbes, and Human Host.

    Science.gov (United States)

    Kussmann, Martin; Van Bladeren, Peter J

    2011-01-01

    Comprehensive investigation of nutritional health effects at the molecular level requires the understanding of the interplay between three genomes, the food, the gut microbial, and the human host genome. Food genomes are researched for discovery and exploitation of macro- and micronutrients as well as specific bioactives, with those genes coding for bioactive proteins and peptides being of central interest. The human gut microbiota encompasses a complex ecosystem in the intestine with profound impact on host metabolism. It is being studied at genomic and, more recently, also at proteomic and metabonomic level. Humans are being characterized at the level of genetic pre-disposition and inter-individual variability in terms of (i) response to nutritional interventions and direction of health trajectories; (ii) epigenetic, metabolic programming at certain life stages with health consequences later in life and even for subsequent generations; and (iii) acute genomic expression as a holistic response to diet, monitored at gene transcript, protein and metabolite level. Modern nutrition science explores health-related aspects of bioactive food components, thereby promoting health, preventing, or delaying the onset of disease, optimizing performance and assessing benefits and risks in individuals and subpopulations. Personalized nutrition means adapting food to individual needs, depending on the human host's life stage, -style, and -situation. Traditionally, nutrigenomics and nutri(epi)genetics are seen as the key sciences to understand human variability in preferences and requirements for diet as well as responses to nutrition. This article puts the three nutrition and health-relevant genomes into perspective, namely the food, the gut microbial and the human host's genome, and calls for an "extended nutrigenomics" approach in order to build the future tools for personalized nutrition, health maintenance, and disease prevention. We discuss examples of these genomes

  6. Extending pathways and processes using molecular interaction networks to analyse cancer genome data

    Directory of Open Access Journals (Sweden)

    Krasnogor Natalio

    2010-12-01

    Full Text Available Abstract Background Cellular processes and pathways, whose deregulation may contribute to the development of cancers, are often represented as cascades of proteins transmitting a signal from the cell surface to the nucleus. However, recent functional genomic experiments have identified thousands of interactions for the signalling canonical proteins, challenging the traditional view of pathways as independent functional entities. Combining information from pathway databases and interaction networks obtained from functional genomic experiments is therefore a promising strategy to obtain more robust pathway and process representations, facilitating the study of cancer-related pathways. Results We present a methodology for extending pre-defined protein sets representing cellular pathways and processes by mapping them onto a protein-protein interaction network, and extending them to include densely interconnected interaction partners. The added proteins display distinctive network topological features and molecular function annotations, and can be proposed as putative new components, and/or as regulators of the communication between the different cellular processes. Finally, these extended pathways and processes are used to analyse their enrichment in pancreatic mutated genes. Significant associations between mutated genes and certain processes are identified, enabling an analysis of the influence of previously non-annotated cancer mutated genes. Conclusions The proposed method for extending cellular pathways helps to explain the functions of cancer mutated genes by exploiting the synergies of canonical knowledge and large-scale interaction data.

  7. TCGA2BED: extracting, extending, integrating, and querying The Cancer Genome Atlas.

    Science.gov (United States)

    Cumbo, Fabio; Fiscon, Giulia; Ceri, Stefano; Masseroli, Marco; Weitschek, Emanuel

    2017-01-03

    Data extraction and integration methods are becoming essential to effectively access and take advantage of the huge amounts of heterogeneous genomics and clinical data increasingly available. In this work, we focus on The Cancer Genome Atlas, a comprehensive archive of tumoral data containing the results of high-throughout experiments, mainly Next Generation Sequencing, for more than 30 cancer types. We propose TCGA2BED a software tool to search and retrieve TCGA data, and convert them in the structured BED format for their seamless use and integration. Additionally, it supports the conversion in CSV, GTF, JSON, and XML standard formats. Furthermore, TCGA2BED extends TCGA data with information extracted from other genomic databases (i.e., NCBI Entrez Gene, HGNC, UCSC, and miRBase). We also provide and maintain an automatically updated data repository with publicly available Copy Number Variation, DNA-methylation, DNA-seq, miRNA-seq, and RNA-seq (V1,V2) experimental data of TCGA converted into the BED format, and their associated clinical and biospecimen meta data in attribute-value text format. The availability of the valuable TCGA data in BED format reduces the time spent in taking advantage of them: it is possible to efficiently and effectively deal with huge amounts of cancer genomic data integratively, and to search, retrieve and extend them with additional information. The BED format facilitates the investigators allowing several knowledge discovery analyses on all tumor types in TCGA with the final aim of understanding pathological mechanisms and aiding cancer treatments.

  8. Congruence as a measurement of extended haplotype structure across the genome

    Directory of Open Access Journals (Sweden)

    Baschal Erin E

    2012-02-01

    Full Text Available Abstract Background Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC, we developed software for studying extended haplotypes. Methods The software, called ExHap (Extended Haplotype, uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. Results Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A. Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity. We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. Conclusions Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies.

  9. Ensemble Methods

    Science.gov (United States)

    Re, Matteo; Valentini, Giorgio

    2012-03-01

    Ensemble methods are statistical and computational learning procedures reminiscent of the human social learning behavior of seeking several opinions before making any crucial decision. The idea of combining the opinions of different "experts" to obtain an overall “ensemble” decision is rooted in our culture at least from the classical age of ancient Greece, and it has been formalized during the Enlightenment with the Condorcet Jury Theorem[45]), which proved that the judgment of a committee is superior to those of individuals, provided the individuals have reasonable competence. Ensembles are sets of learning machines that combine in some way their decisions, or their learning algorithms, or different views of data, or other specific characteristics to obtain more reliable and more accurate predictions in supervised and unsupervised learning problems [48,116]. A simple example is represented by the majority vote ensemble, by which the decisions of different learning machines are combined, and the class that receives the majority of “votes” (i.e., the class predicted by the majority of the learning machines) is the class predicted by the overall ensemble [158]. In the literature, a plethora of terms other than ensembles has been used, such as fusion, combination, aggregation, and committee, to indicate sets of learning machines that work together to solve a machine learning problem [19,40,56,66,99,108,123], but in this chapter we maintain the term ensemble in its widest meaning, in order to include the whole range of combination methods. Nowadays, ensemble methods represent one of the main current research lines in machine learning [48,116], and the interest of the research community on ensemble methods is witnessed by conferences and workshops specifically devoted to ensembles, first of all the multiple classifier systems (MCS) conference organized by Roli, Kittler, Windeatt, and other researchers of this area [14,62,85,149,173]. Several theories have been

  10. ESPC Coupled Global Ensemble Design

    Science.gov (United States)

    2014-09-30

    coupled system infrastructure and forecasting capabilities. Initial operational capability is targeted for 2018. APPROACH 1. It is recognized...provided will be the probability distribution function (PDF) of environmental conditions. It is expected that this distribution will have skill. To...system would be the initial capability for ensemble forecasts . Extensions to fully coupled ensembles would be the next step. 2. Develop an extended

  11. NYYD Ensemble

    Index Scriptorium Estoniae

    2002-01-01

    NYYD Ensemble'i duost Traksmann - Lukk E.-S. Tüüri teosega "Symbiosis", mis on salvestatud ka hiljuti ilmunud NYYD Ensemble'i CDle. 2. märtsil Rakvere Teatri väikeses saalis ja 3. märtsil Rotermanni Soolalaos, kavas Tüür, Kaumann, Berio, Reich, Yun, Hauta-aho, Buckinx

  12. Effective Visualization of Temporal Ensembles.

    Science.gov (United States)

    Hao, Lihua; Healey, Christopher G; Bass, Steffen A

    2016-01-01

    An ensemble is a collection of related datasets, called members, built from a series of runs of a simulation or an experiment. Ensembles are large, temporal, multidimensional, and multivariate, making them difficult to analyze. Another important challenge is visualizing ensembles that vary both in space and time. Initial visualization techniques displayed ensembles with a small number of members, or presented an overview of an entire ensemble, but without potentially important details. Recently, researchers have suggested combining these two directions, allowing users to choose subsets of members to visualization. This manual selection process places the burden on the user to identify which members to explore. We first introduce a static ensemble visualization system that automatically helps users locate interesting subsets of members to visualize. We next extend the system to support analysis and visualization of temporal ensembles. We employ 3D shape comparison, cluster tree visualization, and glyph based visualization to represent different levels of detail within an ensemble. This strategy is used to provide two approaches for temporal ensemble analysis: (1) segment based ensemble analysis, to capture important shape transition time-steps, clusters groups of similar members, and identify common shape changes over time across multiple members; and (2) time-step based ensemble analysis, which assumes ensemble members are aligned in time by combining similar shapes at common time-steps. Both approaches enable users to interactively visualize and analyze a temporal ensemble from different perspectives at different levels of detail. We demonstrate our techniques on an ensemble studying matter transition from hadronic gas to quark-gluon plasma during gold-on-gold particle collisions.

  13. The extended nutrigenomics - understanding the interplay between the genomes of food, gut microbes, and human host

    NARCIS (Netherlands)

    Kussmann, M.; Bladeren, van P.J.

    2011-01-01

    Comprehensive investigation of nutritional health effects at the molecular level requires the understanding of the interplay between three genomes, the food, the gut microbial, and the human host genome. Food genomes are researched for discovery and exploitation of macro- and micronutrients as well

  14. The extended nutrigenomics - understanding the interplay between the genomes of food, gut microbes, and human host

    NARCIS (Netherlands)

    Kussmann, M.; Bladeren, van P.J.

    2011-01-01

    Comprehensive investigation of nutritional health effects at the molecular level requires the understanding of the interplay between three genomes, the food, the gut microbial, and the human host genome. Food genomes are researched for discovery and exploitation of macro- and micronutrients as well

  15. The Extended Nutrigenomics – Understanding the Interplay between the Genomes of Food, Gut Microbes and Human Host

    Directory of Open Access Journals (Sweden)

    Martin eKussmann

    2011-05-01

    Full Text Available Comprehensive investigation of nutritional health effects at molecular level requires understanding the interplay between three genomes, the food, the gut microbial and the human host genome. Food genomes are researched for exploitation of macro- and micronutrients as well as bioactives, with the genes coding for bioactive proteins and peptides being of central interest. The human gut microbiota encompasses a complex intestinal ecosystem with profound impact on host metabolism. It is studied at genomic, proteomic and metabolomic level. Humans are characterized at the level of: genetic predisposition and variability in terms of dietary response and direction of health trajectories; epigenetic, metabolic programming at certain life stages with health consequences later in life and for subsequent generations; and acute genomic expression as a holistic response to diet, monitored at gene transcript, protein and metabolite level.Modern nutrition science explores health aspects of bioactive food components, thereby promoting health, preventing or delaying the onset of disease, optimizing performance and assessing benefits and risks. Personalized nutrition means adapting food to individual needs, depending on the human host’s life stage, -style and -situation. Traditionally, nutrigenomics and nutri(epigenetics have been seen as the key sciences to understand human variability in preferences and requirements for diet as well as responses to nutrition. This article puts the three nutrition and health-relevant genomes into perspective, i.e. the food, the gut microbial and the human host’s genome, and calls for an extended nutrigenomics approach to build the future tools for personalized nutrition, health maintenance and disease prevention. We discuss examples of these genomes, proteomes, transcriptomes and metabolomes under the overarching term genomics that covers all Omics rather than the sole study of DNA and RNA.

  16. Draft Genome Sequence of Extended-Spectrum-β-Lactamase-Producing Escherichia coli Strain CCUG 62462, Isolated from a Urine Sample

    Science.gov (United States)

    Johnning, Anna; Jakobsson, Hedvig E.; Boulund, Fredrik; Salvà-Serra, Francisco; Åhrén, Christina; Kristiansson, Erik

    2016-01-01

    The draft genome sequence has been determined for an extended-spectrum-β-lactamase (ESBL)-producing (blaCTX-M-15) Escherichia coli strain (CCUG 62462), composed of 119 contigs and a total size of 5.27 Mb. This E. coli is serotype O25b and sequence type 131, a pandemic clonal group, causing worldwide antimicrobial-resistant infections. PMID:27979938

  17. Semiconservative replication, genetic repair, and many-gened genomes: Extending the quasispecies paradigm to living systems

    Science.gov (United States)

    Tannenbaum, Emmanuel; Shakhnovich, Eugene I.

    2005-12-01

    Quasispecies theory has emerged as an important tool for modeling the evolutionary dynamics of biological systems. We review recent advances in the field, with an emphasis on the quasispecies dynamics of semiconservatively replicating genomes. Applications to cancer and adult stem cell growth are discussed. Additional topics, such as genetic repair and many-gene genomes, are covered as well.

  18. Extending the Bacillus cereus group genomics to putative food-borne pathogens of different toxicity

    Energy Technology Data Exchange (ETDEWEB)

    Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Goltsman, Eugene [U.S. Department of Energy, Joint Genome Institute; Auger, Sandrine [Genetique Microbienne; Galleron, Nathalie [Genetique Microbienne; Segurens, Beatrice [Center National Sequencage, F-91057 Evry, France; Simon, Jorg [Johann Wolfgang Goethe University, Frankfurt am Main, Germany; Dossat, Carole [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Land, Miriam L [ORNL; Broussolle, Veronique [Securite et Qualite des Produits d' Origine Vegetale; Brillard, Julien [Securite et Qualite des Produits d' Origine Vegetale; Guinebretiere, Marie-Helene [Securite et Qualite des Produits d' Origine Vegetale; Sanchis, Vincent [Genetique Microbienne; Nguen-the, Christophe [Securite et Qualite des Produits d' Origine Vegetale; Lereclus, Didier [Genetique Microbienne; Richardson, P M [U.S. Department of Energy, Joint Genome Institute; Wincker, Patrick [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Weissenbach, Jean [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Ehrlich, Dusko [Genetique Microbienne; Sorokin, Alexei [Genetique Microbienne

    2008-01-01

    The Bacillus cereus group represents sporulating soil bacteria containing pathogenic strains which may cause diarrheic or emetic food poisoning outbreaks. Multiple locus sequence typing revealed a presence in natural samples of these bacteria of about 30 clonal complexes. Application of genomic methods to this group was however biased due to the major interest for representatives closely related to Bacillus anthracis. Albeit the most important food-borne pathogens were not yet defined, existing data indicate that they are scattered all over the phylogenetic tree. The preliminary analysis of the sequences of three genomes discussed in this paper narrows down the gaps in our knowledge of the B. cereus group. The strain NVH391-98 is a rare but particularly severe food-borne pathogen. Sequencing revealed that the strain should be a representative of a novel bacterial species, for which the name Bacillus cytotoxis or Bacillus cytotoxicus is proposed. This strain has a reduced genome size compared to other B. cereus group strains. Genome analysis revealed absence of sigma B factor and the presence of genes encoding diarrheic Nhe toxin, not detected earlier. The strain B. cereus F837/76 represents a clonal complex close to that of B. anthracis. Including F837/76, three such B. cereus strains had been sequenced. Alignment of genomes suggests that B. anthracis is their common ancestor. Since such strains often emerge from clinical cases, they merit a special attention. The third strain, KBAB4, is a typical facultative psychrophile generally found in soil. Phylogenic studies show that in nature it is the most active group in terms of gene exchange. Genomic sequence revealed high presence of extra-chromosomal genetic material (about 530 kb) that may account for this phenomenon. Genes coding Nhe-like toxin were found on a big plasmid in this strain. This may indicate a potential mechanism of toxicity spread from the psychrophile strain community. The results of this genomic

  19. The complete genome sequence of the dominant Sinorhizobium meliloti field isolate SM11 extends the S. meliloti pan-genome.

    Science.gov (United States)

    Schneiker-Bekel, Susanne; Wibberg, Daniel; Bekel, Thomas; Blom, Jochen; Linke, Burkhard; Neuweger, Heiko; Stiens, Michael; Vorhölter, Frank-Jörg; Weidner, Stefan; Goesmann, Alexander; Pühler, Alfred; Schlüter, Andreas

    2011-08-20

    Isolates of the symbiotic nitrogen-fixing species Sinorhizobium meliloti usually contain a chromosome and two large megaplasmids encoding functions that are absolutely required for the specific interaction of the microsymbiont with corresponding host plants leading to an effective symbiosis. The complete genome sequence, including the megaplasmids pSmeSM11c (related to pSymA) and pSmeSM11d (related to pSymB), was established for the dominant, indigenous S. meliloti strain SM11 that had been isolated during a long-term field release experiment with genetically modified S. meliloti strains. The chromosome, the largest replicon of S. meliloti SM11, is 3,908,022bp in size and codes for 3785 predicted protein coding sequences. The size of megaplasmid pSmeSM11c is 1,633,319bp and it contains 1760 predicted protein coding sequences whereas megaplasmid pSmeSM11d is 1,632,395bp in size and comprises 1548 predicted coding sequences. The gene content of the SM11 chromosome is quite similar to that of the reference strain S. meliloti Rm1021. Comparison of pSmeSM11c to pSymA of the reference strain revealed that many gene regions of these replicons are variable, supporting the assessment that pSymA is a major hot-spot for intra-specific differentiation. Plasmids pSymA and pSmeSM11c both encode unique genes. Large gene regions of pSmeSM11c are closely related to corresponding parts of Sinorhizobium medicae WSM419 plasmids. Moreover, pSmeSM11c encodes further novel gene regions, e.g. additional plasmid survival genes (partition, mobilisation and conjugative transfer genes), acdS encoding 1-aminocyclopropane-1-carboxylate deaminase involved in modulation of the phytohormone ethylene level and genes having predicted functions in degradative capabilities, stress response, amino acid metabolism and associated pathways. In contrast to Rm1021 pSymA and pSmeSM11c, megaplasmid pSymB of strain Rm1021 and pSmeSM11d are highly conserved showing extensive synteny with only few rearrangements

  20. Genomic attributes of extended-spectrum β-lactamase-producing Escherichia coli isolated from patients in Lebanon.

    Science.gov (United States)

    Tokajian, Sima; Salloum, Tamara; Eisen, Jonathan A; Jospin, Guillaume; Farra, Anna; Mokhbat, Jacques E; Coil, David A

    2017-03-01

    Extended-spectrum β-lactamase-producing (ESBL) Escherichia coli are a public threat worldwide. This study aimed at analyzing the genomic and functional attributes of nine ESBLs taken from rectal swabs. Samples were isolated from patients admitted for gastrointestinal and urological procedures at the University Medical Center-Rizk Hospital (UMCRH) in Lebanon. Illumina paired-end libraries were prepared and sequenced. The isolates were distributed into five lineages: ST131, ST648, ST405, ST73 and ST38, and harbored bla OXA-1, bla TEM-1B, bla TEM-1C and aac(6')Ib-cr. ST131 isolates were carriers of stx2 converting I phage. This is the first comprehensive genomic analysis performed on ESBLs in Lebanon.

  1. Optimizing de novo transcriptome assembly and extending genomic resources for striped catfish (Pangasianodon hypophthalmus).

    Science.gov (United States)

    Thanh, Nguyen Minh; Jung, Hyungtaek; Lyons, Russell E; Njaci, Isaac; Yoon, Byoung-Ha; Chand, Vincent; Tuan, Nguyen Viet; Thu, Vo Thi Minh; Mather, Peter

    2015-10-01

    Striped catfish (Pangasianodon hypophthalmus) is a commercially important freshwater fish used in inland aquaculture in the Mekong Delta, Vietnam. The culture industry is facing a significant challenge however from saltwater intrusion into many low topographical coastal provinces across the Mekong Delta as a result of predicted climate change impacts. Developing genomic resources for this species can facilitate the production of improved culture lines that can withstand raised salinity conditions, and so we have applied high-throughput Ion Torrent sequencing of transcriptome libraries from six target osmoregulatory organs from striped catfish as a genomic resource for use in future selection strategies. We obtained 12,177,770 reads after trimming and processing with an average length of 97bp. De novo assemblies were generated using CLC Genomic Workbench, Trinity and Velvet/Oases with the best overall contig performance resulting from the CLC assembly. De novo assembly using CLC yielded 66,451 contigs with an average length of 478bp and N50 length of 506bp. A total of 37,969 contigs (57%) possessed significant similarity with proteins in the non-redundant database. Comparative analyses revealed that a significant number of contigs matched sequences reported in other teleost fishes, ranging in similarity from 45.2% with Atlantic cod to 52% with zebrafish. In addition, 28,879 simple sequence repeats (SSRs) and 55,721 single nucleotide polymorphisms (SNPs) were detected in the striped catfish transcriptome. The sequence collection generated in the current study represents the most comprehensive genomic resource for P. hypophthalmus available to date. Our results illustrate the utility of next-generation sequencing as an efficient tool for constructing a large genomic database for marker development in non-model species.

  2. An extended anchored linkage map and virtual mapping for the american mink genome based on homology to human and dog

    DEFF Research Database (Denmark)

    Anistoroaei, Razvan Marian; Ansari, S.; Farid, A.;

    2009-01-01

    In this report we present an extended linkage map of the American mink (Neovison vison) consisting of 157 microsatellite markers and comprising at least one linkage group for each of the autosomes. Each linkage group has been assigned to a chromosome and oriented by fluorescence in situ hybridiza......In this report we present an extended linkage map of the American mink (Neovison vison) consisting of 157 microsatellite markers and comprising at least one linkage group for each of the autosomes. Each linkage group has been assigned to a chromosome and oriented by fluorescence in situ...... comparative human/dog/mink data, these assignments represent useful virtual maps for the American mink genome. Comparison of the current human/dog assembled sequential map with the existing Zoo-FISH-based human/dog/mink maps helped to refine the human/dog/mink comparative map. Furthermore, comparison...

  3. Genome-Wide Linkage Analysis of Hemodynamic Parameters Under Mental and Physical Stress in Extended Omani Arab Pedigrees : The Oman Family Study

    NARCIS (Netherlands)

    Hassan, Mohammed O.; Jaju, Deepali; Voruganti, V. Saroja; Bayoumi, Riad A.; Albarwani, Sulayma; Al-Yahyaee, Saeed; Aslani, Afshin; Snieder, Harold; Lopez-Alvarenga, Juan C.; Al-Anqoudi, Zahir M.; Alizadeh, Behrooz Z.; Comuzzie, Anthony G.

    2011-01-01

    Background: We performed a genome-wide scan in a homogeneous Arab population to identify genomic regions linked to blood pressure (BP) and its intermediate phenotypes during mental and physical stress tests. Methods: The Oman Family Study subjects (N = 1277) were recruited from five extended familie

  4. Genome-wide linkage analysis of hemodynamic parameters under mental and physical stress in extended Omani Arab pedigrees : the Oman Family Study

    NARCIS (Netherlands)

    Hassan, Mohammed O; Jaju, Deepali; Voruganti, V Saroja; Bayoumi, Riad A; Albarwani, Sulayma; Al-Yahyaee, Saeed; Aslani, Afshin; Snieder, Harold; Lopez-Alvarenga, Juan C; Al-Anqoudi, Zahir M; Alizadeh, Behrooz Z; Comuzzie, Anthony G

    2011-01-01

    BACKGROUND: We performed a genome-wide scan in a homogeneous Arab population to identify genomic regions linked to blood pressure (BP) and its intermediate phenotypes during mental and physical stress tests. METHODS: The Oman Family Study subjects (N = 1277) were recruited from five extended familie

  5. Complete genome sequences of Incl1 Plasmids carrying extended-spectrum B-Lactamase genes

    NARCIS (Netherlands)

    Brouwer, M.S.M.; Bossers, A.; Harders, F.; Essen-Zandbergen, van A.; Mevius, D.J.; Smith, H.E.

    2014-01-01

    Extended spectrum beta-lactamases (ESBLs) confer resistance to clinically relevant antibiotics. Often, the resistance genes are carried by conjugative plasmids which are responsible for dissemination. Five IncI1 plasmids carrying ESBLs from commensal and clinical Escherichia coli isolates were compl

  6. Extending the cereus group genomics to putative food-bornepathogens of different toxicity

    Energy Technology Data Exchange (ETDEWEB)

    Lapidus, Alla; Goltsman, Eugene; Auger, Sandrine; Galleron,Nathalie; Segurens, Beatrice; Dossat, Carole; Land, Miriam L.; Broussole,Veronique; Brillard, Julien; Guinebretiere, Marie-Helene; Sanchis,Vincent; Nguen-the, Christophe; Lereclus, Didier; Richardson, Paul; Winker, Patrick; Weissenbach, Jean; Ehrlich, S.Dusko; Sorokin, Alexei

    2006-08-24

    The cereus group represents sporulating soil bacteriacontaining pathogenic strains which may cause diarrheic or emetic foodpoisoning outbreaks. Multiple locus sequence typing revealed a presencein natural samples of these bacteria of about thirty clonal complexes.Application of genomic methods to this group was however biased due tothe major interest for representatives closely related to B. anthracis.Albeit the most important food-borne pathogens were not yet defined,existing dataindicate that they are scattered all over the phylogenetictree. The preliminary analysis of the sequences of three genomesdiscussed in this paper narrows down the gaps in our knowledge of thecereus group. The strain NVH391-98 is a rare but particularly severefood-borne pathogen. Sequencing revealed that the strain must be arepresentative of a novel bacterial species, for which the name Bacilluscytotoxis is proposed. This strain has a reduced genome size compared toother cereus group strains. Genome analysis revealed absence of sigma Bfactor and the presence of genes encoding diarrheic Nhe toxin, notdetected earlier. The strain B. cereus F837/76 represents a clonalcomplex close to that of B. anthracis. Including F837/76, three such B.cereus strains had been sequenced. Alignment of genomes suggests that B.anthracis is their common ancestor. Since such strains often emerge fromclinical cases, they merit a special attention. The third strain, KBAB4,is a typical psychrotrophe characteristic to unbiased soil communities.Phylogenic studies show that in nature it is the most active group interms of gene exchange. Genomic sequence revealed high presence ofextra-chromosomal genetic material (about 530 kb) that may account forthis phenomenon. Genes coding Nhe-like toxin were found on a big plasmidin this strain. This may indicate a potential mechanism of toxicityspread from the psychrotrophic strain community. The results of thisgenomic work and ecological compartments of different strains incite

  7. A genome-wide association study for venous thromboembolism: the extended cohorts for heart and aging research in genomic epidemiology (CHARGE) consortium.

    Science.gov (United States)

    Tang, Weihong; Teichert, Martina; Chasman, Daniel I; Heit, John A; Morange, Pierre-Emmanuel; Li, Guo; Pankratz, Nathan; Leebeek, Frank W; Paré, Guillaume; de Andrade, Mariza; Tzourio, Christophe; Psaty, Bruce M; Basu, Saonli; Ruiter, Rikje; Rose, Lynda; Armasu, Sebastian M; Lumley, Thomas; Heckbert, Susan R; Uitterlinden, André G; Lathrop, Mark; Rice, Kenneth M; Cushman, Mary; Hofman, Albert; Lambert, Jean-Charles; Glazer, Nicole L; Pankow, James S; Witteman, Jacqueline C; Amouyel, Philippe; Bis, Joshua C; Bovill, Edwin G; Kong, Xiaoxiao; Tracy, Russell P; Boerwinkle, Eric; Rotter, Jerome I; Trégouët, David-Alexandre; Loth, Daan W; Stricker, Bruno H Ch; Ridker, Paul M; Folsom, Aaron R; Smith, Nicholas L

    2013-07-01

    Venous thromboembolism (VTE) is a common, heritable disease resulting in high rates of hospitalization and mortality. Yet few associations between VTE and genetic variants, all in the coagulation pathway, have been established. To identify additional genetic determinants of VTE, we conducted a two-stage genome-wide association study (GWAS) among individuals of European ancestry in the extended cohorts for heart and aging research in genomic epidemiology (CHARGE) VTE consortium. The discovery GWAS comprised 1,618 incident VTE cases out of 44,499 participants from six community-based studies. Genotypes for genome-wide single-nucleotide polymorphisms (SNPs) were imputed to approximately 2.5 million SNPs in HapMap and association with VTE assessed using study-design appropriate regression methods. Meta-analysis of these results identified two known loci, in F5 and ABO. Top 1,047 tag SNPs (P ≤ 0.0016) from the discovery GWAS were tested for association in an additional 3,231 cases and 3,536 controls from three case-control studies. In the combined data from these two stages, additional genome-wide significant associations were observed on 4q35 at F11 (top SNP rs4253399, intronic to F11) and on 4q28 at FGG (rs6536024, 9.7 kb from FGG; P FGG locus were not completely explained by previously reported variants. Loci at or near SUSD1 and OTUD7A showed borderline yet novel associations (P FGG loci for VTE. Future studies are warranted to better characterize the associations with F11 and FGG and to replicate the new candidate associations.

  8. Extending CRISPR-Cas9 Technology from Genome Editing to Transcriptional Engineering in the Genus Clostridium

    Science.gov (United States)

    Bruder, Mark R.; Pyne, Michael E.; Moo-Young, Murray

    2016-01-01

    ABSTRACT The discovery and exploitation of the prokaryotic adaptive immunity system based on clustered regularly interspaced short palindromic repeats (CRISPRs) and CRISPR-associated (Cas) proteins have revolutionized genetic engineering. CRISPR-Cas tools have enabled extensive genome editing as well as efficient modulation of the transcriptional program in a multitude of organisms. Progress in the development of genetic engineering tools for the genus Clostridium has lagged behind that of many other prokaryotes, presenting the CRISPR-Cas technology an opportunity to resolve a long-existing issue. Here, we applied the Streptococcus pyogenes type II CRISPR-Cas9 (SpCRISPR-Cas9) system for genome editing in Clostridium acetobutylicum DSM792. We further explored the utility of the SpCRISPR-Cas9 machinery for gene-specific transcriptional repression. For proof-of-concept demonstration, a plasmid-encoded fluorescent protein gene was used for transcriptional repression in C. acetobutylicum. Subsequently, we targeted the carbon catabolite repression (CCR) system of C. acetobutylicum through transcriptional repression of the hprK gene encoding HPr kinase/phosphorylase, leading to the coutilization of glucose and xylose, which are two abundant carbon sources from lignocellulosic feedstocks. Similar approaches based on SpCRISPR-Cas9 for genome editing and transcriptional repression were also demonstrated in Clostridium pasteurianum ATCC 6013. As such, this work lays a foundation for the derivation of clostridial strains for industrial purposes. IMPORTANCE After recognizing the industrial potential of Clostridium for decades, methods for the genetic manipulation of these anaerobic bacteria are still underdeveloped. This study reports the implementation of CRISPR-Cas technology for genome editing and transcriptional regulation in Clostridium acetobutylicum, which is arguably the most common industrial clostridial strain. The developed genetic tools enable simpler, more reliable

  9. Composed ensembles of random unitary ensembles

    CERN Document Server

    Pozniak, M; Kus, M; Pozniak, Marcin; Zyczkowski, Karol; Kus, Marek

    1997-01-01

    Composed ensembles of random unitary matrices are defined via products of matrices, each pertaining to a given canonical circular ensemble of Dyson. We investigate statistical properties of spectra of some composed ensembles and demonstrate their physical relevance. We discuss also the methods of generating random matrices distributed according to invariant Haar measure on the orthogonal and unitary group.

  10. Estimating preselected and postselected ensembles

    Energy Technology Data Exchange (ETDEWEB)

    Massar, Serge [Laboratoire d' Information Quantique, C.P. 225, Universite libre de Bruxelles (U.L.B.), Av. F. D. Rooselvelt 50, B-1050 Bruxelles (Belgium); Popescu, Sandu [H. H. Wills Physics Laboratory, University of Bristol, Tyndall Avenue, Bristol BS8 1TL (United Kingdom); Hewlett-Packard Laboratories, Stoke Gifford, Bristol BS12 6QZ (United Kingdom)

    2011-11-15

    In analogy with the usual quantum state-estimation problem, we introduce the problem of state estimation for a pre- and postselected ensemble. The problem has fundamental physical significance since, as argued by Y. Aharonov and collaborators, pre- and postselected ensembles are the most basic quantum ensembles. Two new features are shown to appear: (1) information is flowing to the measuring device both from the past and from the future; (2) because of the postselection, certain measurement outcomes can be forced never to occur. Due to these features, state estimation in such ensembles is dramatically different from the case of ordinary, preselected-only ensembles. We develop a general theoretical framework for studying this problem and illustrate it through several examples. We also prove general theorems establishing that information flowing from the future is closely related to, and in some cases equivalent to, the complex conjugate information flowing from the past. Finally, we illustrate our approach on examples involving covariant measurements on spin-1/2 particles. We emphasize that all state-estimation problems can be extended to the pre- and postselected situation. The present work thus lays the foundations of a much more general theory of quantum state estimation.

  11. Novel Rheumatoid Arthritis Susceptibility Locus at 22q12 Identified in an Extended UK Genome-Wide Association Study

    Science.gov (United States)

    Orozco, Gisela; Viatte, Sebastien; Bowes, John; Martin, Paul; Wilson, Anthony G; Morgan, Ann W; Steer, Sophia; Wordsworth, Paul; Hocking, Lynne J; Barton, Anne; Worthington, Jane; Eyre, Stephen

    2014-01-01

    Objective The number of confirmed rheumatoid arthritis (RA) loci currently stands at 32, but many lines of evidence indicate that expansion of existing genome-wide association studies (GWAS) enhances the power to detect additional loci. This study was undertaken to extend our previous RA GWAS in a UK cohort, adding more independent RA cases and healthy controls, with the aim of detecting novel association signals for susceptibility to RA in a homogeneous UK cohort. Methods A total of 3,223 UK RA cases and 5,272 UK controls were available for association analyses, with the extension adding 1,361 cases and 2,334 controls to the original GWAS data set. The genotype data for all RA cases were imputed using the Impute program version 2. After stringent quality control thresholds were applied, 3,034 cases and 5,271 controls (1,831,729 single-nucleotide polymorphisms [SNPs]) were available for analysis. Association testing was performed using Plink software. Results The analyses indicated a suggestive association with susceptibility to RA (P < 0.0001) for 6 novel RA loci that have been previously found to be associated with other autoimmune diseases; these 6 SNPs were genotyped in independent samples. Two of the associated loci were validated, one of which was associated with RA at genome-wide levels of significance in the combined analysis, identifying a novel RA locus at 22q12 (P = 6.9 × 10−9). In addition, most of the previously known RA susceptibility loci were confirmed to be associated with RA, and for 16 of the loci, the strength of the association was increased. Conclusion This study identified a new RA locus mapping to 22q12. These results support the notion that increasing the power of GWAS enhances novel gene discovery. PMID:24449572

  12. A genome wide association study of alcohol dependence symptom counts in extended pedigrees identifies C15orf53

    Science.gov (United States)

    Wang, Jen-Chyong; Foroud, Tatiana; Hinrichs, Anthony L; Le, Nhung XH; Bertelsen, Sarah; Budde, John P; Harari, Oscar; Koller, Daniel L; Wetherill, Leah; Agrawal, Arpana; Almasy, Laura; Brooks, Andrew I; Bucholz, Kathleen; Dick, Danielle; Hesselbrock, Victor; Johnson, Eric O; Kang, Sun; Kapoor, Manav; Kramer, John; Kuperman, Samuel; Madden, Pamela AF; Manz, Niklas; Martin, Nicholas G; McClintick, Jeanette N; Montgomery, Grant W; Nurnberger, John I; Rangaswamy, Madhavi; Rice, John; Schuckit, Marc; Tischfield, Jay A; Whitfield, John B; Xuei, Xiaoling; Porjesz, Bernice; Heath, Andrew C; Edenberg, Howard J; Bierut, Laura J; Goate, Alison M

    2013-01-01

    Several studies have identified genes associated with alcohol use disorders, but the variation in each of these genes explains only a small portion of the genetic vulnerability. The goal of the present study was to perform a genome-wide association study (GWAS) in extended families from the Collaborative Study on the Genetics of Alcoholism (COGA) to identify novel genes affecting risk for alcohol dependence. To maximize the power of the extended family design we used a quantitative endophenotype, measured in all individuals: number of alcohol dependence symptoms endorsed (symptom count). Secondary analyses were performed to determine if the single nucleotide polymorphisms (SNPs) associated with symptom count were also associated with the dichotomous phenotype, DSM-IV alcohol dependence. This family-based GWAS identified SNPs in C15orf53 that are strongly associated with DSM-IV alcohol (p=4.5×10−8, inflation corrected p=9.4×10−7). Results with DSM-IV alcohol dependence in the regions of interest support our findings with symptom count, though the associations were less significant. Attempted replications of the most promising association results were conducted in two independent samples: non-overlapping subjects from the Study of Addiction: Genes and Environment (SAGE) and the Australian twin-family study of alcohol use disorders (OZALC). Nominal association of C15orf53 with symptom count was observed in SAGE. The variant that showed strongest association with symptom count, rs12912251 and its highly correlated variants (D′=1, r2≥ 0.95), has previously been associated with risk for bipolar disorder. PMID:23089632

  13. MSEBAG: a dynamic classifier ensemble generation based on `minimum-sufficient ensemble' and bagging

    Science.gov (United States)

    Chen, Lei; Kamel, Mohamed S.

    2016-01-01

    In this paper, we propose a dynamic classifier system, MSEBAG, which is characterised by searching for the 'minimum-sufficient ensemble' and bagging at the ensemble level. It adopts an 'over-generation and selection' strategy and aims to achieve a good bias-variance trade-off. In the training phase, MSEBAG first searches for the 'minimum-sufficient ensemble', which maximises the in-sample fitness with the minimal number of base classifiers. Then, starting from the 'minimum-sufficient ensemble', a backward stepwise algorithm is employed to generate a collection of ensembles. The objective is to create a collection of ensembles with a descending fitness on the data, as well as a descending complexity in the structure. MSEBAG dynamically selects the ensembles from the collection for the decision aggregation. The extended adaptive aggregation (EAA) approach, a bagging-style algorithm performed at the ensemble level, is employed for this task. EAA searches for the competent ensembles using a score function, which takes into consideration both the in-sample fitness and the confidence of the statistical inference, and averages the decisions of the selected ensembles to label the test pattern. The experimental results show that the proposed MSEBAG outperforms the benchmarks on average.

  14. Exploring ensemble visualization

    Science.gov (United States)

    Phadke, Madhura N.; Pinto, Lifford; Alabi, Oluwafemi; Harter, Jonathan; Taylor, Russell M., II; Wu, Xunlei; Petersen, Hannah; Bass, Steffen A.; Healey, Christopher G.

    2012-01-01

    An ensemble is a collection of related datasets. Each dataset, or member, of an ensemble is normally large, multidimensional, and spatio-temporal. Ensembles are used extensively by scientists and mathematicians, for example, by executing a simulation repeatedly with slightly different input parameters and saving the results in an ensemble to see how parameter choices affect the simulation. To draw inferences from an ensemble, scientists need to compare data both within and between ensemble members. We propose two techniques to support ensemble exploration and comparison: a pairwise sequential animation method that visualizes locally neighboring members simultaneously, and a screen door tinting method that visualizes subsets of members using screen space subdivision. We demonstrate the capabilities of both techniques, first using synthetic data, then with simulation data of heavy ion collisions in high-energy physics. Results show that both techniques are capable of supporting meaningful comparisons of ensemble data.

  15. Genome-Wide Copy Number Variation Analysis in Extended Families and Unrelated Individuals Characterized for Musical Aptitude and Creativity in Music

    OpenAIRE

    Ukkola-Vuoti, Liisa; Kanduri, Chakravarthi; Oikkonen, Jaana; Buck, Gemma; Blancher, Christine; Raijas, Pirre; Karma, Kai; Lähdesmäki, Harri; Järvelä, Irma

    2013-01-01

    Music perception and practice represent complex cognitive functions of the human brain. Recently, evidence for the molecular genetic background of music related phenotypes has been obtained. In order to further elucidate the molecular background of musical phenotypes we analyzed genome wide copy number variations (CNVs) in five extended pedigrees and in 172 unrelated subjects characterized for musical aptitude and creative functions in music. Musical aptitude was defined by combination of the...

  16. Spectral diagonal ensemble Kalman filters

    CERN Document Server

    Kasanický, Ivan; Vejmelka, Martin

    2015-01-01

    A new type of ensemble Kalman filter is developed, which is based on replacing the sample covariance in the analysis step by its diagonal in a spectral basis. It is proved that this technique improves the aproximation of the covariance when the covariance itself is diagonal in the spectral basis, as is the case, e.g., for a second-order stationary random field and the Fourier basis. The method is extended by wavelets to the case when the state variables are random fields, which are not spatially homogeneous. Efficient implementations by the fast Fourier transform (FFT) and discrete wavelet transform (DWT) are presented for several types of observations, including high-dimensional data given on a part of the domain, such as radar and satellite images. Computational experiments confirm that the method performs well on the Lorenz 96 problem and the shallow water equations with very small ensembles and over multiple analysis cycles.

  17. Making Tree Ensembles Interpretable

    OpenAIRE

    Hara, Satoshi; Hayashi, Kohei

    2016-01-01

    Tree ensembles, such as random forest and boosted trees, are renowned for their high prediction performance, whereas their interpretability is critically limited. In this paper, we propose a post processing method that improves the model interpretability of tree ensembles. After learning a complex tree ensembles in a standard way, we approximate it by a simpler model that is interpretable for human. To obtain the simpler model, we derive the EM algorithm minimizing the KL divergence from the ...

  18. Multilevel ensemble Kalman filter

    KAUST Repository

    Chernov, Alexey

    2016-01-06

    This work embeds a multilevel Monte Carlo (MLMC) sampling strategy into the Monte Carlo step of the ensemble Kalman filter (EnKF). In terms of computational cost vs. approximation error the asymptotic performance of the multilevel ensemble Kalman filter (MLEnKF) is superior to the EnKF s.

  19. The Ensembl REST API: Ensembl Data for Any Language.

    Science.gov (United States)

    Yates, Andrew; Beal, Kathryn; Keenan, Stephen; McLaren, William; Pignatelli, Miguel; Ritchie, Graham R S; Ruffier, Magali; Taylor, Kieron; Vullo, Alessandro; Flicek, Paul

    2015-01-01

    We present a Web service to access Ensembl data using Representational State Transfer (REST). The Ensembl REST server enables the easy retrieval of a wide range of Ensembl data by most programming languages, using standard formats such as JSON and FASTA while minimizing client work. We also introduce bindings to the popular Ensembl Variant Effect Predictor tool permitting large-scale programmatic variant analysis independent of any specific programming language. The Ensembl REST API can be accessed at http://rest.ensembl.org and source code is freely available under an Apache 2.0 license from http://github.com/Ensembl/ensembl-rest. © The Author 2014. Published by Oxford University Press.

  20. Extended analysis of a genome-wide association study in primary sclerosing cholangitis detects multiple novel risk loci

    NARCIS (Netherlands)

    Folseraas, Trine; Melum, Espen; Rausch, Philipp; Juran, Brian D.; Ellinghaus, Eva; Shiryaev, Alexey; Laerdahl, Jon K.; Ellinghaus, David; Schramm, Christoph; Weismueller, Tobias J.; Gotthard, Daniel Nils; Hov, Johannes Roksund; Clausen, Ole Petter; Weersma, Rinse K.; Janse, Marcel; Boberg, Kirsten Muri; Bjornsson, Einar; Marschall, Hanns-Ulrich; Cleynen, Isabelle; Rosenstiel, Philip; Holm, Kristian; Teufel, Andreas; Rust, Christian; Gieger, Christian; Wichmann, H-Erich; Bergquist, Annika; Ryu, Euijung; Ponsioen, Cyriel Y.; Runz, Heiko; Sterneck, Martina; Vermeire, Severine; Beuers, Ulrich; Wijmenga, Cisca; Schrumpf, Erik; Manns, Michael P.; Lazaridis, Konstantinos N.; Schreiber, Stefan; Baines, John F.; Franke, Andre; Karlsen, Tom H.

    Background & Aims: A limited number of genetic risk factors have been reported in primary sclerosing cholangitis (PSC). To discover further genetic susceptibility factors for PSC, we followed up on,a second tier of single nucleotide polymorphisms (SNPs) from a genome-wide association study (GWAS).

  1. Calibrating ensemble reliability whilst preserving spatial structure

    Directory of Open Access Journals (Sweden)

    Jonathan Flowerdew

    2014-03-01

    Full Text Available Ensemble forecasts aim to improve decision-making by predicting a set of possible outcomes. Ideally, these would provide probabilities which are both sharp and reliable. In practice, the models, data assimilation and ensemble perturbation systems are all imperfect, leading to deficiencies in the predicted probabilities. This paper presents an ensemble post-processing scheme which directly targets local reliability, calibrating both climatology and ensemble dispersion in one coherent operation. It makes minimal assumptions about the underlying statistical distributions, aiming to extract as much information as possible from the original dynamic forecasts and support statistically awkward variables such as precipitation. The output is a set of ensemble members preserving the spatial, temporal and inter-variable structure from the raw forecasts, which should be beneficial to downstream applications such as hydrological models. The calibration is tested on three leading 15-d ensemble systems, and their aggregation into a simple multimodel ensemble. Results are presented for 12 h, 1° scale over Europe for a range of surface variables, including precipitation. The scheme is very effective at removing unreliability from the raw forecasts, whilst generally preserving or improving statistical resolution. In most cases, these benefits extend to the rarest events at each location within the 2-yr verification period. The reliability and resolution are generally equivalent or superior to those achieved using a Local Quantile-Quantile Transform, an established calibration method which generalises bias correction. The value of preserving spatial structure is demonstrated by the fact that 3×3 averages derived from grid-scale precipitation calibration perform almost as well as direct calibration at 3×3 scale, and much better than a similar test neglecting the spatial relationships. Some remaining issues are discussed regarding the finite size of the output

  2. Genomic Dissection of Travel-Associated Extended-Spectrum-Beta-Lactamase-Producing Salmonella enterica Serovar Typhi Isolates Originating from the Philippines: a One-Off Occurrence or a Threat to Effective Treatment of Typhoid Fever?

    DEFF Research Database (Denmark)

    Hendriksen, Rene S.; Leekitcharoenphon, Pimlapas; Mikoleit, Matthew

    2015-01-01

    One unreported case of extended-spectrum-beta-lactamase (ESBL)-producing Salmonella enterica serovar Typhi was identified, whole-genome sequence typed, among other analyses, and compared to other available genomes of S. Typhi. The reported strain was similar to a previously published strain harbo...

  3. Ensemble data assimilation with an adjusted forecast spread

    Directory of Open Access Journals (Sweden)

    Sabrina Rainwater

    2013-04-01

    Full Text Available Ensemble data assimilation typically evolves an ensemble of model states whose spread is intended to represent the algorithm's uncertainty about the state of the physical system that produces the data. The analysis phase treats the forecast ensemble as a random sample from a background distribution, and it transforms the ensemble according to the background and observation error statistics to provide an appropriate sample for the next forecast phase. We find that in the presence of model nonlinearity and model error, it can be fruitful to rescale the ensemble spread prior to the forecast and then reverse this rescaling after the forecast. We call this approach forecast spread adjustment, which we discuss and test in this article using an ensemble Kalman filter and a 2005 model due to Lorenz. We argue that forecast spread adjustment provides a tunable parameter, that is, complementary to covariance inflation, which cumulatively increases ensemble spread to compensate for underestimation of uncertainty. We also show that as the adjustment parameter approaches zero, the filter approaches the extended Kalman filter if the ensemble size is sufficiently large. We find that varying the adjustment parameter can significantly reduce analysis and forecast errors in some cases. We evaluate how the improvement provided by forecast spread adjustment depends on ensemble size, observation error and model error. Our results indicate that the technique is most effective for small ensembles, small observation error and large model error, though the effectiveness depends significantly on the nature of the model error.

  4. Ensemble Enabled Weighted PageRank

    CERN Document Server

    Luo, Dongsheng; Hu, Renjun; Duan, Liang; Ma, Shuai

    2016-01-01

    This paper describes our solution for WSDM Cup 2016. Ranking the query independent importance of scholarly articles is a critical and challenging task, due to the heterogeneity and dynamism of entities involved. Our approach is called Ensemble enabled Weighted PageRank (EWPR). To do this, we first propose Time-Weighted PageRank that extends PageRank by introducing a time decaying factor. We then develop an ensemble method to assemble the authorities of the heterogeneous entities involved in scholarly articles. We finally propose to use external data sources to further improve the ranking accuracy. Our experimental study shows that our EWPR is a good choice for ranking scholarly articles.

  5. Ensemble Data Mining Methods

    Science.gov (United States)

    Oza, Nikunj C.

    2004-01-01

    Ensemble Data Mining Methods, also known as Committee Methods or Model Combiners, are machine learning methods that leverage the power of multiple models to achieve better prediction accuracy than any of the individual models could on their own. The basic goal when designing an ensemble is the same as when establishing a committee of people: each member of the committee should be as competent as possible, but the members should be complementary to one another. If the members are not complementary, Le., if they always agree, then the committee is unnecessary---any one member is sufficient. If the members are complementary, then when one or a few members make an error, the probability is high that the remaining members can correct this error. Research in ensemble methods has largely revolved around designing ensembles consisting of competent yet complementary models.

  6. Ensemble Data Mining Methods

    Data.gov (United States)

    National Aeronautics and Space Administration — Ensemble Data Mining Methods, also known as Committee Methods or Model Combiners, are machine learning methods that leverage the power of multiple models to achieve...

  7. Vibrio cholerae VttRA and VttRB Regulatory Influences Extend beyond the Type 3 Secretion System Genomic Island

    OpenAIRE

    Chaand, Mudit; Dziejman, Michelle

    2013-01-01

    A subset of non-O1/non-O139 serogroup strains of Vibrio cholerae cause disease using type 3 secretion system (T3SS)-mediated mechanisms. An ∼50-kb genomic island carries genes encoding the T3SS structural apparatus, effector proteins, and two transmembrane transcriptional regulators, VttRA and VttRB, which are ToxR homologues. Previous experiments demonstrated that VttRA and VttRB are necessary for colonization in vivo and promote bile-dependent T3SS gene expression in vitro. To better unders...

  8. Vibrio cholerae VttRA and VttRB Regulatory Influences Extend beyond the Type 3 Secretion System Genomic Island

    OpenAIRE

    Chaand, Mudit; Dziejman, Michelle

    2013-01-01

    A subset of non-O1/non-O139 serogroup strains of Vibrio cholerae cause disease using type 3 secretion system (T3SS)-mediated mechanisms. An ∼50-kb genomic island carries genes encoding the T3SS structural apparatus, effector proteins, and two transmembrane transcriptional regulators, VttRA and VttRB, which are ToxR homologues. Previous experiments demonstrated that VttRA and VttRB are necessary for colonization in vivo and promote bile-dependent T3SS gene expression in vitro. To better unders...

  9. Statistical ensembles and fragmentation of finite nuclei

    Science.gov (United States)

    Das, P.; Mallik, S.; Chaudhuri, G.

    2017-09-01

    Statistical models based on different ensembles are very commonly used to describe the nuclear multifragmentation reaction in heavy ion collisions at intermediate energies. Canonical model results are more appropriate for finite nuclei calculations while those obtained from the grand canonical ones are more easily calculable. A transformation relation has been worked out for converting results of finite nuclei from grand canonical to canonical and vice versa. The formula shows that, irrespective of the particle number fluctuation in the grand canonical ensemble, exact canonical results can be recovered for observables varying linearly or quadratically with the number of particles. This result is of great significance since the baryon and charge conservation constraints can make the exact canonical calculations extremely difficult in general. This concept developed in this work can be extended in future for transformation to ensembles where analytical solutions do not exist. The applicability of certain equations (isoscaling, etc.) in the regime of finite nuclei can also be tested using this transformation relation.

  10. DNA profiling of extended tracts of primitive DNA repeats: Direct identification of unstable simple repeat loci in complex genome

    Energy Technology Data Exchange (ETDEWEB)

    Rogaeva, E.A.; Korovaitseva, G.; St. George-Hyslop, P. [Univ. of Toronto (Canada)] [and others

    1994-09-01

    The most simple DNA repetitive elements, with repetitive monomer units of only 1-10 bp in tandem tracts, are an abundant component of the human genome. The expansion of at least one type of these repeats ((CCG)n and (CTG)n) have been detected for a several neurological diseases with anticipation in successive generations. We propose here a simple method for the identification of particularly expanded repeats and for the recovery of flanking sequences. We generated DNA probes using PCR to create long concatamers (n>100) by amplification of the di-, tri-, tetra-, penta- and hexa-nucleotide repeat oligonucleotide primer pairs. To reduce the complexity of the background band pattern, the genomic DNA was restricted with a mixture of at least five different endonucleases, thereby reducing the size of restriction fragments containing short simple repeat arrays while leaving intact the large fragments containing the longer simple repeats arrays. Direct blot hybridization has shown different {open_quotes}DNA fingerprint{close_quotes} patterns with all arbitrary selected di-hexa nucleotide repeat probes. Direct hybridization of the (CTG)n and (CCG)n probes revealed simple or multiple band patterns depending upon stringency conditions. We were able to detect the presence of expanded unstable tri-nucleotide alleles by (CCG)n probe for some FRAXA subjects and by (CTG)n probe for some myotonic dystrophy subjects which were not present in the parental DNA patterns. The cloning of the unstable alleles for simple repeats can be performed by direct recover from agarose gels of the aberrant unstable bands detected above. The recovered flanking regions can be cloned, sequenced and used for PCR detection of expanded alleles or can be used to screen cDNA. This method may be used for testing of small families with diseases thought to display clinical evidence of anticipation.

  11. Genomic Epidemiology of Gonococcal Resistance to Extended-Spectrum Cephalosporins, Macrolides, and Fluoroquinolones in the United States, 2000-2013.

    Science.gov (United States)

    Grad, Yonatan H; Harris, Simon R; Kirkcaldy, Robert D; Green, Anna G; Marks, Debora S; Bentley, Stephen D; Trees, David; Lipsitch, Marc

    2016-11-15

     Treatment of Neisseria gonorrhoeae infection is empirical and based on population-wide susceptibilities. Increasing antimicrobial resistance underscores the potential importance of rapid diagnostic tests, including sequence-based tests, to guide therapy. However, the usefulness of sequence-based diagnostic tests depends on the prevalence and dynamics of the resistance mechanisms.  We define the prevalence and dynamics of resistance markers to extended-spectrum cephalosporins, macrolides, and fluoroquinolones in 1102 resistant and susceptible clinical N. gonorrhoeae isolates collected from 2000 to 2013 via the Centers for Disease Control and Prevention's Gonococcal Isolate Surveillance Project.  Reduced extended-spectrum cephalosporin susceptibility is predominantly clonal and associated with the mosaic penA XXXIV allele and derivatives (sensitivity 98% for cefixime and 91% for ceftriaxone), but alternative resistance mechanisms have sporadically emerged. Reduced azithromycin susceptibility has arisen through multiple mechanisms and shows limited clonal spread; the basis for resistance in 36% of isolates with reduced azithromycin susceptibility is unclear. Quinolone-resistant N. gonorrhoeae has arisen multiple times, with extensive clonal spread.  Quinolone-resistant N. gonorrhoeae and reduced cefixime susceptibility appear amenable to development of sequence-based diagnostic tests, whereas the undefined mechanisms of resistance to ceftriaxone and azithromycin underscore the importance of phenotypic surveillance. The identification of multidrug-resistant isolates highlights the need for additional measures to respond to the threat of untreatable gonorrhea. © The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America.

  12. The semantic similarity ensemble

    Directory of Open Access Journals (Sweden)

    Andrea Ballatore

    2013-12-01

    Full Text Available Computational measures of semantic similarity between geographic terms provide valuable support across geographic information retrieval, data mining, and information integration. To date, a wide variety of approaches to geo-semantic similarity have been devised. A judgment of similarity is not intrinsically right or wrong, but obtains a certain degree of cognitive plausibility, depending on how closely it mimics human behavior. Thus selecting the most appropriate measure for a specific task is a significant challenge. To address this issue, we make an analogy between computational similarity measures and soliciting domain expert opinions, which incorporate a subjective set of beliefs, perceptions, hypotheses, and epistemic biases. Following this analogy, we define the semantic similarity ensemble (SSE as a composition of different similarity measures, acting as a panel of experts having to reach a decision on the semantic similarity of a set of geographic terms. The approach is evaluated in comparison to human judgments, and results indicate that an SSE performs better than the average of its parts. Although the best member tends to outperform the ensemble, all ensembles outperform the average performance of each ensemble's member. Hence, in contexts where the best measure is unknown, the ensemble provides a more cognitively plausible approach.

  13. Conveniently Pre-Tagged and Pre-Packaged: Extended Molecular Identification and Metagenomics Using Complete Metazoan Mitochondrial Genomes

    Science.gov (United States)

    Dettai, Agnes; Gallut, Cyril; Brouillet, Sophie; Pothier, Joel; Lecointre, Guillaume; Debruyne, Régis

    2012-01-01

    Background Researchers sorely need markers and approaches for biodiversity exploration (both specimen linked and metagenomics) using the full potential of next generation sequencing technologies (NGST). Currently, most studies rely on expensive multiple tagging, PCR primer universality and/or the use of few markers, sometimes with insufficient variability. Methodology/Principal Findings We propose a novel approach for the isolation and sequencing of a universal, useful and popular marker across distant, non-model metazoans: the complete mitochondrial genome. It relies on the properties of metazoan mitogenomes for enrichment, on careful choice of the organisms to multiplex, as well as on the wide collection of accumulated mitochondrial reference datasets for post-sequencing sorting and identification instead of individual tagging. Multiple divergent organisms can be sequenced simultaneously, and their complete mitogenome obtained at a very low cost. We provide in silico testing of dataset assembly for a selected set of example datasets. Conclusions/Significance This approach generates large mitogenome datasets. These sequences are useful for phylogenetics, molecular identification and molecular ecology studies, and are compatible with all existing projects or available datasets based on mitochondrial sequences, such as the Barcode of Life project. Our method can yield sequences both from identified samples and metagenomic samples. The use of the same datasets for both kinds of studies makes for a powerful approach, especially since the datasets have a high variability even at species level, and would be a useful complement to the less variable 18S rDNA currently prevailing in metagenomic studies. PMID:23251474

  14. Hierarchical Bayes Ensemble Kalman Filtering

    CERN Document Server

    Tsyrulnikov, Michael

    2015-01-01

    Ensemble Kalman filtering (EnKF), when applied to high-dimensional systems, suffers from an inevitably small affordable ensemble size, which results in poor estimates of the background error covariance matrix ${\\bf B}$. The common remedy is a kind of regularization, usually an ad-hoc spatial covariance localization (tapering) combined with artificial covariance inflation. Instead of using an ad-hoc regularization, we adopt the idea by Myrseth and Omre (2010) and explicitly admit that the ${\\bf B}$ matrix is unknown and random and estimate it along with the state (${\\bf x}$) in an optimal hierarchical Bayes analysis scheme. We separate forecast errors into predictability errors (i.e. forecast errors due to uncertainties in the initial data) and model errors (forecast errors due to imperfections in the forecast model) and include the two respective components ${\\bf P}$ and ${\\bf Q}$ of the ${\\bf B}$ matrix into the extended control vector $({\\bf x},{\\bf P},{\\bf Q})$. Similarly, we break the traditional backgrou...

  15. Visualizing ensembles in structural biology.

    Science.gov (United States)

    Melvin, Ryan L; Salsbury, Freddie R

    2016-06-01

    Displaying a single representative conformation of a biopolymer rather than an ensemble of states mistakenly conveys a static nature rather than the actual dynamic personality of biopolymers. However, there are few apparent options due to the fixed nature of print media. Here we suggest a standardized methodology for visually indicating the distribution width, standard deviation and uncertainty of ensembles of states with little loss of the visual simplicity of displaying a single representative conformation. Of particular note is that the visualization method employed clearly distinguishes between isotropic and anisotropic motion of polymer subunits. We also apply this method to ligand binding, suggesting a way to indicate the expected error in many high throughput docking programs when visualizing the structural spread of the output. We provide several examples in the context of nucleic acids and proteins with particular insights gained via this method. Such examples include investigating a therapeutic polymer of FdUMP (5-fluoro-2-deoxyuridine-5-O-monophosphate) - a topoisomerase-1 (Top1), apoptosis-inducing poison - and nucleotide-binding proteins responsible for ATP hydrolysis from Bacillus subtilis. We also discuss how these methods can be extended to any macromolecular data set with an underlying distribution, including experimental data such as NMR structures.

  16. Genomic dissection of travel-associated extended-spectrum-beta-lactamase-producing Salmonella enterica serovar typhi isolates originating from the Philippines: a one-off occurrence or a threat to effective treatment of typhoid fever?

    Science.gov (United States)

    Hendriksen, Rene S; Leekitcharoenphon, Pimlapas; Mikoleit, Matthew; Jensen, Jacob Dyring; Kaas, Rolf Sommer; Roer, Louise; Joshi, Heena B; Pornruangmong, Srirat; Pulsrikarn, Chaiwat; Gonzalez-Aviles, Gladys D; Reuland, E Ascelijn; Al Naiemi, Nashwan; Wester, Astrid Louise; Aarestrup, Frank M; Hasman, Henrik

    2015-02-01

    One unreported case of extended-spectrum-beta-lactamase (ESBL)-producing Salmonella enterica serovar Typhi was identified, whole-genome sequence typed, among other analyses, and compared to other available genomes of S. Typhi. The reported strain was similar to a previously published strain harboring blaSHV-12 from the Philippines and likely part of an undetected outbreak, the first of ESBL-producing S. Typhi. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  17. Protective Garment Ensemble

    Science.gov (United States)

    Wakefield, M. E.

    1982-01-01

    Protective garment ensemble with internally-mounted environmental- control unit contains its own air supply. Alternatively, a remote-environmental control unit or an air line is attached at the umbilical quick disconnect. Unit uses liquid air that is vaporized to provide both breathing air and cooling. Totally enclosed garment protects against toxic substances.

  18. Music Ensemble: Course Proposal.

    Science.gov (United States)

    Kovach, Brian

    A proposal is presented for a Music Ensemble course to be offered at the Community College of Philadelphia for music students who have had previous vocal or instrumental training. A standardized course proposal cover form is followed by a statement of purpose for the course, a list of major course goals, a course outline, and a bibliography. Next,…

  19. Neural Network Ensembles

    DEFF Research Database (Denmark)

    Hansen, Lars Kai; Salamon, Peter

    1990-01-01

    We propose several means for improving the performance an training of neural networks for classification. We use crossvalidation as a tool for optimizing network parameters and architecture. We show further that the remaining generalization error can be reduced by invoking ensembles of similar...... networks....

  20. Multilevel ensemble Kalman filtering

    KAUST Repository

    Hoel, Haakon

    2016-01-08

    The ensemble Kalman filter (EnKF) is a sequential filtering method that uses an ensemble of particle paths to estimate the means and covariances required by the Kalman filter by the use of sample moments, i.e., the Monte Carlo method. EnKF is often both robust and efficient, but its performance may suffer in settings where the computational cost of accurate simulations of particles is high. The multilevel Monte Carlo method (MLMC) is an extension of classical Monte Carlo methods which by sampling stochastic realizations on a hierarchy of resolutions may reduce the computational cost of moment approximations by orders of magnitude. In this work we have combined the ideas of MLMC and EnKF to construct the multilevel ensemble Kalman filter (MLEnKF) for the setting of finite dimensional state and observation spaces. The main ideas of this method is to compute particle paths on a hierarchy of resolutions and to apply multilevel estimators on the ensemble hierarchy of particles to compute Kalman filter means and covariances. Theoretical results and a numerical study of the performance gains of MLEnKF over EnKF will be presented. Some ideas on the extension of MLEnKF to settings with infinite dimensional state spaces will also be presented.

  1. Deterministic entanglement of Rydberg ensembles by engineered dissipation

    DEFF Research Database (Denmark)

    Dasari, Durga; Mølmer, Klaus

    2014-01-01

    We propose a scheme that employs dissipation to deterministically generate entanglement in an ensemble of strongly interacting Rydberg atoms. With a combination of microwave driving between different Rydberg levels and a resonant laser coupling to a short lived atomic state, the ensemble can...... be driven towards a dark steady state that entangles all atoms. The long-range resonant dipole-dipole interaction between different Rydberg states extends the entanglement beyond the van der Walls interaction range with perspectives for entangling large and distant ensembles....

  2. Stochastic and dynamical downscaling of ensemble precipitation forecasts

    Science.gov (United States)

    Brussolo, E.; von Hardenberg, J.; Rebora, N.

    2009-04-01

    Forecasting hydrogeological risk in small basins requires quantitative forecasts and an estimate of the probability of occurrence of severe, localized precipitation events at spatial scales of the order of tens of kilometers or less, significantly smaller than those currently provided by large scale, global, ensemble forecasting systems (EPS). Dynamically based forecasts at these scales can be obtained extending EPS scenarios with high-resolution, non-hydrostatic, limited area ensemble prediction systems. An alternative is represented by the direct application of stochastic downscaling techniques to the large scale ensemble forecasts. This work compares the performances of these two very different ensemble forecast downscaling approaches. To this purpose we consider ensemble forecasts provided by the ECMWF EPS, downscaled in space using the RainFARM stochastic technique [1], and ensembles of forecasts obtained from the COSMO-LEPS limited area prediction system (which also uses ECMWF EPS ensemble members as boundary conditions), for three intense precipitation events over northern Italy in 2006. The statistical properties of the fields produced with these two techniques are compared and the skill of the resulting ensembles is verified against direct precipitation measurements from a dense network of rain gauges. Reference: 1. Rebora, N., L. Ferraris, J. von Hardenberg, and A. Provenzale, 2006: The RainFARM: Rainfall Downscaling by a Filtered AutoRegressive Model. J. Hydrometeorol., 7, 724-738.

  3. Ensemble Forecast: A New Approach to Uncertainty and Predictability

    Institute of Scientific and Technical Information of China (English)

    2005-01-01

    Ensemble techniques have been used to generate daily numerical weather forecasts since the 1990s in numerical centers around the world due to the increase in computation ability. One of the main purposes of numerical ensemble forecasts is to try to assimilate the initial uncertainty (initial error) and the forecast uncertainty (forecast error) by applying either the initial perturbation method or the multi-model/multiphysics method. In fact, the mean of an ensemble forecast offers a better forecast than a deterministic (or control) forecast after a short lead time (3 5 days) for global modelling applications. There is about a 1-2-day improvement in the forecast skill when using an ensemble mean instead of a single forecast for longer lead-time. The skillful forecast (65% and above of an anomaly correlation) could be extended to 8 days (or longer) by present-day ensemble forecast systems. Furthermore, ensemble forecasts can deliver a probabilistic forecast to the users, which is based on the probability density function (PDF)instead of a single-value forecast from a traditional deterministic system. It has long been recognized that the ensemble forecast not only improves our weather forecast predictability but also offers a remarkable forecast for the future uncertainty, such as the relative measure of predictability (RMOP) and probabilistic quantitative precipitation forecast (PQPF). Not surprisingly, the success of the ensemble forecast and its wide application greatly increase the confidence of model developers and research communities.

  4. Extensive amplification of GI-VII-6, a multidrug resistance genomic island of Salmonella enterica serovar Typhimurium, increases resistance to extended-spectrum cephalosporins

    Directory of Open Access Journals (Sweden)

    Ken-ichi eLee

    2015-02-01

    Full Text Available GI-VII-6 is a chromosomally integrated multidrug resistance genomic island harbored by a specific clone of Salmonella enterica serovar Typhimurium (S. Typhimurium. It contains a gene encoding CMY-2 β-lactamase (blaCMY-2, and therefore contributes to extended-spectrum cephalosporin resistance. To elucidate the significance of GI-VII-6 on adaptive evolution, spontaneous mutants of S. Typhimurium strain L-3553 were selected on plates containing cefotaxime (CTX. The concentrations of CTX were higher than its minimum inhibition concentration to the parent strain. The mutants appeared on the plates containing 12.5 and 25 μg/ml CTX at a frequency of 10−6 and 10−8, respectively. No colonies were observed at higher CTX concentrations. The copy number of blaCMY-2 increased up to 85 per genome in the mutants, while the parent strain contains one copy of that in the chromosome. This elevation was accompanied by increased amount of transcription. The blaCMY-2 copy number in the mutants drastically decreased in the absence of antibiotic selection pressure. Southern hybridization analysis and short-read mapping indicated that the entire 125 kb GI-VII-6 or parts of it were tandemly amplified. GI-VII-6 amplification occurred at its original position, although it also transposed to other locations in the genome in some mutants, including an endogenous plasmid in some of the mutants, leading to the amplification of GI-VII-6 at different loci. Insertion sequences were observed at the junction of the amplified regions in the mutants, suggesting their significant roles in the transposition and amplification. Plasmid copy number in the selected mutants was 1.4 to 4.4 times higher than that of the parent strain. These data suggest that transposition and amplification of the blaCMY-2-containing region, along with the copy number variation of the plasmid, contributed to the extensive amplification of blaCMY-2 and increased resistance to CTX.

  5. Using regulatory and epistatic networks to extend the findings of a genome scan: identifying the gene drivers of pigmentation in merino sheep.

    Science.gov (United States)

    García-Gámez, Elsa; Reverter, Antonio; Whan, Vicki; McWilliam, Sean M; Arranz, Juan José; Kijas, James

    2011-01-01

    Extending genome wide association analysis by the inclusion of gene expression data may assist in the dissection of complex traits. We examined piebald, a pigmentation phenotype in both human and Merino sheep, by analysing multiple data types using a systems approach. First, a case control analysis of 49,034 ovine SNP was performed which confirmed a multigenic basis for the condition. We combined these results with gene expression data from five tissue types analysed with a skin-specific microarray. Promoter sequence analysis of differentially expressed genes allowed us to reverse-engineer a regulatory network. Likewise, by testing two-loci models derived from all pair-wise comparisons across piebald-associated SNP, we generated an epistatic network. At the intersection of both networks, we identified thirteen genes with insulin-like growth factor binding protein 7 (IGFBP7), platelet-derived growth factor alpha (PDGFRA) and the tetraspanin platelet activator CD9 at the kernel of the intersection. Further, we report a number of differentially expressed genes in regions containing highly associated SNP including ATRN, DOCK7, FGFR1OP, GLI3, SILV and TBX15. The application of network theory facilitated co-analysis of genetic variation with gene expression, recapitulated aspects of the known molecular biology of skin pigmentation and provided insights into the transcription regulation and epistatic interactions involved in piebald Merino sheep.

  6. Using regulatory and epistatic networks to extend the findings of a genome scan: identifying the gene drivers of pigmentation in merino sheep.

    Directory of Open Access Journals (Sweden)

    Elsa García-Gámez

    Full Text Available Extending genome wide association analysis by the inclusion of gene expression data may assist in the dissection of complex traits. We examined piebald, a pigmentation phenotype in both human and Merino sheep, by analysing multiple data types using a systems approach. First, a case control analysis of 49,034 ovine SNP was performed which confirmed a multigenic basis for the condition. We combined these results with gene expression data from five tissue types analysed with a skin-specific microarray. Promoter sequence analysis of differentially expressed genes allowed us to reverse-engineer a regulatory network. Likewise, by testing two-loci models derived from all pair-wise comparisons across piebald-associated SNP, we generated an epistatic network. At the intersection of both networks, we identified thirteen genes with insulin-like growth factor binding protein 7 (IGFBP7, platelet-derived growth factor alpha (PDGFRA and the tetraspanin platelet activator CD9 at the kernel of the intersection. Further, we report a number of differentially expressed genes in regions containing highly associated SNP including ATRN, DOCK7, FGFR1OP, GLI3, SILV and TBX15. The application of network theory facilitated co-analysis of genetic variation with gene expression, recapitulated aspects of the known molecular biology of skin pigmentation and provided insights into the transcription regulation and epistatic interactions involved in piebald Merino sheep.

  7. Statistical ensembles for money and debt

    Science.gov (United States)

    Viaggiu, Stefano; Lionetto, Andrea; Bargigli, Leonardo; Longo, Michele

    2012-10-01

    We build a statistical ensemble representation of two economic models describing respectively, in simplified terms, a payment system and a credit market. To this purpose we adopt the Boltzmann-Gibbs distribution where the role of the Hamiltonian is taken by the total money supply (i.e. including money created from debt) of a set of interacting economic agents. As a result, we can read the main thermodynamic quantities in terms of monetary ones. In particular, we define for the credit market model a work term which is related to the impact of monetary policy on credit creation. Furthermore, with our formalism we recover and extend some results concerning the temperature of an economic system, previously presented in the literature by considering only the monetary base as a conserved quantity. Finally, we study the statistical ensemble for the Pareto distribution.

  8. Genome-wide association analysis confirms and extends the association of SLC2A9 with serum uric acid levels to Mexican Americans

    Directory of Open Access Journals (Sweden)

    Venkata Saroja eVoruganti

    2013-12-01

    Full Text Available Increased serum uric acid (SUA is a risk factor for gout and renal and cardiovascular disease. The purpose of this study was to identify genetic factors that affect the variation in SUA in 632 Mexican Americans participants of the San Antonio Family Heart Study (SAFHS. A genome-wide association analysis was performed using the Illumina Human Hap 550K single nucleotide polymorphism (SNP microarray. We used a linear regression-based association test under an additive model of allelic effect, while accounting for non-independence among family members via a kinship variance component. All analyses were performed in the software package SOLAR. SNPs rs6832439, rs13131257 and rs737267 in solute carrier protein 2 family, member 9 (SLC2A9 were associated with SUA at genome-wide significance (p <1.3×10-7. The minor alleles of these SNPs had frequencies of 36.2%, 36.2%, and 38.2 %, respectively, and were associated with decreasing SUA levels. All of these SNPs were located in introns 3-7 of SLC2A9, the location of the previously reported associations in European populations. When analyzed for association with cardiovascular-renal disease risk factors, conditional on SLC2A9 SNPs strongly associated with SUA, significant associations were found for SLC2A9 SNPs with BMI, body weight and waist circumference (p < 1.4 x 10-3 and suggestive associations with albumin-creatinine ratio and total antioxidant status. The SLC2A9 gene encodes an urate transporter that has considerable influence on variation in SUA. In addition to the primary association locus, suggestive evidence (p<1.9×10-6 for joint linkage/association was found at a previously-reported urate quantitative trait locus (Logarithm of odds score = 3.6 on 3p26.3. In summary, our GWAS extends and confirms the association of SLC2A9 with SUA for the first time in a Mexican American cohort and also shows for the first time its association with cardiovascular-renal disease risk factors.

  9. Ensembles on configuration space classical, quantum, and beyond

    CERN Document Server

    Hall, Michael J W

    2016-01-01

    This book describes a promising approach to problems in the foundations of quantum mechanics, including the measurement problem. The dynamics of ensembles on configuration space is shown here to be a valuable tool for unifying the formalisms of classical and quantum mechanics, for deriving and extending the latter in various ways, and for addressing the quantum measurement problem. A description of physical systems by means of ensembles on configuration space can be introduced at a very fundamental level: the basic building blocks are a configuration space, probabilities, and Hamiltonian equations of motion for the probabilities. The formalism can describe both classical and quantum systems, and their thermodynamics, with the main difference being the choice of ensemble Hamiltonian. Furthermore, there is a natural way of introducing ensemble Hamiltonians that describe the evolution of hybrid systems; i.e., interacting systems that have distinct classical and quantum sectors, allowing for consistent descriptio...

  10. Imprinting and recalling cortical ensembles.

    Science.gov (United States)

    Carrillo-Reid, Luis; Yang, Weijian; Bando, Yuki; Peterka, Darcy S; Yuste, Rafael

    2016-08-12

    Neuronal ensembles are coactive groups of neurons that may represent building blocks of cortical circuits. These ensembles could be formed by Hebbian plasticity, whereby synapses between coactive neurons are strengthened. Here we report that repetitive activation with two-photon optogenetics of neuronal populations from ensembles in the visual cortex of awake mice builds neuronal ensembles that recur spontaneously after being imprinted and do not disrupt preexisting ones. Moreover, imprinted ensembles can be recalled by single- cell stimulation and remain coactive on consecutive days. Our results demonstrate the persistent reconfiguration of cortical circuits by two-photon optogenetics into neuronal ensembles that can perform pattern completion. Copyright © 2016, American Association for the Advancement of Science.

  11. Multilevel ensemble Kalman filtering

    KAUST Repository

    Hoel, Hakon

    2016-06-14

    This work embeds a multilevel Monte Carlo sampling strategy into the Monte Carlo step of the ensemble Kalman filter (EnKF) in the setting of finite dimensional signal evolution and noisy discrete-time observations. The signal dynamics is assumed to be governed by a stochastic differential equation (SDE), and a hierarchy of time grids is introduced for multilevel numerical integration of that SDE. The resulting multilevel EnKF is proved to asymptotically outperform EnKF in terms of computational cost versus approximation accuracy. The theoretical results are illustrated numerically.

  12. Critical behavior in topological ensembles

    CERN Document Server

    Bulycheva, K; Nechaev, S

    2014-01-01

    We consider the relation between three physical problems: 2D directed lattice random walks in an external magnetic field, ensembles of torus knots, and 5d Abelian SUSY gauge theory with massless hypermultiplet in $\\Omega$ background. All these systems exhibit the critical behavior typical for the "area+length" statistics of grand ensembles of 2D directed paths. In particular, using the combinatorial description, we have found the new critical behavior in the ensembles of the torus knots and in the instanton ensemble in 5d gauge theory. The relation with the integrable model is discussed.

  13. Genome-Wide Copy Number Variation Analysis in Extended Families and Unrelated Individuals Characterized for Musical Aptitude and Creativity in Music

    Science.gov (United States)

    Oikkonen, Jaana; Buck, Gemma; Blancher, Christine; Raijas, Pirre; Karma, Kai; Lähdesmäki, Harri; Järvelä, Irma

    2013-01-01

    Music perception and practice represent complex cognitive functions of the human brain. Recently, evidence for the molecular genetic background of music related phenotypes has been obtained. In order to further elucidate the molecular background of musical phenotypes we analyzed genome wide copy number variations (CNVs) in five extended pedigrees and in 172 unrelated subjects characterized for musical aptitude and creative functions in music. Musical aptitude was defined by combination of the scores of three music tests (COMB scores): auditory structuring ability, Seashores test for pitch and for time. Data on creativity in music (herein composing, improvising and/or arranging music) was surveyed using a web-based questionnaire. Several CNVRs containing genes that affect neurodevelopment, learning and memory were detected. A deletion at 5q31.1 covering the protocadherin-α gene cluster (Pcdha 1-9) was found co-segregating with low music test scores (COMB) in both sample sets. Pcdha is involved in neural migration, differentiation and synaptogenesis. Creativity in music was found to co-segregate with a duplication covering glucose mutarotase gene (GALM) at 2p22. GALM has influence on serotonin release and membrane trafficking of the human serotonin transporter. Interestingly, genes related to serotonergic systems have been shown to associate not only with psychiatric disorders but also with creativity and music perception. Both, Pcdha and GALM, are related to the serotonergic systems influencing cognitive and motor functions, important for music perception and practice. Finally, a 1.3 Mb duplication was identified in a subject with low COMB scores in the region previously linked with absolute pitch (AP) at 8q24. No differences in the CNV burden was detected among the high/low music test scores or creative/non-creative groups. In summary, CNVs and genes found in this study are related to cognitive functions. Our result suggests new candidate genes for music

  14. Genome-wide copy number variation analysis in extended families and unrelated individuals characterized for musical aptitude and creativity in music.

    Science.gov (United States)

    Ukkola-Vuoti, Liisa; Kanduri, Chakravarthi; Oikkonen, Jaana; Buck, Gemma; Blancher, Christine; Raijas, Pirre; Karma, Kai; Lähdesmäki, Harri; Järvelä, Irma

    2013-01-01

    Music perception and practice represent complex cognitive functions of the human brain. Recently, evidence for the molecular genetic background of music related phenotypes has been obtained. In order to further elucidate the molecular background of musical phenotypes we analyzed genome wide copy number variations (CNVs) in five extended pedigrees and in 172 unrelated subjects characterized for musical aptitude and creative functions in music. Musical aptitude was defined by combination of the scores of three music tests (COMB scores): auditory structuring ability, Seashores test for pitch and for time. Data on creativity in music (herein composing, improvising and/or arranging music) was surveyed using a web-based questionnaire.Several CNVRs containing genes that affect neurodevelopment, learning and memory were detected. A deletion at 5q31.1 covering the protocadherin-α gene cluster (Pcdha 1-9) was found co-segregating with low music test scores (COMB) in both sample sets. Pcdha is involved in neural migration, differentiation and synaptogenesis. Creativity in music was found to co-segregate with a duplication covering glucose mutarotase gene (GALM) at 2p22. GALM has influence on serotonin release and membrane trafficking of the human serotonin transporter. Interestingly, genes related to serotonergic systems have been shown to associate not only with psychiatric disorders but also with creativity and music perception. Both, Pcdha and GALM, are related to the serotonergic systems influencing cognitive and motor functions, important for music perception and practice. Finally, a 1.3 Mb duplication was identified in a subject with low COMB scores in the region previously linked with absolute pitch (AP) at 8q24. No differences in the CNV burden was detected among the high/low music test scores or creative/non-creative groups. In summary, CNVs and genes found in this study are related to cognitive functions. Our result suggests new candidate genes for music perception

  15. Genome-wide copy number variation analysis in extended families and unrelated individuals characterized for musical aptitude and creativity in music.

    Directory of Open Access Journals (Sweden)

    Liisa Ukkola-Vuoti

    Full Text Available Music perception and practice represent complex cognitive functions of the human brain. Recently, evidence for the molecular genetic background of music related phenotypes has been obtained. In order to further elucidate the molecular background of musical phenotypes we analyzed genome wide copy number variations (CNVs in five extended pedigrees and in 172 unrelated subjects characterized for musical aptitude and creative functions in music. Musical aptitude was defined by combination of the scores of three music tests (COMB scores: auditory structuring ability, Seashores test for pitch and for time. Data on creativity in music (herein composing, improvising and/or arranging music was surveyed using a web-based questionnaire.Several CNVRs containing genes that affect neurodevelopment, learning and memory were detected. A deletion at 5q31.1 covering the protocadherin-α gene cluster (Pcdha 1-9 was found co-segregating with low music test scores (COMB in both sample sets. Pcdha is involved in neural migration, differentiation and synaptogenesis. Creativity in music was found to co-segregate with a duplication covering glucose mutarotase gene (GALM at 2p22. GALM has influence on serotonin release and membrane trafficking of the human serotonin transporter. Interestingly, genes related to serotonergic systems have been shown to associate not only with psychiatric disorders but also with creativity and music perception. Both, Pcdha and GALM, are related to the serotonergic systems influencing cognitive and motor functions, important for music perception and practice. Finally, a 1.3 Mb duplication was identified in a subject with low COMB scores in the region previously linked with absolute pitch (AP at 8q24. No differences in the CNV burden was detected among the high/low music test scores or creative/non-creative groups. In summary, CNVs and genes found in this study are related to cognitive functions. Our result suggests new candidate genes for

  16. Deterministic Mean-Field Ensemble Kalman Filtering

    KAUST Repository

    Law, Kody J. H.

    2016-05-03

    The proof of convergence of the standard ensemble Kalman filter (EnKF) from Le Gland, Monbet, and Tran [Large sample asymptotics for the ensemble Kalman filter, in The Oxford Handbook of Nonlinear Filtering, Oxford University Press, Oxford, UK, 2011, pp. 598--631] is extended to non-Gaussian state-space models. A density-based deterministic approximation of the mean-field limit EnKF (DMFEnKF) is proposed, consisting of a PDE solver and a quadrature rule. Given a certain minimal order of convergence k between the two, this extends to the deterministic filter approximation, which is therefore asymptotically superior to standard EnKF for dimension d<2k. The fidelity of approximation of the true distribution is also established using an extension of the total variation metric to random measures. This is limited by a Gaussian bias term arising from nonlinearity/non-Gaussianity of the model, which arises in both deterministic and standard EnKF. Numerical results support and extend the theory.

  17. Botnet analysis using ensemble classifier

    Directory of Open Access Journals (Sweden)

    Anchit Bijalwan

    2016-09-01

    Full Text Available This paper analyses the botnet traffic using Ensemble of classifier algorithm to find out bot evidence. We used ISCX dataset for training and testing purpose. We extracted the features of both training and testing datasets. After extracting the features of this dataset, we bifurcated these features into two classes, normal traffic and botnet traffic and provide labelling. Thereafter using modern data mining tool, we have applied ensemble of classifier algorithm. Our experimental results show that the performance for finding bot evidence using ensemble of classifiers is better than single classifier. Ensemble based classifiers perform better than single classifier by either combining powers of multiple algorithms or introducing diversification to the same classifier by varying input in bot analysis. Our results are showing that by using voting method of ensemble based classifier accuracy is increased up to 96.41% from 93.37%.

  18. On Ensemble Nonlinear Kalman Filtering with Symmetric Analysis Ensembles

    KAUST Repository

    Luo, Xiaodong

    2010-09-19

    The ensemble square root filter (EnSRF) [1, 2, 3, 4] is a popular method for data assimilation in high dimensional systems (e.g., geophysics models). Essentially the EnSRF is a Monte Carlo implementation of the conventional Kalman filter (KF) [5, 6]. It is mainly different from the KF at the prediction steps, where it is some ensembles, rather then the means and covariance matrices, of the system state that are propagated forward. In doing this, the EnSRF is computationally more efficient than the KF, since propagating a covariance matrix forward in high dimensional systems is prohibitively expensive. In addition, the EnSRF is also very convenient in implementation. By propagating the ensembles of the system state, the EnSRF can be directly applied to nonlinear systems without any change in comparison to the assimilation procedures in linear systems. However, by adopting the Monte Carlo method, the EnSRF also incurs certain sampling errors. One way to alleviate this problem is to introduce certain symmetry to the ensembles, which can reduce the sampling errors and spurious modes in evaluation of the means and covariances of the ensembles [7]. In this contribution, we present two methods to produce symmetric ensembles. One is based on the unscented transform [8, 9], which leads to the unscented Kalman filter (UKF) [8, 9] and its variant, the ensemble unscented Kalman filter (EnUKF) [7]. The other is based on Stirling’s interpolation formula (SIF), which results in the divided difference filter (DDF) [10]. Here we propose a simplified divided difference filter (sDDF) in the context of ensemble filtering. The similarity and difference between the sDDF and the EnUKF will be discussed. Numerical experiments will also be conducted to investigate the performance of the sDDF and the EnUKF, and compare them to a well‐established EnSRF, the ensemble transform Kalman filter (ETKF) [2].

  19. Flood Forecasting Based on TIGGE Precipitation Ensemble Forecast

    Directory of Open Access Journals (Sweden)

    Jinyin Ye

    2016-01-01

    Full Text Available TIGGE (THORPEX International Grand Global Ensemble was a major part of the THORPEX (Observing System Research and Predictability Experiment. It integrates ensemble precipitation products from all the major forecast centers in the world and provides systematic evaluation on the multimodel ensemble prediction system. Development of meteorologic-hydrologic coupled flood forecasting model and early warning model based on the TIGGE precipitation ensemble forecast can provide flood probability forecast, extend the lead time of the flood forecast, and gain more time for decision-makers to make the right decision. In this study, precipitation ensemble forecast products from ECMWF, NCEP, and CMA are used to drive distributed hydrologic model TOPX. We focus on Yi River catchment and aim to build a flood forecast and early warning system. The results show that the meteorologic-hydrologic coupled model can satisfactorily predict the flow-process of four flood events. The predicted occurrence time of peak discharges is close to the observations. However, the magnitude of the peak discharges is significantly different due to various performances of the ensemble prediction systems. The coupled forecasting model can accurately predict occurrence of the peak time and the corresponding risk probability of peak discharge based on the probability distribution of peak time and flood warning, which can provide users a strong theoretical foundation and valuable information as a promising new approach.

  20. Gene finding in the chicken genome

    Directory of Open Access Journals (Sweden)

    Antonarakis Stylianos E

    2005-05-01

    Full Text Available Abstract Background Despite the continuous production of genome sequence for a number of organisms, reliable, comprehensive, and cost effective gene prediction remains problematic. This is particularly true for genomes for which there is not a large collection of known gene sequences, such as the recently published chicken genome. We used the chicken sequence to test comparative and homology-based gene-finding methods followed by experimental validation as an effective genome annotation method. Results We performed experimental evaluation by RT-PCR of three different computational gene finders, Ensembl, SGP2 and TWINSCAN, applied to the chicken genome. A Venn diagram was computed and each component of it was evaluated. The results showed that de novo comparative methods can identify up to about 700 chicken genes with no previous evidence of expression, and can correctly extend about 40% of homology-based predictions at the 5' end. Conclusions De novo comparative gene prediction followed by experimental verification is effective at enhancing the annotation of the newly sequenced genomes provided by standard homology-based methods.

  1. The function genomics study

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    @@ Genomics is a biology term appeared ten years ago, used to describe the researches of genomic mapping, sequencing, and structure analysis, etc. Genomics, the first journal for publishing papers on genomics research was born in 1986. In the past decade, the concept of genomics has been widely accepted by scientists who are engaging in biology research. Meanwhile, the research scope of genomics has been extended continuously, from simple gene mapping and sequencing to function genomics study. To reflect the change, genomics is divided into two parts now, the structure genomics and the function genomics.

  2. Disease-associated mutations that alter the RNA structural ensemble.

    Directory of Open Access Journals (Sweden)

    Matthew Halvorsen

    2010-08-01

    Full Text Available Genome-wide association studies (GWAS often identify disease-associated mutations in intergenic and non-coding regions of the genome. Given the high percentage of the human genome that is transcribed, we postulate that for some observed associations the disease phenotype is caused by a structural rearrangement in a regulatory region of the RNA transcript. To identify such mutations, we have performed a genome-wide analysis of all known disease-associated Single Nucleotide Polymorphisms (SNPs from the Human Gene Mutation Database (HGMD that map to the untranslated regions (UTRs of a gene. Rather than using minimum free energy approaches (e.g. mFold, we use a partition function calculation that takes into consideration the ensemble of possible RNA conformations for a given sequence. We identified in the human genome disease-associated SNPs that significantly alter the global conformation of the UTR to which they map. For six disease-states (Hyperferritinemia Cataract Syndrome, beta-Thalassemia, Cartilage-Hair Hypoplasia, Retinoblastoma, Chronic Obstructive Pulmonary Disease (COPD, and Hypertension, we identified multiple SNPs in UTRs that alter the mRNA structural ensemble of the associated genes. Using a Boltzmann sampling procedure for sub-optimal RNA structures, we are able to characterize and visualize the nature of the conformational changes induced by the disease-associated mutations in the structural ensemble. We observe in several cases (specifically the 5' UTRs of FTL and RB1 SNP-induced conformational changes analogous to those observed in bacterial regulatory Riboswitches when specific ligands bind. We propose that the UTR and SNP combinations we identify constitute a "RiboSNitch," that is a regulatory RNA in which a specific SNP has a structural consequence that results in a disease phenotype. Our SNPfold algorithm can help identify RiboSNitches by leveraging GWAS data and an analysis of the mRNA structural ensemble.

  3. Ensemble manifold regularization.

    Science.gov (United States)

    Geng, Bo; Tao, Dacheng; Xu, Chao; Yang, Linjun; Hua, Xian-Sheng

    2012-06-01

    We propose an automatic approximation of the intrinsic manifold for general semi-supervised learning (SSL) problems. Unfortunately, it is not trivial to define an optimization function to obtain optimal hyperparameters. Usually, cross validation is applied, but it does not necessarily scale up. Other problems derive from the suboptimality incurred by discrete grid search and the overfitting. Therefore, we develop an ensemble manifold regularization (EMR) framework to approximate the intrinsic manifold by combining several initial guesses. Algorithmically, we designed EMR carefully so it 1) learns both the composite manifold and the semi-supervised learner jointly, 2) is fully automatic for learning the intrinsic manifold hyperparameters implicitly, 3) is conditionally optimal for intrinsic manifold approximation under a mild and reasonable assumption, and 4) is scalable for a large number of candidate manifold hyperparameters, from both time and space perspectives. Furthermore, we prove the convergence property of EMR to the deterministic matrix at rate root-n. Extensive experiments over both synthetic and real data sets demonstrate the effectiveness of the proposed framework.

  4. Diurnal Ensemble Surface Meteorology Statistics

    Data.gov (United States)

    U.S. Environmental Protection Agency — Excel file containing diurnal ensemble statistics of 2-m temperature, 2-m mixing ratio and 10-m wind speed. This Excel file contains figures for Figure 2 in the...

  5. PSO-Ensemble Demo Application

    DEFF Research Database (Denmark)

    2004-01-01

    Within the framework of the PSO-Ensemble project (FU2101) a demo application has been created. The application use ECMWF ensemble forecasts. Two instances of the application are running; one for Nysted Offshore and one for the total production (except Horns Rev) in the Eltra area. The output...... is available via two password-protected web-pages hosted at IMM and is used daily by Elsam and E2....

  6. Similarity measures for protein ensembles

    DEFF Research Database (Denmark)

    Lindorff-Larsen, Kresten; Ferkinghoff-Borg, Jesper

    2009-01-01

    Analyses of similarities and changes in protein conformation can provide important information regarding protein function and evolution. Many scores, including the commonly used root mean square deviation, have therefore been developed to quantify the similarities of different protein conformations...... a synthetic example from molecular dynamics simulations. We then apply the algorithms to revisit the problem of ensemble averaging during structure determination of proteins, and find that an ensemble refinement method is able to recover the correct distribution of conformations better than standard single...

  7. PSO-Ensemble Demo Application

    DEFF Research Database (Denmark)

    2004-01-01

    Within the framework of the PSO-Ensemble project (FU2101) a demo application has been created. The application use ECMWF ensemble forecasts. Two instances of the application are running; one for Nysted Offshore and one for the total production (except Horns Rev) in the Eltra area. The output is a...... is available via two password-protected web-pages hosted at IMM and is used daily by Elsam and E2....

  8. Seasonal hydrological ensemble forecasts over Europe

    Science.gov (United States)

    Arnal, Louise; Wetterhall, Fredrik; Pappenberger, Florian

    2015-04-01

    Seasonal forecasts have an important socio-economic value in hydro-meteorological forecasting. The applications are for example hydropower management, spring flood prediction and water resources management. The latter includes prediction of low flows, primordial for navigation, water quality assessment, droughts and agricultural water needs. Traditionally, seasonal hydrological forecasts are done using the observed discharge from previous years, so called Ensemble Streamflow Prediction (ESP). With the recent increasing development of seasonal meteorological forecasts, the incentive for developing and improving seasonal hydrological forecasts is great. In this study, a seasonal hydrological forecast, driven by the ECMWF's System 4 (SEA), was compared with an ESP of modelled discharge using observations. The hydrological model used for both forecasts was the LISFLOOD model, run over a European domain with a spatial resolution of 5 km. The forecasts were produced from 1990 until the present time, with a daily time step. They were issued once a month with a lead time of seven months. The SEA forecasts are constituted of 15 ensemble members, extended to 51 members every three months. The ESP forecasts comprise 20 ensembles and served as a benchmark for this comparative study. The forecast systems were compared using a diverse set of verification metrics, such as continuous ranked probability scores, ROC curves, anomaly correlation coefficients and Nash-Sutcliffe efficiency coefficients. These metrics were computed over several time-scales, ranging from a weekly to a six-months basis, for each season. The evaluation enabled the investigation of several aspects of seasonal forecasting, such as limits of predictability, timing of high and low flows, as well as exceedance of percentiles. The analysis aimed at exploring the spatial distribution and timely evolution of the limits of predictability.

  9. Evaluating the trade-offs between ensemble size and ensemble resolution in an ensemble-variational data assimilation system

    Science.gov (United States)

    Lei, Lili; Whitaker, Jeffrey S.

    2017-06-01

    The current NCEP operational four-dimensional ensemble-variational data assimilation system uses a control forecast at T1534 resolution coupled with an 80 member ensemble at T574 resolution. Given an increase in computing resources, and assuming the control forecast resolution is fixed, would it be better to increase the ensemble size and keep the ensemble resolution the same, or increase the ensemble resolution and keep the ensemble size the same? To answer this question, experiments are conducted at reduced resolutions. Two sets of experiments are conducted which both use approximately four times more computational resources than the control experiment that uses a control forecast at T670 and an 80 member ensemble at T254. One increases the ensemble size to 320 but keeps the ensemble resolution at T254; and the other increases the ensemble resolution to T670 but retains an 80 ensemble size. When ensemble size increases to 320, turning off the static component of the background-error covariance does not degrade performance. When the data assimilation parameters are tuned for optimal performance, increasing either ensemble size or ensemble resolution can improve the forecast performance. Increasing ensemble resolution is slightly, but significantly better than increasing ensemble size for these experiments, particularly when considering errors at smaller scales. Much of the benefit of increasing ensemble resolution comes about by eliminating the need for a deterministic control forecast and running all of the background forecasts at the same resolution. In this "single-resolution" mode, the control forecast is replaced by an ensemble average, which reduces small-scale errors significantly.

  10. Algorithms on ensemble quantum computers.

    Science.gov (United States)

    Boykin, P Oscar; Mor, Tal; Roychowdhury, Vwani; Vatan, Farrokh

    2010-06-01

    In ensemble (or bulk) quantum computation, all computations are performed on an ensemble of computers rather than on a single computer. Measurements of qubits in an individual computer cannot be performed; instead, only expectation values (over the complete ensemble of computers) can be measured. As a result of this limitation on the model of computation, many algorithms cannot be processed directly on such computers, and must be modified, as the common strategy of delaying the measurements usually does not resolve this ensemble-measurement problem. Here we present several new strategies for resolving this problem. Based on these strategies we provide new versions of some of the most important quantum algorithms, versions that are suitable for implementing on ensemble quantum computers, e.g., on liquid NMR quantum computers. These algorithms are Shor's factorization algorithm, Grover's search algorithm (with several marked items), and an algorithm for quantum fault-tolerant computation. The first two algorithms are simply modified using a randomizing and a sorting strategies. For the last algorithm, we develop a classical-quantum hybrid strategy for removing measurements. We use it to present a novel quantum fault-tolerant scheme. More explicitly, we present schemes for fault-tolerant measurement-free implementation of Toffoli and σ(z)(¼) as these operations cannot be implemented "bitwise", and their standard fault-tolerant implementations require measurement.

  11. CME Ensemble Forecasting - A Primer

    Science.gov (United States)

    Pizzo, V. J.; de Koning, C. A.; Cash, M. D.; Millward, G. H.; Biesecker, D. A.; Codrescu, M.; Puga, L.; Odstrcil, D.

    2014-12-01

    SWPC has been evaluating various approaches for ensemble forecasting of Earth-directed CMEs. We have developed the software infrastructure needed to support broad-ranging CME ensemble modeling, including composing, interpreting, and making intelligent use of ensemble simulations. The first step is to determine whether the physics of the interplanetary propagation of CMEs is better described as chaotic (like terrestrial weather) or deterministic (as in tsunami propagation). This is important, since different ensemble strategies are to be pursued under the two scenarios. We present the findings of a comprehensive study of CME ensembles in uniform and structured backgrounds that reveals systematic relationships between input cone parameters and ambient flow states and resulting transit times and velocity/density amplitudes at Earth. These results clearly indicate that the propagation of single CMEs to 1 AU is a deterministic process. Thus, the accuracy with which one can forecast the gross properties (such as arrival time) of CMEs at 1 AU is determined primarily by the accuracy of the inputs. This is no tautology - it means specifically that efforts to improve forecast accuracy should focus upon obtaining better inputs, as opposed to developing better propagation models. In a companion paper (deKoning et al., this conference), we compare in situ solar wind data with forecast events in the SWPC operational archive to show how the qualitative and quantitative findings presented here are entirely consistent with the observations and may lead to improved forecasts of arrival time at Earth.

  12. Linking neuronal ensembles by associative synaptic plasticity.

    Directory of Open Access Journals (Sweden)

    Qi Yuan

    Full Text Available Synchronized activity in ensembles of neurons recruited by excitatory afferents is thought to contribute to the coding information in the brain. However, the mechanisms by which neuronal ensembles are generated and modified are not known. Here we show that in rat hippocampal slices associative synaptic plasticity enables ensembles of neurons to change by incorporating neurons belonging to different ensembles. Associative synaptic plasticity redistributes the composition of different ensembles recruited by distinct inputs such as to specifically increase the similarity between the ensembles. These results show that in the hippocampus, the ensemble of neurons recruited by a given afferent projection is fluid and can be rapidly and persistently modified to specifically include neurons from different ensembles. This linking of ensembles may contribute to the formation of associative memories.

  13. A mollified Ensemble Kalman filter

    CERN Document Server

    Bergemann, Kay

    2010-01-01

    It is well recognized that discontinuous analysis increments of sequential data assimilation systems, such as ensemble Kalman filters, might lead to spurious high frequency adjustment processes in the model dynamics. Various methods have been devised to continuously spread out the analysis increments over a fixed time interval centered about analysis time. Among these techniques are nudging and incremental analysis updates (IAU). Here we propose another alternative, which may be viewed as a hybrid of nudging and IAU and which arises naturally from a recently proposed continuous formulation of the ensemble Kalman analysis step. A new slow-fast extension of the popular Lorenz-96 model is introduced to demonstrate the properties of the proposed mollified ensemble Kalman filter.

  14. Excitation energies from ensemble DFT

    Science.gov (United States)

    Borgoo, Alex; Teale, Andy M.; Helgaker, Trygve

    2015-12-01

    We study the evaluation of the Gross-Oliveira-Kohn expression for excitation energies E1-E0=ɛ1-ɛ0+∂E/xc,w[ρ] ∂w | ρ =ρ0. This expression gives the difference between an excitation energy E1 - E0 and the corresponding Kohn-Sham orbital energy difference ɛ1 - ɛ0 as a partial derivative of the exchange-correlation energy of an ensemble of states Exc,w[ρ]. Through Lieb maximisation, on input full-CI density functions, the exchange-correlation energy is evaluated accurately and the partial derivative is evaluated numerically using finite difference. The equality is studied numerically for different geometries of the H2 molecule and different ensemble weights. We explore the adiabatic connection for the ensemble exchange-correlation energy. The latter may prove useful when modelling the unknown weight dependence of the exchange-correlation energy.

  15. Predicting protein dynamics from structural ensembles

    CERN Document Server

    Copperman, J

    2015-01-01

    The biological properties of proteins are uniquely determined by their structure and dynamics. A protein in solution populates a structural ensemble of metastable configurations around the global fold. From overall rotation to local fluctuations, the dynamics of proteins can cover several orders of magnitude in time scales. We propose a simulation-free coarse-grained approach which utilizes knowledge of the important metastable folded states of the protein to predict the protein dynamics. This approach is based upon the Langevin Equation for Protein Dynamics (LE4PD), a Langevin formalism in the coordinates of the protein backbone. The linear modes of this Langevin formalism organize the fluctuations of the protein, so that more extended dynamical cooperativity relates to increasing energy barriers to mode diffusion. The accuracy of the LE4PD is verified by analyzing the predicted dynamics across a set of seven different proteins for which both relaxation data and NMR solution structures are available. Using e...

  16. The Partition Ensemble Fallacy Fallacy

    CERN Document Server

    Nemoto, K; Nemoto, Kae; Braunstein, Samuel L.

    2002-01-01

    The Partition Ensemble Fallacy was recently applied to claim no quantum coherence exists in coherent states produced by lasers. We show that this claim relies on an untestable belief of a particular prior distribution of absolute phase. One's choice for the prior distribution for an unobservable quantity is a matter of `religion'. We call this principle the Partition Ensemble Fallacy Fallacy. Further, we show an alternative approach to construct a relative-quantity Hilbert subspace where unobservability of certain quantities is guaranteed by global conservation laws. This approach is applied to coherent states and constructs an approximate relative-phase Hilbert subspace.

  17. Thermalization and pseudolocality in extended quantum systems

    CERN Document Server

    Doyon, Benjamin

    2015-01-01

    Recently, it was understood that extended concepts of locality played important roles in the study of extended quantum systems out of equilibrium, in particular in so-called generalized Gibbs ensembles. In this paper, we rigorously study pseudolocal charges and their involvement in time evolutions and in the thermalization process of arbitrary states with strong enough clustering properties. We show that the densities of pseudolocal charges form a Hilbert space, with inner product determined by response functions. Using this, we define the family of pseudolocal states: clustering states connected to the infinite-temperature state by paths whose tangents are actions of pseudolocal charges. This family includes thermal Gibbs states, as well as (a precise definition of) generalized Gibbs ensembles. We prove that the family of pseudolocal states is preserved by finite time evolution, and that, under certain conditions, the stationary state emerging at infinite time is a generalized Gibbs ensemble with respect to ...

  18. Hierarchical Bayes Ensemble Kalman Filter for geophysical data assimilation

    Science.gov (United States)

    Tsyrulnikov, Michael; Rakitko, Alexander

    2016-04-01

    In the Ensemble Kalman Filter (EnKF), the forecast error covariance matrix B is estimated from a sample (ensemble), which inevitably implies a degree of uncertainty. This uncertainty is especially large in high dimensions, where the affordable ensemble size is orders of magnitude less than the dimensionality of the system. Common remedies include ad-hoc devices like variance inflation and covariance localization. The goal of this study is to optimize the account for the inherent uncertainty of the B matrix in EnKF. Following the idea by Myrseth and Omre (2010), we explicitly admit that the B matrix is unknown and random and estimate it along with the state (x) in an optimal hierarchical Bayes analysis scheme. We separate forecast errors into predictability errors (i.e. forecast errors due to uncertainties in the initial data) and model errors (forecast errors due to imperfections in the forecast model) and include the two respective components P and Q of the B matrix into the extended control vector (x,P,Q). Similarly, we break the traditional forecast ensemble into the predictability-error related ensemble and model-error related ensemble. The reason for the separation of model errors from predictability errors is the fundamental difference between the two sources of error. Model error are external (i.e. do not depend on the filter's performance) whereas predictability errors are internal to a filter (i.e. are determined by the filter's behavior). At the analysis step, we specify Inverse Wishart based priors for the random matrices P and Q and conditionally Gaussian prior for the state x. Then, we update the prior distribution of (x,P,Q) using both observation and ensemble data, so that ensemble members are used as generalized observations and ordinary observations are allowed to influence the covariances. We show that for linear dynamics and linear observation operators, conditional Gaussianity of the state is preserved in the course of filtering. At the forecast

  19. Multimodel ensembles of wheat growth

    DEFF Research Database (Denmark)

    Martre, Pierre; Wallach, Daniel; Asseng, Senthold

    2015-01-01

    , but such studies are difficult to organize and have only recently begun. We report on the largest ensemble study to date, of 27 wheat models tested in four contrasting locations for their accuracy in simulating multiple crop growth and yield variables. The relative error averaged over models was 24...

  20. Turbo Codes Extended with Outer BCH Code

    DEFF Research Database (Denmark)

    Andersen, Jakob Dahl

    1996-01-01

    The "error floor" observed in several simulations with the turbo codes is verified by calculation of an upper bound to the bit error rate for the ensemble of all interleavers. Also an easy way to calculate the weight enumerator used in this bound is presented. An extended coding scheme is proposed...

  1. Global Ensemble Forecast System (GEFS) [1 Deg.

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Global Ensemble Forecast System (GEFS) is a weather forecast model made up of 21 separate forecasts, or ensemble members. The National Centers for Environmental...

  2. Characterizing and visualizing predictive uncertainty in numerical ensembles through Bayesian model averaging.

    Science.gov (United States)

    Gosink, Luke; Bensema, Kevin; Pulsipher, Trenton; Obermaier, Harald; Henry, Michael; Childs, Hank; Joy, Kenneth I

    2013-12-01

    Numerical ensemble forecasting is a powerful tool that drives many risk analysis efforts and decision making tasks. These ensembles are composed of individual simulations that each uniquely model a possible outcome for a common event of interest: e.g., the direction and force of a hurricane, or the path of travel and mortality rate of a pandemic. This paper presents a new visual strategy to help quantify and characterize a numerical ensemble's predictive uncertainty: i.e., the ability for ensemble constituents to accurately and consistently predict an event of interest based on ground truth observations. Our strategy employs a Bayesian framework to first construct a statistical aggregate from the ensemble. We extend the information obtained from the aggregate with a visualization strategy that characterizes predictive uncertainty at two levels: at a global level, which assesses the ensemble as a whole, as well as a local level, which examines each of the ensemble's constituents. Through this approach, modelers are able to better assess the predictive strengths and weaknesses of the ensemble as a whole, as well as individual models. We apply our method to two datasets to demonstrate its broad applicability.

  3. Squeezing of Collective Excitations in Spin Ensembles

    DEFF Research Database (Denmark)

    Kraglund Andersen, Christian; Mølmer, Klaus

    2012-01-01

    We analyse the possibility to create two-mode spin squeezed states of two separate spin ensembles by inverting the spins in one ensemble and allowing spin exchange between the ensembles via a near resonant cavity field. We investigate the dynamics of the system using a combination of numerical an...

  4. Optimizing Hybrid de Novo Transcriptome Assembly and Extending Genomic Resources for Giant Freshwater Prawns (Macrobrachium rosenbergii): The Identification of Genes and Markers Associated with Reproduction.

    Science.gov (United States)

    Jung, Hyungtaek; Yoon, Byung-Ha; Kim, Woo-Jin; Kim, Dong-Wook; Hurwood, David A; Lyons, Russell E; Salin, Krishna R; Kim, Heui-Soo; Baek, Ilseon; Chand, Vincent; Mather, Peter B

    2016-05-07

    The giant freshwater prawn, Macrobrachium rosenbergii, a sexually dimorphic decapod crustacean is currently the world's most economically important cultured freshwater crustacean species. Despite its economic importance, there is currently a lack of genomic resources available for this species, and this has limited exploration of the molecular mechanisms that control the M. rosenbergii sex-differentiation system more widely in freshwater prawns. Here, we present the first hybrid transcriptome from M. rosenbergii applying RNA-Seq technologies directed at identifying genes that have potential functional roles in reproductive-related traits. A total of 13,733,210 combined raw reads (1720 Mbp) were obtained from Ion-Torrent PGM and 454 FLX. Bioinformatic analyses based on three state-of-the-art assemblers, the CLC Genomic Workbench, Trans-ABySS, and Trinity, that use single and multiple k-mer methods respectively, were used to analyse the data. The influence of multiple k-mers on assembly performance was assessed to gain insight into transcriptome assembly from short reads. After optimisation, de novo assembly resulted in 44,407 contigs with a mean length of 437 bp, and the assembled transcripts were further functionally annotated to detect single nucleotide polymorphisms and simple sequence repeat motifs. Gene expression analysis was also used to compare expression patterns from ovary and testis tissue libraries to identify genes with potential roles in reproduction and sex differentiation. The large transcript set assembled here represents the most comprehensive set of transcriptomic resources ever developed for reproduction traits in M. rosenbergii, and the large number of genetic markers predicted should constitute an invaluable resource for future genetic research studies on M. rosenbergii and can be applied more widely on other freshwater prawn species in the genus Macrobrachium.

  5. Optimizing Hybrid de Novo Transcriptome Assembly and Extending Genomic Resources for Giant Freshwater Prawns (Macrobrachium rosenbergii: The Identification of Genes and Markers Associated with Reproduction

    Directory of Open Access Journals (Sweden)

    Hyungtaek Jung

    2016-05-01

    Full Text Available The giant freshwater prawn, Macrobrachium rosenbergii, a sexually dimorphic decapod crustacean is currently the world’s most economically important cultured freshwater crustacean species. Despite its economic importance, there is currently a lack of genomic resources available for this species, and this has limited exploration of the molecular mechanisms that control the M. rosenbergii sex-differentiation system more widely in freshwater prawns. Here, we present the first hybrid transcriptome from M. rosenbergii applying RNA-Seq technologies directed at identifying genes that have potential functional roles in reproductive-related traits. A total of 13,733,210 combined raw reads (1720 Mbp were obtained from Ion-Torrent PGM and 454 FLX. Bioinformatic analyses based on three state-of-the-art assemblers, the CLC Genomic Workbench, Trans-ABySS, and Trinity, that use single and multiple k-mer methods respectively, were used to analyse the data. The influence of multiple k-mers on assembly performance was assessed to gain insight into transcriptome assembly from short reads. After optimisation, de novo assembly resulted in 44,407 contigs with a mean length of 437 bp, and the assembled transcripts were further functionally annotated to detect single nucleotide polymorphisms and simple sequence repeat motifs. Gene expression analysis was also used to compare expression patterns from ovary and testis tissue libraries to identify genes with potential roles in reproduction and sex differentiation. The large transcript set assembled here represents the most comprehensive set of transcriptomic resources ever developed for reproduction traits in M. rosenbergii, and the large number of genetic markers predicted should constitute an invaluable resource for future genetic research studies on M. rosenbergii and can be applied more widely on other freshwater prawn species in the genus Macrobrachium.

  6. Classical and Quantum Ensembles via Multiresolution. II. Wigner Ensembles

    OpenAIRE

    2004-01-01

    We present the application of the variational-wavelet analysis to the analysis of quantum ensembles in Wigner framework. (Naive) deformation quantization, the multiresolution representations and the variational approach are the key points. We construct the solutions of Wigner-like equations via the multiscale expansions in the generalized coherent states or high-localized nonlinear eigenmodes in the base of the compactly supported wavelets and the wavelet packets. We demonstrate the appearanc...

  7. Hydrological Ensemble Prediction System (HEPS)

    Science.gov (United States)

    Thielen-Del Pozo, J.; Schaake, J.; Martin, E.; Pailleux, J.; Pappenberger, F.

    2010-09-01

    Flood forecasting systems form a key part of ‘preparedness' strategies for disastrous floods and provide hydrological services, civil protection authorities and the public with information of upcoming events. Provided the warning leadtime is sufficiently long, adequate preparatory actions can be taken to efficiently reduce the impacts of the flooding. Following on the success of the use of ensembles for weather forecasting, the hydrological community now moves increasingly towards Hydrological Ensemble Prediction Systems (HEPS) for improved flood forecasting using operationally available NWP products as inputs. However, these products are often generated on relatively coarse scales compared to hydrologically relevant basin units and suffer systematic biases that may have considerable impact when passed through the non-linear hydrological filters. Therefore, a better understanding on how best to produce, communicate and use hydrologic ensemble forecasts in hydrological short-, medium- und long term prediction of hydrological processes is necessary. The "Hydrologic Ensemble Prediction Experiment" (HEPEX), is an international initiative consisting of hydrologists, meteorologist and end-users to advance probabilistic hydrologic forecast techniques for flood, drought and water management applications. Different aspects of the hydrological ensemble processor are being addressed including • Production of useful meteorological products relevant for hydrological applications, ranging from nowcasting products to seasonal forecasts. The importance of hindcasts that are consistent with the operational weather forecasts will be discussed to support bias correction and downscaling, statistically meaningful verification of HEPS, and the development and testing of operating rules; • Need for downscaling and post-processing of weather ensembles to reduce bias before entering hydrological applications; • Hydrological model and parameter uncertainty and how to correct and

  8. Scale-free brain ensemble modulated by phase synchronization

    Institute of Scientific and Technical Information of China (English)

    Dan WU; Chao-yi LI; Jie LIU; Jing LU; De-zhong YAO

    2014-01-01

    To listen to brain activity as a piece of music, we proposed the scale-free brainwave music (SFBM) technology, which could translate the scalp electroencephalogram (EEG) into music notes according to the power law of both EEG and music. In the current study, this methodology was further extended to a musical ensemble of two channels. First, EEG data from two selected channels are translated into musical instrument digital interface (MIDI) sequences, where the EEG parameters modulate the pitch, duration, and volume of each musical note. The phase synchronization index of the two channels is computed by a Hilbert transform. Then the two MIDI sequences are integrated into a chorus according to the phase synchronization index. The EEG with a high synchronization index is represented by more consonant musical intervals, while the low index is expressed by inconsonant musical intervals. The brain ensemble derived from real EEG segments illustrates differences in harmony and pitch distribution during the eyes-closed and eyes-open states. Furthermore, the scale-free phenomena exist in the brainwave ensemble. Therefore, the scale-free brain ensemble modulated by phase synchronization is a new attempt to express the EEG through an auditory and musical way, and it can be used for EEG monitoring and bio-feedback.

  9. Ensemble prediction experiments using conditional nonlinear optimal perturbation

    Institute of Scientific and Technical Information of China (English)

    2009-01-01

    Two methods for initialization of ensemble forecasts are compared, namely, singular vector (SV) and conditional nonlinear optimal perturbation (CNOP). The comparison is done for forecast lengths of up to 10 days with a three-level quasi-geostrophic (QG) atmospheric model in a perfect model scenario. Ten cases are randomly selected from 1982/1983 winter to 1993/1994 winter (from December to the following February). Anomaly correlation coefficient (ACC) is adopted as a tool to measure the quality of the predicted ensembles on the Northern Hemisphere 500 hPa geopotential height. The results show that the forecast quality of ensemble samples in which the first SV is replaced by CNOP is higher than that of samples composed of only SVs in the medium range, based on the occurrence of weather re-gime transitions in Northern Hemisphere after about four days. Besides, the reliability of ensemble forecasts is evaluated by the Rank Histograms. The above conclusions confirm and extend those reached earlier by the authors, which stated that the introduction of CNOP improves the forecast skill under the condition that the analysis error belongs to a kind of fast-growing error by using a barotropic QG model.

  10. Ensemble prediction experiments using conditional nonlinear optimal perturbation

    Institute of Scientific and Technical Information of China (English)

    JIANG ZhiNa; MU Mu; WANG DongHai

    2009-01-01

    Two methods for initialization of ensemble forecasts are compared, namely, singular vector (SV) and conditional nonlinear optimal perturbation (CNOP). The comparison is done for forecast lengths of up to 10 days with a three-level quasi-geostrophic (QG) atmospheric model in a perfect model scenario. Ten cases are randomly selected from 1982/1983 winter to 1993/1994 winter (from 12 to the following February). Anomaly correlation coefficient (ACC) is adopted as a tool to measure the quality of the predicted ensembles on the Northern Hemisphere 500 hPa geopotential height. The results show that the forecast quality of ensemble samples in which the first SV is replaced by CNOP is higher than that of samples composed of only SVs in the medium range, based on the occurrence of weather re-gime transitions in Northern Hemisphere after about four days. Besides, the reliability of ensemble forecasts is evaluated by the Rank Histograms. The above conclusions confirm .and extend those reached earlier by the authors, which stated that the introduction of CNOP improves the forecast skill under the condition that the analysis error belongs to a kind of fast-growing error by using a barotropic QG model.

  11. Annotation-Based Whole Genomic Prediction and Selection

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Do, Duy Ngoc; Janss, Luc;

    in their contribution to estimated genomic variances and in prediction of genomic breeding values by applying SNP annotation approaches to feed efficiency. Ensembl Variant Predictor (EVP) and Pig QTL database were used as the source of genomic annotation for 60K chip. Genomic prediction was performed using the Bayes...... prove useful for less heritable traits such as diseases and fertility...

  12. Extended Emotions

    DEFF Research Database (Denmark)

    Krueger, Joel; Szanto, Thomas

    2016-01-01

    Until recently, philosophers and psychologists conceived of emotions as brain- and body-bound affairs. But researchers have started to challenge this internalist and individualist orthodoxy. A rapidly growing body of work suggests that some emotions incorporate external resources and thus extend...... beyond the neurophysiological confines of organisms; some even argue that emotions can be socially extended and shared by multiple agents. Call this the extended emotions thesis (ExE). In this article, we consider different ways of understanding ExE in philosophy, psychology, and the cognitive sciences....... First, we outline the background of the debate and discuss different argumentative strategies for ExE. In particular, we distinguish ExE from cognate but more moderate claims about the embodied and situated nature of cognition and emotion (Section 1). We then dwell upon two dimensions of ExE: emotions...

  13. Extended thermodynamics

    CERN Document Server

    Müller, Ingo

    1993-01-01

    Physicists firmly believe that the differential equations of nature should be hyperbolic so as to exclude action at a distance; yet the equations of irreversible thermodynamics - those of Navier-Stokes and Fourier - are parabolic. This incompatibility between the expectation of physicists and the classical laws of thermodynamics has prompted the formulation of extended thermodynamics. After describing the motifs and early evolution of this new branch of irreversible thermodynamics, the authors apply the theory to mon-atomic gases, mixtures of gases, relativistic gases, and "gases" of phonons and photons. The discussion brings into perspective the various phenomena called second sound, such as heat propagation, propagation of shear stress and concentration, and the second sound in liquid helium. The formal mathematical structure of extended thermodynamics is exposed and the theory is shown to be fully compatible with the kinetic theory of gases. The study closes with the testing of extended thermodynamics thro...

  14. Symanzik flow on HISQ ensembles

    CERN Document Server

    Bazavov, A; Brown, N; DeTar, C; Foley, J; Gottlieb, Steven; Heller, U M; Hetrick, J E; Laiho, J; Levkova, L; Oktay, M; Sugar, R L; Toussaint, D; Van de Water, R S; Zhou, R

    2013-01-01

    We report on a scale determination with gradient-flow techniques on the $N_f = 2 + 1 + 1$ HISQ ensembles generated by the MILC collaboration. The lattice scale $w_0/a$, originally proposed by the BMW collaboration, is computed using Symanzik flow at four lattice spacings ranging from 0.15 to 0.06 fm. With a Taylor series ansatz, the results are simultaneously extrapolated to the continuum and interpolated to physical quark masses. We give a preliminary determination of the scale $w_0$ in physical units, along with associated systematic errors, and compare with results from other groups. We also present a first estimate of autocorrelation lengths as a function of flowtime for these ensembles.

  15. Statistical Analysis of Protein Ensembles

    Science.gov (United States)

    Máté, Gabriell; Heermann, Dieter

    2014-04-01

    As 3D protein-configuration data is piling up, there is an ever-increasing need for well-defined, mathematically rigorous analysis approaches, especially that the vast majority of the currently available methods rely heavily on heuristics. We propose an analysis framework which stems from topology, the field of mathematics which studies properties preserved under continuous deformations. First, we calculate a barcode representation of the molecules employing computational topology algorithms. Bars in this barcode represent different topological features. Molecules are compared through their barcodes by statistically determining the difference in the set of their topological features. As a proof-of-principle application, we analyze a dataset compiled of ensembles of different proteins, obtained from the Ensemble Protein Database. We demonstrate that our approach correctly detects the different protein groupings.

  16. Statistical Analysis of Protein Ensembles

    Directory of Open Access Journals (Sweden)

    Gabriell eMáté

    2014-04-01

    Full Text Available As 3D protein-configuration data is piling up, there is an ever-increasing need for well-defined, mathematically rigorous analysis approaches, especially that the vast majority of the currently available methods rely heavily on heuristics. We propose an analysis framework which stems from topology, the field of mathematics which studies properties preserved under continuous deformations. First, we calculate a barcode representation of the molecules employing computational topology algorithms. Bars in this barcode represent different topological features. Molecules are compared through their barcodes by statistically determining the difference in the set of their topological features. As a proof-of-principle application, we analyze a dataset compiled of ensembles of different proteins, obtained from the Ensemble Protein Database. We demonstrate that our approach correctly detects the different protein groupings.

  17. Genomic and genotoxic responses to controlled weathered-oil exposures confirm and extend field studies on impacts of the Deepwater Horizon oil spill on native killifish.

    Directory of Open Access Journals (Sweden)

    Whitney Pilcher

    Full Text Available To understand the ecotoxicological impacts of the Deepwater Horizon oil spill, field studies provide a context for ecological realism but laboratory-based studies offer power for connecting biological effects with specific causes. As a complement to field studies, we characterized genome-wide gene expression responses of Gulf killifish (Fundulus grandis to oil-contaminated waters in controlled laboratory exposures. Transcriptional responses to the highest concentrations of oiled water in the laboratory were predictive of field-observed responses that coincided with the timing and location of major oiling. The transcriptional response to the low concentration (∼ 10-fold lower than the high concentration was distinct from the high concentration and was not predictive of major oiling in the field. The high concentration response was characterized by activation of the molecular signaling pathway that facilitates oil metabolism and oil toxicity. The high concentration also induced DNA damage. The low concentration invoked expression of genes that may support a compensatory response, including genes associated with regulation of transcription, cell cycle progression, RNA processing, DNA damage, and apoptosis. We conclude that the gene expression response detected in the field was a robust indicator of exposure to the toxic components of contaminating oil, that animals in the field were exposed to relatively high concentrations that are especially damaging to early life stages, and that such exposures can damage DNA.

  18. Mapping Transcription Factors on Extended DNA: A Single Molecule Approach

    Science.gov (United States)

    Ebenstein, Yuval; Gassman, Natalie; Weiss, Shimon

    The ability to determine the precise loci and distribution of nucleic acid binding proteins is instrumental to our detailed understanding of cellular processes such as transcription, replication, and chromatin reorganization. Traditional molecular biology approaches and above all Chromatin immunoprecipitation (ChIP) based methods have provided a wealth of information regarding protein-DNA interactions. Nevertheless, existing techniques can only provide average properties of these interactions, since they are based on the accumulation of data from numerous protein-DNA complexes analyzed at the ensemble level. We propose a single molecule approach for direct visualization of DNA binding proteins bound specifically to their recognition sites along a long stretch of DNA such as genomic DNA. Fluorescent Quantum dots are used to tag proteins bound to DNA, and the complex is deposited on a glass substrate by extending the DNA to a linear form. The sample is then imaged optically to determine the precise location of the protein binding site. The method is demonstrated by detecting individual, Quantum dot tagged T7-RNA polymerase enzymes on the bacteriophage T7 genomic DNA and assessing the relative occupancy of the different promoters.

  19. Comprehensive comparisons of the current human, mouse, and rat RefSeq, Ensembl, EST, and FANTOM3 datasets

    DEFF Research Database (Denmark)

    Nordström, Karl J V; Mirza, Majd A I; Larsson, Thomas P

    2006-01-01

    Our understanding of functional genetic elements in the genomes is continuously growing and new entries are entered in various databases on a regular basis. We have here merged the genetic elements in RefSeq, Ensembl, FANTOM3, HINV, and NCBI:s ESTdb using the genome assemblies in order to achieve...

  20. [Integration of different T-DNA structures of ACC oxidase gene into carnation genome extended cut flower vase-life differently].

    Science.gov (United States)

    Yu, Yi-Xun; Bao, Man-Zhu

    2004-09-01

    The cultivar 'Master' of carnation (Dianthus caryophyllus L.) was transformed with four T-DNA structures containing sense, antisense, sense direct repeat and antisense direct repeat gene of ACC oxidase mediated by Agrobacterium tumefaciens. Southern blotting detection showed that foreign gene was integrated into the carnation genome and 14 transgenic lines were obtained. The transgenic plants were transplanted to soil and grew normally in greenhouse. Of the 12 transgenic lines screened, the cut flower vase life of 8 transgenic lines is up to 11 days and the longest one is 12.8 days while the vase life of the control is 5.8 days under 25 degrees C. The vase life of 2 lines out of 3 with single sense ACO gene is same as that of the control, while the vase life of 3 lines out of 4 with single antisense ACO gene is prolonged. The vase life of cut flowers of 5 lines with direct repeat ACO genes is all prolonged by about 6 days, while the vase life of 3 out of 7 lines with single ACO gene is same as that of the control. During the senescence of cut flowers, the ethylene production of the most of the transgenic lines decreased significantly, and the production of ethylene is not detectable in lines T456, T556 and T575. The results of the research demonstrate that antisense foreign gene inhibits expression of endogenesis gene more significantly than sense one. Both sense direct repeat and antisense direct repeat foreign genes can suppress endogenous gene expression more significantly comparing to single foreign genes. The transgenic lines obtained from this research are useful to minimize carnation cut flower transportation and storage expenses.

  1. Conformational Ensemble of the Poliovirus 3CD Precursor Observed by MD Simulations and Confirmed by SAXS: A Strategy to Expand the Viral Proteome?

    Science.gov (United States)

    Moustafa, Ibrahim M; Gohara, David W; Uchida, Akira; Yennawar, Neela; Cameron, Craig E

    2015-11-23

    The genomes of RNA viruses are relatively small. To overcome the small-size limitation, RNA viruses assign distinct functions to the processed viral proteins and their precursors. This is exemplified by poliovirus 3CD protein. 3C protein is a protease and RNA-binding protein. 3D protein is an RNA-dependent RNA polymerase (RdRp). 3CD exhibits unique protease and RNA-binding activities relative to 3C and is devoid of RdRp activity. The origin of these differences is unclear, since crystal structure of 3CD revealed "beads-on-a-string" structure with no significant structural differences compared to the fully processed proteins. We performed molecular dynamics (MD) simulations on 3CD to investigate its conformational dynamics. A compact conformation of 3CD was observed that was substantially different from that shown crystallographically. This new conformation explained the unique properties of 3CD relative to the individual proteins. Interestingly, simulations of mutant 3CD showed altered interface. Additionally, accelerated MD simulations uncovered a conformational ensemble of 3CD. When we elucidated the 3CD conformations in solution using small-angle X-ray scattering (SAXS) experiments a range of conformations from extended to compact was revealed, validating the MD simulations. The existence of conformational ensemble of 3CD could be viewed as a way to expand the poliovirus proteome, an observation that may extend to other viruses.

  2. Ensemble models of neutrophil trafficking in severe sepsis.

    Directory of Open Access Journals (Sweden)

    Sang Ok Song

    Full Text Available A hallmark of severe sepsis is systemic inflammation which activates leukocytes and can result in their misdirection. This leads to both impaired migration to the locus of infection and increased infiltration into healthy tissues. In order to better understand the pathophysiologic mechanisms involved, we developed a coarse-grained phenomenological model of the acute inflammatory response in CLP (cecal ligation and puncture-induced sepsis in rats. This model incorporates distinct neutrophil kinetic responses to the inflammatory stimulus and the dynamic interactions between components of a compartmentalized inflammatory response. Ensembles of model parameter sets consistent with experimental observations were statistically generated using a Markov-Chain Monte Carlo sampling. Prediction uncertainty in the model states was quantified over the resulting ensemble parameter sets. Forward simulation of the parameter ensembles successfully captured experimental features and predicted that systemically activated circulating neutrophils display impaired migration to the tissue and neutrophil sequestration in the lung, consequently contributing to tissue damage and mortality. Principal component and multiple regression analyses of the parameter ensembles estimated from survivor and non-survivor cohorts provide insight into pathologic mechanisms dictating outcome in sepsis. Furthermore, the model was extended to incorporate hypothetical mechanisms by which immune modulation using extracorporeal blood purification results in improved outcome in septic rats. Simulations identified a sub-population (about 18% of the treated population that benefited from blood purification. Survivors displayed enhanced neutrophil migration to tissue and reduced sequestration of lung neutrophils, contributing to improved outcome. The model ensemble presented herein provides a platform for generating and testing hypotheses in silico, as well as motivating further experimental

  3. Classical and Quantum Ensembles via Multiresolution. II. Wigner Ensembles

    CERN Document Server

    Fedorova, A N; Fedorova, Antonina N.; Zeitlin, Michael G.

    2004-01-01

    We present the application of the variational-wavelet analysis to the analysis of quantum ensembles in Wigner framework. (Naive) deformation quantization, the multiresolution representations and the variational approach are the key points. We construct the solutions of Wigner-like equations via the multiscale expansions in the generalized coherent states or high-localized nonlinear eigenmodes in the base of the compactly supported wavelets and the wavelet packets. We demonstrate the appearance of (stable) localized patterns (waveletons) and consider entanglement and decoherence as possible applications.

  4. Control of inhomogeneous atomic ensembles of hyperfine qudits

    CERN Document Server

    Mischuck, Brian E; Deutsch, Ivan H

    2011-01-01

    We study the ability to control d-dimensional quantum systems (qudits) encoded in the hyperfine spin of alkali-metal atoms through the application of radio- and microwave-frequency magnetic fields in the presence of inhomogeneities in amplitude and detuning. Such a capability is essential to the design of robust pulses that mitigate the effects of experimental uncertainty and also for application to tomographic addressing of particular members of an extended ensemble. We study the problem of preparing an arbitrary state in the Hilbert space from an initial fiducial state. We prove that inhomogeneous control of qudit ensembles is possible based on a semi-analytic protocol that synthesizes the target through a sequence of alternating rf and microwave-driven SU(2) rotations in overlapping irreducible subspaces. Several examples of robust control are studied, and the semi-analytic protocol is compared to a brute force, full numerical search. For small inhomogeneities, < 1%, both approaches achieve average fide...

  5. Extending Puppet

    CERN Document Server

    Franceschi, Alessandro

    2014-01-01

    This book is a clear, detailed and practical guide to learn about designing and deploying you puppet architecture, with informative examples to highlight and explain concepts in a focused manner. This book is designed for users who already have good experience with Puppet, and will surprise experienced users with innovative topics that explore how to design, implement, adapt, and deploy a Puppet architecture. The key to extending Puppet is the development of types and providers, for which you must be familiar with Ruby.

  6. Unsupervised Learning in an Ensemble of Spiking Neural Networks Mediated by ITDP

    Science.gov (United States)

    Staras, Kevin

    2016-01-01

    We propose a biologically plausible architecture for unsupervised ensemble learning in a population of spiking neural network classifiers. A mixture of experts type organisation is shown to be effective, with the individual classifier outputs combined via a gating network whose operation is driven by input timing dependent plasticity (ITDP). The ITDP gating mechanism is based on recent experimental findings. An abstract, analytically tractable model of the ITDP driven ensemble architecture is derived from a logical model based on the probabilities of neural firing events. A detailed analysis of this model provides insights that allow it to be extended into a full, biologically plausible, computational implementation of the architecture which is demonstrated on a visual classification task. The extended model makes use of a style of spiking network, first introduced as a model of cortical microcircuits, that is capable of Bayesian inference, effectively performing expectation maximization. The unsupervised ensemble learning mechanism, based around such spiking expectation maximization (SEM) networks whose combined outputs are mediated by ITDP, is shown to perform the visual classification task well and to generalize to unseen data. The combined ensemble performance is significantly better than that of the individual classifiers, validating the ensemble architecture and learning mechanisms. The properties of the full model are analysed in the light of extensive experiments with the classification task, including an investigation into the influence of different input feature selection schemes and a comparison with a hierarchical STDP based ensemble architecture. PMID:27760125

  7. Unsupervised Learning in an Ensemble of Spiking Neural Networks Mediated by ITDP.

    Science.gov (United States)

    Shim, Yoonsik; Philippides, Andrew; Staras, Kevin; Husbands, Phil

    2016-10-01

    We propose a biologically plausible architecture for unsupervised ensemble learning in a population of spiking neural network classifiers. A mixture of experts type organisation is shown to be effective, with the individual classifier outputs combined via a gating network whose operation is driven by input timing dependent plasticity (ITDP). The ITDP gating mechanism is based on recent experimental findings. An abstract, analytically tractable model of the ITDP driven ensemble architecture is derived from a logical model based on the probabilities of neural firing events. A detailed analysis of this model provides insights that allow it to be extended into a full, biologically plausible, computational implementation of the architecture which is demonstrated on a visual classification task. The extended model makes use of a style of spiking network, first introduced as a model of cortical microcircuits, that is capable of Bayesian inference, effectively performing expectation maximization. The unsupervised ensemble learning mechanism, based around such spiking expectation maximization (SEM) networks whose combined outputs are mediated by ITDP, is shown to perform the visual classification task well and to generalize to unseen data. The combined ensemble performance is significantly better than that of the individual classifiers, validating the ensemble architecture and learning mechanisms. The properties of the full model are analysed in the light of extensive experiments with the classification task, including an investigation into the influence of different input feature selection schemes and a comparison with a hierarchical STDP based ensemble architecture.

  8. Extended Hamiltonian approach to continuous tempering.

    Science.gov (United States)

    Gobbo, Gianpaolo; Leimkuhler, Benedict J

    2015-06-01

    We introduce an enhanced sampling simulation technique based on continuous tempering, i.e., on continuously varying the temperature of the system under investigation. Our approach is mathematically straightforward, being based on an extended Hamiltonian formulation in which an auxiliary degree of freedom, determining the effective temperature, is coupled to the physical system. The physical system and its temperature evolve continuously in time according to the equations of motion derived from the extended Hamiltonian. Due to the Hamiltonian structure, it is easy to show that a particular subset of the configurations of the extended system is distributed according to the canonical ensemble for the physical system at the correct physical temperature.

  9. Ensembles

    OpenAIRE

    2012-01-01

    Licence; En 1935, un groupe de mathématiciens français eut l'ambition de reconstruire tout l'édifice mathématique (sans S pour bien montrer l'unité) selon la pensée formaliste de Hilbert. Les membres fondateurs ont été Henri Cartan, Claude Chevalley, Jean Delsarte, Jean Dieudonné, André Weil auxquels se joindra René de Possel. En juillet 1935 fut donc créé, lors d'un séminaire en Auvergne le groupe 'Nicolas Bourbaki'. Le nom de cette association fait référence en fait à une anecdote qui se pa...

  10. Analysis of mesoscale forecasts using ensemble methods

    CERN Document Server

    Gross, Markus

    2016-01-01

    Mesoscale forecasts are now routinely performed as elements of operational forecasts and their outputs do appear convincing. However, despite their realistic appearance at times the comparison to observations is less favorable. At the grid scale these forecasts often do not compare well with observations. This is partly due to the chaotic system underlying the weather. Another key problem is that it is impossible to evaluate the risk of making decisions based on these forecasts because they do not provide a measure of confidence. Ensembles provide this information in the ensemble spread and quartiles. However, running global ensembles at the meso or sub mesoscale involves substantial computational resources. National centers do run such ensembles, but the subject of this publication is a method which requires significantly less computation. The ensemble enhanced mesoscale system presented here aims not at the creation of an improved mesoscale forecast model. Also it is not to create an improved ensemble syste...

  11. Measuring social interaction in music ensembles.

    Science.gov (United States)

    Volpe, Gualtiero; D'Ausilio, Alessandro; Badino, Leonardo; Camurri, Antonio; Fadiga, Luciano

    2016-05-05

    Music ensembles are an ideal test-bed for quantitative analysis of social interaction. Music is an inherently social activity, and music ensembles offer a broad variety of scenarios which are particularly suitable for investigation. Small ensembles, such as string quartets, are deemed a significant example of self-managed teams, where all musicians contribute equally to a task. In bigger ensembles, such as orchestras, the relationship between a leader (the conductor) and a group of followers (the musicians) clearly emerges. This paper presents an overview of recent research on social interaction in music ensembles with a particular focus on (i) studies from cognitive neuroscience; and (ii) studies adopting a computational approach for carrying out automatic quantitative analysis of ensemble music performances.

  12. Gibbs Ensembles of Nonintersecting Paths

    CERN Document Server

    Borodin, Alexei

    2008-01-01

    We consider a family of determinantal random point processes on the two-dimensional lattice and prove that members of our family can be interpreted as a kind of Gibbs ensembles of nonintersecting paths. Examples include probability measures on lozenge and domino tilings of the plane, some of which are non-translation-invariant. The correlation kernels of our processes can be viewed as extensions of the discrete sine kernel, and we show that the Gibbs property is a consequence of simple linear relations satisfied by these kernels. The processes depend on infinitely many parameters, which are closely related to parametrization of totally positive Toeplitz matrices.

  13. Wind Power Prediction using Ensembles

    DEFF Research Database (Denmark)

    Giebel, Gregor; Badger, Jake; Landberg, Lars

    2005-01-01

    offshore wind farm and the whole Jutland/Funen area. The utilities used these forecasts for maintenance planning, fuel consumption estimates and over-the-weekend trading on the Leipzig power exchange. Othernotable scientific results include the better accuracy of forecasts made up from a simple...... superposition of two NWP provider (in our case, DMI and DWD), an investigation of the merits of a parameterisation of the turbulent kinetic energy within thedelivered wind speed forecasts, and the finding that a “naïve” downscaling of each of the coarse ECMWF ensemble members with higher resolution HIRLAM did...

  14. Ensemble Methods Foundations and Algorithms

    CERN Document Server

    Zhou, Zhi-Hua

    2012-01-01

    An up-to-date, self-contained introduction to a state-of-the-art machine learning approach, Ensemble Methods: Foundations and Algorithms shows how these accurate methods are used in real-world tasks. It gives you the necessary groundwork to carry out further research in this evolving field. After presenting background and terminology, the book covers the main algorithms and theories, including Boosting, Bagging, Random Forest, averaging and voting schemes, the Stacking method, mixture of experts, and diversity measures. It also discusses multiclass extension, noise tolerance, error-ambiguity a

  15. Quantum Repeaters and Atomic Ensembles

    DEFF Research Database (Denmark)

    Borregaard, Johannes

    a previous protocol, thereby enabling fast local processing, which greatly enhances the distribution rate. We then move on to describe our work on improving the stability of atomic clocks using entanglement. Entanglement can potentially push the stability of atomic clocks to the so-called Heisenberg limit......, which is the absolute upper limit of the stability allowed by the Heisenberg uncertainty relation. It has, however, been unclear whether entangled state’s enhanced sensitivity to noise would prevent reaching this limit. We have developed an adaptive measurement protocol, which circumvents this problem...... based on atomic ensembles....

  16. A Localized Ensemble Kalman Smoother

    Science.gov (United States)

    Butala, Mark D.

    2012-01-01

    Numerous geophysical inverse problems prove difficult because the available measurements are indirectly related to the underlying unknown dynamic state and the physics governing the system may involve imperfect models or unobserved parameters. Data assimilation addresses these difficulties by combining the measurements and physical knowledge. The main challenge in such problems usually involves their high dimensionality and the standard statistical methods prove computationally intractable. This paper develops and addresses the theoretical convergence of a new high-dimensional Monte-Carlo approach called the localized ensemble Kalman smoother.

  17. Heterogeneous versus Homogeneous Machine Learning Ensembles

    Directory of Open Access Journals (Sweden)

    Petrakova Aleksandra

    2015-12-01

    Full Text Available The research demonstrates efficiency of the heterogeneous model ensemble application for a cancer diagnostic procedure. Machine learning methods used for the ensemble model training are neural networks, random forest, support vector machine and offspring selection genetic algorithm. Training of models and the ensemble design is performed by means of HeuristicLab software. The data used in the research have been provided by the General Hospital of Linz, Austria.

  18. Interpreting Tree Ensembles with inTrees

    OpenAIRE

    Deng, Houtao

    2014-01-01

    Tree ensembles such as random forests and boosted trees are accurate but difficult to understand, debug and deploy. In this work, we provide the inTrees (interpretable trees) framework that extracts, measures, prunes and selects rules from a tree ensemble, and calculates frequent variable interactions. An rule-based learner, referred to as the simplified tree ensemble learner (STEL), can also be formed and used for future prediction. The inTrees framework can applied to both classification an...

  19. Extending Experiences

    DEFF Research Database (Denmark)

    A computer game's player is experiencing not only the game as a designer-made artefact, but also a multitude of social and cultural practices and contexts of both computer game play and everyday life. As a truly multidisciplinary anthology, Extending Experiences sheds new light on the mesh...... of possibilities and influences the player engages with. Part one, Experiential Structures of Play, considers some of the key concepts commonly used to address the experience of a computer game player. The second part, Bordering Play, discusses conceptual and practical overlaps of games and everyday life...

  20. Spatial-temporal fractions verification for high-resolution ensemble forecasts

    Directory of Open Access Journals (Sweden)

    Le Duc

    2013-04-01

    Full Text Available Experiments with two ensemble systems of resolutions 10 km (MF10km and 2 km (MF2km were designed to examine the value of cloud-resolving ensemble forecast in predicting precipitation on small spatio-temporal scales. Since the verification was performed on short-term precipitation at high resolution, uncertainties from small-scale processes caused the traditional verification methods to be inconsistent with the subjective evaluation. An extended verification method based on the Fractions Skill Score (FSS was introduced to account for these uncertainties. The main idea is to extend the concept of spatial neighbourhood in FSS to the time and ensemble dimension. The extension was carried out by recognising that even if ensemble forecast is used, small-scale variability still exists in forecasts and influences verification results. In addition to FSS, the neighbourhood concept was also incorporated into reliability diagrams and relative operating characteristics to verify the reliability and resolution of two systems. The extension of FSS in time dimension demonstrates the important role of temporal scales in short-term precipitation verification at small spatial scales. The extension of FSS in ensemble space is called the ensemble FSS, which is a good representative of FSS for ensemble forecast in comparison with the FSS of ensemble mean. The verification results show that MF2km outperforms MF10km in heavy rain forecasts. In contrast, MF10km was slightly better than MF2km in predicting light rains, suggesting that the horizontal resolution of 2 km is not necessarily enough to completely resolve convective cells.

  1. Analysis of peeling decoder for MET ensembles

    CERN Document Server

    Hinton, Ryan

    2009-01-01

    The peeling decoder introduced by Luby, et al. allows analysis of LDPC decoding for the binary erasure channel (BEC). For irregular ensembles, they analyze the decoder state as a Markov process and present a solution to the differential equations describing the process mean. Multi-edge type (MET) ensembles allow greater precision through specifying graph connectivity. We generalize the the peeling decoder for MET ensembles and derive analogous differential equations. We offer a new change of variables and solution to the node fraction evolutions in the general (MET) case. This result is preparatory to investigating finite-length ensemble behavior.

  2. A Novel Feature Extraction Scheme with Ensemble Coding for Protein–Protein Interaction Prediction

    Directory of Open Access Journals (Sweden)

    Xiuquan Du

    2014-07-01

    Full Text Available Protein–protein interactions (PPIs play key roles in most cellular processes, such as cell metabolism, immune response, endocrine function, DNA replication, and transcription regulation. PPI prediction is one of the most challenging problems in functional genomics. Although PPI data have been increasing because of the development of high-throughput technologies and computational methods, many problems are still far from being solved. In this study, a novel predictor was designed by using the Random Forest (RF algorithm with the ensemble coding (EC method. To reduce computational time, a feature selection method (DX was adopted to rank the features and search the optimal feature combination. The DXEC method integrates many features and physicochemical/biochemical properties to predict PPIs. On the Gold Yeast dataset, the DXEC method achieves 67.2% overall precision, 80.74% recall, and 70.67% accuracy. On the Silver Yeast dataset, the DXEC method achieves 76.93% precision, 77.98% recall, and 77.27% accuracy. On the human dataset, the prediction accuracy reaches 80% for the DXEC-RF method. We extended the experiment to a bigger and more realistic dataset that maintains 50% recall on the Yeast All dataset and 80% recall on the Human All dataset. These results show that the DXEC method is suitable for performing PPI prediction. The prediction service of the DXEC-RF classifier is available at http://ailab.ahu.edu.cn:8087/ DXECPPI/index.jsp.

  3. Model error estimation in ensemble data assimilation

    Directory of Open Access Journals (Sweden)

    S. Gillijns

    2007-01-01

    Full Text Available A new methodology is proposed to estimate and account for systematic model error in linear filtering as well as in nonlinear ensemble based filtering. Our results extend the work of Dee and Todling (2000 on constant bias errors to time-varying model errors. In contrast to existing methodologies, the new filter can also deal with the case where no dynamical model for the systematic error is available. In the latter case, the applicability is limited by a matrix rank condition which has to be satisfied in order for the filter to exist. The performance of the filter developed in this paper is limited by the availability and the accuracy of observations and by the variance of the stochastic model error component. The effect of these aspects on the estimation accuracy is investigated in several numerical experiments using the Lorenz (1996 model. Experimental results indicate that the availability of a dynamical model for the systematic error significantly reduces the variance of the model error estimates, but has only minor effect on the estimates of the system state. The filter is able to estimate additive model error of any type, provided that the rank condition is satisfied and that the stochastic errors and measurement errors are significantly smaller than the systematic errors. The results of this study are encouraging. However, it remains to be seen how the filter performs in more realistic applications.

  4. Ensemble positive unlabeled learning for disease gene identification.

    Directory of Open Access Journals (Sweden)

    Peng Yang

    Full Text Available An increasing number of genes have been experimentally confirmed in recent years as causative genes to various human diseases. The newly available knowledge can be exploited by machine learning methods to discover additional unknown genes that are likely to be associated with diseases. In particular, positive unlabeled learning (PU learning methods, which require only a positive training set P (confirmed disease genes and an unlabeled set U (the unknown candidate genes instead of a negative training set N, have been shown to be effective in uncovering new disease genes in the current scenario. Using only a single source of data for prediction can be susceptible to bias due to incompleteness and noise in the genomic data and a single machine learning predictor prone to bias caused by inherent limitations of individual methods. In this paper, we propose an effective PU learning framework that integrates multiple biological data sources and an ensemble of powerful machine learning classifiers for disease gene identification. Our proposed method integrates data from multiple biological sources for training PU learning classifiers. A novel ensemble-based PU learning method EPU is then used to integrate multiple PU learning classifiers to achieve accurate and robust disease gene predictions. Our evaluation experiments across six disease groups showed that EPU achieved significantly better results compared with various state-of-the-art prediction methods as well as ensemble learning classifiers. Through integrating multiple biological data sources for training and the outputs of an ensemble of PU learning classifiers for prediction, we are able to minimize the potential bias and errors in individual data sources and machine learning algorithms to achieve more accurate and robust disease gene predictions. In the future, our EPU method provides an effective framework to integrate the additional biological and computational resources for better disease

  5. JEnsembl: a version-aware Java API to Ensembl data systems

    Science.gov (United States)

    Paterson, Trevor; Law, Andy

    2012-01-01

    Motivation: The Ensembl Project provides release-specific Perl APIs for efficient high-level programmatic access to data stored in various Ensembl database schema. Although Perl scripts are perfectly suited for processing large volumes of text-based data, Perl is not ideal for developing large-scale software applications nor embedding in graphical interfaces. The provision of a novel Java API would facilitate type-safe, modular, object-orientated development of new Bioinformatics tools with which to access, analyse and visualize Ensembl data. Results: The JEnsembl API implementation provides basic data retrieval and manipulation functionality from the Core, Compara and Variation databases for all species in Ensembl and EnsemblGenomes and is a platform for the development of a richer API to Ensembl datasources. The JEnsembl architecture uses a text-based configuration module to provide evolving, versioned mappings from database schema to code objects. A single installation of the JEnsembl API can therefore simultaneously and transparently connect to current and previous database instances (such as those in the public archive) thus facilitating better analysis repeatability and allowing ‘through time’ comparative analyses to be performed. Availability: Project development, released code libraries, Maven repository and documentation are hosted at SourceForge (http://jensembl.sourceforge.net). Contact: jensembl-develop@lists.sf.net, andy.law@roslin.ed.ac.uk, trevor.paterson@roslin.ed.ac.uk PMID:22945789

  6. Consciousness extended

    DEFF Research Database (Denmark)

    Carrara-Augustenborg, Claudia

    2012-01-01

    There is no consensus yet regarding a conceptualization of consciousness able to accommodate all the features of such complex phenomenon. Different theoretical and empirical models lend strength to both the occurrence of a non-accessible informational broadcast, and to the mobilization of specific...... brain areas responsible for the emergence of the individual´s explicit and variable access to given segments of such broadcast. Rather than advocating one model over others, this chapter proposes to broaden the conceptualization of consciousness by letting it embrace both mechanisms. Within...... such extended framework, I propose conceptual and functional distinctions between consciousness (global broadcast of information), awareness (individual´s ability to access the content of such broadcast) and unconsciousness (focally isolated neural activations). My hypothesis is that a demarcation in terms...

  7. Canonical ensemble approach to graded-response perceptrons

    Science.gov (United States)

    Bollé, D.; Erichsen, R., Jr.

    1999-03-01

    Perceptrons with graded input-output relations and a limited output precision are studied within the Gardner-Derrida canonical ensemble approach. Soft non-negative error measures are introduced allowing for extended retrieval properties. In particular, the performance of these systems for a linear (quadratic) error measure, corresponding to the perceptron (adaline) learning algorithm, is compared with the performance for a rigid error measure, simply counting the number of errors. Replica-symmetry-breaking effects are evaluated, and the analytic results are compared with numerical simulations.

  8. Inner structure of vehicular ensembles and random matrix theory

    Science.gov (United States)

    Krbálek, Milan; Hobza, Tomáš

    2016-05-01

    We introduce a special class of random matrices (DUE) whose spectral statistics corresponds to statistics of microscopical quantities detected in vehicular flows. Comparing the level spacing distribution (for ordered eigenvalues in unfolded spectra of DUE matrices) with the time-clearance distribution extracted from various areas of the flux-density diagram (evaluated from original traffic data measured on Czech expressways with high occupancies) we demonstrate that the set of classical systems showing an universality associated with Random Matrix Ensembles can be extended by traffic systems.

  9. Light-mediated non-Gaussian entanglement of atomic ensembles

    Science.gov (United States)

    Pettersson, Olov; Byrnes, Tim

    2017-04-01

    We analyze a similar scheme for producing light-mediated entanglement between atomic ensembles, as first realized by Julsgaard, Kozhekin, and Polzik [Nature (London) 413, 400 (2001), 10.1038/35096524]. In the standard approach to modeling the scheme, a Holstein-Primakoff approximation is made, where the atomic ensembles are treated as bosonic modes, and is only valid for short interaction times. In this paper, we solve the time evolution without this approximation, which extends the region of validity of the interaction time. For short entangling times, we find that this produces a state with characteristics similar to those of a two-mode squeezed state, in agreement with standard predictions. For long entangling times, the state evolves into a non-Gaussian form, and the characteristics of the two-mode squeezed state start to diminish. This is attributed to more exotic types of entangled states being generated. We characterize the states by examining the Fock-state probability distributions, Husimi Q distributions, and nonlocal entanglement between the ensembles. We compare and connect several quantities obtained by using the Holstein-Primakoff approach and our exact time evolution methods.

  10. Accuracy/diversity and ensemble MLP classifier design.

    Science.gov (United States)

    Windeatt, Terry

    2006-09-01

    The difficulties of tuning parameters of multilayer perceptrons (MLP) classifiers are well known. In this paper, a measure is described that is capable of predicting the number of classifier training epochs for achieving optimal performance in an ensemble of MLP classifiers. The measure is computed between pairs of patterns on the training data and is based on a spectral representation of a Boolean function. This representation characterizes the mapping from classifier decisions to target label and allows accuracy and diversity to be incorporated within a single measure. Results on many benchmark problems, including the Olivetti Research Laboratory (ORL) face database demonstrate that the measure is well correlated with base-classifier test error, and may be used to predict the optimal number of training epochs. While correlation with ensemble test error is not quite as strong, it is shown in this paper that the measure may be used to predict number of epochs for optimal ensemble performance. Although the technique is only applicable to two-class problems, it is extended here to multiclass through output coding. For the output-coding technique, a random code matrix is shown to give better performance than one-per-class code, even when the base classifier is well-tuned.

  11. Ensembles of physical states and random quantum circuits on graphs

    CERN Document Server

    Hamma, Alioscia; Zanardi, Paolo

    2012-01-01

    In this paper we continue and extend the investigations of the ensembles of random physical states introduced in A. Hamma et al arXiv:1109.4391. These ensembles are constructed by finite-length random quantum circuits (RQC) acting on (hyper)edges of an underlying (hyper)graph structure. The latter encodes for the locality structure associated with finite-time quantum evolutions generated by physical i.e., local, Hamiltonians. Our goal is to analyze physical properties of typical states in these ensembles, in particular here we focus on proxies of quantum entanglement as purity and $\\alpha$-Renyi entropies. The problem is formulated in terms of matrix elements of superoperators which depend on the graph structure, choice of probability measure over the local unitaries and circuit length. In the $\\alpha=2$ case these superoperators act on a restricted multi-qubit space generated by permutation operators associated to the subsets of vertices of the graph. For permutationally invariant interactions the dynamics c...

  12. Beta ensembles, quantum Painlev\\'e equations and isomonodromy systems

    CERN Document Server

    Rumanov, Igor

    2014-01-01

    This is a review of recent developments in the theory of beta ensembles of random matrices and their relations with conformal filed theory (CFT). There are (almost) no new results here. This article can serve as a guide on appearances and studies of quantum Painlev\\'e and more general multidimensional linear equations of Belavin-Polyakov-Zamolodchikov (BPZ) type in literature. We demonstrate how BPZ equations of CFT arise from $\\beta$-ensemble eigenvalue integrals. Quantum Painlev\\'e equations are relatively simple instances of BPZ or confluent BPZ equations, they are PDEs in two independent variables ("time" and "space"). While CFT is known as quantum integrable theory, here we focus on the appearing links of $\\beta$-ensembles and CFT with {\\it classical} integrable structure and isomonodromy systems. The central point is to show on the example of quantum Painlev\\'e II (QPII)~\\cite{betaFP1} how classical integrable structure can be extended to general values of $\\beta$ (or CFT central charge $c$), beyond the...

  13. Improved customer choice predictions using ensemble methods

    NARCIS (Netherlands)

    M.C. van Wezel (Michiel); R. Potharst (Rob)

    2005-01-01

    textabstractIn this paper various ensemble learning methods from machine learning and statistics are considered and applied to the customer choice modeling problem. The application of ensemble learning usually improves the prediction quality of flexible models like decision trees and thus leads to

  14. Layered Ensemble Architecture for Time Series Forecasting.

    Science.gov (United States)

    Rahman, Md Mustafizur; Islam, Md Monirul; Murase, Kazuyuki; Yao, Xin

    2016-01-01

    Time series forecasting (TSF) has been widely used in many application areas such as science, engineering, and finance. The phenomena generating time series are usually unknown and information available for forecasting is only limited to the past values of the series. It is, therefore, necessary to use an appropriate number of past values, termed lag, for forecasting. This paper proposes a layered ensemble architecture (LEA) for TSF problems. Our LEA consists of two layers, each of which uses an ensemble of multilayer perceptron (MLP) networks. While the first ensemble layer tries to find an appropriate lag, the second ensemble layer employs the obtained lag for forecasting. Unlike most previous work on TSF, the proposed architecture considers both accuracy and diversity of the individual networks in constructing an ensemble. LEA trains different networks in the ensemble by using different training sets with an aim of maintaining diversity among the networks. However, it uses the appropriate lag and combines the best trained networks to construct the ensemble. This indicates LEAs emphasis on accuracy of the networks. The proposed architecture has been tested extensively on time series data of neural network (NN)3 and NN5 competitions. It has also been tested on several standard benchmark time series data. In terms of forecasting accuracy, our experimental results have revealed clearly that LEA is better than other ensemble and nonensemble methods.

  15. Ensemble methods for handwritten digit recognition

    DEFF Research Database (Denmark)

    Hansen, Lars Kai; Liisberg, Christian; Salamon, P.

    1992-01-01

    . It is further shown that it is possible to estimate the ensemble performance as well as the learning curve on a medium-size database. In addition the authors present preliminary analysis of experiments on a large database and show that state-of-the-art performance can be obtained using the ensemble approach...

  16. Nonextensivity in magnetic nanoparticle ensembles

    Science.gov (United States)

    Binek, Ch.; Polisetty, S.; He, Xi; Mukherjee, T.; Rajesh, R.; Redepenning, J.

    2006-08-01

    A superconducting quantum interference device and Faraday rotation technique are used to study dipolar interacting nanoparticles embedded in a polystyrene matrix. Magnetization isotherms are measured for three cylindrically shaped samples of constant diameter but various heights. Detailed analysis of the isotherms supports Tsallis’ conjecture of a magnetic equation of state that involves temperature and magnetic field variables scaled by the logarithm of the number of magnetic nanoparticles. This unusual scaling of thermodynamic variables, which are conventionally considered to be intensive, originates from the nonextensivity of the Gibbs free energy in three-dimensional dipolar interacting particle ensembles. Our experimental evidence for nonextensivity is based on the data collapse of various isotherms that require scaling of the field variable in accordance with Tsallis’ equation of state.

  17. Genomic Database Searching.

    Science.gov (United States)

    Hutchins, James R A

    2017-01-01

    The availability of reference genome sequences for virtually all species under active research has revolutionized biology. Analyses of genomic variations in many organisms have provided insights into phenotypic traits, evolution and disease, and are transforming medicine. All genomic data from publicly funded projects are freely available in Internet-based databases, for download or searching via genome browsers such as Ensembl, Vega, NCBI's Map Viewer, and the UCSC Genome Browser. These online tools generate interactive graphical outputs of relevant chromosomal regions, showing genes, transcripts, and other genomic landmarks, and epigenetic features mapped by projects such as ENCODE.This chapter provides a broad overview of the major genomic databases and browsers, and describes various approaches and the latest resources for searching them. Methods are provided for identifying genomic locus and sequence information using gene names or codes, identifiers for DNA and RNA molecules and proteins; also from karyotype bands, chromosomal coordinates, sequences, motifs, and matrix-based patterns. Approaches are also described for batch retrieval of genomic information, performing more complex queries, and analyzing larger sets of experimental data, for example from next-generation sequencing projects.

  18. Perception of ensemble statistics requires attention.

    Science.gov (United States)

    Jackson-Nielsen, Molly; Cohen, Michael A; Pitts, Michael A

    2017-02-01

    To overcome inherent limitations in perceptual bandwidth, many aspects of the visual world are represented as summary statistics (e.g., average size, orientation, or density of objects). Here, we investigated the relationship between summary (ensemble) statistics and visual attention. Recently, it was claimed that one ensemble statistic in particular, color diversity, can be perceived without focal attention. However, a broader debate exists over the attentional requirements of conscious perception, and it is possible that some form of attention is necessary for ensemble perception. To test this idea, we employed a modified inattentional blindness paradigm and found that multiple types of summary statistics (color and size) often go unnoticed without attention. In addition, we found attentional costs in dual-task situations, further implicating a role for attention in statistical perception. Overall, we conclude that while visual ensembles may be processed efficiently, some amount of attention is necessary for conscious perception of ensemble statistics.

  19. Popular Ensemble Methods: An Empirical Study

    CERN Document Server

    Maclin, R; 10.1613/jair.614

    2011-01-01

    An ensemble consists of a set of individually trained classifiers (such as neural networks or decision trees) whose predictions are combined when classifying novel instances. Previous research has shown that an ensemble is often more accurate than any of the single classifiers in the ensemble. Bagging (Breiman, 1996c) and Boosting (Freund and Shapire, 1996; Shapire, 1990) are two relatively new but popular methods for producing ensembles. In this paper we evaluate these methods on 23 data sets using both neural networks and decision trees as our classification algorithm. Our results clearly indicate a number of conclusions. First, while Bagging is almost always more accurate than a single classifier, it is sometimes much less accurate than Boosting. On the other hand, Boosting can create ensembles that are less accurate than a single classifier -- especially when using neural networks. Analysis indicates that the performance of the Boosting methods is dependent on the characteristics of the data set being exa...

  20. Extended Chameleons

    CERN Document Server

    Brax, Philippe

    2015-01-01

    We extend the chameleon models by considering Scalar-Fluid theories where the coupling between matter and the scalar field can be represented by a quadratic effective potential with density-dependent minimum and mass. In this context, we study the effects of the scalar field on Solar System tests of gravity and show that models passing these stringent constraints can still induce large modifications of Newton's law on galactic scales. On these scales we analyse models which could lead to a percent deviation of Newton's law outside the virial radius. We then model the dark matter halo as a Navarro-Frenk-White profile and explicitly find that the fifth force can give large contributions around the galactic core in a particular model where the scalar field mass is constant and the minimum of its potential varies linearly with the matter density. At cosmological distances, we find that this model does not alter the growth of large scale structures and therefore would be best tested on galactic scales, where inter...

  1. On the efficiency of biased sampling of the multiple state path ensemble

    NARCIS (Netherlands)

    Rogal, J.; Bolhuis, P.G.

    2010-01-01

    Developed for complex systems undergoing rare events involving many (meta)stable states, the multiple state transition path sampling aims to sample from an extended path ensemble including all possible trajectories between any pair of (meta)stable states. The key issue for an efficient sampling of t

  2. Generation of scenarios from calibrated ensemble forecasts with a dynamic ensemble copula coupling approach

    CERN Document Server

    Bouallegue, Zied Ben; Theis, Susanne E; Pinson, Pierre

    2015-01-01

    Probabilistic forecasts in the form of ensemble of scenarios are required for complex decision making processes. Ensemble forecasting systems provide such products but the spatio-temporal structures of the forecast uncertainty is lost when statistical calibration of the ensemble forecasts is applied for each lead time and location independently. Non-parametric approaches allow the reconstruction of spatio-temporal joint probability distributions at a low computational cost.For example, the ensemble copula coupling (ECC) method consists in rebuilding the multivariate aspect of the forecast from the original ensemble forecasts. Based on the assumption of error stationarity, parametric methods aim to fully describe the forecast dependence structures. In this study, the concept of ECC is combined with past data statistics in order to account for the autocorrelation of the forecast error. The new approach which preserves the dynamical development of the ensemble members is called dynamic ensemble copula coupling (...

  3. An Organic Computing Approach to Self-organising Robot Ensembles

    Directory of Open Access Journals (Sweden)

    Sebastian Albrecht von Mammen

    2016-11-01

    Full Text Available Similar to the Autonomous Computing initiative, that has mainly been advancing techniques for self-optimisation focussing on computing systems and infrastructures, Organic Computing (OC has been driving the development of system design concepts and algorithms for self-adaptive systems at large. Examples of application domains include, for instance, traffic management and control, cloud services, communication protocols, and robotic systems. Such an OC system typically consists of a potentially large set of autonomous and self-managed entities, where each entity acts with a local decision horizon. By means of cooperation of the individual entities, the behaviour of the entire ensemble system is derived. In this article, we present our work on how autonomous, adaptive robot ensembles can benefit from OC technology. Our elaborations are aligned with the different layers of an observer/controller framework which provides the foundation for the individuals' adaptivity at system design-level. Relying on an extended Learning Classifier System (XCS in combination with adequate simulation techniques, this basic system design empowers robot individuals to improve their individual and collaborative performances, e.g. by means of adapting to changing goals and conditions.Not only for the sake of generalisability, but also because of its enormous transformative potential, we stage our research in the domain of robot ensembles that are typically comprised of several quad-rotors and that organise themselves to fulfil spatial tasks such as maintenance of building facades or the collaborative search for mobile targets. Our elaborations detail the architectural concept, provide examples of individual self-optimisation as well as of the optimisation of collaborative efforts, and we show how the user can control the ensembles at multiple levels of abstraction. We conclude with a summary of our approach and an outlook on possible future steps.

  4. extendFromReads

    Energy Technology Data Exchange (ETDEWEB)

    2013-10-03

    This package assists in genome assembly. extendFromReads takes as input a set of Illumina (eg, MiSeq) DNA sequencing reads, a query seed sequence and a direction to extend the seed. The algorithm collects all seed--]matching reads (flipping reverse--]orientation hits), trims off the seed and additional sequence in the other direction, sorts the remaining sequences alphabetically, and prints them aligned without gaps from the point of seed trimming. This produces a visual display distinguishing the flanks of multi-]copy seeds. A companion script hitMates.pl collects the mates of seed--]hi]ng reads, whose alignment reveals longer extensions from the seed. The collect/trim/sort strategy was made iterative and scaled up in the script denovo.pl, for de novo contig assembly. An index is pre--]built using indexReads.pl that for each unique 21--]mer found in all the reads, records its gfateh of extension (whether extendable, blocked by low coverage, or blocked by branching after a duplicated sequence) and other characteristics. Importantly, denovo.pl records all branchings that follow a branching contig endpoint, providing contig-]extension information

  5. Calculations of canonical averages from the grand canonical ensemble.

    Science.gov (United States)

    Kosov, D S; Gelin, M F; Vdovin, A I

    2008-02-01

    Grand canonical and canonical ensembles become equivalent in the thermodynamic limit, but when the system size is finite the results obtained in the two ensembles deviate from each other. In many important cases, the canonical ensemble provides an appropriate physical description but it is often much easier to perform the calculations in the corresponding grand canonical ensemble. We present a method to compute averages in the canonical ensemble based on calculations of the expectation values in the grand canonical ensemble. The number of particles, which is fixed in the canonical ensemble, is not necessarily the same as the average number of particles in the grand canonical ensemble.

  6. Refining ensembles of predicted gene regulatory networks based on characteristic interaction sets.

    Directory of Open Access Journals (Sweden)

    Lukas Windhager

    Full Text Available Different ensemble voting approaches have been successfully applied for reverse-engineering of gene regulatory networks. They are based on the assumption that a good approximation of true network structure can be derived by considering the frequencies of individual interactions in a large number of predicted networks. Such approximations are typically superior in terms of prediction quality and robustness as compared to considering a single best scoring network only. Nevertheless, ensemble approaches only work well if the predicted gene regulatory networks are sufficiently similar to each other. If the topologies of predicted networks are considerably different, an ensemble of all networks obscures interesting individual characteristics. Instead, networks should be grouped according to local topological similarities and ensemble voting performed for each group separately. We argue that the presence of sets of co-occurring interactions is a suitable indicator for grouping predicted networks. A stepwise bottom-up procedure is proposed, where first mutual dependencies between pairs of interactions are derived from predicted networks. Pairs of co-occurring interactions are subsequently extended to derive characteristic interaction sets that distinguish groups of networks. Finally, ensemble voting is applied separately to the resulting topologically similar groups of networks to create distinct group-ensembles. Ensembles of topologically similar networks constitute distinct hypotheses about the reference network structure. Such group-ensembles are easier to interpret as their characteristic topology becomes clear and dependencies between interactions are known. The availability of distinct hypotheses facilitates the design of further experiments to distinguish between plausible network structures. The proposed procedure is a reasonable refinement step for non-deterministic reverse-engineering applications that produce a large number of candidate

  7. Hybrid Data Assimilation without Ensemble Filtering

    Science.gov (United States)

    Todling, Ricardo; Akkraoui, Amal El

    2014-01-01

    The Global Modeling and Assimilation Office is preparing to upgrade its three-dimensional variational system to a hybrid approach in which the ensemble is generated using a square-root ensemble Kalman filter (EnKF) and the variational problem is solved using the Grid-point Statistical Interpolation system. As in most EnKF applications, we found it necessary to employ a combination of multiplicative and additive inflations, to compensate for sampling and modeling errors, respectively and, to maintain the small-member ensemble solution close to the variational solution; we also found it necessary to re-center the members of the ensemble about the variational analysis. During tuning of the filter we have found re-centering and additive inflation to play a considerably larger role than expected, particularly in a dual-resolution context when the variational analysis is ran at larger resolution than the ensemble. This led us to consider a hybrid strategy in which the members of the ensemble are generated by simply converting the variational analysis to the resolution of the ensemble and applying additive inflation, thus bypassing the EnKF. Comparisons of this, so-called, filter-free hybrid procedure with an EnKF-based hybrid procedure and a control non-hybrid, traditional, scheme show both hybrid strategies to provide equally significant improvement over the control; more interestingly, the filter-free procedure was found to give qualitatively similar results to the EnKF-based procedure.

  8. Comparative genomics in cyprinids: Common carp EST's help the annotation of the zebrafish genome

    NARCIS (Netherlands)

    Christoffels, A.; Bartfai, R.; Srinivasan, H.; Komen, J.

    2006-01-01

    Background - Automatic annotation of sequenced eukaryotic genomes integrates a combination of methodologies such as ab-initio methods and alignment of homologous genes and/or proteins. For example, annotation of the zebrafish genome within Ensembl relies heavily on available cDNA and protein sequenc

  9. Extending particle filters to higher dimensional problems

    Science.gov (United States)

    Weir, B.; Miller, R.; Spitz, Y. H.

    2013-12-01

    Particle filters are attractive solutions to nonlinear and non-Gaussian data assimilation problems since they avoid making parametric assumptions. Nevertheless, in very many dimensions their ensembles collapse onto a single particle unless the number of particles grows exponentially as a function of the dimension. This talk investigates three techniques that, used in conjunction, show the potential of preventing ensemble collapse: optimization, mixture models, and covariance refinement. Optimization is the basis of implicit sampling algorithms. By itself, it significantly reduces the growth of the necessary ensemble size, yet not to a sub-exponential function of dimension. Mixture models, which introduce a semi-parametric assumption, allow the technique to adjust the initial position of the particles. For linear and Gaussian problems, the combination of optimization and mixture models reduces the necessary ensemble size to a sub-exponential function of dimension. Covariance refinement adjusts local approximations of the second moment of the particle distributions to account for its global variation. This is especially effective for problems that are strongly nonlinear. In numerical experiments, covariance refinement used alongside optimization and mixture models shows the potential to extend the prevention of collapse to a general class of nonlinear and non-Gaussian problems.

  10. 4DVAR by ensemble Kalman smoother

    CERN Document Server

    Mandel, Jan; Gratton, Serge

    2013-01-01

    We propose to use the ensemble Kalman smoother (EnKS) as linear least squares solver in the Gauss-Newton method for the large nonlinear least squares in incremental 4DVAR. The ensemble approach is naturally parallel over the ensemble members and no tangent or adjoint operators are needed. Further, adding a regularization term results in replacing the Gauss-Newton method, which may diverge, by^M the Levenberg-Marquardt method, which is known to be convergent. The regularization is implemented efficiently as an additional observation in the EnKS.

  11. Derivation of Mayer Series from Canonical Ensemble

    Science.gov (United States)

    Wang, Xian-Zhi

    2016-02-01

    Mayer derived the Mayer series from both the canonical ensemble and the grand canonical ensemble by use of the cluster expansion method. In 2002, we conjectured a recursion formula of the canonical partition function of a fluid (X.Z. Wang, Phys. Rev. E 66 (2002) 056102). In this paper we give a proof for this formula by developing an appropriate expansion of the integrand of the canonical partition function. We further derive the Mayer series solely from the canonical ensemble by use of this recursion formula.

  12. The impact of initial spread calibration on the RELO ensemble and its application to Lagrangian dynamics

    Directory of Open Access Journals (Sweden)

    M. Wei

    2013-09-01

    Full Text Available A number of real-time ocean model forecasts were carried out successfully at Naval Research Laboratory (NRL to provide modeling support and numerical guidance to the CARTHE GLAD at-sea experiment during summer 2012. Two RELO ensembles and three single models using NCOM and HYCOM with different resolutions were carried out. A calibrated ensemble system with enhanced spread and reliability was developed to better support this experiment. The calibrated ensemble is found to outperform the un-calibrated ensemble in forecasting accuracy, skill, and reliability for all the variables and observation spaces evaluated. The metrics used in this paper include RMS error, anomaly correlation, PECA, Brier score, spread reliability, and Talagrand rank histogram. It is also found that even the un-calibrated ensemble outperforms the single forecast from the model with the same resolution. The advantages of the ensembles are further extended to the Lagrangian framework. In contrast to a single model forecast, the RELO ensemble provides not only the most likely Lagrangian trajectory for a particle in the ocean, but also an uncertainty estimate that directly reflects the complicated ocean dynamics, which is valuable for decision makers. The examples show that the calibrated ensemble with more reliability can capture trajectories in different, even opposite, directions, which would be missed by the un-calibrated ensemble. The ensembles are applied to compute the repelling and attracting Lagrangian coherent structures (LCSs, and the uncertainties of the LCSs, which are hard to obtain from a single model forecast, are estimated. It is found that the spatial scales of the LCSs depend on the model resolution. The model with the highest resolution produces the finest, small-scale, LCS structures, while the model with lowest resolution generates only large-scale LCSs. The repelling and attracting LCSs are found to intersect at many locations and create complex mesoscale

  13. Nonextensivity in Magnetic Nanocluster Ensembles

    Science.gov (United States)

    Binek, Christian; Polisetty, Srinivas; He, Xi; Mukherjee, Tathagata; Rajasekeran, Rajesh; Redepenning, Jody

    2006-03-01

    We study the scaling behavior of dipolar interacting nanoparticles in 3D samples of various sizes but constant particle density. Ferromagnetic γ-Fe2O3 clusters embedded in a polystyrene matrix are fabricated by thermal decomposition of metal carbonyls. Transmission electron microscopy reveals a narrow size distribution of 12 nm clusters. They are randomly dispersed in the matrix with an average separation of 80 nm. Magnetization isotherms of these single domain particle ensembles are measured by SQUID magnetometry above the blocking temperature TB =115K where non-equilibrium effects are avoided. After demagnetization corrections which convert the applied magnetic fields into internal fields, H, a data collapse is achieved when scaling the magnetic moment, m, and H by appropriate factors. The latter are theoretically predicted functions of the number of particles and determined here numerically. Scaling of H takes into account the nonextensive (NE) behavior of dipolar interacting particles. In the case of long range interactions a scaling schema has been proposed by Tsallis and confirmed by simulations. The controversial field of NE thermodynamics requires however experimental evidence provided here.

  14. Ensemble Dynamics and Bred Vectors

    CERN Document Server

    Balci, Nusret; Restrepo, Juan M; Sell, George R

    2011-01-01

    We introduce the new concept of an EBV to assess the sensitivity of model outputs to changes in initial conditions for weather forecasting. The new algorithm, which we call the "Ensemble Bred Vector" or EBV, is based on collective dynamics in essential ways. By construction, the EBV algorithm produces one or more dominant vectors. We investigate the performance of EBV, comparing it to the BV algorithm as well as the finite-time Lyapunov Vectors. We give a theoretical justification to the observed fact that the vectors produced by BV, EBV, and the finite-time Lyapunov vectors are similar for small amplitudes. Numerical comparisons of BV and EBV for the 3-equation Lorenz model and for a forced, dissipative partial differential equation of Cahn-Hilliard type that arises in modeling the thermohaline circulation, demonstrate that the EBV yields a size-ordered description of the perturbation field, and is more robust than the BV in the higher nonlinear regime. The EBV yields insight into the fractal structure of th...

  15. A 4D-Ensemble-Variational System for Data Assimilation and Ensemble Initialization

    Science.gov (United States)

    Bowler, Neill; Clayton, Adam; Jardak, Mohamed; Lee, Eunjoo; Jermey, Peter; Lorenc, Andrew; Piccolo, Chiara; Pring, Stephen; Wlasak, Marek; Barker, Dale; Inverarity, Gordon; Swinbank, Richard

    2016-04-01

    The Met Office has been developing a four-dimensional ensemble variational (4DEnVar) data assimilation system over the past four years. The 4DEnVar system is intended both as data assimilation system in its own right and also an improved means of initializing the Met Office Global and Regional Ensemble Prediction System (MOGREPS). The global MOGREPS ensemble has been initialized by running an ensemble of 4DEnVars (En-4DEnVar). The scalability and maintainability of ensemble data assimilation methods make them increasingly attractive, and 4DEnVar may be adopted in the context of the Met Office's LFRic project to redevelop the technical infrastructure to enable its Unified Model (MetUM) to be run efficiently on massively parallel supercomputers. This presentation will report on the results of the 4DEnVar development project, including experiments that have been run using ensemble sizes of up to 200 members.

  16. Spatial Ensemble Postprocessing of Precipitation Forecasts Using High Resolution Analyses

    Science.gov (United States)

    Lang, Moritz N.; Schicker, Irene; Kann, Alexander; Wang, Yong

    2017-04-01

    shows a mean improvement of more than 40% in CRPS when compared to bilinearly interpolated uncalibrated ensemble forecasts. The validation on randomly selected grid points, representing the true height distribution over Austria, still indicates a mean improvement of 35%. The applied statistical model is currently set up for 6-hourly and daily accumulation periods, but will be extended to a temporal resolution of 1-3 hours within a new probabilistic nowcasting system operated by ZAMG.

  17. Transition from Poisson to circular unitary ensemble

    Indian Academy of Sciences (India)

    Vinayak; Akhilesh Pandey

    2009-09-01

    Transitions to universality classes of random matrix ensembles have been useful in the study of weakly-broken symmetries in quantum chaotic systems. Transitions involving Poisson as the initial ensemble have been particularly interesting. The exact two-point correlation function was derived by one of the present authors for the Poisson to circular unitary ensemble (CUE) transition with uniform initial density. This is given in terms of a rescaled symmetry breaking parameter Λ. The same result was obtained for Poisson to Gaussian unitary ensemble (GUE) transition by Kunz and Shapiro, using the contour-integral method of Brezin and Hikami. We show that their method is applicable to Poisson to CUE transition with arbitrary initial density. Their method is also applicable to the more general ℓ CUE to CUE transition where CUE refers to the superposition of ℓ independent CUE spectra in arbitrary ratio.

  18. Ensemble treatments of thermal pairing in nuclei

    Science.gov (United States)

    Hung, Nguyen Quang; Dang, Nguyen Dinh

    2009-10-01

    A systematic comparison is conducted for pairing properties of finite systems at nonzero temperature as predicted by the exact solutions of the pairing problem embedded in three principal statistical ensembles, namely the grandcanonical ensemble, canonical ensemble and microcanonical ensemble, as well as the unprojected (FTBCS1+SCQRPA) and Lipkin-Nogami projected (FTLN1+SCQRPA) theories that include the quasiparticle number fluctuation and coupling to pair vibrations within the self-consistent quasiparticle random-phase approximation. The numerical calculations are performed for the pairing gap, total energy, heat capacity, entropy, and microcanonical temperature within the doubly-folded equidistant multilevel pairing model. The FTLN1+SCQRPA predictions are found to agree best with the exact grand-canonical results. In general, all approaches clearly show that the superfluid-normal phase transition is smoothed out in finite systems. A novel formula is suggested for extracting the empirical pairing gap in reasonable agreement with the exact canonical results.

  19. Ensemble Machine Learning Methods and Applications

    CERN Document Server

    Ma, Yunqian

    2012-01-01

    It is common wisdom that gathering a variety of views and inputs improves the process of decision making, and, indeed, underpins a democratic society. Dubbed “ensemble learning” by researchers in computational intelligence and machine learning, it is known to improve a decision system’s robustness and accuracy. Now, fresh developments are allowing researchers to unleash the power of ensemble learning in an increasing range of real-world applications. Ensemble learning algorithms such as “boosting” and “random forest” facilitate solutions to key computational issues such as face detection and are now being applied in areas as diverse as object trackingand bioinformatics.   Responding to a shortage of literature dedicated to the topic, this volume offers comprehensive coverage of state-of-the-art ensemble learning techniques, including various contributions from researchers in leading industrial research labs. At once a solid theoretical study and a practical guide, the volume is a windfall for r...

  20. Ensemble Learning for Free with Evolutionary Algorithms ?

    CERN Document Server

    Gagné, Christian; Schoenauer, Marc; Tomassini, Marco

    2007-01-01

    Evolutionary Learning proceeds by evolving a population of classifiers, from which it generally returns (with some notable exceptions) the single best-of-run classifier as final result. In the meanwhile, Ensemble Learning, one of the most efficient approaches in supervised Machine Learning for the last decade, proceeds by building a population of diverse classifiers. Ensemble Learning with Evolutionary Computation thus receives increasing attention. The Evolutionary Ensemble Learning (EEL) approach presented in this paper features two contributions. First, a new fitness function, inspired by co-evolution and enforcing the classifier diversity, is presented. Further, a new selection criterion based on the classification margin is proposed. This criterion is used to extract the classifier ensemble from the final population only (Off-line) or incrementally along evolution (On-line). Experiments on a set of benchmark problems show that Off-line outperforms single-hypothesis evolutionary learning and state-of-art ...

  1. Tropical Cyclone Ensemble Data Assimilation

    Science.gov (United States)

    2012-09-30

    conditions. But this method was not used to process the entire QuikSCAT mission. In this study, we use quality information concerning the impact of rain...candidate for such an approach is described by Yoon et al. (2012). We are in the process of extending our reseach efforts on that topic in close

  2. Genomic libraries: I. Construction and screening of fosmid genomic libraries.

    Science.gov (United States)

    Quail, Mike A; Matthews, Lucy; Sims, Sarah; Lloyd, Christine; Beasley, Helen; Baxter, Simon W

    2011-01-01

    Large insert genome libraries have been a core resource required to sequence genomes, analyze haplotypes, and aid gene discovery. While next generation sequencing technologies are revolutionizing the field of genomics, traditional genome libraries will still be required for accurate genome assembly. Their utility is also being extended to functional studies for understanding DNA regulatory elements. Here, we present a detailed method for constructing genomic fosmid libraries, testing for common contaminants, gridding the library to nylon membranes, then hybridizing the library membranes with a radiolabeled probe to identify corresponding genomic clones. While this chapter focuses on fosmid libraries, many of these steps can also be applied to bacterial artificial chromosome libraries.

  3. Reversible Projective Measurement in Quantum Ensembles

    CERN Document Server

    Khitrin, Anatoly; Lee, Jae-Seung

    2010-01-01

    We present experimental NMR demonstration of a scheme of reversible projective measurement, which allows extracting information on outcomes and probabilities of a projective measurement in a non-destructive way, with a minimal net effect on the quantum state of an ensemble. The scheme uses reversible dynamics and weak measurement of the intermediate state. The experimental system is an ensemble of 133Cs (S = 7/2) nuclei in a liquid-crystalline matrix.

  4. Ozone ensemble forecast with machine learning algorithms

    OpenAIRE

    Mallet, Vivien; Stoltz, Gilles; Mauricette, Boris

    2009-01-01

    International audience; We apply machine learning algorithms to perform sequential aggregation of ozone forecasts. The latter rely on a multimodel ensemble built for ozone forecasting with the modeling system Polyphemus. The ensemble simulations are obtained by changes in the physical parameterizations, the numerical schemes, and the input data to the models. The simulations are carried out for summer 2001 over western Europe in order to forecast ozone daily peaks and ozone hourly concentrati...

  5. Cluster Ensemble-based Image Segmentation

    OpenAIRE

    Xiaoru Wang; Junping Du; Shuzhe Wu; Xu Li; Fu Li

    2013-01-01

    Image segmentation is the foundation of computer vision applications. In this paper, we propose a new cluster ensemble-based image segmentation algorithm, which overcomes several problems of traditional methods. We make two main contributions in this paper. First, we introduce the cluster ensemble concept to fuse the segmentation results from different types of visual features effectively, which can deliver a better final result and achieve a much more stable performance for broad categories ...

  6. Querying genomic databases

    Energy Technology Data Exchange (ETDEWEB)

    Baehr, A.; Hagstrom, R.; Joerg, D.; Overbeek, R.

    1991-09-01

    A natural-language interface has been developed that retrieves genomic information by using a simple subset of English. The interface spares the biologist from the task of learning database-specific query languages and computer programming. Currently, the interface deals with the E. coli genome. It can, however, be readily extended and shows promise as a means of easy access to other sequenced genomic databases as well.

  7. Impacts of calibration strategies and ensemble methods on ensemble flood forecasting over Lanjiang basin, Southeast China

    Science.gov (United States)

    Liu, Li; Xu, Yue-Ping

    2017-04-01

    Ensemble flood forecasting driven by numerical weather prediction products is becoming more commonly used in operational flood forecasting applications.In this study, a hydrological ensemble flood forecasting system based on Variable Infiltration Capacity (VIC) model and quantitative precipitation forecasts from TIGGE dataset is constructed for Lanjiang Basin, Southeast China. The impacts of calibration strategies and ensemble methods on the performance of the system are then evaluated.The hydrological model is optimized by parallel programmed ɛ-NSGAII multi-objective algorithm and two respectively parameterized models are determined to simulate daily flows and peak flows coupled with a modular approach.The results indicatethat the ɛ-NSGAII algorithm permits more efficient optimization and rational determination on parameter setting.It is demonstrated that the multimodel ensemble streamflow mean have better skills than the best singlemodel ensemble mean (ECMWF) and the multimodel ensembles weighted on members and skill scores outperform other multimodel ensembles. For typical flood event, it is proved that the flood can be predicted 3-4 days in advance, but the flows in rising limb can be captured with only 1-2 days ahead due to the flash feature. With respect to peak flows selected by Peaks Over Threshold approach, the ensemble means from either singlemodel or multimodels are generally underestimated as the extreme values are smoothed out by ensemble process.

  8. Towards a GME ensemble forecasting system: Ensemble initialization using the breeding technique

    Directory of Open Access Journals (Sweden)

    Jan D. Keller

    2008-12-01

    Full Text Available The quantitative forecast of precipitation requires a probabilistic background particularly with regard to forecast lead times of more than 3 days. As only ensemble simulations can provide useful information of the underlying probability density function, we built a new ensemble forecasting system (GME-EFS based on the GME model of the German Meteorological Service (DWD. For the generation of appropriate initial ensemble perturbations we chose the breeding technique developed by Toth and Kalnay (1993, 1997, which develops perturbations by estimating the regions of largest model error induced uncertainty. This method is applied and tested in the framework of quasi-operational forecasts for a three month period in 2007. The performance of the resulting ensemble forecasts are compared to the operational ensemble prediction systems ECMWF EPS and NCEP GFS by means of ensemble spread of free atmosphere parameters (geopotential and temperature and ensemble skill of precipitation forecasting. This comparison indicates that the GME ensemble forecasting system (GME-EFS provides reasonable forecasts with spread skill score comparable to that of the NCEP GFS. An analysis with the continuous ranked probability score exhibits a lack of resolution for the GME forecasts compared to the operational ensembles. However, with significant enhancements during the 3 month test period, the first results of our work with the GME-EFS indicate possibilities for further development as well as the potential for later operational usage.

  9. Minimal redefinition of the OSV ensemble

    CERN Document Server

    Parvizi, S; Parvizi, Shahrokh; Tavanfar, Alireza

    2005-01-01

    In the interesting conjecture, Z_{BH}=|Z_{top}|^2, proposed by Ooguri, Strominger and Vafa (OSV), the black hole ensemble is a mixed ensemble, and resulting degeneracy of states as obtained from the ensemble inverse-Laplace integration, suffer from prefactors which do not respect the (relevant) electric-magnetic dualities. One idea to overcome this deficiency, as claimed recently, is imposing a nontrivial measure for the ensemble sum. We address this problem and upon a redefinition of the OSV ensemble whose variables are as numerous as the electric potentials, show that for restoring the symmetry no non-Euclidean measure is needful. In detail, we rewrite the OSV free energy as a function of new variables which are combinations of the electric-potentials and the black hole charges. Subsequently the Legendre transformation which bridges between the entropy and the black hole free energy in terms of these variables, points to a generalized ensemble. In this context we will consider all the cases of relevance: sm...

  10. Level density for deformations of the Gaussian orthogonal ensemble

    CERN Document Server

    Bertuola, A C; Hussein, M S; Pato, M P; Sargeant, A J

    2004-01-01

    Formulas are derived for the average level density of deformed, or transition, Gaussian orthogonal random matrix ensembles. After some general considerations about Gaussian ensembles we derive formulas for the average level density for (i) the transition from the Gaussian orthogonal ensemble (GOE) to the Poisson ensemble and (ii) the transition from the GOE to $m$ GOEs.

  11. The classicality and quantumness of a quantum ensemble

    CERN Document Server

    Zhu, Xuanmin; Wu, Shengjun; Liu, Quanhui

    2010-01-01

    In this paper, we investigate the classicality and quantumness of a quantum ensemble. We define a quantity called classicality to characterize how classical a quantum ensemble is. An ensemble of commuting states that can be manipulated classically has a unit classicality, while a general ensemble has a classicality less than 1. We also study how quantum an ensemble is by defining a related quantity called quantumness. We find that the classicality of an ensemble is closely related to how perfectly the ensemble can be cloned, and that the quantumness of an ensemble is essentially responsible for the security of quantum key distribution(QKD) protocols using that ensemble. Furthermore, we show that the quantumness of an ensemble used in a QKD protocol is exactly the attainable lower bound of the error rate in the sifted key.

  12. Ensemble-averaged Rabi oscillations in a ferromagnetic CoFeB film

    Science.gov (United States)

    Capua, Amir; Rettner, Charles; Yang, See-Hun; Phung, Timothy; Parkin, Stuart S. P.

    2017-06-01

    Rabi oscillations describe the process whereby electromagnetic radiation interacts coherently with spin states in a non-equilibrium interaction. To date, Rabi oscillations have not been studied in one of the most common spin ensembles in nature: spins in ferromagnets. Here, using a combination of femtosecond laser pulses and microwave excitations, we report the classical analogue of Rabi oscillations in ensemble-averaged spins of a ferromagnet. The microwave stimuli are shown to extend the coherence-time resulting in resonant spin amplification. The results we present in a dense magnetic system are qualitatively similar to those reported previously in semiconductors which have five orders of magnitude fewer spins and which require resonant optical excitations to spin-polarize the ensemble. Our study is a step towards connecting concepts used in quantum processing with spin-transport effects in ferromagnets. For example, coherent control may become possible without the complications of driving an electromagnetic field but rather by using spin-polarized currents.

  13. Stochastic methods for light propagation and recurrent scattering in saturated and nonsaturated atomic ensembles

    CERN Document Server

    Lee, Mark D; Ruostekoski, Janne

    2016-01-01

    We derive equations for the strongly coupled system of light and dense atomic ensembles. The formalism includes an arbitrary internal level structure for the atoms and is not restricted to weak excitation of atoms by light. In the low light intensity limit for atoms with a single electronic ground state, the full quantum field-theoretical representation of the model can be solved exactly by means of classical stochastic electrodynamics simulations for stationary atoms that represent cold atomic ensembles. Simulations for the optical response of atoms in a quantum degenerate regime require one to synthesize a stochastic ensemble of atomic positions that generates the corresponding quantum statistical position correlations between the atoms. In the case of multiple ground levels or at light intensities where saturation becomes important, the classical simulations require approximations that neglect quantum fluctuations between the levels. We show how the model is extended to incorporate corrections due to quant...

  14. Ensemble postprocessing for probabilistic quantitative precipitation forecasts

    Science.gov (United States)

    Bentzien, S.; Friederichs, P.

    2012-12-01

    Precipitation is one of the most difficult weather variables to predict in hydrometeorological applications. In order to assess the uncertainty inherent in deterministic numerical weather prediction (NWP), meteorological services around the globe develop ensemble prediction systems (EPS) based on high-resolution NWP systems. With non-hydrostatic model dynamics and without parameterization of deep moist convection, high-resolution NWP models are able to describe convective processes in more detail and provide more realistic mesoscale structures. However, precipitation forecasts are still affected by displacement errors, systematic biases and fast error growth on small scales. Probabilistic guidance can be achieved from an ensemble setup which accounts for model error and uncertainty of initial and boundary conditions. The German Meteorological Service (Deutscher Wetterdienst, DWD) provides such an ensemble system based on the German-focused limited-area model COSMO-DE. With a horizontal grid-spacing of 2.8 km, COSMO-DE is the convection-permitting high-resolution part of the operational model chain at DWD. The COSMO-DE-EPS consists of 20 realizations of COSMO-DE, driven by initial and boundary conditions derived from 4 global models and 5 perturbations of model physics. Ensemble systems like COSMO-DE-EPS are often limited with respect to ensemble size due to the immense computational costs. As a consequence, they can be biased and exhibit insufficient ensemble spread, and probabilistic forecasts may be not well calibrated. In this study, probabilistic quantitative precipitation forecasts are derived from COSMO-DE-EPS and evaluated at more than 1000 rain gauges located all over Germany. COSMO-DE-EPS is a frequently updated ensemble system, initialized 8 times a day. We use the time-lagged approach to inexpensively increase ensemble spread, which results in more reliable forecasts especially for extreme precipitation events. Moreover, we will show that statistical

  15. Metal Oxide Gas Sensor Drift Compensation Using a Two-Dimensional Classifier Ensemble

    Directory of Open Access Journals (Sweden)

    Hang Liu

    2015-04-01

    Full Text Available Sensor drift is the most challenging problem in gas sensing at present. We propose a novel two-dimensional classifier ensemble strategy to solve the gas discrimination problem, regardless of the gas concentration, with high accuracy over extended periods of time. This strategy is appropriate for multi-class classifiers that consist of combinations of pairwise classifiers, such as support vector machines. We compare the performance of the strategy with those of competing methods in an experiment based on a public dataset that was compiled over a period of three years. The experimental results demonstrate that the two-dimensional ensemble outperforms the other methods considered. Furthermore, we propose a pre-aging process inspired by that applied to the sensors to improve the stability of the classifier ensemble. The experimental results demonstrate that the weight of each multi-class classifier model in the ensemble remains fairly static before and after the addition of new classifier models to the ensemble, when a pre-aging procedure is applied.

  16. Optimized expanded ensembles for simulations involving molecular insertions and deletions. II. Open systems

    Science.gov (United States)

    Escobedo, Fernando A.

    2007-11-01

    In the Grand Canonical, osmotic, and Gibbs ensembles, chemical potential equilibrium is attained via transfers of molecules between the system and either a reservoir or another subsystem. In this work, the expanded ensemble (EXE) methods described in part I [F. A. Escobedo and F. J. Martínez-Veracoechea, J. Chem. Phys. 127, 174103 (2007)] of this series are extended to these ensembles to overcome the difficulties associated with implementing such whole-molecule transfers. In EXE, such moves occur via a target molecule that undergoes transitions through a number of intermediate coupling states. To minimize the tunneling time between the fully coupled and fully decoupled states, the intermediate states could be either: (i) sampled with an optimal frequency distribution (the sampling problem) or (ii) selected with an optimal spacing distribution (staging problem). The sampling issue is addressed by determining the biasing weights that would allow generating an optimal ensemble; discretized versions of this algorithm (well suited for small number of coupling stages) are also presented. The staging problem is addressed by selecting the intermediate stages in such a way that a flat histogram is the optimized ensemble. The validity of the advocated methods is demonstrated by their application to two model problems, the solvation of large hard spheres into a fluid of small and large spheres, and the vapor-liquid equilibrium of a chain system.

  17. Ensemble DFT Approach to Excited States of Strongly Correlated Molecular Systems.

    Science.gov (United States)

    Filatov, Michael

    2016-01-01

    Ensemble density functional theory (DFT) is a novel time-independent formalism for obtaining excitation energies of many-body fermionic systems. A considerable advantage of ensemble DFT over the more common Kohn-Sham (KS) DFT and time-dependent DFT formalisms is that it enables one to account for strong non-dynamic electron correlation in the ground and excited states of molecular systems in a transparent and accurate fashion. Despite its positive aspects, ensemble DFT has not so far found its way into the repertoire of methods of modern computational chemistry, probably because of the perceived lack of practically affordable implementations of the theory. The spin-restricted ensemble-referenced KS (REKS) method is perhaps the first computationally feasible implementation of the ideas behind ensemble DFT which enables one to describe accurately electronic transitions in a wide class of molecular systems, including strongly correlated molecules (biradicals, molecules undergoing bond breaking/formation), extended π-conjugated systems, donor-acceptor charge transfer adducts, etc.

  18. Perturbation of convection-permitting NWP forecasts for flash-flood ensemble forecasting

    Directory of Open Access Journals (Sweden)

    B. Vincendon

    2011-05-01

    Full Text Available Mediterranean intense weather events often lead to devastating flash-floods. Extending the forecasting lead times further than the watershed response times, implies the use of numerical weather prediction (NWP to drive hydrological models. However, the nature of the precipitating events and the temporal and spatial scales of the watershed response make them difficult to forecast, even using a high-resolution convection-permitting NWP deterministic forecasting. This study proposes a new method to sample the uncertainties of high-resolution NWP precipitation forecasts in order to quantify the predictability of the streamflow forecasts. We have developed a perturbation method based on convection-permitting NWP-model error statistics. It produces short-term precipitation ensemble forecasts from single-value meteorological forecasts. These rainfall ensemble forecasts are then fed into a hydrological model dedicated to flash-flood forecasting to produce ensemble streamflow forecasts. The verification on two flash-flood events shows that this forecasting ensemble performs better than the deterministic forecast. The performance of the precipitation perturbation method has also been found to be broadly as good as that obtained using a state-of-the-art research convection-permitting NWP ensemble, while requiring less computing time.

  19. Uncertainty analysis of hydrological ensemble forecasts in a distributed model utilising short-range rainfall prediction

    Directory of Open Access Journals (Sweden)

    Y. Xuan

    2009-03-01

    Full Text Available Advances in mesoscale numerical weather predication make it possible to provide rainfall forecasts along with many other data fields at increasingly higher spatial resolutions. It is currently possible to incorporate high-resolution NWPs directly into flood forecasting systems in order to obtain an extended lead time. It is recognised, however, that direct application of rainfall outputs from the NWP model can contribute considerable uncertainty to the final river flow forecasts as the uncertainties inherent in the NWP are propagated into hydrological domains and can also be magnified by the scaling process. As the ensemble weather forecast has become operationally available, it is of particular interest to the hydrologist to investigate both the potential and implication of ensemble rainfall inputs to the hydrological modelling systems in terms of uncertainty propagation. In this paper, we employ a distributed hydrological model to analyse the performance of the ensemble flow forecasts based on the ensemble rainfall inputs from a short-range high-resolution mesoscale weather model. The results show that: (1 The hydrological model driven by QPF can produce forecasts comparable with those from a raingauge-driven one; (2 The ensemble hydrological forecast is able to disseminate abundant information with regard to the nature of the weather system and the confidence of the forecast itself; and (3 the uncertainties as well as systematic biases are sometimes significant and, as such, extra effort needs to be made to improve the quality of such a system.

  20. A seasonal hydrologic ensemble prediction system for water resource management

    Science.gov (United States)

    Luo, L.; Wood, E. F.

    2006-12-01

    A seasonal hydrologic ensemble prediction system, developed for the Ohio River basin, has been improved and expanded to several other regions including the Eastern U.S., Africa and East Asia. The prediction system adopts the traditional Extended Streamflow Prediction (ESP) approach, utilizing the VIC (Variable Infiltration Capacity) hydrological model as the central tool for producing ensemble prediction of soil moisture, snow and streamflow with lead times up to 6-month. VIC is forced by observed meteorology to estimate the hydrological initial condition prior to the forecast, but during the forecast period the atmospheric forcing comes from statistically downscaled, seasonal forecast from dynamic climate models. The seasonal hydrologic ensemble prediction system is currently producing realtime seasonal hydrologic forecast for these regions on a monthly basis. Using hindcasts from a 19-year period (1981-1999), during which seasonal hindcasts from NCEP Climate Forecast System (CFS) and European Union DEMETER project are available, we evaluate the performance of the forecast system over our forecast regions. The evaluation shows that the prediction system using the current forecast approach is able to produce reliable and accurate precipitation, soil moisture and streamflow predictions. The overall skill is much higher then the traditional ESP. In particular, forecasts based on multiple climate model forecast are more skillful than single model-based forecast. This emphasizes the significant need for producing seasonal climate forecast with multiple climate models for hydrologic applications. Forecast from this system is expected to provide very valuable information about future hydrologic states and associated risks for end users, including water resource management and financial sectors.

  1. Ensemble data assimilation in the Red Sea: sensitivity to ensemble selection and atmospheric forcing

    KAUST Repository

    Toye, Habib

    2017-05-26

    We present our efforts to build an ensemble data assimilation and forecasting system for the Red Sea. The system consists of the high-resolution Massachusetts Institute of Technology general circulation model (MITgcm) to simulate ocean circulation and of the Data Research Testbed (DART) for ensemble data assimilation. DART has been configured to integrate all members of an ensemble adjustment Kalman filter (EAKF) in parallel, based on which we adapted the ensemble operations in DART to use an invariant ensemble, i.e., an ensemble Optimal Interpolation (EnOI) algorithm. This approach requires only single forward model integration in the forecast step and therefore saves substantial computational cost. To deal with the strong seasonal variability of the Red Sea, the EnOI ensemble is then seasonally selected from a climatology of long-term model outputs. Observations of remote sensing sea surface height (SSH) and sea surface temperature (SST) are assimilated every 3 days. Real-time atmospheric fields from the National Center for Environmental Prediction (NCEP) and the European Center for Medium-Range Weather Forecasts (ECMWF) are used as forcing in different assimilation experiments. We investigate the behaviors of the EAKF and (seasonal-) EnOI and compare their performances for assimilating and forecasting the circulation of the Red Sea. We further assess the sensitivity of the assimilation system to various filtering parameters (ensemble size, inflation) and atmospheric forcing.

  2. Ensemble data assimilation in the Red Sea: sensitivity to ensemble selection and atmospheric forcing

    Science.gov (United States)

    Toye, Habib; Zhan, Peng; Gopalakrishnan, Ganesh; Kartadikaria, Aditya R.; Huang, Huang; Knio, Omar; Hoteit, Ibrahim

    2017-07-01

    We present our efforts to build an ensemble data assimilation and forecasting system for the Red Sea. The system consists of the high-resolution Massachusetts Institute of Technology general circulation model (MITgcm) to simulate ocean circulation and of the Data Research Testbed (DART) for ensemble data assimilation. DART has been configured to integrate all members of an ensemble adjustment Kalman filter (EAKF) in parallel, based on which we adapted the ensemble operations in DART to use an invariant ensemble, i.e., an ensemble Optimal Interpolation (EnOI) algorithm. This approach requires only single forward model integration in the forecast step and therefore saves substantial computational cost. To deal with the strong seasonal variability of the Red Sea, the EnOI ensemble is then seasonally selected from a climatology of long-term model outputs. Observations of remote sensing sea surface height (SSH) and sea surface temperature (SST) are assimilated every 3 days. Real-time atmospheric fields from the National Center for Environmental Prediction (NCEP) and the European Center for Medium-Range Weather Forecasts (ECMWF) are used as forcing in different assimilation experiments. We investigate the behaviors of the EAKF and (seasonal-) EnOI and compare their performances for assimilating and forecasting the circulation of the Red Sea. We further assess the sensitivity of the assimilation system to various filtering parameters (ensemble size, inflation) and atmospheric forcing.

  3. Generation of scenarios from calibrated ensemble forecasts with a dynamic ensemble copula coupling approach

    DEFF Research Database (Denmark)

    Ben Bouallègue, Zied; Heppelmann, Tobias; Theis, Susanne E.

    2015-01-01

    Probabilistic forecasts in the form of ensemble of scenarios are required for complex decision making processes. Ensemble forecasting systems provide such products but the spatio-temporal structures of the forecast uncertainty is lost when statistical calibration of the ensemble forecasts...... is applied for each lead time and location independently. Non-parametric approaches allow the reconstruction of spatio-temporal joint probability distributions at a low computational cost.For example, the ensemble copula coupling (ECC) method consists in rebuilding the multivariate aspect of the forecast...... from the original ensemble forecasts. Based on the assumption of error stationarity, parametric methods aim to fully describe the forecast dependence structures. In this study, the concept of ECC is combined with past data statistics in order to account for the autocorrelation of the forecast error...

  4. Generation of scenarios from calibrated ensemble forecasts with a dual ensemble copula coupling approach

    DEFF Research Database (Denmark)

    Ben Bouallègue, Zied; Heppelmann, Tobias; Theis, Susanne E.

    2016-01-01

    Probabilistic forecasts in the form of ensemble of scenarios are required for complex decision making processes. Ensemble forecasting systems provide such products but the spatio-temporal structures of the forecast uncertainty is lost when statistical calibration of the ensemble forecasts...... is applied for each lead time and location independently. Non-parametric approaches allow the reconstruction of spatio-temporal joint probability distributions at a low computational cost. For example, the ensemble copula coupling (ECC) method rebuilds the multivariate aspect of the forecast from...... the original ensemble forecasts. Based on the assumption of error stationarity, parametric methods aim to fully describe the forecast dependence structures. In this study, the concept of ECC is combined with past data statistics in order to account for the autocorrelation of the forecast error. The new...

  5. Multiscale macromolecular simulation: role of evolving ensembles.

    Science.gov (United States)

    Singharoy, A; Joshi, H; Ortoleva, P J

    2012-10-22

    Multiscale analysis provides an algorithm for the efficient simulation of macromolecular assemblies. This algorithm involves the coevolution of a quasiequilibrium probability density of atomic configurations and the Langevin dynamics of spatial coarse-grained variables denoted order parameters (OPs) characterizing nanoscale system features. In practice, implementation of the probability density involves the generation of constant OP ensembles of atomic configurations. Such ensembles are used to construct thermal forces and diffusion factors that mediate the stochastic OP dynamics. Generation of all-atom ensembles at every Langevin time step is computationally expensive. Here, multiscale computation for macromolecular systems is made more efficient by a method that self-consistently folds in ensembles of all-atom configurations constructed in an earlier step, history, of the Langevin evolution. This procedure accounts for the temporal evolution of these ensembles, accurately providing thermal forces and diffusions. It is shown that efficiency and accuracy of the OP-based simulations is increased via the integration of this historical information. Accuracy improves with the square root of the number of historical timesteps included in the calculation. As a result, CPU usage can be decreased by a factor of 3-8 without loss of accuracy. The algorithm is implemented into our existing force-field based multiscale simulation platform and demonstrated via the structural dynamics of viral capsomers.

  6. Substrate-Specific Reorganization of the Conformational Ensemble of CSK Implicates Novel Modes of Kinase Function

    Science.gov (United States)

    Jamros, Michael A.; Oliveira, Leandro C.; Whitford, Paul C.; Onuchic, José N.; Adams, Joseph A.; Jennings, Patricia A.

    2012-01-01

    Protein kinases use ATP as a phosphoryl donor for the posttranslational modification of signaling targets. It is generally thought that the binding of this nucleotide induces conformational changes leading to closed, more compact forms of the kinase domain that ideally orient active-site residues for efficient catalysis. The kinase domain is oftentimes flanked by additional ligand binding domains that up- or down-regulate catalytic function. C-terminal Src kinase (Csk) is a multidomain tyrosine kinase that is up-regulated by N-terminal SH2 and SH3 domains. Although the X-ray structure of Csk suggests the enzyme is compact, X-ray scattering studies indicate that the enzyme possesses both compact and open conformational forms in solution. Here, we investigated whether interactions with the ATP analog AMP-PNP and ADP can shift the conformational ensemble of Csk in solution using a combination of small angle x-ray scattering and molecular dynamics simulations. We find that binding of AMP-PNP shifts the ensemble towards more extended rather than more compact conformations. Binding of ADP further shifts the ensemble towards extended conformations, including highly extended conformations not adopted by the apo protein, nor by the AMP-PNP bound protein. These ensembles indicate that any compaction of the kinase domain induced by nucleotide binding does not extend to the overall multi-domain architecture. Instead, assembly of an ATP-bound kinase domain generates further extended forms of Csk that may have relevance for kinase scaffolding and Src regulation in the cell. PMID:23028292

  7. Using rainfall thresholds and ensemble precipitation forecasts to issue and improve urban inundation alerts

    Science.gov (United States)

    Yang, Tsun-Hua; Hwang, Gong-Do; Tsai, Chin-Cheng; Ho, Jui-Yi

    2016-11-01

    Urban inundation forecasting with extended lead times is useful in saving lives and property. This study proposes the integration of rainfall thresholds and ensemble precipitation forecasts to provide probabilistic urban inundation forecasts. Utilization of ensemble precipitation forecasts can extend forecast lead times to 72 h, predicting peak flows and to allow response agencies to take necessary preparatory measures. However, ensemble precipitation forecasting is time- and resource-intensive. Using rainfall thresholds to estimate urban areas' inundation risk can decrease this complexity and save computation time. This study evaluated the performance of this system using 352 townships in Taiwan and seven typhoons during the period 2013-2015. The levels of forecast probability needed to issue inundation alerts were addressed because ensemble forecasts are probability based. This study applied six levels of forecast probability and evaluated their performance using five measures. The results showed that this forecasting system performed better before a typhoon made landfall. Geography had a strong impact at the start of the numerical weather modeling, resulting in the underestimation of rainfall forecasts. Regardless of this finding, the inundation forecast performance was highly contingent on the rainfall forecast skill. This study then tested a hybrid approach of on-site observations and rainfall forecasts to decrease the influence of numerical weather predictions and improve the forecast performance. The results of this combined system showed that forecasts with a 24 h lead time improved significantly. These findings and the hybrid approach can be applied to other hydrometeorological early warning systems to improve hazard-related forecasts.

  8. Ensemble Averaged Probability Density Function (APDF) for Compressible Turbulent Reacting Flows

    Science.gov (United States)

    Shih, Tsan-Hsing; Liu, Nan-Suey

    2012-01-01

    In this paper, we present a concept of the averaged probability density function (APDF) for studying compressible turbulent reacting flows. The APDF is defined as an ensemble average of the fine grained probability density function (FG-PDF) with a mass density weighting. It can be used to exactly deduce the mass density weighted, ensemble averaged turbulent mean variables. The transport equation for APDF can be derived in two ways. One is the traditional way that starts from the transport equation of FG-PDF, in which the compressible Navier- Stokes equations are embedded. The resulting transport equation of APDF is then in a traditional form that contains conditional means of all terms from the right hand side of the Navier-Stokes equations except for the chemical reaction term. These conditional means are new unknown quantities that need to be modeled. Another way of deriving the transport equation of APDF is to start directly from the ensemble averaged Navier-Stokes equations. The resulting transport equation of APDF derived from this approach appears in a closed form without any need for additional modeling. The methodology of ensemble averaging presented in this paper can be extended to other averaging procedures: for example, the Reynolds time averaging for statistically steady flow and the Reynolds spatial averaging for statistically homogeneous flow. It can also be extended to a time or spatial filtering procedure to construct the filtered density function (FDF) for the large eddy simulation (LES) of compressible turbulent reacting flows.

  9. An ensemble self-training protein interaction article classifier.

    Science.gov (United States)

    Chen, Yifei; Hou, Ping; Manderick, Bernard

    2014-01-01

    Protein-protein interaction (PPI) is essential to understand the fundamental processes governing cell biology. The mining and curation of PPI knowledge are critical for analyzing proteomics data. Hence it is desired to classify articles PPI-related or not automatically. In order to build interaction article classification systems, an annotated corpus is needed. However, it is usually the case that only a small number of labeled articles can be obtained manually. Meanwhile, a large number of unlabeled articles are available. By combining ensemble learning and semi-supervised self-training, an ensemble self-training interaction classifier called EST_IACer is designed to classify PPI-related articles based on a small number of labeled articles and a large number of unlabeled articles. A biological background based feature weighting strategy is extended using the category information from both labeled and unlabeled data. Moreover, a heuristic constraint is put forward to select optimal instances from unlabeled data to improve the performance further. Experiment results show that the EST_IACer can classify the PPI related articles effectively and efficiently.

  10. IASI Radiance Data Assimilation in Local Ensemble Transform Kalman Filter

    Science.gov (United States)

    Cho, K.; Hyoung-Wook, C.; Jo, Y.

    2016-12-01

    Korea institute of Atmospheric Prediction Systems (KIAPS) is developing NWP model with data assimilation systems. Local Ensemble Transform Kalman Filter (LETKF) system, one of the data assimilation systems, has been developed for KIAPS Integrated Model (KIM) based on cubed-sphere grid and has successfully assimilated real data. LETKF data assimilation system has been extended to 4D- LETKF which considers time-evolving error covariance within assimilation window and IASI radiance data assimilation using KPOP (KIAPS package for observation processing) with RTTOV (Radiative Transfer for TOVS). The LETKF system is implementing semi operational prediction including conventional (sonde, aircraft) observation and AMSU-A (Advanced Microwave Sounding Unit-A) radiance data from April. Recently, the semi operational prediction system updated radiance observations including GPS-RO, AMV, IASI (Infrared Atmospheric Sounding Interferometer) data at July. A set of simulation of KIM with ne30np4 and 50 vertical levels (of top 0.3hPa) were carried out for short range forecast (10days) within semi operation prediction LETKF system with ensemble forecast 50 members. In order to only IASI impact, our experiments used only conventional and IAIS radiance data to same semi operational prediction set. We carried out sensitivity test for IAIS thinning method (3D and 4D). IASI observation number was increased by temporal (4D) thinning and the improvement of IASI radiance data impact on the forecast skill of model will expect.

  11. Ultrastable optical clock with two cold-atom ensembles

    Science.gov (United States)

    Schioppo, M.; Brown, R. C.; McGrew, W. F.; Hinkley, N.; Fasano, R. J.; Beloy, K.; Yoon, T. H.; Milani, G.; Nicolodi, D.; Sherman, J. A.; Phillips, N. B.; Oates, C. W.; Ludlow, A. D.

    2017-01-01

    Atomic clocks based on optical transitions are the most stable, and therefore precise, timekeepers available. These clocks operate by alternating intervals of atomic interrogation with the 'dead' time required for quantum state preparation and readout. This non-continuous interrogation of the atom system results in the Dick effect, an aliasing of frequency noise from the laser interrogating the atomic transition. Despite recent advances in optical clock stability that have been achieved by improving laser coherence, the Dick effect has continually limited the performance of optical clocks. Here we implement a robust solution to overcome this limitation: a zero-dead-time optical clock that is based on the interleaved interrogation of two cold-atom ensembles. This clock exhibits vanishingly small Dick noise, thereby achieving an unprecedented fractional frequency instability assessed to be for an averaging time τ in seconds. We also consider alternate dual-atom-ensemble schemes to extend laser coherence and reduce the standard quantum limit of clock stability, achieving a spectroscopy line quality factor of Q > 4 × 1015.

  12. Control and Synchronization of Neuron Ensembles

    CERN Document Server

    Li, Jr-Shin; Ruths, Justin

    2011-01-01

    Synchronization of oscillations is a phenomenon prevalent in natural, social, and engineering systems. Controlling synchronization of oscillating systems is motivated by a wide range of applications from neurological treatment of Parkinson's disease to the design of neurocomputers. In this article, we study the control of an ensemble of uncoupled neuron oscillators described by phase models. We examine controllability of such a neuron ensemble for various phase models and, furthermore, study the related optimal control problems. In particular, by employing Pontryagin's maximum principle, we analytically derive optimal controls for spiking single- and two-neuron systems, and analyze the applicability of the latter to an ensemble system. Finally, we present a robust computational method for optimal control of spiking neurons based on pseudospectral approximations. The methodology developed here is universal to the control of general nonlinear phase oscillators.

  13. On large deviations for ensembles of distributions

    Science.gov (United States)

    Khrychev, D. A.

    2013-11-01

    The paper is concerned with the large deviations problem in the Freidlin-Wentzell formulation without the assumption of the uniqueness of the solution to the equation involving white noise. In other words, it is assumed that for each \\varepsilon>0 the nonempty set \\mathscr P_\\varepsilon of weak solutions is not necessarily a singleton. Analogues of a number of concepts in the theory of large deviations are introduced for the set \\{\\mathscr P_\\varepsilon,\\,\\varepsilon>0\\}, hereafter referred to as an ensemble of distributions. The ensembles of weak solutions of an n-dimensional stochastic Navier-Stokes system and stochastic wave equation with power-law nonlinearity are shown to be uniformly exponentially tight. An idempotent Wiener process in a Hilbert space and idempotent partial differential equations are defined. The accumulation points in the sense of large deviations of the ensembles in question are shown to be weak solutions of the corresponding idempotent equations. Bibliography: 14 titles.

  14. Cavity cooling of an ensemble spin system.

    Science.gov (United States)

    Wood, Christopher J; Borneman, Troy W; Cory, David G

    2014-02-07

    We describe how sideband cooling techniques may be applied to large spin ensembles in magnetic resonance. Using the Tavis-Cummings model in the presence of a Rabi drive, we solve a Markovian master equation describing the joint spin-cavity dynamics to derive cooling rates as a function of ensemble size. Our calculations indicate that the coupled angular momentum subspaces of a spin ensemble containing roughly 10(11) electron spins may be polarized in a time many orders of magnitude shorter than the typical thermal relaxation time. The described techniques should permit efficient removal of entropy for spin-based quantum information processors and fast polarization of spin samples. The proposed application of a standard technique in quantum optics to magnetic resonance also serves to reinforce the connection between the two fields, which has recently begun to be explored in further detail due to the development of hybrid designs for manufacturing noise-resilient quantum devices.

  15. Characteristic polynomials in real Ginibre ensembles

    Energy Technology Data Exchange (ETDEWEB)

    Akemann, G; Phillips, M J [Department of Mathematical Sciences and BURSt Research Centre, Brunel University West London, UB8 3PH Uxbridge (United Kingdom); Sommers, H-J [Fachbereich Physik, Universitaet Duisburg-Essen, 47048 Duisburg (Germany)], E-mail: Gernot.Akemann@brunel.ac.uk, E-mail: Michael.Phillips@brunel.ac.uk, E-mail: H.J.Sommers@uni-due.de

    2009-01-09

    We calculate the average of two characteristic polynomials for the real Ginibre ensemble of asymmetric random matrices, and its chiral counterpart. Considered as quadratic forms they determine a skew-symmetric kernel from which all complex eigenvalue correlations can be derived. Our results are obtained in a very simple fashion without going to an eigenvalue representation, and are completely new in the chiral case. They hold for Gaussian ensembles which are partly symmetric, with kernels given in terms of Hermite and Laguerre polynomials respectively, depending on an asymmetry parameter. This allows us to interpolate between the maximally asymmetric real Ginibre and the Gaussian orthogonal ensemble, as well as their chiral counterparts. (fast track communication)

  16. Embedded random matrix ensembles in quantum physics

    CERN Document Server

    Kota, V K B

    2014-01-01

    Although used with increasing frequency in many branches of physics, random matrix ensembles are not always sufficiently specific to account for important features of the physical system at hand. One refinement which retains the basic stochastic approach but allows for such features consists in the use of embedded ensembles.  The present text is an exhaustive introduction to and survey of this important field. Starting with an easy-to-read introduction to general random matrix theory, the text then develops the necessary concepts from the beginning, accompanying the reader to the frontiers of present-day research. With some notable exceptions, to date these ensembles have primarily been applied in nuclear spectroscopy. A characteristic example is the use of a random two-body interaction in the framework of the nuclear shell model. Yet, topics in atomic physics, mesoscopic physics, quantum information science and statistical mechanics of isolated finite quantum systems can also be addressed using these ensemb...

  17. Total probabilities of ensemble runoff forecasts

    Science.gov (United States)

    Olav Skøien, Jon; Bogner, Konrad; Salamon, Peter; Smith, Paul; Pappenberger, Florian

    2017-04-01

    Ensemble forecasting has a long history from meteorological modelling, as an indication of the uncertainty of the forecasts. However, it is necessary to calibrate and post-process the ensembles as the they often exhibit both bias and dispersion errors. Two of the most common methods for this are Bayesian Model Averaging (Raftery et al., 2005) and Ensemble Model Output Statistics (EMOS) (Gneiting et al., 2005). There are also methods for regionalizing these methods (Berrocal et al., 2007) and for incorporating the correlation between lead times (Hemri et al., 2013). Engeland and Steinsland Engeland and Steinsland (2014) developed a framework which can estimate post-processing parameters varying in space and time, while giving a spatially and temporally consistent output. However, their method is computationally complex for our larger number of stations, which makes it unsuitable for our purpose. Our post-processing method of the ensembles is developed in the framework of the European Flood Awareness System (EFAS - http://www.efas.eu), where we are making forecasts for whole Europe, and based on observations from around 700 catchments. As the target is flood forecasting, we are also more interested in improving the forecast skill for high-flows rather than in a good prediction of the entire flow regime. EFAS uses a combination of ensemble forecasts and deterministic forecasts from different meteorological forecasters to force a distributed hydrologic model and to compute runoff ensembles for each river pixel within the model domain. Instead of showing the mean and the variability of each forecast ensemble individually, we will now post-process all model outputs to estimate the total probability, the post-processed mean and uncertainty of all ensembles. The post-processing parameters are first calibrated for each calibration location, but we are adding a spatial penalty in the calibration process to force a spatial correlation of the parameters. The penalty takes

  18. Circular β ensembles, CMV representation, characteristic polynomials

    Institute of Scientific and Technical Information of China (English)

    SU ZhongGen

    2009-01-01

    In this note we first briefly review some recent progress in the study of the circular β ensemble on the unit circle, where 0 > 0 is a model parameter. In the special cases β = 1,2 and 4, this ensemble describes the joint probability density of eigenvalues of random orthogonal, unitary and sympletic matrices, respectively. For general β, Killip and Nenciu discovered a five-diagonal sparse matrix model, the CMV representation. This representation is new even in the case β = 2; and it has become a powerful tool for studying the circular β ensemble. We then give an elegant derivation for the moment identities of characteristic polynomials via the link with orthogonal polynomials on the unit circle.

  19. Kinetic theory of nonequilibrium ensembles, irreversible thermodynamics, and generalized hydrodynamics

    CERN Document Server

    Eu, Byung Chan

    2016-01-01

    This book presents the fundamentals of irreversible thermodynamics for nonlinear transport processes in gases and liquids, as well as for generalized hydrodynamics extending the classical hydrodynamics of Navier, Stokes, Fourier, and Fick. Together with its companion volume on relativistic theories, it provides a comprehensive picture of the kinetic theory formulated from the viewpoint of nonequilibrium ensembles in both nonrelativistic and, in Vol. 2, relativistic contexts. Theories of macroscopic irreversible processes must strictly conform to the thermodynamic laws at every step and in all approximations that enter their derivation from the mechanical principles. Upholding this as the inviolable tenet, the author develops theories of irreversible transport processes in fluids (gases or liquids) on the basis of irreversible kinetic equations satisfying the H theorem. They apply regardless of whether the processes are near to or far removed from equilibrium, or whether they are linear or nonlinear with respe...

  20. Memory imperfections in atomic-ensemble-based quantum repeaters

    Science.gov (United States)

    Brask, Jonatan Bohr; Sørensen, Anders Søndberg

    2008-07-01

    Quantum repeaters promise to deliver long-distance entanglement overcoming loss in realistic quantum channels. A promising class of repeaters, based on atomic ensemble quantum memories and linear optics, follows the proposal by L.-M. Duan , Nature (London) 414, 413 (2001). Here we analyze this protocol in terms of a very general model for the quantum memories employed. We derive analytical expressions for scaling of entanglement with memory imperfections, dark counts, loss, and distance, and we apply our results to two specific quantum memory protocols. Our methods apply to any quantum memory with an interaction Hamiltonian at most quadratic in the mode operators and are in principle extendible to more recent modifications of the original proposal of Duan, Lukin, Cirac, and Zoller.

  1. Phase locking a clock oscillator to a coherent atomic ensemble

    CERN Document Server

    Kohlhaas, R; Cantin, E; Aspect, A; Landragin, A; Bouyer, P

    2015-01-01

    The sensitivity of an atomic interferometer increases when the phase evolution of its quantum superposition state is measured over a longer interrogation interval. In practice, a limit is set by the measurement process, which returns not the phase, but its projection in terms of population difference on two energetic levels. The phase interval over which the relation can be inverted is thus limited to the interval $[-\\pi/2,\\pi/2]$; going beyond it introduces an ambiguity in the read out, hence a sensitivity loss. Here, we extend the unambiguous interval to probe the phase evolution of an atomic ensemble using coherence preserving measurements and phase corrections, and demonstrate the phase lock of the clock oscillator to an atomic superposition state. We propose a protocol based on the phase lock to improve atomic clocks under local oscillator noise, and foresee the application to other atomic interferometers such as inertial sensors.

  2. Ensemble Eclipse: A Process for Prefab Development Environment for the Ensemble Project

    Science.gov (United States)

    Wallick, Michael N.; Mittman, David S.; Shams, Khawaja, S.; Bachmann, Andrew G.; Ludowise, Melissa

    2013-01-01

    This software simplifies the process of having to set up an Eclipse IDE programming environment for the members of the cross-NASA center project, Ensemble. It achieves this by assembling all the necessary add-ons and custom tools/preferences. This software is unique in that it allows developers in the Ensemble Project (approximately 20 to 40 at any time) across multiple NASA centers to set up a development environment almost instantly and work on Ensemble software. The software automatically has the source code repositories and other vital information and settings included. The Eclipse IDE is an open-source development framework. The NASA (Ensemble-specific) version of the software includes Ensemble-specific plug-ins as well as settings for the Ensemble project. This software saves developers the time and hassle of setting up a programming environment, making sure that everything is set up in the correct manner for Ensemble development. Existing software (i.e., standard Eclipse) requires an intensive setup process that is both time-consuming and error prone. This software is built once by a single user and tested, allowing other developers to simply download and use the software

  3. Total probabilities of ensemble runoff forecasts

    Science.gov (United States)

    Olav Skøien, Jon; Bogner, Konrad; Salamon, Peter; Smith, Paul; Pappenberger, Florian

    2016-04-01

    Ensemble forecasting has for a long time been used as a method in meteorological modelling to indicate the uncertainty of the forecasts. However, as the ensembles often exhibit both bias and dispersion errors, it is necessary to calibrate and post-process them. Two of the most common methods for this are Bayesian Model Averaging (Raftery et al., 2005) and Ensemble Model Output Statistics (EMOS) (Gneiting et al., 2005). There are also methods for regionalizing these methods (Berrocal et al., 2007) and for incorporating the correlation between lead times (Hemri et al., 2013). Engeland and Steinsland Engeland and Steinsland (2014) developed a framework which can estimate post-processing parameters which are different in space and time, but still can give a spatially and temporally consistent output. However, their method is computationally complex for our larger number of stations, and cannot directly be regionalized in the way we would like, so we suggest a different path below. The target of our work is to create a mean forecast with uncertainty bounds for a large number of locations in the framework of the European Flood Awareness System (EFAS - http://www.efas.eu) We are therefore more interested in improving the forecast skill for high-flows rather than the forecast skill of lower runoff levels. EFAS uses a combination of ensemble forecasts and deterministic forecasts from different forecasters to force a distributed hydrologic model and to compute runoff ensembles for each river pixel within the model domain. Instead of showing the mean and the variability of each forecast ensemble individually, we will now post-process all model outputs to find a total probability, the post-processed mean and uncertainty of all ensembles. The post-processing parameters are first calibrated for each calibration location, but assuring that they have some spatial correlation, by adding a spatial penalty in the calibration process. This can in some cases have a slight negative

  4. Thermalization and Pseudolocality in Extended Quantum Systems

    Science.gov (United States)

    Doyon, Benjamin

    2017-04-01

    Recently, it was understood that modified concepts of locality played an important role in the study of extended quantum systems out of equilibrium, in particular in so-called generalized Gibbs ensembles. In this paper, we rigorously study pseudolocal charges and their involvement in time evolutions and in the thermalization process of arbitrary states with strong enough clustering properties. We show that the densities of pseudolocal charges form a Hilbert space, with inner product determined by thermodynamic susceptibilities. Using this, we define the family of pseudolocal states, which are determined by pseudolocal charges. This family includes thermal Gibbs states at high enough temperatures, as well as (a precise definition of) generalized Gibbs ensembles. We prove that the family of pseudolocal states is preserved by finite time evolution, and that, under certain conditions, the stationary state emerging at infinite time is a generalized Gibbs ensemble with respect to the evolution dynamics. If the evolution dynamics does not admit any conserved pseudolocal charges other than the evolution Hamiltonian, we show that any stationary pseudolocal state with respect to these dynamics is a thermal Gibbs state, and that Gibbs thermalization occurs. The framework is that of translation-invariant states on hypercubic quantum lattices of any dimensionality (including quantum chains) and finite-range Hamiltonians, and does not involve integrability.

  5. Climate change hotspots in the CMIP5 global climate model ensemble

    OpenAIRE

    Diffenbaugh, Noah S; Giorgi, Filippo

    2012-01-01

    We use a statistical metric of multi-dimensional climate change to quantify the emergence of global climate change hotspots in the CMIP5 climate model ensemble. Our hotspot metric extends previous work through the inclusion of extreme seasonal temperature and precipitation, which exert critical influence on climate change impacts. The results identify areas of the Amazon, the Sahel and tropical West Africa, Indonesia, and the Tibetan Plateau as persistent regional climate change hotspots thro...

  6. An Ensemble Approach for Expanding Queries

    Science.gov (United States)

    2012-11-01

    vincristine; thalidomide; painful; cisplatin; oxaliplatin; charcot -marie-tooth disease ; drugs; neuropathy Ensemble expansion child of, asthma, kids...system disorders; peripheral nerve diseases ; peripheral neuropathies; peripheral nervous system disorder; peripheral nervous system disease ...peripheral nerve disease ; peripheral nerve disorders, peripheral nerve disorder Relation expansion offspring, child of, of child, child find

  7. NYYD Ensemble ja Riho Sibul / Anneli Remme

    Index Scriptorium Estoniae

    Remme, Anneli, 1968-

    2001-01-01

    Gavin Bryarsi teos "Jesus' Blood Never Failed Me Yet" NYYD Ensemble'i ja Riho Sibula esituses 27. detsembril Pauluse kirikus Tartus ja 28. detsembril Rootsi- Mihkli kirikus Tallinnas. Kaastegevad Tartu Ülikooli Kammerkoor (Tartus) ja kammerkoor Voces Musicales (Tallinnas). Kunstiline juht Olari Elts

  8. A method for ensemble wildland fire simulation

    Science.gov (United States)

    Mark A. Finney; Isaac C. Grenfell; Charles W. McHugh; Robert C. Seli; Diane Trethewey; Richard D. Stratton; Stuart Brittain

    2011-01-01

    An ensemble simulation system that accounts for uncertainty in long-range weather conditions and two-dimensional wildland fire spread is described. Fuel moisture is expressed based on the energy release component, a US fire danger rating index, and its variation throughout the fire season is modeled using time series analysis of historical weather data. This analysis...

  9. NYYD Ensemble ja Riho Sibul / Anneli Remme

    Index Scriptorium Estoniae

    Remme, Anneli, 1968-

    2001-01-01

    Gavin Bryarsi teos "Jesus' Blood Never Failed Me Yet" NYYD Ensemble'i ja Riho Sibula esituses 27. detsembril Pauluse kirikus Tartus ja 28. detsembril Rootsi- Mihkli kirikus Tallinnas. Kaastegevad Tartu Ülikooli Kammerkoor (Tartus) ja kammerkoor Voces Musicales (Tallinnas). Kunstiline juht Olari Elts

  10. Eigenstate Gibbs ensemble in integrable quantum systems

    Science.gov (United States)

    Nandy, Sourav; Sen, Arnab; Das, Arnab; Dhar, Abhishek

    2016-12-01

    The eigenstate thermalization hypothesis conjectures that for a thermodynamically large system in one of its energy eigenstates, the reduced density matrix describing any finite subsystem is determined solely by a set of relevant conserved quantities. In a chaotic quantum system, only the energy is expected to play that role and hence eigenstates appear locally thermal. Integrable systems, on the other hand, possess an extensive number of such conserved quantities and therefore the reduced density matrix requires specification of all the corresponding parameters (generalized Gibbs ensemble). However, here we show by unbiased statistical sampling of the individual eigenstates with a given finite energy density that the local description of an overwhelming majority of these states of even such an integrable system is actually Gibbs-like, i.e., requires only the energy density of the eigenstate. Rare eigenstates that cannot be represented by the Gibbs ensemble can also be sampled efficiently by our method and their local properties are then shown to be described by appropriately truncated generalized Gibbs ensembles. We further show that the presence of these rare eigenstates differentiates the model from the chaotic case and leads to the system being described by a generalized Gibbs ensemble at long time under a unitary dynamics following a sudden quench, even when the initial state is a typical (Gibbs-like) eigenstate of the prequench Hamiltonian.

  11. Locally Accessible Information from Multipartite Ensembles

    Institute of Scientific and Technical Information of China (English)

    SONG Wei

    2009-01-01

    We present a universal Holevo-like upper bound on the locally accessible information for arbitrary multipartite ensembles.This bound allows us to analyze the indistinguishability of a set of orthogonal states under local operations and classical communication.We also derive the upper bound for the capacity of distributed dense coding with multipartite senders and multipartite receivers.

  12. Canonical Ensemble Model for Black Hole Radiation

    Indian Academy of Sciences (India)

    Jingyi Zhang

    2014-09-01

    In this paper, a canonical ensemble model for the black hole quantum tunnelling radiation is introduced. In this model the probability distribution function corresponding to the emission shell is calculated to second order. The formula of pressure and internal energy of the thermal system is modified, and the fundamental equation of thermodynamics is also discussed.

  13. A Hierarchical Bayes Ensemble Kalman Filter

    Science.gov (United States)

    Tsyrulnikov, Michael; Rakitko, Alexander

    2017-01-01

    A new ensemble filter that allows for the uncertainty in the prior distribution is proposed and tested. The filter relies on the conditional Gaussian distribution of the state given the model-error and predictability-error covariance matrices. The latter are treated as random matrices and updated in a hierarchical Bayes scheme along with the state. The (hyper)prior distribution of the covariance matrices is assumed to be inverse Wishart. The new Hierarchical Bayes Ensemble Filter (HBEF) assimilates ensemble members as generalized observations and allows ordinary observations to influence the covariances. The actual probability distribution of the ensemble members is allowed to be different from the true one. An approximation that leads to a practicable analysis algorithm is proposed. The new filter is studied in numerical experiments with a doubly stochastic one-variable model of "truth". The model permits the assessment of the variance of the truth and the true filtering error variance at each time instance. The HBEF is shown to outperform the EnKF and the HEnKF by Myrseth and Omre (2010) in a wide range of filtering regimes in terms of performance of its primary and secondary filters.

  14. Statistical theory of hierarchical avalanche ensemble

    OpenAIRE

    Olemskoi, Alexander I.

    1999-01-01

    The statistical ensemble of avalanche intensities is considered to investigate diffusion in ultrametric space of hierarchically subordinated avalanches. The stationary intensity distribution and the steady-state current are obtained. The critical avalanche intensity needed to initiate the global avalanche formation is calculated depending on noise intensity. The large time asymptotic for the probability of the global avalanche appearance is derived.

  15. Marking up lattice QCD configurations and ensembles

    CERN Document Server

    Coddington, P; Maynard, C M; Pleiter, D; Yoshié, T

    2007-01-01

    QCDml is an XML-based markup language designed for sharing QCD configurations and ensembles world-wide via the International Lattice Data Grid (ILDG). Based on the latest release, we present key ingredients of the QCDml in order to provide some starting points for colleagues in this community to markup valuable configurations and submit them to the ILDG.

  16. A Theoretical Analysis of Why Hybrid Ensembles Work

    Directory of Open Access Journals (Sweden)

    Kuo-Wei Hsu

    2017-01-01

    Full Text Available Inspired by the group decision making process, ensembles or combinations of classifiers have been found favorable in a wide variety of application domains. Some researchers propose to use the mixture of two different types of classification algorithms to create a hybrid ensemble. Why does such an ensemble work? The question remains. Following the concept of diversity, which is one of the fundamental elements of the success of ensembles, we conduct a theoretical analysis of why hybrid ensembles work, connecting using different algorithms to accuracy gain. We also conduct experiments on classification performance of hybrid ensembles of classifiers created by decision tree and naïve Bayes classification algorithms, each of which is a top data mining algorithm and often used to create non-hybrid ensembles. Therefore, through this paper, we provide a complement to the theoretical foundation of creating and using hybrid ensembles.

  17. Global Ensemble Forecast System (GEFS) [2.5 Deg.

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Global Ensemble Forecast System (GEFS) is a weather forecast model made up of 21 separate forecasts, or ensemble members. The National Centers for Environmental...

  18. An educational model for ensemble streamflow simulation and uncertainty analysis

    National Research Council Canada - National Science Library

    AghaKouchak, A; Nakhjiri, N; Habib, E

    2013-01-01

    ...) are interconnected. The educational toolbox includes a MATLAB Graphical User Interface (GUI) and an ensemble simulation scheme that can be used for teaching uncertainty analysis, parameter estimation, ensemble simulation and model sensitivity...

  19. Ensemble-based Kalman Filters in Strongly Nonlinear Dynamics

    Institute of Scientific and Technical Information of China (English)

    Zhaoxia PU; Joshua HACKER

    2009-01-01

    This study examines the effectiveness of ensemble Kalman filters in data assimilation with the strongly nonlinear dynamics of the Lorenz-63 model, and in particular their use in predicting the regime transition that occurs when the model jumps from one basin of attraction to the other. Four configurations of the ensemble-based Kalman filtering data assimilation techniques, including the ensemble Kalman filter, ensemble adjustment Kalman filter, ensemble square root filter and ensemble transform Kalman filter, are evaluated with their ability in predicting the regime transition (also called phase transition) and also are compared in terms of their sensitivity to both observational and sampling errors. The sensitivity of each ensemble-based filter to the size of the ensemble is also examined.

  20. Space Applications for Ensemble Detection and Analysis Project

    Data.gov (United States)

    National Aeronautics and Space Administration — Ensemble Detection is both a measurement technique and analysis tool. Like a prism that separates light into spectral bands, an ensemble detector mixes a signal with...

  1. Ensemble-based forecasting at Horns Rev: Ensemble conversion and kernel dressing

    DEFF Research Database (Denmark)

    Pinson, Pierre; Madsen, Henrik

    methodology. In a first stage, ensemble forecasts of meteorological variables are converted to power through a suitable power curve model. The relevance and benefits of employing a newly developed orthogonal fitting method for the power curve model over the traditional least-squares one are discussed...... predictive distributions. Such a methodology has the benefit of yielding predictive distributions that are of increased reliability (in a probabilistic sense) in comparison with the raw ensemble forecasts, while taking advantage of their high resolution....... of probabilistic forecasts, the resolution of which may be maximized by using meteorological ensemble predictions as input. The paper concentrates on the test case of the Horns Rev wind farm over a period of approximately one year, in order to describe, apply and discuss a complete ensemble-based forecasting...

  2. Theory and Practice of Phase-aware Ensemble Forecasting

    Science.gov (United States)

    Schulte, J. A.; Georgas, N.

    2016-12-01

    The timing of events represents a source of uncertainty in ensemble forecasting that can produce misleading ensemble statistics. A general theory is presented to overcome drawbacks of traditional ensemble forecasting statistics that perform poorly in the presence of timing disagreements among ensemble members. It was shown, in particular, that ensemble forecasts containing substantial uncertainty in timing can produce non-trivial higher-order statistical moments, rendering the ensemble mean inappropriate as a best available estimate of the future state of the forecast parameter in question. A set of theoretical experiments showed that the existence of large timing differences among ensemble members can produce negative ensemble skewness even when the ensemble members are sinusoids whose amplitudes are drawn from a normal distribution: Consistently, the ensemble mean will tend to fall on the left tail of the normal distribution representing the originally sampled amplitudes, rather than at the mean or median. To remedy the left-tail placement problem of the ensemble mean, a new generally applicable ensemble statistic - the phase-aware ensemble mean - is proposed that is more robust against ensemble skewness resulting from timing spread. The computation of the phase-aware mean involves the transformation of all ensemble members to wavelet space and the subsequent inverse wavelet transformation of the product of the ensemble mean wavelet phase and modulus back to the time domain. The new methods were applied to storm surge reforecasts for Hurricane Irene and Sandy at 8 stations located around the New York City metropolitan area. The phase-aware ensemble mean was found to perform better at detecting the magnitude of events compared to the traditional ensemble mean, consistent with the results from theoretical experiments. The ensemble mean, moreover, was found to be consistently located on the left tail of distributions representing future peak storm surge outcomes. A

  3. Quantum canonical ensemble: A projection operator approach

    Science.gov (United States)

    Magnus, Wim; Lemmens, Lucien; Brosens, Fons

    2017-09-01

    Knowing the exact number of particles N, and taking this knowledge into account, the quantum canonical ensemble imposes a constraint on the occupation number operators. The constraint particularly hampers the systematic calculation of the partition function and any relevant thermodynamic expectation value for arbitrary but fixed N. On the other hand, fixing only the average number of particles, one may remove the above constraint and simply factorize the traces in Fock space into traces over single-particle states. As is well known, that would be the strategy of the grand-canonical ensemble which, however, comes with an additional Lagrange multiplier to impose the average number of particles. The appearance of this multiplier can be avoided by invoking a projection operator that enables a constraint-free computation of the partition function and its derived quantities in the canonical ensemble, at the price of an angular or contour integration. Introduced in the recent past to handle various issues related to particle-number projected statistics, the projection operator approach proves beneficial to a wide variety of problems in condensed matter physics for which the canonical ensemble offers a natural and appropriate environment. In this light, we present a systematic treatment of the canonical ensemble that embeds the projection operator into the formalism of second quantization while explicitly fixing N, the very number of particles rather than the average. Being applicable to both bosonic and fermionic systems in arbitrary dimensions, transparent integral representations are provided for the partition function ZN and the Helmholtz free energy FN as well as for two- and four-point correlation functions. The chemical potential is not a Lagrange multiplier regulating the average particle number but can be extracted from FN+1 -FN, as illustrated for a two-dimensional fermion gas.

  4. Flood Forecasting Based on TIGGE Precipitation Ensemble Forecast

    OpenAIRE

    Jinyin Ye; Yuehong Shao; Zhijia Li

    2016-01-01

    TIGGE (THORPEX International Grand Global Ensemble) was a major part of the THORPEX (Observing System Research and Predictability Experiment). It integrates ensemble precipitation products from all the major forecast centers in the world and provides systematic evaluation on the multimodel ensemble prediction system. Development of meteorologic-hydrologic coupled flood forecasting model and early warning model based on the TIGGE precipitation ensemble forecast can provide flood probability fo...

  5. Ensembles of signal transduction models using Pareto Optimal Ensemble Techniques (POETs).

    Science.gov (United States)

    Song, Sang Ok; Chakrabarti, Anirikh; Varner, Jeffrey D

    2010-07-01

    Mathematical modeling of complex gene expression programs is an emerging tool for understanding disease mechanisms. However, identification of large models sometimes requires training using qualitative, conflicting or even contradictory data sets. One strategy to address this challenge is to estimate experimentally constrained model ensembles using multiobjective optimization. In this study, we used Pareto Optimal Ensemble Techniques (POETs) to identify a family of proof-of-concept signal transduction models. POETs integrate Simulated Annealing (SA) with Pareto optimality to identify models near the optimal tradeoff surface between competing training objectives. We modeled a prototypical-signaling network using mass-action kinetics within an ordinary differential equation (ODE) framework (64 ODEs in total). The true model was used to generate synthetic immunoblots from which the POET algorithm identified the 117 unknown model parameters. POET generated an ensemble of signaling models, which collectively exhibited population-like behavior. For example, scaled gene expression levels were approximately normally distributed over the ensemble following the addition of extracellular ligand. Also, the ensemble recovered robust and fragile features of the true model, despite significant parameter uncertainty. Taken together, these results suggest that experimentally constrained model ensembles could capture qualitatively important network features without exact parameter information.

  6. Data assimilation in integrated hydrological modeling using ensemble Kalman filtering

    DEFF Research Database (Denmark)

    Rasmussen, Jørn; Madsen, H.; Jensen, Karsten Høgh

    2015-01-01

    Groundwater head and stream discharge is assimilated using the ensemble transform Kalman filter in an integrated hydrological model with the aim of studying the relationship between the filter performance and the ensemble size. In an attempt to reduce the required number of ensemble members...

  7. Exploring and Listening to Chinese Classical Ensembles in General Music

    Science.gov (United States)

    Zhang, Wenzhuo

    2017-01-01

    Music diversity is valued in theory, but the extent to which it is efficiently presented in music class remains limited. Within this article, I aim to bridge this gap by introducing four genres of Chinese classical ensembles--Qin and Xiao duets, Jiang Nan bamboo and silk ensembles, Cantonese ensembles, and contemporary Chinese orchestras--into the…

  8. Data assimilation in integrated hydrological modeling using ensemble Kalman filtering

    DEFF Research Database (Denmark)

    Rasmussen, Jørn; Madsen, H.; Jensen, Karsten Høgh;

    2015-01-01

    Groundwater head and stream discharge is assimilated using the ensemble transform Kalman filter in an integrated hydrological model with the aim of studying the relationship between the filter performance and the ensemble size. In an attempt to reduce the required number of ensemble members...

  9. On generalized extending modules

    Institute of Scientific and Technical Information of China (English)

    ZENG Qing-yi

    2007-01-01

    A module M is called generalized extending if for any submodule N of M, there is a direct summand K of M such that N≤K and K/N is singular. Any extending module and any singular module are generalized extending. Any homomorphic image of a generalized extending module is generalized extending. Any direct sum of a singular (uniform) module and a semi-simple module is generalized extending. A ring R is a right Co-H-ring ifand only ifall right R modules are generalized extending modules.

  10. The role of ensemble post-processing for modeling the ensemble tail

    Science.gov (United States)

    Van De Vyver, Hans; Van Schaeybroeck, Bert; Vannitsem, Stéphane

    2016-04-01

    The past decades the numerical weather prediction community has witnessed a paradigm shift from deterministic to probabilistic forecast and state estimation (Buizza and Leutbecher, 2015; Buizza et al., 2008), in an attempt to quantify the uncertainties associated with initial-condition and model errors. An important benefit of a probabilistic framework is the improved prediction of extreme events. However, one may ask to what extent such model estimates contain information on the occurrence probability of extreme events and how this information can be optimally extracted. Different approaches have been proposed and applied on real-world systems which, based on extreme value theory, allow the estimation of extreme-event probabilities conditional on forecasts and state estimates (Ferro, 2007; Friederichs, 2010). Using ensemble predictions generated with a model of low dimensionality, a thorough investigation is presented quantifying the change of predictability of extreme events associated with ensemble post-processing and other influencing factors including the finite ensemble size, lead time and model assumption and the use of different covariates (ensemble mean, maximum, spread...) for modeling the tail distribution. Tail modeling is performed by deriving extreme-quantile estimates using peak-over-threshold representation (generalized Pareto distribution) or quantile regression. Common ensemble post-processing methods aim to improve mostly the ensemble mean and spread of a raw forecast (Van Schaeybroeck and Vannitsem, 2015). Conditional tail modeling, on the other hand, is a post-processing in itself, focusing on the tails only. Therefore, it is unclear how applying ensemble post-processing prior to conditional tail modeling impacts the skill of extreme-event predictions. This work is investigating this question in details. Buizza, Leutbecher, and Isaksen, 2008: Potential use of an ensemble of analyses in the ECMWF Ensemble Prediction System, Q. J. R. Meteorol

  11. Broad range of 2050 warming from an observationally constrained large climate model ensemble

    Science.gov (United States)

    Rowlands, Daniel J.; Frame, David J.; Ackerley, Duncan; Aina, Tolu; Booth, Ben B. B.; Christensen, Carl; Collins, Matthew; Faull, Nicholas; Forest, Chris E.; Grandey, Benjamin S.; Gryspeerdt, Edward; Highwood, Eleanor J.; Ingram, William J.; Knight, Sylvia; Lopez, Ana; Massey, Neil; McNamara, Frances; Meinshausen, Nicolai; Piani, Claudio; Rosier, Suzanne M.; Sanderson, Benjamin M.; Smith, Leonard A.; Stone, Dáithí A.; Thurston, Milo; Yamazaki, Kuniko; Hiro Yamazaki, Y.; Allen, Myles R.

    2012-04-01

    Incomplete understanding of three aspects of the climate system--equilibrium climate sensitivity, rate of ocean heat uptake and historical aerosol forcing--and the physical processes underlying them lead to uncertainties in our assessment of the global-mean temperature evolution in the twenty-first century. Explorations of these uncertainties have so far relied on scaling approaches, large ensembles of simplified climate models, or small ensembles of complex coupled atmosphere-ocean general circulation models which under-represent uncertainties in key climate system properties derived from independent sources. Here we present results from a multi-thousand-member perturbed-physics ensemble of transient coupled atmosphere-ocean general circulation model simulations. We find that model versions that reproduce observed surface temperature changes over the past 50 years show global-mean temperature increases of 1.4-3K by 2050, relative to 1961-1990, under a mid-range forcing scenario. This range of warming is broadly consistent with the expert assessment provided by the Intergovernmental Panel on Climate Change Fourth Assessment Report, but extends towards larger warming than observed in ensembles-of-opportunity typically used for climate impact assessments. From our simulations, we conclude that warming by the middle of the twenty-first century that is stronger than earlier estimates is consistent with recent observed temperature changes and a mid-range `no mitigation' scenario for greenhouse-gas emissions.

  12. Ensemble of classifiers for confidence-rated classification of NDE signal

    Science.gov (United States)

    Banerjee, Portia; Safdarnejad, Seyed; Udpa, Lalita; Udpa, Satish

    2016-02-01

    Ensemble of classifiers in general, aims to improve classification accuracy by combining results from multiple weak hypotheses into a single strong classifier through weighted majority voting. Improved versions of ensemble of classifiers generate self-rated confidence scores which estimate the reliability of each of its prediction and boost the classifier using these confidence-rated predictions. However, such a confidence metric is based only on the rate of correct classification. In existing works, although ensemble of classifiers has been widely used in computational intelligence, the effect of all factors of unreliability on the confidence of classification is highly overlooked. With relevance to NDE, classification results are affected by inherent ambiguity of classifica-tion, non-discriminative features, inadequate training samples and noise due to measurement. In this paper, we extend the existing ensemble classification by maximizing confidence of every classification decision in addition to minimizing the classification error. Initial results of the approach on data from eddy current inspection show improvement in classification performance of defect and non-defect indications.

  13. Forecasting European cold waves based on subsampling strategies of CMIP5 and Euro-CORDEX ensembles

    Science.gov (United States)

    Cordero-Llana, Laura; Braconnot, Pascale; Vautard, Robert; Vrac, Mathieu; Jezequel, Aglae

    2016-04-01

    Forecasting future extreme events under the present changing climate represents a difficult task. Currently there are a large number of ensembles of simulations for climate projections that take in account different models and scenarios. However, there is a need for reducing the size of the ensemble to make the interpretation of these simulations more manageable for impact studies or climate risk assessment. This can be achieved by developing subsampling strategies to identify a limited number of simulations that best represent the ensemble. In this study, cold waves are chosen to test different approaches for subsampling available simulations. The definition of cold waves depends on the criteria used, but they are generally defined using a minimum temperature threshold, the duration of the cold spell as well as their geographical extend. These climate indicators are not universal, highlighting the difficulty of directly comparing different studies. As part of the of the CLIPC European project, we use daily surface temperature data obtained from CMIP5 outputs as well as Euro-CORDEX simulations to predict future cold waves events in Europe. From these simulations a clustering method is applied to minimise the number of ensembles required. Furthermore, we analyse the different uncertainties that arise from the different model characteristics and definitions of climate indicators. Finally, we will test if the same subsampling strategy can be used for different climate indicators. This will facilitate the use of the subsampling results for a wide number of impact assessment studies.

  14. Improving medium-range ensemble streamflow forecasts through statistical post-processing

    Science.gov (United States)

    Mendoza, Pablo; Wood, Andy; Clark, Elizabeth; Nijssen, Bart; Clark, Martyn; Ramos, Maria-Helena; Nowak, Kenneth; Arnold, Jeffrey

    2017-04-01

    Probabilistic hydrologic forecasts are a powerful source of information for decision-making in water resources operations. A common approach is the hydrologic model-based generation of streamflow forecast ensembles, which can be implemented to account for different sources of uncertainties - e.g., from initial hydrologic conditions (IHCs), weather forecasts, and hydrologic model structure and parameters. In practice, hydrologic ensemble forecasts typically have biases and spread errors stemming from errors in the aforementioned elements, resulting in a degradation of probabilistic properties. In this work, we compare several statistical post-processing techniques applied to medium-range ensemble streamflow forecasts obtained with the System for Hydromet Applications, Research and Prediction (SHARP). SHARP is a fully automated prediction system for the assessment and demonstration of short-term to seasonal streamflow forecasting applications, developed by the National Center for Atmospheric Research, University of Washington, U.S. Army Corps of Engineers, and U.S. Bureau of Reclamation. The suite of post-processing techniques includes linear blending, quantile mapping, extended logistic regression, quantile regression, ensemble analogs, and the generalized linear model post-processor (GLMPP). We assess and compare these techniques using multi-year hindcasts in several river basins in the western US. This presentation discusses preliminary findings about the effectiveness of the techniques for improving probabilistic skill, reliability, discrimination, sharpness and resolution.

  15. Refining Disordered Peptide Ensembles with Computational Amide I Spectroscopy: Application to Elastin-Like Peptides

    Science.gov (United States)

    Reppert, Mike; Roy, Anish R.; Tempkin, Jeremy O. B.; Dinner, Aaron R.; Tokmakoff, Andrei

    2017-01-01

    The characterization of intrinsically disordered protein (IDP) ensembles is complicated both by inherent heterogeneity and by the fact that many common experimental techniques function poorly when applied to IDPs. For this reason, the development of alternative structural tools for probing IDP ensembles has attracted considerable attention. Here we describe our recent work in developing experimental and computational tools for characterizing IDP ensembles using Amide I (backbone carbonyl stretch) vibrational spectroscopy. In this approach, the infrared (IR) absorption frequencies of isotope-labeled amide bonds probe their local electrostatic environments and structures. Empirical frequency maps allow us to use this spectroscopic data as a direct experimental test of atomistic structural models. We apply these methods to a family of short elastin-like peptides (ELPs), fragments of the elastin protein based around the Pro-Gly turn motif characteristic of the elastomeric segments of the full protein. Using a maximum entropy analysis of experimental spectra on the basis of predicted spectra from molecular dynamics (MD) ensembles, we find that peptides with Ala or Val sidechains preceding the Pro-Gly turn unit exhibit a stronger tendency toward extended structures than do Gly-Pro-Gly motifs, suggesting an important role for steric interactions in tuning the molecular properties of elastin. PMID:27736076

  16. Demonstrating the value of larger ensembles in forecasting physical systems

    Directory of Open Access Journals (Sweden)

    Reason L. Machete

    2016-12-01

    Full Text Available Ensemble simulation propagates a collection of initial states forward in time in a Monte Carlo fashion. Depending on the fidelity of the model and the properties of the initial ensemble, the goal of ensemble simulation can range from merely quantifying variations in the sensitivity of the model all the way to providing actionable probability forecasts of the future. Whatever the goal is, success depends on the properties of the ensemble, and there is a longstanding discussion in meteorology as to the size of initial condition ensemble most appropriate for Numerical Weather Prediction. In terms of resource allocation: how is one to divide finite computing resources between model complexity, ensemble size, data assimilation and other components of the forecast system. One wishes to avoid undersampling information available from the model's dynamics, yet one also wishes to use the highest fidelity model available. Arguably, a higher fidelity model can better exploit a larger ensemble; nevertheless it is often suggested that a relatively small ensemble, say ~16 members, is sufficient and that larger ensembles are not an effective investment of resources. This claim is shown to be dubious when the goal is probabilistic forecasting, even in settings where the forecast model is informative but imperfect. Probability forecasts for a ‘simple’ physical system are evaluated at different lead times; ensembles of up to 256 members are considered. The pure density estimation context (where ensemble members are drawn from the same underlying distribution as the target differs from the forecasting context, where one is given a high fidelity (but imperfect model. In the forecasting context, the information provided by additional members depends also on the fidelity of the model, the ensemble formation scheme (data assimilation, the ensemble interpretation and the nature of the observational noise. The effect of increasing the ensemble size is quantified by

  17. Ensemble Forecasting of Major Solar Flares

    CERN Document Server

    Guerra, J A; Uritsky, V M

    2015-01-01

    We present the results from the first ensemble prediction model for major solar flares (M and X classes). Using the probabilistic forecasts from three models hosted at the Community Coordinated Modeling Center (NASA-GSFC) and the NOAA forecasts, we developed an ensemble forecast by linearly combining the flaring probabilities from all four methods. Performance-based combination weights were calculated using a Monte Carlo-type algorithm by applying a decision threshold $P_{th}$ to the combined probabilities and maximizing the Heidke Skill Score (HSS). Using the probabilities and events time series from 13 recent solar active regions (2012 - 2014), we found that a linear combination of probabilities can improve both probabilistic and categorical forecasts. Combination weights vary with the applied threshold and none of the tested individual forecasting models seem to provide more accurate predictions than the others for all values of $P_{th}$. According to the maximum values of HSS, a performance-based weights ...

  18. Quantum data compression of a qubit ensemble.

    Science.gov (United States)

    Rozema, Lee A; Mahler, Dylan H; Hayat, Alex; Turner, Peter S; Steinberg, Aephraim M

    2014-10-17

    Data compression is a ubiquitous aspect of modern information technology, and the advent of quantum information raises the question of what types of compression are feasible for quantum data, where it is especially relevant given the extreme difficulty involved in creating reliable quantum memories. We present a protocol in which an ensemble of quantum bits (qubits) can in principle be perfectly compressed into exponentially fewer qubits. We then experimentally implement our algorithm, compressing three photonic qubits into two. This protocol sheds light on the subtle differences between quantum and classical information. Furthermore, since data compression stores all of the available information about the quantum state in fewer physical qubits, it could allow for a vast reduction in the amount of quantum memory required to store a quantum ensemble, making even today's limited quantum memories far more powerful than previously recognized.

  19. Rotationally invariant ensembles of integrable matrices.

    Science.gov (United States)

    Yuzbashyan, Emil A; Shastry, B Sriram; Scaramazza, Jasen A

    2016-05-01

    We construct ensembles of random integrable matrices with any prescribed number of nontrivial integrals and formulate integrable matrix theory (IMT)-a counterpart of random matrix theory (RMT) for quantum integrable models. A type-M family of integrable matrices consists of exactly N-M independent commuting N×N matrices linear in a real parameter. We first develop a rotationally invariant parametrization of such matrices, previously only constructed in a preferred basis. For example, an arbitrary choice of a vector and two commuting Hermitian matrices defines a type-1 family and vice versa. Higher types similarly involve a random vector and two matrices. The basis-independent formulation allows us to derive the joint probability density for integrable matrices, similar to the construction of Gaussian ensembles in the RMT.

  20. Face Recognition using Optimal Representation Ensemble

    CERN Document Server

    Li, Hanxi; Gao, Yongsheng

    2011-01-01

    Recently, the face recognizers based on linear representations have been shown to deliver state-of-the-art performance. In real-world applications, however, face images usually suffer from expressions, disguises and random occlusions. The problematic facial parts undermine the validity of the linear-subspace assumption and thus the recognition performance deteriorates significantly. In this work, we address the problem in a learning-inference-mixed fashion. By observing that the linear-subspace assumption is more reliable on certain face patches rather than on the holistic face, some Bayesian Patch Representations (BPRs) are randomly generated and interpreted according to the Bayes' theory. We then train an ensemble model over the patch-representations by minimizing the empirical risk w.r.t the "leave-one-out margins". The obtained model is termed Optimal Representation Ensemble (ORE), since it guarantees the optimality from the perspective of Empirical Risk Minimization. To handle the unknown patterns in tes...

  1. Staying thermal with Hartree ensemble approximations

    Energy Technology Data Exchange (ETDEWEB)

    Salle, Mischa E-mail: msalle@science.uva.nl; Smit, Jan E-mail: jsmit@science.uva.nl; Vink, Jeroen C. E-mail: jcvink@science.uva.nl

    2002-03-25

    We study thermal behavior of a recently introduced Hartree ensemble approximation, which allows for non-perturbative inhomogeneous field configurations as well as for approximate thermalization, in the phi (cursive,open) Greek{sup 4} model in 1+1 dimensions. Using ensembles with a free field thermal distribution as out-of-equilibrium initial conditions we determine thermalization time scales. The time scale for which the system stays in approximate quantum thermal equilibrium is an indication of the time scales for which the approximation method stays reasonable. This time scale turns out to be two orders of magnitude larger than the time scale for thermalization, in the range of couplings and temperatures studied. We also discuss simplifications of our method which are numerically more efficient and make a comparison with classical dynamics.

  2. Entanglement in a Solid State Spin Ensemble

    CERN Document Server

    Simmons, Stephanie; Riemann, Helge; Abrosimov, Nikolai V; Becker, Peter; Pohl, Hans-Joachim; Thewalt, Mike L W; Itoh, Kohei M; Morton, John J L

    2010-01-01

    Entanglement is the quintessential quantum phenomenon and a necessary ingredient in most emerging quantum technologies, including quantum repeaters, quantum information processing (QIP) and the strongest forms of quantum cryptography. Spin ensembles, such as those in liquid state nuclear magnetic resonance, have been powerful in the development of quantum control methods, however, these demonstrations contained no entanglement and ultimately constitute classical simulations of quantum algorithms. Here we report the on-demand generation of entanglement between an ensemble of electron and nuclear spins in isotopically engineered phosphorus-doped silicon. We combined high field/low temperature electron spin resonance (3.4 T, 2.9 K) with hyperpolarisation of the 31P nuclear spin to obtain an initial state of sufficient purity to create a non-classical, inseparable state. The state was verified using density matrix tomography based on geometric phase gates, and had a fidelity of 98% compared with the ideal state a...

  3. Dysonian dynamics of the Ginibre ensemble.

    Science.gov (United States)

    Burda, Zdzislaw; Grela, Jacek; Nowak, Maciej A; Tarnowski, Wojciech; Warchoł, Piotr

    2014-09-05

    We study the time evolution of Ginibre matrices whose elements undergo Brownian motion. The non-Hermitian character of the Ginibre ensemble binds the dynamics of eigenvalues to the evolution of eigenvectors in a nontrivial way, leading to a system of coupled nonlinear equations resembling those for turbulent systems. We formulate a mathematical framework allowing simultaneous description of the flow of eigenvalues and eigenvectors, and we unravel a hidden dynamics as a function of a new complex variable, which in the standard description is treated as a regulator only. We solve the evolution equations for large matrices and demonstrate that the nonanalytic behavior of the Green's functions is associated with a shock wave stemming from a Burgers-like equation describing correlations of eigenvectors. We conjecture that the hidden dynamics that we observe for the Ginibre ensemble is a general feature of non-Hermitian random matrix models and is relevant to related physical applications.

  4. Rotationally invariant ensembles of integrable matrices

    Science.gov (United States)

    Yuzbashyan, Emil A.; Shastry, B. Sriram; Scaramazza, Jasen A.

    2016-05-01

    We construct ensembles of random integrable matrices with any prescribed number of nontrivial integrals and formulate integrable matrix theory (IMT)—a counterpart of random matrix theory (RMT) for quantum integrable models. A type-M family of integrable matrices consists of exactly N -M independent commuting N ×N matrices linear in a real parameter. We first develop a rotationally invariant parametrization of such matrices, previously only constructed in a preferred basis. For example, an arbitrary choice of a vector and two commuting Hermitian matrices defines a type-1 family and vice versa. Higher types similarly involve a random vector and two matrices. The basis-independent formulation allows us to derive the joint probability density for integrable matrices, similar to the construction of Gaussian ensembles in the RMT.

  5. Eigenstate Gibbs Ensemble in Integrable Quantum Systems

    CERN Document Server

    Nandy, Sourav; Das, Arnab; Dhar, Abhishek

    2016-01-01

    The Eigenstate Thermalization Hypothesis implies that for a thermodynamically large system in one of its eigenstates, the reduced density matrix describing any finite subsystem is determined solely by a set of {\\it relevant} conserved quantities. In a generic system, only the energy plays that role and hence eigenstates appear locally thermal. Integrable systems, on the other hand, possess an extensive number of such conserved quantities and hence the reduced density matrix requires specification of an infinite number of parameters (Generalized Gibbs Ensemble). However, here we show by unbiased statistical sampling of the individual eigenstates with a given finite energy density, that the local description of an overwhelming majority of these states of even such an integrable system is actually Gibbs-like, i.e. requires only the energy density of the eigenstate. Rare eigenstates that cannot be represented by the Gibbs ensemble can also be sampled efficiently by our method and their local properties are then s...

  6. ABCD of Beta Ensembles and Topological Strings

    CERN Document Server

    Krefl, Daniel

    2012-01-01

    We study beta-ensembles with Bn, Cn, and Dn eigenvalue measure and their relation with refined topological strings. Our results generalize the familiar connections between local topological strings and matrix models leading to An measure, and illustrate that all those classical eigenvalue ensembles, and their topological string counterparts, are related one to another via various deformations and specializations, quantum shifts and discrete quotients. We review the solution of the Gaussian models via Macdonald identities, and interpret them as conifold theories. The interpolation between the various models is plainly apparent in this case. For general polynomial potential, we calculate the partition function in the multi-cut phase in a perturbative fashion, beyond tree-level in the large-N limit. The relation to refined topological string orientifolds on the corresponding local geometry is discussed along the way.

  7. Support Vector Machine Ensemble Based on Genetic Algorithm

    Institute of Scientific and Technical Information of China (English)

    LI Ye; YIN Ru-po; CAI Yun-ze; XU Xiao-ming

    2006-01-01

    Support vector machines (SVMs) have been introduced as effective methods for solving classification problems.However, due to some limitations in practical applications,their generalization performance is sometimes far from the expected level. Therefore, it is meaningful to study SVM ensemble learning. In this paper, a novel genetic algorithm based ensemble learning method, namely Direct Genetic Ensemble (DGE), is proposed. DGE adopts the predictive accuracy of ensemble as the fitness function and searches a good ensemble from the ensemble space. In essence, DGE is also a selective ensemble learning method because the base classifiers of the ensemble are selected according to the solution of genetic algorithm. In comparison with other ensemble learning methods, DGE works on a higher level and is more direct. Different strategies of constructing diverse base classifiers can be utilized in DGE.Experimental results show that SVM ensembles constructed by DGE can achieve better performance than single SVMs,bagged and boosted SVM ensembles. In addition, some valuable conclusions are obtained.

  8. Various multistage ensembles for prediction of heating energy consumption

    Directory of Open Access Journals (Sweden)

    Radisa Jovanovic

    2015-04-01

    Full Text Available Feedforward neural network models are created for prediction of daily heating energy consumption of a NTNU university campus Gloshaugen using actual measured data for training and testing. Improvement of prediction accuracy is proposed by using neural network ensemble. Previously trained feed-forward neural networks are first separated into clusters, using k-means algorithm, and then the best network of each cluster is chosen as member of an ensemble. Two conventional averaging methods for obtaining ensemble output are applied; simple and weighted. In order to achieve better prediction results, multistage ensemble is investigated. As second level, adaptive neuro-fuzzy inference system with various clustering and membership functions are used to aggregate the selected ensemble members. Feedforward neural network in second stage is also analyzed. It is shown that using ensemble of neural networks can predict heating energy consumption with better accuracy than the best trained single neural network, while the best results are achieved with multistage ensemble.

  9. Spatially Coupled Ensembles Universally Achieve Capacity under Belief Propagation

    CERN Document Server

    Kudekar, Shrinivas; Urbanke, Ruediger

    2012-01-01

    We investigate spatially coupled code ensembles. For transmission over the binary erasure channel, it was recently shown that spatial coupling increases the belief propagation threshold of the ensemble to essentially the maximum a-priori threshold of the underlying component ensemble. This explains why convolutional LDPC ensembles, originally introduced by Felstrom and Zigangirov, perform so well over this channel. We show that the equivalent result holds true for transmission over general binary-input memoryless output-symmetric channels. More precisely, given a desired error probability and a gap to capacity, we can construct a spatially coupled ensemble which fulfills these constraints universally on this class of channels under belief propagation decoding. In fact, most codes in that ensemble have that property. The quantifier universal refers to the single ensemble/code which is good for all channels but we assume that the channel is known at the receiver. The key technical result is a proof that under b...

  10. Analysis and optimization of weighted ensemble sampling

    CERN Document Server

    Aristoff, David

    2016-01-01

    We give a mathematical framework for weighted ensemble (WE) sampling, a binning and resampling technique for efficiently computing probabilities in molecular dynamics. We prove that WE sampling is unbiased in a very general setting that includes adaptive binning. We show that when WE is used for stationary calculations in tandem with a Markov state model (MSM), the MSM can be used to optimize the allocation of replicas in the bins.

  11. Quantum Data Compression of a Qubit Ensemble

    OpenAIRE

    Rozema, Lee A.; Mahler, Dylan H.; Hayat, Alex; Turner, Peter S.; Steinberg, Aephraim M.

    2014-01-01

    Data compression is a ubiquitous aspect of modern information technology, and the advent of quantum information raises the question of what types of compression are feasible for quantum data, where it is especially relevant given the extreme difficulty involved in creating reliable quantum memories. We present a protocol in which an ensemble of quantum bits (qubits) can in principle be perfectly compressed into exponentially fewer qubits. We then experimentally implement our algorithm, compre...

  12. Multiscale ensemble filtering for reservoir engineering applications

    OpenAIRE

    Lawniczak, W.; Hanea, R.G.; Heemink, A.; Mclaughlin, D.

    2009-01-01

    Reservoir management requires periodic updates of the simulation models using the production data available over time. Traditionally, validation of reservoir models with production data is done using a history matching process. Uncertainties in the data, as well as in the model, lead to a nonunique history matching inverse problem. It has been shown that the ensemble Kalman filter (EnKF) is an adequate method for predicting the dynamics of the reservoir. The EnKF is a sequential Monte-Carlo a...

  13. Statistical Ensemble Theory of Gompertz Growth Model

    Directory of Open Access Journals (Sweden)

    Takuya Yamano

    2009-11-01

    Full Text Available An ensemble formulation for the Gompertz growth function within the framework of statistical mechanics is presented, where the two growth parameters are assumed to be statistically distributed. The growth can be viewed as a self-referential process, which enables us to use the Bose-Einstein statistics picture. The analytical entropy expression pertain to the law can be obtained in terms of the growth velocity distribution as well as the Gompertz function itself for the whole process.

  14. Staying Thermal with Hartree Ensemble Approximations

    CERN Document Server

    Salle, M; Vink, Jeroen C

    2000-01-01

    Using Hartree ensemble approximations to compute the real time dynamics of scalar fields in 1+1 dimension, we find that with suitable initial conditions, approximate thermalization is achieved much faster than found in our previous work. At large times, depending on the interaction strength and temperature, the particle distribution slowly changes: the Bose-Einstein distribution of the particle densities develops classical features. We also discuss variations of our method which are numerically more efficient.

  15. Protein folding by distributed computing and the denatured state ensemble.

    Science.gov (United States)

    Marianayagam, Neelan J; Fawzi, Nicolas L; Head-Gordon, Teresa

    2005-11-15

    The distributed computing (DC) paradigm in conjunction with the folding@home (FH) client server has been used to study the folding kinetics of small peptides and proteins, giving excellent agreement with experimentally measured folding rates, although pathways sampled in these simulations are not always consistent with the folding mechanism. In this study, we use a coarse-grain model of protein L, whose two-state kinetics have been characterized in detail by using long-time equilibrium simulations, to rigorously test a FH protocol using approximately 10,000 short-time, uncoupled folding simulations starting from an extended state of the protein. We show that the FH results give non-Poisson distributions and early folding events that are unphysical, whereas longer folding events experience a correct barrier to folding but are not representative of the equilibrium folding ensemble. Using short-time, uncoupled folding simulations started from an equilibrated denatured state ensemble (DSE), we also do not get agreement with the equilibrium two-state kinetics because of overrepresented folding events arising from higher energy subpopulations in the DSE. The DC approach using uncoupled short trajectories can make contact with traditionally measured experimental rates and folding mechanism when starting from an equilibrated DSE, when the simulation time is long enough to sample the lowest energy states of the unfolded basin and the simulated free-energy surface is correct. However, the DC paradigm, together with faster time-resolved and single-molecule experiments, can also reveal the breakdown in the two-state approximation due to observation of folding events from higher energy subpopulations in the DSE.

  16. Interplanetary magnetic field ensemble at 1 AU

    Energy Technology Data Exchange (ETDEWEB)

    Matthaeus, W.H.; Goldstein, M.L.; King, J.H.

    1985-04-01

    A method for calculation ensemble averages from magnetic field data is described. A data set comprising approximately 16 months of nearly continuous ISEE-3 magnetic field data is used in this study. Individual subintervals of this data, ranging from 15 hours to 15.6 days comprise the ensemble. The sole condition for including each subinterval in the averages is the degree to which it represents a weakly time-stationary process. Averages obtained by this method are appropriate for a turbulence description of the interplanetary medium. The ensemble average correlation length obtained from all subintervals is found to be 4.9 x 10 to the 11th cm. The average value of the variances of the magnetic field components are in the approximate ratio 8:9:10, where the third component is the local mean field direction. The correlation lengths and variances are found to have a systematic variation with subinterval duration, reflecting the important role of low-frequency fluctuations in the interplanetary medium.

  17. Gradient Flow Analysis on MILC HISQ Ensembles

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Nathan [Washington U., St. Louis; Bazavov, Alexei [Brookhaven; Bernard, Claude [Washington U., St. Louis; DeTar, Carleton [Utah U.; Foley, Justin [Utah U.; Gottlieb, Steven [Indiana U.; Heller, Urs M. [APS, New York; Hetrick, J. E. [U. Pacific, Stockton; Komijani, Javad [Washington U., St. Louis; Laiho, Jack [Syracuse U.; Levkova, Ludmila [Utah U.; Oktay, M. B. [Utah U.; Sugar, Robert [UC, Santa Barbara; Toussaint, Doug [Arizona U.; Van de Water, Ruth S. [Fermilab; Zhou, Ran [Fermilab

    2014-11-14

    We report on a preliminary scale determination with gradient-flow techniques on the $N_f = 2 + 1 + 1$ HISQ ensembles generated by the MILC collaboration. The ensembles include four lattice spacings, ranging from 0.15 to 0.06 fm, and both physical and unphysical values of the quark masses. The scales $\\sqrt{t_0}/a$ and $w_0/a$ are computed using Symanzik flow and the cloverleaf definition of $\\langle E \\rangle$ on each ensemble. Then both scales and the meson masses $aM_\\pi$ and $aM_K$ are adjusted for mistunings in the charm mass. Using a combination of continuum chiral perturbation theory and a Taylor series ansatz in the lattice spacing, the results are simultaneously extrapolated to the continuum and interpolated to physical quark masses. Our preliminary results are $\\sqrt{t_0} = 0.1422(7)$fm and $w_0 = 0.1732(10)$fm. We also find the continuum mass-dependence of $w_0$.

  18. Cavity Cooling for Ensemble Spin Systems

    Science.gov (United States)

    Cory, David

    2015-03-01

    Recently there has been a surge of interest in exploring thermodynamics in quantum systems where dissipative effects can be exploited to perform useful work. One such example is quantum state engineering where a quantum state of high purity may be prepared by dissipative coupling through a cold thermal bath. This has been used to great effect in many quantum systems where cavity cooling has been used to cool mechanical modes to their quantum ground state through coupling to the resolved sidebands of a high-Q resonator. In this talk we explore how these techniques may be applied to an ensemble spin system. This is an attractive process as it potentially allows for parallel remove of entropy from a large number of quantum systems, enabling an ensemble to achieve a polarization greater than thermal equilibrium, and potentially on a time scale much shorter than thermal relaxation processes. This is achieved by the coupled angular momentum subspaces of the ensemble behaving as larger effective spins, overcoming the weak individual coupling of individual spins to a microwave resonator. Cavity cooling is shown to cool each of these subspaces to their respective ground state, however an additional algorithmic step or dissipative process is required to couple between these subspaces and enable cooling to the full ground state of the joint system.

  19. Multivariate localization methods for ensemble Kalman filtering

    KAUST Repository

    Roh, S.

    2015-05-08

    In ensemble Kalman filtering (EnKF), the small number of ensemble members that is feasible to use in a practical data assimilation application leads to sampling variability of the estimates of the background error covariances. The standard approach to reducing the effects of this sampling variability, which has also been found to be highly efficient in improving the performance of EnKF, is the localization of the estimates of the covariances. One family of localization techniques is based on taking the Schur (entry-wise) product of the ensemble-based sample covariance matrix and a correlation matrix whose entries are obtained by the discretization of a distance-dependent correlation function. While the proper definition of the localization function for a single state variable has been extensively investigated, a rigorous definition of the localization function for multiple state variables has been seldom considered. This paper introduces two strategies for the construction of localization functions for multiple state variables. The proposed localization functions are tested by assimilating simulated observations experiments into the bivariate Lorenz 95 model with their help.

  20. Gradient Flow Analysis on MILC HISQ Ensembles

    CERN Document Server

    Bazavov, A; Brown, N; DeTar, C; Foley, J; Gottlieb, Steven; Heller, U M; Hetrick, J E; Komijani, J; Laiho, J; Levkova, L; Oktay, M; Sugar, R L; Toussaint, D; Van de Water, R S; Zhou, R

    2014-01-01

    We report on a preliminary scale determination with gradient-flow techniques on the $N_f = 2 + 1 + 1$ HISQ ensembles generated by the MILC collaboration. The ensembles include four lattice spacings, ranging from 0.15 to 0.06 fm, and both physical and unphysical values of the quark masses. The scales $\\sqrt{t_0}/a$ and $w_0/a$ are computed using Symanzik flow and the cloverleaf definition of $\\langle E \\rangle$ on each ensemble. Then both scales and the meson masses $aM_\\pi$ and $aM_K$ are adjusted for mistunings in the charm mass. Using a combination of continuum chiral perturbation theory and a Taylor series ansatz in the lattice spacing, the results are simultaneously extrapolated to the continuum and interpolated to physical quark masses. Our preliminary results are $\\sqrt{t_0} = 0.1422(7)$fm and $w_0 = 0.1732(10)$fm. We also find the continuum mass-dependence of $w_0$.

  1. Multivariate localization methods for ensemble Kalman filtering

    KAUST Repository

    Roh, S.

    2015-12-03

    In ensemble Kalman filtering (EnKF), the small number of ensemble members that is feasible to use in a practical data assimilation application leads to sampling variability of the estimates of the background error covariances. The standard approach to reducing the effects of this sampling variability, which has also been found to be highly efficient in improving the performance of EnKF, is the localization of the estimates of the covariances. One family of localization techniques is based on taking the Schur (element-wise) product of the ensemble-based sample covariance matrix and a correlation matrix whose entries are obtained by the discretization of a distance-dependent correlation function. While the proper definition of the localization function for a single state variable has been extensively investigated, a rigorous definition of the localization function for multiple state variables that exist at the same locations has been seldom considered. This paper introduces two strategies for the construction of localization functions for multiple state variables. The proposed localization functions are tested by assimilating simulated observations experiments into the bivariate Lorenz 95 model with their help.

  2. Ensemble transform sensitivity method for adaptive observations

    Science.gov (United States)

    Zhang, Yu; Xie, Yuanfu; Wang, Hongli; Chen, Dehui; Toth, Zoltan

    2016-01-01

    The Ensemble Transform (ET) method has been shown to be useful in providing guidance for adaptive observation deployment. It predicts forecast error variance reduction for each possible deployment using its corresponding transformation matrix in an ensemble subspace. In this paper, a new ET-based sensitivity (ETS) method, which calculates the gradient of forecast error variance reduction in terms of analysis error variance reduction, is proposed to specify regions for possible adaptive observations. ETS is a first order approximation of the ET; it requires just one calculation of a transformation matrix, increasing computational efficiency (60%-80% reduction in computational cost). An explicit mathematical formulation of the ETS gradient is derived and described. Both the ET and ETS methods are applied to the Hurricane Irene (2011) case and a heavy rainfall case for comparison. The numerical results imply that the sensitive areas estimated by the ETS and ET are similar. However, ETS is much more efficient, particularly when the resolution is higher and the number of ensemble members is larger.

  3. Multivariate localization methods for ensemble Kalman filtering

    Directory of Open Access Journals (Sweden)

    S. Roh

    2015-05-01

    Full Text Available In ensemble Kalman filtering (EnKF, the small number of ensemble members that is feasible to use in a practical data assimilation application leads to sampling variability of the estimates of the background error covariances. The standard approach to reducing the effects of this sampling variability, which has also been found to be highly efficient in improving the performance of EnKF, is the localization of the estimates of the covariances. One family of localization techniques is based on taking the Schur (entry-wise product of the ensemble-based sample covariance matrix and a correlation matrix whose entries are obtained by the discretization of a distance-dependent correlation function. While the proper definition of the localization function for a single state variable has been extensively investigated, a rigorous definition of the localization function for multiple state variables has been seldom considered. This paper introduces two strategies for the construction of localization functions for multiple state variables. The proposed localization functions are tested by assimilating simulated observations experiments into the bivariate Lorenz 95 model with their help.

  4. On large deviations for ensembles of distributions

    Energy Technology Data Exchange (ETDEWEB)

    Khrychev, D A [Moscow State Institute of Radio-Engineering, Electronics and Automation (Technical University), Moscow (Russian Federation)

    2013-11-30

    The paper is concerned with the large deviations problem in the Freidlin-Wentzell formulation without the assumption of the uniqueness of the solution to the equation involving white noise. In other words, it is assumed that for each ε>0 the nonempty set P{sub ε} of weak solutions is not necessarily a singleton. Analogues of a number of concepts in the theory of large deviations are introduced for the set (P{sub ε}, ε>0), hereafter referred to as an ensemble of distributions. The ensembles of weak solutions of an n-dimensional stochastic Navier-Stokes system and stochastic wave equation with power-law nonlinearity are shown to be uniformly exponentially tight. An idempotent Wiener process in a Hilbert space and idempotent partial differential equations are defined. The accumulation points in the sense of large deviations of the ensembles in question are shown to be weak solutions of the corresponding idempotent equations. Bibliography: 14 titles.

  5. Multivariate localization methods for ensemble Kalman filtering

    Science.gov (United States)

    Roh, S.; Jun, M.; Szunyogh, I.; Genton, M. G.

    2015-12-01

    In ensemble Kalman filtering (EnKF), the small number of ensemble members that is feasible to use in a practical data assimilation application leads to sampling variability of the estimates of the background error covariances. The standard approach to reducing the effects of this sampling variability, which has also been found to be highly efficient in improving the performance of EnKF, is the localization of the estimates of the covariances. One family of localization techniques is based on taking the Schur (element-wise) product of the ensemble-based sample covariance matrix and a correlation matrix whose entries are obtained by the discretization of a distance-dependent correlation function. While the proper definition of the localization function for a single state variable has been extensively investigated, a rigorous definition of the localization function for multiple state variables that exist at the same locations has been seldom considered. This paper introduces two strategies for the construction of localization functions for multiple state variables. The proposed localization functions are tested by assimilating simulated observations experiments into the bivariate Lorenz 95 model with their help.

  6. Dynamic Analogue Initialization for Ensemble Forecasting

    Institute of Scientific and Technical Information of China (English)

    LI Shan; RONG Xingyao; LIU Yun; LIU Zhengyu; Klaus FRAEDRICH

    2013-01-01

    This paper introduces a new approach for the initialization of ensemble numerical forecasting:Dynamic Analogue Initialization (DAI).DAI assumes that the best model state trajectories for the past provide the initial conditions for the best forecasts in the future.As such,DAI performs the ensemble forecast using the best analogues from a full size ensemble.As a pilot study,the Lorenz63 and Lorenz96 models were used to test DAI's effectiveness independently.Results showed that DAI can improve the forecast significantly.Especially in lower-dimensional systems,DAI can reduce the forecast RMSE by ~50% compared to the Monte Carlo forecast (MC).This improvement is because DAI is able to recognize the direction of the analysis error through the embedding process and therefore selects those good trajectories with reduced initial error.Meanwhile,a potential improvement of DAI is also proposed,and that is to find the optimal range of embedding time based on the error's growing speed.

  7. EnsembleGraph: Interactive Visual Analysis of Spatial-Temporal Behavior for Ensemble Simulation Data

    Energy Technology Data Exchange (ETDEWEB)

    Shu, Qingya; Guo, Hanqi; Che, Limei; Yuan, Xiaoru; Liu, Junfeng; Liang, Jie

    2016-04-19

    We present a novel visualization framework—EnsembleGraph— for analyzing ensemble simulation data, in order to help scientists understand behavior similarities between ensemble members over space and time. A graph-based representation is used to visualize individual spatiotemporal regions with similar behaviors, which are extracted by hierarchical clustering algorithms. A user interface with multiple-linked views is provided, which enables users to explore, locate, and compare regions that have similar behaviors between and then users can investigate and analyze the selected regions in detail. The driving application of this paper is the studies on regional emission influences over tropospheric ozone, which is based on ensemble simulations conducted with different anthropogenic emission absences using the MOZART-4 (model of ozone and related tracers, version 4) model. We demonstrate the effectiveness of our method by visualizing the MOZART-4 ensemble simulation data and evaluating the relative regional emission influences on tropospheric ozone concentrations. Positive feedbacks from domain experts and two case studies prove efficiency of our method.

  8. A modified ensemble Kalman particle filter for non-Gaussian systems with nonlinear measurement functions

    Science.gov (United States)

    Shen, Zheqi; Tang, Youmin

    2016-04-01

    The ensemble Kalman particle filter (EnKPF) is a combination of two Bayesian-based algorithms, namely, the ensemble Kalman filter (EnKF) and the sequential importance resampling particle filter(SIR-PF). It was recently introduced to address non-Gaussian features in data assimilation for highly nonlinear systems, by providing a continuous interpolation between the EnKF and SIR-PF analysis schemes. In this paper, we first extend the EnKPF algorithm by modifying the formula for the computation of the covariancematrix, making it suitable for nonlinear measurement functions (we will call this extended algorithm nEnKPF). Further, a general form of the Kalman gain is introduced to the EnKPF to improve the performance of the nEnKPF when the measurement function is highly nonlinear (this improved algorithm is called mEnKPF). The Lorenz '63 model and Lorenz '96 model are used to test the two modified EnKPF algorithms. The experiments show that the mEnKPF and nEnKPF, given an affordable ensemble size, can perform better than the EnKF for the nonlinear systems with nonlinear observations. These results suggest a promising opportunity to develop a non-Gaussian scheme for realistic numerical models.

  9. Ensemble: a web-based system for psychology survey and experiment management.

    Science.gov (United States)

    Tomic, Stefan T; Janata, Petr

    2007-08-01

    We provide a description of Ensemble, a suite of Web-integrated modules for managing and analyzing data associated with psychology experiments in a small research lab. The system delivers interfaces via a Web browser for creating and presenting simple surveys without the need to author Web pages and with little or no programming effort. The surveys may be extended by selecting and presenting auditory and/or visual stimuli with MATLAB and Flash to enable a wide range of psychophysical and cognitive experiments which do not require the recording of precise reaction times. Additionally, one is provided with the ability to administer and present experiments remotely. The software technologies employed by the various modules of Ensemble are MySQL, PHP, MATLAB, and Flash. The code for Ensemble is open source and available to the public, so that its functions can be readily extended by users. We describe the architecture of the system, the functionality of each module, and provide basic examples of the interfaces.

  10. Ensemble of regional climate model projections for Ireland

    Science.gov (United States)

    Nolan, Paul; McGrath, Ray

    2016-04-01

    of over 35 days per year. Results show significant projected decreases in mean annual, spring and summer precipitation amounts by mid-century. The projected decreases are largest for summer, with "likely" reductions ranging from 0% to 20%. The frequencies of heavy precipitation events show notable increases (approximately 20%) during the winter and autumn months. The number of extended dry periods is projected to increase substantially during autumn and summer. Regional variations of projected precipitation change remain statistically elusive. The energy content of the wind is projected to significantly decrease for the future spring, summer and autumn months. Projected increases for winter were found to be statistically insignificant. The projected decreases were largest for summer, with "likely" values ranging from 3% to 15%. Results suggest that the tracks of intense storms are projected to extend further south over Ireland relative to those in the reference simulation. As extreme storm events are rare, the storm-tracking research needs to be extended. Future work will focus on analysing a larger ensemble, thus allowing a robust statistical analysis of extreme storm track projections.

  11. The scattering of $f$- and $p$-modes from ensembles of thin magnetic flux tubes - An analytical approach

    CERN Document Server

    Hanson, Chris S

    2014-01-01

    Motivated by the observational results of Braun (1995), we extend the model of Hanson & Cally (2014) to address the effect of multiple scattering of f and p-modes by an ensemble of thin vertical magnetic flux tubes in the surface layers of the Sun. As in observational Hankel analysis we measure the scatter and phase shift from an incident cylindrical wave in a coordinate system roughly centred in the core of the ensemble. It is demonstrated that, although thin flux tubes are unable to interact with high order fluting modes individually, they can indirectly absorb energy from these waves through the scatters of kink and sausage components. It is also shown how the distribution of absorption and phase shift across the azimuthal order m depends strongly on the tube position, as well as on the individual tube characteristics. This is the first analytical study into an ensembles multiple scattering regime, that is embedded within a strati?ed atmosphere.

  12. The scattering of f- and p-modes from ensembles of thin magnetic flux tubes: an analytical approach

    Energy Technology Data Exchange (ETDEWEB)

    Hanson, Chris S.; Cally, Paul S., E-mail: christopher.hanson@monash.edu [Monash Centre for Astrophysics and School of Mathematical Sciences, Monash University, Clayton, Victoria 3800 (Australia)

    2014-08-20

    Motivated by the observational results of Braun, we extend the model of Hanson and Cally to address the effect of multiple scattering of f and p modes by an ensemble of thin vertical magnetic flux tubes in the surface layers of the Sun. As in the observational Hankel analysis, we measure the scatter and phase shift from an incident cylindrical wave in a coordinate system roughly centered in the core of the ensemble. It is demonstrated that although thin flux tubes are unable to interact with high-order fluting modes individually, they can indirectly absorb energy from these waves through the scatters of kink and sausage components. It is also shown how the distribution of absorption and phase shift across the azimuthal order m depends strongly on the tube position as well as on the individual tube characteristics. This is the first analytical study into an ensembles multiple-scattering regime that is embedded within a stratified atmosphere.

  13. De praeceptis ferendis: good practice in multi-model ensembles

    Directory of Open Access Journals (Sweden)

    I. Kioutsioukis

    2014-06-01

    Full Text Available Ensembles of air quality models have been formally and empirically shown to outperform single models in many cases. Evidence suggests that ensemble error is reduced when the members form a diverse and accurate ensemble. Diversity and accuracy are hence two factors that should be taken care of while designing ensembles in order for them to provide better predictions. There exists a trade-off between diversity and accuracy for which one cannot be gained without expenses of the other. Theoretical aspects like the bias-variance-covariance decomposition and the accuracy-diversity decomposition are linked together and support the importance of creating ensemble that incorporates both the elements. Hence, the common practice of unconditional averaging of models without prior manipulation limits the advantages of ensemble averaging. We demonstrate the importance of ensemble accuracy and diversity through an inter-comparison of ensemble products for which a sound mathematical framework exists, and provide specific recommendations for model selection and weighting for multi model ensembles. To this end we have devised statistical tools that can be used for diagnostic evaluation of ensemble modelling products, complementing existing operational methods.

  14. Ensemble statistical and subspace clustering model for analysis of autism spectrum disorder phenotypes.

    Science.gov (United States)

    Al-Jabery, Khalid; Obafemi-Ajayi, Tayo; Olbricht, Gayla R; Takahashi, T Nicole; Kanne, Stephen; Wunsch, Donald

    2016-08-01

    Heterogeneity in Autism Spectrum Disorder (ASD) is complex including variability in behavioral phenotype as well as clinical, physiologic, and pathologic parameters. The fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) now diagnoses ASD using a 2-dimensional model based social communication deficits and fixated interests and repetitive behaviors. Sorting out heterogeneity is crucial for study of etiology, diagnosis, treatment and prognosis. In this paper, we present an ensemble model for analyzing ASD phenotypes using several machine learning techniques and a k-dimensional subspace clustering algorithm. Our ensemble also incorporates statistical methods at several stages of analysis. We apply this model to a sample of 208 probands drawn from the Simon Simplex Collection Missouri Site patients. The results provide useful evidence that is helpful in elucidating the phenotype complexity within ASD. Our model can be extended to other disorders that exhibit a diverse range of heterogeneity.

  15. Multiple Scattering of Seismic Waves from Ensembles of Upwardly Lossy Thin Flux Tubes

    CERN Document Server

    Hanson, Chris S

    2015-01-01

    Our previous semi-analytic treatment of f- and p-mode multiple scattering from ensembles of thin flux tubes (Hanson and Cally, Astrophys. J. 781, 125; 791, 129, 2014) is extended by allowing both sausage and kink waves to freely escape at the top of the model using a radiative boundary condition there. As expected, this additional avenue of escape, supplementing downward loss into the deep solar interior, results in substantially greater absorption of incident f- and p-modes. However, less intuitively, it also yields mildly to substantially smaller phase shifts in waves emerging from the ensemble. This may have implications for the interpretation of seismic data for solar plage regions, and in particular their small measured phase shifts.

  16. Ultra-stable optical clock with two cold-atom ensembles

    CERN Document Server

    Schioppo, M; McGrew, W F; Hinkley, N; Fasano, R J; Beloy, K; Yoon, T H; Milani, G; Nicolodi, D; Sherman, J A; Phillips, N B; Oates, C W; Ludlow, A D

    2016-01-01

    Atomic clocks based on optical transitions are the most stable, and therefore precise, timekeepers available. These clocks operate by alternating intervals of atomic interrogation with dead time required for quantum state preparation and readout. This non-continuous interrogation of the atom system results in the Dick effect, an aliasing of frequency noise of the laser interrogating the atomic transition. Despite recent advances in optical clock stability achieved by improving laser coherence, the Dick effect has continually limited optical clock performance. Here we implement a robust solution to overcome this limitation: a zero-dead-time optical clock based on the interleaved interrogation of two cold-atom ensembles. This clock exhibits vanishingly small Dick noise, thereby achieving an unprecedented fractional frequency instability of $6 \\times 10^{-17} / \\sqrt{\\tau}$ for an averaging time $\\tau$ in seconds. We also consider alternate dual-atom-ensemble schemes to extend laser coherence and reduce the stan...

  17. Self-consistent implementation of ensemble density functional theory method for multiple strongly correlated electron pairs

    Science.gov (United States)

    Filatov, Michael; Liu, Fang; Kim, Kwang S.; Martínez, Todd J.

    2016-12-01

    The spin-restricted ensemble-referenced Kohn-Sham (REKS) method is based on an ensemble representation of the density and is capable of correctly describing the non-dynamic electron correlation stemming from (near-)degeneracy of several electronic configurations. The existing REKS methodology describes systems with two electrons in two fractionally occupied orbitals. In this work, the REKS methodology is extended to treat systems with four fractionally occupied orbitals accommodating four electrons and self-consistent implementation of the REKS(4,4) method with simultaneous optimization of the orbitals and their fractional occupation numbers is reported. The new method is applied to a number of molecular systems where simultaneous dissociation of several chemical bonds takes place, as well as to the singlet ground states of organic tetraradicals 2,4-didehydrometaxylylene and 1,4,6,9-spiro[4.4]nonatetrayl.

  18. GEOWOW - Benefits of TIGGE ensemble forecast data for the GEOSS community

    Science.gov (United States)

    Zsoter, Ervin

    2014-05-01

    GEOWOW (GEOSS interoperability for Weather, Ocean and Water) is an EU-funded project with the main challenge to improve Earth Observation data discovery, accessibility and exploitability, and to evolve the Global Earth Observation System of Systems (GEOSS) for the benefit of all Societal Benefit Areas (SBAs) with focus on Weather, Ocean Ecosystems and Water. There is a particular focus on supporting multi-disciplinary interoperability across different SBAs, including the use of weather forecast ensembles in applications. The weather community in GEOWOW, led by ECMWF, addresses the elements of the GEO Capacity Building Strategy by improving the access to TIGGE (THORPEX Interactive Grand Global Ensemble) data in which global ensemble predictions from 10 leading global NWP centres are collected in near real-time to support research on ensemble prediction. GEOWOW extends the TIGGE archive with ensemble weather forecasts from limited area models and will build a multi-model ensemble time-series archive (Global and LAM containing data only for specific points) to increase the accessibility of TIGGE for a wider community. GEOWOW work also includes the development and demonstration of (multi-model) products using TIGGE data to support high impact weather forecasting in areas such as tropical cyclone tracks, heavy precipitation events and strong winds. Testing and delivery of these products is in collaboration with participants in the WMO Severe Weather Forecasting Demonstration Project (SWFDP). The SWFDP is a WMO capacity building project which uses a Cascading Forecasting Process to support a basic capability to issue severe weather warnings in developing and least-developed countries. A specific area of work covers the demonstration and documentation of the potential use of TIGGE data in high impact weather forecasting through mainly case studies. Analysed cases include various types of severe weather and different regions - for example rainfall and flooding in West

  19. Skill improvement of seasonal Arctic sea ice forecasts using bias-correction and ensemble calibration

    Science.gov (United States)

    Krikken, Folmer; Hazeleger, Wilco; Vlot, Willem; Schmeits, Maurice; Guemas, Virginie

    2016-04-01

    We explore the standard error and skill of dynamical seasonal sea ice forecasts of the Arctic using different bias-correction and ensemble calibration methods. The latter is often used in weather forecasting, but so far has not been applied to Arctic sea ice forecasts. We use seasonal predictions of Arctic sea ice of a 5-member ensemble forecast using the fully coupled GCM EC-Earth, with model initial states obtained by nudging towards ORAS4 and ERA-Interim. The raw model forecasts contain large biases in total sea ice area, especially during the summer months. This is mainly caused by a difference in average seasonal cycle between EC-Earth and observations, which translates directly into the forecasts yielding large biases. Further errors are introduced by the differences in long term trend between the observed sea ice, and the uninitialised EC-earth simulation. We find that extended logistic regression (ELR) and heteroscedastic extended logistic regression (HELR) both prove viable ensemble calibration methods, and improve the forecasts substantially compared to standard bias correction techniques. No clear distinction between ELR and HELR is found. Forecasts starting in May have higher skill (CRPSS > 0 up to 5 months lead time) than forecasts starting in August (2-3 months) and November (2-3 months), with trend-corrected climatology as reference. Analysis of regional skill in the Arctic shows distinct differences, where mainly the Arctic ocean and the Kara and Barents sea prove to be one of the more predictable regions with skilful forecasts starting in May up to 5-6 months lead time. Again, forecasts starting in August and November show much lower regional skill. Overall, it is still difficult to beat relative simple statistical forecasts, but by using ELR and HELR we are getting reasonably close to skilful seasonal forecasts up to 12 months lead time. These results show there is large potential, and need, for using ensemble calibration in seasonal forecasts of

  20. Extending reference assembly models

    DEFF Research Database (Denmark)

    Church, Deanna M.; Schneider, Valerie A.; Steinberg, Karyn Meltz

    2015-01-01

    The human genome reference assembly is crucial for aligning and analyzing sequence data, and for genome annotation, among other roles. However, the models and analysis assumptions that underlie the current assembly need revising to fully represent human sequence diversity. Improved analysis tools...

  1. Extending reference assembly models

    DEFF Research Database (Denmark)

    Church, Deanna M.; Schneider, Valerie A.; Steinberg, Karyn Meltz;

    2015-01-01

    The human genome reference assembly is crucial for aligning and analyzing sequence data, and for genome annotation, among other roles. However, the models and analysis assumptions that underlie the current assembly need revising to fully represent human sequence diversity. Improved analysis tools...

  2. A Multivariate Empirical Orthogonal Function-Based Scheme for the Balanced Initial Ensemble Generation of an Ensemble Kalman Filter

    Institute of Scientific and Technical Information of China (English)

    ZHENG Fei; ZHU Jiang

    2010-01-01

    The initial ensemble perturbations for an ensemble data assimilation system are expected to reasonably sample model uncertainty at the time of analysis to further reduce analysis uncertainty.Therefore,the careful choice of an initial ensemble perturbation method that dynamically cycles ensemble perturbations is required for the optimal performance of the system.Based on the multivariate empirical onhogonal function(MEOF)method,a new ensemble initialization scheme is developed to generate balanced initial perturbations for the ensemble Kalman filter(EnKF)data assimilation,with a reasonable consideration of the physical relationships between different model variables.The scheme is applied in assimilation experiments with a global spectral atmospheric model and with real observations.The proposed perturbation method is compared to the commonly used method of spatially-correlated random perturbations.The comparisons show that the model uncertainties prior to the first analysis time,which are forecasted from the balanced ensemble initial fields,maintain a much more reasonable spread and a more accurate forecast error covariance than those from the randomly perturbed initial fields.The analysis results are further improved by the balanced ensemble initialization scheme due to more accurate background information.Also,a 20-day continuous assimilation experiment shows that the ensemble spreads for each model variable are still retained in reasonable ranges without considering additional perturbations or inflations during the assimilation cycles,while the ensemble spreads from the randomly perturbed initialization scheme decrease and collapse rapidly.

  3. Exponential decay of matrix Φ -entropies on Markov semigroups with applications to dynamical evolutions of quantum ensembles

    Science.gov (United States)

    Cheng, Hao-Chung; Hsieh, Min-Hsiu; Tomamichel, Marco

    2017-09-01

    In this work, we extend the theory of quantum Markov processes on a single quantum state to a broader theory that covers Markovian evolution of an ensemble of quantum states, which generalizes Lindblad's formulation of quantum dynamical semigroups. Our results establish the equivalence between an exponential decrease of the matrix Φ -entropies and the Φ -Sobolev inequalities, which allows us to characterize the dynamical evolution of a quantum ensemble to its equilibrium. In particular, we study the convergence rates of two special semigroups, namely, the depolarizing channel and the phase-damping channel. In the former, since there exists a unique equilibrium state, we show that the matrix Φ -entropy of the resulting quantum ensemble decays exponentially as time goes on. Consequently, we obtain a stronger notion of monotonicity of the Holevo quantity—the Holevo quantity of the quantum ensemble decays exponentially in time and the convergence rate is determined by the modified log-Sobolev inequalities. However, in the latter, the matrix Φ -entropy of the quantum ensemble that undergoes the phase-damping Markovian evolution generally will not decay exponentially. There is no classical analogy for these different equilibrium situations. Finally, we also study a statistical mixing of Markov semigroups on matrix-valued functions. We can explicitly calculate the convergence rate of a Markovian jump process defined on Boolean hypercubes and provide upper bounds to the mixing time.

  4. Integrating weather and climate predictions for seamless hydrologic ensemble forecasting: A case study in the Yalong River basin

    Science.gov (United States)

    Ye, Aizhong; Deng, Xiaoxue; Ma, Feng; Duan, Qingyun; Zhou, Zheng; Du, Chao

    2017-04-01

    Despite the tremendous improvement made in numerical weather and climate models over the recent years, the forecasts generated by those models still cannot be used directly for hydrological forecasting. A post-processor like the Ensemble Pre-Processor (EPP) developed by U.S. National Weather Service must be used to remove various biases and to extract useful predictive information from those forecasts. In this paper, we investigate how different designs of canonical events in the EPP can help post-process precipitation forecasts from the Global Ensemble Forecast System (GEFS) and Climate Forecast System Version 2 (CFSv2). The use of canonical events allow those products to be linked seamlessly and then the post-processed ensemble precipitation forecasts can be generated using the Schaake Shuffle procedure. We used the post-processed ensemble precipitation forecasts to drive a distributed hydrological model to obtain ensemble streamflow forecasts and evaluated those forecasts against the observed streamflow. We found that the careful design of canonical events can help extract more useful information, especially when up-to-date observed precipitation is used to setup the canonical events. We also found that streamflow forecasts using post-processed precipitation forecasts have longer lead times and higher accuracy than streamflow forecasts made by traditional Extend Streamflow Prediction (ESP) and the forecasts based on original GEFS and CFSv2 precipitation forecasts.

  5. Seasonal hydrological ensemble forecasts over Europe

    Science.gov (United States)

    Arnal, Louise; Wetterhall, Fredrik; Stephens, Elisabeth; Cloke, Hannah; Pappenberger, Florian

    2016-04-01

    This study investigates the limits of predictability in dynamical seasonal discharge forecasting, in both space and time, over Europe. Seasonal forecasts have an important socioeconomic value. Applications are numerous and cover hydropower management, spring flood prediction, low flow prediction for navigation and agricultural water demands. Additionally, the constant increase in NWP skill for longer lead times and the predicted increase in the intensity and frequency of hydro-meteorological extremes, have amplified the incentive to promote and further improve hydrological forecasts on sub-seasonal to seasonal timescales. In this study, seasonal hydrological forecasts (SEA), driven by the ECMWF's System 4 in hindcast mode, were analysed against an Ensemble Streamflow Prediction (ESP) benchmark. The ESP was forced with an ensemble of resampled historical meteorological observations and started with perfect initial conditions. Both forecasts were produced by the LISFLOOD model, run on the pan-European scale with a spatial resolution of 5 by 5 km. The forecasts were issued monthly on a daily time step, from 1990 until the current time, up to a lead time of 7 months. The seasonal discharge forecasts were analysed against the ESP on a catchment scale in terms of their accuracy, skill and sharpness, using a diverse set of verification metrics (e.g. KGE, CRPSS and ROC). Additionally, a reverse-ESP was constructed by forcing the LISFLOOD model with a single perfect meteorological set of observations and initiated from an ensemble of resampled historical initial conditions. The comparison of the ESP with the reverse-ESP approach enabled the identification of the respective contribution of meteorological forcings and hydrologic initial conditions errors to seasonal discharge forecasting uncertainties in Europe. These results could help pinpoint target elements of the forecasting chain which, after being improved, could lead to substantial increase in discharge predictability

  6. A Framework for Non-Equilibrium Statistical Ensemble Theory

    Institute of Scientific and Technical Information of China (English)

    BI Qiao; HE Zu-Tan; LIU Jie

    2011-01-01

    Since Gibbs synthesized a general equilibrium statistical ensemble theory, many theorists have attempted to generalized the Gibbsian theory to non-equilibrium phenomena domain, however the status of the theory of nonequilibrium phenomena can not be said as firm as well established as the Gibbsian ensemble theory. In this work, we present a framework for the non-equilibrium statistical ensemble formalism based on a subdynamic kinetic equation (SKE) rooted from the Brussels-Austin school and followed by some up-to-date works. The constructed key is to use a similarity transformation between Gibbsian ensembles formalism based on Liouville equation and the subdynamic ensemble formalism based on the SKE. Using this formalism, we study the spin-Boson system, as cases of weak coupling or strongly coupling, and obtain the reduced density operators for the Canonical ensembles easily.

  7. Cluster ensembles, quantization and the dilogarithm

    DEFF Research Database (Denmark)

    Fock, Vladimir; Goncharov, Alexander B.

    2009-01-01

    , possibly degenerate, and the space has a Poisson structure. The map is compatible with these structures. The dilogarithm together with its motivic and quantum avatars plays a central role in the cluster ensemble structure. We define a non-commutative -deformation of the -space. When is a root of unity...... group . It is an algebraic-geometric avatar of higher Teichmüller theory on related to . We suggest that there exists a duality between the and spaces. In particular, we conjecture that the tropical points of one of the spaces parametrise a basis in the space of functions on the Langlands dual space. We...

  8. Accurate Atom Counting in Mesoscopic Ensembles

    CERN Document Server

    Hume, D B; Joos, M; Muessel, W; Strobel, H; Oberthaler, M K

    2013-01-01

    Many cold atom experiments rely on precise atom number detection, especially in the context of quantum-enhanced metrology where effects at the single particle level are important. Here, we investigate the limits of atom number counting via resonant fluorescence detection for mesoscopic samples of trapped atoms. We characterize the precision of these fluorescence measurements beginning from the single-atom level up to more than one thousand. By investigating the primary noise sources, we obtain single-atom resolution for atom numbers as high as 1200. This capability is an essential prerequisite for future experiments with highly entangled states of mesoscopic atomic ensembles.

  9. Accurate Atom Counting in Mesoscopic Ensembles

    Science.gov (United States)

    Hume, D. B.; Stroescu, I.; Joos, M.; Muessel, W.; Strobel, H.; Oberthaler, M. K.

    2013-12-01

    Many cold atom experiments rely on precise atom number detection, especially in the context of quantum-enhanced metrology where effects at the single particle level are important. Here, we investigate the limits of atom number counting via resonant fluorescence detection for mesoscopic samples of trapped atoms. We characterize the precision of these fluorescence measurements beginning from the single-atom level up to more than one thousand. By investigating the primary noise sources, we obtain single-atom resolution for atom numbers as high as 1200. This capability is an essential prerequisite for future experiments with highly entangled states of mesoscopic atomic ensembles.

  10. Supervised Ensemble Classification of Kepler Variable Stars

    CERN Document Server

    Bass, Gideon

    2016-01-01

    Variable star analysis and classification is an important task in the understanding of stellar features and processes. While historically classifications have been done manually by highly skilled experts, the recent and rapid expansion in the quantity and quality of data has demanded new techniques, most notably automatic classification through supervised machine learning. We present an expansion of existing work on the field by analyzing variable stars in the {\\em Kepler} field using an ensemble approach, combining multiple characterization and classification techniques to produce improved classification rates. Classifications for each of the roughly 150,000 stars observed by {\\em Kepler} are produced separating the stars into one of 14 variable star classes.

  11. Modeling Coordination Problems in a Music Ensemble

    DEFF Research Database (Denmark)

    Frimodt-Møller, Søren R.

    2008-01-01

    This paper considers in general terms, how musicians are able to coordinate through rational choices in a situation of (temporary) doubt in an ensemble performance. A fictitious example involving a 5-bar development in an unknown piece of music is analyzed in terms of epistemic logic, more...... specifically a multi-agent system, where it is shown that perfect coordination can only be certain to take place if the musicians have common knowledge of certain rules of the composition. We subsequently argue, however, that the musicians need not agree on the central features of the piece of music in order...

  12. Asymptotic expansions for the Gaussian unitary ensemble

    DEFF Research Database (Denmark)

    Haagerup, Uffe; Thorbjørnsen, Steen

    2012-01-01

    Let g : R ¿ C be a C8-function with all derivatives bounded and let trn denote the normalized trace on the n × n matrices. In Ref. 3 Ercolani and McLaughlin established asymptotic expansions of the mean value ¿{trn(g(Xn))} for a rather general class of random matrices Xn, including the Gaussian...... Unitary Ensemble (GUE). Using an analytical approach, we provide in the present paper an alternative proof of this asymptotic expansion in the GUE case. Specifically we derive for a random matrix Xn that where k is an arbitrary positive integer. Considered as mappings of g, we determine the coefficients...

  13. Accurate atom counting in mesoscopic ensembles.

    Science.gov (United States)

    Hume, D B; Stroescu, I; Joos, M; Muessel, W; Strobel, H; Oberthaler, M K

    2013-12-20

    Many cold atom experiments rely on precise atom number detection, especially in the context of quantum-enhanced metrology where effects at the single particle level are important. Here, we investigate the limits of atom number counting via resonant fluorescence detection for mesoscopic samples of trapped atoms. We characterize the precision of these fluorescence measurements beginning from the single-atom level up to more than one thousand. By investigating the primary noise sources, we obtain single-atom resolution for atom numbers as high as 1200. This capability is an essential prerequisite for future experiments with highly entangled states of mesoscopic atomic ensembles.

  14. Validation of the Air Force Weather Agency Ensemble Prediction Systems

    Science.gov (United States)

    2014-03-27

    to deterministic models. Results from ensemble weather input into operational risk management ( ORM ) destruction of enemy air defense simulations...growth during the analysis period (Toth and Kalnay, 1993; Toth and Kalnay, 1997). From this framework the ensemble transform bred vector, ensemble...features. Each of its 10 members is run independently using different configurations in the framework of the Weather Research and Forecasting (WRF

  15. Unconditional two-mode squeezing of separated atomic ensembles

    CERN Document Server

    Parkins, A S; Solano, E

    2005-01-01

    We propose schemes for the unconditional preparation of a two-mode squeezed state of effective bosonic modes realized in a pair of atomic ensembles interacting collectively with optical cavity and laser fields. The scheme uses Raman transitions between stable atomic ground states and under ideal conditions produces pure entangled states in the steady state. The scheme works both for ensembles confined within a single cavity and for ensembles confined in separate, cascaded cavities.

  16. The Moment Convergence Rates for Largest Eigenvalues of β Ensembles

    Institute of Scientific and Technical Information of China (English)

    Jun Shan XIE

    2013-01-01

    The paper focuses on the largest eigenvalues of the β-Hermite ensemble and theβ-Laguerre ensemble.In particular,we obtain the precise moment convergence rates of their largest eigenvalues.The results are motivated by the complete convergence for partial sums of i.i.d.random variables,and the proofs depend on the small deviations for largest eigenvalues of the β ensembles and tail inequalities of the general β Tracy-Widom law.

  17. Extracting Value from Ensembles for Cloud-Free Forecasting

    Science.gov (United States)

    2011-09-01

    for Medium range Weather Forecasting EMean Ensemble mean ETR Ensemble transform with rescaling EUMETSAT European Organization for the...transform method (ET) with rescaling ( ETR ) to define the initial atmospheric uncertainty (Wei et al. 2008). Adapted from the ET method devised by...variances of each grid point to further restrain the initial ensemble spread. The ETR method replaced the breeding method in GEFS during NCEP’s May

  18. On sequential observation processing in localized ensemble Kalman filters

    OpenAIRE

    Nerger, Lars

    2014-01-01

    The different variants of current ensemble square-root Kalman filters assimilate either all observations at once or perform a sequence in which batches of observations or each single observation is assimilated. The sequential observation processing is used in filter algorithms like the ensemble adjustment Kalman filter (EAKF) and the ensemble square-root filter (EnSRF) and can result in computationally efficient algorithms because matrix inversions in the observation space are reduced to the ...

  19. Sequential ensemble-based optimal design for parameter estimation: SEQUENTIAL ENSEMBLE-BASED OPTIMAL DESIGN

    Energy Technology Data Exchange (ETDEWEB)

    Man, Jun [Zhejiang Provincial Key Laboratory of Agricultural Resources and Environment, Institute of Soil and Water Resources and Environmental Science, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou China; Zhang, Jiangjiang [Zhejiang Provincial Key Laboratory of Agricultural Resources and Environment, Institute of Soil and Water Resources and Environmental Science, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou China; Li, Weixuan [Pacific Northwest National Laboratory, Richland Washington USA; Zeng, Lingzao [Zhejiang Provincial Key Laboratory of Agricultural Resources and Environment, Institute of Soil and Water Resources and Environmental Science, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou China; Wu, Laosheng [Department of Environmental Sciences, University of California, Riverside California USA

    2016-10-01

    The ensemble Kalman filter (EnKF) has been widely used in parameter estimation for hydrological models. The focus of most previous studies was to develop more efficient analysis (estimation) algorithms. On the other hand, it is intuitively understandable that a well-designed sampling (data-collection) strategy should provide more informative measurements and subsequently improve the parameter estimation. In this work, a Sequential Ensemble-based Optimal Design (SEOD) method, coupled with EnKF, information theory and sequential optimal design, is proposed to improve the performance of parameter estimation. Based on the first-order and second-order statistics, different information metrics including the Shannon entropy difference (SD), degrees of freedom for signal (DFS) and relative entropy (RE) are used to design the optimal sampling strategy, respectively. The effectiveness of the proposed method is illustrated by synthetic one-dimensional and two-dimensional unsaturated flow case studies. It is shown that the designed sampling strategies can provide more accurate parameter estimation and state prediction compared with conventional sampling strategies. Optimal sampling designs based on various information metrics perform similarly in our cases. The effect of ensemble size on the optimal design is also investigated. Overall, larger ensemble size improves the parameter estimation and convergence of optimal sampling strategy. Although the proposed method is applied to unsaturated flow problems in this study, it can be equally applied in any other hydrological problems.

  20. Ensemble-based forecasting at Horns Rev: Ensemble conversion and kernel dressing

    DEFF Research Database (Denmark)

    Pinson, Pierre; Madsen, Henrik

    . The obtained ensemble forecasts of wind power are then converted into predictive distributions with an original adaptive kernel dressing method. The shape of the kernels is driven by a mean-variance model, the parameters of which are recursively estimated in order to maximize the overall skill of obtained...

  1. Evaluating Tropical Cyclone Forecast Track Uncertainty Using A Grand Ensemble of Ensemble Prediction Systems

    Science.gov (United States)

    2011-09-01

    variable is appropriately sized for the region ( UCAR 2010). 4. An Isotropic Joint-Ensemble Majumdar and Finochio (2010) develop a probability circle...Forecasting, 22, 671–675. UCAR , cited 2010: NCEP Perturbation Method. [Available online at http://www.meted.ucar.edu/nwp/pcu2/ens_matrix

  2. The MIP Ensemble Simulation: Local Ensemble Statistics in the Cosmic Web

    CERN Document Server

    Aragon-Calvo, M A

    2012-01-01

    Here we present a novel N-body simulation technique that allows us to compute ensemble statistics on a local basis, directly relating halo properties to their environment. This is achieved by the use of an ensemble simulation in which the otherwise independent realizations share the same fluctuations above a given cut-off scale. This produces a constrained ensemble where the LSS is common to all realizations while having an independent halo population. By generating a large number of semi-independent realizations we can effectively increase the local halo density by an arbitrary factor thus breaking the fundamental limit of the finite halo density (for a given halo mass range) determined by the halo mass function. This technique allows us to compute local ensemble statistics of the matter/halo distribution at a particular position in space, removing the intrinsic stochasticity in the halo formation process and directly relating halo properties to their environment. This is a major improvement over global desc...

  3. Deformed Gaussian Orthogonal Ensemble Analysis of the Interacting Boson Model

    CERN Document Server

    Pato, M P; Lima, C L; Hussein, M S; Alhassid, Y

    1994-01-01

    A Deformed Gaussian Orthogonal Ensemble (DGOE) which interpolates between the Gaussian Orthogonal Ensemble and a Poissonian Ensemble is constructed. This new ensemble is then applied to the analysis of the chaotic properties of the low lying collective states of nuclei described by the Interacting Boson Model (IBM). This model undergoes a transition order-chaos-order from the $SU(3)$ limit to the $O(6)$ limit. Our analysis shows that the quantum fluctuations of the IBM Hamiltonian, both of the spectrum and the eigenvectors, follow the expected behaviour predicted by the DGOE when one goes from one limit to the other.

  4. Bayesian ensemble refinement by replica simulations and reweighting.

    Science.gov (United States)

    Hummer, Gerhard; Köfinger, Jürgen

    2015-12-28

    We describe different Bayesian ensemble refinement methods, examine their interrelation, and discuss their practical application. With ensemble refinement, the properties of dynamic and partially disordered (bio)molecular structures can be characterized by integrating a wide range of experimental data, including measurements of ensemble-averaged observables. We start from a Bayesian formulation in which the posterior is a functional that ranks different configuration space distributions. By maximizing this posterior, we derive an optimal Bayesian ensemble distribution. For discrete configurations, this optimal distribution is identical to that obtained by the maximum entropy "ensemble refinement of SAXS" (EROS) formulation. Bayesian replica ensemble refinement enhances the sampling of relevant configurations by imposing restraints on averages of observables in coupled replica molecular dynamics simulations. We show that the strength of the restraints should scale linearly with the number of replicas to ensure convergence to the optimal Bayesian result in the limit of infinitely many replicas. In the "Bayesian inference of ensembles" method, we combine the replica and EROS approaches to accelerate the convergence. An adaptive algorithm can be used to sample directly from the optimal ensemble, without replicas. We discuss the incorporation of single-molecule measurements and dynamic observables such as relaxation parameters. The theoretical analysis of different Bayesian ensemble refinement approaches provides a basis for practical applications and a starting point for further investigations.

  5. Adiabatic Passage of Collective Excitations in Atomic Ensembles

    Institute of Scientific and Technical Information of China (English)

    LIYong; MIAOYuan-Xiu; SUNChang-Pu

    2004-01-01

    We describe a theoretical scheme that allows for transfer of quantum states of atomic collective excitation between two macroscopic atomic ensembles localized in two spatially-separated domains. The conception is based on the occurrence of double-exciton dark states due to the collective destructive quantum interference of the emissions from the two atomic ensembles. With an adiabatically coherence manipulation for the atom-field couplings by stimulated Rmann scattering, the dark states will extrapolate from an exciton state of an ensemble to that of another. This realizes the transport of quantum information among atomic ensembles.

  6. Adiabatic Passage of Collective Excitations in Atomic Ensembles

    Institute of Scientific and Technical Information of China (English)

    LI Yong; MIAO Yuan-Xiu; SUN Chang-Pu

    2004-01-01

    We describe a theoretical scheme that allows for transfer of quantum states of atomic collective excitation between two macroscopic atomic ensembles localized in two spatially-separated domains. The conception is based on the occurrence of double-exciton dark states due to the collective destructive quantum interference of the emissions from the two atomic ensembles. With an adiabatically coherence manipulation for the atom-field couplings by stimulated Ramann scattering, the dark states will extrapolate from an exciton state of an ensemble to that of another. This realizes the transport of quantum information among atomic ensembles.

  7. Relation between native ensembles and experimental structures of proteins

    DEFF Research Database (Denmark)

    Best, R. B.; Lindorff-Larsen, Kresten; DePristo, M. A.

    2006-01-01

    Different experimental structures of the same protein or of proteins with high sequence similarity contain many small variations. Here we construct ensembles of "high-sequence similarity Protein Data Bank" (HSP) structures and consider the extent to which such ensembles represent the structural...... Data Bank ensembles; moreover, we show that the effects of uncertainties in structure determination are insufficient to explain the results. These results highlight the importance of accounting for native-state protein dynamics in making comparisons with ensemble-averaged experimental data and suggest...

  8. Fractional exclusion statistics and the Random Matrix Boson Ensemble

    CERN Document Server

    Hernández-Quiroz, Saul; Benet, Luis; Flores, Jorge; Cocho, Germinal

    2012-01-01

    The k-body Gaussian Embedded Ensemble of Random Matrices is considered for N bosons distributed on two single-particle levels. When k = N, the ensemble is equivalent to the Gaussian Orthogonal Ensemble (GOE), and when k = 2 it corresponds to the Two-body Random Ensemble (TBRE) for bosons. It is shown that the energy spectrum leads to a rank function which is of the form of a discrete generalized beta distribution. The same distribution is obtained assuming N non-interacting quasiparticles that obey the fractional exclusion statistics introduced by Haldane two decades ago.

  9. Ensembler: Enabling High-Throughput Molecular Simulations at the Superfamily Scale

    Science.gov (United States)

    Parton, Daniel L.; Grinaway, Patrick B.; Hanson, Sonya M.; Beauchamp, Kyle A.; Chodera, John D.

    2016-01-01

    The rapidly expanding body of available genomic and protein structural data provides a rich resource for understanding protein dynamics with biomolecular simulation. While computational infrastructure has grown rapidly, simulations on an omics scale are not yet widespread, primarily because software infrastructure to enable simulations at this scale has not kept pace. It should now be possible to study protein dynamics across entire (super)families, exploiting both available structural biology data and conformational similarities across homologous proteins. Here, we present a new tool for enabling high-throughput simulation in the genomics era. Ensembler takes any set of sequences—from a single sequence to an entire superfamily—and shepherds them through various stages of modeling and refinement to produce simulation-ready structures. This includes comparative modeling to all relevant PDB structures (which may span multiple conformational states of interest), reconstruction of missing loops, addition of missing atoms, culling of nearly identical structures, assignment of appropriate protonation states, solvation in explicit solvent, and refinement and filtering with molecular simulation to ensure stable simulation. The output of this pipeline is an ensemble of structures ready for subsequent molecular simulations using computer clusters, supercomputers, or distributed computing projects like Folding@home. Ensembler thus automates much of the time-consuming process of preparing protein models suitable for simulation, while allowing scalability up to entire superfamilies. A particular advantage of this approach can be found in the construction of kinetic models of conformational dynamics—such as Markov state models (MSMs)—which benefit from a diverse array of initial configurations that span the accessible conformational states to aid sampling. We demonstrate the power of this approach by constructing models for all catalytic domains in the human tyrosine

  10. Ensembler: Enabling High-Throughput Molecular Simulations at the Superfamily Scale.

    Science.gov (United States)

    Parton, Daniel L; Grinaway, Patrick B; Hanson, Sonya M; Beauchamp, Kyle A; Chodera, John D

    2016-06-01

    The rapidly expanding body of available genomic and protein structural data provides a rich resource for understanding protein dynamics with biomolecular simulation. While computational infrastructure has grown rapidly, simulations on an omics scale are not yet widespread, primarily because software infrastructure to enable simulations at this scale has not kept pace. It should now be possible to study protein dynamics across entire (super)families, exploiting both available structural biology data and conformational similarities across homologous proteins. Here, we present a new tool for enabling high-throughput simulation in the genomics era. Ensembler takes any set of sequences-from a single sequence to an entire superfamily-and shepherds them through various stages of modeling and refinement to produce simulation-ready structures. This includes comparative modeling to all relevant PDB structures (which may span multiple conformational states of interest), reconstruction of missing loops, addition of missing atoms, culling of nearly identical structures, assignment of appropriate protonation states, solvation in explicit solvent, and refinement and filtering with molecular simulation to ensure stable simulation. The output of this pipeline is an ensemble of structures ready for subsequent molecular simulations using computer clusters, supercomputers, or distributed computing projects like Folding@home. Ensembler thus automates much of the time-consuming process of preparing protein models suitable for simulation, while allowing scalability up to entire superfamilies. A particular advantage of this approach can be found in the construction of kinetic models of conformational dynamics-such as Markov state models (MSMs)-which benefit from a diverse array of initial configurations that span the accessible conformational states to aid sampling. We demonstrate the power of this approach by constructing models for all catalytic domains in the human tyrosine kinase

  11. Ensembler: Enabling High-Throughput Molecular Simulations at the Superfamily Scale.

    Directory of Open Access Journals (Sweden)

    Daniel L Parton

    2016-06-01

    Full Text Available The rapidly expanding body of available genomic and protein structural data provides a rich resource for understanding protein dynamics with biomolecular simulation. While computational infrastructure has grown rapidly, simulations on an omics scale are not yet widespread, primarily because software infrastructure to enable simulations at this scale has not kept pace. It should now be possible to study protein dynamics across entire (superfamilies, exploiting both available structural biology data and conformational similarities across homologous proteins. Here, we present a new tool for enabling high-throughput simulation in the genomics era. Ensembler takes any set of sequences-from a single sequence to an entire superfamily-and shepherds them through various stages of modeling and refinement to produce simulation-ready structures. This includes comparative modeling to all relevant PDB structures (which may span multiple conformational states of interest, reconstruction of missing loops, addition of missing atoms, culling of nearly identical structures, assignment of appropriate protonation states, solvation in explicit solvent, and refinement and filtering with molecular simulation to ensure stable simulation. The output of this pipeline is an ensemble of structures ready for subsequent molecular simulations using computer clusters, supercomputers, or distributed computing projects like Folding@home. Ensembler thus automates much of the time-consuming process of preparing protein models suitable for simulation, while allowing scalability up to entire superfamilies. A particular advantage of this approach can be found in the construction of kinetic models of conformational dynamics-such as Markov state models (MSMs-which benefit from a diverse array of initial configurations that span the accessible conformational states to aid sampling. We demonstrate the power of this approach by constructing models for all catalytic domains in the human

  12. Cluster Ensemble-based Image Segmentation

    Directory of Open Access Journals (Sweden)

    Xiaoru Wang

    2013-07-01

    Full Text Available Image segmentation is the foundation of computer vision applications. In this paper, we propose a new\tcluster ensemble-based image\tsegmentation algorithm, which overcomes several problems of traditional methods. We make two main contributions in this paper. First, we introduce the cluster ensemble concept to fuse the segmentation results from different types of visual features effectively, which can deliver a better final result and achieve a much more stable performance for broad categories of images. Second, we exploit the PageRank idea from Internet applications and apply it to the image segmentation task. This can improve the final segmentation results by combining the spatial information of the image and the semantic similarity of regions. Our experiments on four public image databases validate the superiority of our algorithm over conventional single type of feature or multiple types of features-based algorithms, since our algorithm can fuse multiple types of features effectively for better segmentation results. Moreover, our method is also proved to be very competitive in comparison with other state-of-the-art segmentation algorithms.

  13. Online cross-validation-based ensemble learning.

    Science.gov (United States)

    Benkeser, David; Ju, Cheng; Lendle, Sam; van der Laan, Mark

    2017-05-04

    Online estimators update a current estimate with a new incoming batch of data without having to revisit past data thereby providing streaming estimates that are scalable to big data. We develop flexible, ensemble-based online estimators of an infinite-dimensional target parameter, such as a regression function, in the setting where data are generated sequentially by a common conditional data distribution given summary measures of the past. This setting encompasses a wide range of time-series models and, as special case, models for independent and identically distributed data. Our estimator considers a large library of candidate online estimators and uses online cross-validation to identify the algorithm with the best performance. We show that by basing estimates on the cross-validation-selected algorithm, we are asymptotically guaranteed to perform as well as the true, unknown best-performing algorithm. We provide extensions of this approach including online estimation of the optimal ensemble of candidate online estimators. We illustrate excellent performance of our methods using simulations and a real data example where we make streaming predictions of infectious disease incidence using data from a large database. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  14. Nanobiosensing with Arrays and Ensembles of Nanoelectrodes

    Directory of Open Access Journals (Sweden)

    Najmeh Karimian

    2016-12-01

    Full Text Available Since the first reports dating back to the mid-1990s, ensembles and arrays of nanoelectrodes (NEEs and NEAs, respectively have gained an important role as advanced electroanalytical tools thank to their unique characteristics which include, among others, dramatically improved signal/noise ratios, enhanced mass transport and suitability for extreme miniaturization. From the year 2000 onward, these properties have been exploited to develop electrochemical biosensors in which the surfaces of NEEs/NEAs have been functionalized with biorecognition layers using immobilization modes able to take the maximum advantage from the special morphology and composite nature of their surface. This paper presents an updated overview of this field. It consists of two parts. In the first, we discuss nanofabrication methods and the principles of functioning of NEEs/NEAs, focusing, in particular, on those features which are important for the development of highly sensitive and miniaturized biosensors. In the second part, we review literature references dealing the bioanalytical and biosensing applications of sensors based on biofunctionalized arrays/ensembles of nanoelectrodes, focusing our attention on the most recent advances, published in the last five years. The goal of this review is both to furnish fundamental knowledge to researchers starting their activity in this field and provide critical information on recent achievements which can stimulate new ideas for future developments to experienced scientists.

  15. Hsaing Waing: Classical Ensemble of Myanmar

    Directory of Open Access Journals (Sweden)

    Chalermkit Kengkeaw

    2013-09-01

    Full Text Available Hsaing Waing is a classical music ensemble and a prominent culturalidentity of Myanmar. The Hsaing Waing ensemble consists of many instruments such as the Pat Waing, Muang Hsaing, Hne, Chauk Lon Bat, Byaung, Wa, Wallet Kok, Yakin, Si, and Mong. The earliest historical record of the Hsaing Waing is in 1544 where the Pat Waing and possibly the Hsaing Waing, was in royal service at the court of King Tabinshwehti of the Taungoo dynasty and prospered under the Kaunbaun dynasty up to colonial rule. During colonization, Hsaing Waing’s popularity declined but other innovations were introduced such as modern recording mediums and broadcasts which transferred the popularity of Hsaing Waing to a broader public audience and brought innovation to religious music, ceremonial rituals, fusion of westernmusical instruments such as the piano, violin and mandolin. The wealth of knowledge and numbers of connoisseur during the Kaunbaun dynasty led to the transfer of knowledge to many apprentices which were responsible for the development and adaptation and continuation of Hsaing Waing during colonization, socialism and independence. The transfer of knowledge was carried out by previous generations through apprentices, family members, close relatives and inspired individuals. The factors for the successful inheritance of Hsaing Waing are management, education, musicians and opportunity.

  16. Ensemble Kalman filtering with residual nudging

    Directory of Open Access Journals (Sweden)

    Xiaodong Luo

    2012-10-01

    Full Text Available Covariance inflation and localisation are two important techniques that are used to improve the performance of the ensemble Kalman filter (EnKF by (in effect adjusting the sample covariances of the estimates in the state space. In this work, an additional auxiliary technique, called residual nudging, is proposed to monitor and, if necessary, adjust the residual norms of state estimates in the observation space. In an EnKF with residual nudging, if the residual norm of an analysis is larger than a pre-specified value, then the analysis is replaced by a new one whose residual norm is no larger than a pre-specified value. Otherwise, the analysis is considered as a reasonable estimate and no change is made. A rule for choosing the pre-specified value is suggested. Based on this rule, the corresponding new state estimates are explicitly derived in case of linear observations. Numerical experiments in the 40-dimensional Lorenz 96 model show that introducing residual nudging to an EnKF may improve its accuracy and/or enhance its stability against filter divergence, especially in the small ensemble scenario.

  17. Ensemble Kalman filtering with residual nudging

    KAUST Repository

    Luo, X.

    2012-10-03

    Covariance inflation and localisation are two important techniques that are used to improve the performance of the ensemble Kalman filter (EnKF) by (in effect) adjusting the sample covariances of the estimates in the state space. In this work, an additional auxiliary technique, called residual nudging, is proposed to monitor and, if necessary, adjust the residual norms of state estimates in the observation space. In an EnKF with residual nudging, if the residual norm of an analysis is larger than a pre-specified value, then the analysis is replaced by a new one whose residual norm is no larger than a pre-specified value. Otherwise, the analysis is considered as a reasonable estimate and no change is made. A rule for choosing the pre-specified value is suggested. Based on this rule, the corresponding new state estimates are explicitly derived in case of linear observations. Numerical experiments in the 40-dimensional Lorenz 96 model show that introducing residual nudging to an EnKF may improve its accuracy and/or enhance its stability against filter divergence, especially in the small ensemble scenario.

  18. Extended Life Coolant Testing

    Science.gov (United States)

    2016-06-06

    ELC – Extended Life Coolant SCA – Supplemental Coolant Additive SOW – Scope of Work SwRI – Southwest Research Institute TARDEC – Tank Automotive...ethylene or propylene glycol and 35% extended life coolant #1 (ELC1) with a balance of water. At a higher ELC1 content of 45% or 50%, the mass loss...UNCLASSIFIED TABLE OF CONTENTS EXTENDED LIFE COOLANT TESTING INTERIM REPORT TFLRF No. 478 by Gregory A. T. Hansen Edwin A

  19. Extended icosahedral structures

    CERN Document Server

    Jaric, Marko V

    1989-01-01

    Extended Icosahedral Structures discusses the concepts about crystal structures with extended icosahedral symmetry. This book is organized into six chapters that focus on actual modeling of extended icosahedral crystal structures. This text first presents a tiling approach to the modeling of icosahedral quasiperiodic crystals. It then describes the models for icosahedral alloys based on random connections between icosahedral units, with particular emphasis on diffraction properties. Other chapters examine the glassy structures with only icosahedral orientational order and the extent of tra

  20. Effect of Doubling the Ensemble Size on the Performance of Ensemble Prediction in the Warm Season Using MOGREPS Implemented at the KMA

    Institute of Scientific and Technical Information of China (English)

    Jun Kyung KAY; Hyun Mee KIM; Young-Youn PARK; Joohyung SON

    2013-01-01

    Using the Met Office Global and Regional Ensemble Prediction System (MOGREPS) implemented at the Korea Meteorological Administration (KMA),the effect of doubling the ensemble size on the performance of ensemble prediction in the warm season was evaluated.Because a finite ensemble size causes sampling error in the full forecast probability distribution function (PDF),ensemble size is closely related to the efficiency of the ensemble prediction system.Prediction capability according to doubling the ensemble size was evaluated by increasing the number of ensembles from 24 to 48 in MOGREPS implemented at the KMA.The initial analysis perturbations generated by the Ensemble Transform Kalman Filter (ETKF) were integrated for 10 days from 22 May to 23 June 2009.Several statistical verification scores were used to measure the accuracy,reliability,and resolution of ensemble probabilistic forecasts for 24 and 48 ensemble member forecasts.Even though the results were not significant,the accuracy of ensemble prediction improved slightly as ensemble size increased,especially for longer forecast times in the Northern Hemisphere.While increasing the number of ensemble members resulted in a slight improvement in resolution as forecast time increased,inconsistent results were obtained for the scores assessing the reliability of ensemble prediction.The overall performance of ensemble prediction in terms of accuracy,resolution,and reliability increased slightly with ensemble size,especially for longer forecast times.

  1. A unified gene catalog for the laboratory mouse reference genome.

    Science.gov (United States)

    Zhu, Y; Richardson, J E; Hale, P; Baldarelli, R M; Reed, D J; Recla, J M; Sinclair, R; Reddy, T B K; Bult, C J

    2015-08-01

    We report here a semi-automated process by which mouse genome feature predictions and curated annotations (i.e., genes, pseudogenes, functional RNAs, etc.) from Ensembl, NCBI and Vertebrate Genome Annotation database (Vega) are reconciled with the genome features in the Mouse Genome Informatics (MGI) database (http://www.informatics.jax.org) into a comprehensive and non-redundant catalog. Our gene unification method employs an algorithm (fjoin--feature join) for efficient detection of genome coordinate overlaps among features represented in two annotation data sets. Following the analysis with fjoin, genome features are binned into six possible categories (1:1, 1:0, 0:1, 1:n, n:1, n:m) based on coordinate overlaps. These categories are subsequently prioritized for assessment of annotation equivalencies and differences. The version of the unified catalog reported here contains more than 59,000 entries, including 22,599 protein-coding coding genes, 12,455 pseudogenes, and 24,007 other feature types (e.g., microRNAs, lincRNAs, etc.). More than 23,000 of the entries in the MGI gene catalog have equivalent gene models in the annotation files obtained from NCBI, Vega, and Ensembl. 12,719 of the features are unique to NCBI relative to Ensembl/Vega; 11,957 are unique to Ensembl/Vega relative to NCBI, and 3095 are unique to MGI. More than 4000 genome features fall into categories that require manual inspection to resolve structural differences in the gene models from different annotation sources. Using the MGI unified gene catalog, researchers can easily generate a comprehensive report of mouse genome features from a single source and compare the details of gene and transcript structure using MGI's mouse genome browser.

  2. Quantum Extended Supersymmetries

    CERN Document Server

    Grigore, D R; Grigore, Dan Radu; Scharf, Gunter

    2003-01-01

    We analyse some quantum multiplets associated with extended supersymmetries. We study in detail the general form of the causal (anti)commutation relations. The condition of positivity of the scalar product imposes severe restrictions on the (quantum) model. It is problematic if one can find out quantum extensions of the standard model with extended supersymmetries.

  3. Extended Theories of Gravitation

    Directory of Open Access Journals (Sweden)

    Fatibene Lorenzo

    2013-09-01

    Full Text Available Extended theories of gravitation are naturally singled out by an analysis inspired by the Ehelers-Pirani-Schild framework. In this framework the structure of spacetime is described by a Weyl geometry which is enforced by dynamics. Standard General Relativity is just one possible theory within the class of extended theories of gravitation. Also all Palatini f(R theories are shown to be extended theories of gravitation. This more general setting allows a more general interpretation scheme and more general possible couplings between gravity and matter. The definitions and constructions of extended theories will be reviewed. A general interpretation scheme will be considered for extended theories and some examples will be considered.

  4. Stochastic methods for light propagation and recurrent scattering in saturated and nonsaturated atomic ensembles

    Science.gov (United States)

    Lee, Mark D.; Jenkins, Stewart D.; Ruostekoski, Janne

    2016-06-01

    We derive equations for the strongly coupled system of light and dense atomic ensembles. The formalism includes an arbitrary internal-level structure for the atoms and is not restricted to weak excitation of atoms by light. In the low-light-intensity limit for atoms with a single electronic ground state, the full quantum field-theoretical representation of the model can be solved exactly by means of classical stochastic electrodynamics simulations for stationary atoms that represent cold atomic ensembles. Simulations for the optical response of atoms in a quantum degenerate regime require one to synthesize a stochastic ensemble of atomic positions that generates the corresponding quantum statistical position correlations between the atoms. In the case of multiple ground levels or at light intensities where saturation becomes important, the classical simulations require approximations that neglect quantum fluctuations between the levels. We show how the model is extended to incorporate corrections due to quantum fluctuations that result from virtual scattering processes. In the low-light-intensity limit, we illustrate the simulations in a system of atoms in a Mott-insulator state in a two-dimensional optical lattice, where recurrent scattering of light induces strong interatomic correlations. These correlations result in collective many-atom subradiant and superradiant states and a strong dependence of the response on the spatial confinement within the lattice sites.

  5. Expanding the validity of the ensemble Kalman filter without the intrinsic need for inflation

    Directory of Open Access Journals (Sweden)

    M. Bocquet

    2015-07-01

    Full Text Available The ensemble Kalman filter (EnKF is a powerful data assimilation method meant for high-dimensional nonlinear systems. But its implementation requires fixes such as localization and inflation. The recently developed finite-size ensemble Kalman filter (EnKF-N does not require multiplicative inflation meant to counteract sampling errors. Aside from the practical interest of avoiding the tuning of inflation in perfect model data assimilation experiments, it also offers theoretical insights and a unique perspective on the EnKF. Here, we revisit, clarify and correct several key points of the EnKF-N derivation. This simplifies the use of the method, and expands its validity. The EnKF is shown to not only rely on the observations and the forecast ensemble but also on an implicit prior assumption, termed hyperprior, that fills in the gap of missing information. In the EnKF-N framework, this assumption is made explicit through a Bayesian hierarchy. This hyperprior has been so far chosen to be the uninformative Jeffreys' prior. Here, this choice is revisited to improve the performance of the EnKF-N in the regime where the analysis strongly relaxes to the prior. Moreover, it is shown that the EnKF-N can be extended with a normal-inverse-Wishart informative hyperprior that additionally introduces climatological error statistics. This can be identified as a hybrid 3D-Var/EnKF counterpart to the EnKF-N.

  6. Predicting gene function using hierarchical multi-label decision tree ensembles

    Directory of Open Access Journals (Sweden)

    Kocev Dragi

    2010-01-01

    Full Text Available Abstract Background S. cerevisiae, A. thaliana and M. musculus are well-studied organisms in biology and the sequencing of their genomes was completed many years ago. It is still a challenge, however, to develop methods that assign biological functions to the ORFs in these genomes automatically. Different machine learning methods have been proposed to this end, but it remains unclear which method is to be preferred in terms of predictive performance, efficiency and usability. Results We study the use of decision tree based models for predicting the multiple functions of ORFs. First, we describe an algorithm for learning hierarchical multi-label decision trees. These can simultaneously predict all the functions of an ORF, while respecting a given hierarchy of gene functions (such as FunCat or GO. We present new results obtained with this algorithm, showing that the trees found by it exhibit clearly better predictive performance than the trees found by previously described methods. Nevertheless, the predictive performance of individual trees is lower than that of some recently proposed statistical learning methods. We show that ensembles of such trees are more accurate than single trees and are competitive with state-of-the-art statistical learning and functional linkage methods. Moreover, the ensemble method is computationally efficient and easy to use. Conclusions Our results suggest that decision tree based methods are a state-of-the-art, efficient and easy-to-use approach to ORF function prediction.

  7. Thermal Insulation Distribution Pattern of Layered Clothing Ensemble

    Institute of Scientific and Technical Information of China (English)

    李俊; 韦鸿发; 刘岩; 张渭源

    2004-01-01

    With a thermal manikin, the distribution pattern of thermal insulation in multi-layered clothing ensemble is studied. It is found that the thermal insulation of multi-layered clothing ensemble has certain statistical relationship with the thermal insulation of each layer, and the prediction equation has been established.

  8. Building Identity in Collegiate Midlevel Choral Ensembles: The Director's Perspective

    Science.gov (United States)

    Major, Marci L.

    2017-01-01

    This study was designed to explore the director's perspective on the role organizational images play in social identity development in midlevel choral ensembles. Using a phenomenological methodology, I interviewed 10 current or former directors of midlevel choral ensembles from eight midwestern U.S. colleges and universities. Directors cited…

  9. Calculation of the chemical potential in the Gibbs ensemble

    NARCIS (Netherlands)

    Smit, B.; Frenkel, D.

    1989-01-01

    An expression for the chemical potential in the Gibbs ensemble is derived. For finite system sizes this expression for the chemical potential differs system-atically from Widom's test particle insertion method for the N, V, T ensemble. In order to compare these two methods for calculating the chemic

  10. A Comparison of Ensemble Kalman Filters for Storm Surge Assimilation

    KAUST Repository

    Altaf, Muhammad

    2014-08-01

    This study evaluates and compares the performances of several variants of the popular ensembleKalman filter for the assimilation of storm surge data with the advanced circulation (ADCIRC) model. Using meteorological data from Hurricane Ike to force the ADCIRC model on a domain including the Gulf ofMexico coastline, the authors implement and compare the standard stochastic ensembleKalman filter (EnKF) and three deterministic square root EnKFs: the singular evolutive interpolated Kalman (SEIK) filter, the ensemble transform Kalman filter (ETKF), and the ensemble adjustment Kalman filter (EAKF). Covariance inflation and localization are implemented in all of these filters. The results from twin experiments suggest that the square root ensemble filters could lead to very comparable performances with appropriate tuning of inflation and localization, suggesting that practical implementation details are at least as important as the choice of the square root ensemble filter itself. These filters also perform reasonably well with a relatively small ensemble size, whereas the stochastic EnKF requires larger ensemble sizes to provide similar accuracy for forecasts of storm surge.

  11. Ensemble Bayesian model averaging using Markov Chain Monte Carlo sampling

    NARCIS (Netherlands)

    Vrugt, J.A.; Diks, C.G.H.; Clark, M.

    2008-01-01

    Bayesian model averaging (BMA) has recently been proposed as a statistical method to calibrate forecast ensembles from numerical weather models. Successful implementation of BMA however, requires accurate estimates of the weights and variances of the individual competing models in the ensemble. In t

  12. Conductor and Ensemble Performance Expressivity and State Festival Ratings

    Science.gov (United States)

    Price, Harry E.; Chang, E. Christina

    2005-01-01

    This study is the second in a series examining the relationship between conducting and ensemble performance. The purpose was to further examine the associations among conductor, ensemble performance expressivity, and festival ratings. Participants were asked to rate the expressivity of video-only conducting and parallel audio-only excerpts from a…

  13. Ensembles and their modules as objects of cartosemiotic inquiry

    Directory of Open Access Journals (Sweden)

    Hansgeorg Schlichtmann

    2010-01-01

    Full Text Available The structured set of signs in a map face -- here called map-face aggregate or MFA -- and the associated marginal notes make up an ensemble of modules or components (modular ensemble. Such ensembles are recognized where groups of entries are intuitively viewed as complex units, which includes the case that entries are consulted jointly and thus are involved in the same process of sign reception. Modular ensembles are amenable to semiotic study, just as are written or pictorial stories. Four kinds (one of them mentioned above are discussed in detail, two involving single MFAs, the other two being assemblages of maps, such as atlases. In terms of their internal structure, two types are recognized: the combinate (or grouping, in which modules are directly linked by combinatorial relations (example above, and the cumulate (or collection (of documents, in which modules are indirectly related through some conceptual commonality (example: series of geological maps. The discussion then turns to basic points concerning modular ensembles (identification of a module, internal organization of an ensemble, and characteristics which establish an ensemble as a unit and further to a few general semiotic concepts as they relate to the present research. Since this paper originated as a reaction to several of A. Wolodtschenko’s recent publications, it concludes with comments on some of his arguments which pertain to modular ensembles.

  14. An iterative ensemble Kalman filter for reservoir engineering applications

    NARCIS (Netherlands)

    Krymskaya, M.V.; Hanea, R.G.; Verlaan, M.

    2009-01-01

    The study has been focused on examining the usage and the applicability of ensemble Kalman filtering techniques to the history matching procedures. The ensemble Kalman filter (EnKF) is often applied nowadays to solving such a problem. Meanwhile, traditional EnKF requires assumption of the

  15. Ensemble Methods for Classification of Physical Activities from Wrist Accelerometry.

    Science.gov (United States)

    Chowdhury, Alok Kumar; Tjondronegoro, Dian; Chandran, Vinod; Trost, Stewart G

    2017-09-01

    To investigate whether the use of ensemble learning algorithms improve physical activity recognition accuracy compared to the single classifier algorithms, and to compare the classification accuracy achieved by three conventional ensemble machine learning methods (bagging, boosting, random forest) and a custom ensemble model comprising four algorithms commonly used for activity recognition (binary decision tree, k nearest neighbor, support vector machine, and neural network). The study used three independent data sets that included wrist-worn accelerometer data. For each data set, a four-step classification framework consisting of data preprocessing, feature extraction, normalization and feature selection, and classifier training and testing was implemented. For the custom ensemble, decisions from the single classifiers were aggregated using three decision fusion methods: weighted majority vote, naïve Bayes combination, and behavior knowledge space combination. Classifiers were cross-validated using leave-one subject out cross-validation and compared on the basis of average F1 scores. In all three data sets, ensemble learning methods consistently outperformed the individual classifiers. Among the conventional ensemble methods, random forest models provided consistently high activity recognition; however, the custom ensemble model using weighted majority voting demonstrated the highest classification accuracy in two of the three data sets. Combining multiple individual classifiers using conventional or custom ensemble learning methods can improve activity recognition accuracy from wrist-worn accelerometer data.

  16. Competitive Learning Neural Network Ensemble Weighted by Predicted Performance

    Science.gov (United States)

    Ye, Qiang

    2010-01-01

    Ensemble approaches have been shown to enhance classification by combining the outputs from a set of voting classifiers. Diversity in error patterns among base classifiers promotes ensemble performance. Multi-task learning is an important characteristic for Neural Network classifiers. Introducing a secondary output unit that receives different…

  17. Exact ensemble density-functional theory for excited states

    CERN Document Server

    Yang, Zeng-hui; Pribram-Jones, Aurora; Burke, Kieron; Needs, Richard J; Ullrich, Carsten A

    2014-01-01

    We construct exact Kohn-Sham potentials for the ensemble density-functional theory (EDFT) of excited states from the ground and excited states of helium. The exchange-correlation potential is compared with current approximations, which miss prominent features. The ensemble derivative discontinuity is tested, and the virial theorem is proven and illustrated.

  18. An iterative ensemble Kalman filter for reservoir engineering applications

    NARCIS (Netherlands)

    Krymskaya, M.V.; Hanea, R.G.; Verlaan, M.

    2009-01-01

    The study has been focused on examining the usage and the applicability of ensemble Kalman filtering techniques to the history matching procedures. The ensemble Kalman filter (EnKF) is often applied nowadays to solving such a problem. Meanwhile, traditional EnKF requires assumption of the distributi

  19. Competitive Learning Neural Network Ensemble Weighted by Predicted Performance

    Science.gov (United States)

    Ye, Qiang

    2010-01-01

    Ensemble approaches have been shown to enhance classification by combining the outputs from a set of voting classifiers. Diversity in error patterns among base classifiers promotes ensemble performance. Multi-task learning is an important characteristic for Neural Network classifiers. Introducing a secondary output unit that receives different…

  20. Modality-Driven Classification and Visualization of Ensemble Variance

    Energy Technology Data Exchange (ETDEWEB)

    Bensema, Kevin; Gosink, Luke; Obermaier, Harald; Joy, Kenneth I.

    2016-10-01

    Advances in computational power now enable domain scientists to address conceptual and parametric uncertainty by running simulations multiple times in order to sufficiently sample the uncertain input space. While this approach helps address conceptual and parametric uncertainties, the ensemble datasets produced by this technique present a special challenge to visualization researchers as the ensemble dataset records a distribution of possible values for each location in the domain. Contemporary visualization approaches that rely solely on summary statistics (e.g., mean and variance) cannot convey the detailed information encoded in ensemble distributions that are paramount to ensemble analysis; summary statistics provide no information about modality classification and modality persistence. To address this problem, we propose a novel technique that classifies high-variance locations based on the modality of the distribution of ensemble predictions. Additionally, we develop a set of confidence metrics to inform the end-user of the quality of fit between the distribution at a given location and its assigned class. We apply a similar method to time-varying ensembles to illustrate the relationship between peak variance and bimodal or multimodal behavior. These classification schemes enable a deeper understanding of the behavior of the ensemble members by distinguishing between distributions that can be described by a single tendency and distributions which reflect divergent trends in the ensemble.

  1. Effect of lateral boundary perturbations on the breeding method and the local ensemble transform Kalman filter for mesoscale ensemble prediction

    Directory of Open Access Journals (Sweden)

    Kazuo Saito

    2012-01-01

    Full Text Available The effect of lateral boundary perturbations (LBPs on the mesoscale breeding (MBD method and the local ensemble transform Kalman filter (LETKF as the initial perturbations generators for mesoscale ensemble prediction systems (EPSs was examined. A LBPs method using the Japan Meteorological Agency's (JMA's operational one-week global ensemble prediction was developed and applied to the mesoscale EPS of the Meteorological Research Institute for the World Weather Research Programme, Beijing 2008 Olympics Research and Development Project. The amplitude of the LBPs was adjusted based on the ensemble spread statistics considering the difference of the forecast times of the JMA's one-week EPS and the associated breeding/ensemble Kalman filter (EnKF cycles. LBPs in the ensemble forecast increase the ensemble spread and improve the accuracy of the ensemble mean forecast. In the MBD method, if LBPs were introduced in its breeding cycles, the growth rate of the generated bred vectors is increased, and the ensemble spread and the root mean square errors (RMSEs of the ensemble mean are further improved in the ensemble forecast. With LBPs in the breeding cycles, positional correspondences to the meteorological disturbances and the orthogonality of the bred vectors are improved. Brier Skill Scores (BSSs also showed a remarkable effect of LBPs in the breeding cycles. LBPs showed a similar effect with the LETKF. If LBPs were introduced in the EnKF data assimilation cycles, the ensemble spread, ensemble mean accuracy, and BSSs for precipitation were improved, although the relative advantage of LETKF as the initial perturbations generator against MDB was not necessarily clear. LBPs in the EnKF cycles contribute not to the orthogonalisation but to prevent the underestimation of the forecast error near the lateral boundary.The accuracy of the LETKF analyses was compared with that of the mesoscale 4D-VAR analyses. With LBPs in the LETKF cycles, the RMSEs of the

  2. Ensemble Streamflow Forecast Improvements in NYC's Operations Support Tool

    Science.gov (United States)

    Wang, L.; Weiss, W. J.; Porter, J.; Schaake, J. C.; Day, G. N.; Sheer, D. P.

    2013-12-01

    Like most other water supply utilities, New York City's Department of Environmental Protection (DEP) has operational challenges associated with drought and wet weather events. During drought conditions, DEP must maintain water supply reliability to 9 million customers as well as meet environmental release requirements downstream of its reservoirs. During and after wet weather events, DEP must maintain turbidity compliance in its unfiltered Catskill and Delaware reservoir systems and minimize spills to mitigate downstream flooding. Proactive reservoir management - such as release restrictions to prepare for a drought or preventative drawdown in advance of a large storm - can alleviate negative impacts associated with extreme events. It is important for water managers to understand the risks associated with proactive operations so unintended consequences such as endangering water supply reliability with excessive drawdown prior to a storm event are minimized. Probabilistic hydrologic forecasts are a critical tool in quantifying these risks and allow water managers to make more informed operational decisions. DEP has recently completed development of an Operations Support Tool (OST) that integrates ensemble streamflow forecasts, real-time observations, and a reservoir system operations model into a user-friendly graphical interface that allows its water managers to take robust and defensible proactive measures in the face of challenging system conditions. Since initial development of OST was first presented at the 2011 AGU Fall Meeting, significant improvements have been made to the forecast system. First, the monthly AR1 forecasts ('Hirsch method') were upgraded with a generalized linear model (GLM) utilizing historical daily correlations ('Extended Hirsch method' or 'eHirsch'). The development of eHirsch forecasts improved predictive skill over the Hirsch method in the first week to a month from the forecast date and produced more realistic hydrographs on the tail

  3. Data assimilation the ensemble Kalman filter

    CERN Document Server

    Evensen, Geir

    2007-01-01

    Data Assimilation comprehensively covers data assimilation and inverse methods, including both traditional state estimation and parameter estimation. This text and reference focuses on various popular data assimilation methods, such as weak and strong constraint variational methods and ensemble filters and smoothers. It is demonstrated how the different methods can be derived from a common theoretical basis, as well as how they differ and/or are related to each other, and which properties characterize them, using several examples. Rather than emphasize a particular discipline such as oceanography or meteorology, it presents the mathematical framework and derivations in a way which is common for any discipline where dynamics is merged with measurements. The mathematics level is modest, although it requires knowledge of basic spatial statistics, Bayesian statistics, and calculus of variations. Readers will also appreciate the introduction to the mathematical methods used and detailed derivations, which should b...

  4. China’s First Modern Dance Ensemble

    Institute of Scientific and Technical Information of China (English)

    1992-01-01

    After four years’hardwork by both Chineseand foreign artiststhe Guangdong ExperimentalModern Dance Ensemble,thefirst of its kind in China,wasestablished on June 6,1992,in the Friendship Theater ofGuangzhou.Ms、Yang Meiqi,a famous Chinese folk danceeducator,was chosen as headand Mr.Willy Tsao,a famousyoung Hongkong dancer,as artisticdirector.China’s Central TV Stationreported the news.Recommended by Ms.ChiangChing,a Chinese-American dancer,Yang Meiqi went to Durham,NorthCarolina,in the United States in thesummer of 1986.to attend the Amer-ican Dance Festival.The moderndances put on during the festivalfascinated her with their universal“language,”flexible movement,cho-reography and scientific training.“Isn’t this just What China’s dance

  5. ARM Cloud Retrieval Ensemble Data Set (ACRED)

    Energy Technology Data Exchange (ETDEWEB)

    Zhao, C; Xie, S; Klein, SA; McCoy, R; Comstock, JM; Delanoë, J; Deng, M; Dunn, M; Hogan, RJ; Jensen, MP; Mace, GG; McFarlane, SA; O’Connor, EJ; Protat, A; Shupe, MD; Turner, D; Wang, Z

    2011-09-12

    This document describes a new Atmospheric Radiation Measurement (ARM) data set, the ARM Cloud Retrieval Ensemble Data Set (ACRED), which is created by assembling nine existing ground-based cloud retrievals of ARM measurements from different cloud retrieval algorithms. The current version of ACRED includes an hourly average of nine ground-based retrievals with vertical resolution of 45 m for 512 layers. The techniques used for the nine cloud retrievals are briefly described in this document. This document also outlines the ACRED data availability, variables, and the nine retrieval products. Technical details about the generation of ACRED, such as the methods used for time average and vertical re-grid, are also provided.

  6. Hindcast experiments of ensemble streamflow forecasting for the Paraopeba river (Brazil)

    Science.gov (United States)

    Collischonn, W.; Meller, A.; Dias, P. L. S.; Moreira, D. S.

    2012-04-01

    Streamflow forecasts are routinely produced and used in Brazil to predict inflow to major hydropower reservoirs . In this field of application quantitative precipitation forecasts are becoming increasingly used to extend the range and increase the skill of streamflow forecasts. Forecasting systems designed to provide flood alert, on the other side, are relatively rare in Brazil, and are often based on simplified river routing models. However, a number of recent floods with significant loss of lives and economical impact is now motivating the creation of a new governmental institution dedicated to natural disaster and flood forecasting. This will further motivate the incorporation of numerical weather predictions (NWP) as input data to hydrological flood forecasting models, with the aim of increasing forecast lead time. In this context ensemble meteorological forecasts will be increasingly useful, since it is expected that ensembles can give some idea of the confidence level of the forecasts, and that extremes can be better captured by a high number of NWP runs with different initial conditions, or with different meteorological models. Silva Dias and Moreira (2006) organized a grand ensemble including several different models and model members for South America. We used forecasts of individual models of this grand ensemble to run a series of streamflow hindcast experiments (in forecast mode), using the MGB-IPH hydrological model. These tests were conducted in the Paraopeba river basin, which is a tributary of the São Francisco river, located in Minas Gerais State, in a Tropical region in the range from 21 S to 19 S. Results of 72 hour streamflow forecasts were compared to hourly observed discharge at Porto Mesquita gauging station, were the drainage area is 10280 square kilometers, during the Austral Summer of 2011. Results were assessed by visual inspection of hydrographs and by the analysis of a number of summary statistics. These preliminary results suggest that

  7. An educational model for ensemble streamflow simulation and uncertainty analysis

    Directory of Open Access Journals (Sweden)

    A. AghaKouchak

    2013-02-01

    Full Text Available This paper presents the hands-on modeling toolbox, HBV-Ensemble, designed as a complement to theoretical hydrology lectures, to teach hydrological processes and their uncertainties. The HBV-Ensemble can be used for in-class lab practices and homework assignments, and assessment of students' understanding of hydrological processes. Using this modeling toolbox, students can gain more insights into how hydrological processes (e.g., precipitation, snowmelt and snow accumulation, soil moisture, evapotranspiration and runoff generation are interconnected. The educational toolbox includes a MATLAB Graphical User Interface (GUI and an ensemble simulation scheme that can be used for teaching uncertainty analysis, parameter estimation, ensemble simulation and model sensitivity. HBV-Ensemble was administered in a class for both in-class instruction and a final project, and students submitted their feedback about the toolbox. The results indicate that this educational software had a positive impact on students understanding and knowledge of uncertainty in hydrological modeling.

  8. Ensemble inequivalence: Landau theory and the ABC model

    Science.gov (United States)

    Cohen, O.; Mukamel, D.

    2012-12-01

    It is well known that systems with long-range interactions may exhibit different phase diagrams when studied within two different ensembles. In many of the previously studied examples of ensemble inequivalence, the phase diagrams differ only when the transition in one of the ensembles is first order. By contrast, in a recent study of a generalized ABC model, the canonical and grand-canonical ensembles of the model were shown to differ even when they both exhibit a continuous transition. Here we show that the order of the transition where ensemble inequivalence may occur is related to the symmetry properties of the order parameter associated with the transition. This is done by analyzing the Landau expansion of a generic model with long-range interactions. The conclusions drawn from the generic analysis are demonstrated for the ABC model by explicit calculation of its Landau expansion.

  9. Excitations and benchmark ensemble density functional theory for two electrons

    CERN Document Server

    Pribram-Jones, Aurora; Trail, John R; Burke, Kieron; Needs, Richard J; Ullrich, Carsten A

    2014-01-01

    A new method for extracting ensemble Kohn-Sham potentials from accurate excited state densities is applied to a variety of two electron systems, exploring the behavior of exact ensemble density functional theory. The issue of separating the Hartree energy and the choice of degenerate eigenstates is explored. A new approximation, spin eigenstate Hartree-exchange (SEHX), is derived. Exact conditions that are proven include the signs of the correlation energy components, the virial theorem for both exchange and correlation, and the asymptotic behavior of the potential for small weights of the excited states. Many energy components are given as a function of the weights for two electrons in a one-dimensional flat box, in a box with a large barrier to create charge transfer excitations, in a three-dimensional harmonic well (Hooke's atom), and for the He atom singlet-triplet ensemble, singlet-triplet-singlet ensemble, and triplet bi-ensemble.

  10. Excitations and benchmark ensemble density functional theory for two electrons

    Energy Technology Data Exchange (ETDEWEB)

    Pribram-Jones, Aurora; Burke, Kieron [Department of Chemistry, University of California-Irvine, Irvine, California 92697 (United States); Yang, Zeng-hui; Ullrich, Carsten A. [Department of Physics and Astronomy, University of Missouri, Columbia, Missouri 65211 (United States); Trail, John R.; Needs, Richard J. [Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge CB3 0HE (United Kingdom)

    2014-05-14

    A new method for extracting ensemble Kohn-Sham potentials from accurate excited state densities is applied to a variety of two-electron systems, exploring the behavior of exact ensemble density functional theory. The issue of separating the Hartree energy and the choice of degenerate eigenstates is explored. A new approximation, spin eigenstate Hartree-exchange, is derived. Exact conditions that are proven include the signs of the correlation energy components and the asymptotic behavior of the potential for small weights of the excited states. Many energy components are given as a function of the weights for two electrons in a one-dimensional flat box, in a box with a large barrier to create charge transfer excitations, in a three-dimensional harmonic well (Hooke's atom), and for the He atom singlet-triplet ensemble, singlet-triplet-singlet ensemble, and triplet bi-ensemble.

  11. Adaptive calibration of (u,v)‐wind ensemble forecasts

    DEFF Research Database (Denmark)

    Pinson, Pierre

    2012-01-01

    Ensemble forecasts of (u,v)‐wind are of crucial importance for a number of decision‐making problems related to e.g. air traffic control, ship routeing and energy management. The skill of these ensemble forecasts as generated by NWP‐based models can be maximised by correcting for their lack...... of sufficient reliability. The original framework introduced here allows for an adaptive bivariate calibration of these ensemble forecasts. The originality of this methodology lies in the fact that calibrated ensembles still consist of a set of (space–time) trajectories, after translation and dilation...... on the adaptive calibration of ECMWF ensemble forecasts of (u,v)‐wind at 10 m above ground level over Europe over a three‐year period between December 2006 and December 2009. Substantial improvements in (bivariate) reliability and in various deterministic/probabilistic scores are observed. Finally, the maps...

  12. Induced Ginibre ensemble of random matrices and quantum operations

    CERN Document Server

    Fischmann, J; Khoruzhenko, B A; Sommers, H -J; Zyczkowski, K

    2011-01-01

    A generalisation of the Ginibre ensemble of non-Hermitian random square matrices is introduced. The corresponding probability measure is induced by the ensemble of rectangular Gaussian matrices via a quadratisation procedure. We derive the joint probability density of eigenvalues for such induced Ginibre ensemble and study various spectral correlation functions for complex and real matrices, and analyse universal behaviour in the limit of large dimensions. In this limit the eigenvalues of the induced Ginibre ensemble cover uniformly a ring in the complex plane. The real induced Ginibre ensemble is shown to be useful to describe statistical properties of evolution operators associated with random quantum operations, for which the dimensions of the input state and the output state do differ.

  13. Discrete post-processing of total cloud cover ensemble forecasts

    Science.gov (United States)

    Hemri, Stephan; Haiden, Thomas; Pappenberger, Florian

    2017-04-01

    This contribution presents an approach to post-process ensemble forecasts for the discrete and bounded weather variable of total cloud cover. Two methods for discrete statistical post-processing of ensemble predictions are tested. The first approach is based on multinomial logistic regression, the second involves a proportional odds logistic regression model. Applying them to total cloud cover raw ensemble forecasts from the European Centre for Medium-Range Weather Forecasts improves forecast skill significantly. Based on station-wise post-processing of raw ensemble total cloud cover forecasts for a global set of 3330 stations over the period from 2007 to early 2014, the more parsimonious proportional odds logistic regression model proved to slightly outperform the multinomial logistic regression model. Reference Hemri, S., Haiden, T., & Pappenberger, F. (2016). Discrete post-processing of total cloud cover ensemble forecasts. Monthly Weather Review 144, 2565-2577.

  14. Emergence of overlap in ensembles of spatial multiplexes and statistical mechanics of spatial interacting networks ensembles

    CERN Document Server

    Halu, Arda; Bianconi, Ginestra

    2013-01-01

    Spatial networks range from the brain networks, to transportation networks and infrastructures. Recently interacting and multiplex networks are attracting great attention because their dynamics and robustness cannot be understood without treating at the same time several networks. Here we present maximal entropy ensembles of spatial multiplex and spatial interacting networks that can be used in order to model spatial multilayer network structures and to build null models of real datasets. We show that spatial multiplex naturally develop a significant overlap of the links, a noticeable property of many multiplexes that can affect significantly the dynamics taking place on them. Additionally, we characterize ensembles of spatial interacting networks and we analyse the structure of interacting airport and railway networks in India, showing the effect of space in determining the link probability.

  15. Evolution of precipitation extremes in two large ensembles of climate simulations

    Science.gov (United States)

    Martel, Jean-Luc; Mailhot, Alain; Talbot, Guillaume; Brissette, François; Ludwig, Ralf; Frigon, Anne; Leduc, Martin; Turcotte, Richard

    2017-04-01

    Recent studies project significant changes in the future distribution of precipitation extremes due to global warming. It is likely that extreme precipitation intensity will increase in a future climate and that extreme events will be more frequent. In this work, annual maxima daily precipitation series from the Canadian Earth System Model (CanESM2) 50-member large ensemble (spatial resolution of 2.8°x2.8°) and the Community Earth System Model (CESM1) 40-member large ensemble (spatial resolution of 1°x1°) are used to investigate extreme precipitation over the historical (1980-2010) and future (2070-2100) periods. The use of these ensembles results in respectively 1 500 (30 years x 50 members) and 1200 (30 years x 40 members) simulated years over both the historical and future periods. These large datasets allow the computation of empirical daily extreme precipitation quantiles for large return periods. Using the CanESM2 and CESM1 large ensembles, extreme daily precipitation with return periods ranging from 2 to 100 years are computed in historical and future periods to assess the impact of climate change. Results indicate that daily precipitation extremes generally increase in the future over most land grid points and that these increases will also impact the 100-year extreme daily precipitation. Considering that many public infrastructures have lifespans exceeding 75 years, the increase in extremes has important implications on service levels of water infrastructures and public safety. Estimated increases in precipitation associated to very extreme precipitation events (e.g. 100 years) will drastically change the likelihood of flooding and their extent in future climate. These results, although interesting, need to be extended to sub-daily durations, relevant for urban flooding protection and urban infrastructure design (e.g. sewer networks, culverts). Models and simulations at finer spatial and temporal resolution are therefore needed.

  16. Ensemble Bayesian forecasting system Part I: Theory and algorithms

    Science.gov (United States)

    Herr, Henry D.; Krzysztofowicz, Roman

    2015-05-01

    The ensemble Bayesian forecasting system (EBFS), whose theory was published in 2001, is developed for the purpose of quantifying the total uncertainty about a discrete-time, continuous-state, non-stationary stochastic process such as a time series of stages, discharges, or volumes at a river gauge. The EBFS is built of three components: an input ensemble forecaster (IEF), which simulates the uncertainty associated with random inputs; a deterministic hydrologic model (of any complexity), which simulates physical processes within a river basin; and a hydrologic uncertainty processor (HUP), which simulates the hydrologic uncertainty (an aggregate of all uncertainties except input). It works as a Monte Carlo simulator: an ensemble of time series of inputs (e.g., precipitation amounts) generated by the IEF is transformed deterministically through a hydrologic model into an ensemble of time series of outputs, which is next transformed stochastically by the HUP into an ensemble of time series of predictands (e.g., river stages). Previous research indicated that in order to attain an acceptable sampling error, the ensemble size must be on the order of hundreds (for probabilistic river stage forecasts and probabilistic flood forecasts) or even thousands (for probabilistic stage transition forecasts). The computing time needed to run the hydrologic model this many times renders the straightforward simulations operationally infeasible. This motivates the development of the ensemble Bayesian forecasting system with randomization (EBFSR), which takes full advantage of the analytic meta-Gaussian HUP and generates multiple ensemble members after each run of the hydrologic model; this auxiliary randomization reduces the required size of the meteorological input ensemble and makes it operationally feasible to generate a Bayesian ensemble forecast of large size. Such a forecast quantifies the total uncertainty, is well calibrated against the prior (climatic) distribution of

  17. The dialogically extended mind

    DEFF Research Database (Denmark)

    Fusaroli, Riccardo; Gangopadhyay, Nivedita; Tylén, Kristian

    2014-01-01

    A growing conceptual and empirical literature is advancing the idea that language extends our cognitive skills. One of the most influential positions holds that language – qua material symbols – facilitates individual thought processes by virtue of its material properties. Extending upon this model......, we argue that language enhances our cognitive capabilities in a much more radical way: The skilful engagement of public material symbols facilitates evolutionarily unprecedented modes of collective perception, action and reasoning (interpersonal synergies) creating dialogically extended minds. We...... relate our approach to other ideas about collective minds and review a number of empirical studies to identify the mechanisms enabling the constitution of interpersonal cognitive systems....

  18. The Extended Enterprise concept

    DEFF Research Database (Denmark)

    Larsen, Lars Bjørn; Vesterager, Johan; Gobbi, Chiara

    1999-01-01

    This paper provides an overview of the work that has been done regarding the Extended Enterprise concept in the Common Concept team of Globeman 21 including references to results deliverables concerning the development of the Extended Enterprise concept. The first section presents the basic concept...... picture from Globeman21, which illustrates the Globeman21 way of realising the Extended Enterprise concept. The second section presents the Globeman21 EE concept in a life cycle perspective, which to a large extent is based on the thoughts and ideas behind GERAM (ISO/DIS 15704)....

  19. The Lagrangian Ensemble metamodel for simulating plankton ecosystems

    Science.gov (United States)

    Woods, J. D.

    2005-10-01

    This paper presents a detailed account of the Lagrangian Ensemble (LE) metamodel for simulating plankton ecosystems. It uses agent-based modelling to describe the life histories of many thousands of individual plankters. The demography of each plankton population is computed from those life histories. So too is bio-optical and biochemical feedback to the environment. The resulting “virtual ecosystem” is a comprehensive simulation of the plankton ecosystem. It is based on phenotypic equations for individual micro-organisms. LE modelling differs significantly from population-based modelling. The latter uses prognostic equations to compute demography and biofeedback directly. LE modelling diagnoses them from the properties of individual micro-organisms, whose behaviour is computed from prognostic equations. That indirect approach permits the ecosystem to adjust gracefully to changes in exogenous forcing. The paper starts with theory: it defines the Lagrangian Ensemble metamodel and explains how LE code performs a number of computations “behind the curtain”. They include budgeting chemicals, and deriving biofeedback and demography from individuals. The next section describes the practice of LE modelling. It starts with designing a model that complies with the LE metamodel. Then it describes the scenario for exogenous properties that provide the computation with initial and boundary conditions. These procedures differ significantly from those used in population-based modelling. The next section shows how LE modelling is used in research, teaching and planning. The practice depends largely on hindcasting to overcome the limits to predictability of weather forecasting. The scientific method explains observable ecosystem phenomena in terms of finer-grained processes that cannot be observed, but which are controlled by the basic laws of physics, chemistry and biology. What-If? Prediction ( WIP), used for planning, extends hindcasting by adding events that describe

  20. Spin storage in quantum dot ensembles and single quantum dots

    Energy Technology Data Exchange (ETDEWEB)

    Heiss, Dominik

    2009-10-15

    This thesis deals with the investigation of spin relaxation of electrons and holes in small ensembles of self-assembled quantum dots using optical techniques. Furthermore, a method to detect the spin orientation in a single quantum dot was developed in the framework of this thesis. A spin storage device was used to optically generate oriented electron spins in small frequency selected quantum dot ensembles using circularly polarized optical excitation. The spin orientation can be determined by the polarization of the time delayed electroluminescence signal generated by the device after a continuously variable storage time. The degree of spin polarized initialization was found to be limited to 0.6 at high magnetic fields, where anisotropic effects are compensated. The spin relaxation was directly measured as a function of magnetic field, lattice temperature and s-shell transition energy of the quantum dot by varying the spin storage time up to 30 ms. Very long spin lifetimes are obtained with a lower limit of T{sub 1}=20 ms at B=4 T and T=1 K. A strong magnetic field dependence T{sub 1}{proportional_to}B{sup -5} has been observed for low temperatures of T=1 K which weakens as the temperature is increased. In addition, the temperature dependence has been determined with T{sub 1}{proportional_to}T{sup -1}. The characteristic dependencies on magnetic field and temperature lead to the identification of the spin relaxation mechanism, which is governed by spin-orbit coupling and mediated by single phonon scattering. This finding is qualitatively supported by the energy dependent measurements. The investigations were extended to a modified device design that enabled studying the spin relaxation dynamics of heavy holes in self-assembled quantum dots. The measurements show a polarization memory effect for holes with up to 0.1 degree of polarization. Furthermore, investigations of the time dynamics of the hole spin relaxation reveal surprisingly long lifetimes T{sub 1}{sup h

  1. Quantum Control of Open Systems and Dense Atomic Ensembles

    Science.gov (United States)

    DiLoreto, Christopher

    Controlling the dynamics of open quantum systems; i.e. quantum systems that decohere because of interactions with the environment, is an active area of research with many applications in quantum optics and quantum computation. My thesis expands the scope of this inquiry by seeking to control open systems in proximity to an additional system. The latter could be a classical system such as metal nanoparticles, or a quantum system such as a cluster of similar atoms. By modelling the interactions between the systems, we are able to expand the accessible state space of the quantum system in question. For a single, three-level quantum system, I examine isolated systems that have only normal spontaneous emission. I then show that intensity-intensity correlation spectra, which depend directly on the density matrix of the system, can be used detect whether transitions share a common energy level. This detection is possible due to the presence of quantum interference effects between two transitions if they are connected. This effect allows one to asses energy level structure diagrams in complex atoms/molecules. By placing an open quantum system near a nanoparticle dimer, I show that the spontaneous emission rate of the system can be changed "on demand" by changing the polarization of an incident, driving field. In a three-level, Lambda system, this allows a qubit to both retain high qubit fidelity when it is operating, and to be rapidly initialized to a pure state once it is rendered unusable by decoherence. This type of behaviour is not possible in a single open quantum system; therefore adding a classical system nearby extends the overall control space of the quantum system. An open quantum system near identical neighbours in a dense ensemble is another example of how the accessible state space can be expanded. I show that a dense ensemble of atoms rapidly becomes disordered with states that are not directly excited by an incident field becoming significantly populated

  2. Robust Ensemble Filtering and Its Relation to Covariance Inflation in the Ensemble Kalman Filter

    KAUST Repository

    Luo, Xiaodong

    2011-12-01

    A robust ensemble filtering scheme based on the H∞ filtering theory is proposed. The optimal H∞ filter is derived by minimizing the supremum (or maximum) of a predefined cost function, a criterion different from the minimum variance used in the Kalman filter. By design, the H∞ filter is more robust than the Kalman filter, in the sense that the estimation error in the H∞ filter in general has a finite growth rate with respect to the uncertainties in assimilation, except for a special case that corresponds to the Kalman filter. The original form of the H∞ filter contains global constraints in time, which may be inconvenient for sequential data assimilation problems. Therefore a variant is introduced that solves some time-local constraints instead, and hence it is called the time-local H∞ filter (TLHF). By analogy to the ensemble Kalman filter (EnKF), the concept of ensemble time-local H∞ filter (EnTLHF) is also proposed. The general form of the EnTLHF is outlined, and some of its special cases are discussed. In particular, it is shown that an EnKF with certain covariance inflation is essentially an EnTLHF. In this sense, the EnTLHF provides a general framework for conducting covariance inflation in the EnKF-based methods. Some numerical examples are used to assess the relative robustness of the TLHF–EnTLHF in comparison with the corresponding KF–EnKF method.

  3. Assessing the impact of land use change on hydrology by ensemble modelling (LUCHEM) II: Ensemble combinations and predictions

    Science.gov (United States)

    Viney, N.R.; Bormann, H.; Breuer, L.; Bronstert, A.; Croke, B.F.W.; Frede, H.; Graff, T.; Hubrechts, L.; Huisman, J.A.; Jakeman, A.J.; Kite, G.W.; Lanini, J.; Leavesley, G.; Lettenmaier, D.P.; Lindstrom, G.; Seibert, J.; Sivapalan, M.; Willems, P.

    2009-01-01

    This paper reports on a project to compare predictions from a range of catchment models applied to a mesoscale river basin in central Germany and to assess various ensemble predictions of catchment streamflow. The models encompass a large range in inherent complexity and input requirements. In approximate order of decreasing complexity, they are DHSVM, MIKE-SHE, TOPLATS, WASIM-ETH, SWAT, PRMS, SLURP, HBV, LASCAM and IHACRES. The models are calibrated twice using different sets of input data. The two predictions from each model are then combined by simple averaging to produce a single-model ensemble. The 10 resulting single-model ensembles are combined in various ways to produce multi-model ensemble predictions. Both the single-model ensembles and the multi-model ensembles are shown to give predictions that are generally superior to those of their respective constituent models, both during a 7-year calibration period and a 9-year validation period. This occurs despite a considerable disparity in performance of the individual models. Even the weakest of models is shown to contribute useful information to the ensembles they are part of. The best model combination methods are a trimmed mean (constructed using the central four or six predictions each day) and a weighted mean ensemble (with weights calculated from calibration performance) that places relatively large weights on the better performing models. Conditional ensembles, in which separate model weights are used in different system states (e.g. summer and winter, high and low flows) generally yield little improvement over the weighted mean ensemble. However a conditional ensemble that discriminates between rising and receding flows shows moderate improvement. An analysis of ensemble predictions shows that the best ensembles are not necessarily those containing the best individual models. Conversely, it appears that some models that predict well individually do not necessarily combine well with other models in

  4. Rational extended thermodynamics

    CERN Document Server

    Müller, Ingo

    1998-01-01

    Ordinary thermodynamics provides reliable results when the thermodynamic fields are smooth, in the sense that there are no steep gradients and no rapid changes. In fluids and gases this is the domain of the equations of Navier-Stokes and Fourier. Extended thermodynamics becomes relevant for rapidly varying and strongly inhomogeneous processes. Thus the propagation of high­ frequency waves, and the shape of shock waves, and the regression of small-scale fluctuation are governed by extended thermodynamics. The field equations of ordinary thermodynamics are parabolic while extended thermodynamics is governed by hyperbolic systems. The main ingredients of extended thermodynamics are • field equations of balance type, • constitutive quantities depending on the present local state and • entropy as a concave function of the state variables. This set of assumptions leads to first order quasi-linear symmetric hyperbolic systems of field equations; it guarantees the well-posedness of initial value problems and f...

  5. Ensemble Kalman filtering without the intrinsic need for inflation

    Directory of Open Access Journals (Sweden)

    M. Bocquet

    2011-10-01

    Full Text Available The main intrinsic source of error in the ensemble Kalman filter (EnKF is sampling error. External sources of error, such as model error or deviations from Gaussianity, depend on the dynamical properties of the model. Sampling errors can lead to instability of the filter which, as a consequence, often requires inflation and localization. The goal of this article is to derive an ensemble Kalman filter which is less sensitive to sampling errors. A prior probability density function conditional on the forecast ensemble is derived using Bayesian principles. Even though this prior is built upon the assumption that the ensemble is Gaussian-distributed, it is different from the Gaussian probability density function defined by the empirical mean and the empirical error covariance matrix of the ensemble, which is implicitly used in traditional EnKFs. This new prior generates a new class of ensemble Kalman filters, called finite-size ensemble Kalman filter (EnKF-N. One deterministic variant, the finite-size ensemble transform Kalman filter (ETKF-N, is derived. It is tested on the Lorenz '63 and Lorenz '95 models. In this context, ETKF-N is shown to be stable without inflation for ensemble size greater than the model unstable subspace dimension, at the same numerical cost as the ensemble transform Kalman filter (ETKF. One variant of ETKF-N seems to systematically outperform the ETKF with optimally tuned inflation. However it is shown that ETKF-N does not account for all sampling errors, and necessitates localization like any EnKF, whenever the ensemble size is too small. In order to explore the need for inflation in this small ensemble size regime, a local version of the new class of filters is defined (LETKF-N and tested on the Lorenz '95 toy model. Whatever the size of the ensemble, the filter is stable. Its performance without inflation is slightly inferior to that of LETKF with optimally tuned inflation for small interval between updates, and

  6. Decadal climate predictions improved by ocean ensemble dispersion filtering

    Science.gov (United States)

    Kadow, C.; Illing, S.; Kröner, I.; Ulbrich, U.; Cubasch, U.

    2017-06-01

    Decadal predictions by Earth system models aim to capture the state and phase of the climate several years in advance. Atmosphere-ocean interaction plays an important role for such climate forecasts. While short-term weather forecasts represent an initial value problem and long-term climate projections represent a boundary condition problem, the decadal climate prediction falls in-between these two time scales. In recent years, more precise initialization techniques of coupled Earth system models and increased ensemble sizes have improved decadal predictions. However, climate models in general start losing the initialized signal and its predictive skill from one forecast year to the next. Here we show that the climate prediction skill of an Earth system model can be improved by a shift of the ocean state toward the ensemble mean of its individual members at seasonal intervals. We found that this procedure, called ensemble dispersion filter, results in more accurate results than the standard decadal prediction. Global mean and regional temperature, precipitation, and winter cyclone predictions show an increased skill up to 5 years ahead. Furthermore, the novel technique outperforms predictions with larger ensembles and higher resolution. Our results demonstrate how decadal climate predictions benefit from ocean ensemble dispersion filtering toward the ensemble mean.type="synopsis">type="main">Plain Language SummaryDecadal predictions aim to predict the climate several years in advance. Atmosphere-ocean interaction plays an important role for such climate forecasts. The ocean memory due to its heat capacity holds big potential skill. In recent years, more precise initialization techniques of coupled Earth system models (incl. atmosphere and ocean) have improved decadal predictions. Ensembles are another important aspect. Applying slightly perturbed predictions to trigger the famous butterfly effect results in an ensemble. Instead of evaluating one prediction, but the

  7. Symmetric Extended Ockham Algebras

    Institute of Scientific and Technical Information of China (English)

    T.S. Blyth; Jie Fang

    2003-01-01

    The variety eO of extended Ockham algebras consists of those algealgebra with an additional endomorphism k such that the unary operations f and k commute. Here, we consider the cO-algebras which have a property of symmetry. We show that there are thirty two non-isomorphic subdirectly irreducible symmetric extended MS-algebras and give a complete description of them.2000 Mathematics Subject Classification: 06D15, 06D30

  8. Structured RNAs and synteny regions in the pig genome

    DEFF Research Database (Denmark)

    Anthon, Christian; Tafer, Hakim; Havgaard, Jakob H

    2014-01-01

    for Laurasiatheria (pig, cow, dolphin, horse, cat, dog, hedgehog). CONCLUSIONS: We have obtained one of the most comprehensive annotations for structured ncRNAs of a mammalian genome, which is likely to play central roles in both health modelling and production. The core annotation is available in Ensembl 70...

  9. An archaeal genomic signature

    Science.gov (United States)

    Graham, D. E.; Overbeek, R.; Olsen, G. J.; Woese, C. R.

    2000-01-01

    Comparisons of complete genome sequences allow the most objective and comprehensive descriptions possible of a lineage's evolution. This communication uses the completed genomes from four major euryarchaeal taxa to define a genomic signature for the Euryarchaeota and, by extension, the Archaea as a whole. The signature is defined in terms of the set of protein-encoding genes found in at least two diverse members of the euryarchaeal taxa that function uniquely within the Archaea; most signature proteins have no recognizable bacterial or eukaryal homologs. By this definition, 351 clusters of signature proteins have been identified. Functions of most proteins in this signature set are currently unknown. At least 70% of the clusters that contain proteins from all the euryarchaeal genomes also have crenarchaeal homologs. This conservative set, which appears refractory to horizontal gene transfer to the Bacteria or the Eukarya, would seem to reflect the significant innovations that were unique and fundamental to the archaeal "design fabric." Genomic protein signature analysis methods may be extended to characterize the evolution of any phylogenetically defined lineage. The complete set of protein clusters for the archaeal genomic signature is presented as supplementary material (see the PNAS web site, www.pnas.org).

  10. Alpha-synuclein oligomers and fibrils originate in two distinct conformer pools: a small angle X-ray scattering and ensemble optimisation modelling study.

    Science.gov (United States)

    Curtain, Cyril C; Kirby, Nigel M; Mertens, Haydyn D T; Barnham, Kevin J; Knott, Robert B; Masters, Colin L; Cappai, Roberto; Rekas, Agata; Kenche, Vijaya B; Ryan, Timothy

    2015-01-01

    The 140 residue intrinsically disordered protein α-synuclein (α-syn) self-associates to form fibrils that are the major constituent of the Lewy body intracellular protein inclusions, and neurotoxic oligomers. Both of these macromolecular structures are associated with a number of neurodegenerative diseases, including Parkinson's disease and dementia with Lewy bodies. Using ensemble optimisation modelling (EOM) and small angle X-ray scattering (SAXS) on a size-exclusion column equipped beamline, we studied how the distribution of structural conformers in α-syn may be influenced by the presence of the familial early-onset mutations A30P, E45K and A53T, by substituting the four methionine residues with alanines and by reaction with copper (Cu2+) or an anti-fibril organic platinum (Pt) complex. We found that the WT had two major conformer groups, representing ensembles of compact and extended structures. The population of the extended group was increased in the more rapidly fibril-forming E45K and A53T mutants, while the compact group was enlarged in the oligomer-forming A30P mutant. Addition of Cu2+ resulted in the formation of an ensemble of compact conformers, while the anti-fibril agent and alanine substitution substantially reduced the population of extended conformers. Since our observations with the mutants suggest that fibrils may be drawn from the extended conformer ensemble, we propose that the compact and extended ensembles represent the beginning of oligomer and fibril formation pathways respectively, both of which have been reported to lead to a toxic gain of function. Manipulating these pathways and monitoring the results by EOM and SAXS may be useful in the development of anti-Parkinson's disease therapies.

  11. A modified iterative ensemble Kalman filter data assimilation method

    Science.gov (United States)

    Xu, Baoxiong; Bai, Yulong; Wang, Yizhao; Li, Zhe; Ma, Boyang

    2017-08-01

    High nonlinearity is a typical characteristic associated with data assimilation systems. Additionally, iterative ensemble based methods have attracted a large amount of research attention, which has been focused on dealing with nonlinearity problems. To solve the local convergence problem of the iterative ensemble Kalman filter, a modified iterative ensemble Kalman filter algorithm was put forward, which was based on a global convergence strategy from the perspective of a Gauss-Newton iteration. Through self-adaption, the step factor was adjusted to enable every iteration to approach expected values during the process of the data assimilation. A sensitivity experiment was carried out in a low dimensional Lorenz-63 chaotic system, as well as a Lorenz-96 model. The new method was tested via ensemble size, observation variance, and inflation factor changes, along with other aspects. Meanwhile, comparative research was conducted with both a traditional ensemble Kalman filter and an iterative ensemble Kalman filter. The results showed that the modified iterative ensemble Kalman filter algorithm was a data assimilation method that was able to effectively estimate a strongly nonlinear system state.

  12. Progressive freezing of interacting spins in isolated finite magnetic ensembles

    Science.gov (United States)

    Bhattacharya, Kakoli; Dupuis, Veronique; Le-Roy, Damien; Deb, Pritam

    2017-02-01

    Self-organization of magnetic nanoparticles into secondary nanostructures provides an innovative way for designing functional nanomaterials with novel properties, different from the constituent primary nanoparticles as well as their bulk counterparts. Collective magnetic properties of such complex closed packing of magnetic nanoparticles makes them more appealing than the individual magnetic nanoparticles in many technological applications. This work reports the collective magnetic behaviour of magnetic ensembles comprising of single domain Fe3O4 nanoparticles. The present work reveals that the ensemble formation is based on the re-orientation and attachment of the nanoparticles in an iso-oriented fashion at the mesoscale regime. Comprehensive dc magnetic measurements show the prevalence of strong interparticle interactions in the ensembles. Due to the close range organization of primary Fe3O4 nanoparticles in the ensemble, the spins of the individual nanoparticles interact through dipolar interactions as realized from remnant magnetization measurements. Signature of super spin glass like behaviour in the ensembles is observed in the memory studies carried out in field cooled conditions. Progressive freezing of spins in the ensembles is corroborated from the Vogel-Fulcher fit of the susceptibility data. Dynamic scaling of relaxation reasserted slow spin dynamics substantiating cluster spin glass like behaviour in the ensembles.

  13. Cancer genomics

    DEFF Research Database (Denmark)

    Norrild, Bodil; Guldberg, Per; Ralfkiær, Elisabeth Methner

    2007-01-01

    Almost all cells in the human body contain a complete copy of the genome with an estimated number of 25,000 genes. The sequences of these genes make up about three percent of the genome and comprise the inherited set of genetic information. The genome also contains information that determines whe...

  14. Annotation-Based Whole Genomic Prediction and Selection

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Do, Duy Ngoc; Janss, Luc

    Cπ method and applied to 1,272 Duroc pigs with both genotypic and phenotypic records including residual (RFI) and daily feed intake (DFI), average daily gain (ADG) and back fat (BF)). Records were split into a training (968 pigs) and a validation dataset (304 pigs). SNPs were annotated by 14 different...... groups. Genomic prediction has accuracy comparable to an own phenotype and use of genomic prediction can be cost effective by replacing feed intake measurement. Use of genomic annotation of SNPs and QTL information had no largely significant impact on predictive accuracy for the current traits but may...... in their contribution to estimated genomic variances and in prediction of genomic breeding values by applying SNP annotation approaches to feed efficiency. Ensembl Variant Predictor (EVP) and Pig QTL database were used as the source of genomic annotation for 60K chip. Genomic prediction was performed using the Bayes...

  15. Soil texture reclassification by an ensemble model

    Science.gov (United States)

    Cisty, Milan; Hlavcova, Kamila

    2015-04-01

    a prerequisite for solving some subsequent task, this bias is propagated to the subsequent modelling or other work. Therefore, for the sake of achieving more general and precise outputs while solving such tasks, the authors of the present paper are proposing a hybrid approach, which has the potential for obtaining improved results. Although the authors continue recommending the use of the mentioned parametric PSD models in the proposed methodology, the final prediction is made by an ensemble machine learning algorithm based on regression trees, the so-called Random Forest algorithm, which is built on top of the outputs of such models, which serves as an ensemble members. An improvement in precision was proved, and it is documented in the paper that the ensemble model worked better than any of its constituents. References Nemes, A., Wosten, J.H.M., Lilly, A., Voshaar, J.H.O.: Evaluation of different procedures to interpolate particle-size distributions to achieve compatibility within soil databases. Geoderma 90, 187- 202 (1999) Hwang, S.: Effect of texture on the performance of soil particle-size distribution models. Geoderma 123, 363-371 (2004) Botula, Y.D., Cornelis, W.M., Baert, G., Mafuka, P., Van Ranst, E.: Particle size distribution models for soils of the humid tropics. J Soils Sediments. 13, 686-698 (2013)

  16. Accounting for model error due to unresolved scales within ensemble Kalman filtering

    CERN Document Server

    Mitchell, Lewis

    2014-01-01

    We propose a method to account for model error due to unresolved scales in the context of the ensemble transform Kalman filter (ETKF). The approach extends to this class of algorithms the deterministic model error formulation recently explored for variational schemes and extended Kalman filter. The model error statistic required in the analysis update is estimated using historical reanalysis increments and a suitable model error evolution law. Two different versions of the method are described; a time-constant model error treatment where the same model error statistical description is time-invariant, and a time-varying treatment where the assumed model error statistics is randomly sampled at each analysis step. We compare both methods with the standard method of dealing with model error through inflation and localization, and illustrate our results with numerical simulations on a low order nonlinear system exhibiting chaotic dynamics. The results show that the filter skill is significantly improved through th...

  17. Deterministic Methods for Filtering, part I: Mean-field Ensemble Kalman Filtering

    CERN Document Server

    Law, Kody J H; Tempone, Raul

    2014-01-01

    This paper provides a proof of convergence of the standard EnKF generalized to non-Gaussian state space models, based on the indistinguishability property of the joint distribution on the ensemble. A density-based deterministic approximation of the mean-field EnKF (MFEnKF) is proposed, consisting of a PDE solver and a quadrature rule. Given a certain minimal order of convergence k between the two, this extends to the deterministic filter approximation, which is therefore asymptotically superior to standard EnKF for d<2k. The fidelity of approximation of the true distribution is also established using an extension of total variation metric to random measures. This is limited by a Gaussian bias term arising from non-linearity/non-Gaussianity of the model, which arises in both deterministic and standard EnKF. Numerical results support and extend the theory.

  18. Early warning of drought in Europe using the monthly ensemble system from ECMWF

    Directory of Open Access Journals (Sweden)

    C. Lavaysse

    2015-02-01

    Full Text Available Timely forecasts of the onset or possible evolution of droughts are an important contribution to mitigate their manifold negative effects. In this paper we therefore analyse and compare the performance of the first month of the probabilistic extended range forecast and of the seasonal forecast from ECMWF in predicting droughts over the European continent. The Standardized Precipitation Index (SPI is used to quantify the onset and severity of droughts. It can be shown that on average the extended range forecast has greater skill than the seasonal forecast whilst both outperform climatology. No significant spatial or temporal patterns can be observed but the scores are improved when focussing on large-scale droughts. In a second step we then analyse several different methods to convert the probabilistic forecasts of SPI into a Boolean drought warning. It can be demonstrated that methodologies which convert low percentiles of the forecasted SPI cumulative distribution function into warnings are superior in comparison with alternatives such as the mean or the median of the ensemble. The paper demonstrates that up to 40% of droughts are correctly forecasted one month in advance. Nevertheless, during false alarms or misses, we did not find significant differences in the distribution of the ensemble members that would allow for a quantitative assessment of the uncertainty.

  19. Ensemble Deep Learning for Biomedical Time Series Classification

    Directory of Open Access Journals (Sweden)

    Lin-peng Jin

    2016-01-01

    Full Text Available Ensemble learning has been proved to improve the generalization ability effectively in both theory and practice. In this paper, we briefly outline the current status of research on it first. Then, a new deep neural network-based ensemble method that integrates filtering views, local views, distorted views, explicit training, implicit training, subview prediction, and Simple Average is proposed for biomedical time series classification. Finally, we validate its effectiveness on the Chinese Cardiovascular Disease Database containing a large number of electrocardiogram recordings. The experimental results show that the proposed method has certain advantages compared to some well-known ensemble methods, such as Bagging and AdaBoost.

  20. Ensemble Deep Learning for Biomedical Time Series Classification.

    Science.gov (United States)

    Jin, Lin-Peng; Dong, Jun

    2016-01-01

    Ensemble learning has been proved to improve the generalization ability effectively in both theory and practice. In this paper, we briefly outline the current status of research on it first. Then, a new deep neural network-based ensemble method that integrates filtering views, local views, distorted views, explicit training, implicit training, subview prediction, and Simple Average is proposed for biomedical time series classification. Finally, we validate its effectiveness on the Chinese Cardiovascular Disease Database containing a large number of electrocardiogram recordings. The experimental results show that the proposed method has certain advantages compared to some well-known ensemble methods, such as Bagging and AdaBoost.

  1. Ensemble Deep Learning for Biomedical Time Series Classification

    Science.gov (United States)

    2016-01-01

    Ensemble learning has been proved to improve the generalization ability effectively in both theory and practice. In this paper, we briefly outline the current status of research on it first. Then, a new deep neural network-based ensemble method that integrates filtering views, local views, distorted views, explicit training, implicit training, subview prediction, and Simple Average is proposed for biomedical time series classification. Finally, we validate its effectiveness on the Chinese Cardiovascular Disease Database containing a large number of electrocardiogram recordings. The experimental results show that the proposed method has certain advantages compared to some well-known ensemble methods, such as Bagging and AdaBoost.

  2. An Improved Particle Swarm Optimization Algorithm Based on Ensemble Technique

    Institute of Scientific and Technical Information of China (English)

    SHI Yan; HUANG Cong-ming

    2006-01-01

    An improved particle swarm optimization (PSO) algorithm based on ensemble technique is presented. The algorithm combines some previous best positions (pbest) of the particles to get an ensemble position (Epbest), which is used to replace the global best position (gbest). It is compared with the standard PSO algorithm invented by Kennedy and Eberhart and some improved PSO algorithms based on three different benchmark functions. The simulation results show that the improved PSO based on ensemble technique can get better solutions than the standard PSO and some other improved algorithms under all test cases.

  3. Extended spider cognition.

    Science.gov (United States)

    Japyassú, Hilton F; Laland, Kevin N

    2017-05-01

    There is a tension between the conception of cognition as a central nervous system (CNS) process and a view of cognition as extending towards the body or the contiguous environment. The centralised conception requires large or complex nervous systems to cope with complex environments. Conversely, the extended conception involves the outsourcing of information processing to the body or environment, thus making fewer demands on the processing power of the CNS. The evolution of extended cognition should be particularly favoured among small, generalist predators such as spiders, and here, we review the literature to evaluate the fit of empirical data with these contrasting models of cognition. Spiders do not seem to be cognitively limited, displaying a large diversity of learning processes, from habituation to contextual learning, including a sense of numerosity. To tease apart the central from the extended cognition, we apply the mutual manipulability criterion, testing the existence of reciprocal causal links between the putative elements of the system. We conclude that the web threads and configurations are integral parts of the cognitive systems. The extension of cognition to the web helps to explain some puzzling features of spider behaviour and seems to promote evolvability within the group, enhancing innovation through cognitive connectivity to variable habitat features. Graded changes in relative brain size could also be explained by outsourcing information processing to environmental features. More generally, niche-constructed structures emerge as prime candidates for extending animal cognition, generating the selective pressures that help to shape the evolving cognitive system.

  4. An extended day program

    Directory of Open Access Journals (Sweden)

    Ševkušić-Mandić Slavica G.

    2002-01-01

    Full Text Available The paper presents the results of a pilot project evaluation, carried out as an action investigation whose aim was to provide a better quality extended day for primary school students. The project included the training of teachers involved in extended day program, designing of special activities performed by teachers with children once a week as well as changes and equipping of premises where children stay. The aims of the program were conception and performance of activities in a less formal way than during regular instructional days, linking of learning at school and acquired knowledge to everyday experiences, and work on contents contributing to the development of child's interests and creativity. The program was accomplished in a Belgrade primary school during the 2001/2002 academic year, comprising students of 1st and 2nd grades (N=77. The effects of the program were monitored throughout the academic year (observation and teachers' reports on accomplished workshops and at the end of the academic year (teachers and students' opinions of the program, academic achievement and creativity of students attending the extended day program compared with students not attending it. Findings about positive effects of the program on students' broadening of interests and willingness to express themselves creatively, indicate unequivocally that there is a need for developing special extended day programs. The extended day program is an opportunity for school to exert greater educational influence that has yet to be tapped.

  5. Large margin classifier-based ensemble tracking

    Science.gov (United States)

    Wang, Yuru; Liu, Qiaoyuan; Yin, Minghao; Wang, ShengSheng

    2016-07-01

    In recent years, many studies consider visual tracking as a two-class classification problem. The key problem is to construct a classifier with sufficient accuracy in distinguishing the target from its background and sufficient generalize ability in handling new frames. However, the variable tracking conditions challenges the existing methods. The difficulty mainly comes from the confused boundary between the foreground and background. This paper handles this difficulty by generalizing the classifier's learning step. By introducing the distribution data of samples, the classifier learns more essential characteristics in discriminating the two classes. Specifically, the samples are represented in a multiscale visual model. For features with different scales, several large margin distribution machine (LDMs) with adaptive kernels are combined in a Baysian way as a strong classifier. Where, in order to improve the accuracy and generalization ability, not only the margin distance but also the sample distribution is optimized in the learning step. Comprehensive experiments are performed on several challenging video sequences, through parameter analysis and field comparison, the proposed LDM combined ensemble tracker is demonstrated to perform with sufficient accuracy and generalize ability in handling various typical tracking difficulties.

  6. Variety of synchronous regimes in neuronal ensembles

    Science.gov (United States)

    Komarov, M. A.; Osipov, G. V.; Suykens, J. A. K.

    2008-09-01

    We consider a Hodgkin-Huxley-type model of oscillatory activity in neurons of the snail Helix pomatia. This model has a distinctive feature: It demonstrates multistability in oscillatory and silent modes that is typical for the thalamocortical neurons. A single neuron cell can demonstrate a variety of oscillatory activity: Regular and chaotic spiking and bursting behavior. We study collective phenomena in small and large arrays of nonidentical cells coupled by models of electrical and chemical synapses. Two single elements coupled by electrical coupling show different types of synchronous behavior, in particular in-phase and antiphase synchronous regimes. In an ensemble of three inhibitory synaptically coupled elements, the phenomenon of sequential synchronous dynamics is observed. We study the synchronization phenomena in the chain of nonidentical neurons at different oscillatory behavior coupled with electrical and chemical synapses. Various regimes of phase synchronization are observed: (i) Synchronous regular and chaotic spiking; (ii) synchronous regular and chaotic bursting; and (iii) synchronous regular and chaotic bursting with different numbers of spikes inside the bursts. We detect and study the effect of collective synchronous burst generation due to the cluster formation and the oscillatory death.

  7. General approaches in ensemble quantum computing

    Indian Academy of Sciences (India)

    V Vimalan; N Chandrakumar

    2008-01-01

    We have developed methodology for NMR quantum computing focusing on enhancing the efficiency of initialization, of logic gate implementation and of readout. Our general strategy involves the application of rotating frame pulse sequences to prepare pseudopure states and to perform logic operations. We demonstrate experimentally our methodology for both homonuclear and heteronuclear spin ensembles. On model two-spin systems, the initialization time of one of our sequences is three-fourths (in the heteronuclear case) or one-fourth (in the homonuclear case), of the typical pulsed free precession sequences, attaining the same initialization efficiency. We have implemented the logical SWAP operation in homonuclear AMX spin systems using selective isotropic mixing, reducing the duration taken to a third compared to the standard re-focused INEPT-type sequence. We introduce the 1D version for readout of the rotating frame SWAP operation, in an attempt to reduce readout time. We further demonstrate the Hadamard mode of 1D SWAP, which offers 2N-fold reduction in experiment time for a system with -working bits, attaining the same sensitivity as the standard 1D version.

  8. Ensemble LUT classification for degraded document enhancement

    Science.gov (United States)

    Obafemi-Ajayi, Tayo; Agam, Gady; Frieder, Ophir

    2008-01-01

    The fast evolution of scanning and computing technologies have led to the creation of large collections of scanned paper documents. Examples of such collections include historical collections, legal depositories, medical archives, and business archives. Moreover, in many situations such as legal litigation and security investigations scanned collections are being used to facilitate systematic exploration of the data. It is almost always the case that scanned documents suffer from some form of degradation. Large degradations make documents hard to read and substantially deteriorate the performance of automated document processing systems. Enhancement of degraded document images is normally performed assuming global degradation models. When the degradation is large, global degradation models do not perform well. In contrast, we propose to estimate local degradation models and use them in enhancing degraded document images. Using a semi-automated enhancement system we have labeled a subset of the Frieder diaries collection.1 This labeled subset was then used to train an ensemble classifier. The component classifiers are based on lookup tables (LUT) in conjunction with the approximated nearest neighbor algorithm. The resulting algorithm is highly effcient. Experimental evaluation results are provided using the Frieder diaries collection.1

  9. Group Theory for Embedded Random Matrix Ensembles

    CERN Document Server

    Kota, V K B

    2014-01-01

    Embedded random matrix ensembles are generic models for describing statistical properties of finite isolated quantum many-particle systems. For the simplest spinless fermion (or boson) systems with say $m$ fermions (or bosons) in $N$ single particle states and interacting with say $k$-body interactions, we have EGUE($k$) [embedded GUE of $k$-body interactions) with GUE embedding and the embedding algebra is $U(N)$. In this paper, using EGUE($k$) representation for a Hamiltonian that is $k$-body and an independent EGUE($t$) representation for a transition operator that is $t$-body and employing the embedding $U(N)$ algebra, finite-$N$ formulas for moments up to order four are derived, for the first time, for the transition strength densities (transition strengths multiplied by the density of states at the initial and final energies). In the asymptotic limit, these formulas reduce to those derived for the EGOE version and establish that in general bivariate transition strength densities take bivariate Gaussian ...

  10. Emergent order in ensembles of active spinners

    Science.gov (United States)

    van Zuiden, Benjamin C.; Paulose, Jayson; Irvine, William T. M.; Bartolo, Denis; Vitelli, Vincenzo

    Interacting self-propelled particles is proxy to model many living systems from cytoskeletal motors to bird flocks, while also providing a framework to investigate fundamental questions in non equilibrium statistical mechanics. A surge of recent studies have shown that self-propulsion significantly modifies the phase behavior of particles interacting via potential interactions. A prototypical example is the so-called Motility Induced Phase Separation occurring in ensembles of self-propelled hard spheres. In stark contrast, our understanding of active spinning, as opposed to self-propulsion, remains very scarce. Here, we study a system of self-spinning dimers, interacting via soft repulsive forces. Upon varying the density and activity, we observe a range of emergent phases characterized by different degrees of spatiotemporal order in the position and orientation of the dimers. Changes in bulk properties, including crystallization, melting, and freezing, are reflected in the collective motion of the particles. We rationalize our numerical findings theoretically and demonstrate some of these concepts in a active granular experiment.

  11. Ensemble Kalman filtering with residual nudging

    CERN Document Server

    Luo, Xiaodong; 10.3402/tellusa.v64i0.17130

    2012-01-01

    Covariance inflation and localization are two important techniques that are used to improve the performance of the ensemble Kalman filter (EnKF) by (in effect) adjusting the sample covariances of the estimates in the state space. In this work an additional auxiliary technique, called residual nudging, is proposed to monitor and, if necessary, adjust the residual norms of state estimates in the observation space. In an EnKF with residual nudging, if the residual norm of an analysis is larger than a pre-specified value, then the analysis is replaced by a new one whose residual norm is no larger than a pre-specified value. Otherwise the analysis is considered as a reasonable estimate and no change is made. A rule for choosing the pre-specified value is suggested. Based on this rule, the corresponding new state estimates are explicitly derived in case of linear observations. Numerical experiments in the 40-dimensional Lorenz 96 model show that introducing residual nudging to an EnKF may improve its accuracy and/o...

  12. Orchestrating Distributed Resource Ensembles for Petascale Science

    Energy Technology Data Exchange (ETDEWEB)

    Baldin, Ilya; Mandal, Anirban; Ruth, Paul; Yufeng, Xin

    2014-04-24

    Distributed, data-intensive computational science applications of interest to DOE scientific com- munities move large amounts of data for experiment data management, distributed analysis steps, remote visualization, and accessing scientific instruments. These applications need to orchestrate ensembles of resources from multiple resource pools and interconnect them with high-capacity multi- layered networks across multiple domains. It is highly desirable that mechanisms are designed that provide this type of resource provisioning capability to a broad class of applications. It is also important to have coherent monitoring capabilities for such complex distributed environments. In this project, we addressed these problems by designing an abstract API, enabled by novel semantic resource descriptions, for provisioning complex and heterogeneous resources from multiple providers using their native provisioning mechanisms and control planes: computational, storage, and multi-layered high-speed network domains. We used an extensible resource representation based on semantic web technologies to afford maximum flexibility to applications in specifying their needs. We evaluated the effectiveness of provisioning using representative data-intensive ap- plications. We also developed mechanisms for providing feedback about resource performance to the application, to enable closed-loop feedback control and dynamic adjustments to resource allo- cations (elasticity). This was enabled through development of a novel persistent query framework that consumes disparate sources of monitoring data, including perfSONAR, and provides scalable distribution of asynchronous notifications.

  13. Quantum metrology with cold atomic ensembles

    Directory of Open Access Journals (Sweden)

    Mitchell Morgan W.

    2013-08-01

    Full Text Available Quantum metrology uses quantum features such as entanglement and squeezing to improve the sensitivity of quantum-limited measurements. Long established as a valuable technique in optical measurements such as gravitational-wave detection, quantum metrology is increasingly being applied to atomic instruments such as matter-wave interferometers, atomic clocks, and atomic magnetometers. Several of these new applications involve dual optical/atomic quantum systems, presenting both new challenges and new opportunities. Here we describe an optical magnetometry system that achieves both shot-noise-limited and projection-noise-limited performance, allowing study of optical magnetometry in a fully-quantum regime [1]. By near-resonant Faraday rotation probing, we demonstrate measurement-based spin squeezing in a magnetically-sensitive atomic ensemble [2-4]. The versatility of this system allows us also to design metrologically-relevant optical nonlinearities, and to perform quantum-noise-limited measurements with interacting photons. As a first interaction-based measurement [5], we implement a non-linear metrology scheme proposed by Boixo et al. with the surprising feature of precision scaling better than the 1/N “Heisenberg limit” [6].

  14. Evaluation of seasonal ensemble forecasts in Norway

    Science.gov (United States)

    Tore Sinnes, Svein; Engeland, Kolbjørn; Langsholt, Elin; Roar Sælthun, Nils

    2017-04-01

    Throughout the winter and spring season, seasonal forecasts are used by the Norwegian Water Resources and Energy Directorate (NVE) in order to assess the probability for sever floods or for low seasonal runoff volumes. The latter is especially important for hydropower production. The seasonal forecasts are generated by a set of 145 lumped, elevation distributed HBV models distributed all over Norway. The observed weather is used to establish the initial snow cover, soil moisture and groundwater levels in the HBV model. Subsequently, scenarios are created by using time series of observed weather the previous 50 years, creating a total of 50 ensembles. The predictability of this seasonal forecasting system depends therefore on the importance of the initial conditions, and in Norway the seasonal snow cover is especially important. The aim of this study is to evaluate the performance of the seasonal forecasts of flood peaks and seasonal runoff volumes and especially to evaluate of the predictability depends on (i) catchment climatology and (ii) issue dates and lead times. For achieving these aims, evaluation criterions assessing reliability and sharpness were used. The results shows that the predictability is the highest for catchments where the spring runoff is dominated by snow melt. The predictability is the highest for the shortest lead times (up to 1 months ahead).The predictive performance is higher for runoff volumes than for the flood peaks.

  15. A past discharge assimilation system for ensemble streamflow forecasts over France – Part 2: Impact on the ensemble streamflow forecasts

    Directory of Open Access Journals (Sweden)

    G. Thirel

    2010-08-01

    Full Text Available The use of ensemble streamflow forecasts is developing in the international flood forecasting services. Ensemble streamflow forecast systems can provide more accurate forecasts and useful information about the uncertainty of the forecasts, thus improving the assessment of risks. Nevertheless, these systems, like all hydrological forecasts, suffer from errors on initialization or on meteorological data, which lead to hydrological prediction errors. This article, which is the second part of a 2-part article, concerns the impacts of initial states, improved by a streamflow assimilation system, on an ensemble streamflow prediction system over France. An assimilation system was implemented to improve the streamflow analysis of the SAFRAN-ISBA-MODCOU (SIM hydro-meteorological suite, which initializes the ensemble streamflow forecasts at Météo-France. This assimilation system, using the Best Linear Unbiased Estimator (BLUE and modifying the initial soil moisture states, showed an improvement of the streamflow analysis with low soil moisture increments. The final states of this suite were used to initialize the ensemble streamflow forecasts of Météo-France, which are based on the SIM model and use the European Centre for Medium-range Weather Forecasts (ECMWF 10-day Ensemble Prediction System (EPS. Two different configurations of the assimilation system were used in this study: the first with the classical SIM model and the second using improved soil physics in ISBA. The effects of the assimilation system on the ensemble streamflow forecasts were assessed for these two configurations, and a comparison was made with the original (i.e. without data assimilation and without the improved physics ensemble streamflow forecasts. It is shown that the assimilation system improved most of the statistical scores usually computed for the validation of ensemble predictions (RMSE, Brier Skill Score and its decomposition, Ranked Probability Skill Score, False Alarm

  16. Extending Critical Performativity

    DEFF Research Database (Denmark)

    Spicer, André; Alvesson, Mats; Kärreman, Dan

    2016-01-01

    In this article we extend the debate about critical performativity. We begin by outlining the basic tenets of critical performativity and how this has been applied in the study of management and organization. We then address recent critiques of critical performance. We note these arguments suffer...... from an undue focus on intra-academic debates; engage in author-itarian theoretical policing; feign relevance through symbolic radicalism; and repackage common sense. We take these critiques as an opportunity to offer an extended model of critical performativity that involves focusing on issues...

  17. Developments in non-linear Kalman Ensemble and Particle Filtering techniques for hydrological data assimilation

    Science.gov (United States)

    Khaki, Mehdi; Forootan, Ehsan; Kuhn, Michael; Awange, Joseph; Pattiaratchi, Charitha

    2016-04-01

    long range spurious correlations and increase the effective ensemble size. To evaluate and compare the mentioned data assimilation methods and study their performance, numerical experiments are generated using assimilation of Gravity Recovery and Climate Experiment (GRACE) observations into AWRA (Australian Water Resource Assessment) model. Fundamental issues regarding the assimilation of water storage observations into the spatially extended chaotic system are addressed.

  18. Ensemble-Type Kalman Filter Algorithm conserving mass, total energy and enstrophy

    Science.gov (United States)

    Zeng, Yuefei; Janjic, Tijana; Ruckstuhl, Yvonne; Verlaan, Martin

    2017-04-01

    In a recent study (Zeng and Janjic 2016), we explored the effect on conservation properties of data assimilation using perfect model experiments with a 2D shallow water model preserving important properties of the true nonlinear flow. It was found that during the assimilation with the ensemble Kalman filter algorithm, the total energy of the analysis ensemble mean converges towards the nature run value with time. However, the enstrophy, divergence and energy spectra were strongly affected by the data assimilation settings. We tested the effects on the prediction depending on the type of error in the initial condition and showed that the accumulated noise during assimilation and the error of analysis are good indicators of the quality of the prediction. Having in mind that the conservation of both the kinetic energy and enstrophy by momentum advection schemes in the case of non-divergent flow prevents a systematic and unrealistic energy cascade towards the high wave numbers, we constructed the ensemble data assimilation algorithm that conserves both energy and enstrophy. This is done by extending QPEns (Janjic et al. 2014) to allow for nonlinear constraints using, instead of quadratic programming, the sequential quadratic programming algorithm. Experiments with the 2D shallow water model show similar RMSEs of the algorithm without constraints and the algorithm with only the total energy constrained. The algorithm which constraints enstrophy as well as energy and enstrophy during data assimilation showed smaller RMSE to the one without the constraint on enstrophy. Similar behavior can be seen in the energy spectrum where algorithms which include the constraint on enstrophy are closer to the true spectrum, in particular for wavelengths between 200 km and 1000 km. The enstrophy constraint resulted in a reduction of noise during data assimilation. Finally, the algorithm, with both energy and enstrophy constraint showed the smallest error growth during the two weeks

  19. Pathway analysis in attention deficit hyperactivity disorder: An ensemble approach.

    Science.gov (United States)

    Mooney, Michael A; McWeeney, Shannon K; Faraone, Stephen V; Hinney, Anke; Hebebrand, Johannes; Nigg, Joel T; Wilmot, Beth

    2016-09-01

    Despite a wealth of evidence for the role of genetics in attention deficit hyperactivity disorder (ADHD), specific and definitive genetic mechanisms have not been identified. Pathway analyses, a subset of gene-set analyses, extend the knowledge gained from genome-wide association studies (GWAS) by providing functional context for genetic associations. However, there are numerous methods for association testing of gene sets and no real consensus regarding the best approach. The present study applied six pathway analysis methods to identify pathways associated with ADHD in two GWAS datasets from the Psychiatric Genomics Consortium. Methods that utilize genotypes to model pathway-level effects identified more replicable pathway associations than methods using summary statistics. In addition, pathways implicated by more than one method were significantly more likely to replicate. A number of brain-relevant pathways, such as RhoA signaling, glycosaminoglycan biosynthesis, fibroblast growth factor receptor activity, and pathways containing potassium channel genes, were nominally significant by multiple methods in both datasets. These results support previous hypotheses about the role of regulation of neurotransmitter release, neurite outgrowth and axon guidance in contributing to the ADHD phenotype and suggest the value of cross-method convergence in evaluating pathway analysis results. © 2016 Wiley Periodicals, Inc.

  20. Infinite ensemble of support vector machines for prediction of ...

    African Journals Online (AJOL)

    user

    Many researchers have demonstrated the use of artificial neural networks (ANNs) to ..... Following section discusses the effect of infinite ensemble approach ..... major problem with artificial intelligence-based modeling approaches is their ...

  1. An educational model for ensemble streamflow simulation and uncertainty analysis

    Directory of Open Access Journals (Sweden)

    A. AghaKouchak

    2012-06-01

    Full Text Available This paper presents a hands-on modeling toolbox, HBV-Ensemble, designed as a complement to theoretical hydrology lectures, to teach hydrological processes and their uncertainties. The HBV-Ensemble can be used for in-class lab practices and homework assignments, and assessment of students' understanding of hydrological processes. Using this model, students can gain more insights into how hydrological processes (e.g., precipitation, snowmelt and snow accumulation, soil moisture, evapotranspiration and runoff generation are interconnected. The model includes a MATLAB Graphical User Interface (GUI and an ensemble simulation scheme that can be used for not only hydrological processes, but also for teaching uncertainty analysis, parameter estimation, ensemble simulation and model sensitivity.

  2. Relation between native ensembles and experimental structures of proteins

    DEFF Research Database (Denmark)

    Best, R. B.; Lindorff-Larsen, Kresten; DePristo, M. A.

    2006-01-01

    Data Bank ensembles; moreover, we show that the effects of uncertainties in structure determination are insufficient to explain the results. These results highlight the importance of accounting for native-state protein dynamics in making comparisons with ensemble-averaged experimental data and suggest......Different experimental structures of the same protein or of proteins with high sequence similarity contain many small variations. Here we construct ensembles of "high-sequence similarity Protein Data Bank" (HSP) structures and consider the extent to which such ensembles represent the structural...... heterogeneity of the native state in solution. We find that different NMR measurements probing structure and dynamics of given proteins in solution, including order parameters, scalar couplings, and residual dipolar couplings, are remarkably well reproduced by their respective high-sequence similarity Protein...

  3. Spectroscopic properties of inhomogeneously broadened spin ensembles in a cavity

    DEFF Research Database (Denmark)

    Kurucz, Zoltan; Wesenberg, Janus; Mølmer, Klaus

    2011-01-01

    The enhanced collective coupling to weak quantum fields may turn atomic or spin ensembles into an important component in quantum information processing architectures. Inhomogeneous broadening can, however, significantly reduce the coupling and the lifetime of the collective excitation...

  4. Trace formula for an ensemble of bumpy billiards

    CERN Document Server

    Pavloff, N

    1995-01-01

    We study the semiclassical quantization of an ensemble of billiards with a small random shape deformation. We derive a trace formula averaged over shape disorder. The results are illustrated by the study of supershells in rough metal clusters.

  5. Time and ensemble averaging in time series analysis

    CERN Document Server

    Latka, Miroslaw; Jernajczyk, Wojciech; West, Bruce J

    2010-01-01

    In many applications expectation values are calculated by partitioning a single experimental time series into an ensemble of data segments of equal length. Such single trajectory ensemble (STE) is a counterpart to a multiple trajectory ensemble (MTE) used whenever independent measurements or realizations of a stochastic process are available. The equivalence of STE and MTE for stationary systems was postulated by Wang and Uhlenbeck in their classic paper on Brownian motion (Rev. Mod. Phys. 17, 323 (1945)) but surprisingly has not yet been proved. Using the stationary and ergodic paradigm of statistical physics -- the Ornstein-Uhlenbeck (OU) Langevin equation, we revisit Wang and Uhlenbeck's postulate. In particular, we find that the variance of the solution of this equation is different for these two ensembles. While the variance calculated using the MTE quantifies the spreading of independent trajectories originating from the same initial point, the variance for STE measures the spreading of two correlated r...

  6. Phase-selective entrainment of nonlinear oscillator ensembles

    Science.gov (United States)

    Zlotnik, Anatoly; Nagao, Raphael; Kiss, István Z.; Li-Shin, Jr.

    2016-03-01

    The ability to organize and finely manipulate the hierarchy and timing of dynamic processes is important for understanding and influencing brain functions, sleep and metabolic cycles, and many other natural phenomena. However, establishing spatiotemporal structures in biological oscillator ensembles is a challenging task that requires controlling large collections of complex nonlinear dynamical units. In this report, we present a method to design entrainment signals that create stable phase patterns in ensembles of heterogeneous nonlinear oscillators without using state feedback information. We demonstrate the approach using experiments with electrochemical reactions on multielectrode arrays, in which we selectively assign ensemble subgroups into spatiotemporal patterns with multiple phase clusters. The experimentally confirmed mechanism elucidates the connection between the phases and natural frequencies of a collection of dynamical elements, the spatial and temporal information that is encoded within this ensemble, and how external signals can be used to retrieve this information.

  7. Ensemble vs. time averages in financial time series analysis

    Science.gov (United States)

    Seemann, Lars; Hua, Jia-Chen; McCauley, Joseph L.; Gunaratne, Gemunu H.

    2012-12-01

    Empirical analysis of financial time series suggests that the underlying stochastic dynamics are not only non-stationary, but also exhibit non-stationary increments. However, financial time series are commonly analyzed using the sliding interval technique that assumes stationary increments. We propose an alternative approach that is based on an ensemble over trading days. To determine the effects of time averaging techniques on analysis outcomes, we create an intraday activity model that exhibits periodic variable diffusion dynamics and we assess the model data using both ensemble and time averaging techniques. We find that ensemble averaging techniques detect the underlying dynamics correctly, whereas sliding intervals approaches fail. As many traded assets exhibit characteristic intraday volatility patterns, our work implies that ensemble averages approaches will yield new insight into the study of financial markets’ dynamics.

  8. Quantum Ensemble Classification: A Sampling-Based Learning Control Approach.

    Science.gov (United States)

    Chen, Chunlin; Dong, Daoyi; Qi, Bo; Petersen, Ian R; Rabitz, Herschel

    2017-06-01

    Quantum ensemble classification (QEC) has significant applications in discrimination of atoms (or molecules), separation of isotopes, and quantum information extraction. However, quantum mechanics forbids deterministic discrimination among nonorthogonal states. The classification of inhomogeneous quantum ensembles is very challenging, since there exist variations in the parameters characterizing the members within different classes. In this paper, we recast QEC as a supervised quantum learning problem. A systematic classification methodology is presented by using a sampling-based learning control (SLC) approach for quantum discrimination. The classification task is accomplished via simultaneously steering members belonging to different classes to their corresponding target states (e.g., mutually orthogonal states). First, a new discrimination method is proposed for two similar quantum systems. Then, an SLC method is presented for QEC. Numerical results demonstrate the effectiveness of the proposed approach for the binary classification of two-level quantum ensembles and the multiclass classification of multilevel quantum ensembles.

  9. Parameterization of extended systems

    DEFF Research Database (Denmark)

    Niemann, Hans Henrik

    2006-01-01

    The YJBK parameterization (of all stabilizing controllers) is extended to handle systems with additional sensors and/or actuators. It is shown that the closed loop transfer function is still an affine function in the YJBK parameters in the nominal case. Further, some closed-loop stability results...

  10. An automated approach to network features of protein structure ensembles.

    Science.gov (United States)

    Bhattacharyya, Moitrayee; Bhat, Chanda R; Vishveshwara, Saraswathi

    2013-10-01

    Network theory applied to protein structures provides insights into numerous problems of biological relevance. The explosion in structural data available from PDB and simulations establishes a need to introduce a standalone-efficient program that assembles network concepts/parameters under one hood in an automated manner. Herein, we discuss the development/application of an exhaustive, user-friendly, standalone program package named PSN-Ensemble, which can handle structural ensembles generated through molecular dynamics (MD) simulation/NMR studies or from multiple X-ray structures. The novelty in network construction lies in the explicit consideration of side-chain interactions among amino acids. The program evaluates network parameters dealing with topological organization and long-range allosteric communication. The introduction of a flexible weighing scheme in terms of residue pairwise cross-correlation/interaction energy in PSN-Ensemble brings in dynamical/chemical knowledge into the network representation. Also, the results are mapped on a graphical display of the structure, allowing an easy access of network analysis to a general biological community. The potential of PSN-Ensemble toward examining structural ensemble is exemplified using MD trajectories of an ubiquitin-conjugating enzyme (UbcH5b). Furthermore, insights derived from network parameters evaluated using PSN-Ensemble for single-static structures of active/inactive states of β2-adrenergic receptor and the ternary tRNA complexes of tyrosyl tRNA synthetases (from organisms across kingdoms) are discussed. PSN-Ensemble is freely available from http://vishgraph.mbu.iisc.ernet.in/PSN-Ensemble/psn_index.html.

  11. Clustering Categorical Data:A Cluster Ensemble Approach

    Institute of Scientific and Technical Information of China (English)

    He Zengyou(何增友); Xu Xiaofei; Deng Shengchun

    2003-01-01

    Clustering categorical data, an integral part of data mining,has attracted much attention recently. In this paper, the authors formally define the categorical data clustering problem as an optimization problem from the viewpoint of cluster ensemble, and apply cluster ensemble approach for clustering categorical data. Experimental results on real datasets show that better clustering accuracy can be obtained by comparing with existing categorical data clustering algorithms.

  12. Ensemble control of linear systems with parameter uncertainties

    Science.gov (United States)

    Kou, Kit Ian; Liu, Yang; Zhang, Dandan; Tu, Yanshuai

    2016-07-01

    In this paper, we study the optimal control problem for a class of four-dimensional linear systems based on quaternionic and Fourier analysis. When the control is unconstrained, the optimal ensemble controller for this linear ensemble control systems is given in terms of prolate spheroidal wave functions. For the constrained convex optimisation problem of such systems, the quadratic programming is presented to obtain the optimal control laws. Simulations are given to verity the effectiveness of the proposed theory.

  13. Pycobra: A Python Toolbox for Ensemble Learning and Visualisation

    OpenAIRE

    Guedj, Benjamin; Srinivasa Desikan, Bhargav

    2017-01-01

    We introduce \\texttt{pycobra}, a Python library devoted to ensemble learning (regression and classification) and visualisation. Its main assets are the implementation of several ensemble learning algorithms, a flexible and generic interface to compare and blend any existing machine learning algorithm available in Python libraries (as long as a \\texttt{predict} method is given), and visualisation tools such as Voronoi tessellations. \\texttt{pycobra} is fully \\texttt{scikit-learn} compatible an...

  14. Ensembles of probability estimation trees for customer churn prediction

    OpenAIRE

    2010-01-01

    Customer churn prediction is one of the most, important elements tents of a company's Customer Relationship Management, (CRM) strategy In tins study, two strategies are investigated to increase the lift. performance of ensemble classification models, i.e (1) using probability estimation trees (PETs) instead of standard decision trees as base classifiers; and (n) implementing alternative fusion rules based on lift weights lot the combination of ensemble member's outputs Experiments ale conduct...

  15. Evolutionary Ensemble for In Silico Prediction of Ames Test Mutagenicity

    Science.gov (United States)

    Chen, Huanhuan; Yao, Xin

    Driven by new regulations and animal welfare, the need to develop in silico models has increased recently as alternative approaches to safety assessment of chemicals without animal testing. This paper describes a novel machine learning ensemble approach to building an in silico model for the prediction of the Ames test mutagenicity, one of a battery of the most commonly used experimental in vitro and in vivo genotoxicity tests for safety evaluation of chemicals. Evolutionary random neural ensemble with negative correlation learning (ERNE) [1] was developed based on neural networks and evolutionary algorithms. ERNE combines the method of bootstrap sampling on training data with the method of random subspace feature selection to ensure diversity in creating individuals within an initial ensemble. Furthermore, while evolving individuals within the ensemble, it makes use of the negative correlation learning, enabling individual NNs to be trained as accurate as possible while still manage to maintain them as diverse as possible. Therefore, the resulting individuals in the final ensemble are capable of cooperating collectively to achieve better generalization of prediction. The empirical experiment suggest that ERNE is an effective ensemble approach for predicting the Ames test mutagenicity of chemicals.

  16. SVM and SVM Ensembles in Breast Cancer Prediction

    Science.gov (United States)

    Huang, Min-Wei; Chen, Chih-Wen; Lin, Wei-Chao; Ke, Shih-Wen; Tsai, Chih-Fong

    2017-01-01

    Breast cancer is an all too common disease in women, making how to effectively predict it an active research problem. A number of statistical and machine learning techniques have been employed to develop various breast cancer prediction models. Among them, support vector machines (SVM) have been shown to outperform many related techniques. To construct the SVM classifier, it is first necessary to decide the kernel function, and different kernel functions can result in different prediction performance. However, there have been very few studies focused on examining the prediction performances of SVM based on different kernel functions. Moreover, it is unknown whether SVM classifier ensembles which have been proposed to improve the performance of single classifiers can outperform single SVM classifiers in terms of breast cancer prediction. Therefore, the aim of this paper is to fully assess the prediction performance of SVM and SVM ensembles over small and large scale breast cancer datasets. The classification accuracy, ROC, F-measure, and computational times of training SVM and SVM ensembles are compared. The experimental results show that linear kernel based SVM ensembles based on the bagging method and RBF kernel based SVM ensembles with the boosting method can be the better choices for a small scale dataset, where feature selection should be performed in the data pre-processing stage. For a large scale dataset, RBF kernel based SVM ensembles based on boosting perform better than the other classifiers. PMID:28060807

  17. Monthly Ensembles in Algal Bloom Predictions on the Baltic Sea

    Science.gov (United States)

    Roiha, Petra; Westerlund, Antti; Stipa, Tapani

    2010-05-01

    In this work we explore the statistical features of monthly ensembles and their capability to predict biogeochemical conditions in the Baltic Sea. Operational marine environmental modelling has been considered hard, and consequently there are very few operational ecological models. Operational modelling of harmful algal blooms is harder still, since it is difficult to separate the algal species in models, and in general, very little is known of HAB properties. We present results of an ensemble approach to HAB forecasting in the Baltic, and discuss the applicability of the forecasting method to biochemical modelling. It turns out that HABs are indeed possible to forecast with useful accuracy. For modelling the algal blooms in Baltic Sea we used FMI operational 3-dimensional biogeochemical model to produce seasonal ensemble forecasts for different physical, chemical and biological variables. The modelled variables were temperature, salinity, velocity, silicate, phosphate, nitrate, diatoms, flagellates and two species of potentially toxic filamentous cyanobacteria nodularia spumigena and aphanizomenon flos-aquae. In this work we concentrate to the latter two. Ensembles were produced by running the biogeochemical model several times and forcing it on every run with different set of seasonal weather parameters from ECMWF's mathematically perturbed ensemble prediction forecasts. The ensembles were then analysed by statistical methods and the median, quartiles, minimum and maximum values were calculated for estimating the probable amounts of algae. Validation for the forecast method was made by comparing the final results against available and valid in-situ HAB data.

  18. SVM and SVM Ensembles in Breast Cancer Prediction.

    Science.gov (United States)

    Huang, Min-Wei; Chen, Chih-Wen; Lin, Wei-Chao; Ke, Shih-Wen; Tsai, Chih-Fong

    2017-01-01

    Breast cancer is an all too common disease in women, making how to effectively predict it an active research problem. A number of statistical and machine learning techniques have been employed to develop various breast cancer prediction models. Among them, support vector machines (SVM) have been shown to outperform many related techniques. To construct the SVM classifier, it is first necessary to decide the kernel function, and different kernel functions can result in different prediction performance. However, there have been very few studies focused on examining the prediction performances of SVM based on different kernel functions. Moreover, it is unknown whether SVM classifier ensembles which have been proposed to improve the performance of single classifiers can outperform single SVM classifiers in terms of breast cancer prediction. Therefore, the aim of this paper is to fully assess the prediction performance of SVM and SVM ensembles over small and large scale breast cancer datasets. The classification accuracy, ROC, F-measure, and computational times of training SVM and SVM ensembles are compared. The experimental results show that linear kernel based SVM ensembles based on the bagging method and RBF kernel based SVM ensembles with the boosting method can be the better choices for a small scale dataset, where feature selection should be performed in the data pre-processing stage. For a large scale dataset, RBF kernel based SVM ensembles based on boosting perform better than the other classifiers.

  19. Selecting, weeding, and weighting biased climate model ensembles

    Science.gov (United States)

    Jackson, C. S.; Picton, J.; Huerta, G.; Nosedal Sanchez, A.

    2012-12-01

    In the Bayesian formulation, the "log-likelihood" is a test statistic for selecting, weeding, or weighting climate model ensembles with observational data. This statistic has the potential to synthesize the physical and data constraints on quantities of interest. One of the thorny issues for formulating the log-likelihood is how one should account for biases. While in the past we have included a generic discrepancy term, not all biases affect predictions of quantities of interest. We make use of a 165-member ensemble CAM3.1/slab ocean climate models with different parameter settings to think through the issues that are involved with predicting each model's sensitivity to greenhouse gas forcing given what can be observed from the base state. In particular we use multivariate empirical orthogonal functions to decompose the differences that exist among this ensemble to discover what fields and regions matter to the model's sensitivity. We find that the differences that matter are a small fraction of the total discrepancy. Moreover, weighting members of the ensemble using this knowledge does a relatively poor job of adjusting the ensemble mean toward the known answer. This points out the shortcomings of using weights to correct for biases in climate model ensembles created by a selection process that does not emphasize the priorities of your log-likelihood.

  20. Genetic Programming Based Ensemble System for Microarray Data Classification

    Directory of Open Access Journals (Sweden)

    Kun-Hong Liu

    2015-01-01

    Full Text Available Recently, more and more machine learning techniques have been applied to microarray data analysis. The aim of this study is to propose a genetic programming (GP based new ensemble system (named GPES, which can be used to effectively classify different types of cancers. Decision trees are deployed as base classifiers in this ensemble framework with three operators: Min, Max, and Average. Each individual of the GP is an ensemble system, and they become more and more accurate in the evolutionary process. The feature selection technique and balanced subsampling technique are applied to increase the diversity in each ensemble system. The final ensemble committee is selected by a forward search algorithm, which is shown to be capable of fitting data automatically. The performance of GPES is evaluated using five binary class and six multiclass microarray datasets, and results show that the algorithm can achieve better results in most cases compared with some other ensemble systems. By using elaborate base classifiers or applying other sampling techniques, the performance of GPES may be further improved.

  1. Knowledge based cluster ensemble for cancer discovery from biomolecular data.

    Science.gov (United States)

    Yu, Zhiwen; Wongb, Hau-San; You, Jane; Yang, Qinmin; Liao, Hongying

    2011-06-01

    The adoption of microarray techniques in biological and medical research provides a new way for cancer diagnosis and treatment. In order to perform successful diagnosis and treatment of cancer, discovering and classifying cancer types correctly is essential. Class discovery is one of the most important tasks in cancer classification using biomolecular data. Most of the existing works adopt single clustering algorithms to perform class discovery from biomolecular data. However, single clustering algorithms have limitations, which include a lack of robustness, stability, and accuracy. In this paper, we propose a new cluster ensemble approach called knowledge based cluster ensemble (KCE) which incorporates the prior knowledge of the data sets into the cluster ensemble framework. Specifically, KCE represents the prior knowledge of a data set in the form of pairwise constraints. Then, the spectral clustering algorithm (SC) is adopted to generate a set of clustering solutions. Next, KCE transforms pairwise constraints into confidence factors for these clustering solutions. After that, a consensus matrix is constructed by considering all the clustering solutions and their corresponding confidence factors. The final clustering result is obtained by partitioning the consensus matrix. Comparison with single clustering algorithms and conventional cluster ensemble approaches, knowledge based cluster ensemble approaches are more robust, stable and accurate. The experiments on cancer data sets show that: 1) KCE works well on these data sets; 2) KCE not only outperforms most of the state-of-the-art single clustering algorithms, but also outperforms most of the state-of-the-art cluster ensemble approaches.

  2. Hybrid Intrusion Detection Using Ensemble of Classification Methods

    Directory of Open Access Journals (Sweden)

    M.Govindarajan

    2014-01-01

    Full Text Available One of the major developments in machine learning in the past decade is the ensemble method, which finds highly accurate classifier by combining many moderately accurate component classifiers. In this research work, new ensemble classification methods are proposed for homogeneous ensemble classifiers using bagging and heterogeneous ensemble classifiers using arcing classifier and their performances are analyzed in terms of accuracy. A Classifier ensemble is designed using Radial Basis Function (RBF and Support Vector Machine (SVM as base classifiers. The feasibility and the benefits of the proposed approaches are demonstrated by the means of real and benchmark data sets of intrusion detection. The main originality of the proposed approach is based on three main parts: preprocessing phase, classification phase and combining phase. A wide range of comparative experiments are conducted for real and benchmark data sets of intrusion detection. The accuracy of base classifiers is compared with homogeneous and heterogeneous models for data mining problem. The proposed ensemble methods provide significant improvement of accuracy compared to individual classifiers and also heterogeneous models exhibit better results than homogeneous models for real and benchmark data sets of intrusion detection.

  3. Long-range interacting systems in the unconstrained ensemble

    Science.gov (United States)

    Latella, Ivan; Pérez-Madrid, Agustín; Campa, Alessandro; Casetti, Lapo; Ruffo, Stefano

    2017-01-01

    Completely open systems can exchange heat, work, and matter with the environment. While energy, volume, and number of particles fluctuate under completely open conditions, the equilibrium states of the system, if they exist, can be specified using the temperature, pressure, and chemical potential as control parameters. The unconstrained ensemble is the statistical ensemble describing completely open systems and the replica energy is the appropriate free energy for these control parameters from which the thermodynamics must be derived. It turns out that macroscopic systems with short-range interactions cannot attain equilibrium configurations in the unconstrained ensemble, since temperature, pressure, and chemical potential cannot be taken as a set of independent variables in this case. In contrast, we show that systems with long-range interactions can reach states of thermodynamic equilibrium in the unconstrained ensemble. To illustrate this fact, we consider a modification of the Thirring model and compare the unconstrained ensemble with the canonical and grand-canonical ones: The more the ensemble is constrained by fixing the volume or number of particles, the larger the space of parameters defining the equilibrium configurations.

  4. Three-model ensemble wind prediction in southern Italy

    Science.gov (United States)

    Torcasio, Rosa Claudia; Federico, Stefano; Calidonna, Claudia Roberta; Avolio, Elenio; Drofa, Oxana; Landi, Tony Christian; Malguzzi, Piero; Buzzi, Andrea; Bonasoni, Paolo

    2016-03-01

    Quality of wind prediction is of great importance since a good wind forecast allows the prediction of available wind power, improving the penetration of renewable energies into the energy market. Here, a 1-year (1 December 2012 to 30 November 2013) three-model ensemble (TME) experiment for wind prediction is considered. The models employed, run operationally at National Research Council - Institute of Atmospheric Sciences and Climate (CNR-ISAC), are RAMS (Regional Atmospheric Modelling System), BOLAM (BOlogna Limited Area Model), and MOLOCH (MOdello LOCale in H coordinates). The area considered for the study is southern Italy and the measurements used for the forecast verification are those of the GTS (Global Telecommunication System). Comparison with observations is made every 3 h up to 48 h of forecast lead time. Results show that the three-model ensemble outperforms the forecast of each individual model. The RMSE improvement compared to the best model is between 22 and 30 %, depending on the season. It is also shown that the three-model ensemble outperforms the IFS (Integrated Forecasting System) of the ECMWF (European Centre for Medium-Range Weather Forecast) for the surface wind forecasts. Notably, the three-model ensemble forecast performs better than each unbiased model, showing the added value of the ensemble technique. Finally, the sensitivity of the three-model ensemble RMSE to the length of the training period is analysed.

  5. Ensembles of a small number of conformations with relative populations

    Energy Technology Data Exchange (ETDEWEB)

    Vammi, Vijay, E-mail: vsvammi@iastate.edu; Song, Guang, E-mail: gsong@iastate.edu [Iowa State University, Bioinformatics and Computational Biology Program, Department of Computer Science (United States)

    2015-12-15

    In our previous work, we proposed a new way to represent protein native states, using ensembles of a small number of conformations with relative Populations, or ESP in short. Using Ubiquitin as an example, we showed that using a small number of conformations could greatly reduce the potential of overfitting and assigning relative populations to protein ensembles could significantly improve their quality. To demonstrate that ESP indeed is an excellent alternative to represent protein native states, in this work we compare the quality of two ESP ensembles of Ubiquitin with several well-known regular ensembles or average structure representations. Extensive amount of significant experimental data are employed to achieve a thorough assessment. Our results demonstrate that ESP ensembles, though much smaller in size comparing to regular ensembles, perform equally or even better sometimes in all four different types of experimental data used in the assessment, namely, the residual dipolar couplings, residual chemical shift anisotropy, hydrogen exchange rates, and solution scattering profiles. This work further underlines the significance of having relative populations in describing the native states.

  6. Concrete ensemble Kalman filters with rigorous catastrophic filter divergence.

    Science.gov (United States)

    Kelly, David; Majda, Andrew J; Tong, Xin T

    2015-08-25

    The ensemble Kalman filter and ensemble square root filters are data assimilation methods used to combine high-dimensional, nonlinear dynamical models with observed data. Ensemble methods are indispensable tools in science and engineering and have enjoyed great success in geophysical sciences, because they allow for computationally cheap low-ensemble-state approximation for extremely high-dimensional turbulent forecast models. From a theoretical perspective, the dynamical properties of these methods are poorly understood. One of the central mysteries is the numerical phenomenon known as catastrophic filter divergence, whereby ensemble-state estimates explode to machine infinity, despite the true state remaining in a bounded region. In this article we provide a breakthrough insight into the phenomenon, by introducing a simple and natural forecast model that transparently exhibits catastrophic filter divergence under all ensemble methods and a large set of initializations. For this model, catastrophic filter divergence is not an artifact of numerical instability, but rather a true dynamical property of the filter. The divergence is not only validated numerically but also proven rigorously. The model cleanly illustrates mechanisms that give rise to catastrophic divergence and confirms intuitive accounts of the phenomena given in past literature.

  7. Long-range interacting systems in the unconstrained ensemble.

    Science.gov (United States)

    Latella, Ivan; Pérez-Madrid, Agustín; Campa, Alessandro; Casetti, Lapo; Ruffo, Stefano

    2017-01-01

    Completely open systems can exchange heat, work, and matter with the environment. While energy, volume, and number of particles fluctuate under completely open conditions, the equilibrium states of the system, if they exist, can be specified using the temperature, pressure, and chemical potential as control parameters. The unconstrained ensemble is the statistical ensemble describing completely open systems and the replica energy is the appropriate free energy for these control parameters from which the thermodynamics must be derived. It turns out that macroscopic systems with short-range interactions cannot attain equilibrium configurations in the unconstrained ensemble, since temperature, pressure, and chemical potential cannot be taken as a set of independent variables in this case. In contrast, we show that systems with long-range interactions can reach states of thermodynamic equilibrium in the unconstrained ensemble. To illustrate this fact, we consider a modification of the Thirring model and compare the unconstrained ensemble with the canonical and grand-canonical ones: The more the ensemble is constrained by fixing the volume or number of particles, the larger the space of parameters defining the equilibrium configurations.

  8. Concrete ensemble Kalman filters with rigorous catastrophic filter divergence

    Science.gov (United States)

    Kelly, David; Majda, Andrew J.; Tong, Xin T.

    2015-01-01

    The ensemble Kalman filter and ensemble square root filters are data assimilation methods used to combine high-dimensional, nonlinear dynamical models with observed data. Ensemble methods are indispensable tools in science and engineering and have enjoyed great success in geophysical sciences, because they allow for computationally cheap low-ensemble-state approximation for extremely high-dimensional turbulent forecast models. From a theoretical perspective, the dynamical properties of these methods are poorly understood. One of the central mysteries is the numerical phenomenon known as catastrophic filter divergence, whereby ensemble-state estimates explode to machine infinity, despite the true state remaining in a bounded region. In this article we provide a breakthrough insight into the phenomenon, by introducing a simple and natural forecast model that transparently exhibits catastrophic filter divergence under all ensemble methods and a large set of initializations. For this model, catastrophic filter divergence is not an artifact of numerical instability, but rather a true dynamical property of the filter. The divergence is not only validated numerically but also proven rigorously. The model cleanly illustrates mechanisms that give rise to catastrophic divergence and confirms intuitive accounts of the phenomena given in past literature. PMID:26261335

  9. EnsembleGASVR: A novel ensemble method for classifying missense single nucleotide polymorphisms

    KAUST Repository

    Rapakoulia, Trisevgeni

    2014-04-26

    Motivation: Single nucleotide polymorphisms (SNPs) are considered the most frequently occurring DNA sequence variations. Several computational methods have been proposed for the classification of missense SNPs to neutral and disease associated. However, existing computational approaches fail to select relevant features by choosing them arbitrarily without sufficient documentation. Moreover, they are limited to the problem ofmissing values, imbalance between the learning datasets and most of them do not support their predictions with confidence scores. Results: To overcome these limitations, a novel ensemble computational methodology is proposed. EnsembleGASVR facilitates a twostep algorithm, which in its first step applies a novel evolutionary embedded algorithm to locate close to optimal Support Vector Regression models. In its second step, these models are combined to extract a universal predictor, which is less prone to overfitting issues, systematizes the rebalancing of the learning sets and uses an internal approach for solving the missing values problem without loss of information. Confidence scores support all the predictions and the model becomes tunable by modifying the classification thresholds. An extensive study was performed for collecting the most relevant features for the problem of classifying SNPs, and a superset of 88 features was constructed. Experimental results show that the proposed framework outperforms well-known algorithms in terms of classification performance in the examined datasets. Finally, the proposed algorithmic framework was able to uncover the significant role of certain features such as the solvent accessibility feature, and the top-scored predictions were further validated by linking them with disease phenotypes. © The Author 2014.

  10. Ensemble Kalman filter versus ensemble smoother for assessing hydraulic conductivity via tracer test data assimilation

    Directory of Open Access Journals (Sweden)

    E. Crestani

    2013-04-01

    Full Text Available Estimating the spatial variability of hydraulic conductivity K in natural aquifers is important for predicting the transport of dissolved compounds. Especially in the nonreactive case, the plume evolution is mainly controlled by the heterogeneity of K. At the local scale, the spatial distribution of K can be inferred by combining the Lagrangian formulation of the transport with a Kalman-filter-based technique and assimilating a sequence of time-lapse concentration C measurements, which, for example, can be evaluated on site through the application of a geophysical method. The objective of this work is to compare the ensemble Kalman filter (EnKF and the ensemble smoother (ES capabilities to retrieve the hydraulic conductivity spatial distribution in a groundwater flow and transport modeling framework. The application refers to a two-dimensional synthetic aquifer in which a tracer test is simulated. Moreover, since Kalman-filter-based methods are optimal only if each of the involved variables fit to a Gaussian probability density function (pdf and since this condition may not be met by some of the flow and transport state variables, issues related to the non-Gaussianity of the variables are analyzed and different transformation of the pdfs are considered in order to evaluate their influence on the performance of the methods. The results show that the EnKF reproduces with good accuracy the hydraulic conductivity field, outperforming the ES regardless of the pdf of the concentrations.

  11. On evaluation of ensemble precipitation forecasts with observation-based ensembles

    Directory of Open Access Journals (Sweden)

    S. Jaun

    2007-04-01

    Full Text Available Spatial interpolation of precipitation data is uncertain. How important is this uncertainty and how can it be considered in evaluation of high-resolution probabilistic precipitation forecasts? These questions are discussed by experimental evaluation of the COSMO consortium's limited-area ensemble prediction system COSMO-LEPS. The applied performance measure is the often used Brier skill score (BSS. The observational references in the evaluation are (a analyzed rain gauge data by ordinary Kriging and (b ensembles of interpolated rain gauge data by stochastic simulation. This permits the consideration of either a deterministic reference (the event is observed or not with 100% certainty or a probabilistic reference that makes allowance for uncertainties in spatial averaging. The evaluation experiments show that the evaluation uncertainties are substantial even for the large area (41 300 km2 of Switzerland with a mean rain gauge distance as good as 7 km: the one- to three-day precipitation forecasts have skill decreasing with forecast lead time but the one- and two-day forecast performances differ not significantly.

  12. Extended Range Prediction of Indian Summer Monsoon: Current status

    Science.gov (United States)

    Sahai, A. K.; Abhilash, S.; Borah, N.; Joseph, S.; Chattopadhyay, R.; S, S.; Rajeevan, M.; Mandal, R.; Dey, A.

    2014-12-01

    The main focus of this study is to develop forecast consensus in the extended range prediction (ERP) of monsoon Intraseasonal oscillations using a suit of different variants of Climate Forecast system (CFS) model. In this CFS based Grand MME prediction system (CGMME), the ensemble members are generated by perturbing the initial condition and using different configurations of CFSv2. This is to address the role of different physical mechanisms known to have control on the error growth in the ERP in the 15-20 day time scale. The final formulation of CGMME is based on 21 ensembles of the standalone Global Forecast System (GFS) forced with bias corrected forecasted SST from CFS, 11 low resolution CFST126 and 11 high resolution CFST382. Thus, we develop the multi-model consensus forecast for the ERP of Indian summer monsoon (ISM) using a suite of different variants of CFS model. This coordinated international effort lead towards the development of specific tailor made regional forecast products over Indian region. Skill of deterministic and probabilistic categorical rainfall forecast as well the verification of large-scale low frequency monsoon intraseasonal oscillations has been carried out using hindcast from 2001-2012 during the monsoon season in which all models are initialized at every five days starting from 16May to 28 September. The skill of deterministic forecast from CGMME is better than the best participating single model ensemble configuration (SME). The CGMME approach is believed to quantify the uncertainty in both initial conditions and model formulation. Main improvement is attained in probabilistic forecast which is because of an increase in the ensemble spread, thereby reducing the error due to over-confident ensembles in a single model configuration. For probabilistic forecast, three tercile ranges are determined by ranking method based on the percentage of ensemble members from all the participating models falls in those three categories. CGMME further

  13. Comparing climate change impacts on crops in Belgium based on CMIP3 and EU-ENSEMBLES multi-model ensembles

    Science.gov (United States)

    Vanuytrecht, E.; Raes, D.; Willems, P.; Semenov, M.

    2012-04-01

    Global Circulation Models (GCMs) are sophisticated tools to study the future evolution of the climate. Yet, the coarse scale of GCMs of hundreds of kilometers raises questions about the suitability for agricultural impact assessments. These assessments are often made at field level and require consideration of interactions at sub-GCM grid scale (e.g., elevation-dependent climatic changes). Regional climate models (RCMs) were developed to provide climate projections at a spatial scale of 25-50 km for limited regions, e.g. Europe (Giorgi and Mearns, 1991). Climate projections from GCMs or RCMs are available as multi-model ensembles. These ensembles are based on large data sets of simulations produced by modelling groups worldwide, who performed a set of coordinated climate experiments in which climate models were run for a common set of experiments and various emissions scenarios (Knutti et al., 2010). The use of multi-model ensembles in climate change studies is an important step in quantifying uncertainty in impact predictions, which will underpin more informed decisions for adaptation and mitigation to changing climate (Semenov and Stratonovitch, 2010). The objective of our study was to evaluate the effect of the spatial scale of climate projections on climate change impacts for cereals in Belgium. Climate scenarios were based on two multi-model ensembles, one comprising 15 GCMs of the Coupled Model Intercomparison Project phase 3 (CMIP3; Meehl et al., 2007) with spatial resolution of 200-300 km, the other comprising 9 RCMs of the EU-ENSEMBLES project (van der Linden and Mitchell, 2009) with spatial resolution of 25 km. To be useful for agricultural impact assessments, the projections of GCMs and RCMs were downscaled to the field level. Long series (240 cropping seasons) of local-scale climate scenarios were generated by the LARS-WG weather generator (Semenov et al., 2010) via statistical inference. Crop growth and development were simulated with the Aqua

  14. A past discharge assimilation system for ensemble streamflow forecasts over France – Part 2: Impact on the ensemble streamflow forecasts

    Directory of Open Access Journals (Sweden)

    G. Thirel

    2010-04-01

    Full Text Available The use of ensemble streamflow forecasts is developing in the international flood forecasting services. Such systems can provide more accurate forecasts and useful information about the uncertainty of the forecasts, thus improving the assessment of risks. Nevertheless, these systems, like all hydrological forecasts, suffer from errors on initialization or on meteorological data, which lead to hydrological prediction errors. This article, which is the second part of a 2-part article, concerns the impacts of initial states, improved by a streamflow assimilation system, on an ensemble streamflow prediction system over France. An assimilation system was implemented to improve the streamflow analysis of the SAFRAN-ISBA-MODCOU (SIM hydro-meteorological suite, which initializes the ensemble streamflow forecasts at Météo-France. This assimilation system, using the Best Linear Unbiased Estimator (BLUE and modifying the initial soil moisture states, showed an improvement of the streamflow analysis with low soil moisture increments. The final states of this suite were used to initialize the ensemble streamflow forecasts of Météo-France, which are based on the SIM model and use the European Centre for Medium-range Weather Forecasts (ECMWF 10-day Ensemble Prediction System (EPS. Two different configurations of the assimilation system were used in this study: the first with the classical SIM model and the second using improved soil physics in ISBA. The effects of the assimilation system on the ensemble streamflow forecasts were assessed for these two configurations, and a comparison was made with the original (i.e. without data assimilation and without the improved physics ensemble streamflow forecasts. It is shown that the assimilation system improved most of the statistical scores usually computed for the validation of ensemble predictions (RMSE, Brier Skill Score and its decomposition, Ranked Probability Skill Score, False Alarm Rate, etc., especially

  15. Classification of Subcellular Phenotype Images by Decision Templates for Classifier Ensemble

    Science.gov (United States)

    Zhang, Bailing

    2010-01-01

    Subcellular localization is a key functional characteristic of proteins. An automatic, reliable and efficient prediction system for protein subcellular localization is needed for large-scale genome analysis. The automated cell phenotype image classification problem is an interesting "bioimage informatics" application. It can be used for establishing knowledge of the spatial distribution of proteins within living cells and permits to screen systems for drug discovery or for early diagnosis of a disease. In this paper, three well-known texture feature extraction methods including local binary patterns (LBP), Gabor filtering and Gray Level Coocurrence Matrix (GLCM) have been applied to cell phenotype images and the multiple layer perceptron (MLP) method has been used to classify cell phenotype image. After classification of the extracted features, decision-templates ensemble algorithm (DT) is used to combine base classifiers built on the different feature sets. Different texture feature sets can provide sufficient diversity among base classifiers, which is known as a necessary condition for improvement in ensemble performance. For the HeLa cells, the human classification error rate on this task is of 17% as reported in previous publications. We obtain with our method an error rate of 4.8%.

  16. Mean-field Ensemble Kalman Filter

    KAUST Repository

    Law, Kody

    2015-01-07

    A proof of convergence of the standard EnKF generalized to non-Gaussian state space models is provided. A density-based deterministic approximation of the mean-field limiting EnKF (MFEnKF) is proposed, consisting of a PDE solver and a quadrature rule. Given a certain minimal order of convergence between the two, this extends to the deterministic filter approximation, which is therefore asymptotically superior to standard EnKF for d < 2 . The fidelity of approximation of the true distribution is also established using an extension of total variation metric to random measures. This is limited by a Gaussian bias term arising from non-linearity/non-Gaussianity of the model, which arises in both deterministic and standard EnKF. Numerical results support and extend the theory.

  17. Combining 2-m temperature nowcasting and short range ensemble forecasting

    Directory of Open Access Journals (Sweden)

    A. Kann

    2011-12-01

    Full Text Available During recent years, numerical ensemble prediction systems have become an important tool for estimating the uncertainties of dynamical and physical processes as represented in numerical weather models. The latest generation of limited area ensemble prediction systems (LAM-EPSs allows for probabilistic forecasts at high resolution in both space and time. However, these systems still suffer from systematic deficiencies. Especially for nowcasting (0–6 h applications the ensemble spread is smaller than the actual forecast error. This paper tries to generate probabilistic short range 2-m temperature forecasts by combining a state-of-the-art nowcasting method and a limited area ensemble system, and compares the results with statistical methods. The Integrated Nowcasting Through Comprehensive Analysis (INCA system, which has been in operation at the Central Institute for Meteorology and Geodynamics (ZAMG since 2006 (Haiden et al., 2011, provides short range deterministic forecasts at high temporal (15 min–60 min and spatial (1 km resolution. An INCA Ensemble (INCA-EPS of 2-m temperature forecasts is constructed by applying a dynamical approach, a statistical approach, and a combined dynamic-statistical method. The dynamical method takes uncertainty information (i.e. ensemble variance from the operational limited area ensemble system ALADIN-LAEF (Aire Limitée Adaptation Dynamique Développement InterNational Limited Area Ensemble Forecasting which is running operationally at ZAMG (Wang et al., 2011. The purely statistical method assumes a well-calibrated spread-skill relation and applies ensemble spread according to the skill of the INCA forecast of the most recent past. The combined dynamic-statistical approach adapts the ensemble variance gained from ALADIN-LAEF with non-homogeneous Gaussian regression (NGR which yields a statistical mbox{correction} of the first and second moment (mean bias and dispersion for Gaussian distributed continuous

  18. Motor-motor interactions in ensembles of muscle myosin: using theory to connect single molecule to ensemble measurements

    Science.gov (United States)

    Walcott, Sam

    2013-03-01

    Interactions between the proteins actin and myosin drive muscle contraction. Properties of a single myosin interacting with an actin filament are largely known, but a trillion myosins work together in muscle. We are interested in how single-molecule properties relate to ensemble function. Myosin's reaction rates depend on force, so ensemble models keep track of both molecular state and force on each molecule. These models make subtle predictions, e.g. that myosin, when part of an ensemble, moves actin faster than when isolated. This acceleration arises because forces between molecules speed reaction kinetics. Experiments support this prediction and allow parameter estimates. A model based on this analysis describes experiments from single molecule to ensemble. In vivo, actin is regulated by proteins that, when present, cause the binding of one myosin to speed the binding of its neighbors; binding becomes cooperative. Although such interactions preclude the mean field approximation, a set of linear ODEs describes these ensembles under simplified experimental conditions. In these experiments cooperativity is strong, with the binding of one molecule affecting ten neighbors on either side. We progress toward a description of myosin ensembles under physiological conditions.

  19. Extended Irreversible Thermodynamics

    CERN Document Server

    Jou, David

    2010-01-01

    This is the 4th edition of the highly acclaimed monograph on Extended Irreversible Thermodynamics, a theory that goes beyond the classical theory of irreversible processes. In contrast to the classical approach, the basic variables describing the system are complemented by non-equilibrium quantities. The claims made for extended thermodynamics are confirmed by the kinetic theory of gases and statistical mechanics. The book covers a wide spectrum of applications, and also contains a thorough discussion of the foundations and the scope of the current theories on non-equilibrium thermodynamics. For this new edition, the authors critically revised existing material while taking into account the most recent developments in fast moving fields such as heat transport in micro- and nanosystems or fast solidification fronts in materials sciences. Several fundamental chapters have been revisited emphasizing physics and applications over mathematical derivations. Also, fundamental questions on the definition of non-equil...

  20. Introduction to Extended Electrodynamics

    CERN Document Server

    Donev, S

    1997-01-01

    This paper summarizes the motivations and results obtained so far in the frame of a particular non-linearization of Classical Electrodynamics, which was called Extended Electrodynamics. The main purpose pursued with this non-linear extension of the classical Maxwell's equations is to have a reliable field-theoretical approach in describing (3+1) soliton-like electromagnetic formations, in particular, to build an extended and finite field model of free photons and photon complexes. The first chapter gives a corresponding analysis of Maxwell theory and introduces the new equations. The second chapter gives a full account of the results, including the photon-like solutions, in the vacuum case. A new concept, called scale factor, is defined and successfully used. Two ways for describing the intrinsic angular momentum are given. Interference of two photon-like solutions is also considered. The third chapter considers interaction with external fields (continuous media) on the base of establishing correspondence bet...

  1. Extended Theories of Gravitation

    Directory of Open Access Journals (Sweden)

    Fatibene Lorenzo

    2013-09-01

    Full Text Available Within the framework of extended theories of gravitation we shall discuss physical equivalences among different formalisms and classical tests. As suggested by the Ehlers-Pirani-Schild framework, the conformal invariance will be preserved and its effect on observational protocols discussed. Accordingly, we shall review standard tests showing how Palatini f(R-theories naturally passes solar system tests. Observation protocols will be discussed in this wider framework.

  2. Propelling Extended Objects

    Science.gov (United States)

    Humbert, Richard

    2010-01-01

    A force acting on just part of an extended object (either a solid or a volume of a liquid) can cause all of it to move. That motion is due to the transmission of the force through the object by its material. This paper discusses how the force is distributed to all of the object by a gradient of stress or pressure in it, which creates the local…

  3. The Fly Printer - Extended

    DEFF Research Database (Denmark)

    Beloff, Laura; Klaus, Malena

    2016-01-01

    Artist talk / Work-in-progress What is the purpose of a machine or an artifact, like the Fly Printer, that is dislocated, that produces images that have no meaning, no instrumentality, that depict nothing in the world? The biological and the cultural are reunited in this apparatus as a possibilit...... the results. The extended version of the Fly Printer containing the technological perception and DNNs is a collaboration between Laura Beloff and Malene Theres Klaus...

  4. Structured RNAs and synteny regions in the pig genome

    DEFF Research Database (Denmark)

    Anthon, Christian; Tafer, Hakim; Havgaard, Jakob Hull;

    2014-01-01

    for Laurasiatheria (pig, cow, dolphin, horse, cat, dog, hedgehog). CONCLUSIONS: We have obtained one of the most comprehensive annotations for structured ncRNAs of a mammalian genome, which is likely to play central roles in both health modelling and production. The core annotation is available in Ensembl 70......BACKGROUND: Annotating mammalian genomes for noncoding RNAs (ncRNAs) is nontrivial since far from all ncRNAs are known and the computational models are resource demanding. Currently, the human genome holds the best mammalian ncRNA annotation, a result of numerous efforts by several groups. However...

  5. Flexible modeling frameworks to replace small ensembles of hydrological models and move toward large ensembles?

    Science.gov (United States)

    Addor, Nans; Clark, Martyn P.; Mizukami, Naoki

    2017-04-01

    Climate change impacts on hydrological processes are typically assessed using small ensembles of hydrological models. That is, a handful of hydrological models are typically driven by a larger number of climate models. Such a setup has several limitations. Because the number of hydrological models is small, only a small proportion of the model space is sampled, likely leading to an underestimation of the uncertainties in the projections. Further, sampling is arbitrary: although hydrological models should be selected to provide a representative sample of existing models (in terms of complexity and governing hypotheses), they are instead usually selected based on legacy reasons. Furthermore, running several hydrological models currently constitutes a practical challenge because each model must be setup and calibrated individually. Finally, and probably most importantly, the differences between the projected impacts cannot be directly related to differences between hydrological models, because the models are different in almost every possible aspect. We are hence in a situation in which different hydrological models deliver different projections, but for reasons that are mostly unclear, and in which the uncertainty in the projections is probably underestimated. To overcome these limitations, we are experimenting with the flexible modeling framework FUSE (Framework for Understanding Model Errors). FUSE enables to construct conceptual models piece by piece (in a "pick and mix" approach), so it can be used to generate a large number of models that mimic existing models and/or models that differ from other models in single targeted respect (e.g. how baseflow is generated). FUSE hence allows for controlled modeling experiments, and for a more systematic and exhaustive sampling of the model space. Here we explore climate change impacts over the contiguous USA on a 12km grid using two groups of three models: the first group involves the commonly used models VIC, PRMS and HEC

  6. Quantifying Monte Carlo uncertainty in ensemble Kalman filter

    Energy Technology Data Exchange (ETDEWEB)

    Thulin, Kristian; Naevdal, Geir; Skaug, Hans Julius; Aanonsen, Sigurd Ivar

    2009-01-15

    This report is presenting results obtained during Kristian Thulin PhD study, and is a slightly modified form of a paper submitted to SPE Journal. Kristian Thulin did most of his portion of the work while being a PhD student at CIPR, University of Bergen. The ensemble Kalman filter (EnKF) is currently considered one of the most promising methods for conditioning reservoir simulation models to production data. The EnKF is a sequential Monte Carlo method based on a low rank approximation of the system covariance matrix. The posterior probability distribution of model variables may be estimated fram the updated ensemble, but because of the low rank covariance approximation, the updated ensemble members become correlated samples from the posterior distribution. We suggest using multiple EnKF runs, each with smaller ensemble size to obtain truly independent samples from the posterior distribution. This allows a point-wise confidence interval for the posterior cumulative distribution function (CDF) to be constructed. We present a methodology for finding an optimal combination of ensemble batch size (n) and number of EnKF runs (m) while keeping the total number of ensemble members ( m x n) constant. The optimal combination of n and m is found through minimizing the integrated mean square error (MSE) for the CDFs and we choose to define an EnKF run with 10.000 ensemble members as having zero Monte Carlo error. The methodology is tested on a simplistic, synthetic 2D model, but should be applicable also to larger, more realistic models. (author). 12 refs., figs.,tabs

  7. Bayesian Processor of Ensemble for Precipitation Forecasting: A Development Plan

    Science.gov (United States)

    Toth, Z.; Krzysztofowicz, R.

    2006-05-01

    The Bayesian Processor of Ensemble (BPE) is a new, theoretically-based technique for probabilistic forecasting of weather variates. It is a generalization of the Bayesian Processor of Output (BPO) developed by Krzysztofowicz and Maranzano for processing single values of multiple predictors into a posterior distribution function of a predictand. The BPE processes an ensemble of a predictand generated by multiple integrations of a numerical weather prediction (NWP) model, and optimally fuses the ensemble with climatic data in order to quantify uncertainty about the predictand. As is well known, Bayes theorem provides the optimal theoretical framework for fusing information from different sources and for obtaining the posterior distribution function of a predictand. Using a family of such distribution functions, a given raw ensemble can be mapped into a posterior ensemble, which is well calibrated, has maximum informativeness, and preserves the spatio-temporal and cross-variate dependence structure of the NWP output fields. The challenge is to develop and test the BPE suitable for operational forecasting. This talk will present the basic design components of the BPE, along with a discussion of the climatic and training data to be used in its potential application at the National Centers for Environmental Prediction (NCEP). The technique will be tested first on quasi-normally distributed variates and next on precipitation variates. For reasons of economy, the BPE will be applied on the relatively coarse resolution grid corresponding to the ensemble output, and then the posterior ensemble will be downscaled to finer grids such as that of the National Digital Forecast Database (NDFD).

  8. An assessment of a multi-model ensemble of decadal climate predictions

    Science.gov (United States)

    Bellucci, A.; Haarsma, R.; Gualdi, S.; Athanasiadis, P. J.; Caian, M.; Cassou, C.; Fernandez, E.; Germe, A.; Jungclaus, J.; Kröger, J.; Matei, D.; Müller, W.; Pohlmann, H.; Salas y Melia, D.; Sanchez, E.; Smith, D.; Terray, L.; Wyser, K.; Yang, S.

    2015-05-01

    A multi-model ensemble of decadal prediction experiments, performed in the framework of the EU-funded COMBINE (Comprehensive Modelling of the Earth System for Better Climate Prediction and Projection) Project following the 5th Coupled Model Intercomparison Project protocol is examined. The ensemble combines a variety of dynamical models, initialization and perturbation strategies, as well as data assimilation products employed to constrain the initial state of the system. Taking advantage of the multi-model approach, several aspects of decadal climate predictions are assessed, including predictive skill, impact of the initialization strategy and the level of uncertainty characterizing the predicted fluctuations of key climate variables. The present analysis adds to the growing evidence that the current generation of climate models adequately initialized have significant skill in predicting years ahead not only the anthropogenic warming but also part of the internal variability of the climate system. An important finding is that the multi-model ensemble mean does generally outperform the individual forecasts, a well-documented result for seasonal forecasting, supporting the need to extend the multi-model framework to real-time decadal predictions in order to maximize the predictive capabilities of currently available decadal forecast systems. The multi-model perspective did also allow a more robust assessment of the impact of the initialization strategy on the quality of decadal predictions, providing hints of an improved forecast skill under full-value (with respect to anomaly) initialization in the near-term range, over the Indo-Pacific equatorial region. Finally, the consistency across the different model predictions was assessed. Specifically, different systems reveal a general agreement in predicting the near-term evolution of surface temperatures, displaying positive correlations between different decadal hindcasts over most of the global domain.

  9. Inequalities of extended beta and extended hypergeometric functions.

    Science.gov (United States)

    Mondal, Saiful R

    2017-01-01

    We study the log-convexity of the extended beta functions. As a consequence, we establish Turán-type inequalities. The monotonicity, log-convexity, log-concavity of extended hypergeometric functions are deduced by using the inequalities on extended beta functions. The particular cases of those results also give the Turán-type inequalities for extended confluent and extended Gaussian hypergeometric functions. Some reverses of Turán-type inequalities are also derived.

  10. Ensemble Kalman filter versus ensemble smoother for assessing hydraulic conductivity via tracer test data assimilation

    Directory of Open Access Journals (Sweden)

    E. Crestani

    2012-11-01

    Full Text Available The significance of estimating the spatial variability of the hydraulic conductivity K in natural aquifers is relevant to the possibility of defining the space and time evolution of a non-reactive plume, since the transport of a solute is mainly controlled by the heterogeneity of K. At the local scale, the spatial distribution of K can be inferred by combining the Lagrangian formulation of the transport with a Kalman filter-based technique and assimilating a sequence of time-lapse concentration C measurements, which, for example, can be evaluated on-site through the application of a geophysical method. The objective of this work is to compare the ensemble Kalman filter (EnKF and the ensemble smoother (ES capabilities to retrieve the hydraulic conductivity spatial distribution in a groundwater flow and transport modeling framework. The application refers to a two-dimensional synthetic aquifer in which a tracer test is simulated. Moreover, since Kalman filter-based methods are optimal only if each of the involved variables fit to a Gaussian probability density function (pdf and since this condition may not be met by some of the flow and transport state variables, issues related to the non-Gaussianity of the variables are analyzed and different transformation of the pdfs are considered in order to evaluate their influence on the performance of the methods. The results show that the EnKF reproduces with good accuracy the hydraulic conductivity field, outperforming the ES regardless of the pdf of the concentrations.

  11. Extended Testability Analysis Tool

    Science.gov (United States)

    Melcher, Kevin; Maul, William A.; Fulton, Christopher

    2012-01-01

    The Extended Testability Analysis (ETA) Tool is a software application that supports fault management (FM) by performing testability analyses on the fault propagation model of a given system. Fault management includes the prevention of faults through robust design margins and quality assurance methods, or the mitigation of system failures. Fault management requires an understanding of the system design and operation, potential failure mechanisms within the system, and the propagation of those potential failures through the system. The purpose of the ETA Tool software is to process the testability analysis results from a commercial software program called TEAMS Designer in order to provide a detailed set of diagnostic assessment reports. The ETA Tool is a command-line process with several user-selectable report output options. The ETA Tool also extends the COTS testability analysis and enables variation studies with sensor sensitivity impacts on system diagnostics and component isolation using a single testability output. The ETA Tool can also provide extended analyses from a single set of testability output files. The following analysis reports are available to the user: (1) the Detectability Report provides a breakdown of how each tested failure mode was detected, (2) the Test Utilization Report identifies all the failure modes that each test detects, (3) the Failure Mode Isolation Report demonstrates the system s ability to discriminate between failure modes, (4) the Component Isolation Report demonstrates the system s ability to discriminate between failure modes relative to the components containing the failure modes, (5) the Sensor Sensor Sensitivity Analysis Report shows the diagnostic impact due to loss of sensor information, and (6) the Effect Mapping Report identifies failure modes that result in specified system-level effects.

  12. Precipitation and temperature ensemble forecasts from single-value forecasts

    Directory of Open Access Journals (Sweden)

    J. Schaake

    2007-04-01

    Full Text Available A procedure is presented to construct ensemble forecasts from single-value forecasts of precipitation and temperature. This involves dividing the spatial forecast domain and total forecast period into a number of parts that are treated as separate forecast events. The spatial domain is divided into hydrologic sub-basins. The total forecast period is divided into time periods, one for each model time step. For each event archived values of forecasts and corresponding observations are used to model the joint distribution of forecasts and observations. The conditional distribution of observations for a given single-value forecast is used to represent the corresponding probability distribution of events that may occur for that forecast. This conditional forecast distribution subsequently is used to create ensemble members that vary in space and time using the "Schaake Shuffle" (Clark et al, 2004. The resulting ensemble members have the same space-time patterns as historical observations so that space-time joint relationships between events that have a significant effect on hydrological response tend to be preserved.

    Forecast uncertainty is space and time-scale dependent. For a given lead time to the beginning of the valid period of an event, forecast uncertainty depends on the length of the forecast valid time period and the spatial area to which the forecast applies. Although the "Schaake Shuffle" procedure, when applied to construct ensemble members from a time-series of single value forecasts, may preserve some of this scale dependency, it may not be sufficient without additional constraint. To account more fully for the time-dependent structure of forecast uncertainty, events for additional "aggregate" forecast periods are defined as accumulations of different "base" forecast periods.

    The generated ensemble members can be ingested by an Ensemble Streamflow Prediction system to produce ensemble forecasts of streamflow and other

  13. Seamless Hourly Rainfall Ensemble Forecasts for 0 - 10 days

    Science.gov (United States)

    Cooper, Shaun; Seed, Alan

    2014-05-01

    The Australian Bureau of Meteorology uses a number of Numerical Weather Prediction (NWP) models to generate deterministic rainfall forecasts over a range of lead-times, each with a different resolution in space and time and with different forecast domains. High resolution regional NWP models are used to generate forecasts for the first three days, and are typically more accurate than lower resolution Global NWP models that produce forecasts for longer lead times. Consequently, there is a requirement for a seamless forecast system that is able to blend the various NWP forecasts into a single forecast with a uniform resolution over the entire forecast period. NWP rainfall forecasts contain errors at scales that are significant for even large river basins, and ensemble hydrological prediction systems require ensembles of the order of 100 members, which is well beyond the size that can be generated by NWP ensemble systems. The idea, therefore, is to blend the NWP models in such a way that recognises the skill of the NWP at a particular scale and lead time and to use a stochastic model of forecast errors to perturb the blended deterministic forecast to generate a large ensemble. NWP uncertainties are scale and forecast lead time dependent, especially at long forecast lead times, and are characteristic to each model. By blending the models scale by scale it is possible to recognise the increased skill of the models at larger spatial scales and shorter lead times. The stochastic model is applied at each scale, adding increasingly more variability at smaller spatial scales, while preserving the space-time structure of rain. This process allows an ensemble to be generated by blending deterministic forecasts. Two NWP models from the Bureau, ACCESS-G (Global) (~40 km by ~40 km, 3 hourly out to 10 days) and ACCESS-R (Regional) (~12 km by ~12 km, 1 hourly out to 3 days), are downscaled and blended with the stochastic model to produce an ensemble of hourly forecasts out to 10

  14. Trends in the predictive performance of raw ensemble weather forecasts

    Science.gov (United States)

    Hemri, Stephan; Scheuerer, Michael; Pappenberger, Florian; Bogner, Konrad; Haiden, Thomas

    2015-04-01

    Over the last two decades the paradigm in weather forecasting has shifted from being deterministic to probabilistic. Accordingly, numerical weather prediction (NWP) models have been run increasingly as ensemble forecasting systems. The goal of such ensemble forecasts is to approximate the forecast probability distribution by a finite sample of scenarios. Global ensemble forecast systems, like the European Centre for Medium-Range Weather Forecasts (ECMWF) ensemble, are prone to probabilistic biases, and are therefore not reliable. They particularly tend to be underdispersive for surface weather parameters. Hence, statistical post-processing is required in order to obtain reliable and sharp forecasts. In this study we apply statistical post-processing to ensemble forecasts of near-surface temperature, 24-hour precipitation totals, and near-surface wind speed from the global ECMWF model. Our main objective is to evaluate the evolution of the difference in skill between the raw ensemble and the post-processed forecasts. The ECMWF ensemble is under continuous development, and hence its forecast skill improves over time. Parts of these improvements may be due to a reduction of probabilistic bias. Thus, we first hypothesize that the gain by post-processing decreases over time. Based on ECMWF forecasts from January 2002 to March 2014 and corresponding observations from globally distributed stations we generate post-processed forecasts by ensemble model output statistics (EMOS) for each station and variable. Parameter estimates are obtained by minimizing the Continuous Ranked Probability Score (CRPS) over rolling training periods that consist of the n days preceding the initialization dates. Given the higher average skill in terms of CRPS of the post-processed forecasts for all three variables, we analyze the evolution of the difference in skill between raw ensemble and EMOS forecasts. The fact that the gap in skill remains almost constant over time, especially for near

  15. Extended associative memories

    OpenAIRE

    J.H Sossa Azuela; R. Barrón Fernández

    2007-01-01

    The #945; #946; associative memories recently developed in Ref 10 have proven to be powerful tools for memorizing and recalling patterns when they appear distorted by noise. However they are only useful in the binary case. In this paper we show that it is possible to extend these memories now to the gray-level case. To get the desired extension, we take the original operators #945; and #946;, foundation of the #945; #946; memories, and propose a more general family of operators. We find t...

  16. Branes in extended spacetime

    CERN Document Server

    Sakatani, Yuho

    2016-01-01

    We propose a novel approach to the brane worldvolume theory based on the geometry of extended field theories; double field theory and exceptional field theory. We demonstrate the effectiveness of this approach by showing that one can reproduce the conventional bosonic string/membrane actions, and the M5-brane action in the weak field approximation. At a glance, the proposed 5-brane action without approximation looks different from the known M5-brane actions, but it is consistent with the known non-linear self-duality relation, and it may provide a new formulation of a single M5-brane action. Actions for exotic branes are also discussed.

  17. Ensemble size impact on the decadal predictive skill assessment

    Directory of Open Access Journals (Sweden)

    Frank Sienz

    2016-12-01

    Full Text Available Retrospective prediction experiments have to be performed to estimate the skill of decadal prediction systems. These are necessarily restricted in the number due to the computational constraints. From weather and seasonal prediction it is known that the ensemble size is crucial to yield reliable predictions. Differences are expected for decadal predictions due to the differing time-scales of the involved processes and the longer prediction horizon. A conceptual model is applied that enables the systematic analysis of ensemble size dependencies in a framework close to that of decadal predictions. Differences are quantified in terms of the confidence intervals coverage and the power of statistical tests for prediction scores. In addition, the concepts are applied to decadal predicitions of the MiKlip Baseline1 system. It is shown that small ensemble, as well as hindcast sample sizes lead to biased test performances in a way that the detection of a present prediction skill is hampered. Experiments with ensemble sizes smaller than 10 are not recommended to evaluate decadal prediction skill or as basis for the prediction system developement. For regions with low signal-to-noise ratios much larger ensembles are required and it is shown that in this case successful decadal predictions are possible for the Central European summer temperatures.

  18. Application of evolutionary computation on ensemble forecast of quantitative precipitation

    Science.gov (United States)

    Dufek, Amanda S.; Augusto, Douglas A.; Dias, Pedro L. S.; Barbosa, Helio J. C.

    2017-09-01

    An evolutionary computation algorithm known as genetic programming (GP) has been explored as an alternative tool for improving the ensemble forecast of 24-h accumulated precipitation. Three GP versions and six ensembles' languages were applied to several real-world datasets over southern, southeastern and central Brazil during the rainy period from October to February of 2008-2013. According to the results, the GP algorithms performed better than two traditional statistical techniques, with errors 27-57% lower than simple ensemble mean and the MASTER super model ensemble system. In addition, the results revealed that GP algorithms outperformed the best individual forecasts, reaching an improvement of 34-42%. On the other hand, the GP algorithms had a similar performance with respect to each other and to the Bayesian model averaging, but the former are far more versatile techniques. Although the results for the six ensembles' languages are almost indistinguishable, our most complex linear language turned out to be the best overall proposal. Moreover, some meteorological attributes, including the weather patterns over Brazil, seem to play an important role in the prediction of daily rainfall amount.

  19. Ensemble of Thermostatically Controlled Loads: Statistical Physics Approach

    Energy Technology Data Exchange (ETDEWEB)

    Chertkov, Michael [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Skolkovo Inst. of Science and Technology, Moscow (Russia); Chernyak, Vladimir [Wayne State Univ., Detroit, MI (United States). Dept. of Chemistry

    2017-01-17

    Thermostatically Controlled Loads (TCL), e.g. air-conditioners and heaters, are by far the most wide-spread consumers of electricity. Normally the devices are calibrated to provide the so-called bang-bang control of temperature - changing from on to off , and vice versa, depending on temperature. Aggregation of a large group of similar devices into a statistical ensemble is considered, where the devices operate following the same dynamics subject to stochastic perturbations and randomized, Poisson on/off switching policy. We analyze, using theoretical and computational tools of statistical physics, how the ensemble relaxes to a stationary distribution and establish relation between the re- laxation and statistics of the probability flux, associated with devices' cycling in the mixed (discrete, switch on/off , and continuous, temperature) phase space. This allowed us to derive and analyze spec- trum of the non-equilibrium (detailed balance broken) statistical system. and uncover how switching policy affects oscillatory trend and speed of the relaxation. Relaxation of the ensemble is of a practical interest because it describes how the ensemble recovers from significant perturbations, e.g. forceful temporary switching o aimed at utilizing flexibility of the ensemble in providing "demand response" services relieving consumption temporarily to balance larger power grid. We discuss how the statistical analysis can guide further development of the emerging demand response technology.

  20. Are paleoclimate model ensembles consistent with the MARGO data synthesis?

    Directory of Open Access Journals (Sweden)

    J. C. Hargreaves

    2011-03-01

    Full Text Available We investigate the consistency of various ensembles of model simulations with the Multiproxy Approach for the Reconstruction of the Glacial Ocean Surface (MARGO sea surface temperature data synthesis. We discover that while two multi-model ensembles, created through the Paleoclimate Model Intercomparison Projects (PMIP and PMIP2, pass our simple tests of reliability, an ensemble based on parameter variation in a single model does not perform so well. We show that accounting for observational uncertainty in the MARGO database is of prime importance for correctly evaluating the ensembles. Perhaps surprisingly, the inclusion of a coupled dynamical ocean (compared to the use of a slab ocean does not appear to cause a wider spread in the sea surface temperature anomalies, but rather causes systematic changes with more heat transported north in the Atlantic. There is weak evidence that the sea surface temperature data may be more consistent with meridional overturning in the North Atlantic being similar for the LGM and the present day, however, the small size of the PMIP2 ensemble prevents any statistically significant results from being obtained.