WorldWideScience

Sample records for hierarchical clustering revealed

  1. Global Considerations in Hierarchical Clustering Reveal Meaningful Patterns in Data

    Science.gov (United States)

    Varshavsky, Roy; Horn, David; Linial, Michal

    2008-01-01

    Background A hierarchy, characterized by tree-like relationships, is a natural method of organizing data in various domains. When considering an unsupervised machine learning routine, such as clustering, a bottom-up hierarchical (BU, agglomerative) algorithm is used as a default and is often the only method applied. Methodology/Principal Findings We show that hierarchical clustering that involve global considerations, such as top-down (TD, divisive), or glocal (global-local) algorithms are better suited to reveal meaningful patterns in the data. This is demonstrated, by testing the correspondence between the results of several algorithms (TD, glocal and BU) and the correct annotations provided by experts. The correspondence was tested in multiple domains including gene expression experiments, stock trade records and functional protein families. The performance of each of the algorithms is evaluated by statistical criteria that are assigned to clusters (nodes of the hierarchy tree) based on expert-labeled data. Whereas TD algorithms perform better on global patterns, BU algorithms perform well and are advantageous when finer granularity of the data is sought. In addition, a novel TD algorithm that is based on genuine density of the data points is presented and is shown to outperform other divisive and agglomerative methods. Application of the algorithm to more than 500 protein sequences belonging to ion-channels illustrates the potential of the method for inferring overlooked functional annotations. ClustTree, a graphical Matlab toolbox for applying various hierarchical clustering algorithms and testing their quality is made available. Conclusions Although currently rarely used, global approaches, in particular, TD or glocal algorithms, should be considered in the exploratory process of clustering. In general, applying unsupervised clustering methods can leverage the quality of manually-created mapping of proteins families. As demonstrated, it can also provide

  2. Hierarchical clustering of breast cancer methylomes revealed differentially methylated and expressed breast cancer genes.

    Directory of Open Access Journals (Sweden)

    I-Hsuan Lin

    Full Text Available Oncogenic transformation of normal cells often involves epigenetic alterations, including histone modification and DNA methylation. We conducted whole-genome bisulfite sequencing to determine the DNA methylomes of normal breast, fibroadenoma, invasive ductal carcinomas and MCF7. The emergence, disappearance, expansion and contraction of kilobase-sized hypomethylated regions (HMRs and the hypomethylation of the megabase-sized partially methylated domains (PMDs are the major forms of methylation changes observed in breast tumor samples. Hierarchical clustering of HMR revealed tumor-specific hypermethylated clusters and differential methylated enhancers specific to normal or breast cancer cell lines. Joint analysis of gene expression and DNA methylation data of normal breast and breast cancer cells identified differentially methylated and expressed genes associated with breast and/or ovarian cancers in cancer-specific HMR clusters. Furthermore, aberrant patterns of X-chromosome inactivation (XCI was found in breast cancer cell lines as well as breast tumor samples in the TCGA BRCA (breast invasive carcinoma dataset. They were characterized with differentially hypermethylated XIST promoter, reduced expression of XIST, and over-expression of hypomethylated X-linked genes. High expressions of these genes were significantly associated with lower survival rates in breast cancer patients. Comprehensive analysis of the normal and breast tumor methylomes suggests selective targeting of DNA methylation changes during breast cancer progression. The weak causal relationship between DNA methylation and gene expression observed in this study is evident of more complex role of DNA methylation in the regulation of gene expression in human epigenetics that deserves further investigation.

  3. Neutrosophic Hierarchical Clustering Algoritms

    Directory of Open Access Journals (Sweden)

    Rıdvan Şahin

    2014-03-01

    Full Text Available Interval neutrosophic set (INS is a generalization of interval valued intuitionistic fuzzy set (IVIFS, whose the membership and non-membership values of elements consist of fuzzy range, while single valued neutrosophic set (SVNS is regarded as extension of intuitionistic fuzzy set (IFS. In this paper, we extend the hierarchical clustering techniques proposed for IFSs and IVIFSs to SVNSs and INSs respectively. Based on the traditional hierarchical clustering procedure, the single valued neutrosophic aggregation operator, and the basic distance measures between SVNSs, we define a single valued neutrosophic hierarchical clustering algorithm for clustering SVNSs. Then we extend the algorithm to classify an interval neutrosophic data. Finally, we present some numerical examples in order to show the effectiveness and availability of the developed clustering algorithms.

  4. Hierarchical clustering for graph visualization

    CERN Document Server

    Clémençon, Stéphan; Rossi, Fabrice; Tran, Viet Chi

    2012-01-01

    This paper describes a graph visualization methodology based on hierarchical maximal modularity clustering, with interactive and significant coarsening and refining possibilities. An application of this method to HIV epidemic analysis in Cuba is outlined.

  5. Hierarchical Formation of Galactic Clusters

    CERN Document Server

    Elmegreen, B G

    2006-01-01

    Young stellar groupings and clusters have hierarchical patterns ranging from flocculent spiral arms and star complexes on the largest scale to OB associations, OB subgroups, small loose groups, clusters and cluster subclumps on the smallest scales. There is no obvious transition in morphology at the cluster boundary, suggesting that clusters are only the inner parts of the hierarchy where stars have had enough time to mix. The power-law cluster mass function follows from this hierarchical structure: n(M_cl) M_cl^-b for b~2. This value of b is independently required by the observation that the summed IMFs from many clusters in a galaxy equals approximately the IMF of each cluster.

  6. Intuitionistic fuzzy hierarchical clustering algorithms

    Institute of Scientific and Technical Information of China (English)

    Xu Zeshui

    2009-01-01

    Intuitionistic fuzzy set (IFS) is a set of 2-tuple arguments, each of which is characterized by a mem-bership degree and a nonmembership degree. The generalized form of IFS is interval-valued intuitionistic fuzzy set (IVIFS), whose components are intervals rather than exact numbers. IFSs and IVIFSs have been found to be very useful to describe vagueness and uncertainty. However, it seems that little attention has been focused on the clus-tering analysis of IFSs and IVIFSs. An intuitionistic fuzzy hierarchical algorithm is introduced for clustering IFSs, which is based on the traditional hierarchical clustering procedure, the intuitionistic fuzzy aggregation operator, and the basic distance measures between IFSs: the Hamming distance, normalized Hamming, weighted Hamming, the Euclidean distance, the normalized Euclidean distance, and the weighted Euclidean distance. Subsequently, the algorithm is extended for clustering IVIFSs. Finally the algorithm and its extended form are applied to the classifications of building materials and enterprises respectively.

  7. Convex Clustering: An Attractive Alternative to Hierarchical Clustering

    Science.gov (United States)

    Chen, Gary K.; Chi, Eric C.; Ranola, John Michael O.; Lange, Kenneth

    2015-01-01

    The primary goal in cluster analysis is to discover natural groupings of objects. The field of cluster analysis is crowded with diverse methods that make special assumptions about data and address different scientific aims. Despite its shortcomings in accuracy, hierarchical clustering is the dominant clustering method in bioinformatics. Biologists find the trees constructed by hierarchical clustering visually appealing and in tune with their evolutionary perspective. Hierarchical clustering operates on multiple scales simultaneously. This is essential, for instance, in transcriptome data, where one may be interested in making qualitative inferences about how lower-order relationships like gene modules lead to higher-order relationships like pathways or biological processes. The recently developed method of convex clustering preserves the visual appeal of hierarchical clustering while ameliorating its propensity to make false inferences in the presence of outliers and noise. The solution paths generated by convex clustering reveal relationships between clusters that are hidden by static methods such as k-means clustering. The current paper derives and tests a novel proximal distance algorithm for minimizing the objective function of convex clustering. The algorithm separates parameters, accommodates missing data, and supports prior information on relationships. Our program CONVEXCLUSTER incorporating the algorithm is implemented on ATI and nVidia graphics processing units (GPUs) for maximal speed. Several biological examples illustrate the strengths of convex clustering and the ability of the proximal distance algorithm to handle high-dimensional problems. CONVEXCLUSTER can be freely downloaded from the UCLA Human Genetics web site at http://www.genetics.ucla.edu/software/ PMID:25965340

  8. Hierarchical Clustering and Active Galaxies

    CERN Document Server

    Hatziminaoglou, E; Manrique, A

    2000-01-01

    The growth of Super Massive Black Holes and the parallel development of activity in galactic nuclei are implemented in an analytic code of hierarchical clustering. The evolution of the luminosity function of quasars and AGN will be computed with special attention paid to the connection between quasars and Seyfert galaxies. One of the major interests of the model is the parallel study of quasar formation and evolution and the History of Star Formation.

  9. Galaxy formation through hierarchical clustering

    Science.gov (United States)

    White, Simon D. M.; Frenk, Carlos S.

    1991-01-01

    Analytic methods for studying the formation of galaxies by gas condensation within massive dark halos are presented. The present scheme applies to cosmogonies where structure grows through hierarchical clustering of a mixture of gas and dissipationless dark matter. The simplest models consistent with the current understanding of N-body work on dissipationless clustering, and that of numerical and analytic work on gas evolution and cooling are adopted. Standard models for the evolution of the stellar population are also employed, and new models for the way star formation heats and enriches the surrounding gas are constructed. Detailed results are presented for a cold dark matter universe with Omega = 1 and H(0) = 50 km/s/Mpc, but the present methods are applicable to other models. The present luminosity functions contain significantly more faint galaxies than are observed.

  10. PERFORMANCE OF SELECTED AGGLOMERATIVE HIERARCHICAL CLUSTERING METHODS

    Directory of Open Access Journals (Sweden)

    Nusa Erman

    2015-01-01

    Full Text Available A broad variety of different methods of agglomerative hierarchical clustering brings along problems how to choose the most appropriate method for the given data. It is well known that some methods outperform others if the analysed data have a specific structure. In the presented study we have observed the behaviour of the centroid, the median (Gower median method, and the average method (unweighted pair-group method with arithmetic mean – UPGMA; average linkage between groups. We have compared them with mostly used methods of hierarchical clustering: the minimum (single linkage clustering, the maximum (complete linkage clustering, the Ward, and the McQuitty (groups method average, weighted pair-group method using arithmetic averages - WPGMA methods. We have applied the comparison of these methods on spherical, ellipsoid, umbrella-like, “core-and-sphere”, ring-like and intertwined three-dimensional data structures. To generate the data and execute the analysis, we have used R statistical software. Results show that all seven methods are successful in finding compact, ball-shaped or ellipsoid structures when they are enough separated. Conversely, all methods except the minimum perform poor on non-homogenous, irregular and elongated ones. Especially challenging is a circular double helix structure; it is being correctly revealed only by the minimum method. We can also confirm formerly published results of other simulation studies, which usually favour average method (besides Ward method in cases when data is assumed to be fairly compact and well separated.

  11. Assembling hierarchical cluster solids with atomic precision.

    Science.gov (United States)

    Turkiewicz, Ari; Paley, Daniel W; Besara, Tiglet; Elbaz, Giselle; Pinkard, Andrew; Siegrist, Theo; Roy, Xavier

    2014-11-12

    Hierarchical solids created from the binary assembly of cobalt chalcogenide and iron oxide molecular clusters are reported. Six different molecular clusters based on the octahedral Co6E8 (E = Se or Te) and the expanded cubane Fe8O4 units are used as superatomic building blocks to construct these crystals. The formation of the solid is driven by the transfer of charge between complementary electron-donating and electron-accepting clusters in solution that crystallize as binary ionic compounds. The hierarchical structures are investigated by single-crystal X-ray diffraction, providing atomic and superatomic resolution. We report two different superstructures: a superatomic relative of the CsCl lattice type and an unusual packing arrangement based on the double-hexagonal close-packed lattice. Within these superstructures, we demonstrate various compositions and orientations of the clusters.

  12. Hesitant fuzzy agglomerative hierarchical clustering algorithms

    Science.gov (United States)

    Zhang, Xiaolu; Xu, Zeshui

    2015-02-01

    Recently, hesitant fuzzy sets (HFSs) have been studied by many researchers as a powerful tool to describe and deal with uncertain data, but relatively, very few studies focus on the clustering analysis of HFSs. In this paper, we propose a novel hesitant fuzzy agglomerative hierarchical clustering algorithm for HFSs. The algorithm considers each of the given HFSs as a unique cluster in the first stage, and then compares each pair of the HFSs by utilising the weighted Hamming distance or the weighted Euclidean distance. The two clusters with smaller distance are jointed. The procedure is then repeated time and again until the desirable number of clusters is achieved. Moreover, we extend the algorithm to cluster the interval-valued hesitant fuzzy sets, and finally illustrate the effectiveness of our clustering algorithms by experimental results.

  13. A New Metrics for Hierarchical Clustering

    Institute of Scientific and Technical Information of China (English)

    YANGGuangwen; SHIShuming; WANGDingxing

    2003-01-01

    Hierarchical clustering is a popular method of performing unsupervised learning. Some metric must be used to determine the similarity between pairs of clusters in hierarchical clustering. Traditional similarity metrics either can deal with simple shapes (i.e. spherical shapes) only or are very sensitive to outliers (the chaining effect). The main contribution of this paper is to propose some potential-based similarity metrics (APES and AMAPES) between clusters in hierarchical clustering, inspired by the concepts of the electric potential and the gravitational potential in electromagnetics and astronomy. The main features of these metrics are: the first, they have strong antijamming capability; the second, they are capable of finding clusters of different shapes such as spherical, spiral, chain, circle, sigmoid, U shape or other complex irregular shapes; the third, existing algorithms and research fruits for classical metrics can be adopted to deal with these new potential-based metrics with no or little modification. Experiments showed that the new metrics are more superior to traditional ones. Different potential functions are compared, and the sensitivity to parameters is also analyzed in this paper.

  14. Managing Clustered Data Using Hierarchical Linear Modeling

    Science.gov (United States)

    Warne, Russell T.; Li, Yan; McKyer, E. Lisako J.; Condie, Rachel; Diep, Cassandra S.; Murano, Peter S.

    2012-01-01

    Researchers in nutrition research often use cluster or multistage sampling to gather participants for their studies. These sampling methods often produce violations of the assumption of data independence that most traditional statistics share. Hierarchical linear modeling is a statistical method that can overcome violations of the independence…

  15. Managing Clustered Data Using Hierarchical Linear Modeling

    Science.gov (United States)

    Warne, Russell T.; Li, Yan; McKyer, E. Lisako J.; Condie, Rachel; Diep, Cassandra S.; Murano, Peter S.

    2012-01-01

    Researchers in nutrition research often use cluster or multistage sampling to gather participants for their studies. These sampling methods often produce violations of the assumption of data independence that most traditional statistics share. Hierarchical linear modeling is a statistical method that can overcome violations of the independence…

  16. Robust Pseudo-Hierarchical Support Vector Clustering

    DEFF Research Database (Denmark)

    Hansen, Michael Sass; Sjöstrand, Karl; Olafsdóttir, Hildur

    2007-01-01

    Support vector clustering (SVC) has proven an efficient algorithm for clustering of noisy and high-dimensional data sets, with applications within many fields of research. An inherent problem, however, has been setting the parameters of the SVC algorithm. Using the recent emergence of a method...... for calculating the entire regularization path of the support vector domain description, we propose a fast method for robust pseudo-hierarchical support vector clustering (HSVC). The method is demonstrated to work well on generated data, as well as for detecting ischemic segments from multidimensional myocardial...

  17. Hierarchical Control for Multiple DC Microgrids Clusters

    DEFF Research Database (Denmark)

    Shafiee, Qobad; Dragicevic, Tomislav; Vasquez, Juan Carlos;

    2014-01-01

    This paper presents a distributed hierarchical control framework to ensure reliable operation of dc Microgrid (MG) clusters. In this hierarchy, primary control is used to regulate the common bus voltage inside each MG locally. An adaptive droop method is proposed for this level which determines....... Another distributed policy is employed then to regulate the power flow among the MGs according to their local SOCs. The proposed distributed controllers on each MG communicate with only the neighbor MGs through a communication infrastructure. Finally, the small signal model is expanded for dc MG clusters...

  18. Constructing storyboards based on hierarchical clustering analysis

    Science.gov (United States)

    Hasebe, Satoshi; Sami, Mustafa M.; Muramatsu, Shogo; Kikuchi, Hisakazu

    2005-07-01

    There are growing needs for quick preview of video contents for the purpose of improving accessibility of video archives as well as reducing network traffics. In this paper, a storyboard that contains a user-specified number of keyframes is produced from a given video sequence. It is based on hierarchical cluster analysis of feature vectors that are derived from wavelet coefficients of video frames. Consistent use of extracted feature vectors is the key to avoid a repetition of computationally-intensive parsing of the same video sequence. Experimental results suggest that a significant reduction in computational time is gained by this strategy.

  19. Technique for fast and efficient hierarchical clustering

    Science.gov (United States)

    Stork, Christopher

    2013-10-08

    A fast and efficient technique for hierarchical clustering of samples in a dataset includes compressing the dataset to reduce a number of variables within each of the samples of the dataset. A nearest neighbor matrix is generated to identify nearest neighbor pairs between the samples based on differences between the variables of the samples. The samples are arranged into a hierarchy that groups the samples based on the nearest neighbor matrix. The hierarchy is rendered to a display to graphically illustrate similarities or differences between the samples.

  20. Magnetic susceptibilities of cluster-hierarchical models

    Science.gov (United States)

    McKay, Susan R.; Berker, A. Nihat

    1984-02-01

    The exact magnetic susceptibilities of hierarchical models are calculated near and away from criticality, in both the ordered and disordered phases. The mechanism and phenomenology are discussed for models with susceptibilities that are physically sensible, e.g., nondivergent away from criticality. Such models are found based upon the Niemeijer-van Leeuwen cluster renormalization. A recursion-matrix method is presented for the renormalization-group evaluation of response functions. Diagonalization of this matrix at fixed points provides simple criteria for well-behaved densities and response functions.

  1. Hierarchical Aligned Cluster Analysis for Temporal Clustering of Human Motion.

    Science.gov (United States)

    Zhou, Feng; De la Torre, Fernando; Hodgins, Jessica K

    2013-03-01

    Temporal segmentation of human motion into plausible motion primitives is central to understanding and building computational models of human motion. Several issues contribute to the challenge of discovering motion primitives: the exponential nature of all possible movement combinations, the variability in the temporal scale of human actions, and the complexity of representing articulated motion. We pose the problem of learning motion primitives as one of temporal clustering, and derive an unsupervised hierarchical bottom-up framework called hierarchical aligned cluster analysis (HACA). HACA finds a partition of a given multidimensional time series into m disjoint segments such that each segment belongs to one of k clusters. HACA combines kernel k-means with the generalized dynamic time alignment kernel to cluster time series data. Moreover, it provides a natural framework to find a low-dimensional embedding for time series. HACA is efficiently optimized with a coordinate descent strategy and dynamic programming. Experimental results on motion capture and video data demonstrate the effectiveness of HACA for segmenting complex motions and as a visualization tool. We also compare the performance of HACA to state-of-the-art algorithms for temporal clustering on data of a honey bee dance. The HACA code is available online.

  2. A Hierarchical Clustering Methodology for the Estimation of Toxicity

    Science.gov (United States)

    A Quantitative Structure Activity Relationship (QSAR) methodology based on hierarchical clustering was developed to predict toxicological endpoints. This methodology utilizes Ward's method to divide a training set into a series of structurally similar clusters. The structural sim...

  3. Hierarchical Cluster Assembly in Globally Collapsing Clouds

    CERN Document Server

    Vazquez-Semadeni, Enrique; Colin, Pedro

    2016-01-01

    We discuss the mechanism of cluster formation in a numerical simulation of a molecular cloud (MC) undergoing global hierarchical collapse (GHC). The global nature of the collapse implies that the SFR increases over time. The hierarchical nature of the collapse consists of small-scale collapses within larger-scale ones. The large-scale collapses culminate a few Myr later than the small-scale ones and consist of filamentary flows that accrete onto massive central clumps. The small-scale collapses form clumps that are embedded in the filaments and falling onto the large-scale collapse centers. The stars formed in the early, small-scale collapses share the infall motion of their parent clumps. Thus, the filaments feed both gaseous and stellar material to the massive central clump. This leads to the presence of a few older stars in a region where new protostars are forming, and also to a self-similar structure, in which each unit is composed of smaller-scale sub-units that approach each other and may merge. Becaus...

  4. Hierarchical clustering using correlation metric and spatial continuity constraint

    Science.gov (United States)

    Stork, Christopher L.; Brewer, Luke N.

    2012-10-02

    Large data sets are analyzed by hierarchical clustering using correlation as a similarity measure. This provides results that are superior to those obtained using a Euclidean distance similarity measure. A spatial continuity constraint may be applied in hierarchical clustering analysis of images.

  5. Fast, Linear Time Hierarchical Clustering using the Baire Metric

    CERN Document Server

    Contreras, Pedro

    2011-01-01

    The Baire metric induces an ultrametric on a dataset and is of linear computational complexity, contrasted with the standard quadratic time agglomerative hierarchical clustering algorithm. In this work we evaluate empirically this new approach to hierarchical clustering. We compare hierarchical clustering based on the Baire metric with (i) agglomerative hierarchical clustering, in terms of algorithm properties; (ii) generalized ultrametrics, in terms of definition; and (iii) fast clustering through k-means partititioning, in terms of quality of results. For the latter, we carry out an in depth astronomical study. We apply the Baire distance to spectrometric and photometric redshifts from the Sloan Digital Sky Survey using, in this work, about half a million astronomical objects. We want to know how well the (more costly to determine) spectrometric redshifts can predict the (more easily obtained) photometric redshifts, i.e. we seek to regress the spectrometric on the photometric redshifts, and we use clusterwi...

  6. Hierarchical Approach in Clustering to Euclidean Traveling Salesman Problem

    Science.gov (United States)

    Fajar, Abdulah; Herman, Nanna Suryana; Abu, Nur Azman; Shahib, Sahrin

    There has been growing interest in studying combinatorial optimization problems by clustering strategy, with a special emphasis on the traveling salesman problem (TSP). TSP naturally arises as a sub problem in much transportation, manufacturing and logistics application, this problem has caught much attention of mathematicians and computer scientists. A clustering approach will decompose TSP into sub graph and form cluster, so it may reduce problem size into smaller problem. Impact of hierarchical approach will be investigated to produce a better clustering strategy that fit into Euclidean TSP. Clustering strategy to Euclidean TSP consist of two main step, there are; clustering and tour construction. The significant of this research is clustering approach solution result has error less than 10% compare to best known solution (TSPLIB) and there is improvement to a hierarchical clustering algorithm in order to fit in such Euclidean TSP solution method.

  7. The Hierarchical Clustering of Tax Burden in the EU27

    Directory of Open Access Journals (Sweden)

    Simkova Nikola

    2015-09-01

    Full Text Available The issue of taxation has become more important due to a significant share of the government revenue. There are several ways of expressing the tax burden of countries. This paper describes the traditional approach as a share of tax revenue to GDP which is applied to the total taxation and the capital taxation as a part of tax systems affecting investment decisions. The implicit tax rate on capital created by Eurostat also offers a possible explanation of the tax burden on capital, so its components are analysed in detail. This study uses one of the econometric methods called the hierarchical clustering. The data on which the clustering is based comprises countries in the EU27 for the period of 1995 – 2012. The aim of this paper is to reveal clusters of countries in the EU27 with similar tax burden or tax changes. The findings suggest that mainly newly acceding countries (2004 and 2007 are in a group of countries with a low tax burden which tried to encourage investors by favourable tax rates. On the other hand, there are mostly countries from the original EU15. Some clusters may be explained by similar historical development, geographic and demographic characteristics.

  8. Hierarchical Clustering and the Concept of Space Distortion.

    Science.gov (United States)

    Hubert, Lawrence; Schultz, James

    An empirical assesssment of the space distortion properties of two prototypic hierarchical clustering procedures is given in terms of an occupancy model developed from combinatorics. Using one simple example, the single-link and complete-link clustering strategies now in common use in the behavioral sciences are empirically shown to be space…

  9. The Hierarchical Distribution of Young Stellar Clusters in Nearby Galaxies

    Science.gov (United States)

    Grasha, Kathryn; Calzetti, Daniela

    2017-01-01

    We investigate the spatial distributions of young stellar clusters in six nearby galaxies to trace the large scale hierarchical star-forming structures. The six galaxies are drawn from the Legacy ExtraGalactic UV Survey (LEGUS). We quantify the strength of the clustering among stellar clusters as a function of spatial scale and age to establish the survival timescale of the substructures. We separate the clusters into different classes, compact (bound) clusters and associations (unbound), and compare the clustering among them. We find that younger star clusters are more strongly clustered over small spatial scales and that the clustering disappears rapidly for ages as young as a few tens of Myr, consistent with clusters slowly losing the fractal dimension inherited at birth from their natal molecular clouds.

  10. Hierarchical Clustering Given Confidence Intervals of Metric Distances

    CERN Document Server

    Huang, Weiyu

    2016-01-01

    This paper considers metric spaces where distances between a pair of nodes are represented by distance intervals. The goal is to study methods for the determination of hierarchical clusters, i.e., a family of nested partitions indexed by a resolution parameter, induced from the given distance intervals of the metric spaces. Our construction of hierarchical clustering methods is based on defining admissible methods to be those methods that abide to the axioms of value - nodes in a metric space with two nodes are clustered together at the convex combination of the distance bounds between them - and transformation - when both distance bounds are reduced, the output may become more clustered but not less. Two admissible methods are constructed and are shown to provide universal upper and lower bounds in the space of admissible methods. Practical implications are explored by clustering moving points via snapshots and by clustering networks representing brain structural connectivity using the lower and upper bounds...

  11. Hierarchical modeling of cluster size in wildlife surveys

    Science.gov (United States)

    Royle, J. Andrew

    2008-01-01

    Clusters or groups of individuals are the fundamental unit of observation in many wildlife sampling problems, including aerial surveys of waterfowl, marine mammals, and ungulates. Explicit accounting of cluster size in models for estimating abundance is necessary because detection of individuals within clusters is not independent and detectability of clusters is likely to increase with cluster size. This induces a cluster size bias in which the average cluster size in the sample is larger than in the population at large. Thus, failure to account for the relationship between delectability and cluster size will tend to yield a positive bias in estimates of abundance or density. I describe a hierarchical modeling framework for accounting for cluster-size bias in animal sampling. The hierarchical model consists of models for the observation process conditional on the cluster size distribution and the cluster size distribution conditional on the total number of clusters. Optionally, a spatial model can be specified that describes variation in the total number of clusters per sample unit. Parameter estimation, model selection, and criticism may be carried out using conventional likelihood-based methods. An extension of the model is described for the situation where measurable covariates at the level of the sample unit are available. Several candidate models within the proposed class are evaluated for aerial survey data on mallard ducks (Anas platyrhynchos).

  12. Update Legal Documents Using Hierarchical Ranking Models and Word Clustering

    OpenAIRE

    Pham, Minh Quang Nhat; Nguyen, Minh Le; Shimazu, Akira

    2010-01-01

    Our research addresses the task of updating legal documents when newinformation emerges. In this paper, we employ a hierarchical ranking model tothe task of updating legal documents. Word clustering features are incorporatedto the ranking models to exploit semantic relations between words. Experimentalresults on legal data built from the United States Code show that the hierarchicalranking model with word clustering outperforms baseline methods using VectorSpace Model, and word cluster-based ...

  13. Exploiting Homogeneity of Density in Incremental Hierarchical Clustering

    Directory of Open Access Journals (Sweden)

    Dwi H. Widiyantoro

    2006-11-01

    Full Text Available Hierarchical clustering is an important tool in many applications. As it involves a large data set that proliferates over time, reclustering the data set periodically is not an efficient process. Therefore, the ability to incorporate a new data set incrementally into an existing hierarchy becomes increasingly demanding. This article describes Homogen, a system that employs a new algorithm for generating a hierarchy of concepts and clusters incrementally from a stream of observations. The system aims to construct a hierarchy that satisfies the homogeneity and the monotonicity properties. Working in a bottom-up fashion, a new observation is placed in the hierarchy and a sequence of hierarchy restructuring processes is performed only in regions that have been affected by the presence of the new observation. Additionally, it combines multiple restructuring techniques that address different restructuring objectives to get a synergistic effect. The system has been tested on a variety of domains including structured and unstructured data sets. The experimental results reveal that the system is able to construct a concept hierarchy that is consistent regardless of the input data order and whose quality is comparable to the quality of those produced by non incremental clustering algorithms.

  14. MultiDendrograms: Variable-Group Agglomerative Hierarchical Clustering

    CERN Document Server

    Gomez, Sergio; Montiel, Justo; Torres, David

    2012-01-01

    MultiDendrograms is a Java-written application that computes agglomerative hierarchical clusterings of data. Starting from a distances (or weights) matrix, MultiDendrograms is able to calculate its dendrograms using the most common agglomerative hierarchical clustering methods. The application implements a variable-group algorithm that solves the non-uniqueness problem found in the standard pair-group algorithm. This problem arises when two or more minimum distances between different clusters are equal during the agglomerative process, because then different output clusterings are possible depending on the criterion used to break ties between distances. MultiDendrograms solves this problem implementing a variable-group algorithm that groups more than two clusters at the same time when ties occur.

  15. Hierarchical Overlapping Clustering of Network Data Using Cut Metrics

    CERN Document Server

    Gama, Fernando; Ribeiro, Alejandro

    2016-01-01

    A novel method to obtain hierarchical and overlapping clusters from network data -i.e., a set of nodes endowed with pairwise dissimilarities- is presented. The introduced method is hierarchical in the sense that it outputs a nested collection of groupings of the node set depending on the resolution or degree of similarity desired, and it is overlapping since it allows nodes to belong to more than one group. Our construction is rooted on the facts that a hierarchical (non-overlapping) clustering of a network can be equivalently represented by a finite ultrametric space and that a convex combination of ultrametrics results in a cut metric. By applying a hierarchical (non-overlapping) clustering method to multiple dithered versions of a given network and then convexly combining the resulting ultrametrics, we obtain a cut metric associated to the network of interest. We then show how to extract a hierarchical overlapping clustering structure from the aforementioned cut metric. Furthermore, the so-called overlappi...

  16. Performance Analysis of Hierarchical Clustering Algorithm

    Directory of Open Access Journals (Sweden)

    K.Ranjini

    2011-07-01

    Full Text Available Clustering is the classification of objects into different groups, or more precisely, the partitioning of a data set into subsets (clusters, so that the data in each subset (ideally share some common trait - often proximity according to some defined distance measure. Data clustering is a common technique for statistical data analysis, which is used in many fields, including machine learning, data mining, pattern recognition, image analysis and bioinformatics. This paper explains the implementation of agglomerative and divisive clustering algorithms applied on various types of data. The details of the victims of Tsunami in Thailand during the year 2004, was taken as the test data. Visual programming is used for implementation and running time of the algorithms using different linkages (agglomerative to different types of data are taken for analysis.

  17. Properties of hierarchically forming star clusters

    CERN Document Server

    Maschberger, Th; Bonnell, I A; Kroupa, P

    2010-01-01

    We undertake a systematic analysis of the early (< 0.5 Myr) evolution of clustering and the stellar initial mass function in turbulent fragmentation simulations. These large scale simulations for the first time offer the opportunity for a statistical analysis of IMF variations and correlations between stellar properties and cluster richness. The typical evolutionary scenario involves star formation in small-n clusters which then progressively merge; the first stars to form are seeds of massive stars and achieve a headstart in mass acquisition. These massive seeds end up in the cores of clusters and a large fraction of new stars of lower mass is formed in the outer parts of the clusters. The resulting clusters are therefore mass segregated at an age of 0.5 Myr, although the signature of mass segregation is weakened during mergers. We find that the resulting IMF has a smaller exponent (alpha=1.8-2.2) than the Salpeter value (alpha=2.35). The IMFs in subclusters are truncated at masses only somewhat larger th...

  18. Evaluation by hierarchical clustering of multiple cytokine expression after phytohemagglutinin stimulation

    Directory of Open Access Journals (Sweden)

    Yang Chunhe

    2016-01-01

    Full Text Available The hierarchical clustering method has been used for exploration of gene expression and proteomic profiles; however, little research into its application in the examination of expression of multiplecytokine/chemokine responses to stimuli has been reported. Thus, little progress has been made on how phytohemagglutinin(PHA affects cytokine expression profiling on a large scale in the human hematological system. To investigate the characteristic expression pattern under PHA stimulation, Luminex, a multiplex bead-based suspension array, was performed. The data set collected from human peripheral blood mononuclear cells (PBMC was analyzed using the hierarchical clustering method. It was revealed that two specific chemokines (CCL3 andCCL4 underwent significantly greater quantitative changes during induction of expression than other tested cytokines/chemokines after PHA stimulation. This result indicates that hierarchical clustering is a useful tool for detecting fine patterns during exploration of biological data, and that it can play an important role in comparative studies.

  19. Hierarchical clusters of phytoplankton variables in dammed water bodies

    Science.gov (United States)

    Silva, Eliana Costa e.; Lopes, Isabel Cristina; Correia, Aldina; Gonçalves, A. Manuela

    2017-06-01

    In this paper a dataset containing biological variables of the water column of several Portuguese reservoirs is analyzed. Hierarchical cluster analysis is used to obtain clusters of phytoplankton variables of the phylum Cyanophyta, with the objective of validating the classification of Portuguese reservoirs previewly presented in [1] which were divided into three clusters: (1) Interior Tagus and Aguieira; (2) Douro; and (3) Other rivers. Now three new clusters of Cyanophyta variables were found. Kruskal-Wallis and Mann-Whitney tests are used to compare the now obtained Cyanophyta clusters and the previous Reservoirs clusters, in order to validate the classification of the water quality of reservoirs. The amount of Cyanophyta algae present in the reservoirs from the three clusters is significantly different, which validates the previous classification.

  20. A Novel Divisive Hierarchical Clustering Algorithm for Geospatial Analysis

    Directory of Open Access Journals (Sweden)

    Shaoning Li

    2017-01-01

    Full Text Available In the fields of geographic information systems (GIS and remote sensing (RS, the clustering algorithm has been widely used for image segmentation, pattern recognition, and cartographic generalization. Although clustering analysis plays a key role in geospatial modelling, traditional clustering methods are limited due to computational complexity, noise resistant ability and robustness. Furthermore, traditional methods are more focused on the adjacent spatial context, which makes it hard for the clustering methods to be applied to multi-density discrete objects. In this paper, a new method, cell-dividing hierarchical clustering (CDHC, is proposed based on convex hull retraction. The main steps are as follows. First, a convex hull structure is constructed to describe the global spatial context of geospatial objects. Then, the retracting structure of each borderline is established in sequence by setting the initial parameter. The objects are split into two clusters (i.e., “sub-clusters” if the retracting structure intersects with the borderlines. Finally, clusters are repeatedly split and the initial parameter is updated until the terminate condition is satisfied. The experimental results show that CDHC separates the multi-density objects from noise sufficiently and also reduces complexity compared to the traditional agglomerative hierarchical clustering algorithm.

  1. Hierarchical clustering techniques for image database organization and summarization

    Science.gov (United States)

    Vellaikal, Asha; Kuo, C.-C. Jay

    1998-10-01

    This paper investigates clustering techniques as a method of organizing image databases to support popular visual management functions such as searching, browsing and navigation. Different types of hierarchical agglomerative clustering techniques are studied as a method of organizing features space as well as summarizing image groups by the selection of a few appropriate representatives. Retrieval performance using both single and multiple level hierarchies are experimented with and the algorithms show an interesting relationship between the top k correct retrievals and the number of comparisons required. Some arguments are given to support the use of such cluster-based techniques for managing distributed image databases.

  2. Breaking the hierarchy - a new cluster selection mechanism for hierarchical clustering methods

    Directory of Open Access Journals (Sweden)

    Zweig Katharina A

    2009-10-01

    Full Text Available Abstract Background Hierarchical clustering methods like Ward's method have been used since decades to understand biological and chemical data sets. In order to get a partition of the data set, it is necessary to choose an optimal level of the hierarchy by a so-called level selection algorithm. In 2005, a new kind of hierarchical clustering method was introduced by Palla et al. that differs in two ways from Ward's method: it can be used on data on which no full similarity matrix is defined and it can produce overlapping clusters, i.e., allow for multiple membership of items in clusters. These features are optimal for biological and chemical data sets but until now no level selection algorithm has been published for this method. Results In this article we provide a general selection scheme, the level independent clustering selection method, called LInCS. With it, clusters can be selected from any level in quadratic time with respect to the number of clusters. Since hierarchically clustered data is not necessarily associated with a similarity measure, the selection is based on a graph theoretic notion of cohesive clusters. We present results of our method on two data sets, a set of drug like molecules and set of protein-protein interaction (PPI data. In both cases the method provides a clustering with very good sensitivity and specificity values according to a given reference clustering. Moreover, we can show for the PPI data set that our graph theoretic cohesiveness measure indeed chooses biologically homogeneous clusters and disregards inhomogeneous ones in most cases. We finally discuss how the method can be generalized to other hierarchical clustering methods to allow for a level independent cluster selection. Conclusion Using our new cluster selection method together with the method by Palla et al. provides a new interesting clustering mechanism that allows to compute overlapping clusters, which is especially valuable for biological and

  3. A Framework for Hierarchical Clustering Based Indexing in Search Engines

    Directory of Open Access Journals (Sweden)

    Parul Gupta

    2011-01-01

    Full Text Available Granting efficient and fast accesses to the index is a key issuefor performances of Web Search Engines. In order to enhancememory utilization and favor fast query resolution, WSEs useInverted File (IF indexes that consist of an array of theposting lists where each posting list is associated with a termand contains the term as well as the identifiers of the documentscontaining the term. Since the document identifiers are stored insorted order, they can be stored as the difference between thesuccessive documents so as to reduce the size of the index. Thispaper describes a clustering algorithm that aims atpartitioning the set of documents into ordered clusters so thatthe documents within the same cluster are similar and are beingassigned the closer document identifiers. Thus the averagevalue of the differences between the successive documents willbe minimized and hence storage space would be saved. Thepaper further presents the extension of this clustering algorithmto be applied for the hierarchical clustering in which similarclusters are clubbed to form a mega cluster and similar megaclusters are then combined to form super cluster. Thus thepaper describes the different levels of clustering whichoptimizes the search process by directing the searchto a specific path from higher levels of clustering to the lowerlevels i.e. from super clusters to mega clusters, then to clustersand finally to the individual documents so that the user gets thebest possible matching results in minimum possible time.

  4. Active Clustering: Robust and Efficient Hierarchical Clustering using Adaptively Selected Similarities

    CERN Document Server

    Eriksson, Brian; Singh, Aarti; Nowak, Robert

    2011-01-01

    Hierarchical clustering based on pairwise similarities is a common tool used in a broad range of scientific applications. However, in many problems it may be expensive to obtain or compute similarities between the items to be clustered. This paper investigates the hierarchical clustering of N items based on a small subset of pairwise similarities, significantly less than the complete set of N(N-1)/2 similarities. First, we show that if the intracluster similarities exceed intercluster similarities, then it is possible to correctly determine the hierarchical clustering from as few as 3N log N similarities. We demonstrate this order of magnitude savings in the number of pairwise similarities necessitates sequentially selecting which similarities to obtain in an adaptive fashion, rather than picking them at random. We then propose an active clustering method that is robust to a limited fraction of anomalous similarities, and show how even in the presence of these noisy similarity values we can resolve the hierar...

  5. Hierarchical cluster-tendency analysis of the group structure in the foreign exchange market

    Science.gov (United States)

    Wu, Xin-Ye; Zheng, Zhi-Gang

    2013-08-01

    A hierarchical cluster-tendency (HCT) method in analyzing the group structure of networks of the global foreign exchange (FX) market is proposed by combining the advantages of both the minimal spanning tree (MST) and the hierarchical tree (HT). Fifty currencies of the top 50 World GDP in 2010 according to World Bank's database are chosen as the underlying system. By using the HCT method, all nodes in the FX market network can be "colored" and distinguished. We reveal that the FX networks can be divided into two groups, i.e., the Asia-Pacific group and the Pan-European group. The results given by the hierarchical cluster-tendency method agree well with the formerly observed geographical aggregation behavior in the FX market. Moreover, an oil-resource aggregation phenomenon is discovered by using our method. We find that gold could be a better numeraire for the weekly-frequency FX data.

  6. Hierarchical Cluster Analysis – Various Approaches to Data Preparation

    Directory of Open Access Journals (Sweden)

    Z. Pacáková

    2013-09-01

    Full Text Available The article deals with two various approaches to data preparation to avoid multicollinearity. The aim of the article is to find similarities among the e-communication level of EU states using hierarchical cluster analysis. The original set of fourteen indicators was first reduced on the basis of correlation analysis while in case of high correlation indicator of higher variability was included in further analysis. Secondly the data were transformed using principal component analysis while the principal components are poorly correlated. For further analysis five principal components explaining about 92% of variance were selected. Hierarchical cluster analysis was performed both based on the reduced data set and the principal component scores. Both times three clusters were assumed following Pseudo t-Squared and Pseudo F Statistic, but the final clusters were not identical. An important characteristic to compare the two results found was to look at the proportion of variance accounted for by the clusters which was about ten percent higher for the principal component scores (57.8% compared to 47%. Therefore it can be stated, that in case of using principal component scores as an input variables for cluster analysis with explained proportion high enough (about 92% for in our analysis, the loss of information is lower compared to data reduction on the basis of correlation analysis.

  7. Concept Association and Hierarchical Hamming Clustering Model in Text Classification

    Institute of Scientific and Technical Information of China (English)

    Su Gui-yang; Li Jian-hua; Ma Ying-hua; Li Sheng-hong; Yin Zhong-hang

    2004-01-01

    We propose two models in this paper. The concept of association model is put forward to obtain the co-occurrence relationships among keywords in the documents and the hierarchical Hamming clustering model is used to reduce the dimensionality of the category feature vector space which can solve the problem of the extremely high dimensionality of the documents' feature space. The results of experiment indicate that it can obtain the co-occurrence relations among keywords in the documents which promote the recall of classification system effectively. The hierarchical Hamming clustering model can reduce the dimensionality of the category feature vector efficiently, the size of the vector space is only about 10% of the primary dimensionality.

  8. Non-hierarchical clustering methods on factorial subspaces

    OpenAIRE

    Tortora, Cristina

    2011-01-01

    Cluster analysis (CA) aims at finding homogeneous group of individuals, where homogeneous is referred to individuals that present similar characteristics. Many CA techniques already exist, among the non-hierarchical ones the most known, thank to its simplicity and computational property, is k-means method. However, the method is unstable when the number of variables is large and when variables are correlated. This problem leads to the development of two-step methods, they perform a linear tra...

  9. Image Segmentation by Hierarchical Spatial and Color Spaces Clustering

    Institute of Scientific and Technical Information of China (English)

    YU Wei

    2005-01-01

    Image segmentation, as a basic building block for many high-level image analysis problems, has attracted many research attentions over years. Existing approaches, however, are mainly focusing on the clustering analysis in the single channel information, i.e., either in color or spatial space, which may lead to unsatisfactory segmentation performance. Considering the spatial and color spaces jointly, this paper proposes a new hierarchical image segmentation algorithm, which alternately clusters the image regions in color and spatial spaces in a fine to coarse manner. Without losing the perceptual consistence, the proposed algorithm achieves the segmentation result using only very few number of colors according to user specification.

  10. Hierarchical structure of the Sicilian goats revealed by Bayesian analyses of microsatellite information.

    Science.gov (United States)

    Siwek, M; Finocchiaro, R; Curik, I; Portolano, B

    2011-02-01

    Genetic structure and relationship amongst the main goat populations in Sicily (Girgentana, Derivata di Siria, Maltese and Messinese) were analysed using information from 19 microsatellite markers genotyped on 173 individuals. A posterior Bayesian approach implemented in the program STRUCTURE revealed a hierarchical structure with two clusters at the first level (Girgentana vs. Messinese, Derivata di Siria and Maltese), explaining 4.8% of variation (amovaФ(ST) estimate). Seven clusters nested within these first two clusters (further differentiations of Girgentana, Derivata di Siria and Maltese), explaining 8.5% of variation (amovaФ(SC) estimate). The analyses and methods applied in this study indicate their power to detect subtle population structure.

  11. A fast quad-tree based two dimensional hierarchical clustering.

    Science.gov (United States)

    Rajadurai, Priscilla; Sankaranarayanan, Swamynathan

    2012-01-01

    Recently, microarray technologies have become a robust technique in the area of genomics. An important step in the analysis of gene expression data is the identification of groups of genes disclosing analogous expression patterns. Cluster analysis partitions a given dataset into groups based on specified features. Euclidean distance is a widely used similarity measure for gene expression data that considers the amount of changes in gene expression. However, the huge number of genes and the intricacy of biological networks have highly increased the challenges of comprehending and interpreting the resulting group of data, increasing processing time. The proposed technique focuses on a QT based fast 2-dimensional hierarchical clustering algorithm to perform clustering. The construction of the closest pair data structure is an each level is an important time factor, which determines the processing time of clustering. The proposed model reduces the processing time and improves analysis of gene expression data.

  12. Extending stability through hierarchical clusters in Echo State Networks

    Directory of Open Access Journals (Sweden)

    Sarah Jarvis

    2010-07-01

    Full Text Available Echo State Networks (ESN are reservoir networks that satisfy well-established criteria for stability when constructed as feedforward networks. Recent evidence suggests that stability criteria are altered in the presence of reservoir substructures, such as clusters. Understanding how the reservoir architecture affects stability is thus important for the appropriate design of any ESN. To quantitatively determine the influence of the most relevant network parameters, we analysed the impact of reservoir substructures on stability in hierarchically clustered ESNs (HESN, as they allow a smooth transition from highly structured to increasingly homogeneous reservoirs. Previous studies used the largest eigenvalue of the reservoir connectivity matrix (spectral radius as a predictor for stable network dynamics. Here, we evaluate the impact of clusters, hierarchy and intercluster connectivity on the predictive power of the spectral radius for stability. Both hierarchy and low relative cluster sizes extend the range of spectral radius values, leading to stable networks, while increasing intercluster connectivity decreased maximal spectral radius.

  13. Multi-mode clustering model for hierarchical wireless sensor networks

    Science.gov (United States)

    Hu, Xiangdong; Li, Yongfu; Xu, Huifen

    2017-03-01

    The topology management, i.e., clusters maintenance, of wireless sensor networks (WSNs) is still a challenge due to its numerous nodes, diverse application scenarios and limited resources as well as complex dynamics. To address this issue, a multi-mode clustering model (M2 CM) is proposed to maintain the clusters for hierarchical WSNs in this study. In particular, unlike the traditional time-trigger model based on the whole-network and periodic style, the M2 CM is proposed based on the local and event-trigger operations. In addition, an adaptive local maintenance algorithm is designed for the broken clusters in the WSNs using the spatial-temporal demand changes accordingly. Numerical experiments are performed using the NS2 network simulation platform. Results validate the effectiveness of the proposed model with respect to the network maintenance costs, node energy consumption and transmitted data as well as the network lifetime.

  14. Multilevel compression of random walks on networks reveals hierarchical organization in large integrated systems

    CERN Document Server

    Rosvall, M

    2010-01-01

    To comprehend the hierarchical organization of large integrated systems, we introduce the hierarchical map equation that reveals multilevel structures in networks. In this information-theoretic approach, we exploit the duality between compression and pattern detection; by compressing a description of a random walker as a proxy for real flow on a network, we find regularities in the network that induce this system-wide flow. Finding the shortest multilevel description of the random walker therefore gives us the best hierarchical clustering of the network, the optimal number of levels and modular partition at each level, with respect to the dynamics on the network. With a novel search algorithm, we extract and illustrate the rich multilevel organization of several large social and biological networks. For example, from the global air traffic network we uncover countries and continents, and from the pattern of scientific communication we reveal more than 100 scientific fields organized in four major disciplines:...

  15. Globular cluster formation with multiple stellar populations from hierarchical star cluster complexes

    Science.gov (United States)

    Bekki, Kenji

    2017-01-01

    Most old globular clusters (GCs) in the Galaxy are observed to have internal chemical abundance spreads in light elements. We discuss a new GC formation scenario based on hierarchical star formation within fractal molecular clouds. In the new scenario, a cluster of bound and unbound star clusters (`star cluster complex', SCC) that have a power-law cluster mass function with a slope (β) of 2 is first formed from a massive gas clump developed in a dwarf galaxy. Such cluster complexes and β = 2 are observed and expected from hierarchical star formation. The most massive star cluster (`main cluster'), which is the progenitor of a GC, can accrete gas ejected from asymptotic giant branch (AGB) stars initially in the cluster and other low-mass clusters before the clusters are tidally stripped or destroyed to become field stars in the dwarf. The SCC is initially embedded in a giant gas hole created by numerous supernovae of the SCC so that cold gas outside the hole can be accreted onto the main cluster later. New stars formed from the accreted gas have chemical abundances that are different from those of the original SCC. Using hydrodynamical simulations of GC formation based on this scenario, we show that the main cluster with the initial mass as large as [2 - 5] × 105M⊙ can accrete more than 105M⊙ gas from AGB stars of the SCC. We suggest that merging of hierarchical star cluster complexes can play key roles in stellar halo formation around GCs and self-enrichment processes in the early phase of GC formation.

  16. Hierarchically Clustered Star Formation in the Magellanic Clouds

    CERN Document Server

    Gouliermis, Dimitrios A; Ossenkopf, Volker; Klessen, Ralf S; Dolphin, Andrew E

    2012-01-01

    We present a cluster analysis of the bright main-sequence and faint pre--main-sequence stellar populations of a field ~ 90 x 90 pc centered on the HII region NGC 346/N66 in the Small Magellanic Cloud, from imaging with HST/ACS. We extend our earlier analysis on the stellar cluster population in the region to characterize the structuring behavior of young stars in the region as a whole with the use of stellar density maps interpreted through techniques designed for the study of the ISM structuring. In particular, we demonstrate with Cartwrigth & Whitworth's Q parameter, dendrograms, and the Delta-variance wavelet transform technique that the young stellar populations in the region NGC 346/N66 are hierarchically clustered, in agreement with other regions in the Magellanic Clouds observed with HST. The origin of this hierarchy is currently under investigation.

  17. Multiscale stochastic hierarchical image segmentation by spectral clustering

    Institute of Scientific and Technical Information of China (English)

    LI XiaoBin; TIAN Zheng

    2007-01-01

    This paper proposes a sampling based hierarchical approach for solving the computational demands of the spectral clustering methods when applied to the problem of image segmentation. The authors first define the distance between a pixel and a cluster, and then derive a new theorem to estimate the number of samples needed for clustering. Finally, by introducing a scale parameter into the similarity function, a novel spectral clustering based image segmentation method has been developed. An important characteristic of the approach is that in the course of image segmentation one needs not only to tune the scale parameter to merge the small size clusters or split the large size clusters but also take samples from the data set at the different scales. The multiscale and stochastic nature makes it feasible to apply the method to very large grouping problem. In addition, it also makes the segmentation compute in time that is linear in the size of the image. The experimental results on various synthetic and real world images show the effectiveness of the approach.

  18. An agglomerative hierarchical approach to visualization in Bayesian clustering problems.

    Science.gov (United States)

    Dawson, K J; Belkhir, K

    2009-07-01

    Clustering problems (including the clustering of individuals into outcrossing populations, hybrid generations, full-sib families and selfing lines) have recently received much attention in population genetics. In these clustering problems, the parameter of interest is a partition of the set of sampled individuals--the sample partition. In a fully Bayesian approach to clustering problems of this type, our knowledge about the sample partition is represented by a probability distribution on the space of possible sample partitions. As the number of possible partitions grows very rapidly with the sample size, we cannot visualize this probability distribution in its entirety, unless the sample is very small. As a solution to this visualization problem, we recommend using an agglomerative hierarchical clustering algorithm, which we call the exact linkage algorithm. This algorithm is a special case of the maximin clustering algorithm that we introduced previously. The exact linkage algorithm is now implemented in our software package PartitionView. The exact linkage algorithm takes the posterior co-assignment probabilities as input and yields as output a rooted binary tree, or more generally, a forest of such trees. Each node of this forest defines a set of individuals, and the node height is the posterior co-assignment probability of this set. This provides a useful visual representation of the uncertainty associated with the assignment of individuals to categories. It is also a useful starting point for a more detailed exploration of the posterior distribution in terms of the co-assignment probabilities.

  19. Clustering-based classification of road traffic accidents using hierarchical clustering and artificial neural networks.

    Science.gov (United States)

    Taamneh, Madhar; Taamneh, Salah; Alkheder, Sharaf

    2017-09-01

    Artificial neural networks (ANNs) have been widely used in predicting the severity of road traffic crashes. All available information about previously occurred accidents is typically used for building a single prediction model (i.e., classifier). Too little attention has been paid to the differences between these accidents, leading, in most cases, to build less accurate predictors. Hierarchical clustering is a well-known clustering method that seeks to group data by creating a hierarchy of clusters. Using hierarchical clustering and ANNs, a clustering-based classification approach for predicting the injury severity of road traffic accidents was proposed. About 6000 road accidents occurred over a six-year period from 2008 to 2013 in Abu Dhabi were used throughout this study. In order to reduce the amount of variation in data, hierarchical clustering was applied on the data set to organize it into six different forms, each with different number of clusters (i.e., clusters from 1 to 6). Two ANN models were subsequently built for each cluster of accidents in each generated form. The first model was built and validated using all accidents (training set), whereas only 66% of the accidents were used to build the second model, and the remaining 34% were used to test it (percentage split). Finally, the weighted average accuracy was computed for each type of models in each from of data. The results show that when testing the models using the training set, clustering prior to classification achieves (11%-16%) more accuracy than without using clustering, while the percentage split achieves (2%-5%) more accuracy. The results also suggest that partitioning the accidents into six clusters achieves the best accuracy if both types of models are taken into account.

  20. Kinematic gait patterns in healthy runners: A hierarchical cluster analysis.

    Science.gov (United States)

    Phinyomark, Angkoon; Osis, Sean; Hettinga, Blayne A; Ferber, Reed

    2015-11-01

    Previous studies have demonstrated distinct clusters of gait patterns in both healthy and pathological groups, suggesting that different movement strategies may be represented. However, these studies have used discrete time point variables and usually focused on only one specific joint and plane of motion. Therefore, the first purpose of this study was to determine if running gait patterns for healthy subjects could be classified into homogeneous subgroups using three-dimensional kinematic data from the ankle, knee, and hip joints. The second purpose was to identify differences in joint kinematics between these groups. The third purpose was to investigate the practical implications of clustering healthy subjects by comparing these kinematics with runners experiencing patellofemoral pain (PFP). A principal component analysis (PCA) was used to reduce the dimensionality of the entire gait waveform data and then a hierarchical cluster analysis (HCA) determined group sets of similar gait patterns and homogeneous clusters. The results show two distinct running gait patterns were found with the main between-group differences occurring in frontal and sagittal plane knee angles (Pgait strategies. These results suggest care must be taken when selecting samples of subjects in order to investigate the pathomechanics of injured runners.

  1. Hierarchical star cluster assembly in globally collapsing molecular clouds

    Science.gov (United States)

    Vázquez-Semadeni, Enrique; González-Samaniego, Alejandro; Colín, Pedro

    2017-05-01

    We discuss the mechanism of cluster formation in a numerical simulation of a molecular cloud (MC) undergoing global hierarchical collapse, focusing on how the gas motions in the parent cloud control the assembly of the cluster. The global collapse implies that the star formation rate (SFR) increases over time. The collapse is hierarchical because it consists of small-scale collapses within larger scale ones. The latter culminate a few Myr later than the first small-scale ones and consist of filamentary flows that accrete on to massive central clumps. The small-scale collapses consist of clumps that are embedded in the filaments and falling on to the large-scale collapse centres. The stars formed in the early, small-scale collapses share the infall motion of their parent clumps, so that the filaments feed both gas and stars to the massive central clump. This process leads to the presence of a few older stars in a region where new protostars are forming, and also to a self-similar structure, in which each unit is composed of smaller scale subunits that approach each other and may merge. Because the older stars formed in the filaments share the infall motion of the gas on to the central clump, they tend to have larger velocities and to be distributed over larger areas than the younger stars formed in the central clump. Finally, interpreting the initial mass function (IMF) simply as a probability distribution implies that massive stars only form once the local SFR is large enough to sample the IMF up to high masses. In combination with the increase of the SFR, this implies that massive stars tend to appear late in the evolution of the MC, and only in the central massive clumps. We discuss the correspondence of these features with observed properties of young stellar clusters, finding very good qualitative agreement.

  2. Multilevel compression of random walks on networks reveals hierarchical organization in large integrated systems.

    Directory of Open Access Journals (Sweden)

    Martin Rosvall

    Full Text Available To comprehend the hierarchical organization of large integrated systems, we introduce the hierarchical map equation, which reveals multilevel structures in networks. In this information-theoretic approach, we exploit the duality between compression and pattern detection; by compressing a description of a random walker as a proxy for real flow on a network, we find regularities in the network that induce this system-wide flow. Finding the shortest multilevel description of the random walker therefore gives us the best hierarchical clustering of the network--the optimal number of levels and modular partition at each level--with respect to the dynamics on the network. With a novel search algorithm, we extract and illustrate the rich multilevel organization of several large social and biological networks. For example, from the global air traffic network we uncover countries and continents, and from the pattern of scientific communication we reveal more than 100 scientific fields organized in four major disciplines: life sciences, physical sciences, ecology and earth sciences, and social sciences. In general, we find shallow hierarchical structures in globally interconnected systems, such as neural networks, and rich multilevel organizations in systems with highly separated regions, such as road networks.

  3. Lyman Alpha Emitters in the Hierarchically Clustering Galaxy Formation

    CERN Document Server

    Kobayashi, Masakazu A R; Nagashima, Masahiro

    2007-01-01

    We present a new theoretical model for the luminosity functions (LFs) of Lyman alpha (Lya) emitting galaxies in the framework of hierarchical galaxy formation. We extend a semi-analytic model of galaxy formation that reproduces a number of observations for local galaxies, without changing the original model parameters but introducing a physically-motivated modelling to describe the escape fraction of Lya photons from host galaxies (f_esc). Though a previous study using a hierarchical clustering model simply assumed a constant and universal value of f_esc, we incorporate two new effects on f_esc: extinction by interstellar dust and galaxy-scale outflow induced as a star formation feedback. It is found that the new model nicely reproduces all the observed Lya LFs of the Lya emitters (LAEs) at different redshifts in z ~ 3--6. Our model predicts that galaxies with strong outflows and f_esc ~ 1 are dominant in the observed LFs, which is consistent with available observations while the simple universal f_esc model ...

  4. The structure of dark matter halos in hierarchical clustering theories

    CERN Document Server

    Subramanian, K; Ostriker, J P; Subramanian, Kandaswamy; Cen, Renyue; Ostriker, Jeremiah P.

    1999-01-01

    During hierarchical clustering, smaller masses generally collapse earlier than larger masses and so are denser on the average. The core of a small mass halo could be dense enough to resist disruption and survive undigested, when it is incorporated into a bigger object. We explore the possibility that a nested sequence of undigested cores in the center of the halo, which have survived the hierarchical, inhomogeneous collapse to form larger and larger objects, determines the halo structure in the inner regions. For a flat universe with $P(k) \\propto k^n$, scaling arguments then suggest that the core density profile is, $\\rho \\propto r^{-\\alpha}$ with $\\alpha = (9+3n)/(5+n)$. But whether such behaviour obtains depends on detailed dynamics. We first examine the dynamics using a fluid approach to the self-similar collapse solutions for the dark matter phase space density, including the effect of velocity dispersions. We highlight the importance of tangential velocity dispersions to obtain density profiles shallowe...

  5. Hierarchical Compressed Sensing for Cluster Based Wireless Sensor Networks

    Directory of Open Access Journals (Sweden)

    Vishal Krishna Singh

    2016-02-01

    Full Text Available Data transmission consumes significant amount of energy in large scale wireless sensor networks (WSNs. In such an environment, reducing the in-network communication and distributing the load evenly over the network can reduce the overall energy consumption and maximize the network lifetime significantly. In this work, the aforementioned problem of network lifetime and uneven energy consumption in large scale wireless sensor networks is addressed. This work proposes a hierarchical compressed sensing (HCS scheme to reduce the in-network communication during the data gathering process. Co-related sensor readings are collected via a hierarchical clustering scheme. A compressed sensing (CS based data processing scheme is devised to transmit the data from the source to the sink. The proposed HCS is able to identify the optimal position for the application of CS to achieve reduced and similar number of transmissions on all the nodes in the network. An activity map is generated to validate the reduced and uniformly distributed communication load of the WSN. Based on the number of transmissions per data gathering round, the bit-hop metric model is used to analyse the overall energy consumption. Simulation results validate the efficiency of the proposed method over the existing CS based approaches.

  6. Hand Tracking based on Hierarchical Clustering of Range Data

    CERN Document Server

    Cespi, Roberto; Lindner, Marvin

    2011-01-01

    Fast and robust hand segmentation and tracking is an essential basis for gesture recognition and thus an important component for contact-less human-computer interaction (HCI). Hand gesture recognition based on 2D video data has been intensively investigated. However, in practical scenarios purely intensity based approaches suffer from uncontrollable environmental conditions like cluttered background colors. In this paper we present a real-time hand segmentation and tracking algorithm using Time-of-Flight (ToF) range cameras and intensity data. The intensity and range information is fused into one pixel value, representing its combined intensity-depth homogeneity. The scene is hierarchically clustered using a GPU based parallel merging algorithm, allowing a robust identification of both hands even for inhomogeneous backgrounds. After the detection, both hands are tracked on the CPU. Our tracking algorithm can cope with the situation that one hand is temporarily covered by the other hand.

  7. Identifying Reference Objects by Hierarchical Clustering in Java Environment

    Directory of Open Access Journals (Sweden)

    RAHUL SAHA

    2011-09-01

    Full Text Available Recently Java programming environment has become so popular. Java programming language is a language that is designed to be portable enough to be executed in wide range of computers ranging from cell phones to supercomputers. Computer programs written in Java are compiled into Java Byte code instructions that are suitable for execution by a Java Virtual Machine implementation. Java virtual Machine is commonly implemented in software by means of an interpreter for the Java Virtual Machine instruction set. As an object oriented language, Java utilizes the concept of objects. Our idea is to identify the candidate objects references in a Java environment through hierarchical cluster analysis using reference stack and execution stack.

  8. Novel density-based and hierarchical density-based clustering algorithms for uncertain data.

    Science.gov (United States)

    Zhang, Xianchao; Liu, Han; Zhang, Xiaotong

    2017-09-01

    Uncertain data has posed a great challenge to traditional clustering algorithms. Recently, several algorithms have been proposed for clustering uncertain data, and among them density-based techniques seem promising for handling data uncertainty. However, some issues like losing uncertain information, high time complexity and nonadaptive threshold have not been addressed well in the previous density-based algorithm FDBSCAN and hierarchical density-based algorithm FOPTICS. In this paper, we firstly propose a novel density-based algorithm PDBSCAN, which improves the previous FDBSCAN from the following aspects: (1) it employs a more accurate method to compute the probability that the distance between two uncertain objects is less than or equal to a boundary value, instead of the sampling-based method in FDBSCAN; (2) it introduces new definitions of probability neighborhood, support degree, core object probability, direct reachability probability, thus reducing the complexity and solving the issue of nonadaptive threshold (for core object judgement) in FDBSCAN. Then, we modify the algorithm PDBSCAN to an improved version (PDBSCANi), by using a better cluster assignment strategy to ensure that every object will be assigned to the most appropriate cluster, thus solving the issue of nonadaptive threshold (for direct density reachability judgement) in FDBSCAN. Furthermore, as PDBSCAN and PDBSCANi have difficulties for clustering uncertain data with non-uniform cluster density, we propose a novel hierarchical density-based algorithm POPTICS by extending the definitions of PDBSCAN, adding new definitions of fuzzy core distance and fuzzy reachability distance, and employing a new clustering framework. POPTICS can reveal the cluster structures of the datasets with different local densities in different regions better than PDBSCAN and PDBSCANi, and it addresses the issues in FOPTICS. Experimental results demonstrate the superiority of our proposed algorithms over the existing

  9. The reflection of hierarchical cluster analysis of co-occurrence matrices in SPSS

    NARCIS (Netherlands)

    Zhou, Q.; Leng, F.; Leydesdorff, L.

    2015-01-01

    Purpose: To discuss the problems arising from hierarchical cluster analysis of co-occurrence matrices in SPSS, and the corresponding solutions. Design/methodology/approach: We design different methods of using the SPSS hierarchical clustering module for co-occurrence matrices in order to compare the

  10. The reflection of hierarchical cluster analysis of co-occurrence matrices in SPSS

    NARCIS (Netherlands)

    Zhou, Q.; Leng, F.; Leydesdorff, L.

    2015-01-01

    Purpose: To discuss the problems arising from hierarchical cluster analysis of co-occurrence matrices in SPSS, and the corresponding solutions. Design/methodology/approach: We design different methods of using the SPSS hierarchical clustering module for co-occurrence matrices in order to compare

  11. [Study of the clinical phenotype of symptomatic chronic airways disease by hierarchical cluster analysis and two-step cluster analyses].

    Science.gov (United States)

    Ning, P; Guo, Y F; Sun, T Y; Zhang, H S; Chai, D; Li, X M

    2016-09-01

    To study the distinct clinical phenotype of chronic airway diseases by hierarchical cluster analysis and two-step cluster analysis. A population sample of adult patients in Donghuamen community, Dongcheng district and Qinghe community, Haidian district, Beijing from April 2012 to January 2015, who had wheeze within the last 12 months, underwent detailed investigation, including a clinical questionnaire, pulmonary function tests, total serum IgE levels, blood eosinophil level and a peak flow diary. Nine variables were chosen as evaluating parameters, including pre-salbutamol forced expired volume in one second(FEV1)/forced vital capacity(FVC) ratio, pre-salbutamol FEV1, percentage of post-salbutamol change in FEV1, residual capacity, diffusing capacity of the lung for carbon monoxide/alveolar volume adjusted for haemoglobin level, peak expiratory flow(PEF) variability, serum IgE level, cumulative tobacco cigarette consumption (pack-years) and respiratory symptoms (cough and expectoration). Subjects' different clinical phenotype by hierarchical cluster analysis and two-step cluster analysis was identified. (1) Four clusters were identified by hierarchical cluster analysis. Cluster 1 was chronic bronchitis in smokers with normal pulmonary function. Cluster 2 was chronic bronchitis or mild chronic obstructive pulmonary disease (COPD) patients with mild airflow limitation. Cluster 3 included COPD patients with heavy smoking, poor quality of life and severe airflow limitation. Cluster 4 recognized atopic patients with mild airflow limitation, elevated serum IgE and clinical features of asthma. Significant differences were revealed regarding pre-salbutamol FEV1/FVC%, pre-salbutamol FEV1% pred, post-salbutamol change in FEV1%, maximal mid-expiratory flow curve(MMEF)% pred, carbon monoxide diffusing capacity per liter of alveolar(DLCO)/(VA)% pred, residual volume(RV)% pred, total serum IgE level, smoking history (pack-years), St.George's respiratory questionnaire

  12. Evolutionary-Hierarchical Bases of the Formation of Cluster Model of Innovation Economic Development

    Directory of Open Access Journals (Sweden)

    Yuliya Vladimirovna Dubrovskaya

    2016-10-01

    Full Text Available The functioning of a modern economic system is based on the interaction of objects of different hierarchical levels. Thus, the problem of the study of innovation processes taking into account the mutual influence of the activities of these economic actors becomes important. The paper dwells evolutionary basis for the formation of models of innovation development on the basis of micro and macroeconomic analysis. Most of the concepts recognized that despite a big number of diverse models, the coordination of the relations between economic agents is of crucial importance for the successful innovation development. According to the results of the evolutionary-hierarchical analysis, the authors reveal key phases of the development of forms of business cooperation, science and government in the domestic economy. It has become the starting point of the conception of the characteristics of the interaction in the cluster models of innovation development of the economy. Considerable expectancies on improvement of the national innovative system are connected with the development of cluster and network structures. The main objective of government authorities is the formation of mechanisms and institutions that will foster cooperation between members of the clusters. The article explains that the clusters cannot become the factors in the growth of the national economy, not being an effective tool for interaction between the actors of the regional innovative systems.

  13. A Framework for Analyzing Software Quality using Hierarchical Clustering

    Directory of Open Access Journals (Sweden)

    Arashdeep Kaur

    2011-02-01

    Full Text Available Fault proneness data available in the early software life cycle from previous releases or similar kind of projects will aid in improving software quality estimations. Various techniques have been proposed in the literature which includes statistical method, machine learning methods, neural network techniques and clustering techniques for the prediction of faulty and non faulty modules in the project. In this study, Hierarchical clustering algorithm is being trained and tested with lifecycle data collected from NASA projects namely, CM1, PC1 and JM1 as predictive models. These predictive models contain requirement metrics and static code metrics. We have combined requirement metric model with static code metric model to get fusion metric model. Further we have investigated that which of the three prediction models is found to be the best prediction model on the basis of fault detection. The basic hypothesis of software quality estimation is that automatic quality prediction models enable verificationexperts to concentrate their attention and resources at problem areas of the system under development. The proposed approach has been implemented in MATLAB 7.4. The results show that when all the prediction techniques are evaluated, the best prediction model is found to be the fusion metric model. This proposed model is also compared with other quality models available in the literature and is found to be efficient for predicting faulty modules.

  14. Applying of hierarchical clustering to analysis of protein patterns in the human cancer-associated liver.

    Directory of Open Access Journals (Sweden)

    Natalia A Petushkova

    Full Text Available There are two ways that statistical methods can learn from biomedical data. One way is to learn classifiers to identify diseases and to predict outcomes using the training dataset with established diagnosis for each sample. When the training dataset is not available the task can be to mine for presence of meaningful groups (clusters of samples and to explore underlying data structure (unsupervised learning.We investigated the proteomic profiles of the cytosolic fraction of human liver samples using two-dimensional electrophoresis (2DE. Samples were resected upon surgical treatment of hepatic metastases in colorectal cancer. Unsupervised hierarchical clustering of 2DE gel images (n = 18 revealed a pair of clusters, containing 11 and 7 samples. Previously we used the same specimens to measure biochemical profiles based on cytochrome P450-dependent enzymatic activities and also found that samples were clearly divided into two well-separated groups by cluster analysis. It turned out that groups by enzyme activity almost perfectly match to the groups identified from proteomic data. Of the 271 reproducible spots on our 2DE gels, we selected 15 to distinguish the human liver cytosolic clusters. Using MALDI-TOF peptide mass fingerprinting, we identified 12 proteins for the selected spots, including known cancer-associated species.Our results highlight the importance of hierarchical cluster analysis of proteomic data, and showed concordance between results of biochemical and proteomic approaches. Grouping of the human liver samples and/or patients into differing clusters may provide insights into possible molecular mechanism of drug metabolism and creates a rationale for personalized treatment.

  15. Determination of genetic structure of germplasm collections: are traditional hierarchical clustering methods appropriate for molecular marker data?

    NARCIS (Netherlands)

    Odong, T.L.; Heerwaarden, van J.; Jansen, J.; Hintum, van T.J.L.; Eeuwijk, van F.A.

    2011-01-01

    Despite the availability of newer approaches, traditional hierarchical clustering remains very popular in genetic diversity studies in plants. However, little is known about its suitability for molecular marker data. We studied the performance of traditional hierarchical clustering techniques using

  16. A combined multidimensional scaling and hierarchical clustering view for the exploratory analysis of multidimensional data

    Science.gov (United States)

    Craig, Paul; Roa-Seïler, Néna

    2013-01-01

    This paper describes a novel information visualization technique that combines multidimensional scaling and hierarchical clustering to support the exploratory analysis of multidimensional data. The technique displays the results of multidimensional scaling using a scatter plot where the proximity of any two items' representations is approximate to their similarity according to a Euclidean distance metric. The results of hierarchical clustering are overlaid onto this view by drawing smoothed outlines around each nested cluster. The difference in similarity between successive cluster combinations is used to colour code clusters and make stronger natural clusters more prominent in the display. When a cluster or group of items is selected, multidimensional scaling and hierarchical clustering are re-applied to a filtered subset of the data, and animation is used to smooth the transition between successive filtered views. As a case study we demonstrate the technique being used to analyse survey data relating to the appropriateness of different phrases to different emotionally charged situations.

  17. Hierarchical clusters in families with type 2 diabetes

    Science.gov (United States)

    García-Solano, Beatriz; Gallegos-Cabriales, Esther C; Gómez-Meza, Marco V; García-Madrid, Guillermina; Flores-Merlo, Marcela; García-Solano, Mauro

    2015-01-01

    Families represent more than a set of individuals; family is more than a sum of its individual members. With this classification, nurses can identify the family health-illness beliefs obey family as a unit concept, and plan family inclusion into the type 2 diabetes treatment, whom is not considered in public policy, despite families share diet, exercise, and self-monitoring with a member who suffers type 2 diabetes. The aim of this study was to determine whether the characteristics, functionality, routines, and family and individual health in type 2 diabetes describes the differences and similarities between families to consider them as a unit. We performed an exploratory, descriptive hierarchical cluster analysis of 61 families using three instruments and a questionnaire, in addition to weight, height, body fat percentage, hemoglobin A1c, total cholesterol, triglycerides, low-density lipoprotein and high-density lipoprotein. The analysis produced three groups of families. Wilk’s lambda demonstrated statistically significant differences provided by age (Λ = 0.778, F = 2.098, p = 0.010) and family health (Λ = 0.813, F = 2.650, p = 0.023). A post hoc Tukey test coincided with the three subsets. Families with type 2 diabetes have common elements that make them similar, while sharing differences that make them unique. PMID:27347419

  18. The formation of NGC 3603 young starburst cluster: "prompt" hierarchical assembly or monolithic starburst?

    CERN Document Server

    Banerjee, Sambaran

    2014-01-01

    The formation of very young massive clusters or "starburst" clusters is currently one of the most widely debated topic in astronomy. The classical notion dictates that a star cluster is formed in-situ in a dense molecular gas clump followed by a substantial residual gas expulsion. On the other hand, based on the observed morphologies of many young stellar associations, a hierarchical formation scenario is alternatively suggested. A very young (age $\\approx$ 1 Myr), massive ($>10^4M_\\odot$) star cluster like the Galactic NGC 3603 young cluster (HD 97950) is an appropriate testbed for distinguishing between such "monolithic" and "hierarchical" formation scenarios. A recent study by Banerjee and Kroupa (2014) demonstrates that the monolithic scenario remarkably reproduces the HD 97950 cluster. In the present work, we explore the possibility of the formation of the above cluster via hierarchical assembly of subclusters. These subclusters are initially distributed over a wide range of spatial volumes and have vari...

  19. A COMPARISON BETWEEN SINGLE LINKAGE AND COMPLETE LINKAGE IN AGGLOMERATIVE HIERARCHICAL CLUSTER ANALYSIS FOR IDENTIFYING TOURISTS SEGMENTS

    OpenAIRE

    Noor Rashidah Rashid

    2012-01-01

    Cluster Analysis is a multivariate method in statistics. Agglomerative Hierarchical Cluster Analysis is one of approaches in Cluster Analysis. There are two linkage methods in Agglomerative Hierarchical Cluster Analysis which are Single Linkage and Complete Linkage. The purpose of this study is to compare between Single Linkage and Complete Linkage in Agglomerative Hierarchical Cluster Analysis. The comparison of performances between these linkage methods was shown by using Kruskal-Wallis tes...

  20. A dynamic hierarchical clustering method for trajectory-based unusual video event detection.

    Science.gov (United States)

    Jiang, Fan; Wu, Ying; Katsaggelos, Aggelos K

    2009-04-01

    The proposed unusual video event detection method is based on unsupervised clustering of object trajectories, which are modeled by hidden Markov models (HMM). The novelty of the method includes a dynamic hierarchical process incorporated in the trajectory clustering algorithm to prevent model overfitting and a 2-depth greedy search strategy for efficient clustering.

  1. THE EVOLUTION OF BRIGHTEST CLUSTER GALAXIES IN A HIERARCHICAL UNIVERSE

    Energy Technology Data Exchange (ETDEWEB)

    Tonini, Chiara; Bernyk, Maksym; Croton, Darren [Centre for Astrophysics and Supercomputing, Swinburne University of Technology, Melbourne, VIC 3122 (Australia); Maraston, Claudia; Thomas, Daniel [Institute of Cosmology and Gravitation, University of Portsmouth, Portsmouth PO1 3FX (United Kingdom)

    2012-11-01

    We investigate the evolution of brightest cluster galaxies (BCGs) from redshift z {approx} 1.6 to z = 0. We upgrade the hierarchical semi-analytic model of Croton et al. with a new spectro-photometric model that produces realistic galaxy spectra, making use of the Maraston stellar populations and a new recipe for the dust extinction. We compare the model predictions of the K-band luminosity evolution and the J - K, V - I, and I - K color evolution with a series of data sets, including those of Collins et al. who argued that semi-analytic models based on the Millennium simulation cannot reproduce the red colors and high luminosity of BCGs at z > 1. We show instead that the model is well in range of the observed luminosity and correctly reproduces the color evolution of BCGs in the whole redshift range up to z {approx} 1.6. We argue that the success of the semi-analytic model is in large part due to the implementation of a more sophisticated spectro-photometric model. An analysis of the model BCGs shows an increase in mass by a factor of 2-3 since z {approx} 1, and star formation activity down to low redshifts. While the consensus regarding BCGs is that they are passively evolving, we argue that this conclusion is affected by the degeneracy between star formation history and stellar population models used in spectral energy distribution fitting, and by the inefficacy of toy models of passive evolution to capture the complexity of real galaxies, especially those with rich merger histories like BCGs. Following this argument, we also show that in the semi-analytic model the BCGs show a realistic mix of stellar populations, and that these stellar populations are mostly old. In addition, the age-redshift relation of the model BCGs follows that of the universe, meaning that given their merger history and star formation history, the ageing of BCGs is always dominated by the ageing of their stellar populations. In a {Lambda}CDM universe, we define such evolution as &apos

  2. Microglia Morphological Categorization in a Rat Model of Neuroinflammation by Hierarchical Cluster and Principal Components Analysis

    Science.gov (United States)

    Fernández-Arjona, María del Mar; Grondona, Jesús M.; Granados-Durán, Pablo; Fernández-Llebrez, Pedro; López-Ávalos, María D.

    2017-01-01

    It is known that microglia morphology and function are closely related, but only few studies have objectively described different morphological subtypes. To address this issue, morphological parameters of microglial cells were analyzed in a rat model of aseptic neuroinflammation. After the injection of a single dose of the enzyme neuraminidase (NA) within the lateral ventricle (LV) an acute inflammatory process occurs. Sections from NA-injected animals and sham controls were immunolabeled with the microglial marker IBA1, which highlights ramifications and features of the cell shape. Using images obtained by section scanning, individual microglial cells were sampled from various regions (septofimbrial nucleus, hippocampus and hypothalamus) at different times post-injection (2, 4 and 12 h). Each cell yielded a set of 15 morphological parameters by means of image analysis software. Five initial parameters (including fractal measures) were statistically different in cells from NA-injected rats (most of them IL-1β positive, i.e., M1-state) compared to those from control animals (none of them IL-1β positive, i.e., surveillant state). However, additional multimodal parameters were revealed more suitable for hierarchical cluster analysis (HCA). This method pointed out the classification of microglia population in four clusters. Furthermore, a linear discriminant analysis (LDA) suggested three specific parameters to objectively classify any microglia by a decision tree. In addition, a principal components analysis (PCA) revealed two extra valuable variables that allowed to further classifying microglia in a total of eight sub-clusters or types. The spatio-temporal distribution of these different morphotypes in our rat inflammation model allowed to relate specific morphotypes with microglial activation status and brain location. An objective method for microglia classification based on morphological parameters is proposed. Main points Microglia undergo a quantifiable

  3. Hierarchical Clustering Algorithm based on Attribute Dependency for Attention Deficit Hyperactive Disorder

    Directory of Open Access Journals (Sweden)

    J Anuradha

    2014-05-01

    Full Text Available Attention Deficit Hyperactive Disorder (ADHD is a disruptive neurobehavioral disorder characterized by abnormal behavioral patterns in attention, perusing activity, acting impulsively and combined types. It is predominant among school going children and it is tricky to differentiate between an active and an ADHD child. Misdiagnosis and undiagnosed cases are very common. Behavior patterns are identified by the mentors in the academic environment who lack skills in screening those kids. Hence an unsupervised learning algorithm can cluster the behavioral patterns of children at school for diagnosis of ADHD. In this paper, we propose a hierarchical clustering algorithm to partition the dataset based on attribute dependency (HCAD. HCAD forms clusters of data based on the high dependent attributes and their equivalence relation. It is capable of handling large volumes of data with reasonably faster clustering than most of the existing algorithms. It can work on both labeled and unlabelled data sets. Experimental results reveal that this algorithm has higher accuracy in comparison to other algorithms. HCAD achieves 97% of cluster purity in diagnosing ADHD. Empirical analysis of application of HCAD on different data sets from UCI repository is provided.

  4. D Nearest Neighbour Search Using a Clustered Hierarchical Tree Structure

    Science.gov (United States)

    Suhaibah, A.; Uznir, U.; Anton, F.; Mioc, D.; Rahman, A. A.

    2016-06-01

    Locating and analysing the location of new stores or outlets is one of the common issues facing retailers and franchisers. This is due to assure that new opening stores are at their strategic location to attract the highest possible number of customers. Spatial information is used to manage, maintain and analyse these store locations. However, since the business of franchising and chain stores in urban areas runs within high rise multi-level buildings, a three-dimensional (3D) method is prominently required in order to locate and identify the surrounding information such as at which level of the franchise unit will be located or is the franchise unit located is at the best level for visibility purposes. One of the common used analyses used for retrieving the surrounding information is Nearest Neighbour (NN) analysis. It uses a point location and identifies the surrounding neighbours. However, with the immense number of urban datasets, the retrieval and analysis of nearest neighbour information and their efficiency will become more complex and crucial. In this paper, we present a technique to retrieve nearest neighbour information in 3D space using a clustered hierarchical tree structure. Based on our findings, the proposed approach substantially showed an improvement of response time analysis compared to existing approaches of spatial access methods in databases. The query performance was tested using a dataset consisting of 500,000 point locations building and franchising unit. The results are presented in this paper. Another advantage of this structure is that it also offers a minimal overlap and coverage among nodes which can reduce repetitive data entry.

  5. Can Clustering in Genotype Space Reveal "Niches"?

    Science.gov (United States)

    D'Andrea, Rafael; Ostling, Annette

    2016-01-01

    Community ecology lacks the success enjoyed by population genetics to quantify the relative roles played by deterministic and stochastic processes. It has been proposed that clustered patterns of abundance in genotype space provide evidence of selection in microbial communities, since no such clustering would arise in the absence of selection. We critique this test for its unrealistic null hypothesis. We show mathematically and with simulations that point mutations alone lead to clustering in genotype space by causing correlations between abundances of similar genotypes. We also show potential deviations from the mutation-only pattern caused by immigration from a source pool. Clustered patterns in genotype space may still be revealing of selection if analyzed quantitatively but only if neutral and selective regimes can be distinguished once mutation and immigration are included in the null model.

  6. The Hierarchical Distribution of the Young Stellar Clusters in Six Local Star-forming Galaxies

    Science.gov (United States)

    Grasha, K.; Calzetti, D.; Adamo, A.; Kim, H.; Elmegreen, B. G.; Gouliermis, D. A.; Dale, D. A.; Fumagalli, M.; Grebel, E. K.; Johnson, K. E.; Kahre, L.; Kennicutt, R. C.; Messa, M.; Pellerin, A.; Ryon, J. E.; Smith, L. J.; Shabani, F.; Thilker, D.; Ubeda, L.

    2017-05-01

    We present a study of the hierarchical clustering of the young stellar clusters in six local (3-15 Mpc) star-forming galaxies using Hubble Space Telescope broadband WFC3/UVIS UV and optical images from the Treasury Program LEGUS (Legacy ExtraGalactic UV Survey). We identified 3685 likely clusters and associations, each visually classified by their morphology, and we use the angular two-point correlation function to study the clustering of these stellar systems. We find that the spatial distribution of the young clusters and associations are clustered with respect to each other, forming large, unbound hierarchical star-forming complexes that are in general very young. The strength of the clustering decreases with increasing age of the star clusters and stellar associations, becoming more homogeneously distributed after ˜40-60 Myr and on scales larger than a few hundred parsecs. In all galaxies, the associations exhibit a global behavior that is distinct and more strongly correlated from compact clusters. Thus, populations of clusters are more evolved than associations in terms of their spatial distribution, traveling significantly from their birth site within a few tens of Myr, whereas associations show evidence of disruption occurring very quickly after their formation. The clustering of the stellar systems resembles that of a turbulent interstellar medium that drives the star formation process, correlating the components in unbound star-forming complexes in a hierarchical manner, dispersing shortly after formation, suggestive of a single, continuous mode of star formation across all galaxies.

  7. Content Based Image Retrieval using Hierarchical and K-Means Clustering Techniques

    Directory of Open Access Journals (Sweden)

    V.S.V.S. Murthy

    2010-03-01

    Full Text Available In this paper we present an image retrieval system that takes an image as the input query and retrieves images based on image content. Content Based Image Retrieval is an approach for retrieving semantically-relevant images from an image database based on automatically-derived image features. The unique aspect of the system is the utilization of hierarchical and k-means clustering techniques. The proposed procedure consists of two stages. First, here we are going to filter most of the images in the hierarchical clustering and then apply the clustered images to KMeans, so that we can get better favored image results.

  8. Hierarchical Control for Multiple DC-Microgrids Clusters

    DEFF Research Database (Denmark)

    Shafiee, Qobad; Dragicevic, Tomislav; Vasquez, Juan Carlos

    2014-01-01

    DC microgrids (MGs) have gained research interest during the recent years because of many potential advantages as compared to the ac system. To ensure reliable operation of a low-voltage dc MG as well as its intelligent operation with the other DC MGs, a hierarchical control is proposed in this p......DC microgrids (MGs) have gained research interest during the recent years because of many potential advantages as compared to the ac system. To ensure reliable operation of a low-voltage dc MG as well as its intelligent operation with the other DC MGs, a hierarchical control is proposed...

  9. Hierarchical Cluster Analysis: Comparison of Three Linkage Measures and Application to Psychological Data

    Directory of Open Access Journals (Sweden)

    Odilia Yim

    2015-02-01

    Full Text Available Cluster analysis refers to a class of data reduction methods used for sorting cases, observations, or variables of a given dataset into homogeneous groups that differ from each other. The present paper focuses on hierarchical agglomerative cluster analysis, a statistical technique where groups are sequentially created by systematically merging similar clusters together, as dictated by the distance and linkage measures chosen by the researcher. Specific distance and linkage measures are reviewed, including a discussion of how these choices can influence the clustering process by comparing three common linkage measures (single linkage, complete linkage, average linkage. The tutorial guides researchers in performing a hierarchical cluster analysis using the SPSS statistical software. Through an example, we demonstrate how cluster analysis can be used to detect meaningful subgroups in a sample of bilinguals by examining various language variables.

  10. Hierarchical cluster analysis of progression patterns in open-angle glaucoma patients with medical treatment.

    Science.gov (United States)

    Bae, Hyoung Won; Rho, Seungsoo; Lee, Hye Sun; Lee, Naeun; Hong, Samin; Seong, Gong Je; Sung, Kyung Rim; Kim, Chan Yun

    2014-04-29

    To classify medically treated open-angle glaucoma (OAG) by the pattern of progression using hierarchical cluster analysis, and to determine OAG progression characteristics by comparing clusters. Ninety-five eyes of 95 OAG patients who received medical treatment, and who had undergone visual field (VF) testing at least once per year for 5 or more years. OAG was classified into subgroups using hierarchical cluster analysis based on the following five variables: baseline mean deviation (MD), baseline visual field index (VFI), MD slope, VFI slope, and Glaucoma Progression Analysis (GPA) printout. After that, other parameters were compared between clusters. Two clusters were made after a hierarchical cluster analysis. Cluster 1 showed -4.06 ± 2.43 dB baseline MD, 92.58% ± 6.27% baseline VFI, -0.28 ± 0.38 dB per year MD slope, -0.52% ± 0.81% per year VFI slope, and all "no progression" cases in GPA printout, whereas cluster 2 showed -8.68 ± 3.81 baseline MD, 77.54 ± 12.98 baseline VFI, -0.72 ± 0.55 MD slope, -2.22 ± 1.89 VFI slope, and seven "possible" and four "likely" progression cases in GPA printout. There were no significant differences in age, sex, mean IOP, central corneal thickness, and axial length between clusters. However, cluster 2 included more high-tension glaucoma patients and used a greater number of antiglaucoma eye drops significantly compared with cluster 1. Hierarchical cluster analysis of progression patterns divided OAG into slow and fast progression groups, evidenced by assessing the parameters of glaucomatous progression in VF testing. In the fast progression group, the prevalence of high-tension glaucoma was greater and the number of antiglaucoma medications administered was increased versus the slow progression group. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.

  11. A Cluster in the Making: ALMA Reveals the Initial Conditions for High-mass Cluster Formation

    Science.gov (United States)

    Rathborne, J. M.; Longmore, S. N.; Jackson, J. M.; Alves, J. F.; Bally, J.; Bastian, N.; Contreras, Y.; Foster, J. B.; Garay, G.; Kruijssen, J. M. D.; Testi, L.; Walsh, A. J.

    2015-04-01

    G0.253+0.016 is a molecular clump that appears to be on the verge of forming a high-mass cluster: its extremely low dust temperature, high mass, and high density, combined with its lack of prevalent star formation, make it an excellent candidate for an Arches-like cluster in a very early stage of formation. Here we present new Atacama Large Millimeter/Sub-millimeter Array observations of its small-scale (∼0.07 pc) 3 mm dust continuum and molecular line emission from 17 different species that probe a range of distinct physical and chemical conditions. The data reveal a complex network of emission features with a complicated velocity structure: there is emission on all spatial scales, the morphology of which ranges from small, compact regions to extended, filamentary structures that are seen in both emission and absorption. The dust column density is well traced by molecules with higher excitation energies and critical densities, consistent with a clump that has a denser interior. A statistical analysis supports the idea that turbulence shapes the observed gas structure within G0.253+0.016. We find a clear break in the turbulent power spectrum derived from the optically thin dust continuum emission at a spatial scale of ∼0.1 pc, which may correspond to the spatial scale at which gravity has overcome the thermal pressure. We suggest that G0.253+0.016 is on the verge of forming a cluster from hierarchical, filamentary structures that arise from a highly turbulent medium. Although the stellar distribution within high-mass Arches-like clusters is compact, centrally condensed, and smooth, the observed gas distribution within G0.253+0.016 is extended, with no high-mass central concentration, and has a complex, hierarchical structure. If this clump gives rise to a high-mass cluster and its stars are formed from this initially hierarchical gas structure, then the resulting cluster must evolve into a centrally condensed structure via a dynamical process.

  12. A CLUSTER IN THE MAKING: ALMA REVEALS THE INITIAL CONDITIONS FOR HIGH-MASS CLUSTER FORMATION

    Energy Technology Data Exchange (ETDEWEB)

    Rathborne, J. M.; Contreras, Y. [CSIRO Astronomy and Space Science, P.O. Box 76, Epping NSW, 1710 (Australia); Longmore, S. N.; Bastian, N. [Astrophysics Research Institute, Liverpool John Moores University, 146 Brownlow Hill, Liverpool L3 5RF (United Kingdom); Jackson, J. M. [Institute for Astrophysical Research, Boston University, Boston, MA 02215 (United States); Alves, J. F. [University of Vienna, Türkenschanzstrasse 17, A-1180 Vienna (Austria); Bally, J. [Center for Astrophysics and Space Astronomy, University of Colorado, UCB 389, Boulder, CO 8030 (United States); Foster, J. B. [Department of Astronomy, Yale University, P.O. Box 208101 New Haven, CT 06520-8101 (United States); Garay, G. [Universidad de Chile, Camino El Observatorio1515, Las Condes, Santiago (Chile); Kruijssen, J. M. D. [Max-Planck Institut fur Astrophysik, Karl-Schwarzschild-Strasse 1, D-85748, Garching (Germany); Testi, L. [European Southern Observatory, Karl-Schwarzschild-Str. 2, D-85748 Garching bei Munchen (Germany); Walsh, A. J., E-mail: Jill.Rathborne@csiro.au [International Centre for Radio Astronomy Research, Curtin University, GPO Box U1987, Perth (Australia)

    2015-04-01

    G0.253+0.016 is a molecular clump that appears to be on the verge of forming a high-mass cluster: its extremely low dust temperature, high mass, and high density, combined with its lack of prevalent star formation, make it an excellent candidate for an Arches-like cluster in a very early stage of formation. Here we present new Atacama Large Millimeter/Sub-millimeter Array observations of its small-scale (∼0.07 pc) 3 mm dust continuum and molecular line emission from 17 different species that probe a range of distinct physical and chemical conditions. The data reveal a complex network of emission features with a complicated velocity structure: there is emission on all spatial scales, the morphology of which ranges from small, compact regions to extended, filamentary structures that are seen in both emission and absorption. The dust column density is well traced by molecules with higher excitation energies and critical densities, consistent with a clump that has a denser interior. A statistical analysis supports the idea that turbulence shapes the observed gas structure within G0.253+0.016. We find a clear break in the turbulent power spectrum derived from the optically thin dust continuum emission at a spatial scale of ∼0.1 pc, which may correspond to the spatial scale at which gravity has overcome the thermal pressure. We suggest that G0.253+0.016 is on the verge of forming a cluster from hierarchical, filamentary structures that arise from a highly turbulent medium. Although the stellar distribution within high-mass Arches-like clusters is compact, centrally condensed, and smooth, the observed gas distribution within G0.253+0.016 is extended, with no high-mass central concentration, and has a complex, hierarchical structure. If this clump gives rise to a high-mass cluster and its stars are formed from this initially hierarchical gas structure, then the resulting cluster must evolve into a centrally condensed structure via a dynamical process.

  13. Periorbital melasma: Hierarchical cluster analysis of clinical features in Asian patients.

    Science.gov (United States)

    Jung, Y S; Bae, J M; Kim, B J; Kang, J-S; Cho, S B

    2017-03-19

    Studies have shown melasma lesions to be distributed across the face in centrofacial, malar, and mandibular patterns. Meanwhile, however, melasma lesions of the periorbital area have yet to be thoroughly described. We analyzed normal and ultraviolet light-exposed photographs of patients with melasma. The periorbital melasma lesions were measured according to anatomical reference points and a hierarchical cluster analysis was performed. The periorbital melasma lesions showed clinical features of fine and homogenous melasma pigmentation, involving both the upper and lower eyelids that extended to other anatomical sites with a darker and coarser appearance. The hierarchical cluster analysis indicated that patients with periorbital melasma can be categorized into two clusters according to the surface anatomy of the face. Significant differences between cluster 1 and cluster 2 were found in lateral distance and inferolateral distance, but not in medial distance and superior distance. Comparing the two clusters, patients in cluster 2 were found to be significantly older and more commonly accompanied by melasma lesions of the temple and medial cheek. Our hierarchical cluster analysis of periorbital melasma lesions demonstrated that Asian patients with periorbital melasma can be categorized into two clusters according to the surface anatomy of the face. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. Hierarchical trie packet classification algorithm based on expectation-maximization clustering

    Science.gov (United States)

    Bi, Xia-an; Zhao, Junxia

    2017-01-01

    With the development of computer network bandwidth, packet classification algorithms which are able to deal with large-scale rule sets are in urgent need. Among the existing algorithms, researches on packet classification algorithms based on hierarchical trie have become an important packet classification research branch because of their widely practical use. Although hierarchical trie is beneficial to save large storage space, it has several shortcomings such as the existence of backtracking and empty nodes. This paper proposes a new packet classification algorithm, Hierarchical Trie Algorithm Based on Expectation-Maximization Clustering (HTEMC). Firstly, this paper uses the formalization method to deal with the packet classification problem by means of mapping the rules and data packets into a two-dimensional space. Secondly, this paper uses expectation-maximization algorithm to cluster the rules based on their aggregate characteristics, and thereby diversified clusters are formed. Thirdly, this paper proposes a hierarchical trie based on the results of expectation-maximization clustering. Finally, this paper respectively conducts simulation experiments and real-environment experiments to compare the performances of our algorithm with other typical algorithms, and analyzes the results of the experiments. The hierarchical trie structure in our algorithm not only adopts trie path compression to eliminate backtracking, but also solves the problem of low efficiency of trie updates, which greatly improves the performance of the algorithm. PMID:28704476

  15. The Evolution of Galaxy Clustering in Hierarchical Models

    OpenAIRE

    1999-01-01

    The main ingredients of recent semi-analytic models of galaxy formation are summarised. We present predictions for the galaxy clustering properties of a well specified LCDM model whose parameters are constrained by observed local galaxy properties. We present preliminary predictions for evolution of clustering that can be probed with deep pencil beam surveys.

  16. Hierarchicality of trade flow networks reveals complexity of products.

    Science.gov (United States)

    Shi, Peiteng; Zhang, Jiang; Yang, Bo; Luo, Jingfei

    2014-01-01

    With globalization, countries are more connected than before by trading flows, which amounts to at least 36 trillion dollars today. Interestingly, around 30-60 percents of exports consist of intermediate products in global. Therefore, the trade flow network of particular product with high added values can be regarded as value chains. The problem is weather we can discriminate between these products from their unique flow network structure? This paper applies the flow analysis method developed in ecology to 638 trading flow networks of different products. We claim that the allometric scaling exponent η can be used to characterize the degree of hierarchicality of a flow network, i.e., whether the trading products flow on long hierarchical chains. Then, it is pointed out that the flow networks of products with higher added values and complexity like machinary, transport equipment etc. have larger exponents, meaning that their trade flow networks are more hierarchical. As a result, without the extra data like global input-output table, we can identify the product categories with higher complexity, and the relative importance of a country in the global value chain by the trading network solely.

  17. Hierarchicality of trade flow networks reveals complexity of products.

    Directory of Open Access Journals (Sweden)

    Peiteng Shi

    Full Text Available With globalization, countries are more connected than before by trading flows, which amounts to at least 36 trillion dollars today. Interestingly, around 30-60 percents of exports consist of intermediate products in global. Therefore, the trade flow network of particular product with high added values can be regarded as value chains. The problem is weather we can discriminate between these products from their unique flow network structure? This paper applies the flow analysis method developed in ecology to 638 trading flow networks of different products. We claim that the allometric scaling exponent η can be used to characterize the degree of hierarchicality of a flow network, i.e., whether the trading products flow on long hierarchical chains. Then, it is pointed out that the flow networks of products with higher added values and complexity like machinary, transport equipment etc. have larger exponents, meaning that their trade flow networks are more hierarchical. As a result, without the extra data like global input-output table, we can identify the product categories with higher complexity, and the relative importance of a country in the global value chain by the trading network solely.

  18. Clinical fracture risk evaluated by hierarchical agglomerative clustering

    DEFF Research Database (Denmark)

    Kruse, Christian; Eiken, P; Vestergaard, P

    2017-01-01

    profiles. INTRODUCTION: The purposes of this study were to establish and quantify patient clusters of high, average and low fracture risk using an unsupervised machine learning algorithm. METHODS: Regional and national Danish patient data on dual-energy X-ray absorptiometry (DXA) scans, medication...... containing less than 250 subjects. Clusters were identified as high, average or low fracture risk based on bone mineral density (BMD) characteristics. Cluster-based descriptive statistics and relative Z-scores for variable means were computed. RESULTS: Ten thousand seven hundred seventy-five women were...... as low fracture risk with high to very high BMD. A mean age of 60 years was the earliest that allowed for separation of high-risk clusters. DXA scan results could identify high-risk subjects with different antiresorptive treatment compliance levels based on similarities and differences in lumbar spine...

  19. Mapping informative clusters in a hierarchical [corrected] framework of FMRI multivariate analysis.

    Directory of Open Access Journals (Sweden)

    Rui Xu

    Full Text Available Pattern recognition methods have become increasingly popular in fMRI data analysis, which are powerful in discriminating between multi-voxel patterns of brain activities associated with different mental states. However, when they are used in functional brain mapping, the location of discriminative voxels varies significantly, raising difficulties in interpreting the locus of the effect. Here we proposed a hierarchical framework of multivariate approach that maps informative clusters rather than voxels to achieve reliable functional brain mapping without compromising the discriminative power. In particular, we first searched for local homogeneous clusters that consisted of voxels with similar response profiles. Then, a multi-voxel classifier was built for each cluster to extract discriminative information from the multi-voxel patterns. Finally, through multivariate ranking, outputs from the classifiers were served as a multi-cluster pattern to identify informative clusters by examining interactions among clusters. Results from both simulated and real fMRI data demonstrated that this hierarchical approach showed better performance in the robustness of functional brain mapping than traditional voxel-based multivariate methods. In addition, the mapped clusters were highly overlapped for two perceptually equivalent object categories, further confirming the validity of our approach. In short, the hierarchical framework of multivariate approach is suitable for both pattern classification and brain mapping in fMRI studies.

  20. Kendall’s tau and agglomerative clustering for structure determination of hierarchical Archimedean copulas

    Directory of Open Access Journals (Sweden)

    Górecki J.

    2017-01-01

    Full Text Available Several successful approaches to structure determination of hierarchical Archimedean copulas (HACs proposed in the literature rely on agglomerative clustering and Kendall’s correlation coefficient. However, there has not been presented any theoretical proof justifying such approaches. This work fills this gap and introduces a theorem showing that, given the matrix of the pairwise Kendall correlation coefficients corresponding to a HAC, its structure can be recovered by an agglomerative clustering technique.

  1. Prediction of in vitro and in vivo oestrogen receptor activity using hierarchical clustering

    Science.gov (United States)

    In this study, hierarchical clustering classification models were developed to predict in vitro and in vivo oestrogen receptor (ER) activity. Classification models were developed for binding, agonist, and antagonist in vitro ER activity and for mouse in vivo uterotrophic ER bindi...

  2. Non-Hierarchical Clustering as a method to analyse an open-ended ...

    African Journals Online (AJOL)

    Apple

    tests, provide instructors with tools to probe students' conceptual knowledge of various fields of science and ... quantitative non-hierarchical clustering analysis method known as k-means (Everitt, Landau, Leese & Stahl, ...... undergraduate engineering students in creating ... mathematics-formal reasoning and the contextual.

  3. Prediction of in vitro and in vivo oestrogen receptor activity using hierarchical clustering

    Science.gov (United States)

    In this study, hierarchical clustering classification models were developed to predict in vitro and in vivo oestrogen receptor (ER) activity. Classification models were developed for binding, agonist, and antagonist in vitro ER activity and for mouse in vivo uterotrophic ER bindi...

  4. Analysis of genomic signatures in prokaryotes using multinomial regression and hierarchical clustering

    DEFF Research Database (Denmark)

    Ussery, David; Bohlin, Jon; Skjerve, Eystein

    2009-01-01

    Recently there has been an explosion in the availability of bacterial genomic sequences, making possible now an analysis of genomic signatures across more than 800 hundred different bacterial chromosomes, from a wide variety of environments. Using genomic signatures, we pair-wise compared 867...... different genomic DNA sequences, taken from chromosomes and plasmids more than 100,000 base-pairs in length. Hierarchical clustering was performed on the outcome of the comparisons before a multinomial regression model was fitted. The regression model included the cluster groups as the response variable...... AT content. Small improvements to the regression model, although significant, were also obtained by factors such as sequence size, habitat, growth temperature, selective pressure measured as oligonucleotide usage variance, and oxygen requirement.The statistics obtained using hierarchical clustering...

  5. Signatures of Hierarchical Clustering in Dark Matter Detection Experiments

    CERN Document Server

    Stiff, D; Frieman, Joshua A

    2001-01-01

    In the cold dark matter model of structure formation, galaxies are assembled hierarchically from mergers and the accretion of subclumps. This process is expected to leave residual substructure in the Galactic dark halo, including partially disrupted clumps and their associated tidal debris. We develop a model for such halo substructure and study its implications for dark matter (WIMP and axion) detection experiments. We combine the Press-Schechter model for the distribution of halo subclump masses with N-body simulations of the evolution and disruption of individual clumps as they orbit through the evolving Galaxy to derive the probability that the Earth is passing through a subclump or stream of a given density. Our results suggest that it is likely that the local complement of dark matter particles includes a 1-5% contribution from a single clump. The implications for dark matter detection experiments are significant, since the disrupted clump is composed of a `cold' flow of high-velocity particles. We desc...

  6. Clustering dynamic textures with the hierarchical em algorithm for modeling video.

    Science.gov (United States)

    Mumtaz, Adeel; Coviello, Emanuele; Lanckriet, Gert R G; Chan, Antoni B

    2013-07-01

    Dynamic texture (DT) is a probabilistic generative model, defined over space and time, that represents a video as the output of a linear dynamical system (LDS). The DT model has been applied to a wide variety of computer vision problems, such as motion segmentation, motion classification, and video registration. In this paper, we derive a new algorithm for clustering DT models that is based on the hierarchical EM algorithm. The proposed clustering algorithm is capable of both clustering DTs and learning novel DT cluster centers that are representative of the cluster members in a manner that is consistent with the underlying generative probabilistic model of the DT. We also derive an efficient recursive algorithm for sensitivity analysis of the discrete-time Kalman smoothing filter, which is used as the basis for computing expectations in the E-step of the HEM algorithm. Finally, we demonstrate the efficacy of the clustering algorithm on several applications in motion analysis, including hierarchical motion clustering, semantic motion annotation, and learning bag-of-systems (BoS) codebooks for dynamic texture recognition.

  7. Improving the Decision Value of Hierarchical Text Clustering Using Term Overlap Detection

    Directory of Open Access Journals (Sweden)

    Nilupulee Nathawitharana

    2015-09-01

    Full Text Available Humans are used to expressing themselves with written language and language provides a medium with which we can describe our experiences in detail incorporating individuality. Even though documents provide a rich source of information, it becomes very difficult to identify, extract, summarize and search when vast amounts of documents are collected especially over time. Document clustering is a technique that has been widely used to group documents based on similarity of content represented by the words used. Once key groups are identified further drill down into sub-groupings is facilitated by the use of hierarchical clustering. Clustering and hierarchical clustering are very useful when applied to numerical and categorical data and cluster accuracy and purity measures exist to evaluate the outcomes of a clustering exercise. Although the same measures have been applied to text clustering, text clusters are based on words or terms which can be repeated across documents associated with different topics. Therefore text data cannot be considered as a direct ‘coding’ of a particular experience or situation in contrast to numerical and categorical data and term overlap is a very common characteristic in text clustering. In this paper we propose a new technique and methodology for term overlap capture from text documents, highlighting the different situations such overlap could signify and discuss why such understanding is important for obtaining value from text clustering. Experiments were conducted using a widely used text document collection where the proposed methodology allowed exploring the term diversity for a given document collection and obtain clusters with minimum term overlap.

  8. Proteomic properties reveal phyloecological clusters of Archaea.

    Directory of Open Access Journals (Sweden)

    Nela Nikolic

    Full Text Available In this study, we propose a novel way to describe the variety of environmental adaptations of Archaea. We have clustered 57 Archaea by using a non-redundant set of proteomic features, and verified that the clusters correspond to environmental adaptations to the archaeal habitats. The first cluster consists dominantly of hyperthermophiles and hyperthermoacidophilic aerobes. The second cluster joins together halophilic and extremely halophilic Archaea, while the third cluster contains mesophilic (mostly methanogenic Archaea together with thermoacidophiles. The non-redundant subset of proteomic features was found to consist of five features: the ratio of charged residues to uncharged, average protein size, normalized frequency of beta-sheet, normalized frequency of extended structure and number of hydrogen bond donors. We propose this clustering to be termed phyloecological clustering. This approach could give additional insights into relationships among archaeal species that may be hidden by sole phylogenetic analysis.

  9. Proteomic Properties Reveal Phyloecological Clusters of Archaea

    Science.gov (United States)

    Nikolic, Nela; Smole, Zlatko; Krisko, Anita

    2012-01-01

    In this study, we propose a novel way to describe the variety of environmental adaptations of Archaea. We have clustered 57 Archaea by using a non-redundant set of proteomic features, and verified that the clusters correspond to environmental adaptations to the archaeal habitats. The first cluster consists dominantly of hyperthermophiles and hyperthermoacidophilic aerobes. The second cluster joins together halophilic and extremely halophilic Archaea, while the third cluster contains mesophilic (mostly methanogenic) Archaea together with thermoacidophiles. The non-redundant subset of proteomic features was found to consist of five features: the ratio of charged residues to uncharged, average protein size, normalized frequency of beta-sheet, normalized frequency of extended structure and number of hydrogen bond donors. We propose this clustering to be termed phyloecological clustering. This approach could give additional insights into relationships among archaeal species that may be hidden by sole phylogenetic analysis. PMID:23133575

  10. Multilevel hierarchical kernel spectral clustering for real-life large scale complex networks.

    Directory of Open Access Journals (Sweden)

    Raghvendra Mall

    Full Text Available Kernel spectral clustering corresponds to a weighted kernel principal component analysis problem in a constrained optimization framework. The primal formulation leads to an eigen-decomposition of a centered Laplacian matrix at the dual level. The dual formulation allows to build a model on a representative subgraph of the large scale network in the training phase and the model parameters are estimated in the validation stage. The KSC model has a powerful out-of-sample extension property which allows cluster affiliation for the unseen nodes of the big data network. In this paper we exploit the structure of the projections in the eigenspace during the validation stage to automatically determine a set of increasing distance thresholds. We use these distance thresholds in the test phase to obtain multiple levels of hierarchy for the large scale network. The hierarchical structure in the network is determined in a bottom-up fashion. We empirically showcase that real-world networks have multilevel hierarchical organization which cannot be detected efficiently by several state-of-the-art large scale hierarchical community detection techniques like the Louvain, OSLOM and Infomap methods. We show that a major advantage of our proposed approach is the ability to locate good quality clusters at both the finer and coarser levels of hierarchy using internal cluster quality metrics on 7 real-life networks.

  11. HCsnip: An R Package for Semi-supervised Snipping of the Hierarchical Clustering Tree.

    Science.gov (United States)

    Obulkasim, Askar; van de Wiel, Mark A

    2015-01-01

    Hierarchical clustering (HC) is one of the most frequently used methods in computational biology in the analysis of high-dimensional genomics data. Given a data set, HC outputs a binary tree leaves of which are the data points and internal nodes represent clusters of various sizes. Normally, a fixed-height cut on the HC tree is chosen, and each contiguous branch of data points below that height is considered as a separate cluster. However, the fixed-height branch cut may not be ideal in situations where one expects a complicated tree structure with nested clusters. Furthermore, due to lack of utilization of related background information in selecting the cutoff, induced clusters are often difficult to interpret. This paper describes a novel procedure that aims to automatically extract meaningful clusters from the HC tree in a semi-supervised way. The procedure is implemented in the R package HCsnip available from Bioconductor. Rather than cutting the HC tree at a fixed-height, HCsnip probes the various way of snipping, possibly at variable heights, to tease out hidden clusters ensconced deep down in the tree. The cluster extraction process utilizes, along with the data set from which the HC tree is derived, commonly available background information. Consequently, the extracted clusters are highly reproducible and robust against various sources of variations that "haunted" high-dimensional genomics data. Since the clustering process is guided by the background information, clusters are easy to interpret. Unlike existing packages, no constraint is placed on the data type on which clustering is desired. Particularly, the package accepts patient follow-up data for guiding the cluster extraction process. To our knowledge, HCsnip is the first package that is able to decomposes the HC tree into clusters with piecewise snipping under the guidance of patient time-to-event information. Our implementation of the semi-supervised HC tree snipping framework is generic, and can

  12. A Bayesian Alternative to Mutual Information for the Hierarchical Clustering of Dependent Random Variables.

    Directory of Open Access Journals (Sweden)

    Guillaume Marrelec

    Full Text Available The use of mutual information as a similarity measure in agglomerative hierarchical clustering (AHC raises an important issue: some correction needs to be applied for the dimensionality of variables. In this work, we formulate the decision of merging dependent multivariate normal variables in an AHC procedure as a Bayesian model comparison. We found that the Bayesian formulation naturally shrinks the empirical covariance matrix towards a matrix set a priori (e.g., the identity, provides an automated stopping rule, and corrects for dimensionality using a term that scales up the measure as a function of the dimensionality of the variables. Also, the resulting log Bayes factor is asymptotically proportional to the plug-in estimate of mutual information, with an additive correction for dimensionality in agreement with the Bayesian information criterion. We investigated the behavior of these Bayesian alternatives (in exact and asymptotic forms to mutual information on simulated and real data. An encouraging result was first derived on simulations: the hierarchical clustering based on the log Bayes factor outperformed off-the-shelf clustering techniques as well as raw and normalized mutual information in terms of classification accuracy. On a toy example, we found that the Bayesian approaches led to results that were similar to those of mutual information clustering techniques, with the advantage of an automated thresholding. On real functional magnetic resonance imaging (fMRI datasets measuring brain activity, it identified clusters consistent with the established outcome of standard procedures. On this application, normalized mutual information had a highly atypical behavior, in the sense that it systematically favored very large clusters. These initial experiments suggest that the proposed Bayesian alternatives to mutual information are a useful new tool for hierarchical clustering.

  13. A Bayesian Alternative to Mutual Information for the Hierarchical Clustering of Dependent Random Variables.

    Science.gov (United States)

    Marrelec, Guillaume; Messé, Arnaud; Bellec, Pierre

    2015-01-01

    The use of mutual information as a similarity measure in agglomerative hierarchical clustering (AHC) raises an important issue: some correction needs to be applied for the dimensionality of variables. In this work, we formulate the decision of merging dependent multivariate normal variables in an AHC procedure as a Bayesian model comparison. We found that the Bayesian formulation naturally shrinks the empirical covariance matrix towards a matrix set a priori (e.g., the identity), provides an automated stopping rule, and corrects for dimensionality using a term that scales up the measure as a function of the dimensionality of the variables. Also, the resulting log Bayes factor is asymptotically proportional to the plug-in estimate of mutual information, with an additive correction for dimensionality in agreement with the Bayesian information criterion. We investigated the behavior of these Bayesian alternatives (in exact and asymptotic forms) to mutual information on simulated and real data. An encouraging result was first derived on simulations: the hierarchical clustering based on the log Bayes factor outperformed off-the-shelf clustering techniques as well as raw and normalized mutual information in terms of classification accuracy. On a toy example, we found that the Bayesian approaches led to results that were similar to those of mutual information clustering techniques, with the advantage of an automated thresholding. On real functional magnetic resonance imaging (fMRI) datasets measuring brain activity, it identified clusters consistent with the established outcome of standard procedures. On this application, normalized mutual information had a highly atypical behavior, in the sense that it systematically favored very large clusters. These initial experiments suggest that the proposed Bayesian alternatives to mutual information are a useful new tool for hierarchical clustering.

  14. Using hierarchical clustering of secreted protein families to classify and rank candidate effectors of rust fungi.

    Directory of Open Access Journals (Sweden)

    Diane G O Saunders

    Full Text Available Rust fungi are obligate biotrophic pathogens that cause considerable damage on crop plants. Puccinia graminis f. sp. tritici, the causal agent of wheat stem rust, and Melampsora larici-populina, the poplar leaf rust pathogen, have strong deleterious impacts on wheat and poplar wood production, respectively. Filamentous pathogens such as rust fungi secrete molecules called disease effectors that act as modulators of host cell physiology and can suppress or trigger host immunity. Current knowledge on effectors from other filamentous plant pathogens can be exploited for the characterisation of effectors in the genome of recently sequenced rust fungi. We designed a comprehensive in silico analysis pipeline to identify the putative effector repertoire from the genome of two plant pathogenic rust fungi. The pipeline is based on the observation that known effector proteins from filamentous pathogens have at least one of the following properties: (i contain a secretion signal, (ii are encoded by in planta induced genes, (iii have similarity to haustorial proteins, (iv are small and cysteine rich, (v contain a known effector motif or a nuclear localization signal, (vi are encoded by genes with long intergenic regions, (vii contain internal repeats, and (viii do not contain PFAM domains, except those associated with pathogenicity. We used Markov clustering and hierarchical clustering to classify protein families of rust pathogens and rank them according to their likelihood of being effectors. Using this approach, we identified eight families of candidate effectors that we consider of high value for functional characterization. This study revealed a diverse set of candidate effectors, including families of haustorial expressed secreted proteins and small cysteine-rich proteins. This comprehensive classification of candidate effectors from these devastating rust pathogens is an initial step towards probing plant germplasm for novel resistance components.

  15. Using Hierarchical Clustering of Secreted Protein Families to Classify and Rank Candidate Effectors of Rust Fungi

    Science.gov (United States)

    Saunders, Diane G. O.; Win, Joe; Cano, Liliana M.; Szabo, Les J.; Kamoun, Sophien; Raffaele, Sylvain

    2012-01-01

    Rust fungi are obligate biotrophic pathogens that cause considerable damage on crop plants. Puccinia graminis f. sp. tritici, the causal agent of wheat stem rust, and Melampsora larici-populina, the poplar leaf rust pathogen, have strong deleterious impacts on wheat and poplar wood production, respectively. Filamentous pathogens such as rust fungi secrete molecules called disease effectors that act as modulators of host cell physiology and can suppress or trigger host immunity. Current knowledge on effectors from other filamentous plant pathogens can be exploited for the characterisation of effectors in the genome of recently sequenced rust fungi. We designed a comprehensive in silico analysis pipeline to identify the putative effector repertoire from the genome of two plant pathogenic rust fungi. The pipeline is based on the observation that known effector proteins from filamentous pathogens have at least one of the following properties: (i) contain a secretion signal, (ii) are encoded by in planta induced genes, (iii) have similarity to haustorial proteins, (iv) are small and cysteine rich, (v) contain a known effector motif or a nuclear localization signal, (vi) are encoded by genes with long intergenic regions, (vii) contain internal repeats, and (viii) do not contain PFAM domains, except those associated with pathogenicity. We used Markov clustering and hierarchical clustering to classify protein families of rust pathogens and rank them according to their likelihood of being effectors. Using this approach, we identified eight families of candidate effectors that we consider of high value for functional characterization. This study revealed a diverse set of candidate effectors, including families of haustorial expressed secreted proteins and small cysteine-rich proteins. This comprehensive classification of candidate effectors from these devastating rust pathogens is an initial step towards probing plant germplasm for novel resistance components. PMID:22238666

  16. The evolution of Brightest Cluster Galaxies in a hierarchical universe

    CERN Document Server

    Tonini, Chiara; Croton, Darren; Maraston, Claudia; Thomas, Daniel

    2012-01-01

    We investigate the evolution of Brightest Cluster Galaxies (BCGs) from redshift z~1.6 to z~0. We use the semi-analytic model of Croton et al. (2006) with a new spectro-photometric model based on the Maraston (2005) stellar populations and a new recipe for the dust extinction. We compare the model predictions of the K-band luminosity evolution and the J-K, V-I and I-K colour evolution with a series of datasets, including Collins et al. (Nature, 2009) who argued that semi-analytic models based on the Millennium simulation cannot reproduce the red colours and high luminosity of BCGs at z>1. We show instead that the model is well in range of the observed luminosity and correctly reproduces the colour evolution of BCGs in the whole redshift range up to z~1.6. We argue that the success of the semi-analytic model is in large part due to the implementation of a more sophisticated spectro-photometric model. An analysis of the model BCGs shows an increase in mass by a factor ~2 since z~1, and star formation activity do...

  17. Hierarchical clustering of ryanodine receptors enables emergence of a calcium clock in sinoatrial node cells.

    Science.gov (United States)

    Stern, Michael D; Maltseva, Larissa A; Juhaszova, Magdalena; Sollott, Steven J; Lakatta, Edward G; Maltsev, Victor A

    2014-05-01

    The sinoatrial node, whose cells (sinoatrial node cells [SANCs]) generate rhythmic action potentials, is the primary pacemaker of the heart. During diastole, calcium released from the sarcoplasmic reticulum (SR) via ryanodine receptors (RyRs) interacts with membrane currents to control the rate of the heartbeat. This "calcium clock" takes the form of stochastic, partially periodic, localized calcium release (LCR) events that propagate, wave-like, for limited distances. The detailed mechanisms controlling the calcium clock are not understood. We constructed a computational model of SANCs, including three-dimensional diffusion and buffering of calcium in the cytosol and SR; explicit, stochastic gating of individual RyRs and L-type calcium channels; and a full complement of voltage- and calcium-dependent membrane currents. We did not include an anatomical submembrane space or inactivation of RyRs, the two heuristic components that have been used in prior models but are not observed experimentally. When RyRs were distributed in discrete clusters separated by >1 µm, only isolated sparks were produced in this model and LCR events did not form. However, immunofluorescent staining of SANCs for RyR revealed the presence of bridging RyR groups between large clusters, forming an irregular network. Incorporation of this architecture into the model led to the generation of propagating LCR events. Partial periodicity emerged from the interaction of LCR events, as observed experimentally. This calcium clock becomes entrained with membrane currents to accelerate the beating rate, which therefore was controlled by the activity of the SERCA pump, RyR sensitivity, and L-type current amplitude, all of which are targets of β-adrenergic-mediated phosphorylation. Unexpectedly, simulations revealed the existence of a pathological mode at high RyR sensitivity to calcium, in which the calcium clock loses synchronization with the membrane, resulting in a paradoxical decrease in beating

  18. Bayesian latent variable models for hierarchical clustered count outcomes with repeated measures in microbiome studies.

    Science.gov (United States)

    Xu, Lizhen; Paterson, Andrew D; Xu, Wei

    2017-04-01

    Motivated by the multivariate nature of microbiome data with hierarchical taxonomic clusters, counts that are often skewed and zero inflated, and repeated measures, we propose a Bayesian latent variable methodology to jointly model multiple operational taxonomic units within a single taxonomic cluster. This novel method can incorporate both negative binomial and zero-inflated negative binomial responses, and can account for serial and familial correlations. We develop a Markov chain Monte Carlo algorithm that is built on a data augmentation scheme using Pólya-Gamma random variables. Hierarchical centering and parameter expansion techniques are also used to improve the convergence of the Markov chain. We evaluate the performance of our proposed method through extensive simulations. We also apply our method to a human microbiome study.

  19. Bayesian hierarchical clustering for studying cancer gene expression data with unknown statistics.

    Directory of Open Access Journals (Sweden)

    Korsuk Sirinukunwattana

    Full Text Available Clustering analysis is an important tool in studying gene expression data. The Bayesian hierarchical clustering (BHC algorithm can automatically infer the number of clusters and uses Bayesian model selection to improve clustering quality. In this paper, we present an extension of the BHC algorithm. Our Gaussian BHC (GBHC algorithm represents data as a mixture of Gaussian distributions. It uses normal-gamma distribution as a conjugate prior on the mean and precision of each of the Gaussian components. We tested GBHC over 11 cancer and 3 synthetic datasets. The results on cancer datasets show that in sample clustering, GBHC on average produces a clustering partition that is more concordant with the ground truth than those obtained from other commonly used algorithms. Furthermore, GBHC frequently infers the number of clusters that is often close to the ground truth. In gene clustering, GBHC also produces a clustering partition that is more biologically plausible than several other state-of-the-art methods. This suggests GBHC as an alternative tool for studying gene expression data. The implementation of GBHC is available at https://sites.google.com/site/gaussianbhc/

  20. Hierarchical and Non-Hierarchical Linear and Non-Linear Clustering Methods to “Shakespeare Authorship Question”

    Directory of Open Access Journals (Sweden)

    Refat Aljumily

    2015-09-01

    Full Text Available A few literary scholars have long claimed that Shakespeare did not write some of his best plays (history plays and tragedies and proposed at one time or another various suspect authorship candidates. Most modern-day scholars of Shakespeare have rejected this claim, arguing that strong evidence that Shakespeare wrote the plays and poems being his name appears on them as the author. This has caused and led to an ongoing scholarly academic debate for quite some long time. Stylometry is a fast-growing field often used to attribute authorship to anonymous or disputed texts. Stylometric attempts to resolve this literary puzzle have raised interesting questions over the past few years. The following paper contributes to “the Shakespeare authorship question” by using a mathematically-based methodology to examine the hypothesis that Shakespeare wrote all the disputed plays traditionally attributed to him. More specifically, the mathematically based methodology used here is based on Mean Proximity, as a linear hierarchical clustering method, and on Principal Components Analysis, as a non-hierarchical linear clustering method. It is also based, for the first time in the domain, on Self-Organizing Map U-Matrix and Voronoi Map, as non-linear clustering methods to cover the possibility that our data contains significant non-linearities. Vector Space Model (VSM is used to convert texts into vectors in a high dimensional space. The aim of which is to compare the degrees of similarity within and between limited samples of text (the disputed plays. The various works and plays assumed to have been written by Shakespeare and possible authors notably, Sir Francis Bacon, Christopher Marlowe, John Fletcher, and Thomas Kyd, where “similarity” is defined in terms of correlation/distance coefficient measure based on the frequency of usage profiles of function words, word bi-grams, and character triple-grams. The claim that Shakespeare authored all the disputed

  1. An Exactly Soluble Hierarchical Clustering Model Inverse Cascades, Self-Similarity, and Scaling

    CERN Document Server

    Gabrielov, A; Turcotte, D L

    1999-01-01

    We show how clustering as a general hierarchical dynamical process proceeds via a sequence of inverse cascades to produce self-similar scaling, as an intermediate asymptotic, which then truncates at the largest spatial scales. We show how this model can provide a general explanation for the behavior of several models that has been described as ``self-organized critical,'' including forest-fire, sandpile, and slider-block models.

  2. Recursive Hierarchical Image Segmentation by Region Growing and Constrained Spectral Clustering

    Science.gov (United States)

    Tilton, James C.

    2002-01-01

    This paper describes an algorithm for hierarchical image segmentation (referred to as HSEG) and its recursive formulation (referred to as RHSEG). The HSEG algorithm is a hybrid of region growing and constrained spectral clustering that produces a hierarchical set of image segmentations based on detected convergence points. In the main, HSEG employs the hierarchical stepwise optimization (HS WO) approach to region growing, which seeks to produce segmentations that are more optimized than those produced by more classic approaches to region growing. In addition, HSEG optionally interjects between HSWO region growing iterations merges between spatially non-adjacent regions (i.e., spectrally based merging or clustering) constrained by a threshold derived from the previous HSWO region growing iteration. While the addition of constrained spectral clustering improves the segmentation results, especially for larger images, it also significantly increases HSEG's computational requirements. To counteract this, a computationally efficient recursive, divide-and-conquer, implementation of HSEG (RHSEG) has been devised and is described herein. Included in this description is special code that is required to avoid processing artifacts caused by RHSEG s recursive subdivision of the image data. Implementations for single processor and for multiple processor computer systems are described. Results with Landsat TM data are included comparing HSEG with classic region growing. Finally, an application to image information mining and knowledge discovery is discussed.

  3. Hierarchical Adaptive Means (HAM) clustering for hardware-efficient, unsupervised and real-time spike sorting.

    Science.gov (United States)

    Paraskevopoulou, Sivylla E; Wu, Di; Eftekhar, Amir; Constandinou, Timothy G

    2014-09-30

    This work presents a novel unsupervised algorithm for real-time adaptive clustering of neural spike data (spike sorting). The proposed Hierarchical Adaptive Means (HAM) clustering method combines centroid-based clustering with hierarchical cluster connectivity to classify incoming spikes using groups of clusters. It is described how the proposed method can adaptively track the incoming spike data without requiring any past history, iteration or training and autonomously determines the number of spike classes. Its performance (classification accuracy) has been tested using multiple datasets (both simulated and recorded) achieving a near-identical accuracy compared to k-means (using 10-iterations and provided with the number of spike classes). Also, its robustness in applying to different feature extraction methods has been demonstrated by achieving classification accuracies above 80% across multiple datasets. Last but crucially, its low complexity, that has been quantified through both memory and computation requirements makes this method hugely attractive for future hardware implementation. Copyright © 2014 Elsevier B.V. All rights reserved.

  4. Permutation Tests of Hierarchical Cluster Analyses of Carrion Communities and Their Potential Use in Forensic Entomology.

    Science.gov (United States)

    van der Ham, Joris L

    2016-05-19

    Forensic entomologists can use carrion communities' ecological succession data to estimate the postmortem interval (PMI). Permutation tests of hierarchical cluster analyses of these data provide a conceptual method to estimate part of the PMI, the post-colonization interval (post-CI). This multivariate approach produces a baseline of statistically distinct clusters that reflect changes in the carrion community composition during the decomposition process. Carrion community samples of unknown post-CIs are compared with these baseline clusters to estimate the post-CI. In this short communication, I use data from previously published studies to demonstrate the conceptual feasibility of this multivariate approach. Analyses of these data produce series of significantly distinct clusters, which represent carrion communities during 1- to 20-day periods of the decomposition process. For 33 carrion community samples, collected over an 11-day period, this approach correctly estimated the post-CI within an average range of 3.1 days.

  5. An energy efficient cooperative hierarchical MIMO clustering scheme for wireless sensor networks.

    Science.gov (United States)

    Nasim, Mehwish; Qaisar, Saad; Lee, Sungyoung

    2012-01-01

    In this work, we present an energy efficient hierarchical cooperative clustering scheme for wireless sensor networks. Communication cost is a crucial factor in depleting the energy of sensor nodes. In the proposed scheme, nodes cooperate to form clusters at each level of network hierarchy ensuring maximal coverage and minimal energy expenditure with relatively uniform distribution of load within the network. Performance is enhanced by cooperative multiple-input multiple-output (MIMO) communication ensuring energy efficiency for WSN deployments over large geographical areas. We test our scheme using TOSSIM and compare the proposed scheme with cooperative multiple-input multiple-output (CMIMO) clustering scheme and traditional multihop Single-Input-Single-Output (SISO) routing approach. Performance is evaluated on the basis of number of clusters, number of hops, energy consumption and network lifetime. Experimental results show significant energy conservation and increase in network lifetime as compared to existing schemes.

  6. To Aggregate or Not and Potentially Better Questions for Clustered Data: The Need for Hierarchical Linear Modeling in CTE Research

    Science.gov (United States)

    Nimon, Kim

    2012-01-01

    Using state achievement data that are openly accessible, this paper demonstrates the application of hierarchical linear modeling within the context of career technical education research. Three prominent approaches to analyzing clustered data (i.e., modeling aggregated data, modeling disaggregated data, modeling hierarchical data) are discussed…

  7. Determination of genetic structure of germplasm collections: are traditional hierarchical clustering methods appropriate for molecular marker data?

    Science.gov (United States)

    Odong, T L; van Heerwaarden, J; Jansen, J; van Hintum, T J L; van Eeuwijk, F A

    2011-07-01

    Despite the availability of newer approaches, traditional hierarchical clustering remains very popular in genetic diversity studies in plants. However, little is known about its suitability for molecular marker data. We studied the performance of traditional hierarchical clustering techniques using real and simulated molecular marker data. Our study also compared the performance of traditional hierarchical clustering with model-based clustering (STRUCTURE). We showed that the cophenetic correlation coefficient is directly related to subgroup differentiation and can thus be used as an indicator of the presence of genetically distinct subgroups in germplasm collections. Whereas UPGMA performed well in preserving distances between accessions, Ward excelled in recovering groups. Our results also showed a close similarity between clusters obtained by Ward and by STRUCTURE. Traditional cluster analysis can provide an easy and effective way of determining structure in germplasm collections using molecular marker data, and, the output can be used for sampling core collections or for association studies.

  8. Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition.

    Science.gov (United States)

    Liu, An-An; Su, Yu-Ting; Nie, Wei-Zhi; Kankanhalli, Mohan

    2017-01-01

    This paper proposes a hierarchical clustering multi-task learning (HC-MTL) method for joint human action grouping and recognition. Specifically, we formulate the objective function into the group-wise least square loss regularized by low rank and sparsity with respect to two latent variables, model parameters and grouping information, for joint optimization. To handle this non-convex optimization, we decompose it into two sub-tasks, multi-task learning and task relatedness discovery. First, we convert this non-convex objective function into the convex formulation by fixing the latent grouping information. This new objective function focuses on multi-task learning by strengthening the shared-action relationship and action-specific feature learning. Second, we leverage the learned model parameters for the task relatedness measure and clustering. In this way, HC-MTL can attain both optimal action models and group discovery by alternating iteratively. The proposed method is validated on three kinds of challenging datasets, including six realistic action datasets (Hollywood2, YouTube, UCF Sports, UCF50, HMDB51 & UCF101), two constrained datasets (KTH & TJU), and two multi-view datasets (MV-TJU & IXMAS). The extensive experimental results show that: 1) HC-MTL can produce competing performances to the state of the arts for action recognition and grouping; 2) HC-MTL can overcome the difficulty in heuristic action grouping simply based on human knowledge; 3) HC-MTL can avoid the possible inconsistency between the subjective action grouping depending on human knowledge and objective action grouping based on the feature subspace distributions of multiple actions. Comparison with the popular clustered multi-task learning further reveals that the discovered latent relatedness by HC-MTL aids inducing the group-wise multi-task learning and boosts the performance. To the best of our knowledge, ours is the first work that breaks the assumption that all actions are either

  9. SHIPS: Spectral Hierarchical clustering for the Inference of Population Structure in genetic studies.

    Science.gov (United States)

    Bouaziz, Matthieu; Paccard, Caroline; Guedj, Mickael; Ambroise, Christophe

    2012-01-01

    Inferring the structure of populations has many applications for genetic research. In addition to providing information for evolutionary studies, it can be used to account for the bias induced by population stratification in association studies. To this end, many algorithms have been proposed to cluster individuals into genetically homogeneous sub-populations. The parametric algorithms, such as Structure, are very popular but their underlying complexity and their high computational cost led to the development of faster parametric alternatives such as Admixture. Alternatives to these methods are the non-parametric approaches. Among this category, AWclust has proven efficient but fails to properly identify population structure for complex datasets. We present in this article a new clustering algorithm called Spectral Hierarchical clustering for the Inference of Population Structure (SHIPS), based on a divisive hierarchical clustering strategy, allowing a progressive investigation of population structure. This method takes genetic data as input to cluster individuals into homogeneous sub-populations and with the use of the gap statistic estimates the optimal number of such sub-populations. SHIPS was applied to a set of simulated discrete and admixed datasets and to real SNP datasets, that are data from the HapMap and Pan-Asian SNP consortium. The programs Structure, Admixture, AWclust and PCAclust were also investigated in a comparison study. SHIPS and the parametric approach Structure were the most accurate when applied to simulated datasets both in terms of individual assignments and estimation of the correct number of clusters. The analysis of the results on the real datasets highlighted that the clusterings of SHIPS were the more consistent with the population labels or those produced by the Admixture program. The performances of SHIPS when applied to SNP data, along with its relatively low computational cost and its ease of use make this method a promising

  10. Hierarchical clustering

    Directory of Open Access Journals (Sweden)

    L. Infante

    2002-01-01

    Full Text Available En esta contribuci on presento resultados recientes sobre las propiedades de acumulaci on de galaxias, grupos, c umulos y superc umulos de bajo redshift (z 1. Presento, a su vez, lo esperado y lo medido con respecto al grado de evoluci on de la acumulaci on de galaxias. Hemos usado el cat alogo fotom etrico de galaxias extra do de las primeras im agenes del \\Sloan Digital Sky Survey", para estudiar las propiedades de acumulaci on de peque~nas estructuras de galaxias, pares, tr os, cuartetos, quintetos, etc. Un an alisis de la funci on de correlaci on de dos puntos, en un area de 250 grados cuadrados del cielo, muestra que estos objetos, al parecer, est an mucho m as acumulados que galaxias individuales.

  11. Hierarchical Clustering of Large Databases and Classification of Antibiotics at High Noise Levels

    Directory of Open Access Journals (Sweden)

    Alexander V. Yarkov

    2008-12-01

    Full Text Available A new algorithm for divisive hierarchical clustering of chemical compounds based on 2D structural fragments is suggested. The algorithm is deterministic, and given a random ordering of the input, will always give the same clustering and can process a database up to 2 million records on a standard PC. The algorithm was used for classification of 1,183 antibiotics mixed with 999,994 random chemical structures. Similarity threshold, at which best separation of active and non active compounds took place, was estimated as 0.6. 85.7% of the antibiotics were successfully classified at this threshold with 0.4% of inaccurate compounds. A .sdf file was created with the probe molecules for clustering of external databases.

  12. A novel approach to the problem of non-uniqueness of the solution in hierarchical clustering.

    Science.gov (United States)

    Cattinelli, Isabella; Valentini, Giorgio; Paulesu, Eraldo; Borghese, Nunzio Alberto

    2013-07-01

    The existence of multiple solutions in clustering, and in hierarchical clustering in particular, is often ignored in practical applications. However, this is a non-trivial problem, as different data orderings can result in different cluster sets that, in turns, may lead to different interpretations of the same data. The method presented here offers a solution to this issue. It is based on the definition of an equivalence relation over dendrograms that allows developing all and only the significantly different dendrograms for the same dataset, thus reducing the computational complexity to polynomial from the exponential obtained when all possible dendrograms are considered. Experimental results in the neuroimaging and bioinformatics domains show the effectiveness of the proposed method.

  13. A supplier selection using a hybrid grey based hierarchical clustering and artificial bee colony

    Directory of Open Access Journals (Sweden)

    Farshad Faezy Razi

    2014-06-01

    Full Text Available Selection of one or a combination of the most suitable potential providers and outsourcing problem is the most important strategies in logistics and supply chain management. In this paper, selection of an optimal combination of suppliers in inventory and supply chain management are studied and analyzed via multiple attribute decision making approach, data mining and evolutionary optimization algorithms. For supplier selection in supply chain, hierarchical clustering according to the studied indexes first clusters suppliers. Then, according to its cluster, each supplier is evaluated through Grey Relational Analysis. Then the combination of suppliers’ Pareto optimal rank and costs are obtained using Artificial Bee Colony meta-heuristic algorithm. A case study is conducted for a better description of a new algorithm to select a multiple source of suppliers.

  14. Using Dynamic Quantum Clustering to Analyze Hierarchically Heterogeneous Samples on the Nanoscale

    Energy Technology Data Exchange (ETDEWEB)

    Hume, Allison; /Princeton U. /SLAC

    2012-09-07

    Dynamic Quantum Clustering (DQC) is an unsupervised, high visual data mining technique. DQC was tested as an analysis method for X-ray Absorption Near Edge Structure (XANES) data from the Transmission X-ray Microscopy (TXM) group. The TXM group images hierarchically heterogeneous materials with nanoscale resolution and large field of view. XANES data consists of energy spectra for each pixel of an image. It was determined that DQC successfully identifies structure in data of this type without prior knowledge of the components in the sample. Clusters and sub-clusters clearly reflected features of the spectra that identified chemical component, chemical environment, and density in the image. DQC can also be used in conjunction with the established data analysis technique, which does require knowledge of components present.

  15. A hierarchical cluster analysis of normal-tension glaucoma using spectral-domain optical coherence tomography parameters.

    Science.gov (United States)

    Bae, Hyoung Won; Ji, Yongwoo; Lee, Hye Sun; Lee, Naeun; Hong, Samin; Seong, Gong Je; Sung, Kyung Rim; Kim, Chan Yun

    2015-01-01

    Normal-tension glaucoma (NTG) is a heterogenous disease, and there is still controversy about subclassifications of this disorder. On the basis of spectral-domain optical coherence tomography (SD-OCT), we subdivided NTG with hierarchical cluster analysis using optic nerve head (ONH) parameters and retinal nerve fiber layer (RNFL) thicknesses. A total of 200 eyes of 200 NTG patients between March 2011 and June 2012 underwent SD-OCT scans to measure ONH parameters and RNFL thicknesses. We classified NTG into homogenous subgroups based on these variables using a hierarchical cluster analysis, and compared clusters to evaluate diverse NTG characteristics. Three clusters were found after hierarchical cluster analysis. Cluster 1 (62 eyes) had the thickest RNFL and widest rim area, and showed early glaucoma features. Cluster 2 (60 eyes) was characterized by the largest cup/disc ratio and cup volume, and showed advanced glaucomatous damage. Cluster 3 (78 eyes) had small disc areas in SD-OCT and were comprised of patients with significantly younger age, longer axial length, and greater myopia than the other 2 groups. A hierarchical cluster analysis of SD-OCT scans divided NTG patients into 3 groups based upon ONH parameters and RNFL thicknesses. It is anticipated that the small disc area group comprised of younger and more myopic patients may show unique features unlike the other 2 groups.

  16. Water quality assessment with hierarchical cluster analysis based on Mahalanobis distance.

    Science.gov (United States)

    Du, Xiangjun; Shao, Fengjing; Wu, Shunyao; Zhang, Hanlin; Xu, Si

    2017-07-01

    Water quality assessment is crucial for assessment of marine eutrophication, prediction of harmful algal blooms, and environment protection. Previous studies have developed many numeric modeling methods and data driven approaches for water quality assessment. The cluster analysis, an approach widely used for grouping data, has also been employed. However, there are complex correlations between water quality variables, which play important roles in water quality assessment but have always been overlooked. In this paper, we analyze correlations between water quality variables and propose an alternative method for water quality assessment with hierarchical cluster analysis based on Mahalanobis distance. Further, we cluster water quality data collected form coastal water of Bohai Sea and North Yellow Sea of China, and apply clustering results to evaluate its water quality. To evaluate the validity, we also cluster the water quality data with cluster analysis based on Euclidean distance, which are widely adopted by previous studies. The results show that our method is more suitable for water quality assessment with many correlated water quality variables. To our knowledge, it is the first attempt to apply Mahalanobis distance for coastal water quality assessment.

  17. Prioritizing the risk of plant pests by clustering methods; self-organising maps, k-means and hierarchical clustering

    Directory of Open Access Journals (Sweden)

    Susan Worner

    2013-09-01

    -means, hierarchical clustering and the incorporation of the SOM analysis into criteria based approaches to assess pest risk.

  18. Intensity-based hierarchical clustering in CT-scans: application to interactive segmentation in cardiology

    Science.gov (United States)

    Hadida, Jonathan; Desrosiers, Christian; Duong, Luc

    2011-03-01

    The segmentation of anatomical structures in Computed Tomography Angiography (CTA) is a pre-operative task useful in image guided surgery. Even though very robust and precise methods have been developed to help achieving a reliable segmentation (level sets, active contours, etc), it remains very time consuming both in terms of manual interactions and in terms of computation time. The goal of this study is to present a fast method to find coarse anatomical structures in CTA with few parameters, based on hierarchical clustering. The algorithm is organized as follows: first, a fast non-parametric histogram clustering method is proposed to compute a piecewise constant mask. A second step then indexes all the space-connected regions in the piecewise constant mask. Finally, a hierarchical clustering is achieved to build a graph representing the connections between the various regions in the piecewise constant mask. This step builds up a structural knowledge about the image. Several interactive features for segmentation are presented, for instance association or disassociation of anatomical structures. A comparison with the Mean-Shift algorithm is presented.

  19. Validation of hierarchical cluster analysis for identification of bacterial species using 42 bacterial isolates

    Science.gov (United States)

    Ghebremedhin, Meron; Yesupriya, Shubha; Luka, Janos; Crane, Nicole J.

    2015-03-01

    Recent studies have demonstrated the potential advantages of the use of Raman spectroscopy in the biomedical field due to its rapidity and noninvasive nature. In this study, Raman spectroscopy is applied as a method for differentiating between bacteria isolates for Gram status and Genus species. We created models for identifying 28 bacterial isolates using spectra collected with a 785 nm laser excitation Raman spectroscopic system. In order to investigate the groupings of these samples, partial least squares discriminant analysis (PLSDA) and hierarchical cluster analysis (HCA) was implemented. In addition, cluster analyses of the isolates were performed using various data types consisting of, biochemical tests, gene sequence alignment, high resolution melt (HRM) analysis and antimicrobial susceptibility tests of minimum inhibitory concentration (MIC) and degree of antimicrobial resistance (SIR). In order to evaluate the ability of these models to correctly classify bacterial isolates using solely Raman spectroscopic data, a set of 14 validation samples were tested using the PLSDA models and consequently the HCA models. External cluster evaluation criteria of purity and Rand index were calculated at different taxonomic levels to compare the performance of clustering using Raman spectra as well as the other datasets. Results showed that Raman spectra performed comparably, and in some cases better than, the other data types with Rand index and purity values up to 0.933 and 0.947, respectively. This study clearly demonstrates that the discrimination of bacterial species using Raman spectroscopic data and hierarchical cluster analysis is possible and has the potential to be a powerful point-of-care tool in clinical settings.

  20. Hierarchical Regional Disparities and Potential Sector Identification Using Modified Agglomerative Clustering

    Science.gov (United States)

    Munandar, T. A.; Azhari; Mushdholifah, A.; Arsyad, L.

    2017-03-01

    Disparities in regional development methods are commonly identified using the Klassen Typology and Location Quotient. Both methods typically use the data on the gross regional domestic product (GRDP) sectors of a particular region. The Klassen approach can identify regional disparities by classifying the GRDP sector data into four classes, namely Quadrants I, II, III, and IV. Each quadrant indicates a certain level of regional disparities based on the GRDP sector value of the said region. Meanwhile, the Location Quotient (LQ) is usually used to identify potential sectors in a particular region so as to determine which sectors are potential and which ones are not potential. LQ classifies each sector into three classes namely, the basic sector, the non-basic sector with a competitive advantage, and the non-basic sector which can only meet its own necessities. Both Klassen Typology and LQ are unable to visualize the relationship of achievements in the development clearly of each region and sector. This research aimed to develop a new approach to the identification of disparities in regional development in the form of hierarchical clustering. The method of Hierarchical Agglomerative Clustering (HAC) was employed as the basis of the hierarchical clustering model for identifying disparities in regional development. Modifications were made to HAC using the Klassen Typology and LQ. Then, HAC which had been modified using the Klassen Typology was called MHACK while HAC which had been modified using LQ was called MACLoQ. Both algorithms can be used to identify regional disparities (MHACK) and potential sectors (MACLoQ), respectively, in the form of hierarchical clusters. Based on the MHACK in 31 regencies in Central Java Province, it is identified that 3 regencies (Demak, Jepara, and Magelang City) fall into the category of developed and rapidly-growing regions, while the other 28 regencies fall into the category of developed but depressed regions. Results of the MACLo

  1. Diversity of Xiphinema americanum-group Species and Hierarchical Cluster Analysis of Morphometrics.

    Science.gov (United States)

    Lamberti, F; Ciancio, A

    1993-09-01

    Of the 39 species composing the Xiphinema americanum group, 14 were described originally from North America and two others have been reported from this region. Many species are very similar morphologically and can be distinguished only by a difficult comparison of various combinations of some morphometric characters. Study of morphometrics of 49 populations, including the type populations of the 39 species attributed to this group, by principal component analysis and hierarchical cluster analysis placed the populations into five subgroups, proposed here as the X. brevicolle subgroup (seven species), the X. americanum subgroup (17 species), the X. taylori subgroup (two species), the X. pachtaicum subgroup (eight species), and the X. lambertii subgroup (five species).

  2. Iterative Maps with Hierarchical Clustering for the Observed Scales of Astrophysical and Cosmological Structures

    CERN Document Server

    Capozziello, S; De Siena, S; Guerra, F; Illuminati, F

    2000-01-01

    We derive, in order of magnitude, the observed astrophysical and cosmologicalscales in the Universe, from neutron stars to superclusters of galaxies, up to,asymptotically, the observed radius of the Universe. This result is obtained byintroducing a recursive scheme of alternating hierachical mechanisms ofthree-dimensional and two-dimensional close packings of gravitationallyinteracting objects. The iterative scheme yields a rapidly converging geometricsequence, which can be described as a hierarchical clustering of aggregates,having the observed radius of the Universe as its fixed point.

  3. CLUSTAG & WCLUSTAG: Hierarchical Clustering Algorithms for Efficient Tag-SNP Selection

    Science.gov (United States)

    Ao, Sio-Iong

    More than 6 million single nucleotide polymorphisms (SNPs) in the human genome have been genotyped by the HapMap project. Although only a pro portion of these SNPs are functional, all can be considered as candidate markers for indirect association studies to detect disease-related genetic variants. The complete screening of a gene or a chromosomal region is nevertheless an expensive undertak ing for association studies. A key strategy for improving the efficiency of association studies is to select a subset of informative SNPs, called tag SNPs, for analysis. In the chapter, hierarchical clustering algorithms have been proposed for efficient tag SNP selection.

  4. Hierarchical Agglomerative Clustering Schemes for Energy-Efficiency in Wireless Sensor Networks

    Directory of Open Access Journals (Sweden)

    Taleb Tariq

    2017-06-01

    Full Text Available Extending the lifetime of wireless sensor networks (WSNs while delivering the expected level of service remains a hot research topic. Clustering has been identified in the literature as one of the primary means to save communication energy. In this paper, we argue that hierarchical agglomerative clustering (HAC provides a suitable foundation for designing highly energy efficient communication protocols for WSNs. To this end, we study a new mechanism for selecting cluster heads (CHs based both on the physical location of the sensors and their residual energy. Furthermore, we study different patterns of communications between the CHs and the base station depending on the possible transmission ranges and the ability of the sensors to act as traffic relays. Simulation results show that our proposed clustering and communication schemes outperform well-knows existing approaches by comfortable margins. In particular, networks lifetime is increased by more than 60% compared to LEACH and HEED, and by more than 30% compared to K-means clustering.

  5. Evaluation of hierarchical agglomerative cluster analysis methods for discrimination of primary biological aerosol

    Directory of Open Access Journals (Sweden)

    I. Crawford

    2015-07-01

    Full Text Available In this paper we present improved methods for discriminating and quantifying Primary Biological Aerosol Particles (PBAP by applying hierarchical agglomerative cluster analysis to multi-parameter ultra violet-light induced fluorescence (UV-LIF spectrometer data. The methods employed in this study can be applied to data sets in excess of 1×106 points on a desktop computer, allowing for each fluorescent particle in a dataset to be explicitly clustered. This reduces the potential for misattribution found in subsampling and comparative attribution methods used in previous approaches, improving our capacity to discriminate and quantify PBAP meta-classes. We evaluate the performance of several hierarchical agglomerative cluster analysis linkages and data normalisation methods using laboratory samples of known particle types and an ambient dataset. Fluorescent and non-fluorescent polystyrene latex spheres were sampled with a Wideband Integrated Bioaerosol Spectrometer (WIBS-4 where the optical size, asymmetry factor and fluorescent measurements were used as inputs to the analysis package. It was found that the Ward linkage with z-score or range normalisation performed best, correctly attributing 98 and 98.1 % of the data points respectively. The best performing methods were applied to the BEACHON-RoMBAS ambient dataset where it was found that the z-score and range normalisation methods yield similar results with each method producing clusters representative of fungal spores and bacterial aerosol, consistent with previous results. The z-score result was compared to clusters generated with previous approaches (WIBS AnalysiS Program, WASP where we observe that the subsampling and comparative attribution method employed by WASP results in the overestimation of the fungal spore concentration by a factor of 1.5 and the underestimation of bacterial aerosol concentration by a factor of 5. We suggest that this likely due to errors arising from misatrribution

  6. Evaluation of hierarchical agglomerative cluster analysis methods for discrimination of primary biological aerosol

    Directory of Open Access Journals (Sweden)

    I. Crawford

    2015-11-01

    Full Text Available In this paper we present improved methods for discriminating and quantifying primary biological aerosol particles (PBAPs by applying hierarchical agglomerative cluster analysis to multi-parameter ultraviolet-light-induced fluorescence (UV-LIF spectrometer data. The methods employed in this study can be applied to data sets in excess of 1 × 106 points on a desktop computer, allowing for each fluorescent particle in a data set to be explicitly clustered. This reduces the potential for misattribution found in subsampling and comparative attribution methods used in previous approaches, improving our capacity to discriminate and quantify PBAP meta-classes. We evaluate the performance of several hierarchical agglomerative cluster analysis linkages and data normalisation methods using laboratory samples of known particle types and an ambient data set. Fluorescent and non-fluorescent polystyrene latex spheres were sampled with a Wideband Integrated Bioaerosol Spectrometer (WIBS-4 where the optical size, asymmetry factor and fluorescent measurements were used as inputs to the analysis package. It was found that the Ward linkage with z-score or range normalisation performed best, correctly attributing 98 and 98.1 % of the data points respectively. The best-performing methods were applied to the BEACHON-RoMBAS (Bio–hydro–atmosphere interactions of Energy, Aerosols, Carbon, H2O, Organics and Nitrogen–Rocky Mountain Biogenic Aerosol Study ambient data set, where it was found that the z-score and range normalisation methods yield similar results, with each method producing clusters representative of fungal spores and bacterial aerosol, consistent with previous results. The z-score result was compared to clusters generated with previous approaches (WIBS AnalysiS Program, WASP where we observe that the subsampling and comparative attribution method employed by WASP results in the overestimation of the fungal spore concentration by a factor of 1.5 and the

  7. Asteroid family identification using the Hierarchical Clustering Method and WISE/NEOWISE physical properties

    CERN Document Server

    Masiero, Joseph R; Bauer, J M; Grav, T; Nugent, C R; Stevenson, R

    2013-01-01

    Using albedos from WISE/NEOWISE to separate distinct albedo groups within the Main Belt asteroids, we apply the Hierarchical Clustering Method to these subpopulations and identify dynamically associated clusters of asteroids. While this survey is limited to the ~35% of known Main Belt asteroids that were detected by NEOWISE, we present the families linked from these objects as higher confidence associations than can be obtained from dynamical linking alone. We find that over one-third of the observed population of the Main Belt is represented in the high-confidence cores of dynamical families. The albedo distribution of family members differs significantly from the albedo distribution of background objects in the same region of the Main Belt, however interpretation of this effect is complicated by the incomplete identification of lower-confidence family members. In total we link 38,298 asteroids into 76 distinct families. This work represents a critical step necessary to debias the albedo and size distributio...

  8. Analysis of genetic association in Listeria and Diabetes using Hierarchical Clustering and Silhouette Index

    Science.gov (United States)

    Pagnuco, Inti A.; Pastore, Juan I.; Abras, Guillermo; Brun, Marcel; Ballarin, Virginia L.

    2016-04-01

    It is usually assumed that co-expressed genes suggest co-regulation in the underlying regulatory network. Determining sets of co-expressed genes is an important task, where significative groups of genes are defined based on some criteria. This task is usually performed by clustering algorithms, where the whole family of genes, or a subset of them, are clustered into meaningful groups based on their expression values in a set of experiment. In this work we used a methodology based on the Silhouette index as a measure of cluster quality for individual gene groups, and a combination of several variants of hierarchical clustering to generate the candidate groups, to obtain sets of co-expressed genes for two real data examples. We analyzed the quality of the best ranked groups, obtained by the algorithm, using an online bioinformatics tool that provides network information for the selected genes. Moreover, to verify the performance of the algorithm, considering the fact that it doesn’t find all possible subsets, we compared its results against a full search, to determine the amount of good co-regulated sets not detected.

  9. On the Formation of Cool, Non-Flowing Cores in Galaxy Clusters via Hierarchical Mergers

    CERN Document Server

    Burns, J O; Norman, M L; Bryan, G L

    2003-01-01

    We present a new model for the creation of cool cores in rich galaxy clusters within a LambdaCDM cosmological framework using the results from high spatial dynamic range, adaptive mesh hydro/N-body simulations. It is proposed that cores of cool gas first form in subclusters and these subclusters merge to create rich clusters with cool, central X-Ray excesses. The rich cool clusters do not possess ``cooling flows'' due to the presence of bulk velocities in the intracluster medium in excess of 1000 km/sec produced by on-going accretion of gas from supercluster filaments. This new model has several attractive features including the presence of substantial core substructure within the cool cores, and it predicts the appearance of cool bullets, cool fronts, and cool filaments all of which have been recently observed with X-Ray satellites. This hierarchical formation model is also consistent with the observation that cool cores in Abell clusters occur preferentially in dense supercluster environments. On the other ...

  10. Ananke: temporal clustering reveals ecological dynamics of microbial communities

    Directory of Open Access Journals (Sweden)

    Michael W. Hall

    2017-09-01

    Full Text Available Taxonomic markers such as the 16S ribosomal RNA gene are widely used in microbial community analysis. A common first step in marker-gene analysis is grouping genes into clusters to reduce data sets to a more manageable size and potentially mitigate the effects of sequencing error. Instead of clustering based on sequence identity, marker-gene data sets collected over time can be clustered based on temporal correlation to reveal ecologically meaningful associations. We present Ananke, a free and open-source algorithm and software package that complements existing sequence-identity-based clustering approaches by clustering marker-gene data based on time-series profiles and provides interactive visualization of clusters, including highlighting of internal OTU inconsistencies. Ananke is able to cluster distinct temporal patterns from simulations of multiple ecological patterns, such as periodic seasonal dynamics and organism appearances/disappearances. We apply our algorithm to two longitudinal marker gene data sets: faecal communities from the human gut of an individual sampled over one year, and communities from a freshwater lake sampled over eleven years. Within the gut, the segregation of the bacterial community around a food-poisoning event was immediately clear. In the freshwater lake, we found that high sequence identity between marker genes does not guarantee similar temporal dynamics, and Ananke time-series clusters revealed patterns obscured by clustering based on sequence identity or taxonomy. Ananke is free and open-source software available at https://github.com/beiko-lab/ananke.

  11. Using hierarchical clustering methods to classify motor activities of COPD patients from wearable sensor data

    Directory of Open Access Journals (Sweden)

    Reilly John J

    2005-06-01

    Full Text Available Abstract Background Advances in miniature sensor technology have led to the development of wearable systems that allow one to monitor motor activities in the field. A variety of classifiers have been proposed in the past, but little has been done toward developing systematic approaches to assess the feasibility of discriminating the motor tasks of interest and to guide the choice of the classifier architecture. Methods A technique is introduced to address this problem according to a hierarchical framework and its use is demonstrated for the application of detecting motor activities in patients with chronic obstructive pulmonary disease (COPD undergoing pulmonary rehabilitation. Accelerometers were used to collect data for 10 different classes of activity. Features were extracted to capture essential properties of the data set and reduce the dimensionality of the problem at hand. Cluster measures were utilized to find natural groupings in the data set and then construct a hierarchy of the relationships between clusters to guide the process of merging clusters that are too similar to distinguish reliably. It provides a means to assess whether the benefits of merging for performance of a classifier outweigh the loss of resolution incurred through merging. Results Analysis of the COPD data set demonstrated that motor tasks related to ambulation can be reliably discriminated from tasks performed in a seated position with the legs in motion or stationary using two features derived from one accelerometer. Classifying motor tasks within the category of activities related to ambulation requires more advanced techniques. While in certain cases all the tasks could be accurately classified, in others merging clusters associated with different motor tasks was necessary. When merging clusters, it was found that the proposed method could lead to more than 12% improvement in classifier accuracy while retaining resolution of 4 tasks. Conclusion Hierarchical

  12. Using hierarchical clustering methods to classify motor activities of COPD patients from wearable sensor data

    Science.gov (United States)

    Sherrill, Delsey M; Moy, Marilyn L; Reilly, John J; Bonato, Paolo

    2005-01-01

    Background Advances in miniature sensor technology have led to the development of wearable systems that allow one to monitor motor activities in the field. A variety of classifiers have been proposed in the past, but little has been done toward developing systematic approaches to assess the feasibility of discriminating the motor tasks of interest and to guide the choice of the classifier architecture. Methods A technique is introduced to address this problem according to a hierarchical framework and its use is demonstrated for the application of detecting motor activities in patients with chronic obstructive pulmonary disease (COPD) undergoing pulmonary rehabilitation. Accelerometers were used to collect data for 10 different classes of activity. Features were extracted to capture essential properties of the data set and reduce the dimensionality of the problem at hand. Cluster measures were utilized to find natural groupings in the data set and then construct a hierarchy of the relationships between clusters to guide the process of merging clusters that are too similar to distinguish reliably. It provides a means to assess whether the benefits of merging for performance of a classifier outweigh the loss of resolution incurred through merging. Results Analysis of the COPD data set demonstrated that motor tasks related to ambulation can be reliably discriminated from tasks performed in a seated position with the legs in motion or stationary using two features derived from one accelerometer. Classifying motor tasks within the category of activities related to ambulation requires more advanced techniques. While in certain cases all the tasks could be accurately classified, in others merging clusters associated with different motor tasks was necessary. When merging clusters, it was found that the proposed method could lead to more than 12% improvement in classifier accuracy while retaining resolution of 4 tasks. Conclusion Hierarchical clustering methods are relevant

  13. Clustering of galaxies in a hierarchical universe - II. Evolution to high redshift

    Science.gov (United States)

    Kauffmann, Guinevere; Colberg, Jörg M.; Diaferio, Antonaldo; White, Simon D. M.

    1999-08-01

    In hierarchical cosmologies the evolution of galaxy clustering depends both on cosmological quantities such as Omega, Lambda and P(k), which determine how collapsed structures - dark matter haloes - form and evolve, and on the physical processes - cooling, star formation, radiative and hydrodynamic feedback - which drive the formation of galaxies within these merging haloes. In this paper we combine dissipationless cosmological N-body simulations and semi-analytic models of galaxy formation in order to study how these two aspects interact. We focus on the differences in clustering predicted for galaxies of differing luminosity, colour, morphology and star formation rate, and on what these differences can teach us about the galaxy formation process. We show that a `dip' in the amplitude of galaxy correlations between z=0 and z=1 can be an important diagnostic. Such a dip occurs in low-density CDM models, because structure forms early, and dark matter haloes of mass ~10^12M_solar, containing galaxies with luminosities ~L_*, are unbiased tracers of the dark matter over this redshift range; their clustering amplitude then evolves similarly to that of the dark matter. At higher redshifts, bright galaxies become strongly biased and the clustering amplitude increases again. In high density models, structure forms late, and bias evolves much more rapidly. As a result, the clustering amplitude of L_* galaxies remains constant from z=0 to z=1. The strength of these effects is sensitive to sample selection. The dip becomes weaker for galaxies with lower star formation rates, redder colours, higher luminosities and earlier morphological types. We explain why this is the case, and how it is related to the variation with redshift of the abundance and environment of the observed galaxies. We also show that the relative peculiar velocities of galaxies are biased low in our models, but that this effect is never very strong. Studies of clustering evolution as a function of galaxy

  14. Probabilistic inference: Task dependency and individual differences of probability weighting revealed by hierarchical Bayesian modelling

    Directory of Open Access Journals (Sweden)

    Moritz eBoos

    2016-05-01

    Full Text Available Cognitive determinants of probabilistic inference were examined using hierarchical Bayesian modelling techniques. A classic urn-ball paradigm served as experimental strategy, involving a factorial two (prior probabilities by two (likelihoods design. Five computational models of cognitive processes were compared with the observed behaviour. Parameter-free Bayesian posterior probabilities and parameter-free base rate neglect provided inadequate models of probabilistic inference. The introduction of distorted subjective probabilities yielded more robust and generalizable results. A general class of (inverted S-shaped probability weighting functions had been proposed; however, the possibility of large differences in probability distortions not only across experimental conditions, but also across individuals, seems critical for the model’s success. It also seems advantageous to consider individual differences in parameters of probability weighting as being sampled from weakly informative prior distributions of individual parameter values. Thus, the results from hierarchical Bayesian modelling converge with previous results in revealing that probability weighting parameters show considerable task dependency and individual differences. Methodologically, this work exemplifies the usefulness of hierarchical Bayesian modelling techniques for cognitive psychology. Theoretically, human probabilistic inference might be best described as the application of individualized strategic policies for Bayesian belief revision.

  15. Probabilistic Inference: Task Dependency and Individual Differences of Probability Weighting Revealed by Hierarchical Bayesian Modeling.

    Science.gov (United States)

    Boos, Moritz; Seer, Caroline; Lange, Florian; Kopp, Bruno

    2016-01-01

    Cognitive determinants of probabilistic inference were examined using hierarchical Bayesian modeling techniques. A classic urn-ball paradigm served as experimental strategy, involving a factorial two (prior probabilities) by two (likelihoods) design. Five computational models of cognitive processes were compared with the observed behavior. Parameter-free Bayesian posterior probabilities and parameter-free base rate neglect provided inadequate models of probabilistic inference. The introduction of distorted subjective probabilities yielded more robust and generalizable results. A general class of (inverted) S-shaped probability weighting functions had been proposed; however, the possibility of large differences in probability distortions not only across experimental conditions, but also across individuals, seems critical for the model's success. It also seems advantageous to consider individual differences in parameters of probability weighting as being sampled from weakly informative prior distributions of individual parameter values. Thus, the results from hierarchical Bayesian modeling converge with previous results in revealing that probability weighting parameters show considerable task dependency and individual differences. Methodologically, this work exemplifies the usefulness of hierarchical Bayesian modeling techniques for cognitive psychology. Theoretically, human probabilistic inference might be best described as the application of individualized strategic policies for Bayesian belief revision.

  16. DATA CLASSIFICATION WITH NEURAL CLASSIFIER USING RADIAL BASIS FUNCTION WITH DATA REDUCTION USING HIERARCHICAL CLUSTERING

    Directory of Open Access Journals (Sweden)

    M. Safish Mary

    2012-04-01

    Full Text Available Classification of large amount of data is a time consuming process but crucial for analysis and decision making. Radial Basis Function networks are widely used for classification and regression analysis. In this paper, we have studied the performance of RBF neural networks to classify the sales of cars based on the demand, using kernel density estimation algorithm which produces classification accuracy comparable to data classification accuracy provided by support vector machines. In this paper, we have proposed a new instance based data selection method where redundant instances are removed with help of a threshold thus improving the time complexity with improved classification accuracy. The instance based selection of the data set will help reduce the number of clusters formed thereby reduces the number of centers considered for building the RBF network. Further the efficiency of the training is improved by applying a hierarchical clustering technique to reduce the number of clusters formed at every step. The paper explains the algorithm used for classification and for conditioning the data. It also explains the complexities involved in classification of sales data for analysis and decision-making.

  17. 3D Nearest Neighbour Search Using a Clustered Hierarchical Tree Structure

    DEFF Research Database (Denmark)

    Suhaibah, A.; Uznir, U.; Antón Castro, Francesc/François

    2016-01-01

    , with the immense number of urban datasets, the retrieval and analysis of nearest neighbour information and their efficiency will become more complex and crucial. In this paper, we present a technique to retrieve nearest neighbour information in 3D space using a clustered hierarchical tree structure. Based on our...... findings, the proposed approach substantially showed an improvement of response time analysis compared to existing approaches of spatial access methods in databases. The query performance was tested using a dataset consisting of 500,000 point locations building and franchising unit. The results...... of the franchise unit will be located or is the franchise unit located is at the best level for visibility purposes. One of the common used analyses used for retrieving the surrounding information is Nearest Neighbour (NN) analysis. It uses a point location and identifies the surrounding neighbours. However...

  18. Modeling Hierarchically Clustered Longitudinal Survival Processes with Applications to Child Mortality and Maternal Health

    Directory of Open Access Journals (Sweden)

    Kuate-Defo, Bathélémy

    2001-01-01

    Full Text Available EnglishThis paper merges two parallel developments since the 1970s of newstatistical tools for data analysis: statistical methods known as hazard models that are used foranalyzing event-duration data and statistical methods for analyzing hierarchically clustered dataknown as multilevel models. These developments have rarely been integrated in research practice andthe formalization and estimation of models for hierarchically clustered survival data remain largelyuncharted. I attempt to fill some of this gap and demonstrate the merits of formulating and estimatingmultilevel hazard models with longitudinal data.FrenchCette étude intègre deux approches statistiques de pointe d'analyse des donnéesquantitatives depuis les années 70: les méthodes statistiques d'analyse desdonnées biographiques ou méthodes de survie et les méthodes statistiquesd'analyse des données hiérarchiques ou méthodes multi-niveaux. Ces deuxapproches ont été très peu mis en symbiose dans la pratique de recherche et parconséquent, la formulation et l'estimation des modèles appropriés aux donnéeslongitudinales et hiérarchiquement nichées demeure essentiellement un champd'investigation vierge. J'essaye de combler ce vide et j'utilise des données réellesen santé publique pour démontrer les mérites et contextes de formulation etd'estimation des modèles multi-niveaux et multi-états des données biographiqueset longitudinales.

  19. Biomolecule-Assisted Hydrothermal Synthesis and Self-Assembly of Bi2Te3 Nanostring-Cluster Hierarchical Structure

    DEFF Research Database (Denmark)

    Mi, Jianli; Lock, Nina; Sun, Ting;

    2010-01-01

    A simple biomolecule-assisted hydrothermal approach has been developed for the fabrication of Bi2Te3 thermoelectric nanomaterials. The product has a nanostring-cluster hierarchical structure which is composed of ordered and aligned platelet-like crystals. The platelets are100 nm in diameter...

  20. Comparing chemistry to outcome: the development of a chemical distance metric, coupled with clustering and hierarchal visualization applied to macromolecular crystallography.

    Directory of Open Access Journals (Sweden)

    Andrew E Bruno

    Full Text Available Many bioscience fields employ high-throughput methods to screen multiple biochemical conditions. The analysis of these becomes tedious without a degree of automation. Crystallization, a rate limiting step in biological X-ray crystallography, is one of these fields. Screening of multiple potential crystallization conditions (cocktails is the most effective method of probing a proteins phase diagram and guiding crystallization but the interpretation of results can be time-consuming. To aid this empirical approach a cocktail distance coefficient was developed to quantitatively compare macromolecule crystallization conditions and outcome. These coefficients were evaluated against an existing similarity metric developed for crystallization, the C6 metric, using both virtual crystallization screens and by comparison of two related 1,536-cocktail high-throughput crystallization screens. Hierarchical clustering was employed to visualize one of these screens and the crystallization results from an exopolyphosphatase-related protein from Bacteroides fragilis, (BfR192 overlaid on this clustering. This demonstrated a strong correlation between certain chemically related clusters and crystal lead conditions. While this analysis was not used to guide the initial crystallization optimization, it led to the re-evaluation of unexplained peaks in the electron density map of the protein and to the insertion and correct placement of sodium, potassium and phosphate atoms in the structure. With these in place, the resulting structure of the putative active site demonstrated features consistent with active sites of other phosphatases which are involved in binding the phosphoryl moieties of nucleotide triphosphates. The new distance coefficient, CDcoeff, appears to be robust in this application, and coupled with hierarchical clustering and the overlay of crystallization outcome, reveals information of biological relevance. While tested with a single example the

  1. Clustering of Galaxies in a Hierarchical Universe 2 evolution to High Redshift

    CERN Document Server

    Kauffmann, G; Diaferio, A; White, S D M; Kauffmann, Guinevere; Colberg, Joerg M.; Diaferio, Antonaldo; White, Simon D.M.

    1998-01-01

    In hierarchical cosmologies the evolution of galaxy clustering depends both on cosmological quantities such as Omega and Lambda, which determine how dark matter halos form and evolve, and on the physical processes - cooling, star formation and feedback - which drive the formation of galaxies within these merging halos. In this paper, we combine dissipationless cosmological N-body simulations and semi-analytic models of galaxy formation in order to study how these two aspects interact. We focus on the differences in clustering predicted for galaxies of differing luminosity, colour, morphology and star formation rate and on what these differences can teach us about the galaxy formation process. We show that a "dip" in the amplitude of galaxy correlations between z=0 and z=1 can be an important diagnostic. Such a dip occurs in low-density CDM models because structure forms early and dark matter halos of 10**12 solar masses, containing galaxies with luminosities around L*, are unbiased tracers of the dark matter ...

  2. Quality Assured Optimal Resource Provisioning and Scheduling Technique Based on Improved Hierarchical Agglomerative Clustering Algorithm (IHAC

    Directory of Open Access Journals (Sweden)

    A. Meenakshi

    2016-08-01

    Full Text Available Resource allocation is the task of convenient resources to different uses. In the context of an resources, entire economy, can be assigned by different means, such as markets or central planning. Cloud computing has become a new age technology that has got huge potentials in enterprises and markets. Clouds can make it possible to access applications and associated data from anywhere. The fundamental motive of the resource allocation is to allot the available resource in the most effective manner. In the initial phase, a representative resource usage distribution for a group of nodes with identical resource usage patterns is evaluated as resource bundle which can be easily employed to locate a group of nodes fulfilling a standard criterion. In the document, an innovative clustering-based resource aggregation viz. the Improved Hierarchal Agglomerative Clustering Algorithm (IHAC is elegantly launched to realize the compact illustration of a set of identically behaving nodes for scalability. In the subsequent phase concerned with energetic resource allocation procedure, the hybrid optimization technique is brilliantly brought in. The novel technique is devised for scheduling functions to cloud resources which duly consider both financial and evaluation expenses. The efficiency of the novel Resource allocation system is assessed by means of several parameters such the reliability, reusability and certain other metrics. The optimal path choice is the consequence of the hybrid optimization approach. The new-fangled technique allocates the available resource based on the optimal path.

  3. Demographic Data Assessment using Novel 3DCCOM Spatial Hierarchical Clustering: A Case Study of Sonipat Block, Haryana

    Directory of Open Access Journals (Sweden)

    Mamta Malik

    2011-09-01

    Full Text Available Cluster detection is a tool employed by GIS scientists who specialize in the field of spatial analysis. This study employed a combination of GIS, RS and a novel 3DCCOM spatial data clustering algorithm to assess the rural demographic development strategies of Sonepat block, Haryana, India. This Study is undertaken in the rural and rural-based district in India to demonstrate the integration of village-level spatial and non-spatial data in GIS environment using Hierarchical Clustering. Spatial clusters of living standard parameters, including family members, male and female population, sex ratio, total male and female education ratio etc. The paper also envisages future development and usefulness of this community GIS, Spatial data clustering tool for grass-root level planning. Any data that showsgeographic (spatial variability can be subject to cluster analysis.

  4. Multichannel biomedical time series clustering via hierarchical probabilistic latent semantic analysis.

    Science.gov (United States)

    Wang, Jin; Sun, Xiangping; Nahavandi, Saeid; Kouzani, Abbas; Wu, Yuchuan; She, Mary

    2014-11-01

    Biomedical time series clustering that automatically groups a collection of time series according to their internal similarity is of importance for medical record management and inspection such as bio-signals archiving and retrieval. In this paper, a novel framework that automatically groups a set of unlabelled multichannel biomedical time series according to their internal structural similarity is proposed. Specifically, we treat a multichannel biomedical time series as a document and extract local segments from the time series as words. We extend a topic model, i.e., the Hierarchical probabilistic Latent Semantic Analysis (H-pLSA), which was originally developed for visual motion analysis to cluster a set of unlabelled multichannel time series. The H-pLSA models each channel of the multichannel time series using a local pLSA in the first layer. The topics learned in the local pLSA are then fed to a global pLSA in the second layer to discover the categories of multichannel time series. Experiments on a dataset extracted from multichannel Electrocardiography (ECG) signals demonstrate that the proposed method performs better than previous state-of-the-art approaches and is relatively robust to the variations of parameters including length of local segments and dictionary size. Although the experimental evaluation used the multichannel ECG signals in a biometric scenario, the proposed algorithm is a universal framework for multichannel biomedical time series clustering according to their structural similarity, which has many applications in biomedical time series management. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  5. CHIMERA: Top-down model for hierarchical, overlapping and directed cluster structures in directed and weighted complex networks

    Science.gov (United States)

    Franke, R.

    2016-11-01

    In many networks discovered in biology, medicine, neuroscience and other disciplines special properties like a certain degree distribution and hierarchical cluster structure (also called communities) can be observed as general organizing principles. Detecting the cluster structure of an unknown network promises to identify functional subdivisions, hierarchy and interactions on a mesoscale. It is not trivial choosing an appropriate detection algorithm because there are multiple network, cluster and algorithmic properties to be considered. Edges can be weighted and/or directed, clusters overlap or build a hierarchy in several ways. Algorithms differ not only in runtime, memory requirements but also in allowed network and cluster properties. They are based on a specific definition of what a cluster is, too. On the one hand, a comprehensive network creation model is needed to build a large variety of benchmark networks with different reasonable structures to compare algorithms. On the other hand, if a cluster structure is already known, it is desirable to separate effects of this structure from other network properties. This can be done with null model networks that mimic an observed cluster structure to improve statistics on other network features. A third important application is the general study of properties in networks with different cluster structures, possibly evolving over time. Currently there are good benchmark and creation models available. But what is left is a precise sandbox model to build hierarchical, overlapping and directed clusters for undirected or directed, binary or weighted complex random networks on basis of a sophisticated blueprint. This gap shall be closed by the model CHIMERA (Cluster Hierarchy Interconnection Model for Evaluation, Research and Analysis) which will be introduced and described here for the first time.

  6. Typing of unknown microorganisms based on quantitative analysis of fatty acids by mass spectrometry and hierarchical clustering

    Energy Technology Data Exchange (ETDEWEB)

    Li Tingting; Dai Ling; Li Lun; Hu Xuejiao; Dong Linjie; Li Jianjian; Salim, Sule Khalfan; Fu Jieying [Key Laboratory of Pesticides and Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei 430079 (China); Zhong Hongying, E-mail: hyzhong@mail.ccnu.edu.cn [Key Laboratory of Pesticides and Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei 430079 (China)

    2011-01-17

    Rapid identification of unknown microorganisms of clinical and agricultural importance is not only critical for accurate diagnosis of infections but also essential for appropriate and prompt treatment. We describe here a rapid method for microorganisms typing based on quantitative analysis of fatty acids by iFAT approach (Isotope-coded Fatty Acid Transmethylation). In this work, lyophilized cell lysates were directly mixed with 0.5 M NaOH solution in d3-methanol and n-hexane. After 1 min of ultrasonication, the top n-hexane layer was combined with a mixture of standard d0-methanol derived fatty acid methylesters with known concentration. Measurement of intensity ratios of d3/d0 labeled fragment ion and molecular ion pairs at the corresponding target fatty acids provides a quantitative basis for hierarchical clustering. In the resultant dendrogram, the Euclidean distance between unknown species and known species quantitatively reveals their differences or shared similarities in fatty acid related pathways. It is of particular interest to apply this method for typing fungal species because fungi has distinguished lipid biosynthetic pathways that have been targeted for lots of drugs or fungicides compared with bacteria and animals. The proposed method has no dependence on the availability of genome or proteome databases. Therefore, it is can be applicable for a broad range of unknown microorganisms or mutant species.

  7. New Alzheimer amyloid beta responsive genes identified in human neuroblastoma cells by hierarchical clustering.

    Directory of Open Access Journals (Sweden)

    Markus Uhrig

    Full Text Available Alzheimer's disease (AD is characterized by neuronal degeneration and cell loss. Abeta(42, in contrast to Abeta(40, is thought to be the pathogenic form triggering the pathological cascade in AD. In order to unravel overall gene regulation we monitored the transcriptomic responses to increased or decreased Abeta(40 and Abeta(42 levels, generated and derived from its precursor C99 (C-terminal fragment of APP comprising 99 amino acids in human neuroblastoma cells. We identified fourteen differentially expressed transcripts by hierarchical clustering and discussed their involvement in AD. These fourteen transcripts were grouped into two main clusters each showing distinct differential expression patterns depending on Abeta(40 and Abeta(42 levels. Among these transcripts we discovered an unexpected inverse and strong differential expression of neurogenin 2 (NEUROG2 and KIAA0125 in all examined cell clones. C99-overexpression had a similar effect on NEUROG2 and KIAA0125 expression as a decreased Abeta(42/Abeta(40 ratio. Importantly however, an increased Abeta(42/Abeta(40 ratio, which is typical of AD, had an inverse expression pattern of NEUROG2 and KIAA0125: An increased Abeta(42/Abeta(40 ratio up-regulated NEUROG2, but down-regulated KIAA0125, whereas the opposite regulation pattern was observed for a decreased Abeta(42/Abeta(40 ratio. We discuss the possibilities that the so far uncharacterized KIAA0125 might be a counter player of NEUROG2 and that KIAA0125 could be involved in neurogenesis, due to the involvement of NEUROG2 in developmental neural processes.

  8. Symptom Clusters in People Living with HIV Attending Five Palliative Care Facilities in Two Sub-Saharan African Countries: A Hierarchical Cluster Analysis.

    Science.gov (United States)

    Moens, Katrien; Siegert, Richard J; Taylor, Steve; Namisango, Eve; Harding, Richard

    2015-01-01

    Symptom research across conditions has historically focused on single symptoms, and the burden of multiple symptoms and their interactions has been relatively neglected especially in people living with HIV. Symptom cluster studies are required to set priorities in treatment planning, and to lessen the total symptom burden. This study aimed to identify and compare symptom clusters among people living with HIV attending five palliative care facilities in two sub-Saharan African countries. Data from cross-sectional self-report of seven-day symptom prevalence on the 32-item Memorial Symptom Assessment Scale-Short Form were used. A hierarchical cluster analysis was conducted using Ward's method applying squared Euclidean Distance as the similarity measure to determine the clusters. Contingency tables, X2 tests and ANOVA were used to compare the clusters by patient specific characteristics and distress scores. Among the sample (N=217) the mean age was 36.5 (SD 9.0), 73.2% were female, and 49.1% were on antiretroviral therapy (ART). The cluster analysis produced five symptom clusters identified as: 1) dermatological; 2) generalised anxiety and elimination; 3) social and image; 4) persistently present; and 5) a gastrointestinal-related symptom cluster. The patients in the first three symptom clusters reported the highest physical and psychological distress scores. Patient characteristics varied significantly across the five clusters by functional status (worst functional physical status in cluster one, pclusters two and three, p=0.012); global distress (F=26.8, pcluster one, best for cluster four). The greatest burden is associated with cluster one, and should be prioritised in clinical management. Further symptom cluster research in people living with HIV with longitudinally collected symptom data to test cluster stability and identify common symptom trajectories is recommended.

  9. Structural system identification using degree of freedom-based reduction and hierarchical clustering algorithm

    Science.gov (United States)

    Chang, Seongmin; Baek, Sungmin; Kim, Ki-Ook; Cho, Maenghyo

    2015-06-01

    A system identification method has been proposed to validate finite element models of complex structures using measured modal data. Finite element method is used for the system identification as well as the structural analysis. In perturbation methods, the perturbed system is expressed as a combination of the baseline structure and the related perturbations. The changes in dynamic responses are applied to determine the structural modifications so that the equilibrium may be satisfied in the perturbed system. In practical applications, the dynamic measurements are carried out on a limited number of accessible nodes and associated degrees of freedom. The equilibrium equation is, in principle, expressed in terms of the measured (master, primary) and unmeasured (slave, secondary) degrees of freedom. Only the specified degrees of freedom are included in the equation formulation for identification and the unspecified degrees of freedom are eliminated through the iterative improved reduction scheme. A large number of system parameters are included as the unknown variables in the system identification of large-scaled structures. The identification problem with large number of system parameters requires a large amount of computation time and resources. In the present study, a hierarchical clustering algorithm is applied to reduce the number of system parameters effectively. Numerical examples demonstrate that the proposed method greatly improves the accuracy and efficiency in the inverse problem of identification.

  10. Hierarchical black hole triples in young star clusters: impact of Kozai-Lidov resonance on mergers

    CERN Document Server

    Kimpson, Thomas O; Mapelli, Michela; Ziosi, Brunetto M

    2016-01-01

    Mergers of compact object binaries are one of the most powerful sources of gravitational waves (GWs) in the frequency range of second-generation ground-based gravitational wave detectors (Advanced LIGO and Virgo). Dynamical simulations of young dense star clusters (SCs) indicate that ~27 per cent of all double compact object binaries are members of hierarchical triple systems (HTs). In this paper, we consider 570 HTs composed of three compact objects (black holes or neutron stars) that formed dynamically in N-body simulations of young dense SCs. We simulate them for a Hubble time with a new code based on the Mikkola's algorithmic regularization scheme, including the 2.5 post-Newtonian term. We find that ~88 per cent of the simulated systems develop Kozai-Lidov (KL) oscillations. KL resonance triggers the merger of the inner binary in three systems (corresponding to 0.5 per cent of the simulated HTs), by increasing the eccentricity of the inner binary. Accounting for KL oscillations leads to an increase of the...

  11. Hierarchical black hole triples in young star clusters: impact of Kozai-Lidov resonance on mergers

    Science.gov (United States)

    Kimpson, Thomas O.; Spera, Mario; Mapelli, Michela; Ziosi, Brunetto M.

    2016-12-01

    Mergers of compact-object binaries are one of the most powerful sources of gravitational waves (GWs) in the frequency range of second-generation ground-based GW detectors (advanced LIGO and Virgo). Dynamical simulations of young dense star clusters (SCs) indicate that ˜27 per cent of all double compact-object binaries are members of hierarchical triple systems (HTs). In this paper, we consider 570 HTs composed of three compact objects (black holes or neutron stars) that formed dynamically in N-body simulations of young dense SCs. We simulate them for a Hubble time with a new code based on the Mikkola's algorithmic regularization scheme, including the 2.5 post-Newtonian term. We find that ˜88 per cent of the simulated systems develop Kozai-Lidov (KL) oscillations. KL resonance triggers the merger of the inner binary in three systems (corresponding to 0.5 per cent of the simulated HTs), by increasing the eccentricity of the inner binary. Accounting for KL oscillations leads to an increase of the total expected merger rate by ≈50 per cent. All binaries that merge because of KL oscillations were formed by dynamical exchanges (i.e. none is a primordial binary) and have chirp mass >20 M⊙. This result might be crucial to interpret the formation channel of the first recently detected GW events.

  12. Ingredients and Process Standardization of Thepla: An Indian Unleavened Vegetable Flatbread using Hierarchical Cluster Analysis

    Directory of Open Access Journals (Sweden)

    S.S. Arya

    2012-10-01

    Full Text Available Thepla is an Indian unleavened flatbread made from whole-wheat flour with added spices and vegetables. It is particularly consumed in western zone of the India. The preparation of thepla is tedious, time consuming and requires skill. In the present study standardization of thepla ingredients were carried out by standardizing each ingredient on the basis of Overall Acceptability (OA score. Sensory analysis was carried out using nine-point hedonic rating scale with ten trained panellists. Standardized ingredients of thepla were: salt 3%, red chili powder 2.5%, fenugreek leaves 12%, cumin seed powder 0.6%, coriander seed powder 0.6%, ginger garlic paste (1:1 6%, asafoetida 0.6% and oil 3% w/w of whole wheat flour on the basis of highest sensory OA score. Further thepla process parameters such as time, temperature, diameter of thepla and weight of dough were standardized on the basis of sensory OA score. Obtained sensory score data was processed for Hierarchical Cluster Analysis (HCA.

  13. A new Hierarchical Group Key Management based on Clustering Scheme for Mobile Ad Hoc Networks

    Directory of Open Access Journals (Sweden)

    Ayman EL-SAYED

    2014-05-01

    Full Text Available The migration from wired network to wireless network has been a global trend in the past few decades because they provide anytime-anywhere networking services. The wireless networks are rapidly deployed in the future, secure wireless environment will be mandatory. As well, The mobility and scalability brought by wireless network made it possible in many applications. Among all the contemporary wireless networks,Mobile Ad hoc Networks (MANET is one of the most important and unique applications. MANET is a collection of autonomous nodes or terminals which communicate with each other by forming a multihop radio network and maintaining connectivity in a decentralized manner. Due to the nature of unreliable wireless medium data transfer is a major problem in MANET and it lacks security and reliability of data. The most suitable solution to provide the expected level of security to these services is the provision of a key management protocol. A Key management is vital part of security. This issue is even bigger in wireless network compared to wired network. The distribution of keys in an authenticated manner is a difficult task in MANET. When a member leaves or joins the group, it needs to generate a new key to maintain forward and backward secrecy. In this paper, we propose a new group key management schemes namely a Hierarchical, Simple, Efficient and Scalable Group Key (HSESGK based on clustering management scheme for MANETs and different other schemes are classified. Group members deduce the group key in a distributed manner.

  14. MAP-Based Underdetermined Blind Source Separation of Convolutive Mixtures by Hierarchical Clustering and -Norm Minimization

    Directory of Open Access Journals (Sweden)

    Kellermann Walter

    2007-01-01

    Full Text Available We address the problem of underdetermined BSS. While most previous approaches are designed for instantaneous mixtures, we propose a time-frequency-domain algorithm for convolutive mixtures. We adopt a two-step method based on a general maximum a posteriori (MAP approach. In the first step, we estimate the mixing matrix based on hierarchical clustering, assuming that the source signals are sufficiently sparse. The algorithm works directly on the complex-valued data in the time-frequency domain and shows better convergence than algorithms based on self-organizing maps. The assumption of Laplacian priors for the source signals in the second step leads to an algorithm for estimating the source signals. It involves the -norm minimization of complex numbers because of the use of the time-frequency-domain approach. We compare a combinatorial approach initially designed for real numbers with a second-order cone programming (SOCP approach designed for complex numbers. We found that although the former approach is not theoretically justified for complex numbers, its results are comparable to, or even better than, the SOCP solution. The advantage is a lower computational cost for problems with low input/output dimensions.

  15. Taxonomy of Manufacturing Flexibility at Manufacturing Companies Using Imperialist Competitive Algorithms, Support Vector Machines and Hierarchical Cluster Analysis

    Directory of Open Access Journals (Sweden)

    M. Khoobiyan

    2017-04-01

    Full Text Available Manufacturing flexibility is a multidimensional concept and manufacturing companies act differently in using these dimensions. The purpose of this study is to investigate taxonomy and identify dominant groups of manufacturing flexibility. Dimensions of manufacturing flexibility are extracted by content analysis of literature and expert judgements. Manufacturing flexibility was measured by using a questionnaire developed to survey managers of manufacturing companies. The sample size was set at 379. To identify dominant groups of flexibility based on dimensions of flexibility determined, Hierarchical Cluster Analysis (HCA, Imperialist Competitive Algorithms (ICAs and Support Vector Machines (SVMs were used by cluster validity indices. The best algorithm for clustering was SVMs with three clusters, designated as leading delivery-based flexibility, frugal flexibility and sufficient plan-based flexibility.

  16. 3D NEAREST NEIGHBOUR SEARCH USING A CLUSTERED HIERARCHICAL TREE STRUCTURE

    Directory of Open Access Journals (Sweden)

    A. Suhaibah

    2016-06-01

    Full Text Available Locating and analysing the location of new stores or outlets is one of the common issues facing retailers and franchisers. This is due to assure that new opening stores are at their strategic location to attract the highest possible number of customers. Spatial information is used to manage, maintain and analyse these store locations. However, since the business of franchising and chain stores in urban areas runs within high rise multi-level buildings, a three-dimensional (3D method is prominently required in order to locate and identify the surrounding information such as at which level of the franchise unit will be located or is the franchise unit located is at the best level for visibility purposes. One of the common used analyses used for retrieving the surrounding information is Nearest Neighbour (NN analysis. It uses a point location and identifies the surrounding neighbours. However, with the immense number of urban datasets, the retrieval and analysis of nearest neighbour information and their efficiency will become more complex and crucial. In this paper, we present a technique to retrieve nearest neighbour information in 3D space using a clustered hierarchical tree structure. Based on our findings, the proposed approach substantially showed an improvement of response time analysis compared to existing approaches of spatial access methods in databases. The query performance was tested using a dataset consisting of 500,000 point locations building and franchising unit. The results are presented in this paper. Another advantage of this structure is that it also offers a minimal overlap and coverage among nodes which can reduce repetitive data entry.

  17. A Resting-State Brain Functional Network Study in MDD Based on Minimum Spanning Tree Analysis and the Hierarchical Clustering

    Directory of Open Access Journals (Sweden)

    Xiaowei Li

    2017-01-01

    Full Text Available A large number of studies demonstrated that major depressive disorder (MDD is characterized by the alterations in brain functional connections which is also identifiable during the brain’s “resting-state.” But, in the present study, the approach of constructing functional connectivity is often biased by the choice of the threshold. Besides, more attention was paid to the number and length of links in brain networks, and the clustering partitioning of nodes was unclear. Therefore, minimum spanning tree (MST analysis and the hierarchical clustering were first used for the depression disease in this study. Resting-state electroencephalogram (EEG sources were assessed from 15 healthy and 23 major depressive subjects. Then the coherence, MST, and the hierarchical clustering were obtained. In the theta band, coherence analysis showed that the EEG coherence of the MDD patients was significantly higher than that of the healthy controls especially in the left temporal region. The MST results indicated the higher leaf fraction in the depressed group. Compared with the normal group, the major depressive patients lost clustering in frontal regions. Our findings suggested that there was a stronger brain interaction in the MDD group and a left-right functional imbalance in the frontal regions for MDD controls.

  18. Robust hierarchical state-space models reveal diel variation in travel rates of migrating leatherback turtles.

    Science.gov (United States)

    Jonsen, Ian D; Myers, Ransom A; James, Michael C

    2006-09-01

    1. Biological and statistical complexity are features common to most ecological data that hinder our ability to extract meaningful patterns using conventional tools. Recent work on implementing modern statistical methods for analysis of such ecological data has focused primarily on population dynamics but other types of data, such as animal movement pathways obtained from satellite telemetry, can also benefit from the application of modern statistical tools. 2. We develop a robust hierarchical state-space approach for analysis of multiple satellite telemetry pathways obtained via the Argos system. State-space models are time-series methods that allow unobserved states and biological parameters to be estimated from data observed with error. We show that the approach can reveal important patterns in complex, noisy data where conventional methods cannot. 3. Using the largest Atlantic satellite telemetry data set for critically endangered leatherback turtles, we show that the diel pattern in travel rates of these turtles changes over different phases of their migratory cycle. While foraging in northern waters the turtles show similar travel rates during day and night, but on their southward migration to tropical waters travel rates are markedly faster during the day. These patterns are generally consistent with diving data, and may be related to changes in foraging behaviour. Interestingly, individuals that migrate southward to breed generally show higher daytime travel rates than individuals that migrate southward in a non-breeding year. 4. Our approach is extremely flexible and can be applied to many ecological analyses that use complex, sequential data.

  19. Symptom Clusters in People Living with HIV Attending Five Palliative Care Facilities in Two Sub-Saharan African Countries: A Hierarchical Cluster Analysis.

    Directory of Open Access Journals (Sweden)

    Katrien Moens

    Full Text Available Symptom research across conditions has historically focused on single symptoms, and the burden of multiple symptoms and their interactions has been relatively neglected especially in people living with HIV. Symptom cluster studies are required to set priorities in treatment planning, and to lessen the total symptom burden. This study aimed to identify and compare symptom clusters among people living with HIV attending five palliative care facilities in two sub-Saharan African countries.Data from cross-sectional self-report of seven-day symptom prevalence on the 32-item Memorial Symptom Assessment Scale-Short Form were used. A hierarchical cluster analysis was conducted using Ward's method applying squared Euclidean Distance as the similarity measure to determine the clusters. Contingency tables, X2 tests and ANOVA were used to compare the clusters by patient specific characteristics and distress scores.Among the sample (N=217 the mean age was 36.5 (SD 9.0, 73.2% were female, and 49.1% were on antiretroviral therapy (ART. The cluster analysis produced five symptom clusters identified as: 1 dermatological; 2 generalised anxiety and elimination; 3 social and image; 4 persistently present; and 5 a gastrointestinal-related symptom cluster. The patients in the first three symptom clusters reported the highest physical and psychological distress scores. Patient characteristics varied significantly across the five clusters by functional status (worst functional physical status in cluster one, p<0.001; being on ART (highest proportions for clusters two and three, p=0.012; global distress (F=26.8, p<0.001, physical distress (F=36.3, p<0.001 and psychological distress subscale (F=21.8, p<0.001 (all subscales worst for cluster one, best for cluster four.The greatest burden is associated with cluster one, and should be prioritised in clinical management. Further symptom cluster research in people living with HIV with longitudinally collected symptom data to

  20. Principal component analysis vs. self-organizing maps combined with hierarchical clustering for pattern recognition in volcano seismic spectra

    Science.gov (United States)

    Unglert, K.; Radić, V.; Jellinek, A. M.

    2016-06-01

    Variations in the spectral content of volcano seismicity related to changes in volcanic activity are commonly identified manually in spectrograms. However, long time series of monitoring data at volcano observatories require tools to facilitate automated and rapid processing. Techniques such as self-organizing maps (SOM) and principal component analysis (PCA) can help to quickly and automatically identify important patterns related to impending eruptions. For the first time, we evaluate the performance of SOM and PCA on synthetic volcano seismic spectra constructed from observations during two well-studied eruptions at Klauea Volcano, Hawai'i, that include features observed in many volcanic settings. In particular, our objective is to test which of the techniques can best retrieve a set of three spectral patterns that we used to compose a synthetic spectrogram. We find that, without a priori knowledge of the given set of patterns, neither SOM nor PCA can directly recover the spectra. We thus test hierarchical clustering, a commonly used method, to investigate whether clustering in the space of the principal components and on the SOM, respectively, can retrieve the known patterns. Our clustering method applied to the SOM fails to detect the correct number and shape of the known input spectra. In contrast, clustering of the data reconstructed by the first three PCA modes reproduces these patterns and their occurrence in time more consistently. This result suggests that PCA in combination with hierarchical clustering is a powerful practical tool for automated identification of characteristic patterns in volcano seismic spectra. Our results indicate that, in contrast to PCA, common clustering algorithms may not be ideal to group patterns on the SOM and that it is crucial to evaluate the performance of these tools on a control dataset prior to their application to real data.

  1. Topological cluster analysis reveals the systemic organization of the Caenorhabditis elegans connectome.

    Directory of Open Access Journals (Sweden)

    Yunkyu Sohn

    2011-05-01

    Full Text Available The modular organization of networks of individual neurons interwoven through synapses has not been fully explored due to the incredible complexity of the connectivity architecture. Here we use the modularity-based community detection method for directed, weighted networks to examine hierarchically organized modules in the complete wiring diagram (connectome of Caenorhabditis elegans (C. elegans and to investigate their topological properties. Incorporating bilateral symmetry of the network as an important cue for proper cluster assignment, we identified anatomical clusters in the C. elegans connectome, including a body-spanning cluster, which correspond to experimentally identified functional circuits. Moreover, the hierarchical organization of the five clusters explains the systemic cooperation (e.g., mechanosensation, chemosensation, and navigation that occurs among the structurally segregated biological circuits to produce higher-order complex behaviors.

  2. Comparison of multianalyte proficiency test results by sum of ranking differences, principal component analysis, and hierarchical cluster analysis.

    Science.gov (United States)

    Škrbić, Biljana; Héberger, Károly; Durišić-Mladenović, Nataša

    2013-10-01

    Sum of ranking differences (SRD) was applied for comparing multianalyte results obtained by several analytical methods used in one or in different laboratories, i.e., for ranking the overall performances of the methods (or laboratories) in simultaneous determination of the same set of analytes. The data sets for testing of the SRD applicability contained the results reported during one of the proficiency tests (PTs) organized by EU Reference Laboratory for Polycyclic Aromatic Hydrocarbons (EU-RL-PAH). In this way, the SRD was also tested as a discriminant method alternative to existing average performance scores used to compare mutlianalyte PT results. SRD should be used along with the z scores--the most commonly used PT performance statistics. SRD was further developed to handle the same rankings (ties) among laboratories. Two benchmark concentration series were selected as reference: (a) the assigned PAH concentrations (determined precisely beforehand by the EU-RL-PAH) and (b) the averages of all individual PAH concentrations determined by each laboratory. Ranking relative to the assigned values and also to the average (or median) values pointed to the laboratories with the most extreme results, as well as revealed groups of laboratories with similar overall performances. SRD reveals differences between methods or laboratories even if classical test(s) cannot. The ranking was validated using comparison of ranks by random numbers (a randomization test) and using seven folds cross-validation, which highlighted the similarities among the (methods used in) laboratories. Principal component analysis and hierarchical cluster analysis justified the findings based on SRD ranking/grouping. If the PAH-concentrations are row-scaled, (i.e., z scores are analyzed as input for ranking) SRD can still be used for checking the normality of errors. Moreover, cross-validation of SRD on z scores groups the laboratories similarly. The SRD technique is general in nature, i.e., it can

  3. Teaching a machine to see: unsupervised image segmentation and categorisation using growing neural gas and hierarchical clustering

    CERN Document Server

    Hocking, Alex; Davey, Neil; Sun, Yi

    2015-01-01

    We present a novel unsupervised learning approach to automatically segment and label images in astronomical surveys. Automation of this procedure will be essential as next-generation surveys enter the petabyte scale: data volumes will exceed the capability of even large crowd-sourced analyses. We demonstrate how a growing neural gas (GNG) can be used to encode the feature space of imaging data. When coupled with a technique called hierarchical clustering, imaging data can be automatically segmented and labelled by organising nodes in the GNG. The key distinction of unsupervised learning is that these labels need not be known prior to training, rather they are determined by the algorithm itself. Importantly, after training a network can be be presented with images it has never 'seen' before and provide consistent categorisation of features. As a proof-of-concept we demonstrate application on data from the Hubble Space Telescope Frontier Fields: images of clusters of galaxies containing a mixture of galaxy type...

  4. Investigating the effects of climate variations on bacillary dysentery incidence in northeast China using ridge regression and hierarchical cluster analysis

    Directory of Open Access Journals (Sweden)

    Guo Junqiao

    2008-09-01

    Full Text Available Abstract Background The effects of climate variations on bacillary dysentery incidence have gained more recent concern. However, the multi-collinearity among meteorological factors affects the accuracy of correlation with bacillary dysentery incidence. Methods As a remedy, a modified method to combine ridge regression and hierarchical cluster analysis was proposed for investigating the effects of climate variations on bacillary dysentery incidence in northeast China. Results All weather indicators, temperatures, precipitation, evaporation and relative humidity have shown positive correlation with the monthly incidence of bacillary dysentery, while air pressure had a negative correlation with the incidence. Ridge regression and hierarchical cluster analysis showed that during 1987–1996, relative humidity, temperatures and air pressure affected the transmission of the bacillary dysentery. During this period, all meteorological factors were divided into three categories. Relative humidity and precipitation belonged to one class, temperature indexes and evaporation belonged to another class, and air pressure was the third class. Conclusion Meteorological factors have affected the transmission of bacillary dysentery in northeast China. Bacillary dysentery prevention and control would benefit from by giving more consideration to local climate variations.

  5. Energy Efficient Backoff Hierarchical Clustering Algorithms for Multi-Hop Wireless Sensor Networks

    Institute of Scientific and Technical Information of China (English)

    Jun Wang; Yong-Tao Cao; Jun-Yuan Xie; Shi-Fu Chen

    2011-01-01

    Compared with flat routing protocols, clustering is a fundamental performance improvement technique in wireless sensor networks, which can increase network scalability and lifetime. In this paper, we integrate the multi-hop technique with a backoff-based clustering algorithm to organize sensors. By using an adaptive backoff strategy, the algorithm not only realizes load balance among sensor node, but also achieves fairly uniform cluster head distribution across the network. Simulation results also demonstrate our algorithm is more energy-efficient than classical ones. Our algorithm is also easily extended to generate a hierarchy of cluster heads to obtain better network management and energy-efficiency.

  6. [The hierarchical clustering analysis of hyperspectral image based on probabilistic latent semantic analysis].

    Science.gov (United States)

    Yi, Wen-Bin; Shen, Li; Qi, Yin-Feng; Tang, Hong

    2011-09-01

    The paper introduces the Probabilistic Latent Semantic Analysis (PLSA) to the image clustering and an effective image clustering algorithm using the semantic information from PLSA is proposed which is used for hyperspectral images. Firstly, the ISODATA algorithm is used to obtain the initial clustering result of hyperspectral image and the clusters of the initial clustering result are considered as the visual words of the PLSA. Secondly, the object-oriented image segmentation algorithm is used to partition the hyperspectral image and segments with relatively pure pixels are regarded as documents in PLSA. Thirdly, a variety of identification methods which can estimate the best number of cluster centers is combined to get the number of latent semantic topics. Then the conditional distributions of visual words in topics and the mixtures of topics in different documents are estimated by using PLSA. Finally, the conditional probabilistic of latent semantic topics are distinguished using statistical pattern recognition method, the topic type for each visual in each document will be given and the clustering result of hyperspectral image are then achieved. Experimental results show the clusters of the proposed algorithm are better than K-MEANS and ISODATA in terms of object-oriented property and the clustering result is closer to the distribution of real spatial distribution of surface.

  7. Manual hierarchical clustering of regional geochemical data using a Bayesian finite mixture model

    Science.gov (United States)

    Ellefsen, Karl J.; Smith, David

    2016-01-01

    Interpretation of regional scale, multivariate geochemical data is aided by a statistical technique called “clustering.” We investigate a particular clustering procedure by applying it to geochemical data collected in the State of Colorado, United States of America. The clustering procedure partitions the field samples for the entire survey area into two clusters. The field samples in each cluster are partitioned again to create two subclusters, and so on. This manual procedure generates a hierarchy of clusters, and the different levels of the hierarchy show geochemical and geological processes occurring at different spatial scales. Although there are many different clustering methods, we use Bayesian finite mixture modeling with two probability distributions, which yields two clusters. The model parameters are estimated with Hamiltonian Monte Carlo sampling of the posterior probability density function, which usually has multiple modes. Each mode has its own set of model parameters; each set is checked to ensure that it is consistent both with the data and with independent geologic knowledge. The set of model parameters that is most consistent with the independent geologic knowledge is selected for detailed interpretation and partitioning of the field samples.

  8. Energy Efficient Zone Division Multihop Hierarchical Clustering Algorithm for Load Balancing in Wireless Sensor Network

    Directory of Open Access Journals (Sweden)

    Ashim Kumar Ghosh

    2011-12-01

    Full Text Available Wireless sensor nodes are use most embedded computing application. Multihop cluster hierarchy has been presented for large wireless sensor networks (WSNs that can provide scalable routing, data aggregation, and querying. The energy consumption rate for sensors in a WSN varies greatly based on the protocols the sensors use for communications. In this paper we present a cluster based routing algorithm. One of our main goals is to design the energy efficient routing protocol. Here we try to solve the usual problems of WSNs. We know the efficiency of WSNs depend upon the distance between node to base station and the amount of data to be transferred and the performance of clustering is greatly influenced by the selection of cluster-heads, which are in charge of creating clusters and controlling member nodes. This algorithm makes the best use of node with low number of cluster head know as super node. Here we divided the full region in four equal zones and the centre area of the region is used to select for super node. Each zone is considered separately and the zone may be or not divided further that’s depending upon the density of nodes in that zone and capability of the super node. This algorithm forms multilayer communication. The no of layer depends on the network current load and statistics. Our algorithm is easily extended to generate a hierarchy of cluster heads to obtain better network management and energy efficiency.

  9. Classification of cancer cell lines using an automated two-dimensional liquid mapping method with hierarchical clustering techniques.

    Science.gov (United States)

    Wang, Yanfei; Wu, Rong; Cho, Kathleen R; Shedden, Kerby A; Barder, Timothy J; Lubman, David M

    2006-01-01

    A two-dimensional liquid mapping method was used to map the protein expression of eight ovarian serous carcinoma cell lines and three immortalized ovarian surface epithelial cell lines. Maps were produced using pI as the separation parameter in the first dimension and hydrophobicity based upon reversed-phase HPLC separation in the second dimension. The method can be reproducibly used to produce protein expression maps over a pH range from 4.0 to 8.5. A dynamic programming method was used to correct for minor shifts in peaks during the HPLC gradient between sample runs. The resulting corrected maps can then be compared using hierarchical clustering to produce dendrograms indicating the relationship between different cell lines. It was found that several of the ovarian surface epithelial cell lines clustered together, whereas specific groups of serous carcinoma cell lines clustered with each other. Although there is limited information on the current biology of these cell lines, it was shown that the protein expression of certain cell lines is closely related to each other. Other cell lines, including one ovarian clear cell carcinoma cell line, two endometrioid carcinoma cell lines, and three breast epithelial cell lines, were also mapped for comparison to show that their protein profiles cluster differently than the serous samples and to study how they cluster relative to each other. In addition, comparisons can be made between proteins differentially expressed between cell lines that may serve as markers of ovarian serous carcinomas. The automation of the method allows reproducible comparison of many samples, and the use of differential analysis limits the number of proteins that might require further analysis by mass spectrometry techniques.

  10. Cluster based hierarchical resource searching model in P2P network

    Institute of Scientific and Technical Information of China (English)

    Yang Ruijuan; Liu Jian; Tian Jingwen

    2007-01-01

    For the problem of large network load generated by the Gnutella resource-searching model in Peer to Peer (P2P) network, a improved model to decrease the network expense is proposed, which establishes a duster in P2P network,auto-organizes logical layers, and applies a hybrid mechanism of directional searching and flooding. The performance analysis and simulation results show that the proposed hierarchical searching model has availably reduced the generated message load and that its searching-response time performance is as fairly good as that of the Gnutella model.

  11. Inter-Cluster Routing Authentication for Ad Hoc Networks by a Hierarchical Key Scheme

    Institute of Scientific and Technical Information of China (English)

    Yueh-Min Huang; Hua-Yi Lin; Tzone-I Wang

    2006-01-01

    Dissimilar to traditional networks, the features of mobile wireless devices that can actively form a network without any infrastructure mean that mobile ad hoc networks frequently display partition due to node mobility or link failures. These indicate that an ad hoc network is difficult to provide on-line access to a trusted authority server. Therefore,applying traditional Public Key Infrastructure (PKI) security framework to mobile ad hoc networks will cause insecurities.This study proposes a scalable and elastic key management scheme integrated into Cluster Based Secure Routing Protocol (CBSRP) to enhance security and non-repudiation of routing authentication, and introduces an ID-Based internal routing authentication scheme to enhance the routing performance in an internal cluster. Additionally, a method of performing routing authentication between internal and external clusters, as well as inter-cluster routing authentication, is developed.The proposed cluster-based key management scheme distributes trust to an aggregation of cluster heads using a threshold scheme faculty, provides Certificate Authority (CA) with a fault tolerance mechanism to prevent a single point of compromise or failure, and saves CA large repositories from maintaining member certificates, making ad hoc networks robust to malicious behaviors and suitable for numerous mobile devices.

  12. From Snakes to Stars, the Statistics of Collapsed Objects - II. Testing a Generic Scaling Ansatz for Hierarchical Clustering

    CERN Document Server

    Munshi, D; Melott, A L; Munshi, Dipak; Coles, Peter; Melott, Adrian L.

    1999-01-01

    We develop a diagrammatic technique to represent the multi-point cumulative probability density function (CPDF) of mass fluctuations in terms of the statistical properties of individual collapsed objects and relate this to other statistical descriptors such as cumulants, cumulant correlators and factorial moments. We use this approach to establish key scaling relations describing various measurable statistical quantities if clustering follows a simple general scaling ansatz, as expected in hierarchical models. We test these detailed predictions against high-resolution numerical simulations. We show that, when appropriate variables are used, the count probability distribution function (CPDF) and void probability distribution function (VPF) shows clear scaling properties in the non-linear regime. Generalising the results to the two-point count probability distribution function (2CPDF), and the bivariate void probability function (2VPF) we find good match with numerical simulations. We explore the behaviour of t...

  13. Hierarchical compression of Caenorhabditis elegans locomotion reveals phenotypic differences in the organization of behaviour

    Science.gov (United States)

    2016-01-01

    Regularities in animal behaviour offer insights into the underlying organizational and functional principles of nervous systems and automated tracking provides the opportunity to extract features of behaviour directly from large-scale video data. Yet how to effectively analyse such behavioural data remains an open question. Here, we explore whether a minimum description length principle can be exploited to identify meaningful behaviours and phenotypes. We apply a dictionary compression algorithm to behavioural sequences from the nematode worm Caenorhabditis elegans freely crawling on an agar plate both with and without food and during chemotaxis. We find that the motifs identified by the compression algorithm are rare but relevant for comparisons between worms in different environments, suggesting that hierarchical compression can be a useful step in behaviour analysis. We also use compressibility as a new quantitative phenotype and find that the behaviour of wild-isolated strains of C. elegans is more compressible than that of the laboratory strain N2 as well as the majority of mutant strains examined. Importantly, in distinction to more conventional phenotypes such as overall motor activity or aggregation behaviour, the increased compressibility of wild isolates is not explained by the loss of function of the gene npr-1, which suggests that erratic locomotion is a laboratory-derived trait with a novel genetic basis. Because hierarchical compression can be applied to any sequence, we anticipate that compressibility can offer insights into the organization of behaviour in other animals including humans. PMID:27581484

  14. Hierarchical compression of Caenorhabditis elegans locomotion reveals phenotypic differences in the organization of behaviour.

    Science.gov (United States)

    Gomez-Marin, Alex; Stephens, Greg J; Brown, André E X

    2016-08-01

    Regularities in animal behaviour offer insights into the underlying organizational and functional principles of nervous systems and automated tracking provides the opportunity to extract features of behaviour directly from large-scale video data. Yet how to effectively analyse such behavioural data remains an open question. Here, we explore whether a minimum description length principle can be exploited to identify meaningful behaviours and phenotypes. We apply a dictionary compression algorithm to behavioural sequences from the nematode worm Caenorhabditis elegans freely crawling on an agar plate both with and without food and during chemotaxis. We find that the motifs identified by the compression algorithm are rare but relevant for comparisons between worms in different environments, suggesting that hierarchical compression can be a useful step in behaviour analysis. We also use compressibility as a new quantitative phenotype and find that the behaviour of wild-isolated strains of C. elegans is more compressible than that of the laboratory strain N2 as well as the majority of mutant strains examined. Importantly, in distinction to more conventional phenotypes such as overall motor activity or aggregation behaviour, the increased compressibility of wild isolates is not explained by the loss of function of the gene npr-1, which suggests that erratic locomotion is a laboratory-derived trait with a novel genetic basis. Because hierarchical compression can be applied to any sequence, we anticipate that compressibility can offer insights into the organization of behaviour in other animals including humans.

  15. AGN-selected clusters as revealed by weak lensing

    Science.gov (United States)

    Wold, M.; Lacy, M.; Dahle, H.; Lilje, P. B.; Ridgway, S. E.

    2002-01-01

    We discuss the results in light of the cooling flow and the merger/interaction scenarios for triggering and fuelling AGN in clusters, but find that the data do not point unambiguously to neither of the two.

  16. Hierarchical structure and modules in the Escherichia coli transcriptional regulatory network revealed by a new top-down approach

    Directory of Open Access Journals (Sweden)

    Buer Jan

    2004-12-01

    Full Text Available Abstract Background Cellular functions are coordinately carried out by groups of genes forming functional modules. Identifying such modules in the transcriptional regulatory network (TRN of organisms is important for understanding the structure and function of these fundamental cellular networks and essential for the emerging modular biology. So far, the global connectivity structure of TRN has not been well studied and consequently not applied for the identification of functional modules. Moreover, network motifs such as feed forward loop are recently proposed to be basic building blocks of TRN. However, their relationship to functional modules is not clear. Results In this work we proposed a top-down approach to identify modules in the TRN of E. coli. By studying the global connectivity structure of the regulatory network, we first revealed a five-layer hierarchical structure in which all the regulatory relationships are downward. Based on this regulatory hierarchy, we developed a new method to decompose the regulatory network into functional modules and to identify global regulators governing multiple modules. As a result, 10 global regulators and 39 modules were identified and shown to have well defined functions. We then investigated the distribution and composition of the two basic network motifs (feed forward loop and bi-fan motif in the hierarchical structure of TRN. We found that most of these network motifs include global regulators, indicating that these motifs are not basic building blocks of modules since modules should not contain global regulators. Conclusion The transcriptional regulatory network of E. coli possesses a multi-layer hierarchical modular structure without feedback regulation at transcription level. This hierarchical structure builds the basis for a new and simple decomposition method which is suitable for the identification of functional modules and global regulators in the transcriptional regulatory network of E

  17. A comparison of hierarchical cluster analysis and league table rankings as methods for analysis and presentation of district health system performance data in Uganda.

    Science.gov (United States)

    Tashobya, Christine K; Dubourg, Dominique; Ssengooba, Freddie; Speybroeck, Niko; Macq, Jean; Criel, Bart

    2016-03-01

    In 2003, the Uganda Ministry of Health introduced the district league table for district health system performance assessment. The league table presents district performance against a number of input, process and output indicators and a composite index to rank districts. This study explores the use of hierarchical cluster analysis for analysing and presenting district health systems performance data and compares this approach with the use of the league table in Uganda. Ministry of Health and district plans and reports, and published documents were used to provide information on the development and utilization of the Uganda district league table. Quantitative data were accessed from the Ministry of Health databases. Statistical analysis using SPSS version 20 and hierarchical cluster analysis, utilizing Wards' method was used. The hierarchical cluster analysis was conducted on the basis of seven clusters determined for each year from 2003 to 2010, ranging from a cluster of good through moderate-to-poor performers. The characteristics and membership of clusters varied from year to year and were determined by the identity and magnitude of performance of the individual variables. Criticisms of the league table include: perceived unfairness, as it did not take into consideration district peculiarities; and being oversummarized and not adequately informative. Clustering organizes the many data points into clusters of similar entities according to an agreed set of indicators and can provide the beginning point for identifying factors behind the observed performance of districts. Although league table ranking emphasize summation and external control, clustering has the potential to encourage a formative, learning approach. More research is required to shed more light on factors behind observed performance of the different clusters. Other countries especially low-income countries that share many similarities with Uganda can learn from these experiences. © The Author 2015

  18. Heads First: Visual Aftereffects Reveal Hierarchical Integration of Cues to Social Attention.

    Directory of Open Access Journals (Sweden)

    Sarah Cooney

    Full Text Available Determining where another person is attending is an important skill for social interaction that relies on various visual cues, including the turning direction of the head and body. This study reports a novel high-level visual aftereffect that addresses the important question of how these sources of information are combined in gauging social attention. We show that adapting to images of heads turned 25° to the right or left produces a perceptual bias in judging the turning direction of subsequently presented bodies. In contrast, little to no change in the judgment of head orientation occurs after adapting to extremely oriented bodies. The unidirectional nature of the aftereffect suggests that cues from the human body signaling social attention are combined in a hierarchical fashion and is consistent with evidence from single-cell recording studies in nonhuman primates showing that information about head orientation can override information about body posture when both are visible.

  19. Exploring cognitive heterogeneity in first-episode psychosis: What cluster analysis can reveal.

    Science.gov (United States)

    Reser, Maree P; Allott, Kelly A; Killackey, Eóin; Farhall, John; Cotton, Susan M

    2015-10-30

    Variable outcomes in first-episode psychosis (FEP) are partly attributable to heterogeneity in cognitive functioning. To aid identification of those likely to have poorer or better outcomes, we examined whether purported cognitive profiles identified through use of cluster analysis in chronic schizophrenia were evident in FEP. We also aimed to assess whether there was a relationship between cognitive profile and factors independent of the solution, providing external validation that the cognitive profiles represented distinct subgroups. Ward's method hierarchical cluster analysis, verified by a k-means cluster solution, was performed using data obtained from a cognitive test battery administered to 128 participants aged 15-25 years. Four cognitive profiles were identified. A continuity element was evident; participants in cluster four were more cognitively impaired compared to participants in cluster three, who appeared more cognitively intact. Clusters one and two were distinguishable across measures of attention and working memory and visual recognition memory, most likely reflecting sample specific patterns of deficit. Participants in cluster four had significantly lower premorbid and current IQ and higher negative symptoms compared to participants in cluster three. The distinct levels and patterns of cognition found in chronic schizophrenia cohorts are also evident across diagnostic categories in FEP. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  20. Phylogenetic clusters of rhizobia revealed by genome structures

    Institute of Scientific and Technical Information of China (English)

    ZHENG Junfang; LIU Guirong; ZHU Wanfu; ZHOU Yuguang; LIU Shulin

    2004-01-01

    Rhizobia, bacteria that fix atmospheric nitrogen, are important agricultural resources. In order to establish the evolutionary relationships among rhizobia isolated from different geographic regions and different plant hosts for systematic studies, we evaluated the use of physical structure of the rhizobial genomes as a phylogenetic marker to categorize these bacteria. In this work, we analyzed the features of genome structures of 64 rhizobial strains. These rhizobial strains were divided into 21 phylogenetic clusters according to the features of genome structures evaluated by the endonuclease I-CeuI. These clusters were supported by 16S rRNA comparisons and genomic sequences of four rhizobial strains, but they are largely different from those based on the current taxonomic scheme (except 16S rRNA).

  1. Network based approaches reveal clustering in protein point patterns

    Science.gov (United States)

    Parker, Joshua; Barr, Valarie; Aldridge, Joshua; Samelson, Lawrence E.; Losert, Wolfgang

    2014-03-01

    Recent advances in super-resolution imaging have allowed for the sub-diffraction measurement of the spatial location of proteins on the surfaces of T-cells. The challenge is to connect these complex point patterns to the internal processes and interactions, both protein-protein and protein-membrane. We begin analyzing these patterns by forming a geometric network amongst the proteins and looking at network measures, such the degree distribution. This allows us to compare experimentally observed patterns to models. Specifically, we find that the experimental patterns differ from heterogeneous Poisson processes, highlighting an internal clustering structure. Further work will be to compare our results to simulated protein-protein interactions to determine clustering mechanisms.

  2. 建筑物层次空间聚类方法研究%Hierarchical spatial clustering of buildings

    Institute of Scientific and Technical Information of China (English)

    邓敏; 孙前虎; 文小岳; 徐枫

    2011-01-01

    建筑物空间聚类是实现居民地地图自动综合的有效方法.基于图论和Gestalt原理,发展了一种层次的建筑物聚类方法.该方法可以深层次地挖掘建筑物图形的视觉特性,将面状地物信息充分合理地表达在聚类结果中.依据视觉感知原理,借助Dealaunay三角网构建方法,分析了地图上建筑物的自身形状特性和相互间的邻接关系,并依据建筑物间的可视区域均值距离建立了加权邻近结构图,确定了建筑物的邻近关系(定性约束).根据Gestalt准则将邻近性、方向性和几何特征等量化为旋转卡壳距离约束和几何相似度约束.通过实例验证了层次聚类方法得到更加符合人类认知的建筑物聚类结果.%Spatial clustering provides an effective approach for generalization of residential area in automated cartographic generalization.Based on graph theory and Gestalt principle, a hierarchical approach is proposed in this paper.This approach can be utilized to discover the graphical structure formed by buildings, which is obtained with the consideration of shape, size and neighboring relations.The neighboring relations are determined by Dclaunay triangulation, which is a qualitative constraint among buildings.A weighted neighboring structural graph is obtained by setting visual distance as the weight of the linking edge between adjacent buildings.Two levels of quantitative constraints are developed by considering the Gestalt factors, I.e.proximity, orientation and geometry of buildings.One is the rotating calipers minimum distance;the other is the geometric similarity measure.Through experiments it is illustrated that the results by the hierarchical spatial clustering proposed in this paper are consistent with human perception.

  3. Analytical relations concerning the collapse time in hierarchically clustered cosmological models

    CERN Document Server

    Gambera, M

    1997-01-01

    By means of numerical methods, we solve the equations of motion for the collapse of a shell of baryonic matter, made of galaxies and substructure falling into the central regions of a cluster of galaxies, taking into account the effect of the dynamical friction. The parameters on which the dynamical friction mainly depends are: the peaks' height, the number of peaks inside a protocluster multiplied by the correlation function evaluated at the origin, the filtering radius and the nucleus radius of the protocluster of galaxies. We show how the collapse time (Tau) of the shell depends on these parameters. We give a formula that links the dynamical friction coefficient (Eta) o the parameters mentioned above and an analytic relation between the collapse time and (Eta). Finally, we obtain an analytical relation between (Tau) and the mean overdensity (mean Delta) within the shell. All the analytical relations that we find are in excellent agreement with the numerical integration.

  4. Hierarchical cluster analysis of labour market regulations and population health: a taxonomy of low- and middle-income countries

    Directory of Open Access Journals (Sweden)

    Muntaner Carles

    2012-04-01

    Full Text Available Abstract Background An important contribution of the social determinants of health perspective has been to inquire about non-medical determinants of population health. Among these, labour market regulations are of vital significance. In this study, we investigate the labour market regulations among low- and middle-income countries (LMICs and propose a labour market taxonomy to further understand population health in a global context. Methods Using Gross National Product per capita, we classify 113 countries into either low-income (n = 71 or middle-income (n = 42 strata. Principal component analysis of three standardized indicators of labour market inequality and poverty is used to construct 2 factor scores. Factor score reliability is evaluated with Cronbach's alpha. Using these scores, we conduct a hierarchical cluster analysis to produce a labour market taxonomy, conduct zero-order correlations, and create box plots to test their associations with adult mortality, healthy life expectancy, infant mortality, maternal mortality, neonatal mortality, under-5 mortality, and years of life lost to communicable and non-communicable diseases. Labour market and health data are retrieved from the International Labour Organization's Key Indicators of Labour Markets and World Health Organization's Statistical Information System. Results Six labour market clusters emerged: Residual (n = 16, Emerging (n = 16, Informal (n = 10, Post-Communist (n = 18, Less Successful Informal (n = 22, and Insecure (n = 31. Primary findings indicate: (i labour market poverty and population health is correlated in both LMICs; (ii association between labour market inequality and health indicators is significant only in low-income countries; (iii Emerging (e.g., East Asian and Eastern European countries and Insecure (e.g., sub-Saharan African nations clusters are the most advantaged and disadvantaged, respectively, with the remaining clusters experiencing levels of population

  5. Formation of an O-Star Cluster by Hierarchical Accretion in G20.08-0.14 N

    CERN Document Server

    Galván-Madrid, Roberto; Zhang, Qizhou; Kurtz, Stan; Rodríguez, Luis F; Ho, Paul T P

    2009-01-01

    Spectral line and continuum observations of the ionized and molecular gas in G20.08-0.14 N explore the dynamics of accretion over a range of spatial scales in this massive star forming region. Very Large Array observations of NH_3 at 4'' angular resolution show a large scale (0.5 pc) molecular accretion flow around and into a star cluster with three small, bright HII regions. Higher resolution (0.4'') observations with the Submillimeter Array in hot core molecules (CH_3CN, OCS, and SO_2) and the VLA in NH_3, show that the two brightest and smallest HII regions are themselves surrounded by smaller scale (0.05 pc) accretion flows. The axes of rotation of the large and small scale flows are aligned, and the time scale for the contraction of the cloud is short enough, 0.1 Myr, for the large scale accretion flow to deliver significant mass to the smaller scales within the star formation time scale. The flow structure appears to be continuous and hierarchical from larger to smaller scales. Millimeter radio recombin...

  6. Hierarchical Network Design

    DEFF Research Database (Denmark)

    Thomadsen, Tommy

    2005-01-01

    Communication networks are immensely important today, since both companies and individuals use numerous services that rely on them. This thesis considers the design of hierarchical (communication) networks. Hierarchical networks consist of layers of networks and are well-suited for coping...... the clusters. The design of hierarchical networks involves clustering of nodes, hub selection, and network design, i.e. selection of links and routing of ows. Hierarchical networks have been in use for decades, but integrated design of these networks has only been considered for very special types of networks....... The thesis investigates models for hierarchical network design and methods used to design such networks. In addition, ring network design is considered, since ring networks commonly appear in the design of hierarchical networks. The thesis introduces hierarchical networks, including a classification scheme...

  7. Clusters of internally primed transcripts reveal novel long noncoding RNAs.

    Directory of Open Access Journals (Sweden)

    2006-04-01

    Full Text Available Non-protein-coding RNAs (ncRNAs are increasingly being recognized as having important regulatory roles. Although much recent attention has focused on tiny 22- to 25-nucleotide microRNAs, several functional ncRNAs are orders of magnitude larger in size. Examples of such macro ncRNAs include Xist and Air, which in mouse are 18 and 108 kilobases (Kb, respectively. We surveyed the 102,801 FANTOM3 mouse cDNA clones and found that Air and Xist were present not as single, full-length transcripts but as a cluster of multiple, shorter cDNAs, which were unspliced, had little coding potential, and were most likely primed from internal adenine-rich regions within longer parental transcripts. We therefore conducted a genome-wide search for regional clusters of such cDNAs to find novel macro ncRNA candidates. Sixty-six regions were identified, each of which mapped outside known protein-coding loci and which had a mean length of 92 Kb. We detected several known long ncRNAs within these regions, supporting the basic rationale of our approach. In silico analysis showed that many regions had evidence of imprinting and/or antisense transcription. These regions were significantly associated with microRNAs and transcripts from the central nervous system. We selected eight novel regions for experimental validation by northern blot and RT-PCR and found that the majority represent previously unrecognized noncoding transcripts that are at least 10 Kb in size and predominantly localized in the nucleus. Taken together, the data not only identify multiple new ncRNAs but also suggest the existence of many more macro ncRNAs like Xist and Air.

  8. K-shell decomposition reveals hierarchical cortical organization of the human brain

    Science.gov (United States)

    Lahav, Nir; Ksherim, Baruch; Ben-Simon, Eti; Maron-Katz, Adi; Cohen, Reuven; Havlin, Shlomo

    2016-08-01

    In recent years numerous attempts to understand the human brain were undertaken from a network point of view. A network framework takes into account the relationships between the different parts of the system and enables to examine how global and complex functions might emerge from network topology. Previous work revealed that the human brain features ‘small world’ characteristics and that cortical hubs tend to interconnect among themselves. However, in order to fully understand the topological structure of hubs, and how their profile reflect the brain’s global functional organization, one needs to go beyond the properties of a specific hub and examine the various structural layers that make up the network. To address this topic further, we applied an analysis known in statistical physics and network theory as k-shell decomposition analysis. The analysis was applied on a human cortical network, derived from MRI\\DSI data of six participants. Such analysis enables us to portray a detailed account of cortical connectivity focusing on different neighborhoods of inter-connected layers across the cortex. Our findings reveal that the human cortex is highly connected and efficient, and unlike the internet network contains no isolated nodes. The cortical network is comprised of a nucleus alongside shells of increasing connectivity that formed one connected giant component, revealing the human brain’s global functional organization. All these components were further categorized into three hierarchies in accordance with their connectivity profile, with each hierarchy reflecting different functional roles. Such a model may explain an efficient flow of information from the lowest hierarchy to the highest one, with each step enabling increased data integration. At the top, the highest hierarchy (the nucleus) serves as a global interconnected collective and demonstrates high correlation with consciousness related regions, suggesting that the nucleus might serve as a

  9. Classifying airborne radiometry data with Agglomerative Hierarchical Clustering: A tool for geological mapping in context of rainforest (French Guiana)

    Science.gov (United States)

    Martelet, G.; Truffert, C.; Tourlière, B.; Ledru, P.; Perrin, J.

    2006-09-01

    In highly weathered environments, it is crucial that geological maps provide information concerning both the regolith and the bedrock, for societal needs, such as land-use, mineral or water resources management. Often, geologists are facing the challenge of upgrading existing maps, as relevant information concerning weathering processes and pedogenesis is currently missing. In rugged areas in particular, where access to the field is difficult, ground observations are sparsely available, and need therefore to be complemented using methods based on remotely sensed data. For this purpose, we discuss the use of Agglomerative Hierarchical Clustering (AHC) on eU, K and eTh airborne gamma-ray spectrometry grids. The AHC process allows primarily to segment the geophysical maps into zones having coherent U, K and Th contents. The analysis of these contents are discussed in terms of geochemical signature for lithological attribution of classes, as well as the use of a dendrogram, which gives indications on the hierarchical relations between classes. Unsupervised classification maps resulting from AHC can be considered as spatial models of the distribution of the radioelement content in surface and sub-surface formations. The source of gamma rays emanating from the ground is primarily related to the geochemistry of the bedrock and secondarily to modifications of the radioelement distribution by weathering and other secondary mechanisms, such as mobilisation by wind or water. The interpretation of the obtained predictive classified maps, their U, K, Th contents, and the dendrogram, in light of available geological knowledge, allows to separate signatures related to regolith and solid geology. Consequently, classification maps can be integrated within a GIS environment and used by the geologist as a support for mapping bedrock lithologies and their alteration. We illustrate the AHC classification method in the region of Cayenne using high-resolution airborne radiometric data

  10. Rapid recognition of drug-resistance/sensitivity in leukemic cells by Fourier transform infrared microspectroscopy and unsupervised hierarchical cluster analysis.

    Science.gov (United States)

    Bellisola, Giuseppe; Cinque, Gianfelice; Vezzalini, Marzia; Moratti, Elisabetta; Silvestri, Giovannino; Redaelli, Sara; Gambacorti Passerini, Carlo; Wehbe, Katia; Sorio, Claudio

    2013-07-21

    We tested the ability of Fourier Transform (FT) InfraRed (IR) microspectroscopy (microFTIR) in combination with unsupervised Hierarchical Cluster Analysis (HCA) in identifying drug-resistance/sensitivity in leukemic cells exposed to tyrosine kinase inhibitors (TKIs). Experiments were carried out in a well-established mouse model of human Chronic Myelogenous Leukemia (CML). Mouse-derived pro-B Ba/F3 cells transfected with and stably expressing the human p210(BCR-ABL) drug-sensitive wild-type BCR-ABL or the V299L or T315I p210(BCR-ABL) drug-resistant BCR-ABL mutants were exposed to imatinib-mesylate (IMA) or dasatinib (DAS). MicroFTIR was carried out at the Diamond IR beamline MIRIAM where the mid-IR absorbance spectra of individual Ba/F3 cells were acquired using the high brilliance IR synchrotron radiation (SR) via aperture of 15 × 15 μm(2) in sizes. A conventional IR source (globar) was used to compare average spectra over 15 cells or more. IR signatures of drug actions were identified by supervised analyses in the spectra of TKI-sensitive cells. Unsupervised HCA applied to selected intervals of wavenumber allowed us to classify the IR patterns of viable (drug-resistant) and apoptotic (drug-sensitive) cells with an accuracy of >95%. The results from microFTIR + HCA analysis were cross-validated with those obtained via immunochemical methods, i.e. immunoblotting and flow cytometry (FC) that resulted directly and significantly correlated. We conclude that this combined microFTIR + HCA method potentially represents a rapid, convenient and robust screening approach to study the impact of drugs in leukemic cells as well as in peripheral blasts from patients in clinical trials with new anti-leukemic drugs.

  11. Diagnostic phylogenetics reveals a new Porcine circovirus 2 cluster.

    Science.gov (United States)

    Davies, Brendan; Wang, Xiong; Dvorak, Cheryl M T; Marthaler, Douglas; Murtaugh, Michael P

    2016-06-02

    Porcine circovirus 2 (PCV2) was prevalent in swine in the United States before PCV2-associated disease (PCVAD) appeared in 2006. Limited nucleotide sequencing of open reading frame 2 (ORF2) encoding capsid, the only structural protein, revealed the presence of two genotypes, PCV2a and PCV2b. Later, PCV2c and mutant PCV2b, or PCV2d, were also described. However, extensive PCV2 ORF2 sequence databases in veterinary diagnostic laboratories have not been analyzed systematically to determine the genetic diversity of field isolates. Here, we interrogated >1100 PCV2 ORF2 nucleotide sequences to assess population diversity and genetic variation. We detected a novel PCV2 genotype that is substantially different, primarily in ORF2, from all known PCV2. Notably, ORF2 contains a unique carboxyl terminal amino acid insertion resulting in a 238 amino acid ORF2. All other PCV2 ORF2 proteins are 233 or 234 aa in length. Phylogenetic analysis indicates that it is more ancient than other PCV2 genotypes. The findings demonstrate the value of analyzing routine diagnostic laboratory sequence databases in population genetic analyses of animal pathogens.

  12. Hierarchical multiple bit clusters and patterned media enabled by novel nanofabrication techniques -- High resolution electron beam lithography and block polymer self assembly

    Science.gov (United States)

    Xiao, Qijun

    This thesis discusses the full scope of a project exploring the physics of hierarchical clusters of interacting nanomagnets. These clusters may be relevant for novel applications such as multilevel data storage devices. The work can be grouped into three main activities: micromagnetic simulation, fabrication and characterization of proof-of-concept prototype devices, and efforts to scale down the structures by creating the hierarchical structures with the aid of diblock copolymer self assembly. Theoretical micromagnetic studies and simulations based on Landau-Lifshitz-Gilbert (LLG) equation were conducted on nanoscale single domain magnetic entities. For the simulated nanomagnet clusters with perpendicular uniaxial anisotropy, the simulation showed the switching field distributions, the stability of the magnetostatic states with distinctive total cluster perpendicular moments, and the stepwise magnetic switching curves. For simulated nanomagnet clusters with in-plane shape anisotropy, the simulation showed the stepwise switching behaviors governed by thermal agitation and cluster configurations. Proof-of-concept cluster devices with three interacting Co nanomagnets were fabricated by e-beam lithography (EBL) and pulse-reverse electrochemical deposition (PRECD). EBL patterning on a suspended 100 nm SiN membrane showed improved lateral lithography resolution to 30 nm. The Co nanomagnets deposited using the PRECD method showed perpendicular anisotropy. The switching experiments with external applied fields were able to switch the Co nanomagnets through the four magnetostatic states with distinctive total perpendicular cluster magnetization, and proved the feasibility of multilevel data storage devices based on the cluster concept. Shrinking the structures size was experimented by the aid of diblock copolymer. Thick poly(styrene)-b-poly(methyl methacrylate) (PS-b-PMMA) diblock copolymer templates aligned with external electrical field were used to fabricate long Ni

  13. Hierarchical Fragmentation and Jet-like Outflows in IRDC G28.34+0.06, a Growing Massive Protostar Cluster

    CERN Document Server

    Wang, Ke; Wu, Yuefang; Zhang, Huawei

    2011-01-01

    We present Submillimeter Array (SMA) \\lambda = 0.88mm observations of an infrared dark cloud (IRDC) G28.34+0.06. Located in the quiescent southern part of the G28.34 cloud, the region of interest is a massive ($>10^3$\\,\\msun) molecular clump P1 with a luminosity of $\\sim 10^3$ \\lsun, where our previous SMA observations at 1.3mm have revealed a string of five dust cores of 22-64 \\msun\\ along the 1 pc IR-dark filament. The cores are well aligned at a position angle of 48 degrees and regularly spaced at an average projected separation of 0.16 pc. The new high-resolution, high-sensitivity 0.88\\,mm image further resolves the five cores into ten compact condensations of 1.4-10.6 \\msun, with sizes a few thousands AU. The spatial structure at clump ($\\sim 1$ pc) and core ($\\sim 0.1$ pc) scales indicates a hierarchical fragmentation. While the clump fragmentation is consistent with a cylindrical collapse, the observed fragment masses are much larger than the expected thermal Jeans masses. All the cores are driving CO(...

  14. Delineation of Stenotrophomonas maltophilia isolates from cystic fibrosis patients by fatty acid methyl ester profiles and matrix-assisted laser desorption/ionization time-of-flight mass spectra using hierarchical cluster analysis and principal component analysis.

    Science.gov (United States)

    Vidigal, Pedrina Gonçalves; Mosel, Frank; Koehling, Hedda Luise; Mueller, Karl Dieter; Buer, Jan; Rath, Peter Michael; Steinmann, Joerg

    2014-12-01

    Stenotrophomonas maltophilia is an opportunist multidrug-resistant pathogen that causes a wide range of nosocomial infections. Various cystic fibrosis (CF) centres have reported an increasing prevalence of S. maltophilia colonization/infection among patients with this disease. The purpose of this study was to assess specific fingerprints of S. maltophilia isolates from CF patients (n = 71) by investigating fatty acid methyl esters (FAMEs) through gas chromatography (GC) and highly abundant proteins by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS), and to compare them with isolates obtained from intensive care unit (ICU) patients (n = 20) and the environment (n = 11). Principal component analysis (PCA) of GC-FAME patterns did not reveal a clustering corresponding to distinct CF, ICU or environmental types. Based on the peak area index, it was observed that S. maltophilia isolates from CF patients produced significantly higher amounts of fatty acids in comparison with ICU patients and the environmental isolates. Hierarchical cluster analysis (HCA) based on the MALDI-TOF MS peak profiles of S. maltophilia revealed the presence of five large clusters, suggesting a high phenotypic diversity. Although HCA of MALDI-TOF mass spectra did not result in distinct clusters predominantly composed of CF isolates, PCA revealed the presence of a distinct cluster composed of S. maltophilia isolates from CF patients. Our data suggest that S. maltophilia colonizing CF patients tend to modify not only their fatty acid patterns but also their protein patterns as a response to adaptation in the unfavourable environment of the CF lung. © 2014 The Authors.

  15. Hierarchical clustering of genetic diversity associated to different levels of mutation and recombination in Escherichia coli: a study based on Mexican isolates.

    Science.gov (United States)

    González-González, Andrea; Sánchez-Reyes, Luna L; Delgado Sapien, Gabriela; Eguiarte, Luis E; Souza, Valeria

    2013-01-01

    Escherichia coli occur as either free-living microorganisms, or within the colons of mammals and birds as pathogenic or commensal bacteria. Although the Mexican population of intestinal E. coli maintains high levels of genetic diversity, the exact mechanisms by which this occurs remain unknown. We therefore investigated the role of homologous recombination and point mutation in the genetic diversification and population structure of Mexican strains of E. coli. This was explored using a multi locus sequence typing (MLST) approach in a non-outbreak related, host-wide sample of 128 isolates. Overall, genetic diversification in this sample appears to be driven primarily by homologous recombination, and to a lesser extent, by point mutation. Since genetic diversity is hierarchically organized according to the MLST genealogy, we observed that there is not a homogeneous recombination rate, but that different rates emerge at different clustering levels such as phylogenetic group, lineage and clonal complex (CC). Moreover, we detected clear signature of substructure among the A+B1 phylogenetic group, where the majority of isolates were differentiated into four discrete lineages. Substructure pattern is revealed by the presence of several CCs associated to a particular life style and host as well as to different genetic diversification mechanisms. We propose these findings as an alternative explanation for the maintenance of the clear phylogenetic signal of this species despite the prevalence of homologous recombination. Finally, we corroborate using both phylogenetic and genetic population approaches as an effective mean to establish epidemiological surveillance tailored to the ecological specificities of each geographic region.

  16. [Cluster analysis in biomedical researches].

    Science.gov (United States)

    Akopov, A S; Moskovtsev, A A; Dolenko, S A; Savina, G D

    2013-01-01

    Cluster analysis is one of the most popular methods for the analysis of multi-parameter data. The cluster analysis reveals the internal structure of the data, group the separate observations on the degree of their similarity. The review provides a definition of the basic concepts of cluster analysis, and discusses the most popular clustering algorithms: k-means, hierarchical algorithms, Kohonen networks algorithms. Examples are the use of these algorithms in biomedical research.

  17. A Performance-Prediction Model for PIC Applications on Clusters of Symmetric MultiProcessors: Validation with Hierarchical HPF+OpenMP Implementation

    Directory of Open Access Journals (Sweden)

    Sergio Briguglio

    2003-01-01

    Full Text Available A performance-prediction model is presented, which describes different hierarchical workload decomposition strategies for particle in cell (PIC codes on Clusters of Symmetric MultiProcessors. The devised workload decomposition is hierarchically structured: a higher-level decomposition among the computational nodes, and a lower-level one among the processors of each computational node. Several decomposition strategies are evaluated by means of the prediction model, with respect to the memory occupancy, the parallelization efficiency and the required programming effort. Such strategies have been implemented by integrating the high-level languages High Performance Fortran (at the inter-node stage and OpenMP (at the intra-node one. The details of these implementations are presented, and the experimental values of parallelization efficiency are compared with the predicted results.

  18. Functional annotation of hierarchical modularity.

    Directory of Open Access Journals (Sweden)

    Kanchana Padmanabhan

    Full Text Available In biological networks of molecular interactions in a cell, network motifs that are biologically relevant are also functionally coherent, or form functional modules. These functionally coherent modules combine in a hierarchical manner into larger, less cohesive subsystems, thus revealing one of the essential design principles of system-level cellular organization and function-hierarchical modularity. Arguably, hierarchical modularity has not been explicitly taken into consideration by most, if not all, functional annotation systems. As a result, the existing methods would often fail to assign a statistically significant functional coherence score to biologically relevant molecular machines. We developed a methodology for hierarchical functional annotation. Given the hierarchical taxonomy of functional concepts (e.g., Gene Ontology and the association of individual genes or proteins with these concepts (e.g., GO terms, our method will assign a Hierarchical Modularity Score (HMS to each node in the hierarchy of functional modules; the HMS score and its p-value measure functional coherence of each module in the hierarchy. While existing methods annotate each module with a set of "enriched" functional terms in a bag of genes, our complementary method provides the hierarchical functional annotation of the modules and their hierarchically organized components. A hierarchical organization of functional modules often comes as a bi-product of cluster analysis of gene expression data or protein interaction data. Otherwise, our method will automatically build such a hierarchy by directly incorporating the functional taxonomy information into the hierarchy search process and by allowing multi-functional genes to be part of more than one component in the hierarchy. In addition, its underlying HMS scoring metric ensures that functional specificity of the terms across different levels of the hierarchical taxonomy is properly treated. We have evaluated our

  19. The Processing of Causal and Hierarchical Relations in Semantic Memory as Revealed by N400 and Frontal Negativity.

    Directory of Open Access Journals (Sweden)

    Xiuling Liang

    Full Text Available Most current studies investigating semantic memory have focused on associative (ring-emerald or taxonomic relations (bird-sparrow. Little is known about the question of how causal relations (virus-epidemic are stored and accessed in semantic memory. The goal of this study was to examine the processing of causally related, general associatively related and hierarchically related word pairs when participants were required to evaluate whether pairs of words were related in any way. The ERP data showed that the N400 amplitude (200-500 ms elicited by unrelated related words was more negative than all related words. Furthermore, the late frontal distributed negativity (500-700 ms elicited by causally related words was smaller than hierarchically related words, but not for general associated words. These results suggested the processing of causal relations and hierarchical relations in semantic memory recruited different degrees of cognitive resources, especially for role binding.

  20. Gene-Set Local Hierarchical Clustering (GSLHC--A Gene Set-Based Approach for Characterizing Bioactive Compounds in Terms of Biological Functional Groups.

    Directory of Open Access Journals (Sweden)

    Feng-Hsiang Chung

    Full Text Available Gene-set-based analysis (GSA, which uses the relative importance of functional gene-sets, or molecular signatures, as units for analysis of genome-wide gene expression data, has exhibited major advantages with respect to greater accuracy, robustness, and biological relevance, over individual gene analysis (IGA, which uses log-ratios of individual genes for analysis. Yet IGA remains the dominant mode of analysis of gene expression data. The Connectivity Map (CMap, an extensive database on genomic profiles of effects of drugs and small molecules and widely used for studies related to repurposed drug discovery, has been mostly employed in IGA mode. Here, we constructed a GSA-based version of CMap, Gene-Set Connectivity Map (GSCMap, in which all the genomic profiles in CMap are converted, using gene-sets from the Molecular Signatures Database, to functional profiles. We showed that GSCMap essentially eliminated cell-type dependence, a weakness of CMap in IGA mode, and yielded significantly better performance on sample clustering and drug-target association. As a first application of GSCMap we constructed the platform Gene-Set Local Hierarchical Clustering (GSLHC for discovering insights on coordinated actions of biological functions and facilitating classification of heterogeneous subtypes on drug-driven responses. GSLHC was shown to tightly clustered drugs of known similar properties. We used GSLHC to identify the therapeutic properties and putative targets of 18 compounds of previously unknown characteristics listed in CMap, eight of which suggest anti-cancer activities. The GSLHC website http://cloudr.ncu.edu.tw/gslhc/ contains 1,857 local hierarchical clusters accessible by querying 555 of the 1,309 drugs and small molecules listed in CMap. We expect GSCMap and GSLHC to be widely useful in providing new insights in the biological effect of bioactive compounds, in drug repurposing, and in function-based classification of complex diseases.

  1. Avoiding progenitor bias: The structural and mass evolution of Brightest Group and Cluster Galaxies in Hierarchical models since z~1

    CERN Document Server

    Shankar, Francesco; Rettura, Alessandro; Bouillot, Vincent; Moreno, Jorge; Licitra, Rossella; Bernardi, Mariangela; Huertas-Company, Marc; Mei, Simona; Ascaso, Begoña; Sheth, Ravi; Delaye, Lauriane; Raichoor, Anand

    2015-01-01

    The mass and structural evolution of massive galaxies is one of the hottest topics in galaxy formation. This is because it may reveal invaluable insights into the still debated evolutionary processes governing the growth and assembly of spheroids. However, direct comparison between models and observations is usually prevented by the so-called "progenitor bias", i.e., new galaxies entering the observational selection at later epochs, thus eluding a precise study of how pre-existing galaxies actually evolve in size. To limit this effect, we here gather data on high-redshift brightest group and cluster galaxies, evolve their (mean) host halo masses down to z=0 along their main progenitors, and assign as their "descendants" local SDSS central galaxies matched in host halo mass. At face value, the comparison between high redshift and local data suggests a noticeable increase in stellar mass of a factor of >2 since z~1, and of >2.5 in mean effective radius. We then compare the inferred stellar mass and size growth ...

  2. The mechanics of bacterial cluster formation on plant leaf surfaces as revealed by bioreporter technology.

    Science.gov (United States)

    Tecon, Robin; Leveau, Johan H J

    2012-05-01

    Bacteria that colonize the leaves of terrestrial plants often occur in clusters whose size varies from a few to thousands of cells. For the formation of such bacterial clusters, two non-mutually exclusive but very different mechanisms may be proposed: aggregation of multiple cells or clonal reproduction of a single cell. Here we assessed the contribution of both mechanisms on the leaves of bean plants that were colonized by the bacterium Pantoea agglomerans. In one approach, we used a mixture of green and red fluorescent P. agglomerans cells to populate bean leaves. We observed that this resulted in clusters made up of only one colour as well as two-colour clusters, thus providing evidence for both mechanisms. Another P. agglomerans bioreporter, designed to quantify the reproductive success of bacterial colonizers by proxy to the rate at which green fluorescent protein is diluted from dividing cells, revealed that during the first hours on the leaf surface, many bacteria were dividing, but not staying together and forming clusters, which is suggestive of bacterial relocation. Together, these findings support a dynamic model of leaf surface colonization, where both aggregative and reproductive mechanisms take place. The bioreporter-based approach we employed here should be broadly applicable towards a more quantitative and mechanistic understanding of bacterial colonization of surfaces in general. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.

  3. Environmental quenching and hierarchical cluster assembly: Evidence from spectroscopic ages of red-sequence galaxies in Coma

    CERN Document Server

    Smith, Russell J; Price, James; Hudson, Michael J; Phillipps, Steven

    2011-01-01

    We explore the variation in stellar population ages for Coma cluster galaxies as a function of projected cluster-centric distance, using a sample of 362 red-sequence galaxies with high signal-to-noise spectroscopy. The sample spans a wide range in luminosity (0.02-4 L*) and extends from the cluster core to near the virial radius. We find a clear distinction in the observed trends of the giant and dwarf galaxies. The ages of red-sequence giants are primarily determined by galaxy mass, with only weak modulation by environment, in the sense that galaxies at larger cluster-centric distance are slightly younger. For red-sequence dwarfs (with mass <10^10 Msun), the roles of mass and environment as predictors of age are reversed: there is little dependence on mass, but strong trends with projected cluster-centric radius are observed. The average age of dwarfs at the 2.5 Mpc limit of our sample is approximately half that of dwarfs near the cluster centre. The gradient in dwarf galaxy ages is a global cluster-centr...

  4. Hierarchical rutile TiO2 flower cluster-based high efficiency dye-sensitized solar cells via direct hydrothermal growth on conducting substrates.

    Science.gov (United States)

    Ye, Meidan; Liu, Hsiang-Yu; Lin, Changjian; Lin, Zhiqun

    2013-01-28

    Dye-sensitized solar cells (DSSCs) based on hierarchical rutile TiO(2) flower clusters prepared by a facile, one-pot hydrothermal process exhibit a high efficiency. Complex yet appealing rutile TiO(2) flower films are, for the first time, directly hydrothermally grown on a transparent conducting fluorine-doped tin oxide (FTO) substrate. The thickness and density of as-grown flower clusters can be readily tuned by tailoring growth parameters, such as growth time, the addition of cations of different valence and size, initial concentrations of precursor and cation, growth temperature, and acidity. Notably, the small lattice mismatch between the FTO substrate and rutile TiO(2) renders the epitaxial growth of a compact rutile TiO(2) layer on the FTO glass. Intriguingly, these TiO(2) flower clusters can then be exploited as photoanodes to produce DSSCs, yielding a power conversion efficiency of 2.94% despite their rutile nature, which is further increased to 4.07% upon the TiCl(4) treatment.

  5. Transmission permeability of tuberculosis involving immigrants, revealed by a multicentre analysis of clusters.

    Science.gov (United States)

    Alonso Rodríguez, N; Chaves, F; Iñigo, J; Bouza, E; García de Viedma, D; Andrés, S; Cías, R; Daza, R; Domingo, D; Esteban, J; García, J; Gómez Mampaso, E; Herranz, M; Palenque, E; Ruiz Serrano, M J

    2009-05-01

    In recent years, the number of cases of tuberculosis (TB) among immigrants in Spain has increased markedly, and led to this analysis of the recent transmission patterns of TB in the immigrant population in Madrid. The countries from which the highest number of immigrant cases have been reported were Ecuador (21%), Romania (16%), Morocco (12%), Peru (11%) and Bolivia (9%). Fifty-one per cent of the cases were from South America. In a multicentre study (2004-2006), IS6110 restriction fragment length polymorphism and spoligotyping were used to genotype the Mycobacterium tuberculosis isolates from 632 immigrant cases from 47 countries. A total of 183 cases (29%) were grouped into 59 clusters, which are markers of potential transmission events. Most of the clusters (81%) included patients living in different healthcare districts, and 54% of the clusters were multinational. When a sample of 478 autochthonous cases was included, 53% of the clusters involving immigrants also included autochthonous cases. This study revealed marked transmission permeability among nationalities and between the immigrant and the autochthonous populations.

  6. 一种层次聚类的RDF图语义检索方法研究%Hierarchical clustering-based semantic retrieval of RDF graph

    Institute of Scientific and Technical Information of China (English)

    刘宁; 左凤华; 张俊

    2012-01-01

    The cun-ent research related RDF graph retrieve exists some problems, such as low efficiency of memory usage, low search efficiency and so on. This paper proposed a hierarchical clustering semantic retrieval model on RDF graph and the method based on the model to solve aforesaid problems. That extracting entities from RDF graph and hierarchical clustering by the guidance of the ontology library made the complex graph structure into a tree structure for efficient retrieval. Orientating target object which was one of nodes in the model in RDF conducted the semantic expansion queries. Retrieval efficiency increased because retrieval scope narrow down as construction of retrieval model and recall ratio increased by the semantic expansion queries.%针对当前信息资源描述框架(RDF)检索过程中存在的内存使用过大及检索效率低等问题,提出一个RDF图的层次聚类语义检索模型,设计并实现了相应的检索方法.首先从RDF图中抽取实体数据,在本体库的指导下,通过层次聚类,将复杂的图形结构转换为适合检索的树型结构;根据在树中查找到的目标对象,确定其在RDF图中的位置,进行语义扩充查询.检索模型的构建缩小了检索范围,从而提高了检索效率,其语义扩充查询还可以得到较好的查全率.

  7. Evolutionary divergence in the fungal response to fluconazole revealed by soft clustering

    KAUST Repository

    Kuo, Dwight

    2010-07-23

    Background: Fungal infections are an emerging health risk, especially those involving yeast that are resistant to antifungal agents. To understand the range of mechanisms by which yeasts can respond to anti-fungals, we compared gene expression patterns across three evolutionarily distant species - Saccharomyces cerevisiae, Candida glabrata and Kluyveromyces lactis - over time following fluconazole exposure. Results: Conserved and diverged expression patterns were identified using a novel soft clustering algorithm that concurrently clusters data from all species while incorporating sequence orthology. The analysis suggests complementary strategies for coping with ergosterol depletion by azoles - Saccharomyces imports exogenous ergosterol, Candida exports fluconazole, while Kluyveromyces does neither, leading to extreme sensitivity. In support of this hypothesis we find that only Saccharomyces becomes more azole resistant in ergosterol-supplemented media; that this depends on sterol importers Aus1 and Pdr11; and that transgenic expression of sterol importers in Kluyveromyces alleviates its drug sensitivity. Conclusions: We have compared the dynamic transcriptional responses of three diverse yeast species to fluconazole treatment using a novel clustering algorithm. This approach revealed significant divergence among regulatory programs associated with fluconazole sensitivity. In future, such approaches might be used to survey a wider range of species, drug concentrations and stimuli to reveal conserved and divergent molecular response pathways.

  8. Clusters of Insomnia Disorder: An Exploratory Cluster Analysis of Objective Sleep Parameters Reveals Differences in Neurocognitive Functioning, Quantitative EEG, and Heart Rate Variability.

    Science.gov (United States)

    Miller, Christopher B; Bartlett, Delwyn J; Mullins, Anna E; Dodds, Kirsty L; Gordon, Christopher J; Kyle, Simon D; Kim, Jong Won; D'Rozario, Angela L; Lee, Rico S C; Comas, Maria; Marshall, Nathaniel S; Yee, Brendon J; Espie, Colin A; Grunstein, Ronald R

    2016-11-01

    To empirically derive and evaluate potential clusters of Insomnia Disorder through cluster analysis from polysomnography (PSG). We hypothesized that clusters would differ on neurocognitive performance, sleep-onset measures of quantitative (q)-EEG and heart rate variability (HRV). Research volunteers with Insomnia Disorder (DSM-5) completed a neurocognitive assessment and overnight PSG measures of total sleep time (TST), wake time after sleep onset (WASO), and sleep onset latency (SOL) were used to determine clusters. From 96 volunteers with Insomnia Disorder, cluster analysis derived at least two clusters from objective sleep parameters: Insomnia with normal objective sleep duration (I-NSD: n = 53) and Insomnia with short sleep duration (I-SSD: n = 43). At sleep onset, differences in HRV between I-NSD and I-SSD clusters suggest attenuated parasympathetic activity in I-SSD (P clusters by retaining the I-NSD and splitting the I-SSD cluster into two: I-SSD A (n = 29): defined by high WASO and I-SSD B (n = 14): a second I-SSD cluster with high SOL and medium WASO. The I-SSD B cluster performed worse than I-SSD A and I-NSD for sustained attention (P ≤ 0.05). In an exploratory analysis, q-EEG revealed reduced spectral power also in I-SSD B before (Delta, Alpha, Beta-1) and after sleep-onset (Beta-2) compared to I-SSD A and I-NSD (P ≤ 0.05). Two insomnia clusters derived from cluster analysis differ in sleep onset HRV. Preliminary data suggest evidence for three clusters in insomnia with differences for sustained attention and sleep-onset q-EEG. Insomnia 100 sleep study: Australia New Zealand Clinical Trials Registry (ANZCTR) identification number 12612000049875. URL: https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=347742.

  9. 基于类轮廓层次聚类方法的研究%RESEARCH ON CLASS-PROFILE-BASED HIERARCHICAL CLUSTERING METHOD

    Institute of Scientific and Technical Information of China (English)

    孟海东; 唐旋

    2011-01-01

    传统的聚类算法在考虑类与类之间的连通性特征和近似性特征上往往顾此失彼.首先给出类边界点和类轮廓的基本定义以及寻求方法,然后基于类间连通性特征和近似性特征的综合考虑,拟定一些类间相似性度量标准和方法,最后提出一种基于类轮廓的层次聚类算法.该算法能够有效处理任意形状的簇,且能够区分孤立点和噪声数据.通过对图像数据集和Iris标准数据集的聚类分析,验证了该算法的可行性和有效性.%Traditional clustering algorithms are often incapable of roundly considering the connectivity and similarity characteristics among classes. The thesis firstly presents the fundamental definition of class boundary point and class profile; secondly, with comprehensive consideration based on connectivity characteristics and similarity characteristics among classes, defines some standards and methods for inter class similarity measurement; thirdly, proposes a class-profile-based hierarchical clustering algorithm, which is able to effectively process arbitrary shaped clusters and distinguish isolated points from noise data. The feasibility and effectiveness of the algorithm is validated through clustering analysis on image data sets and Iris standard data sets.

  10. Patterns of hybrid loss of imprinting reveal tissue- and cluster-specific regulation.

    Directory of Open Access Journals (Sweden)

    Christopher D Wiley

    Full Text Available BACKGROUND: Crosses between natural populations of two species of deer mice, Peromyscus maniculatus (BW, and P. polionotus (PO, produce parent-of-origin effects on growth and development. BW females mated to PO males (bwxpo produce growth-retarded but otherwise healthy offspring. In contrast, PO females mated to BW males (POxBW produce overgrown and severely defective offspring. The hybrid phenotypes are pronounced in the placenta and include POxBW conceptuses which lack embryonic structures. Evidence to date links variation in control of genomic imprinting with the hybrid defects, particularly in the POxBW offspring. Establishment of genomic imprinting is typically mediated by gametic DNA methylation at sites known as gDMRs. However, imprinted gene clusters vary in their regulation by gDMR sequences. METHODOLOGY/PRINCIPAL FINDINGS: Here we further assess imprinted gene expression and DNA methylation at different cluster types in order to discern patterns. These data reveal POxBW misexpression at the Kcnq1ot1 and Peg3 clusters, both of which lose ICR methylation in placental tissues. In contrast, some embryonic transcripts (Peg10, Kcnq1ot1 reactivated the silenced allele with little or no loss of DNA methylation. Hybrid brains also display different patterns of imprinting perturbations. Several cluster pairs thought to use analogous regulatory mechanisms are differentially affected in the hybrids. CONCLUSIONS/SIGNIFICANCE: These data reinforce the hypothesis that placental and somatic gene regulation differs significantly, as does that between imprinted gene clusters and between species. That such epigenetic regulatory variation exists in recently diverged species suggests a role in reproductive isolation, and that this variation is likely to be adaptive.

  11. Ultrathin mesoporous Co3O4 nanosheets-constructed hierarchical clusters as high rate capability and long life anode materials for lithium-ion batteries

    Science.gov (United States)

    Wu, Shengming; Xia, Tian; Wang, Jingping; Lu, Feifei; Xu, Chunbo; Zhang, Xianfa; Huo, Lihua; Zhao, Hui

    2017-06-01

    Herein, Ultrathin mesoporous Co3O4 nanosheets-constructed hierarchical clusters (UMCN-HCs) have been successfully synthesized via a facile hydrothermal method followed by a subsequent thermolysis treatment at 600 °C in air. The products consist of cluster-like Co3O4 microarchitectures, which are assembled by numerous ultrathin mesoporous Co3O4 nanosheets. When tested as anode materials for lithium-ion batteries, UMCN-HCs deliver a high reversible capacity of 1067 mAh g-1 at a current density of 100 mA g-1 after 100 cycles. Even at 2 A g-1, a stable capacity as high as 507 mAh g-1 can be achieved after 500 cycles. The high reversible capacity, excellent cycling stability, and good rate capability of UMCN-HCs may be attributed to their mesoporous sheet-like nanostructure. The sheet-layered structure of UMCN-HCs may buffer the volume change during the lithiation-delithiation process, and the mesoporous characteristic make lithium-ion transfer more easily at the interface between the active electrode and the electrolyte.

  12. Fingerprint analysis of Hibiscus mutabilis L. leaves based on ultra performance liquid chromatography with photodiode array detector combined with similarity analysis and hierarchical clustering analysis methods

    Directory of Open Access Journals (Sweden)

    Xianrui Liang

    2013-01-01

    Full Text Available Background: A method for chemical fingerprint analysis of Hibiscus mutabilis L. leaves was developed based on ultra performance liquid chromatography with photodiode array detector (UPLC-PAD combined with similarity analysis (SA and hierarchical clustering analysis (HCA. Materials and Methods: 10 batches of Hibiscus mutabilis L. leaves samples were collected from different regions of China. UPLC-PAD was employed to collect chemical fingerprints of Hibiscus mutabilis L. leaves. Results: The relative standard deviations (RSDs of the relative retention times (RRT and relative peak areas (RPA of 10 characteristic peaks (one of them was identified as rutin in precision, repeatability and stability test were less than 3%, and the method of fingerprint analysis was validated to be suitable for the Hibiscus mutabilis L. leaves. Conclusions: The chromatographic fingerprints showed abundant diversity of chemical constituents qualitatively in the 10 batches of Hibiscus mutabilis L. leaves samples from different locations by similarity analysis on basis of calculating the correlation coefficients between each two fingerprints. Moreover, the HCA method clustered the samples into four classes, and the HCA dendrogram showed the close or distant relations among the 10 samples, which was consistent to the SA result to some extent.

  13. Investigating the provenance of iron artifacts of the Royal Iron Factory of Sao Joao de Ipanema by hierarchical cluster analysis of EDS microanalyses of slag inclusions

    Energy Technology Data Exchange (ETDEWEB)

    Mamani-Calcina, Elmer Antonio; Landgraf, Fernando Jose Gomes; Azevedo, Cesar Roberto de Farias, E-mail: c.azevedo@usp.br [Universidade de Sao Paulo (USP), Sao Paulo, SP (Brazil). Escola Politecnica. Departmento de Engenharia Metalurgica e de Materiais

    2017-01-15

    Microstructural characterization techniques, including EDX (Energy Dispersive X-ray Analysis) microanalyses, were used to investigate the slag inclusions in the microstructure of ferrous artifacts of the Royal Iron Factory of Sao Joao de Ipanema (first steel plant of Brazil, XIX century), the D. Pedro II Bridge (located in Bahia, assembled in XIX century and produced in Scotland) and the archaeological sites of Sao Miguel de Missoes (Rio Grande do Sul, Brazil, production site of iron artifacts, the XVIII century) and Afonso Sardinha (Sao Paulo, Brazil production site of iron artifacts, XVI century). The microanalyses results of the main micro constituents of the microstructure of the slag inclusions were investigated by hierarchical cluster analysis and the dendrogram with the microanalyses results of the wüstite phase (using as critical variables the contents of MnO, MgO, Al{sub 2}O{sub 3}, V{sub 2}O{sub 5} and TiO{sub 2}) allowed the identification of four clusters, which successfully represented the samples of the four investigated sites (Ipanema, Sardinha, Missoes and Bahia). Finally, the comparatively low volumetric fraction of slag inclusions in the samples of Ipanema (∼1%) suggested the existence of technological expertise at the iron making processing in the Royal Iron Factory of Sao Joao de Ipanema. (author)

  14. Critérios de formação de carteiras de ativos por meio de Hierarchical Clusters

    Directory of Open Access Journals (Sweden)

    Pierre Lucena

    2010-04-01

    Full Text Available Este artigo tem como objetivo principal apresentar e testar uma ferramenta de estatística multivariada em modelos financeiros. Essa metodologia, conhecida como análise de clusters, separa as observações em grupos com suas determinadas características, em contraste com a metodologia tradicional, que é somente a ordem com os quantis. Foi aplicada essa ferramenta em 213 ações negociadas na Bolsa de São Paulo (Bovespa, separando os grupos por tamanho e book-tomarket. Depois, as novas carteiras foram aplicadas no modelo de Fama e French (1996, comparando os resultados numa formação de carteira para quantil e análise de cluster. Foram encontrados melhores resultados na segunda metodologia. Os autores concluem que a análise de cluster pode ser mais adequada porque tende a formar grupos mais homogeneizados, sendo sua aplicação útil para a formação de carteiras e para a teoria financeira.

  15. Complete Genome Sequence of the Filamentous Fungus Aspergillus westerdijkiae Reveals the Putative Biosynthetic Gene Cluster of Ochratoxin A

    Science.gov (United States)

    Chakrabortti, Alolika; Li, Jinming

    2016-01-01

    Ochratoxin A (OTA) is a common mycotoxin that contaminates food and agricultural products. Sequencing of the complete genome of Aspergillus westerdijkiae, a major producer of OTA, reveals more than 50 biosynthetic gene clusters, including a putative OTA biosynthetic gene cluster that encodes a dozen of enzymes, transporters, and regulatory proteins. PMID:27635003

  16. HUBBLE SPACE TELESCOPE REVEALS MULTIPLE SUB-GIANT BRANCH IN EIGHT GLOBULAR CLUSTERS

    Energy Technology Data Exchange (ETDEWEB)

    Piotto, G.; Nascimbeni, V. [Dipartimento di Astronomia, Universita di Padova, Vicolo dell' Osservatorio 3, I-35122 Padova (Italy); Milone, A. P.; Aparicio, A. [Instituto de Astrofisica de Canarias, E-38200 La Laguna, Tenerife, Canary Islands (Spain); Anderson, J.; Bellini, A. [Space Telescope Science Institute, 3800 San Martin Drive, Baltimore, MD 21218 (United States); Bedin, L. R. [INAF-Osservatorio Astronomico di Padova, Vicolo dell' Osservatorio 5, I-35122 Padova (Italy); Cassisi, S. [INAF-Osservatorio Astronomico di Collurania, via Mentore Maggini, I-64100 Teramo (Italy); Marino, A. F., E-mail: giampaolo.piotto@unipd.it, E-mail: luigi.bedin@oapd.inaf.it, E-mail: milone@iac.es, E-mail: aparicio@iac.es, E-mail: jayander@stsci.edu, E-mail: bellini@stsci.edu, E-mail: cassisi@oa-teramo.inaf.it, E-mail: amarino@MPA-Garching.MPG.DE [Max Planck Institute for Astrophysics, Postfach 1317, D-85741 Garching (Germany)

    2012-11-20

    In the last few years many globular clusters (GCs) have revealed complex color-magnitude diagrams, with the presence of multiple main sequences (MSs), broad or multiple sub-giant branches (SGBs) and MS turnoffs, and broad or split red giant branches (RGBs). After a careful correction for differential reddening, high-accuracy photometry with the Hubble Space Telescope (HST) presented in this paper reveals a broadened or even split SGB in five additional Milky Way GCs: NGC 362, NGC 5286, NGC 6656, NGC 6715, and NGC 7089. In addition, we confirm (with new and archival HST data) the presence of a split SGB in 47 Tuc, NGC 1851, and NGC 6388. The fraction of faint SGB stars with respect to the entire SGB population varies from one cluster to another and ranges from {approx}0.03 for NGC 362 to {approx}0.50 for NGC 6715. The average magnitude difference between the bright SGB and the faint SGB is almost the same at different wavelengths. This peculiarity is consistent with the presence of two groups of stars with either an age difference of about 1-2 Gyr or a significant difference in their overall C+N+O content.

  17. 一种分层分簇的组密钥管理方案%A HIERARCHICAL CLUSTERING-BASED GROUP KEY MANAGEMENT SCHEME

    Institute of Scientific and Technical Information of China (English)

    李珍格; 游林

    2014-01-01

    为了满足无线传感器网络组通信的安全,提出一种分层分簇的组密钥管理方案。该方案采用分层的体系结构,将组中节点分为管理层和普通层。BS通过构造特殊的组密钥多项式更新普通层组密钥,而管理层则采用二元单向函数进行组密钥的协商。分析表明,该方案很好满足了无线传感器网络中组密钥管理的前向安全性,后向安全性,并且减小了存储开销、计算开销和通信开销。%In this paper,a hierarchical clustering-based group key management scheme is proposed in order to satisfy the secure group communication in wireless sensor network.The proposed scheme adopts the hierarchical architecture and divides the nodes in the group into master-node layer and terminal layer.The group key of terminal layer is updated by constructing a special group key polynomial in BS,and the binary one-way function is used by the master-node layer for group key negotiation.Analysis demonstrates that the scheme well satisfies the forward security and backward security of the group key management in WSN,and reduces the storage overhead,computation overhead and communication overhead as well.

  18. Micromechanics of hierarchical materials

    DEFF Research Database (Denmark)

    Mishnaevsky, Leon, Jr.

    2012-01-01

    A short overview of micromechanical models of hierarchical materials (hybrid composites, biomaterials, fractal materials, etc.) is given. Several examples of the modeling of strength and damage in hierarchical materials are summarized, among them, 3D FE model of hybrid composites...... with nanoengineered matrix, fiber bundle model of UD composites with hierarchically clustered fibers and 3D multilevel model of wood considered as a gradient, cellular material with layered composite cell walls. The main areas of research in micromechanics of hierarchical materials are identified, among them......, the investigations of the effects of load redistribution between reinforcing elements at different scale levels, of the possibilities to control different material properties and to ensure synergy of strengthening effects at different scale levels and using the nanoreinforcement effects. The main future directions...

  19. Proteomic and bioinformatic analysis of epithelial tight junction reveals an unexpected cluster of synaptic molecules

    Directory of Open Access Journals (Sweden)

    Tang Vivian W

    2006-12-01

    Full Text Available Abstract Background Zonula occludens, also known as the tight junction, is a specialized cell-cell interaction characterized by membrane "kisses" between epithelial cells. A cytoplasmic plaque of ~100 nm corresponding to a meshwork of densely packed proteins underlies the tight junction membrane domain. Due to its enormous size and difficulties in obtaining a biochemically pure fraction, the molecular composition of the tight junction remains largely unknown. Results A novel biochemical purification protocol has been developed to isolate tight junction protein complexes from cultured human epithelial cells. After identification of proteins by mass spectroscopy and fingerprint analysis, candidate proteins are scored and assessed individually. A simple algorithm has been devised to incorporate transmembrane domains and protein modification sites for scoring membrane proteins. Using this new scoring system, a total of 912 proteins have been identified. These 912 hits are analyzed using a bioinformatics approach to bin the hits in 4 categories: configuration, molecular function, cellular function, and specialized process. Prominent clusters of proteins related to the cytoskeleton, cell adhesion, and vesicular traffic have been identified. Weaker clusters of proteins associated with cell growth, cell migration, translation, and transcription are also found. However, the strongest clusters belong to synaptic proteins and signaling molecules. Localization studies of key components of synaptic transmission have confirmed the presence of both presynaptic and postsynaptic proteins at the tight junction domain. To correlate proteomics data with structure, the tight junction has been examined using electron microscopy. This has revealed many novel structures including end-on cytoskeletal attachments, vesicles fusing/budding at the tight junction membrane domain, secreted substances encased between the tight junction kisses, endocytosis of tight junction

  20. Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR is an efficient tool for metamodelling of nonlinear dynamic models

    Directory of Open Access Journals (Sweden)

    Omholt Stig W

    2011-06-01

    Full Text Available Abstract Background Deterministic dynamic models of complex biological systems contain a large number of parameters and state variables, related through nonlinear differential equations with various types of feedback. A metamodel of such a dynamic model is a statistical approximation model that maps variation in parameters and initial conditions (inputs to variation in features of the trajectories of the state variables (outputs throughout the entire biologically relevant input space. A sufficiently accurate mapping can be exploited both instrumentally and epistemically. Multivariate regression methodology is a commonly used approach for emulating dynamic models. However, when the input-output relations are highly nonlinear or non-monotone, a standard linear regression approach is prone to give suboptimal results. We therefore hypothesised that a more accurate mapping can be obtained by locally linear or locally polynomial regression. We present here a new method for local regression modelling, Hierarchical Cluster-based PLS regression (HC-PLSR, where fuzzy C-means clustering is used to separate the data set into parts according to the structure of the response surface. We compare the metamodelling performance of HC-PLSR with polynomial partial least squares regression (PLSR and ordinary least squares (OLS regression on various systems: six different gene regulatory network models with various types of feedback, a deterministic mathematical model of the mammalian circadian clock and a model of the mouse ventricular myocyte function. Results Our results indicate that multivariate regression is well suited for emulating dynamic models in systems biology. The hierarchical approach turned out to be superior to both polynomial PLSR and OLS regression in all three test cases. The advantage, in terms of explained variance and prediction accuracy, was largest in systems with highly nonlinear functional relationships and in systems with positive feedback

  1. Segmentation Algorithm for Oil Spill SAR Images Based on Hierarchical Agglomerative Clustering%基于HAC的溢油SAR图像分割算法

    Institute of Scientific and Technical Information of China (English)

    苏腾飞; 孟俊敏; 张晰

    2013-01-01

    图像分割是SAR溢油检测中的关键步骤,但由于SAR影像中存在斑点噪声,使得一般的图像分割算法难以收到理想的效果,严重影响溢油检测的精度.发展一种基于凝聚层次聚类(Hierarchical Agglomerative Clustering,HAC)的溢油SAR图像分割算法.该算法利用多尺度分割的思想,能够有效保持SAR影像中溢油斑块的形状特征,并能减少细碎斑块的产生.利用2010年墨西哥湾的Envisat ASAR影像开展了溢油SAR图像分割实验,并将该算法和Canny边缘检测、OTSU阈值分割、FCM分割、水平集分割等方法进行了对比.结果显示,HAC方法可以有效减少细碎斑块的产生,有助于提高SAR溢油检测的精度.%Image segmentation is a crucial stage in the SAR oil spill detection.However,the common image segmentation algorithms can hardly achieve satisfactory results due to speckle noise in the SAR images,thus affecting seriously the accuracy of oil spill detection.For this reason,an image segmentation algorithm which is based on HAC (Hierarchical Agglomerative Clustering) is developed for the oil spill SAR images.This method takes advantage of multi-resolution segmentation to maintain effectively the shape property of oil spill patches,and can reduce the formation of small patches.By using Envisat ASAR images of the Gulf of Mexico obtained in 2010,an experiment of SAR oil spill image segmentation has been conducted.Comparing with other approaches such as Canny,OTSU,FCM and Levelset,the results show that HAC can effectively reduce the producing of small patches,which is helpful to improve the accuracy of SAR oil spill detection.

  2. Neural Mechanisms of Hierarchical Planning in a Virtual Subway Network.

    Science.gov (United States)

    Balaguer, Jan; Spiers, Hugo; Hassabis, Demis; Summerfield, Christopher

    2016-05-18

    Planning allows actions to be structured in pursuit of a future goal. However, in natural environments, planning over multiple possible future states incurs prohibitive computational costs. To represent plans efficiently, states can be clustered hierarchically into "contexts". For example, representing a journey through a subway network as a succession of individual states (stations) is more costly than encoding a sequence of contexts (lines) and context switches (line changes). Here, using functional brain imaging, we asked humans to perform a planning task in a virtual subway network. Behavioral analyses revealed that humans executed a hierarchically organized plan. Brain activity in the dorsomedial prefrontal cortex and premotor cortex scaled with the cost of hierarchical plan representation and unique neural signals in these regions signaled contexts and context switches. These results suggest that humans represent hierarchical plans using a network of caudal prefrontal structures. VIDEO ABSTRACT.

  3. Partial least square and hierarchical clustering in ADMET modeling: prediction of blood-brain barrier permeation of α-adrenergic and imidazoline receptor ligands.

    Science.gov (United States)

    Nikolic, Katarina; Filipic, Slavica; Smoliński, Adam; Kaliszan, Roman; Agbaba, Danica

    2013-01-01

    PURPOSE. Rate of brain penetration (logPS), brain/plasma equilibration rate (logPS-brain), and extent of blood-brain barrier permeation (logBB) of 29 α-adrenergic and imidazoline-receptors ligands were examined in Quantitative-Structure-Property Relationship (QSPR) study. METHODS. Experimentally determined chromatographic retention data (logKw at pH 4.4, slope (S) at pH 4.4, logKw at pH 7.4, slope (S) at pH 7.4, logKw at pH 9.1, and slope (S) at pH 9.1) and capillary electrophoresis migration parameters (μeff at pH 4.4, μeff at pH 7.4, and μeff at pH 9.1), together with calculated molecular descriptors, were used as independent variables in the QSPR study by use of partial least square (PLS) methodology. RESULTS. Predictive potential of the formed QSPR models, QSPR(logPS), QSPR(logPS-brain), QSPR(logBB), was confirmed by cross- and external validation. Hydrophilicity (Hy) and H-indices (H7m) were selected as significant parameters negatively correlated with both logPS and logPS-brain, while topological polar surface area (TPSA(NO)) was chosen as molecular descriptor negatively correlated with both logPS and logBB. The principal component analysis (PCA) and hierarchical clustering analysis (HCA) were applied to cluster examined drugs based on their chromatographic, electrophoretic and molecular properties. Significant positive correlations were obtained between the slope (S) at pH 7.4 and logBB in A/B cluster and between the logKw at pH 9.1 and logPS in C/D cluster. CONCLUSIONS. Results of the QSPR, clustering and correlation studies could be used as novel tool for evaluation of blood-brain barrier permeation of related α-adrenergic/imidazoline receptor ligands.This article is open to POST-PUBLICATION REVIEW. Registered readers (see "For Readers") may comment by clicking on ABSTRACT on the issue's contents page.PURPOSE. Rate of brain penetration (logPS), brain/plasma equilibration rate (logPS-brain), and extent of blood-brain barrier permeation (logBB) of 29

  4. Microarray analysis of colorectal cancer stromal tissue reveals upregulation of two oncogenic miRNA clusters.

    Science.gov (United States)

    Nishida, Naohiro; Nagahara, Makoto; Sato, Tetsuya; Mimori, Koshi; Sudo, Tomoya; Tanaka, Fumiaki; Shibata, Kohei; Ishii, Hideshi; Sugihara, Kenichi; Doki, Yuichiro; Mori, Masaki

    2012-06-01

    Cancer stroma plays an important role in the progression of cancer. Although alterations in miRNA expression have been explored in various kinds of cancers, the expression of miRNAs in cancer stroma has not been explored in detail. Using a laser microdissection technique, we collected RNA samples specific for epithelium or stroma from 13 colorectal cancer tissues and four normal tissues, and miRNA microarray and gene expression microarray were carried out. The expression status of miRNAs was confirmed by reverse transcriptase PCR. Furthermore, we investigated whether miRNA expression status in stromal tissue could influence the clinicopathologic factors. Oncogenic miRNAs, including two miRNA clusters, miR-17-92a and miR-106b-25 cluster, were upregulated in cancer stromal tissues compared with normal stroma. Gene expression profiles from cDNA microarray analyses of the same stromal tissue samples revealed that putative targets of these miRNA clusters, predicted by Target Scan, such as TGFBR2, SMAD2, and BMP family genes, were significantly downregulated in cancer stromal tissue. Downregulated putative targets were also found to be involved in cytokine interaction and cellular adhesion. Importantly, expression of miR-25 and miR-92a in stromal tissues was associated with a variety of clinicopathologic factors. Oncogenic miRNAs were highly expressed in cancer stroma. Although further validation is required, the finding that stromal miRNA expression levels were associated with clinicopathologic factors suggests the possibility that miRNAs in cancer stroma are crucially involved in cancer progression.

  5. Revealing the multi-bonding state between hydrogen and graphene-supported Ti clusters

    CERN Document Server

    Takahashi, Keisuke; Omori, Kengo; Mashoff, Torge; Convertino, Domenica; Miseikis, Vaidotas; Coletti, Camilla; Tozzini, Valentina; Heun, Stefan

    2016-01-01

    Hydrogen adsorption on graphene-supported metal clusters has brought much controversy due to the complex nature of the bonding between hydrogen and metal clusters. The bond types of hydrogen and graphene-supported Ti clusters are experimentally and theoretically investigated. Transmission electron microscopy shows that Ti clusters of nanometer-size are formed on graphene. Thermal desorption spectroscopy captures three hydrogen desorption peaks from hydrogenated graphene-supported Ti clusters. First principle calculations also found three types of interaction: Two types of bonds with different partial ionic character and physisorption. The physical origin for this rests on the charge state of the Ti clusters: when Ti clusters are neutral, H2 is dissociated, and H forms bonds with the Ti cluster. On the other hand, H2 is adsorbed in molecular form on positively charged Ti clusters, resulting in physisorption. Thus, this work clarifies the bonding mechanisms of hydrogen on graphene-supported Ti clusters.

  6. STED super-resolution microscopy reveals an array of MINOS clusters along human mitochondria.

    Science.gov (United States)

    Jans, Daniel C; Wurm, Christian A; Riedel, Dietmar; Wenzel, Dirk; Stagge, Franziska; Deckers, Markus; Rehling, Peter; Jakobs, Stefan

    2013-05-28

    The mitochondrial inner membrane organizing system (MINOS) is a conserved large hetero-oligomeric protein complex in the mitochondrial inner membrane, crucial for the maintenance of cristae morphology. MINOS has been suggested to represent the core of an extended protein network that controls mitochondrial function and structure, and has been linked to several human diseases. The spatial arrangement of MINOS within mitochondria is ill-defined, however. Using super-resolution stimulated emission depletion (STED) microscopy and immunogold electron microscopy, we determined the distribution of three known human MINOS subunits (mitofilin, MINOS1, and CHCHD3) in mammalian cells. Super-resolution microscopy revealed that all three subunits form similar clusters within mitochondria, and that MINOS is more abundant in mitochondria around the nucleus than in peripheral mitochondria. At the submitochondrial level, mitofilin, a core MINOS subunit, is preferentially localized at cristae junctions. In primary human fibroblasts, mitofilin labeling uncovered a regularly spaced pattern of clusters arranged in parallel to the cell growth surfaces. We suggest that this array of MINOS complexes might explain the observed phenomenon of largely horizontally arranged cristae junctions that connect the inner boundary membrane to lamellar cristae. The super-resolution images demonstrate an unexpectedly high level of regularity in the nanoscale distribution of the MINOS complex in human mitochondria, supporting an integrating role of MINOS in the structural organization of the organelle.

  7. Characterization of the Biosynthetic Gene Cluster for Benzoxazole Antibiotics A33853 Reveals Unusual Assembly Logic.

    Science.gov (United States)

    Lv, Meinan; Zhao, Junfeng; Deng, Zixin; Yu, Yi

    2015-10-22

    A33853, which shows excellent bioactivity against Leishmania, is a benzoxazole-family compound formed from two moieties of 3-hydroxyanthranilic acid and one 3-hydroxypicolinic acid. In this study, we have identified the gene cluster responsible for the biosynthesis of A33853 in Streptomyces sp. NRRL12068 through genome mining and heterologous expression. Bioinformatics analysis and functional characterization of the orfs contained in the gene cluster revealed that the biosynthesis of A33853 is directed by a group of unusual enzymes. In particular, BomK, annotated as a ketosynthase, was found to catalyze the amide bond formation between 3-hydroxypicolinic and 3-hydroxyanthranilic acid during the assembly of A33853. BomJ, a putative ATP-dependent coenzyme A ligase, and BomN, a putative amidohydrolase, were further proposed to be involved in the benzoxazole formation in A33853 according to gene deletion experiments. Finally, we have successfully utilized mutasynthesis to generate two analogs of A33853, which were reported previously to possess excellent anti-leishmanial activity.

  8. Clustering of neural code words revealed by a first-order phase transition

    Science.gov (United States)

    Huang, Haiping; Toyoizumi, Taro

    2016-06-01

    A network of neurons in the central nervous system collectively represents information by its spiking activity states. Typically observed states, i.e., code words, occupy only a limited portion of the state space due to constraints imposed by network interactions. Geometrical organization of code words in the state space, critical for neural information processing, is poorly understood due to its high dimensionality. Here, we explore the organization of neural code words using retinal data by computing the entropy of code words as a function of Hamming distance from a particular reference codeword. Specifically, we report that the retinal code words in the state space are divided into multiple distinct clusters separated by entropy-gaps, and that this structure is shared with well-known associative memory networks in a recallable phase. Our analysis also elucidates a special nature of the all-silent state. The all-silent state is surrounded by the densest cluster of code words and located within a reachable distance from most code words. This code-word space structure quantitatively predicts typical deviation of a state-trajectory from its initial state. Altogether, our findings reveal a non-trivial heterogeneous structure of the code-word space that shapes information representation in a biological network.

  9. Isolation of Hox cluster genes from insects reveals an accelerated sequence evolution rate.

    Directory of Open Access Journals (Sweden)

    Heike Hadrys

    Full Text Available Among gene families it is the Hox genes and among metazoan animals it is the insects (Hexapoda that have attracted particular attention for studying the evolution of development. Surprisingly though, no Hox genes have been isolated from 26 out of 35 insect orders yet, and the existing sequences derive mainly from only two orders (61% from Hymenoptera and 22% from Diptera. We have designed insect specific primers and isolated 37 new partial homeobox sequences of Hox cluster genes (lab, pb, Hox3, ftz, Antp, Scr, abd-a, Abd-B, Dfd, and Ubx from six insect orders, which are crucial to insect phylogenetics. These new gene sequences provide a first step towards comparative Hox gene studies in insects. Furthermore, comparative distance analyses of homeobox sequences reveal a correlation between gene divergence rate and species radiation success with insects showing the highest rate of homeobox sequence evolution.

  10. Hierarchical cluster analysis and chemical characterisation of Myrtus communis L. essential oil from Yemen region and its antimicrobial, antioxidant and anti-colorectal adenocarcinoma properties.

    Science.gov (United States)

    Anwar, Sirajudheen; Crouch, Rebecca A; Awadh Ali, Nasser A; Al-Fatimi, Mohamed A; Setzer, William N; Wessjohann, Ludger

    2017-01-09

    The hydrodistilled essential oil obtained from the dried leaves of Myrtus communis, collected in Yemen, was analysed by GC-MS. Forty-one compounds were identified, representing 96.3% of the total oil. The major constituents of essential oil were oxygenated monoterpenoids (87.1%), linalool (29.1%), 1,8-cineole (18.4%), α-terpineol (10.8%), geraniol (7.3%) and linalyl acetate (7.4%). The essential oil was assessed for its antimicrobial activity using a disc diffusion assay and resulted in moderate to potent antibacterial and antifungal activities targeting mainly Bacillus subtilis, Staphylococcus aureus and Candida albicans. The oil moderately reduced the diphenylpicrylhydrazyl radical (IC50 = 4.2 μL/mL or 4.1 mg/mL). In vitro cytotoxicity evaluation against HT29 (human colonic adenocarcinoma cells) showed that the essential oil exhibited a moderate antitumor effect with IC50 of 110 ± 4 μg/mL. Hierarchical cluster analysis of M. communis has been carried out based on the chemical compositions of 99 samples reported in the literature, including Yemeni sample.

  11. Ultra high performance liquid chromatography with electrospray ionization tandem mass spectrometry coupled with hierarchical cluster analysis to evaluate Wikstroemia indica (L.) C. A. Mey. from different geographical regions.

    Science.gov (United States)

    Wei, Lan; Wang, Xiaobo; Mu, Shanxue; Sun, Lixin; Yu, Zhiguo

    2015-06-01

    A sensitive, rapid and simple ultra high performance liquid chromatography with electrospray ionization tandem mass spectrometry method was developed to determine seven constituents (umbelliferone, apigenin, triumbelletin, daphnoretin, arctigenin, genkwanin and emodin) in Wikstroemia indica (L.) C. A. Mey. The chromatographic analysis was performed on an ACQUITY UPLC® BEH C18 column (2.1 × 50 mm, 1.7 μm) by gradient elution with the mobile phase of 0.05% formic acid aqueous solution (A) and acetonitrile (B). Multiple reaction monitoring mode with positive and negative electrospray ionization interface was carried out to detect the components. This method was validated in terms of specificity, linearity, accuracy, precision and stability. Excellent linear behavior was observed over the certain concentration ranges with the correlation coefficient values higher than 0.999. The intraday and innerday precisions were within 2.0%. The recoveries of seven analytes were 99.4-101.1% with relative standard deviation less than 1.2%. The 18 Wikstroemia indica samples from different origins were classified by hierarchical clustering analysis according to the contents of seven components. The results demonstrated that the developed method could successfully be used to quantify simultaneously of seven components in Wikstroemia indica and could be a helpful tool for the detection and confirmation of the quality of traditional Chinese medicines.

  12. Principal factor and hierarchical cluster analyses for the performance assessment of an urban wastewater treatment plant in the Southeast of Spain.

    Science.gov (United States)

    Bayo, Javier; López-Castellanos, Joaquín

    2016-07-01

    Process performance and operation of wastewater treatment plants (WWTP) are carried out to ensure their compliance with legislative requirements imposed by European Union. Because a high amount of variables are daily measured, a coherent and structured approach of such a system is required to understand its inherent behavior and performance efficiency. In this sense, both principal factor analysis (PFA) and hierarchical cluster analysis (HCA) are multivariate techniques that have been widely applied to extract and structure information for different purposes. In this paper, both statistical tools are applied in an urban WWTP situated in the Southeast of Spain, a zone with special characteristics related to the geochemical background composition of water and an important use of fertilizers. Four main factors were extracted in association with nutrients, the ionic component, the organic load to the WWTP, and the efficiency of the whole process. HCA allowed distinguish between influent and effluent parameters, although a deeper examination resulted in a dendrogram with groupings similar to those previously reported for PFA.

  13. Quantitative and Chemical Fingerprint Analysis for the Quality Evaluation of Receptaculum Nelumbinis by RP-HPLC Coupled with Hierarchical Clustering Analysis

    Directory of Open Access Journals (Sweden)

    Jin-Zhong Wu

    2013-01-01

    Full Text Available A simple and reliable method of high-performance liquid chromatography with photodiode array detection (HPLC-DAD was developed to evaluate the quality of Receptaculum Nelumbinis (dried receptacle of Nelumbo nucifera through establishing chromatographic fingerprint and simultaneous determination of five flavonol glycosides, including hyperoside, isoquercitrin, quercetin-3-O-β-d-glucuronide, isorhamnetin-3-O-β-d-galactoside and syringetin-3-O-β-d-glucoside. In quantitative analysis, the five components showed good regression (R > 0.9998 within linear ranges, and their recoveries were in the range of 98.31%–100.32%. In the chromatographic fingerprint, twelve peaks were selected as the characteristic peaks to assess the similarities of different samples collected from different origins in China according to the State Food and Drug Administration (SFDA requirements. Furthermore, hierarchical cluster analysis (HCA was also applied to evaluate the variation of chemical components among different sources of Receptaculum Nelumbinis in China. This study indicated that the combination of quantitative and chromatographic fingerprint analysis can be readily utilized as a quality control method for Receptaculum Nelumbinis and its related traditional Chinese medicinal preparations.

  14. HILIC-UPLC-MS/MS combined with hierarchical clustering analysis to rapidly analyze and evaluate nucleobases and nucleosides in Ginkgo biloba leaves.

    Science.gov (United States)

    Yao, Xin; Zhou, Guisheng; Tang, Yuping; Guo, Sheng; Qian, Dawei; Duan, Jin-Ao

    2015-02-01

    Ginkgo biloba leaf extract has been widely used in dietary supplements and more recently in some foods and beverages. In addition to the well-known flavonol glycosides and terpene lactones, G. biloba leaves are also rich in nucleobases and nucleosides. To determine the content of nucleobases and nucleosides in G. biloba leaves at trace levels, a reliable method has been established by using hydrophilic interaction ultra performance liquid chromatography coupled with triple-quadrupole tandem mass spectrometry (HILIC-UPLC-TQ-MS/MS) working in multiple reaction monitoring mode. Eleven nucleobases and nucleosides were simultaneously determined in seven min. The proposed method was fully validated in terms of linearity, sensitivity, and repeatability, as well as recovery. Furthermore, hierarchical clustering analysis (HCA) was performed to evaluate and classify the samples according to the contents of the eleven chemical constituents. The established approach could be helpful for evaluation of the potential values as dietary supplements and the quality control of G. biloba leaves, which might also be utilized for the investigation of other medicinal herbs containing nucleobases and nucleosides.

  15. Divisive Analysis (DIANA of hierarchical clustering and GPS data for level of service criteria of urban streets

    Directory of Open Access Journals (Sweden)

    Ashish Kumar Patnaik

    2016-03-01

    Full Text Available Level of Service (LOS for heterogeneous traffic flow on urban streets is not well defined in Indian context. Hence in this study an attempt is taken to classify urban road networks into number of street classes and average travel speeds on street segments into LOS categories. Divisive Analysis (DIANA Clustering is used for such classification of large amount of speed data collected using GPS receiver. DIANA algorithm and silhouette validation parameter are used to classify Free Flow Speeds (FFS into optimal number of classes and the same algorithm is applied on speed data to determine ranges of different LOS categories. Speed ranges for LOS categories (A–F expressed in percentage of FFS are found to be 90, 70, 50, 40, 25 and 20–25 respectively in the present study. On the other hand, in HCM (2000 it has been mentioned these values are 85 and above, 67–85, 50–67, 40–50, 30–40 and 30 and less percent respectively.

  16. Single-molecule atomic force microscopy reveals clustering of the yeast plasma-membrane sensor Wsc1.

    Science.gov (United States)

    Heinisch, Jürgen J; Dupres, Vincent; Wilk, Sabrina; Jendretzki, Arne; Dufrêne, Yves F

    2010-06-14

    Signalling is a key feature of living cells which frequently involves the local clustering of specific proteins in the plasma membrane. How such protein clustering is achieved within membrane microdomains ("rafts") is an important, yet largely unsolved problem in cell biology. The plasma membrane of yeast cells represents a good model to address this issue, since it features protein domains that are sufficiently large and stable to be observed by fluorescence microscopy. Here, we demonstrate the ability of single-molecule atomic force microscopy to resolve lateral clustering of the cell integrity sensor Wsc1 in living Saccharomyces cerevisiae cells. We first localize individual wild-type sensors on the cell surface, revealing that they form clusters of approximately 200 nm size. Analyses of three different mutants indicate that the cysteine-rich domain of Wsc1 has a crucial, not yet anticipated function in sensor clustering and signalling. Clustering of Wsc1 is strongly enhanced in deionized water or at elevated temperature, suggesting its relevance in proper stress response. Using in vivo GFP-localization, we also find that non-clustering mutant sensors accumulate in the vacuole, indicating that clustering may prevent endocytosis and sensor turnover. This study represents the first in vivo single-molecule demonstration for clustering of a transmembrane protein in S. cerevisiae. Our findings indicate that in yeast, like in higher eukaryotes, signalling is coupled to the localized enrichment of sensors and receptors within membrane patches.

  17. Single-molecule atomic force microscopy reveals clustering of the yeast plasma-membrane sensor Wsc1.

    Directory of Open Access Journals (Sweden)

    Jürgen J Heinisch

    Full Text Available Signalling is a key feature of living cells which frequently involves the local clustering of specific proteins in the plasma membrane. How such protein clustering is achieved within membrane microdomains ("rafts" is an important, yet largely unsolved problem in cell biology. The plasma membrane of yeast cells represents a good model to address this issue, since it features protein domains that are sufficiently large and stable to be observed by fluorescence microscopy. Here, we demonstrate the ability of single-molecule atomic force microscopy to resolve lateral clustering of the cell integrity sensor Wsc1 in living Saccharomyces cerevisiae cells. We first localize individual wild-type sensors on the cell surface, revealing that they form clusters of approximately 200 nm size. Analyses of three different mutants indicate that the cysteine-rich domain of Wsc1 has a crucial, not yet anticipated function in sensor clustering and signalling. Clustering of Wsc1 is strongly enhanced in deionized water or at elevated temperature, suggesting its relevance in proper stress response. Using in vivo GFP-localization, we also find that non-clustering mutant sensors accumulate in the vacuole, indicating that clustering may prevent endocytosis and sensor turnover. This study represents the first in vivo single-molecule demonstration for clustering of a transmembrane protein in S. cerevisiae. Our findings indicate that in yeast, like in higher eukaryotes, signalling is coupled to the localized enrichment of sensors and receptors within membrane patches.

  18. Comparative analyses of vertebrate posterior HoxD clusters reveal atypical cluster architecture in the caecilian Typhlonectes natans

    Directory of Open Access Journals (Sweden)

    Amemiya Chris T

    2010-11-01

    Full Text Available Abstract Background The posterior genes of the HoxD cluster play a crucial role in the patterning of the tetrapod limb. This region is under the control of a global, long-range enhancer that is present in all vertebrates. Variation in limb types, as is the case in amphibians, can probably not only be attributed to variation in Hox genes, but is likely to be the product of differences in gene regulation. With a collection of vertebrate genome sequences available today, we used a comparative genomics approach to study the posterior HoxD cluster of amphibians. A frog and a caecilian were included in the study to compare coding sequences as well as to determine the gain and loss of putative regulatory sequences. Results We sequenced the posterior end of the HoxD cluster of a caecilian and performed comparative analyses of this region using HoxD clusters of other vertebrates. We determined the presence of conserved non-coding sequences and traced gains and losses of these footprints during vertebrate evolution, with particular focus on amphibians. We found that the caecilian HoxD cluster is almost three times larger than its mammalian counterpart. This enlargement is accompanied with the loss of one gene and the accumulation of repeats in that area. A similar phenomenon was observed in the coelacanth, where a different gene was lost and expansion of the area where the gene was lost has occurred. At least one phylogenetic footprint present in all vertebrates was lost in amphibians. This conserved region is a known regulatory element and functions as a boundary element in neural tissue to prevent expression of Hoxd genes. Conclusion The posterior part of the HoxD cluster of Typhlonectes natans is among the largest known today. The loss of Hoxd-12 and the expansion of the intergenic region may exert an influence on the limb enhancer, by having to bypass a distance seven times that of regular HoxD clusters. Whether or not there is a correlation with the

  19. Weighted Clustering

    DEFF Research Database (Denmark)

    Ackerman, Margareta; Ben-David, Shai; Branzei, Simina

    2012-01-01

    We investigate a natural generalization of the classical clustering problem, considering clustering tasks in which different instances may have different weights.We conduct the first extensive theoretical analysis on the influence of weighted data on standard clustering algorithms in both...... the partitional and hierarchical settings, characterizing the conditions under which algorithms react to weights. Extending a recent framework for clustering algorithm selection, we propose intuitive properties that would allow users to choose between clustering algorithms in the weighted setting and classify...

  20. Regulators of skeletal development: a cluster analysis of 206 bone tumors reveals diagnostically useful markers.

    Science.gov (United States)

    Horvai, Andrew E; Roy, Ritu; Borys, Dariusz; O'Donnell, Richard J

    2012-11-01

    The molecules Indian hedgehog (IHH), SP7 (also known as osterix), sex-determining region Y-box 9 (SOX9), runt-related transcription factor 2 (RUNX2) and TWIST1 regulate the normal differentiation of osteo- and chondrogenic cells from precursors during skeletal development and remodeling. The aberrant function of the same molecules has been implicated in the pathogenesis of bone tumors. Preliminary studies suggest that antibodies against these molecules have practical, diagnostic or prognostic utility in tumors. However, a comprehensive analysis of the expression of these molecules in a large, diverse set of bone tumors has yet to be reported. The goals of this study were to compare the immunohistochemical profiles of IHH, SP7, SOX9, RUNX2 and TWIST1 among bone tumors and to determine the optimum panel for diagnostic utility. Tissue microarrays prepared from 206 undecalcified tumors (71 osteosarcomas, 26 osteoblastomas/osteoid osteomas, 50 giant cell tumors, 5 chondromyxoid fibromas and 54 chondroblastomas) were stained with antibodies to IHH, SP7, SOX9, RUNX2 and TWIST1. The stains were scored for intensity (0-3+) and distribution. The results were analyzed by cluster analysis. Optimum antibody panels for diagnostic sensitivity and specificity were calculated. Analysis revealed six main clusters that corresponded well to tumor types and suggested a close relationship between the stromal cells of giant cell tumor and the osteoblasts of osteosarcoma. The expression profile of chondromyxoid fibroma and chondroblastoma also suggested related differentiation. The distribution of osteoblastomas and osteoid osteomas was more heterogeneous. RUNX2, SOX9 and TWIST1 represented the most sensitive and specific immunohistochemical panel to distinguish among these diagnoses with the limitation that no result could discriminate between chondroblastoma and chondromyxoid fibroma. IHH and SP7 did not yield additional utility.

  1. Discovering hierarchical structure in normal relational data

    DEFF Research Database (Denmark)

    Schmidt, Mikkel Nørgaard; Herlau, Tue; Mørup, Morten

    2014-01-01

    Hierarchical clustering is a widely used tool for structuring and visualizing complex data using similarity. Traditionally, hierarchical clustering is based on local heuristics that do not explicitly provide assessment of the statistical saliency of the extracted hierarchy. We propose a non-param...

  2. Discovery of Overlapping and Hierarchical Communities Based on Extended Link Cluster Sequence%基于增广边簇序列的重叠层次社区发现

    Institute of Scientific and Technical Information of China (English)

    郭红; 黄佳鑫; 郭昆

    2015-01-01

    The mining and discovery of overlapping and hierarchical communities is a hot topic in the area of social network research. Firstly, an algorithm, discovery of link conmunities based on extended link cluster sequence ( DLC ECS) , is proposed to detect overlapping and hierarchical communities in social networks efficiently. Based on the extended link cluster sequence corresponding to community structures with various densities, the optimal link community is detected after searching for the global optimal density. The link communities are transformed into the node communities, and thus the overlapping communities can be found out. Then, hierarchical link communities extraction based on extended link cluster sequence ( HLCE ECS ) is designed. Hierarchical link communities from the extended link cluster sequence is found by the proposed algorithm. The link communities are transformed into the node communities to find out the overlapping and hierarchical communities. Experimental results on are artificial and real-world datasets demonstrate that DLC ECS algorithm significantly improves the community quality and HLCE ECS algorithm effectively discovers meaningful hierarchical communities.%高质量重叠层次社区的挖掘和发现已成为社会网络研究热点,为更有效地发现社会网络中具有重叠层次性的社区结构,提出基于增广边簇序列的边社区发现算法( DLC ECS)。在产生包含所有可能密度参数对应的社区结构的增广边簇序列的基础上,找出全局最优的密度参数,发现全局最优的边社区结构,将识别的边社区结构转化为节点社区结构,发现具有重叠结构的社区。在该序列的基础上,提出层次边社区提取算法( HLCE ECS),快速发现序列中的层次边社区结构,将识别的边社区结构转化为节点社区结构,发现同时具有重叠和层次结构的社区。在真实数据集和人工数据集上的实验表明,DLC ECS具有

  3. Interpolation centers' selection using hierarchical curvature-based clustering Selección de centros de interpolacion mediante agrupamiento jerárquico basado en curvatura

    Directory of Open Access Journals (Sweden)

    Juan C. Rodríguez

    2010-07-01

    Full Text Available Es ampliamente conocido que algunos campos relacionados con aplicaciones de gráficos realistas requieren modelos tridimensionales altamente detallados. Las tecnologías para esto están bien desarrolladas, sin embargo, en algunos casos los escáneres láser obtienen modelos complejos formados por millones de puntos, por lo que son computacionalmente intratables. En estos casos es conveniente obtener un conjunto reducido de estas muestras con las que reconstruir la superficie de la función. Obtener un enfoque de reducción adecuado que posea un equilibrio entre la pérdida de precisión de la función reconstruida, y el costo computacional es un problema no trivial. En este artículo presentamos un método jerárquico de aglomeración a través de la selección de centros mediante la geométrica, la distribución y la estimación de curvatura de las muestras en el espacio 3D.It is widely known that some fields related to graphic applications require realistic and full detailed three-dimensional models. Technologies for this kind of applications exist. However, in some cases, laser scanner get complex models composed of million of points, making its computationally difficult. In these cases, it is desirable to obtain a reduced set of these samples to reconstruct the function's surface. An appropriate reduction approach with a non-significant loss of accuracy in the reconstructed function with a good balance of computational load is usually a non-trivial problem. In this article, a hierarchical clustering based method by the selection of center using the geometric distribution and curvature estimation of the samples in the 3D space is described.

  4. Cluster analysis reveals a binary effect of storage on boar sperm motility function.

    Science.gov (United States)

    Henning, Heiko; Petrunkina, Anna M; Harrison, Robin A P; Waberski, Dagmar

    2014-06-01

    Storage of liquid-preserved boar spermatozoa is associated with a loss of fertilising ability of the preserved spermatozoa, which standard semen parameters barely reflect. Monitoring responses to molecular effectors of sperm function (e.g. bicarbonate) has proven to be a more sensitive approach to investigating storage effects. Bicarbonate not only initiates capacitation in spermatozoa, but also induces motility activation. This occurs at ejaculation, but also happens throughout passage through the oviduct. In the present study we tested whether the specific response of boar sperm subpopulations to bicarbonate, as assessed by motility activation, is altered with the duration of storage in vitro. Three ejaculates from each of seven boars were diluted in Beltsville thawing solution and stored at 17°C. Only minor changes in the parameters of diluted semen were revealed over a period of 72h storage. For assessment of bicarbonate responses, subsamples of diluted spermatozoa were centrifuged through a discontinuous Percoll gradient after 12, 24 and 72h storage. Subsequently, spermatozoa were incubated in two Ca2+-free variants of Tyrode's medium either without (TyrControl) or with (TyrBic) 15mM bicarbonate, and computer-aided sperm analysis motility measurements were made. Cluster analysis of imaging data from motile spermatozoa revealed the presence of five major sperm subpopulations with distinct motility characteristics, differing between TyrBic and TyrControl at any given time (Psperm motility function descriptors to storage: although the quantitative descriptor (percentage of motile spermatozoa) declines in washed semen samples, the qualitative descriptor (percentage of spermatozoa stimulated into fast linear motion by bicarbonate) is sustained independent of the duration of storage.

  5. Newly developed SSR markers reveal genetic diversity and geographical clustering in spinach (Spinacia oleracea).

    Science.gov (United States)

    Göl, Şurhan; Göktay, Mehmet; Allmer, Jens; Doğanlar, Sami; Frary, Anne

    2017-08-01

    Spinach is a popular leafy green vegetable due to its nutritional composition. It contains high concentrations of vitamins A, E, C, and K, and folic acid. Development of genetic markers for spinach is important for diversity and breeding studies. In this work, Next Generation Sequencing (NGS) technology was used to develop genomic simple sequence repeat (SSR) markers. After cleaning and contig assembly, the sequence encompassed 2.5% of the 980 Mb spinach genome. The contigs were mined for SSRs. A total of 3852 SSRs were detected. Of these, 100 primer pairs were tested and 85% were found to yield clear, reproducible amplicons. These 85 markers were then applied to 48 spinach accessions from worldwide origins, resulting in 389 alleles with 89% polymorphism. The average gene diversity (GD) value of the markers (based on a GD calculation that ranges from 0 to 0.5) was 0.25. Our results demonstrated that the newly developed SSR markers are suitable for assessing genetic diversity and population structure of spinach germplasm. The markers also revealed clustering of the accessions based on geographical origin with clear separation of Far Eastern accessions which had the overall highest genetic diversity when compared with accessions from Persia, Turkey, Europe, and the USA. Thus, the SSR markers have good potential to provide valuable information for spinach breeding and germplasm management. Also they will be helpful for genome mapping and core collection establishment.

  6. Methylome sequencing in triple-negative breast cancer reveals distinct methylation clusters with prognostic value.

    Science.gov (United States)

    Stirzaker, Clare; Zotenko, Elena; Song, Jenny Z; Qu, Wenjia; Nair, Shalima S; Locke, Warwick J; Stone, Andrew; Armstong, Nicola J; Robinson, Mark D; Dobrovic, Alexander; Avery-Kiejda, Kelly A; Peters, Kate M; French, Juliet D; Stein, Sandra; Korbie, Darren J; Trau, Matt; Forbes, John F; Scott, Rodney J; Brown, Melissa A; Francis, Glenn D; Clark, Susan J

    2015-02-02

    Epigenetic alterations in the cancer methylome are common in breast cancer and provide novel options for tumour stratification. Here, we perform whole-genome methylation capture sequencing on small amounts of DNA isolated from formalin-fixed, paraffin-embedded tissue from triple-negative breast cancer (TNBC) and matched normal samples. We identify differentially methylated regions (DMRs) enriched with promoters associated with transcription factor binding sites and DNA hypersensitive sites. Importantly, we stratify TNBCs into three distinct methylation clusters associated with better or worse prognosis and identify 17 DMRs that show a strong association with overall survival, including DMRs located in the Wilms tumour 1 (WT1) gene, bi-directional-promoter and antisense WT1-AS. Our data reveal that coordinated hypermethylation can occur in oestrogen receptor-negative disease, and that characterizing the epigenetic framework provides a potential signature to stratify TNBCs. Together, our findings demonstrate the feasibility of profiling the cancer methylome with limited archival tissue to identify regulatory regions associated with cancer.

  7. Silicon Monoxide Observations Reveal a Cluster of Hidden Compact Outflows in the OMC1 South Region

    CERN Document Server

    Zapata, L A; Rodríguez, L F; O'Dell, C R; Zhang, Q; Muench, A; Zapata, Luis A.; Ho, Paul T. P.; Rodriguez, Luis F.; Zhang, Qizhou; Muench, August

    2006-01-01

    We present high angular resolution ($2\\rlap.{''}8 \\times 1\\rlap.{''}7$) SiO J=5$\\to$4; $v=0$ line observations of the OMC1S region in the Orion Nebula made using the Submillimeter Array (SMA). We detect for the first time a cluster of four compact bipolar and monopolar outflows that show high, moderate and low velocity gas and appear to be energized by millimeter and infrared sources associated with this region. The SiO molecular outflows are compact ($<$ 3500 AU), and in most of the cases, they are located very close to their exciting sources. We thus propose that the SiO thermal emission is tracing the youngest and most highly excited parts of the outflows which cannot be detected by other molecules. Moreover, since the ambient cloud is weak in the SiO line emission, these observations can reveal flows that in other molecular transitions will be confused with the ambient velocity cloud emission. Analysis of their positional-velocity diagrams show that some components of these outflows may be driven by wi...

  8. The mechanics of bacterial cluster formation on plant leaf surfaces as revealed by bioreporter technology

    NARCIS (Netherlands)

    Tecon, R.; Leveau, J.H.J.

    2012-01-01

    Bacteria that colonize the leaves of terrestrial plants often occur in clusters whose size varies from a few to thousands of cells. For the formation of such bacterial clusters, two non-mutually exclusive but very different mechanisms may be proposed: aggregation of multiple cells or clonal

  9. Simultaneous clustering of gene expression data with clinical chemistry and pathological evaluations reveals phenotypic prototypes

    Directory of Open Access Journals (Sweden)

    Wolfinger Russell D

    2007-02-01

    Full Text Available Abstract Background Commonly employed clustering methods for analysis of gene expression data do not directly incorporate phenotypic data about the samples. Furthermore, clustering of samples with known phenotypes is typically performed in an informal fashion. The inability of clustering algorithms to incorporate biological data in the grouping process can limit proper interpretation of the data and its underlying biology. Results We present a more formal approach, the modk-prototypes algorithm, for clustering biological samples based on simultaneously considering microarray gene expression data and classes of known phenotypic variables such as clinical chemistry evaluations and histopathologic observations. The strategy involves constructing an objective function with the sum of the squared Euclidean distances for numeric microarray and clinical chemistry data and simple matching for histopathology categorical values in order to measure dissimilarity of the samples. Separate weighting terms are used for microarray, clinical chemistry and histopathology measurements to control the influence of each data domain on the clustering of the samples. The dynamic validity index for numeric data was modified with a category utility measure for determining the number of clusters in the data sets. A cluster's prototype, formed from the mean of the values for numeric features and the mode of the categorical values of all the samples in the group, is representative of the phenotype of the cluster members. The approach is shown to work well with a simulated mixed data set and two real data examples containing numeric and categorical data types. One from a heart disease study and another from acetaminophen (an analgesic exposure in rat liver that causes centrilobular necrosis. Conclusion The modk-prototypes algorithm partitioned the simulated data into clusters with samples in their respective class group and the heart disease samples into two groups (sick and

  10. Identifying Two Groups of Entitled Individuals: Cluster Analysis Reveals Emotional Stability and Self-Esteem Distinction.

    Science.gov (United States)

    Crowe, Michael L; LoPilato, Alexander C; Campbell, W Keith; Miller, Joshua D

    2016-12-01

    The present study hypothesized that there exist two distinct groups of entitled individuals: grandiose-entitled, and vulnerable-entitled. Self-report scores of entitlement were collected for 916 individuals using an online platform. Model-based cluster analyses were conducted on the individuals with scores one standard deviation above mean (n = 159) using the five-factor model dimensions as clustering variables. The results support the existence of two groups of entitled individuals categorized as emotionally stable and emotionally vulnerable. The emotionally stable cluster reported emotional stability, high self-esteem, more positive affect, and antisocial behavior. The emotionally vulnerable cluster reported low self-esteem and high levels of neuroticism, disinhibition, conventionality, psychopathy, negative affect, childhood abuse, intrusive parenting, and attachment difficulties. Compared to the control group, both clusters reported being more antagonistic, extraverted, Machiavellian, and narcissistic. These results suggest important differences are missed when simply examining the linear relationships between entitlement and various aspects of its nomological network.

  11. Differential cytokine profiles in juvenile idiopathic arthritis subtypes revealed by cluster analysis.

    Science.gov (United States)

    van den Ham, Henk-Jan; de Jager, Wilco; Bijlsma, Johannes W J; Prakken, Berent J; de Boer, Rob J

    2009-08-01

    With the introduction of high-throughput biomarker measurements, traditional analysis of these markers is increasingly difficult. Using samples from a diverse group of patients, we tested the applicability of cluster analysis to these data. Using this method, we aim to visualize some of the patterns specific to certain disease groups. In particular, we focus on juvenile idiopathic arthritis (JIA), a multifactorial autoimmune disorder that ultimately leads to chronic inflammation of the joints. Cytokine measurements were performed using multiplex immunoassays. Using heuristic clustering methods, we set out to compare the pattern of 30 cytokines in plasma and SF of JIA, RA, OA, or diabetes type II patients and healthy controls. Analysis shows that oligo- and polyarticular JIA have similar biomarker profiles, both in plasma and SF. Systemic onset JIA (SoJIA) has a profile distinct from other JIA subtypes, suggesting that they involve different inflammatory processes. SoJIA samples do, however, cluster together with RA in SF, suggesting that these two conditions have similar cytokine profiles. Furthermore, we identify several clusters of ILs and chemokines that are co-expressed, suggesting that they are co-regulated. We show that previously undetected clusters of cytokines and patients can be identified by applying cluster analysis to multiplex data. Cytokine clusters identified in plasma and SF samples were quite different, which underscore the differential cytokine signalling in these two compartments, and suggest that plasma samples may not be suitable for estimating joint biomarker profiles and inflammation.

  12. Mechanistic insights into the distribution of carbohydrate clusters on cell membranes revealed by dSTORM imaging

    Science.gov (United States)

    Chen, Junling; Gao, Jing; Cai, Mingjun; Xu, Haijiao; Jiang, Junguang; Tian, Zhiyuan; Wang, Hongda

    2016-07-01

    Cell surface carbohydrates play significant roles in many physiological processes and act as primary markers to indicate various cellular physiological states. The functions of carbohydrates are always associated with their expression and distribution on cell membranes. Based on our previous work, we found that carbohydrates tend to form clusters; however, the underlying mechanism of these clusters remains unknown. Through the direct stochastic optical reconstruction microscopy (dSTORM) strategy, we found that with the contributions of lipid raft as a stable factor and actin cytoskeleton as a restrictive factor, carbohydrate clusters can stably exist with restricted size. Additionally, we revealed that the formation of most carbohydrate clusters (Gal and GlcANc clusters) depended on the carbohydrate-binding proteins (i.e., galectins) cross-linking their specific carbohydrate ligands. Our results clarify the organizational mechanism of carbohydrates on cell surfaces from their formation, stable existence and size-restriction, which promotes a better understanding of the relationship between the function and distribution of carbohydrates, as well as the structure of cell membranes.Cell surface carbohydrates play significant roles in many physiological processes and act as primary markers to indicate various cellular physiological states. The functions of carbohydrates are always associated with their expression and distribution on cell membranes. Based on our previous work, we found that carbohydrates tend to form clusters; however, the underlying mechanism of these clusters remains unknown. Through the direct stochastic optical reconstruction microscopy (dSTORM) strategy, we found that with the contributions of lipid raft as a stable factor and actin cytoskeleton as a restrictive factor, carbohydrate clusters can stably exist with restricted size. Additionally, we revealed that the formation of most carbohydrate clusters (Gal and GlcANc clusters) depended on the

  13. Sequencing and comparative analysis of fugu protocadherin clusters reveal diversity of protocadherin genes among teleosts

    Directory of Open Access Journals (Sweden)

    Rajasegaran Vikneswari

    2007-03-01

    Full Text Available Abstract Background The synaptic cell adhesion molecules, protocadherins, are a vertebrate innovation that accompanied the emergence of the neural tube and the elaborate central nervous system. In mammals, the protocadherins are encoded by three closely-linked clusters (α, β and γ of tandem genes and are hypothesized to provide a molecular code for specifying the remarkably-diverse neural connections in the central nervous system. Like mammals, the coelacanth, a lobe-finned fish, contains a single protocadherin locus, also arranged into α, β and γ clusters. Zebrafish, however, possesses two protocadherin loci that contain more than twice the number of genes as the coelacanth, but arranged only into α and γ clusters. To gain further insight into the evolutionary history of protocadherin clusters, we have sequenced and analyzed protocadherin clusters from the compact genome of the pufferfish, Fugu rubripes. Results Fugu contains two unlinked protocadherin loci, Pcdh1 and Pcdh2, that collectively consist of at least 77 genes. The fugu Pcdh1 locus has been subject to extensive degeneration, resulting in the complete loss of Pcdh1γ cluster. The fugu Pcdh genes have undergone lineage-specific regional gene conversion processes that have resulted in a remarkable regional sequence homogenization among paralogs in the same subcluster. Phylogenetic analyses show that most protocadherin genes are orthologous between fugu and zebrafish either individually or as paralog groups. Based on the inferred phylogenetic relationships of fugu and zebrafish genes, we have reconstructed the evolutionary history of protocadherin clusters in the teleost fish lineage. Conclusion Our results demonstrate the exceptional evolutionary dynamism of protocadherin genes in vertebrates in general, and in teleost fishes in particular. Besides the 'fish-specific' whole genome duplication, the evolution of protocadherin genes in teleost fishes is influenced by lineage

  14. Sequencing rare marine actinomycete genomes reveals high density of unique natural product biosynthetic gene clusters.

    Science.gov (United States)

    Schorn, Michelle A; Alanjary, Mohammad M; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R; Ziemert, Nadine; Moore, Bradley S

    2016-12-01

    Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites.

  15. The formation history of massive cluster galaxies as revealed by CARLA

    CERN Document Server

    Cooke, E A; Rettura, A; Wylezalek, D; Galametz, A; Stern, D; Brodwin, M; Muldrew, S I; Almaini, O; Conselice, C J; Eisenhardt, P R; Hartley, W G; Jarvis, M; Seymour, N; Stanford, S A

    2015-01-01

    We use a sample of 37 of the densest clusters and protoclusters across $1.3 \\le z \\le 3.2$ from the Clusters Around Radio-Loud AGN (CARLA) survey to study the formation of massive cluster galaxies. We use optical $i'$-band and infrared 3.6$\\mu$m and 4.5$\\mu$m images to statistically select sources within these protoclusters and measure their median observed colours; $\\langle i'-[3.6] \\rangle$. We find the abundance of massive galaxies within the protoclusters increases with decreasing redshift, suggesting these objects may form an evolutionary sequence, with the lower redshift clusters in the sample having similar properties to the descendants of the high redshift protoclusters. We find that the protocluster galaxies have an approximately unevolving observed-frame $i'-[3.6]$ colour across the examined redshift range. We compare the evolution of the $\\langle i'-[3.6] \\rangle$ colour of massive cluster galaxies with simplistic galaxy formation models. Taking the full cluster population into account, we show tha...

  16. A systematic screen reveals MicroRNA clusters that significantly regulate four major signaling pathways.

    Directory of Open Access Journals (Sweden)

    Lindsey E Becker

    Full Text Available MicroRNAs (miRNAs are encoded in the genome as individual miRNA genes or as gene clusters transcribed as polycistronic units. About 50% of all miRNAs are estimated to be co-expressed with neighboring miRNAs. Recent studies have begun to illuminate the importance of the clustering of miRNAs from an evolutionary, as well as a functional standpoint. Many miRNA clusters coordinately regulate multiple members of cellular signaling pathways or protein interaction networks. This cooperative method of targeting could produce effects on an overall process that are much more dramatic than the smaller effects often associated with regulation by an individual miRNA. In this study, we screened 366 human miRNA minigenes to determine their effects on the major signaling pathways culminating in AP-1, NF-κB, c-Myc, or p53 transcriptional activity. By stratifying these data into miRNA clusters, this systematic screen provides experimental evidence for the combined effects of clustered miRNAs on these signaling pathways. We also verify p53 as a direct target of miR-200a. This study is the first to provide a panoramic view of miRNA clusters' effects on cellular pathways.

  17. 基于多空间多层次谱聚类的非监督SAR图像分割算法%Segmentation method for SAR images based on unsupervised spectral clustering of multi-hierarchical region

    Institute of Scientific and Technical Information of China (English)

    田玲; 邓旌波; 廖紫纤; 石博; 何楚

    2013-01-01

    提出了一种基于多层区域谱聚类的非监督SAR图像分割算法(multi-space and multi-hierarchical region based spectral clustering,MSMHSC).该算法首先在特征与几何空间求距离,快速获得初始过分割区域,然后在过分割区域的谱空间上进行聚类,最终实现非监督的SAR图像分割.该方法计算复杂度小,无须训练样本,使用层次化思想使其能更充分地利用SAR图像各类先验与似然信息.在MSTAR真实SAR数据集上的实验验证了该算法的快速性和有效性.%This paper proposed a method based on the hierarchical clustering concept.First,it over-segmented the source image into many small regions.And then,it conducted a spectral clustering algorithm on those regions.The algorithm was tested on the MSTAR SAR data set,and was proved to be fast and efficient.

  18. Research of Parallel Programming Techniques of Hierarchical Model Based on SMP Clusters%基于SMP机群的层次化并行编程技术的研究

    Institute of Scientific and Technical Information of China (English)

    祝永志; 张丹丹; 曹宝香; 禹继国

    2012-01-01

    针对多核SMP机群的体系结构特点,讨论了MPI+ OpenMP混合并行程序设计技术.提出了一种多层次化混合设计新方法.设计了N-body问题的多层次化并行算法,并在曙光5000A机群上与传统的混合算法作了性能方面的比较.结果表明,该层次化混合并行算法具有更好的扩展性和加速比.%For multi-core SMP cluster systems, this paper discusses hybrid parallel programming techniques based on MPI and OpenMP.We propose a new hybrid parallel programming methods lhat are aware of architecture hierarchy on SMP cluster systems. We design a hierarchically parallel algorithm on the N-body problem, and compared its performance with traditional hybrid parallel algorithms on the Dawning 5000A cluster. The results indicate that our hierarchically hybrid parallel algorithm has better scalability and speedup than others.

  19. Intrusion Detection Method Based on Improved Growing Hierarchical Self-Organizing Map

    Institute of Scientific and Technical Information of China (English)

    张亚平; 布文秀; 苏畅; 王璐瑶; 许涵

    2016-01-01

    Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individ-ual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower, an improved GHSOM method combined with mutual information is proposed. After theoretical analysis, experiments are conducted to illustrate the effectiveness of the proposed method by accurately clustering the input data. Based on different clusters, the complex relationship within the data can be revealed effectively.

  20. Hierarchical photocatalysts.

    Science.gov (United States)

    Li, Xin; Yu, Jiaguo; Jaroniec, Mietek

    2016-05-01

    As a green and sustainable technology, semiconductor-based heterogeneous photocatalysis has received much attention in the last few decades because it has potential to solve both energy and environmental problems. To achieve efficient photocatalysts, various hierarchical semiconductors have been designed and fabricated at the micro/nanometer scale in recent years. This review presents a critical appraisal of fabrication methods, growth mechanisms and applications of advanced hierarchical photocatalysts. Especially, the different synthesis strategies such as two-step templating, in situ template-sacrificial dissolution, self-templating method, in situ template-free assembly, chemically induced self-transformation and post-synthesis treatment are highlighted. Finally, some important applications including photocatalytic degradation of pollutants, photocatalytic H2 production and photocatalytic CO2 reduction are reviewed. A thorough assessment of the progress made in photocatalysis may open new opportunities in designing highly effective hierarchical photocatalysts for advanced applications ranging from thermal catalysis, separation and purification processes to solar cells.

  1. Dynamical age differences among coeval star clusters as revealed by blue stragglers

    CERN Document Server

    Ferraro, F R; Dalessandro, E; Beccari, G; Pasquato, M; Miocchi, P; Rood, R T; Sigurdsson, S; Sills, A; Vesperini, E; Mapelli, M; Contreras, R; Sanna, N; Mucciarelli, A; 10.1038/nature11686

    2012-01-01

    Globular star clusters that formed at the same cosmic time may have evolved rather differently from a dynamical point of view (because that evolution depends on the internal environment) through a variety of processes that tend progressively to segregate stars more massive than the average towards the cluster centre. Therefore clusters with the same chronological age may have reached quite different stages of their dynamical history (that is, they may have different dynamical ages). Blue straggler stars have masses greater than those at the turn-off point on the main sequence and therefore must be the result of either a collision or a mass-transfer event. Because they are among the most massive and luminous objects in old clusters, they can be used as test particles with which to probe dynamical evolution. Here we report that globular clusters can be grouped into a few distinct families on the basis of the radial distribution of blue stragglers. This grouping corresponds well to an effective ranking of the dy...

  2. Global Analysis of miRNA Gene Clusters and Gene Families Reveals Dynamic and Coordinated Expression

    Directory of Open Access Journals (Sweden)

    Li Guo

    2014-01-01

    Full Text Available To further understand the potential expression relationships of miRNAs in miRNA gene clusters and gene families, a global analysis was performed in 4 paired tumor (breast cancer and adjacent normal tissue samples using deep sequencing datasets. The compositions of miRNA gene clusters and families are not random, and clustered and homologous miRNAs may have close relationships with overlapped miRNA species. Members in the miRNA group always had various expression levels, and even some showed larger expression divergence. Despite the dynamic expression as well as individual difference, these miRNAs always indicated consistent or similar deregulation patterns. The consistent deregulation expression may contribute to dynamic and coordinated interaction between different miRNAs in regulatory network. Further, we found that those clustered or homologous miRNAs that were also identified as sense and antisense miRNAs showed larger expression divergence. miRNA gene clusters and families indicated important biological roles, and the specific distribution and expression further enrich and ensure the flexible and robust regulatory network.

  3. Clustering of 770,000 genomes reveals post-colonial population structure of North America

    Science.gov (United States)

    Han, Eunjung; Carbonetto, Peter; Curtis, Ross E.; Wang, Yong; Granka, Julie M.; Byrnes, Jake; Noto, Keith; Kermany, Amir R.; Myres, Natalie M.; Barber, Mathew J.; Rand, Kristin A.; Song, Shiya; Roman, Theodore; Battat, Erin; Elyashiv, Eyal; Guturu, Harendra; Hong, Eurie L.; Chahine, Kenneth G.; Ball, Catherine A.

    2017-02-01

    Despite strides in characterizing human history from genetic polymorphism data, progress in identifying genetic signatures of recent demography has been limited. Here we identify very recent fine-scale population structure in North America from a network of over 500 million genetic (identity-by-descent, IBD) connections among 770,000 genotyped individuals of US origin. We detect densely connected clusters within the network and annotate these clusters using a database of over 20 million genealogical records. Recent population patterns captured by IBD clustering include immigrants such as Scandinavians and French Canadians; groups with continental admixture such as Puerto Ricans; settlers such as the Amish and Appalachians who experienced geographic or cultural isolation; and broad historical trends, including reduced north-south gene flow. Our results yield a detailed historical portrait of North America after European settlement and support substantial genetic heterogeneity in the United States beyond that uncovered by previous studies.

  4. Hierarchical video summarization

    Science.gov (United States)

    Ratakonda, Krishna; Sezan, M. Ibrahim; Crinon, Regis J.

    1998-12-01

    We address the problem of key-frame summarization of vide in the absence of any a priori information about its content. This is a common problem that is encountered in home videos. We propose a hierarchical key-frame summarization algorithm where a coarse-to-fine key-frame summary is generated. A hierarchical key-frame summary facilitates multi-level browsing where the user can quickly discover the content of the video by accessing its coarsest but most compact summary and then view a desired segment of the video with increasingly more detail. At the finest level, the summary is generated on the basis of color features of video frames, using an extension of a recently proposed key-frame extraction algorithm. The finest level key-frames are recursively clustered using a novel pairwise K-means clustering approach with temporal consecutiveness constraint. We also address summarization of MPEG-2 compressed video without fully decoding the bitstream. We also propose efficient mechanisms that facilitate decoding the video when the hierarchical summary is utilized in browsing and playback of video segments starting at selected key-frames.

  5. Gravity in the Crossfire: Revealing the Properties of Dark Matter in Bullet-like Clusters

    Science.gov (United States)

    Clowe, Douglas

    2010-09-01

    We propose to study the physical nature of dark matter by using massive, merging clusters of galaxies. As shown with the Bullet Cluster {1E0657-56}, such massive well-measured systems are critical for our understanding of dark matter. By more than doubling the number of clusters in the sample and obtaining systems at different observation angles, impact parameters, geometrical arrangements, and merger velocities, the systematic uncertainties in the dark matter cross section calculations can be improved substantially, allowing us to move from rough order of magnitude estimates to measurements with quantifiable uncertainties that can be compared usefully with the predictions from numerical simulations, and the constraints on alternate gravity models become unambiguous. Our proposed targets are three extraordinary, merging galaxy clusters with X-ray and optical offsets that are placed at ideal redshifts for such a study; A520, A1758N, and A2163. To pin down the position of the dark matter component we require high resolution, absolutely calibrated mass maps. High resolution gravitational lensing data is needed to attain this goal, which can only be achieved with the excellent resolving power of the HST.

  6. Hierarchical Affinity Propagation

    CERN Document Server

    Givoni, Inmar; Frey, Brendan J

    2012-01-01

    Affinity propagation is an exemplar-based clustering algorithm that finds a set of data-points that best exemplify the data, and associates each datapoint with one exemplar. We extend affinity propagation in a principled way to solve the hierarchical clustering problem, which arises in a variety of domains including biology, sensor networks and decision making in operational research. We derive an inference algorithm that operates by propagating information up and down the hierarchy, and is efficient despite the high-order potentials required for the graphical model formulation. We demonstrate that our method outperforms greedy techniques that cluster one layer at a time. We show that on an artificial dataset designed to mimic the HIV-strain mutation dynamics, our method outperforms related methods. For real HIV sequences, where the ground truth is not available, we show our method achieves better results, in terms of the underlying objective function, and show the results correspond meaningfully to geographi...

  7. 水声传感器网络簇头分层通信模式路由算法%Routing Protocol of Hierarchical Cluster-Communication Model in the Underwater Acoustic Sensor Network

    Institute of Scientific and Technical Information of China (English)

    马绅惟; 刘广钟

    2014-01-01

    Routing protocol plays a very important role in underwater acoustic sensor networks. Based on the traditional TEEN protocol, a new routing protocol named HCM-TEEN(Hierarchical Cluster-communication Model on TEEN) has been put forward. The improved algorithm sets a new threshold function on the basis of the process of cluster candidate and the cluster elimination, and then introduces a Hierarchical Cluster-communication model in the period of data transmission to optimize the routing process. The experiment by the Matlab proved that HCM-TEEN performed better than the traditional algorithm on the network lifetime and the network average residual energy.%路由协议在水声传感器网络研究领域中扮演着非常重要的角色。基于传统的TEEN协议路由算法,提出了水声传感器网络中簇头分层通信模式的路由算法(HCM-TEEN)。新算法从簇头候选与淘汰过程入手,设置新的阈值函数。在簇头确定完成后,在数据传输阶段引入簇头分层通信模式,从距离和能量的角度上优化路由选择。通过Matlab仿真实验显示, HCM-TEEN算法与传统的算法相比在网络生命周期和节点平均剩余能量上都更具优越性。

  8. Hierarchical Cont-Bouchaud model

    CERN Document Server

    Paluch, Robert; Holyst, Janusz A

    2015-01-01

    We extend the well-known Cont-Bouchaud model to include a hierarchical topology of agent's interactions. The influence of hierarchy on system dynamics is investigated by two models. The first one is based on a multi-level, nested Erdos-Renyi random graph and individual decisions by agents according to Potts dynamics. This approach does not lead to a broad return distribution outside a parameter regime close to the original Cont-Bouchaud model. In the second model we introduce a limited hierarchical Erdos-Renyi graph, where merging of clusters at a level h+1 involves only clusters that have merged at the previous level h and we use the original Cont-Bouchaud agent dynamics on resulting clusters. The second model leads to a heavy-tail distribution of cluster sizes and relative price changes in a wide range of connection densities, not only close to the percolation threshold.

  9. ALMA Reveals Potential Localized Dust Enrichment from Massive Star Clusters in II Zw 40

    Science.gov (United States)

    Consiglio, S. Michelle; Turner, Jean L.; Beck, Sara; Meier, David S.

    2016-12-01

    We present subarcsecond images of submillimeter CO and continuum emission from a local galaxy forming massive star clusters: the blue compact dwarf galaxy II Zw 40. At ˜0.″4 resolution (20 pc), the CO(3-2), CO(1-0), 3 mm, and 870 μm continuum maps illustrate star formation on the scales of individual molecular clouds. Dust contributes about one-third of the 870 μm continuum emission, with free-free accounting for the rest. On these scales, there is not a good correspondence between gas, dust, and free-free emission. Dust continuum is enhanced toward the star-forming region as compared to the CO emission. We suggest that an unexpectedly low and spatially variable gas-to-dust ratio is the result of rapid and localized dust enrichment of clouds by the massive clusters of the starburst.

  10. Global analysis of biosynthetic gene clusters reveals vast potential of secondary metabolite production in Penicillium species

    DEFF Research Database (Denmark)

    Nielsen, Jens Christian; Grijseels, Sietske; Prigent, Sylvain

    2017-01-01

    Filamentous fungi produce a wide range of bioactive compounds with important pharmaceutical applications, such as antibiotic penicillins and cholesterol-lowering statins. However, less attention has been paid to fungal secondary metabolites compared to those from bacteria. In this study, we...... sequenced the genomes of 9 Penicillium species and, together with 15 published genomes, we investigated the secondary metabolism of Penicillium and identified an immense, unexploited potential for producing secondary metabolites by this genus. A total of 1,317 putative biosynthetic gene clusters (BGCs) were...... identified, and polyketide synthase and non-ribosomal peptide synthetase based BGCs were grouped into gene cluster families and mapped to known pathways. The grouping of BGCs allowed us to study the evolutionary trajectory of pathways based on 6-methylsalicylic acid (6-MSA) synthases. Finally, we cross...

  11. The nature of FS CMa stars as revealed by host young clusters

    Science.gov (United States)

    de la Fuente, D.; Najarro, F.; Trombley, C.; Davies, B.; Figer, D. F.

    2015-05-01

    The nature and evolutionary state of the diverse objects displaying the B[e] phenomenon are reasonably known, except for a rare subtype named FS CMa stars. These are surrounded by compact disks of warm dust whose origin is unclear. Although the luminosity of these objects corresponds to main-sequence stars, mass loss rates derived from emission lines are 2 orders of magnitude larger than predicted by wind theory. Hitherto, FS CMa stars have been only found in isolation, which hinders the study of their nature. In this contribution, we present the discovery of FS CMa stars in two young Galactic clusters, which host Wolf-Rayet stars and OB supergiants. Membership to these coeval populations allows us to constrain the luminosity, circumstellar extinction and age of FS CMa stars in an unprecedented way. Due to their relatively low brightness when compared with coeval evolved massive stars, a high number of these objects may remain unnoticed in young clusters.

  12. ALMA Reveals Potential Localized Dust Enrichment from Massive Star Clusters in II Zw 40

    CERN Document Server

    Consiglio, S Michelle; Beck, Sara; Meier, David S

    2016-01-01

    We present subarcsecond images of submillimeter CO and continuum emission from a local galaxy forming massive star clusters: the blue compact dwarf galaxy II Zw 40. At $\\sim$0.4" resolution (20 pc), the CO(3-2), CO(1-0), 3mm and 870${\\mu}$m continuum maps illustrate star formation on the scales of individual molecular clouds. Dust contributes about a third of the 870${\\mu}$m continuum emission, with free-free accounting for the rest. On these scales, there is not a good correspondence between gas, dust, and free-free emission. Dust continuum is enhanced toward the star-forming region as compared to the CO emission. We suggest that an unexpectedly low and spatially variable gas-to-dust ratio is the result of massive clusters of the starburst.

  13. Coxiella burnetii transcriptional analysis reveals serendipity clusters of regulation in intracellular bacteria.

    Directory of Open Access Journals (Sweden)

    Quentin Leroy

    Full Text Available Coxiella burnetii, the causative agent of the zoonotic disease Q fever, is mainly transmitted to humans through an aerosol route. A spore-like form allows C. burnetii to resist different environmental conditions. Because of this, analysis of the survival strategies used by this bacterium to adapt to new environmental conditions is critical for our understanding of C. burnetii pathogenicity. Here, we report the early transcriptional response of C. burnetii under temperature stresses. Our data show that C. burnetii exhibited minor changes in gene regulation under short exposure to heat or cold shock. While small differences were observed, C. burnetii seemed to respond similarly to cold and heat shock. The expression profiles obtained using microarrays produced in-house were confirmed by quantitative RT-PCR. Under temperature stresses, 190 genes were differentially expressed in at least one condition, with a fold change of up to 4. Globally, the differentially expressed genes in C. burnetii were associated with bacterial division, (pppGpp synthesis, wall and membrane biogenesis and, especially, lipopolysaccharide and peptidoglycan synthesis. These findings could be associated with growth arrest and witnessed transformation of the bacteria to a spore-like form. Unexpectedly, clusters of neighboring genes were differentially expressed. These clusters do not belong to operons or genetic networks; they have no evident associated functions and are not under the control of the same promoters. We also found undescribed but comparable clusters of regulation in previously reported transcriptomic analyses of intracellular bacteria, including Rickettsia sp. and Listeria monocytogenes. The transcriptomic patterns of C. burnetii observed under temperature stresses permits the recognition of unpredicted clusters of regulation for which the trigger mechanism remains unidentified but which may be the result of a new mechanism of epigenetic regulation.

  14. Negative Ion Photoelectron Spectroscopy Reveals Thermodynamic Advantage of Organic Acids in Facilitating Formation of Bisulfate Ion Clusters: Atmospheric Implications

    Energy Technology Data Exchange (ETDEWEB)

    Hou, Gao-Lei; Lin, Wei; Deng, Shihu; Zhang, Jian; Zheng, Weijun; Paesani, Francesco; Wang, Xue B.

    2013-03-07

    Recent lab and field measurements have indicated critical roles of organic acids in enhancing new atmospheric aerosol formation. Such findings have stimulated theoretical studies with the aim of understanding interaction of organic acids with common aerosol nucleation precursors like bisulfate (HSO4-). In this Letter, we report a combined negative ion photoelectron spectroscopic and theoretical investigation of molecular clusters formed by HSO4- with succinic acid (SUA, HO2C(CH2)2CO2H), HSO4-(SUA)n (n = 0-2), along with HSO4-(H2O)n and HSO4-(H2SO4)n. It is found that one SUA molecule can stabilize HSO4- by ca. 39 kcal/mol, triple the corresponding value that one water molecule is capable of (ca. 13 kcal/mol). Molecular dynamics simulations and quantum chemical calculations reveal the most plausible structures of these clusters and attribute the stability of these clusters due to formation of strong hydrogen bonds. This work provides direct experimental evidence showing significant thermodynamic advantage by involving organic acid molecules to promote formation and growth in bisulfate clusters and aerosols.

  15. Emerging Massive Star Clusters Revealed: High Resolution Imaging of NGC 4449 from the Radio to the Ultraviolet

    CERN Document Server

    Reines, Amy E; Goss, W M

    2008-01-01

    We present a multi-wavelength study of embedded massive clusters in the nearby (3.9 Mpc) starburst galaxy NGC 4449 in an effort to uncover the earliest phases of massive cluster evolution. By combining high resolution imaging from the radio to the ultraviolet, we reveal these clusters in the process of emerging from their gaseous and dusty birth cocoons. We use Very Large Array (VLA) observations at centimeter wavelengths to identify young clusters surrounded by ultra-dense HII regions, detectable via their production of thermal free-free radio continuum. Ultraviolet, optical and infrared observations are obtained from the Hubble and Spitzer Space Telescope archives for comparison. We detect 39 compact radio sources towards NGC 4449 at 3.6 cm using the highest resolution (1.3") and sensitivity (RMS ~ 12 uJy) VLA image of the galaxy to date. We reliably identify 13 thermal radio sources and their physical properties are derived using both nebular emission from the HII regions and SED fitting to the stellar con...

  16. Classification of microvascular patterns via cluster analysis reveals their prognostic significance in glioblastoma.

    Science.gov (United States)

    Chen, Long; Lin, Zhi-Xiong; Lin, Guo-Shi; Zhou, Chang-Fu; Chen, Yu-Peng; Wang, Xing-Fu; Zheng, Zong-Qing

    2015-01-01

    There are limited researches focusing on microvascular patterns (MVPs) in human glioblastoma and their prognostic impact. We evaluated MVPs of 78 glioblastomas by CD34/periodic acid-Schiff dual staining and by cluster analysis of the percentage of microvascular area for distinct microvascular formations. The distribution of 5 types of basic microvascular formations, that is, microvascular sprouting (MS), vascular cluster (VC), vascular garland (VG), glomeruloid vascular proliferation (GVP), and vasculogenic mimicry (VM), was variable. Accordingly, cluster analysis classified MVPs into 2 types: type I MVP displayed prominent MSs and VCs, whereas type II MVP had numerous VGs, GVPs, and VMs. By analyzing the proportion of microvascular area for each type of formation, we determined that glioblastomas with few MSs and VCs had many GVPs and VMs, and vice versa. VG seemed to be a transitional type of formation. In case of type I MVP, expression of Ki-67 and p53 but not MGMT was significantly higher as compared with those of type II MVP (P analysis showed that the type of MVPs presented as an independent prognostic factor of progression-free survival (PFS) and overall survival (OS) (both P < .001). Type II MVP had a more negative influence on PFS and OS than did type I MVP. We conclude that the heterogeneous MVPs in glioblastoma can be categorized properly by certain histopathologic and statistical analyses and may influence clinical outcome. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  17. Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

    Directory of Open Access Journals (Sweden)

    Vassilev Boris

    2010-04-01

    Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.

  18. Large-scale SNP analysis reveals clustered and continuous patterns of human genetic variation

    Directory of Open Access Journals (Sweden)

    Shriver Mark D

    2005-06-01

    Full Text Available Abstract Understanding the distribution of human genetic variation is an important foundation for research into the genetics of common diseases. Some of the alleles that modify common disease risk are themselves likely to be common and, thus, amenable to identification using gene-association methods. A problem with this approach is that the large sample sizes required for sufficient statistical power to detect alleles with moderate effect make gene-association studies susceptible to false-positive findings as the result of population stratification 12. Such type I errors can be eliminated by using either family-based association tests or methods that sufficiently adjust for population stratification 345. These methods require the availability of genetic markers that can detect and, thus, control for sources of genetic stratification among populations. In an effort to investigate population stratification and identify appropriate marker panels, we have analysed 11,555 single nucleotide polymorphisms in 203 individuals from 12 diverse human populations. Individuals in each population cluster to the exclusion of individuals from other populations using two clustering methods. Higher-order branching and clustering of the populations are consistent with the geographic origins of populations and with previously published genetic analyses. These data provide a valuable resource for the definition of marker panels to detect and control for population stratification in population-based gene identification studies. Using three US resident populations (European-American, African-American and Puerto Rican, we demonstrate how such studies can proceed, quantifying proportional ancestry levels and detecting significant admixture structure in each of these populations.

  19. Eating or meeting? Cluster analysis reveals intricacies of white shark (Carcharodon carcharias migration and offshore behavior.

    Directory of Open Access Journals (Sweden)

    Salvador J Jorgensen

    Full Text Available Elucidating how mobile ocean predators utilize the pelagic environment is vital to understanding the dynamics of oceanic species and ecosystems. Pop-up archival transmitting (PAT tags have emerged as an important tool to describe animal migrations in oceanic environments where direct observation is not feasible. Available PAT tag data, however, are for the most part limited to geographic position, swimming depth and environmental temperature, making effective behavioral observation challenging. However, novel analysis approaches have the potential to extend the interpretive power of these limited observations. Here we developed an approach based on clustering analysis of PAT daily time-at-depth histogram records to distinguish behavioral modes in white sharks (Carcharodon carcharias. We found four dominant and distinctive behavioral clusters matching previously described behavioral patterns, including two distinctive offshore diving modes. Once validated, we mapped behavior mode occurrence in space and time. Our results demonstrate spatial, temporal and sex-based structure in the diving behavior of white sharks in the northeastern Pacific previously unrecognized including behavioral and migratory patterns resembling those of species with lek mating systems. We discuss our findings, in combination with available life history and environmental data, and propose specific testable hypotheses to distinguish between mating and foraging in northeastern Pacific white sharks that can provide a framework for future work. Our methodology can be applied to similar datasets from other species to further define behaviors during unobservable phases.

  20. Genetic Algorithm for Hierarchical Wireless Sensor Networks

    Directory of Open Access Journals (Sweden)

    Sajid Hussain

    2007-09-01

    Full Text Available Large scale wireless sensor networks (WSNs can be used for various pervasive and ubiquitous applications such as security, health-care, industry automation, agriculture, environment and habitat monitoring. As hierarchical clusters can reduce the energy consumption requirements for WSNs, we investigate intelligent techniques for cluster formation and management. A genetic algorithm (GA is used to create energy efficient clusters for data dissemination in wireless sensor networks. The simulation results show that the proposed intelligent hierarchical clustering technique can extend the network lifetime for different network deployment environments.

  1. 1 Hierarchical Approaches to the Analysis of Genetic Diversity in ...

    African Journals Online (AJOL)

    2015-04-14

    Apr 14, 2015 ... Keywords: Genetic diversity, Hierarchical approach, Plant, Clustering,. Descriptive ... utilization) or by clustering (based on a phonetic analysis of individual ...... Improvement of Food Crop Preservatives for the next Millennium.

  2. 基于改进层次聚类的同家族变压器状态变化规律分析%Condition evolution regularity analysis of power transformer in the same family based on improved hierarchical clustering

    Institute of Scientific and Technical Information of China (English)

    李新叶; 李新芳

    2011-01-01

    Family quality default history affects the healthy condition of power transformer greatly in integrated condition assessment. And now, it is usually subjectively decided by expert's experience. A new quantitatively computing method is proposed, that is, using hierarchical clustering technology to analyze the potential evolution regularity and then computing the influence degree of family quality default history on healthy condition of power transformer. To make the clustering result more accurate, line slope distance of condition evolution is proposed as line shape similarity criterion, both data distance criterion and line slope distance criterion are used to cluster data. The experimental result shows that our method is better than traditional hierarchical clustering method, and it is more reasonable to use clustering analysis to calculate the influence degree of family quality default history on power transformer healthy condition.%在变压器状态综合评估的研究中,家族质量缺陷史对变压器健康状态有重要影响,目前多是凭专家经验主观确定.提出利用层次聚类分析技术对同家族变压器状态变化规律进行分析,根据分析结果定量计算家族质量缺陷史对变压器健康状态的影响程度.为提高聚类的准确性,提出用变压器状态变化曲线的斜率距离作为曲线形状的相似性判据,同时用曲线间点数值距离和斜率距离构成交集约束判据进行聚类.实例分析表明改进的层次聚类算法优于传统的层次聚类算法,由聚类分析结果计算家族质量缺陷史对变压器健康状态的影响得出的结果更合理.

  3. Magnetospherically reflected chorus waves revealed by ray tracing with CLUSTER data

    Directory of Open Access Journals (Sweden)

    M. Parrot

    Full Text Available This paper is related to the propagation characteristics of a chorus emission recorded simultaneously by the 4 satellites of the CLUSTER mission on 29 October 2001 between 01:00 and 05:00 UT. During this day, the spacecraft (SC 1, 2, and 4 are relatively close to each other but SC3 has been delayed by half an hour. We use the data recorded aboard CLUSTER by the STAFF spectrum analyser. This instrument provides the cross spectral matrix of three magnetic and two electric field components. Dedicated software processes this spectral matrix in order to determine the wave normal directions relative to the Earth’s magnetic field. This calculation is done for the 4 satellites at different times and different frequencies and allows us to check the directions of these waves. Measurements around the magnetic equator show that the parallel component of the Poynting vector changes its sign when the satellites cross the equator region. It indicates that the chorus waves propagate away from this region which is considered as the source area of these emissions. This is valid for the most intense waves observed on the magnetic and electric power spectrograms. But it is also observed on SC1, SC2, and SC4 that lower intensity waves propagate toward the equator simultaneously with the SC3 intense chorus waves propagating away from the equator. Both waves are at the same frequency. Using the wave normal directions of these waves, a ray tracing study shows that the waves observed by SC1, SC2, and SC4 cross the equatorial plane at the same location as the waves observed by SC3. SC3 which is 30 minutes late observes the waves that originate first from the equator; meanwhile, SC1, SC2, and SC4 observe the same waves that have suffered a Lower Hybrid Resonance (LHR reflection at low altitudes (based on the ray tracing analysis and now return to the equator at a different location with a lower intensity. Similar phenomenon is observed when all SC are on the other side

  4. Nanoscale Imaging Reveals a Tetraspanin-CD9 Coordinated Elevation of Endothelial ICAM-1 Clusters.

    Directory of Open Access Journals (Sweden)

    Jonas Franz

    Full Text Available Endothelial barriers have a central role in inflammation as they allow or deny the passage of leukocytes from the vasculature into the tissue. To bind leukocytes, endothelial cells form adhesive clusters containing tetraspanins and ICAM-1, so-called endothelial adhesive platforms (EAPs. Upon leukocyte binding, EAPs evolve into docking structures that emanate from the endothelial surface while engulfing the leukocyte. Here, we show that TNF-α is sufficient to induce apical protrusions in the absence of leukocytes. Using advanced quantitation of atomic force microscopy (AFM recordings, we found these structures to protrude by 160 ± 80 nm above endothelial surface level. Confocal immunofluorescence microscopy proved them positive for ICAM-1, JAM-A, tetraspanin CD9 and f-actin. Microvilli formation was inhibited in the absence of CD9. Our findings indicate that stimulation with TNF-α induces nanoscale changes in endothelial surface architecture and that--via a tetraspanin CD9 depending mechanism--the EAPs rise above the surface to facilitate leukocyte capture.

  5. Revealing the Physics and Evolution of Galaxies and Galaxy Clusters with SKA Continuum Surveys

    CERN Document Server

    Prandoni, I

    2014-01-01

    In this chapter we provide an overview of the science enabled by radio continuum surveys in the SKA era, focusing on galaxy/galaxy cluster physics and evolution studies, and other relevant continuum science in the >2020 scientific framework. We outline a number of 'reference' radio-continuum surveys for SKA1 that can address such topics, and comprehensively discuss the most critical science requirements that we have identified. We highlight what should be achieved by SKA1, to guarantee a major leap forwards with respect to the pre-SKA era, considering the science advances expected in the coming years with existing and upcoming telescopes (JVLA, LOFAR, eMERLIN, and the three SKA precursors: MWA, ASKAP and MeerKAT). In this exercise we take in due account also the other waveband facilities coming online at the same time (e.g. Euclid, LSST, etc.), which tackle overlapping scientific goals, but in a different manner. In this respect particular attention has been payed to ensure that the proposed reference surveys...

  6. Revealing O VII from stacked X-ray grating spectra of clusters, groups and elliptical galaxies

    CERN Document Server

    Sanders, J S

    2010-01-01

    We stack 4.6 Ms of high spectral resolution XMM-Newton Reflection Grating Spectrometer spectra from galaxy clusters, groups of galaxies and elliptical galaxies. For those objects with a central temperature of less than 1 keV, we detect O VII for the first time, with a probability of false detection of 2.5x10^-4. The flux ratio of the O VII to Fe XVII lines is 1/4 to 1/8 of the emission expected for isobaric radiative cooling in the absence of heating. There is either a process preventing cooling below 0.5 keV, anomalous O/Fe abundance ratios, absorbing material around the coolest X-ray emitting gas or non-radiative cooling taking place. The mean N VII emission line is strong in the sub-keV sample. As the ratio of the hydrogenic N and O lines is largely independent of temperature, we measure a mean N/O ratio of 4.0 +- 0.6 Solar. Although the continuum around the C VI lines is difficult to measure we can similarly estimate that the C/O ratio is 0.9 +- 0.3 Solar.

  7. Multivariate clustering of progression profiles reveals different depression patterns in prodromal Huntington disease.

    Science.gov (United States)

    Kim, Ji-in; Long, Jeffrey D; Mills, James A; McCusker, Elizabeth; Paulsen, Jane S

    2015-11-01

    Although Huntington disease (HD) is caused by an autosomal dominant mutation, its phenotypic presentation differs widely. Variability in clinical phenotypes of HD may reflect the existence of disease subtypes. This hypothesis was tested in prodromal participants from the longitudinal Neurobiological Predictors of Huntington Disease (PREDICT-HD) study. We performed clustering using longitudinal data assessing motor, cognitive, and depression symptoms. Using data from 521 participants with 2,716 data points, we fit growth mixture models (GMM) that identify groups based on multivariate trajectories. In various GMM, different phases of disease progression were partitioned by progression trajectories of motor and cognitive signs, and by overall level of depression symptoms. More progressed motor signs were accompanied by more progressed cognitive signs, but not always by higher levels of depressive symptoms. In several models, there were at least 2 groups with similar trajectories for motor and cognitive signs that showed different levels for depression symptoms-one with a very low level of depression and the other with a higher level of depression. Findings indicate that at least intermediate HD progression might be associated with different levels of depression. Depression is one of the few symptoms that is treatable in HD and has implications for clinical care. Identification of potential depression subtypes may also help to select appropriate patients for clinical trials. (c) 2015 APA, all rights reserved).

  8. Hierarchical Cluster Analysis of Three-Dimensional Reconstructions of Unbiased Sampled Microglia Shows not Continuous Morphological Changes from Stage 1 to 2 after Multiple Dengue Infections in Callithrix penicillata

    Science.gov (United States)

    Diniz, Daniel G.; Silva, Geane O.; Naves, Thaís B.; Fernandes, Taiany N.; Araújo, Sanderson C.; Diniz, José A. P.; de Farias, Luis H. S.; Sosthenes, Marcia C. K.; Diniz, Cristovam G.; Anthony, Daniel C.; da Costa Vasconcelos, Pedro F.; Picanço Diniz, Cristovam W.

    2016-01-01

    It is known that microglial morphology and function are related, but few studies have explored the subtleties of microglial morphological changes in response to specific pathogens. In the present report we quantitated microglia morphological changes in a monkey model of dengue disease with virus CNS invasion. To mimic multiple infections that usually occur in endemic areas, where higher dengue infection incidence and abundant mosquito vectors carrying different serotypes coexist, subjects received once a week subcutaneous injections of DENV3 (genotype III)-infected culture supernatant followed 24 h later by an injection of anti-DENV2 antibody. Control animals received either weekly anti-DENV2 antibodies, or no injections. Brain sections were immunolabeled for DENV3 antigens and IBA-1. Random and systematic microglial samples were taken from the polymorphic layer of dentate gyrus for 3-D reconstructions, where we found intense immunostaining for TNFα and DENV3 virus antigens. We submitted all bi- or multimodal morphological parameters of microglia to hierarchical cluster analysis and found two major morphological phenotypes designated types I and II. Compared to type I (stage 1), type II microglia were more complex; displaying higher number of nodes, processes and trees and larger surface area and volumes (stage 2). Type II microglia were found only in infected monkeys, whereas type I microglia was found in both control and infected subjects. Hierarchical cluster analysis of morphological parameters of 3-D reconstructions of random and systematic selected samples in control and ADE dengue infected monkeys suggests that microglia morphological changes from stage 1 to stage 2 may not be continuous. PMID:27047345

  9. A Hierarchical Clustering Method Based on the Threshold of Semantic Feature in Big Data%大数据中一种基于语义特征阈值的层次聚类方法

    Institute of Scientific and Technical Information of China (English)

    罗恩韬; 王国军

    2015-01-01

    云计算、健康医疗、街景地图服务、推荐系统等新兴服务促使数据的种类和规模以前所未有的速度增长,数据量的激增会导致很多共性问题.例如数据的可表示,可处理和可靠性问题.如何有效处理和分析数据之间的关系,提高数据的划分效率,建立数据的聚类分析模型,已经成为学术界和企业界共同亟待解决的问题.该文提出一种基于语义特征的层次聚类方法,首先根据数据的语义特征进行训练,然后在每个子集上利用训练结果进行层次聚类,最终产生整体数据的密度中心点,提高了数据聚类效率和准确性.此方法采样复杂度低,数据分析准确,易于实现,具有良好的判定性.%The type and scale of data has been promoted with a hitherto unknown speed by the emerging services including cloud computing, health care, street view services recommendation system and so on. However, the surge in the volume of data may lead to many common problems, such as the representability, reliability and handlability of data. Therefore, how to effectively handle the relationship between the data and the analysis to improve the efficiency of classification of the data and establish the data clustering analysis model has become an academic and business problem, which needs to be solved urgently. A hierarchical clustering method based on semantic feature is proposed. Firstly, the data should be trained according to the semantic features of data, and then is used the training result to process hierarchical clustering in each subset; finally, the density center point is produced. This method can improve the efficiency and accuracy of data clustering. This algorithm is of low complexity about sampling, high accuracy of data analysis and good judgment. Furthermore, the algorithm is easy to realize.

  10. Single-particle analysis reveals shutoff control of the Arabidopsis ammonium transporter AMT1;3 by clustering and internalization.

    Science.gov (United States)

    Wang, Qinli; Zhao, Yuanyuan; Luo, Wangxi; Li, Ruili; He, Qihua; Fang, Xiaohong; Michele, Roberto De; Ast, Cindy; von Wirén, Nicolaus; Lin, Jinxing

    2013-08-01

    Ammonium is a preferred source of nitrogen for plants but is toxic at high levels. Plant ammonium transporters (AMTs) play an essential role in NH4(+) uptake, but the mechanism by which AMTs are regulated remains unclear. To study how AMTs are regulated in the presence of ammonium, we used variable-angle total internal reflection fluorescence microscopy and fluorescence cross-correlation spectroscopy for single-particle fluorescence imaging of EGFP-tagged AMT1;3 on the plasma membrane of Arabidopsis root cells at various ammonium levels. We demonstrated that AMT1;3-EGFP dynamically appeared and disappeared on the plasma membrane as moving fluorescent spots in low oligomeric states under N-deprived and N-sufficient conditions. Under external high-ammonium stress, however, AMT1;3-EGFPs were found to amass into clusters, which were then internalized into the cytoplasm. A similar phenomenon also occurred in the glutamine synthetase mutant gln1;2 background. Single-particle analysis of AMT1;3-EGFPs in the clathrin heavy chain 2 mutant (chc2 mutant) and Flotllin1 artificial microRNA (Flot1 amiRNA) backgrounds, together with chemical inhibitor treatments, demonstrated that the endocytosis of AMT1;3 clusters induced by high-ammonium stress could occur mainly through clathrin-mediated endocytic pathways, but the contribution of microdomain-associated endocytic pathway cannot be excluded in the internalization. Our results revealed that the clustering and endocytosis of AMT1;3 provides an effective mechanism by which plant cells can avoid accumulation of toxic levels of ammonium by eliminating active AMT1;3 from the plasma membrane.

  11. In vivo mutational analysis of the mupirocin gene cluster reveals labile points in the biosynthetic pathway: the "leaky hosepipe" mechanism.

    Science.gov (United States)

    Wu, Ji'en; Hothersall, Joanne; Mazzetti, Carlo; O'Connell, Yvonne; Shields, Jennifer A; Rahman, Ayesha S; Cox, Russell J; Crosby, John; Simpson, Thomas J; Thomas, Christopher M; Willis, Christine L

    2008-06-16

    A common feature of the mupirocin and other gene clusters of the AT-less polyketide synthase (PKS) family of metabolites is the introduction of carbon branches by a gene cassette that contains a beta-hydroxy-beta-methylglutaryl CoA synthase (HMC) homologue and acyl carrier protein (ACP), ketosynthase (KS) and two crotonase superfamily homologues. In vivo studies of Pseudomonas fluorescens strains in which any of these components have been mutated reveal a common phenotype in which the two major isolable metabolites are the truncated hexaketide mupirocin H and the tetraketide mupiric acid. The structure of the latter has been confirmed by stereoselective synthesis. Mupiric acid is also the major metabolite arising from inactivation of the ketoreductase (KR) domain of module 4 of the modular PKS. A number of other mutations in the tailoring region of the mupirocin gene cluster also result in production of both mupirocin H and mupiric acid. To explain this common phenotype we propose a mechanistic rationale in which both mupirocin H and mupiric acid represent the products of selective and spontaneous release from labile points in the pathway that occur at significant levels when mutations block the pathway either close to or distant from the labile points.

  12. Improved annotation of antibiotic resistance determinants reveals microbial resistomes cluster by ecology.

    Science.gov (United States)

    Gibson, Molly K; Forsberg, Kevin J; Dantas, Gautam

    2015-01-01

    Antibiotic resistance is a dire clinical problem with important ecological dimensions. While antibiotic resistance in human pathogens continues to rise at alarming rates, the impact of environmental resistance on human health is still unclear. To investigate the relationship between human-associated and environmental resistomes, we analyzed functional metagenomic selections for resistance against 18 clinically relevant antibiotics from soil and human gut microbiota as well as a set of multidrug-resistant cultured soil isolates. These analyses were enabled by Resfams, a new curated database of protein families and associated highly precise and accurate profile hidden Markov models, confirmed for antibiotic resistance function and organized by ontology. We demonstrate that the antibiotic resistance functions that give rise to the resistance profiles observed in environmental and human-associated microbial communities significantly differ between ecologies. Antibiotic resistance functions that most discriminate between ecologies provide resistance to β-lactams and tetracyclines, two of the most widely used classes of antibiotics in the clinic and agriculture. We also analyzed the antibiotic resistance gene composition of over 6000 sequenced microbial genomes, revealing significant enrichment of resistance functions by both ecology and phylogeny. Together, our results indicate that environmental and human-associated microbial communities harbor distinct resistance genes, suggesting that antibiotic resistance functions are largely constrained by ecology.

  13. Individual Movement Strategies Revealed through Novel Clustering of Emergent Movement Patterns

    Science.gov (United States)

    Valle, Denis; Cvetojevic, Sreten; Robertson, Ellen P.; Reichert, Brian E.; Hochmair, Hartwig H.; Fletcher, Robert J.

    2017-03-01

    Understanding movement is critical in several disciplines but analysis methods often neglect key information by adopting each location as sampling unit, rather than each individual. We introduce a novel statistical method that, by focusing on individuals, enables better identification of temporal dynamics of connectivity, traits of individuals that explain emergent movement patterns, and sites that play a critical role in connecting subpopulations. We apply this method to two examples that span movement networks that vary considerably in size and questions: movements of an endangered raptor, the snail kite (Rostrhamus sociabilis plumbeus), and human movement in Florida inferred from Twitter. For snail kites, our method reveals substantial differences in movement strategies for different bird cohorts and temporal changes in connectivity driven by the invasion of an exotic food resource, illustrating the challenge of identifying critical connectivity sites for conservation in the presence of global change. For human movement, our method is able to reliably determine the origin of Florida visitors and identify distinct movement patterns within Florida for visitors from different places, providing near real-time information on the spatial and temporal patterns of tourists. These results emphasize the need to integrate individual variation to generate new insights when modeling movement data.

  14. 一种基于分层结构的Ad Hoc网络分簇路由协议研究%Research based on the hierarchical structure of the Ad Hoc network clustering routing protocol

    Institute of Scientific and Technical Information of China (English)

    冯永亮

    2015-01-01

    The traditional Ad Hoc network clustering routing protocol has low packet delivery ratio problem, this paper proposes a clustering routing protocol based on hierarchical structure. The advanced network layer using AODV routing protocol based backup, and the lower network layer adopts a smaller delay DSDV protocol. The simulation results show that the improved routing protocol improves the packet delivery rate, Shortening the end to end delay.%传统Ad Hoc网络分簇路由协议存在分组投递率低的问题,论文提出一种基于分层结构的分簇路由协议.高级网络层采用基于备份路由的AODV协议,而低级网络层则采用时延较小的DSDV协议.仿真结果显示,改进后的路由协议提高了分组投递率,缩短了端到端时延.

  15. Cluster analysis of spontaneous preterm birth phenotypes identifies potential associations among preterm birth mechanisms.

    Science.gov (United States)

    Esplin, M Sean; Manuck, Tracy A; Varner, Michael W; Christensen, Bryce; Biggio, Joseph; Bukowski, Radek; Parry, Samuel; Zhang, Heping; Huang, Hao; Andrews, William; Saade, George; Sadovsky, Yoel; Reddy, Uma M; Ilekis, John

    2015-09-01

    We sought to use an innovative tool that is based on common biologic pathways to identify specific phenotypes among women with spontaneous preterm birth (SPTB) to enhance investigators' ability to identify and to highlight common mechanisms and underlying genetic factors that are responsible for SPTB. We performed a secondary analysis of a prospective case-control multicenter study of SPTB. All cases delivered a preterm singleton at SPTB ≤34.0 weeks' gestation. Each woman was assessed for the presence of underlying SPTB causes. A hierarchic cluster analysis was used to identify groups of women with homogeneous phenotypic profiles. One of the phenotypic clusters was selected for candidate gene association analysis with the use of VEGAS software. One thousand twenty-eight women with SPTB were assigned phenotypes. Hierarchic clustering of the phenotypes revealed 5 major clusters. Cluster 1 (n = 445) was characterized by maternal stress; cluster 2 (n = 294) was characterized by premature membrane rupture; cluster 3 (n = 120) was characterized by familial factors, and cluster 4 (n = 63) was characterized by maternal comorbidities. Cluster 5 (n = 106) was multifactorial and characterized by infection (INF), decidual hemorrhage (DH), and placental dysfunction (PD). These 3 phenotypes were correlated highly by χ(2) analysis (PD and DH, P cluster 3 of SPTB. We identified 5 major clusters of SPTB based on a phenotype tool and hierarch clustering. There was significant correlation between several of the phenotypes. The INS gene was associated with familial factors that were underlying SPTB. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Cluster analysis for applications

    CERN Document Server

    Anderberg, Michael R

    1973-01-01

    Cluster Analysis for Applications deals with methods and various applications of cluster analysis. Topics covered range from variables and scales to measures of association among variables and among data units. Conceptual problems in cluster analysis are discussed, along with hierarchical and non-hierarchical clustering methods. The necessary elements of data analysis, statistics, cluster analysis, and computer implementation are integrated vertically to cover the complete path from raw data to a finished analysis.Comprised of 10 chapters, this book begins with an introduction to the subject o

  17. High-performance supercapacitor and lithium-ion battery based on 3D hierarchical NH4F-induced nickel cobaltate nanosheet-nanowire cluster arrays as self-supported electrodes

    Science.gov (United States)

    Chen, Yuejiao; Qu, Baihua; Hu, Lingling; Xu, Zhi; Li, Qiuhong; Wang, Taihong

    2013-09-01

    A facile hydrothermal method is developed for large-scale production of three-dimensional (3D) hierarchical porous nickel cobaltate nanowire cluster arrays derived from nanosheet arrays with robust adhesion on Ni foam. Based on the morphology evolution upon reaction time, a possible formation process is proposed. The role of NH4F in formation of the structure has also been investigated based on different NH4F amounts. This unique structure significantly enhances the electroactive surface areas of the NiCo2O4 arrays, leading to better interfacial/chemical distributions at the nanoscale, fast ion and electron transfer and good strain accommodation. Thus, when it is used for supercapacitor testing, a specific capacitance of 1069 F g-1 at a very high current density of 100 A g-1 was obtained. Even after more than 10 000 cycles at various large current densities, a capacitance of 2000 F g-1 at 10 A g-1 with 93.8% retention can be achieved. It also exhibits a high-power density (26.1 kW kg-1) at a discharge current density of 80 A g-1. When used as an anode material for lithium-ion batteries (LIBs), it presents a high reversible capacity of 976 mA h g-1 at a rate of 200 mA g-1 with good cycling stability and rate capability. This array material is rarely used as an anode material. Our results show that this unique 3D hierarchical porous nickel cobaltite is promising for electrochemical energy applications.A facile hydrothermal method is developed for large-scale production of three-dimensional (3D) hierarchical porous nickel cobaltate nanowire cluster arrays derived from nanosheet arrays with robust adhesion on Ni foam. Based on the morphology evolution upon reaction time, a possible formation process is proposed. The role of NH4F in formation of the structure has also been investigated based on different NH4F amounts. This unique structure significantly enhances the electroactive surface areas of the NiCo2O4 arrays, leading to better interfacial/chemical distributions

  18. High-performance supercapacitor and lithium-ion battery based on 3D hierarchical NH4F-induced nickel cobaltate nanosheet-nanowire cluster arrays as self-supported electrodes.

    Science.gov (United States)

    Chen, Yuejiao; Qu, Baihua; Hu, Lingling; Xu, Zhi; Li, Qiuhong; Wang, Taihong

    2013-10-21

    A facile hydrothermal method is developed for large-scale production of three-dimensional (3D) hierarchical porous nickel cobaltate nanowire cluster arrays derived from nanosheet arrays with robust adhesion on Ni foam. Based on the morphology evolution upon reaction time, a possible formation process is proposed. The role of NH4F in formation of the structure has also been investigated based on different NH4F amounts. This unique structure significantly enhances the electroactive surface areas of the NiCo2O4 arrays, leading to better interfacial/chemical distributions at the nanoscale, fast ion and electron transfer and good strain accommodation. Thus, when it is used for supercapacitor testing, a specific capacitance of 1069 F g(-1) at a very high current density of 100 A g(-1) was obtained. Even after more than 10,000 cycles at various large current densities, a capacitance of 2000 F g(-1) at 10 A g(-1) with 93.8% retention can be achieved. It also exhibits a high-power density (26.1 kW kg(-1)) at a discharge current density of 80 A g(-1). When used as an anode material for lithium-ion batteries (LIBs), it presents a high reversible capacity of 976 mA h g(-1) at a rate of 200 mA g(-1) with good cycling stability and rate capability. This array material is rarely used as an anode material. Our results show that this unique 3D hierarchical porous nickel cobaltite is promising for electrochemical energy applications.

  19. Interactive visual exploration and refinement of cluster assignments.

    Science.gov (United States)

    Kern, Michael; Lex, Alexander; Gehlenborg, Nils; Johnson, Chris R

    2017-09-12

    With ever-increasing amounts of data produced in biology research, scientists are in need of efficient data analysis methods. Cluster analysis, combined with visualization of the results, is one such method that can be used to make sense of large data volumes. At the same time, cluster analysis is known to be imperfect and depends on the choice of algorithms, parameters, and distance measures. Most clustering algorithms don't properly account for ambiguity in the source data, as records are often assigned to discrete clusters, even if an assignment is unclear. While there are metrics and visualization techniques that allow analysts to compare clusterings or to judge cluster quality, there is no comprehensive method that allows analysts to evaluate, compare, and refine cluster assignments based on the source data, derived scores, and contextual data. In this paper, we introduce a method that explicitly visualizes the quality of cluster assignments, allows comparisons of clustering results and enables analysts to manually curate and refine cluster assignments. Our methods are applicable to matrix data clustered with partitional, hierarchical, and fuzzy clustering algorithms. Furthermore, we enable analysts to explore clustering results in context of other data, for example, to observe whether a clustering of genomic data results in a meaningful differentiation in phenotypes. Our methods are integrated into Caleydo StratomeX, a popular, web-based, disease subtype analysis tool. We show in a usage scenario that our approach can reveal ambiguities in cluster assignments and produce improved clusterings that better differentiate genotypes and phenotypes.

  20. Hierarchical Star Formation Across Galactic Disks

    Science.gov (United States)

    Gouliermis, Dimitrios

    2016-09-01

    Most stars form in clusters. This fact has emerged from the finding that "embedded clusters account for the 70 - 90% fraction of all stars formed in Giant Molecular Clouds (GMCs)." While this is the case at scales of few 10 parsecs, typical for GMCs, a look at star-forming galaxies in the Local Group (LG) shows significant populations of enormous loose complexes of early-type stars extending at scales from few 100 to few 1000 parsecs. The fact that these stellar complexes host extremely large numbers of loosely distributed massive blue stars implies either that stars form also in an unbound fashion or they are immediately dislocated from their original compact birthplaces or both. The Legacy Extra-Galactic UV Survey (LEGUS) has produced remarkable collections of resolved early-type stars in 50 star-forming LG galaxies, suited for testing ideas about recent star formation. I will present results from our ongoing project on star formation across LEGUS disk galaxies. We characterize the global clustering behavior of the massive young stars in order to understand the morphology of star formation over galactic scales. This morphology appears to be self-similar with fractal dimensions comparable to those of the molecular interstellar medium, apparently driven by large-scale turbulence. Our clustering analysis reveals compact stellar systems nested in larger looser concentrations, which themselves are the dense parts of unbound complexes and super-structures, giving evidence of hierarchical star formation up to galactic scales. We investigate the structural and star formation parameters demographics of the star-forming complexes revealed at various levels of compactness. I will discuss the outcome of our correlation and regression analyses on these parameters in an attempt to understand the link between galactic disk dynamics and morphological structure in spiral and ring galaxies of the local universe.

  1. Weighted Clustering

    CERN Document Server

    Ackerman, Margareta; Branzei, Simina; Loker, David

    2011-01-01

    In this paper we investigate clustering in the weighted setting, in which every data point is assigned a real valued weight. We conduct a theoretical analysis on the influence of weighted data on standard clustering algorithms in each of the partitional and hierarchical settings, characterising the precise conditions under which such algorithms react to weights, and classifying clustering methods into three broad categories: weight-responsive, weight-considering, and weight-robust. Our analysis raises several interesting questions and can be directly mapped to the classical unweighted setting.

  2. Community detection algorithm based on hierarchical clustering under signal missing in propagating process%传播过程中信号缺失的层次聚类社区发现算法

    Institute of Scientific and Technical Information of China (English)

    康茜; 李德玉; 王素格; 冀庆斌

    2015-01-01

    社区发现是社会网络分析的一个基本任务,而社区结构探测是社区发现的一个关键问题。将社区结构中的结点看作信号源,针对信号传递过程中存在信号缺失情况,提出了一种层次聚类社区发现算法。该算法通过度中心性来度量节点接收信号的概率,用于量化节点接受信号过程中的缺失值。经过信号传递,使网络的拓扑结构转化为向量间的几何关系,在此基础上,使用层次聚类算法用于发现社区。为了验证SMHC算法的有效性,通过在三个数据集上与SHC算法、CNM算法、GN算法、Similar算法进行比较,实验结果表明,SMHC算法在一定程度上提高了社区发现的正确率。%Community identification is a basic task of social network analysis, meanwhile the community structure detec-tion is a key problem of community identification. Each node in the community structure is regarded as the signal source. A hierarchical clustering community algorithm is proposed in order to settle the problem of signal missing in the process of signal transmission. The algorithm measures the probability of receiving signals of nodes by degree centrality to quantify the signal missing values. After the signal transmission, the topology of the network is transformed into geometric relation-ships among the vectors. On the basis, the hierarchical clustering algorithm is used to find the community structure. In order to validate the proposed method, this paper compares it with SHC algorithm, CNM algorithm, GN algorithm and Similar algorithm. Under three real networks, the Zachary Club, American Football and Netscience, the experimental results indi-cate that SMHC algorithm can effectively improve precision.

  3. Relation between financial market structure and the real economy: comparison between clustering methods.

    Directory of Open Access Journals (Sweden)

    Nicoló Musmeci

    Full Text Available We quantify the amount of information filtered by different hierarchical clustering methods on correlations between stock returns comparing the clustering structure with the underlying industrial activity classification. We apply, for the first time to financial data, a novel hierarchical clustering approach, the Directed Bubble Hierarchical Tree and we compare it with other methods including the Linkage and k-medoids. By taking the industrial sector classification of stocks as a benchmark partition, we evaluate how the different methods retrieve this classification. The results show that the Directed Bubble Hierarchical Tree can outperform other methods, being able to retrieve more information with fewer clusters. Moreover,we show that the economic information is hidden at different levels of the hierarchical structures depending on the clustering method. The dynamical analysis on a rolling window also reveals that the different methods show different degrees of sensitivity to events affecting financial markets, like crises. These results can be of interest for all the applications of clustering methods to portfolio optimization and risk hedging [corrected].

  4. Relation between financial market structure and the real economy: comparison between clustering methods.

    Science.gov (United States)

    Musmeci, Nicoló; Aste, Tomaso; Di Matteo, T

    2015-01-01

    We quantify the amount of information filtered by different hierarchical clustering methods on correlations between stock returns comparing the clustering structure with the underlying industrial activity classification. We apply, for the first time to financial data, a novel hierarchical clustering approach, the Directed Bubble Hierarchical Tree and we compare it with other methods including the Linkage and k-medoids. By taking the industrial sector classification of stocks as a benchmark partition, we evaluate how the different methods retrieve this classification. The results show that the Directed Bubble Hierarchical Tree can outperform other methods, being able to retrieve more information with fewer clusters. Moreover,we show that the economic information is hidden at different levels of the hierarchical structures depending on the clustering method. The dynamical analysis on a rolling window also reveals that the different methods show different degrees of sensitivity to events affecting financial markets, like crises. These results can be of interest for all the applications of clustering methods to portfolio optimization and risk hedging [corrected].

  5. Statistical properties of convex clustering

    OpenAIRE

    Tan, Kean Ming; Witten, Daniela

    2015-01-01

    In this manuscript, we study the statistical properties of convex clustering. We establish that convex clustering is closely related to single linkage hierarchical clustering and $k$-means clustering. In addition, we derive the range of the tuning parameter for convex clustering that yields a non-trivial solution. We also provide an unbiased estimator of the degrees of freedom, and provide a finite sample bound for the prediction error for convex clustering. We compare convex clustering to so...

  6. Block-based logical hierarchical cluster for distributed multimedia architecture on demand server%基于块的逻辑层次集群:一种分布式多媒体点播服务器的体系结构

    Institute of Scientific and Technical Information of China (English)

    熊旭辉; 余胜生; 周敬利

    2006-01-01

    A structure of logical hierarchical cluster for the distributed multimedia on demand server is proposed. The architecture is mainly composed of the network topology and the resource management of all server nodes. Instead of the physical network hierarchy or the independent management hierarchy, the nodes are organized into a logically hierarchical cluster according to the multimedia block they caches in the midderware layer. The process of a member joining/leaving or the structure adjustment cooperatively implemented by all members is concerned with decentralized maintenance of the logical cluster hierarchy. As the root of each logically hierarchical cluster is randomly mapped into the system, the logical structure of a multimedia block is dynamically expanded across some regions by the two replication policies in different load state respectively. The local load diversion is applied to fine-tune the load of nodes within a local region but belongs to different logical hierarchies. Guaranteed by the dynamic expansion of a logical structure and the load diversion of a local region, the users always select a closest idle node from the logical hierarchy under the condition of topology integration with resource management.

  7. The Immersive Virtual Reality Experience: A Typology of Users Revealed Through Multiple Correspondence Analysis Combined with Cluster Analysis Technique.

    Science.gov (United States)

    Rosa, Pedro J; Morais, Diogo; Gamito, Pedro; Oliveira, Jorge; Saraiva, Tomaz

    2016-03-01

    Immersive virtual reality is thought to be advantageous by leading to higher levels of presence. However, and despite users getting actively involved in immersive three-dimensional virtual environments that incorporate sound and motion, there are individual factors, such as age, video game knowledge, and the predisposition to immersion, that may be associated with the quality of virtual reality experience. Moreover, one particular concern for users engaged in immersive virtual reality environments (VREs) is the possibility of side effects, such as cybersickness. The literature suggests that at least 60% of virtual reality users report having felt symptoms of cybersickness, which reduces the quality of the virtual reality experience. The aim of this study was thus to profile the right user to be involved in a VRE through head-mounted display. To examine which user characteristics are associated with the most effective virtual reality experience (lower cybersickness), a multiple correspondence analysis combined with cluster analysis technique was performed. Results revealed three distinct profiles, showing that the PC gamer profile is more associated with higher levels of virtual reality effectiveness, that is, higher predisposition to be immersed and reduced cybersickness symptoms in the VRE than console gamer and nongamer. These findings can be a useful orientation in clinical practice and future research as they help identify which users are more predisposed to benefit from immersive VREs.

  8. Deliberate change without hierarchical influence?

    DEFF Research Database (Denmark)

    Nørskov, Sladjana; Kesting, Peter; Ulhøi, John Parm

    2017-01-01

    Purpose This paper aims to present that deliberate change is strongly associated with formal structures and top-down influence. Hierarchical configurations have been used to structure processes, overcome resistance and get things done. But is deliberate change also possible without formal...... reveals that deliberate change is indeed achievable in a non-hierarchical collaborative OSS community context. However, it presupposes the presence and active involvement of informal change agents. The paper identifies and specifies four key drivers for change agents’ influence. Originality....../value The findings contribute to organisational analysis by providing a deeper understanding of the importance of leadership in making deliberate change possible in non-hierarchical settings. It points to the importance of “change-by-conviction”, essentially based on voluntary behaviour. This can open the door...

  9. Cluster analysis of dynamic contrast enhanced MRI reveals tumor subregions related to locoregional relapse for cervical cancer patients.

    Science.gov (United States)

    Torheim, Turid; Groendahl, Aurora R; Andersen, Erlend K F; Lyng, Heidi; Malinen, Eirik; Kvaal, Knut; Futsaether, Cecilia M

    2016-11-01

    Solid tumors are known to be spatially heterogeneous. Detection of treatment-resistant tumor regions can improve clinical outcome, by enabling implementation of strategies targeting such regions. In this study, K-means clustering was used to group voxels in dynamic contrast enhanced magnetic resonance images (DCE-MRI) of cervical cancers. The aim was to identify clusters reflecting treatment resistance that could be used for targeted radiotherapy with a dose-painting approach. Eighty-one patients with locally advanced cervical cancer underwent DCE-MRI prior to chemoradiotherapy. The resulting image time series were fitted to two pharmacokinetic models, the Tofts model (yielding parameters K(trans) and νe) and the Brix model (ABrix, kep and kel). K-means clustering was used to group similar voxels based on either the pharmacokinetic parameter maps or the relative signal increase (RSI) time series. The associations between voxel clusters and treatment outcome (measured as locoregional control) were evaluated using the volume fraction or the spatial distribution of each cluster. One voxel cluster based on the RSI time series was significantly related to locoregional control (adjusted p-value 0.048). This cluster consisted of low-enhancing voxels. We found that tumors with poor prognosis had this RSI-based cluster gathered into few patches, making this cluster a potential candidate for targeted radiotherapy. None of the voxels clusters based on Tofts or Brix parameter maps were significantly related to treatment outcome. We identified one group of tumor voxels significantly associated with locoregional relapse that could potentially be used for dose painting. This tumor voxel cluster was identified using the raw MRI time series rather than the pharmacokinetic maps.

  10. Deletion of a regulatory gene within the cpk gene cluster reveals novel antibacterial activity in Streptomyces coelicolor A3(2)

    NARCIS (Netherlands)

    Gottelt, Marco; Kol, Stefan; Gomez-Escribano, Juan Pablo; Bibb, Mervyn; Takano, Eriko; Herron, P.R.

    2010-01-01

    Genome sequencing of Streptomyces coelicolor A3(2) revealed an uncharacterized type I polyketide synthase gene cluster (cpk) Here we describe the discovery of a novel antibacterial activity (abCPK) and a yellow-pigmented secondary metabolite (yCPK) after deleting a presumed pathway-specific regulato

  11. Deletion of a regulatory gene within the cpk gene cluster reveals novel antibacterial activity in Streptomyces coelicolor A3(2)

    NARCIS (Netherlands)

    Gottelt, Marco; Kol, Stefan; Gomez-Escribano, Juan Pablo; Bibb, Mervyn; Takano, Eriko

    Genome sequencing of Streptomyces coelicolor A3(2) revealed an uncharacterized type I polyketide synthase gene cluster (cpk) Here we describe the discovery of a novel antibacterial activity (abCPK) and a yellow-pigmented secondary metabolite (yCPK) after deleting a presumed pathway-specific

  12. Revealing a Ring-like Cluster Complex in a Tidal Tail of the Starburst Galaxy NGC 2146

    CERN Document Server

    Adamo, A; Gallagher, J S; Bastian, N; Ryon, J; Westmoquette, M S; Konstantopoulos, I S; Zackrisson, E; Larsen, S S; Silva-Villa, E; Charlton, J C; Weisz, D R

    2012-01-01

    We report the discovery of a ring-like cluster complex in the starburst galaxy NGC 2146. The Ruby Ring, so named due to its appearance, shows a clear ring-like distribution of star clusters around a central object. It is located in one of the tidal streams which surround the galaxy. NGC 2146 is part of the Snapshot Hubble U-band Cluster Survey (SHUCS). The WFC3/F336W data has added critical information to the available archival Hubble Space Telescope imaging set of NGC 2146, allowing us to determine ages, masses, and extinctions of the clusters in the Ruby Ring. These properties have then been used to investigate the formation of this extraordinary system. We find evidence of a spatial and temporal correlation between the central cluster and the clusters in the ring. The latter are about 4 Myr younger than the central cluster, which has an age of 7 Myr. This result is supported by the H alpha emission which is strongly coincident with the ring, and weaker at the position of the central cluster. From the deriv...

  13. Hierarchical organization of brain functional network during visual task

    CERN Document Server

    Zhuo, Zhao; Fu, Zhong-Qian; Zhang, Jie

    2011-01-01

    In this paper, the brain functional networks derived from high-resolution synchronous EEG time series during visual task are generated by calculating the phase synchronization among the time series. The hierarchical modular organizations of these networks are systematically investigated by the fast Girvan-Newman algorithm. At the same time, the spatially adjacent electrodes (corresponding to EEG channels) are clustered into functional groups based on anatomical parcellation of brain cortex, and this clustering information are compared to that of the functional network. The results show that the modular architectures of brain functional network are in coincidence with that from the anatomical structures over different levels of hierarchy, which suggests that population of neurons performing the same function excite and inhibit in identical rhythms. The structure-function relationship further reveals that the correlations among EEG time series in the same functional group are much stronger than those in differe...

  14. Hierarchical Network Design

    DEFF Research Database (Denmark)

    Thomadsen, Tommy

    2005-01-01

    of different types of hierarchical networks. This is supplemented by a review of ring network design problems and a presentation of a model allowing for modeling most hierarchical networks. We use methods based on linear programming to design the hierarchical networks. Thus, a brief introduction to the various....... The thesis investigates models for hierarchical network design and methods used to design such networks. In addition, ring network design is considered, since ring networks commonly appear in the design of hierarchical networks. The thesis introduces hierarchical networks, including a classification scheme...... linear programming based methods is included. The thesis is thus suitable as a foundation for study of design of hierarchical networks. The major contribution of the thesis consists of seven papers which are included in the appendix. The papers address hierarchical network design and/or ring network...

  15. A genome-wide association study on androstenone levels in pigs reveals a cluster of candidate genes on chromosome 6

    Directory of Open Access Journals (Sweden)

    Groenen Martien AM

    2010-05-01

    Full Text Available Abstract Background In many countries, male piglets are castrated shortly after birth because a proportion of un-castrated male pigs produce meat with an unpleasant flavour and odour. Main compounds of boar taint are androstenone and skatole. The aim of this high-density genome-wide association study was to identify single nucleotide polymorphisms (SNPs associated with androstenone levels in a commercial sire line of pigs. The identification of major genetic effects causing boar taint would accelerate the reduction of boar taint through breeding to finally eliminate the need for castration. Results The Illumina Porcine 60K+SNP Beadchip was genotyped on 987 pigs divergent for androstenone concentration from a commercial Duroc-based sire line. The association analysis with 47,897 SNPs revealed that androstenone levels in fat tissue were significantly affected by 37 SNPs on pig chromosomes SSC1 and SSC6. Among them, the 5 most significant SNPs explained together 13.7% of the genetic variance in androstenone. On SSC6, a larger region of 10 Mb was shown to be associated with androstenone covering several candidate genes potentially involved in the synthesis and metabolism of androgens. Besides known candidate genes, such as cytochrome P450 A19 (CYP2A19, sulfotransferases SULT2A1, and SULT2B1, also new members of the cytochrome P450 CYP2 gene subfamilies and of the hydroxysteroid-dehydrogenases (HSD17B14 were found. In addition, the gene encoding the ß-chain of the luteinizing hormone (LHB which induces steroid synthesis in the Leydig cells of the testis at onset of puberty maps to this area on SSC6. Interestingly, the gene encoding the α-chain of LH is also located in one of the highly significant areas on SSC1. Conclusions This study reveals several areas of the genome at high resolution responsible for variation of androstenone levels in intact boars. Major genetic factors on SSC1 and SSC6 showing moderate to large effects on androstenone

  16. Optimizd Design of Power Scheduling in WSN Based on Sink Root Data Tree with Hierarchical Clustering%Sink根数据聚集树分层的WSN电力调度优化设计

    Institute of Scientific and Technical Information of China (English)

    朱文忠

    2014-01-01

    为提高电力数据调度效率,缩短电力数据调度延时,提出一种改进的无通信冲突的分布式电力数据聚集调度近似算法,采用Sink根数据聚集树对无线传感器网络中各个节点电力资源数据进行分层数据调度,根据分布式数据集对各个电力节点之间的控制信息进行不断融合处理,在最大独立集的基础上建立一棵根在Sink的数据聚集树。每个节点分配一个时间片,使该节点能在无通信冲突的情况下传输数据。仿真实验表明,采用改进算法得到的聚集延时明显减小,有效保证了电力调度控制的实时性,电力信息数据分层融合度能达到90%以上,而改进前的算法只有10%~50%之间。%In order to improve the power data scheduling efficiency, shorten the power data scheduling delay, and improve matching and integration degree, and improved power scheduling optimization design method based on Sink root data tree hierarchical clustering was proposed for improve the management efficiency. We established a tree root in the Sink data ag-gregation tree based on the maximum independent set. Each node was assigned a time slice, so that the node could transmit data in the absence of communication conflict situations. Simulation results show that the improved algorithm has signifi-cantly reduced aggregation delay, and it has effectively ensured the real-time dispatching control, and the data hierarchical fusion degree can reach more than 90%, while the former algorithm is only 10%~50%.

  17. Hierarchical Multiagent Reinforcement Learning

    Science.gov (United States)

    2004-01-25

    In this paper, we investigate the use of hierarchical reinforcement learning (HRL) to speed up the acquisition of cooperative multiagent tasks. We...introduce a hierarchical multiagent reinforcement learning (RL) framework and propose a hierarchical multiagent RL algorithm called Cooperative HRL. In

  18. Generation of hierarchically correlated multivariate symbolic sequences

    CERN Document Server

    Tumminello, Mi; Mantegna, R N

    2008-01-01

    We introduce an algorithm to generate multivariate series of symbols from a finite alphabet with a given hierarchical structure of similarities. The target hierarchical structure of similarities is arbitrary, for instance the one obtained by some hierarchical clustering procedure as applied to an empirical matrix of Hamming distances. The algorithm can be interpreted as the finite alphabet equivalent of the recently introduced hierarchically nested factor model (M. Tumminello et al. EPL 78 (3) 30006 (2007)). The algorithm is based on a generating mechanism that is different from the one used in the mutation rate approach. We apply the proposed methodology for investigating the relationship between the bootstrap value associated with a node of a phylogeny and the probability of finding that node in the true phylogeny.

  19. Cluster-cluster clustering

    Science.gov (United States)

    Barnes, J.; Dekel, A.; Efstathiou, G.; Frenk, C. S.

    1985-01-01

    The cluster correlation function xi sub c(r) is compared with the particle correlation function, xi(r) in cosmological N-body simulations with a wide range of initial conditions. The experiments include scale-free initial conditions, pancake models with a coherence length in the initial density field, and hybrid models. Three N-body techniques and two cluster-finding algorithms are used. In scale-free models with white noise initial conditions, xi sub c and xi are essentially identical. In scale-free models with more power on large scales, it is found that the amplitude of xi sub c increases with cluster richness; in this case the clusters give a biased estimate of the particle correlations. In the pancake and hybrid models (with n = 0 or 1), xi sub c is steeper than xi, but the cluster correlation length exceeds that of the points by less than a factor of 2, independent of cluster richness. Thus the high amplitude of xi sub c found in studies of rich clusters of galaxies is inconsistent with white noise and pancake models and may indicate a primordial fluctuation spectrum with substantial power on large scales.

  20. Cluster-cluster clustering

    Energy Technology Data Exchange (ETDEWEB)

    Barnes, J.; Dekel, A.; Efstathiou, G.; Frenk, C.S.

    1985-08-01

    The cluster correlation function xi sub c(r) is compared with the particle correlation function, xi(r) in cosmological N-body simulations with a wide range of initial conditions. The experiments include scale-free initial conditions, pancake models with a coherence length in the initial density field, and hybrid models. Three N-body techniques and two cluster-finding algorithms are used. In scale-free models with white noise initial conditions, xi sub c and xi are essentially identical. In scale-free models with more power on large scales, it is found that the amplitude of xi sub c increases with cluster richness; in this case the clusters give a biased estimate of the particle correlations. In the pancake and hybrid models (with n = 0 or 1), xi sub c is steeper than xi, but the cluster correlation length exceeds that of the points by less than a factor of 2, independent of cluster richness. Thus the high amplitude of xi sub c found in studies of rich clusters of galaxies is inconsistent with white noise and pancake models and may indicate a primordial fluctuation spectrum with substantial power on large scales. 30 references.

  1. 2 x 2 Achievement Goals and Achievement Emotions: A Cluster Analysis of Students' Motivation

    Science.gov (United States)

    Jang, Leong Yeok; Liu, Woon Chia

    2012-01-01

    This study sought to better understand the adoption of multiple achievement goals at an intra-individual level, and its links to emotional well-being, learning, and academic achievement. Participants were 480 Secondary Two students (aged between 13 and 14 years) from two coeducational government schools. Hierarchical cluster analysis revealed the…

  2. 2 x 2 Achievement Goals and Achievement Emotions: A Cluster Analysis of Students' Motivation

    Science.gov (United States)

    Jang, Leong Yeok; Liu, Woon Chia

    2012-01-01

    This study sought to better understand the adoption of multiple achievement goals at an intra-individual level, and its links to emotional well-being, learning, and academic achievement. Participants were 480 Secondary Two students (aged between 13 and 14 years) from two coeducational government schools. Hierarchical cluster analysis revealed the…

  3. Comparative 16S rRNA Analysis of Lake Bacterioplankton Reveals Globally Distributed Phylogenetic Clusters Including an Abundant Group of Actinobacteria

    Science.gov (United States)

    Glöckner, Frank Oliver; Zaichikov, Evgeny; Belkova, Natalia; Denissova, Ludmilla; Pernthaler, Jakob; Pernthaler, Annelie; Amann, Rudolf

    2000-01-01

    In a search for cosmopolitan phylogenetic clusters of freshwater bacteria, we recovered a total of 190 full and partial 16S ribosomal DNA (rDNA) sequences from three different lakes (Lake Gossenköllesee, Austria; Lake Fuchskuhle, Germany; and Lake Baikal, Russia). The phylogenetic comparison with the currently available rDNA data set showed that our sequences fall into 16 clusters, which otherwise include bacterial rDNA sequences of primarily freshwater and soil, but not marine, origin. Six of the clusters were affiliated with the α, four were affiliated with the β, and one was affiliated with the γ subclass of the Proteobacteria; four were affiliated with the Cytophaga-Flavobacterium-Bacteroides group; and one was affiliated with the class Actinobacteria (formerly known as the high-G+C gram-positive bacteria). The latter cluster (hgcI) is monophyletic and so far includes only sequences directly retrieved from aquatic environments. Fluorescence in situ hybridization (FISH) with probes specific for the hgcI cluster showed abundances of up to 1.7 × 105 cells ml−1 in Lake Gossenköllesee, with strong seasonal fluctuations, and high abundances in the two other lakes investigated. Cell size measurements revealed that Actinobacteria in Lake Gossenköllesee can account for up to 63% of the bacterioplankton biomass. A combination of phylogenetic analysis and FISH was used to reveal 16 globally distributed sequence clusters and to confirm the broad distribution, abundance, and high biomass of members of the class Actinobacteria in freshwater ecosystems. PMID:11055963

  4. Molecular epidemiology of Coxiella burnetii in French livestock reveals the existence of three main genotype clusters and suggests species-specific associations as well as regional stability.

    Science.gov (United States)

    Joulié, Aurelien; Sidi-Boumedine, Karim; Bailly, Xavier; Gasqui, Patrick; Barry, Séverine; Jaffrelo, Lydia; Poncet, Charles; Abrial, David; Yang, Elise; Leblond, Agnès; Rousset, Elodie; Jourdain, Elsa

    2017-03-01

    Q fever is a worldwide zoonosis caused by the bacterium Coxiella burnetii. In domestic ruminants, Q fever main clinical manifestations are abortions. Although the clinical signs may differ between ruminant species, C. burnetii's genetic diversity remains understudied in enzootic areas. Here, we focused on France, where Q fever is enzootic, with the aims to (a) identify potential associations between C. burnetii genotypes and ruminant host species; (b) assess the distribution of C. burnetii genotypes both within French farms and across France's major livestock-farming regions; and (c) suggest a subset of markers for future genotypic studies. We used DNA samples collected between 2006 and 2015 from 301 females (160 cows, 76 ewes, 65 goats) aborted of Q fever within 7 different farming regions. C. burnetii diversity was determined using a multiple-locus variable-number of tandem repeat analysis (MLVA) considering 17 markers. Using a phylogenetic approach, we identified 3 main genotypic clusters divided into 12 sub-clusters. These clusters were significantly associated with ruminant species: almost all the cattle genotypes were found in a "cattle-specific" cluster whereas small ruminants genotypes essentially grouped into the two other clusters. The clusters also proved stable over space and time, some genotypes being more specifically observed in certain farming regions. We also observed some within-farm diversity but this diversity was restricted to a same genotypic cluster. Finally, we identified 6 MLVA markers that maximized the representativeness of the diversity described. Overall, we highlighted that molecular epidemiology is a relevant approach to assess C. burnetii's genetic diversity and to reveal the existence of species-specific associations and regional stability. These results will be valuable in the field to trace genotype circulation among ruminants and from ruminants to humans. Ultimately, the potential links between genotypes and virulence traits need

  5. Comparative Genomic Analyses of Multiple Pseudomonas Strains Infecting Corylus avellana Trees Reveal the Occurrence of Two Genetic Clusters with Both Common and Distinctive Virulence and Fitness Traits.

    Directory of Open Access Journals (Sweden)

    Simone Marcelletti

    Full Text Available The European hazelnut (Corylus avellana is threatened in Europe by several pseudomonads which cause symptoms ranging from twig dieback to tree death. A comparison of the draft genomes of nine Pseudomonas strains isolated from symptomatic C. avellana trees was performed to identify common and distinctive genomic traits. The thorough assessment of genetic relationships among the strains revealed two clearly distinct clusters: P. avellanae and P. syringae. The latter including the pathovars avellanae, coryli and syringae. Between these two clusters, no recombination event was found. A genomic island of approximately 20 kb, containing the hrp/hrc type III secretion system gene cluster, was found to be present without any genomic difference in all nine pseudomonads. The type III secretion system effector repertoires were remarkably different in the two groups, with P. avellanae showing a higher number of effectors. Homologue genes of the antimetabolite mangotoxin and ice nucleation activity clusters were found solely in all P. syringae pathovar strains, whereas the siderophore yersiniabactin was only present in P. avellanae. All nine strains have genes coding for pectic enzymes and sucrose metabolism. By contrast, they do not have genes coding for indolacetic acid and anti-insect toxin. Collectively, this study reveals that genomically different Pseudomonas can converge on the same host plant by suppressing the host defence mechanisms with the use of different virulence weapons. The integration into their genomes of a horizontally acquired genomic island could play a fundamental role in their evolution, perhaps giving them the ability to exploit new ecological niches.

  6. Revealing the Chamaeleon: First detection of a low-mass stellar halo around the young open cluster Eta Chamaeleontis

    CERN Document Server

    Murphy, Simon J; Bessell, Michael S

    2010-01-01

    We have identified several lithium-rich low-mass (0.08cluster Eta Chamaeleontis, nearly four times the radius of previous search efforts. We propose 4 new probable cluster members and 3 possible members requiring further investigation. Candidates were selected on the basis of DENIS and 2MASS photometry, NOMAD astrometry and extensive follow-up spectroscopy. Several of these stars show substantial variation in their H-alpha emission line strengths on timescales of days to months, with at least one event attributable to accretion from a circumstellar disk. These findings are consistent with a dynamical origin for the current configuration of the cluster, without the need to invoke an abnormally top-heavy Initial Mass Function, as proposed by some authors.

  7. In-clustering effects in InAlN and InGaN revealed by high pressure studies

    DEFF Research Database (Denmark)

    Gorczyca, I.; Suski, T.; Kaminska, A.;

    2010-01-01

    results are compared with the results of photoluminescence measurements performed at high hydrostatic pressures on InAlN and InGaN quasi-bulk epilayers. We discuss the modification of the uppermost valence band due to formation of In clusters which, together with the related lattice relaxations, may...

  8. Whole-genome sequencing of Campylobacter jejuni isolated from Danish routine human stool samples reveals surprising degree of clustering

    DEFF Research Database (Denmark)

    Joensen, K G; Kuhn, K G; Müller, L

    2017-01-01

    ; one which had not been identified through the existing surveillance system. CONCLUSIONS: Using WGS, we show that Campylobacter case clustering and even outbreaks appear to occur more frequently than previously assumed, providing important new insight into the relatively poorly understood epidemiology...... of the most important cause of bacterial gastroenteritis in the industrialized world....

  9. Data Clustering

    Science.gov (United States)

    Wagstaff, Kiri L.

    2012-03-01

    clustering, in which some partial information about item assignments or other components of the resulting output are already known and must be accommodated by the solution. Some algorithms seek a partition of the data set into distinct clusters, while others build a hierarchy of nested clusters that can capture taxonomic relationships. Some produce a single optimal solution, while others construct a probabilistic model of cluster membership. More formally, clustering algorithms operate on a data set X composed of items represented by one or more features (dimensions). These could include physical location, such as right ascension and declination, as well as other properties such as brightness, color, temporal change, size, texture, and so on. Let D be the number of dimensions used to represent each item, xi ∈ RD. The clustering goal is to produce an organization P of the items in X that optimizes an objective function f : P -> R, which quantifies the quality of solution P. Often f is defined so as to maximize similarity within a cluster and minimize similarity between clusters. To that end, many algorithms make use of a measure d : X x X -> R of the distance between two items. A partitioning algorithm produces a set of clusters P = {c1, . . . , ck} such that the clusters are nonoverlapping (c_i intersected with c_j = empty set, i != j) subsets of the data set (Union_i c_i=X). Hierarchical algorithms produce a series of partitions P = {p1, . . . , pn }. For a complete hierarchy, the number of partitions n’= n, the number of items in the data set; the top partition is a single cluster containing all items, and the bottom partition contains n clusters, each containing a single item. For model-based clustering, each cluster c_j is represented by a model m_j , such as the cluster center or a Gaussian distribution. The wide array of available clustering algorithms may seem bewildering, and covering all of them is beyond the scope of this chapter. Choosing among them for a

  10. Automated tetraploid genotype calling by hierarchical clustering

    Science.gov (United States)

    SNP arrays are transforming breeding and genetics research for autotetraploids. To fully utilize these arrays, however, the relationship between signal intensity and allele dosage must be inferred independently for each marker. We developed an improved computational method to automate this process, ...

  11. The Grism Lens-amplified Survey from Space (Glass). IX. The Dual Origin of Low-mass Cluster Galaxies as Revealed by New Structural Analyses

    Science.gov (United States)

    Morishita, Takahiro; Abramson, Louis E.; Treu, Tommaso; Vulcani, Benedetta; Schmidt, Kasper B.; Dressler, Alan; Poggianti, Bianca M.; Malkan, Matthew A.; Wang, Xin; Huang, Kuang-Han; Trenti, Michele; Bradač, Maruša; Hoag, Austin

    2017-02-01

    Using deep Hubble Frontier Fields imaging and slitless spectroscopy from the Grism Survey from Space, we study 2200 cluster and 1748 field galaxies at 0.2≤slant z≤slant 0.7 to determine the impact of environment on galaxy size and structure at stellar masses {log}{M}* /{M}ȯ > 7.8, an unprecedented limit at these redshifts. Based on simple assumptions—{r}e=f({M}* )—we find no significant differences in half-light radii (r e ) between equal-mass cluster or field systems. More complex analyses—{r}e=f({M}* ,U-V,n,z,{{Σ }})—reveal local density (Σ) to induce only a 7% ± 3% (95% confidence) reduction in r e beyond what can be accounted for by U ‑ V color, Sérsic index (n), and redshift (z) effects. Almost any size difference between galaxies in high- and low-density regions is thus attributable to their different distributions in properties other than environment. Indeed, we find a clear color–r e correlation in low-mass passive cluster galaxies ({log}{M}* /{M}ȯ star-forming galaxies. We take this as evidence that large-r e low-mass passive cluster galaxies are recently acquired systems that have been environmentally quenched without significant structural transformation (e.g., by ram pressure stripping or starvation). Conversely, ∼20% of small-r e low-mass passive cluster galaxies appear to have been in place since z≳ 3. Given the consistency of the small-r e galaxies’ stellar surface densities (and even colors) with those of systems more than ten times as massive, our findings suggest that clusters mark places where galaxy evolution is accelerated for an ancient base population spanning most masses, with late-time additions quenched by environment-specific mechanisms mainly restricted to the lowest masses.

  12. Brain rhythms reveal a hierarchical network organization.

    Directory of Open Access Journals (Sweden)

    G Karl Steinke

    2011-10-01

    Full Text Available Recordings of ongoing neural activity with EEG and MEG exhibit oscillations of specific frequencies over a non-oscillatory background. The oscillations appear in the power spectrum as a collection of frequency bands that are evenly spaced on a logarithmic scale, thereby preventing mutual entrainment and cross-talk. Over the last few years, experimental, computational and theoretical studies have made substantial progress on our understanding of the biophysical mechanisms underlying the generation of network oscillations and their interactions, with emphasis on the role of neuronal synchronization. In this paper we ask a very different question. Rather than investigating how brain rhythms emerge, or whether they are necessary for neural function, we focus on what they tell us about functional brain connectivity. We hypothesized that if we were able to construct abstract networks, or "virtual brains", whose dynamics were similar to EEG/MEG recordings, those networks would share structural features among themselves, and also with real brains. Applying mathematical techniques for inverse problems, we have reverse-engineered network architectures that generate characteristic dynamics of actual brains, including spindles and sharp waves, which appear in the power spectrum as frequency bands superimposed on a non-oscillatory background dominated by low frequencies. We show that all reconstructed networks display similar topological features (e.g. structural motifs and dynamics. We have also reverse-engineered putative diseased brains (epileptic and schizophrenic, in which the oscillatory activity is altered in different ways, as reported in clinical studies. These reconstructed networks show consistent alterations of functional connectivity and dynamics. In particular, we show that the complexity of the network, quantified as proposed by Tononi, Sporns and Edelman, is a good indicator of brain fitness, since virtual brains modeling diseased states display lower complexity than virtual brains modeling normal neural function. We finally discuss the implications of our results for the neurobiology of health and disease.

  13. A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification

    Directory of Open Access Journals (Sweden)

    Marín Ignacio

    2007-11-01

    Full Text Available Abstract Background Classification procedures are widely used in phylogenetic inference, the analysis of expression profiles, the study of biological networks, etc. Many algorithms have been proposed to establish the similarity between two different classifications of the same elements. However, methods to determine significant coincidences between hierarchical and non-hierarchical partitions are still poorly developed, in spite of the fact that the search for such coincidences is implicit in many analyses of massive data. Results We describe a novel strategy to compare a hierarchical and a dichotomic non-hierarchical classification of elements, in order to find clusters in a hierarchical tree in which elements of a given "flat" partition are overrepresented. The key improvement of our strategy respect to previous methods is using permutation analyses of ranked clusters to determine whether regions of the dendrograms present a significant enrichment. We show that this method is more sensitive than previously developed strategies and how it can be applied to several real cases, including microarray and interactome data. Particularly, we use it to compare a hierarchical representation of the yeast mitochondrial interactome and a catalogue of known mitochondrial protein complexes, demonstrating a high level of congruence between those two classifications. We also discuss extensions of this method to other cases which are conceptually related. Conclusion Our method is highly sensitive and outperforms previously described strategies. A PERL script that implements it is available at http://www.uv.es/~genomica/treetracker.

  14. Deep Chandra Observations of NGC 1404: Cluster Plasma Physics Revealed by an Infalling Early-type Galaxy

    Science.gov (United States)

    Su, Yuanyuan; Kraft, Ralph P.; Roediger, Elke; Nulsen, Paul; Forman, William R.; Churazov, Eugene; Randall, Scott W.; Jones, Christine; Machacek, Marie E.

    2017-01-01

    The intracluster medium (ICM), as a magnetized and highly ionized fluid, provides an ideal laboratory to study plasma physics under extreme conditions that cannot be achieved on Earth. NGC 1404 is a bright elliptical galaxy that is being gas stripped as it falls through the ICM of the Fornax Cluster. We use the new Chandra X-ray observations of NGC 1404 to study ICM microphysics. The interstellar medium of NGC 1404 is characterized by a sharp leading edge, 8 kpc from the Galaxy center, and a short downstream gaseous tail. Contact discontinuities are resolved on unprecedented spatial scales (0.″5 = 45 pc) due to the combination of the proximity of NGC 1404, the superb spatial resolution of Chandra, and the very deep (670 ks) exposure. At the leading edge, we observe sub-kiloparsec-scale eddies generated by Kelvin–Helmholtz instability (KHI) and put an upper limit of 5% Spitzer on the isotropic viscosity of the hot cluster plasma. We also observe mixing between the hot cluster gas and the cooler galaxy gas in the downstream stripped tail, which provides further evidence of a low viscosity plasma. The assumed ordered magnetic fields in the ICM ought to be smaller than 5 μG to allow KHI to develop. The lack of an evident magnetic draping layer just outside the contact edge is consistent with such an upper limit.

  15. 基于主成分与聚类分析的苹果加工品质评价%Evaluation of apple quality based on principal component and hierarchical cluster analysis

    Institute of Scientific and Technical Information of China (English)

    公丽艳; 孟宪军; 刘乃侨; 毕金峰

    2014-01-01

    The purpose of this study was to investigate the variations in physical and chemical characteristics of apple fruit from 30 varieties grown in the same place using pattern recognition tools. Twenty quality parameters of apple samples (e.g. weight,volume, density, color, hardness, sugar-acid ratio, Vitamin C, etc.) were analyzed. Interrelationships between the parameters and the apple variety were investigated by descriptive statistics, principal component analysis (PCA) and hierarchical cluster analysis (HCA). PCA is a mathematical tool which performs a reduction in data dimensionality and allows the visualisation of underlying structure in experimental data and relationships between data and samples.In hierarchical cluster analysis, samples are grouped on the basis of similarities, without taking into account the information about the class membership. The results obtained following HCA are shown as a dendrogram in which five well-defined clusters are visible. Samples will be grouped in clusters in terms of their nearness or similarity. Cluster analysis uses less information (distances only) than PCA. It is interesting to observe what kind of classification can be made on the basis of distances only. The results showed that density, fruit shape index and water content of 30 apple varieties were not significantly different. The remaining seventeen measurements were investigated by principal component analysis. The first six components represented 83.56% of the total variability on the base of the total variance explained and screen plot of principal component analysis. The first principal component was related to titratable acidity, sugar-acid ratio and solid-acid ratio attributes, which were the taste quality factor. The second principal component was related to L,a, andb attributes, which were the color factor. Following that were sweetness factor, texture factor, quality factor and size factor. The sample score plots visually displayed the relationship between

  16. Comparative Study of K-means and Robust Clustering

    Directory of Open Access Journals (Sweden)

    Shashi Sharma

    2013-09-01

    Full Text Available Data mining is the mechanism of implementing patterns in large amount of data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. Clustering is the very big area in which grouping of same type of objects in data mining. Clustering has divided into different categories – partitioned clustering and hierarchical clustering. In this paper we study two types of clustering first is Kmeans which is part of partitioned clustering. Kmeans clustering generates a specific number of disjoint, flat (non-hierarchical clusters. Second clustering is robust clustering which is part of hierarchical clustering. This clustering uses Jaccard coefficient instead of using the distance measures to find the similarity between the data or documents to classify the clusters. We show comparison between Kmeans clustering and robust clustering which is better for categorical data.

  17. Non-Trivial Feature Derivation for Intensifying Feature Detection Using LIDAR Datasets Through Allometric Aggregation Data Analysis Applying Diffused Hierarchical Clustering for Discriminating Agricultural Land Cover in Portions of Northern Mindanao, Philippines

    Science.gov (United States)

    Villar, Ricardo G.; Pelayo, Jigg L.; Mozo, Ray Mari N.; Salig, James B., Jr.; Bantugan, Jojemar

    2016-06-01

    Leaning on the derived results conducted by Central Mindanao University Phil-LiDAR 2.B.11 Image Processing Component, the paper attempts to provides the application of the Light Detection and Ranging (LiDAR) derived products in arriving quality Landcover classification considering the theoretical approach of data analysis principles to minimize the common problems in image classification. These are misclassification of objects and the non-distinguishable interpretation of pixelated features that results to confusion of class objects due to their closely-related spectral resemblance, unbalance saturation of RGB information is a challenged at the same time. Only low density LiDAR point cloud data is exploited in the research denotes as 2 pts/m2 of accuracy which bring forth essential derived information such as textures and matrices (number of returns, intensity textures, nDSM, etc.) in the intention of pursuing the conditions for selection characteristic. A novel approach that takes gain of the idea of object-based image analysis and the principle of allometric relation of two or more observables which are aggregated for each acquisition of datasets for establishing a proportionality function for data-partioning. In separating two or more data sets in distinct regions in a feature space of distributions, non-trivial computations for fitting distribution were employed to formulate the ideal hyperplane. Achieving the distribution computations, allometric relations were evaluated and match with the necessary rotation, scaling and transformation techniques to find applicable border conditions. Thus, a customized hybrid feature was developed and embedded in every object class feature to be used as classifier with employed hierarchical clustering strategy for cross-examining and filtering features. This features are boost using machine learning algorithms as trainable sets of information for a more competent feature detection. The product classification in this

  18. Multi-granularity reconstruction of 3D calamity emergency situations based on visual scale space hierarchical clustering%基于VSSHC算法的灾害应急多粒度三维态势重构

    Institute of Scientific and Technical Information of China (English)

    于海心; 陈杰; 张娟

    2012-01-01

    针对现有灾害应急态势系统的三维地貌实时更新和态势多粒度显示的技术瓶颈,研究并设计了应急三维态势重构系统(3D-ESRS),该系统可进行实时三维地貌更新和多粒度显示态势内容.分析了3D-ESRS的需求和功能,设计了3D-ESRS的基于多智能体(MAS)的系统框架结构,研究了3D-ESRS系统更新地貌和多粒度显示原理与工作流程,构建了基于视觉尺度空间分层聚类(VSSHC)算法的多尺度分类模型.以堰塞湖为例,多粒度显示了水面升高过程,该实验结果表明3D-ESRS与传统基于GIS平台的态势系统相比,可以实时进行三维地貌场景更新,并对场景进行多粒度显示.%A calamity-oriented 3D emergency situation reconstruction system (3D-ESRS) was studied, and its architecture was designed using the multi-agent technique.Moreover, an approach to multi-granularity reconstruction of 3D calamity emergency situations based on the visual scale space hierarchical clustering ( VSSHC) algorithm was proposed for calamity emergency-decision supporting systems to make them realize the real-time presentation of dynamic 3D calamity situations.A simulation platform based on high level architecture (HLA) was established to verify this approach.The simulation results illustrate that this approach is applicable to emergency-decision supporting systems, and compared to the traditional situation display system this 3D-ESRS has the superiority in reconstructing real-time 3D scenario models.

  19. 一种基于分层 AP 的视频关键帧提取方法研究%Research on video key-frame extraction based on hierarchical affinity propagation clustering

    Institute of Scientific and Technical Information of China (English)

    党宏社; 白梅

    2016-01-01

    为从大量的视频资源中高效准确地提取关键帧图像来表达视频的主要内容,针对传统AP聚类方法提取关键帧无法适应大规模图像集的问题,提出一种分层AP的关键帧提取方法。提取所有视频序列的颜色和纹理特征,将待聚类的图像集进行分层,用传统AP聚类方法求取每个图像子集的聚类中心;用得到的聚类中心进行自适应的AP聚类,根据Silhouette指标选取最优的聚类结果,即可得到视频序列的关键帧代表。实验表明,该方法能快速准确地提取视频最优关键帧,在保证保真度指标的同时能提高关键帧提取的压缩比,且适用于不同类型的视频资源。%In order to extract key frames from large‐scale different videos more effectively and accurately ,since traditional AP algorithm is inappropriate to the large‐scale pictures cluste‐ring ,a hierarchical AP method for key frame extraction is proposed .First get the color and texture features of all video sequences ,the pictures set is divided into several subsets ,the tra‐ditional AP is used to obtain the exemplars of each subset ;Then the adaptive AP is imple‐mented on the obtained exemplars ,the key frames of video sequences are extracted according to the index of Silhouette for the best clustering result .The experimental result shows that proposed method is efficient in key‐frame extraction and suitable for all types video re‐sources ,has a high fidelity w hile the compression ratio is improved greatly .

  20. The Nature and Nurture of Star Clusters

    CERN Document Server

    Elmegreen, Bruce G

    2009-01-01

    Star clusters have hierarchical patterns in space and time, suggesting formation processes in the densest regions of a turbulent interstellar medium. Clusters also have hierarchical substructure when they are young, which makes them all look like the inner mixed parts of a pervasive stellar hierarchy. Young field stars share this distribution, presumably because some of them came from dissolved clusters and others formed in a dispersed fashion in the same gas. The fraction of star formation that ends up in clusters is apparently not constant, but may increase with interstellar pressure. Hierarchical structure explains why stars form in clusters and why many of these clusters are self-bound. It also explains the cluster mass function. Halo globular clusters share many properties of disk clusters, including what appears to be an upper cluster cutoff mass. However, halo globulars are self-enriched and often connected with dwarf galaxy streams. The mass function of halo globulars could have initially been like th...

  1. Whole genome sequence of two Rathayibacter toxicus strains reveals a tunicamycin biosynthetic cluster similar to Streptomyces chartreusis

    Science.gov (United States)

    Sechler, Aaron J.; Tancos, Matthew A.; Schneider, David J.; King, Jonas G.; Fennessey, Christine M.; Schroeder, Brenda K.; Murray, Timothy D.; Luster, Douglas G.; Schneider, William L.

    2017-01-01

    Rathayibacter toxicus is a forage grass associated Gram-positive bacterium of major concern to food safety and agriculture. This species is listed by USDA-APHIS as a plant pathogen select agent because it produces a tunicamycin-like toxin that is lethal to livestock and may be vectored by nematode species native to the U.S. The complete genomes of two strains of R. toxicus, including the type strain FH-79, were sequenced and analyzed in comparison with all available, complete R. toxicus genomes. Genome sizes ranged from 2,343,780 to 2,394,755 nucleotides, with 2079 to 2137 predicted open reading frames; all four strains showed remarkable synteny over nearly the entire genome, with only a small transposed region. A cluster of genes with similarity to the tunicamycin biosynthetic cluster from Streptomyces chartreusis was identified. The tunicamycin gene cluster (TGC) in R. toxicus contained 14 genes in two transcriptional units, with all of the functional elements for tunicamycin biosynthesis present. The TGC had a significantly lower GC content (52%) than the rest of the genome (61.5%), suggesting that the TGC may have originated from a horizontal transfer event. Further analysis indicated numerous remnants of other potential horizontal transfer events are present in the genome. In addition to the TGC, genes potentially associated with carotenoid and exopolysaccharide production, bacteriocins and secondary metabolites were identified. A CRISPR array is evident. There were relatively few plant-associated cell-wall hydrolyzing enzymes, but there were numerous secreted serine proteases that share sequence homology to the pathogenicity-associated protein Pat-1 of Clavibacter michiganensis. Overall, the genome provides clear insight into the possible mechanisms for toxin production in R. toxicus, providing a basis for future genetic approaches. PMID:28796837

  2. What Do Deep Statistical Analysis on Gaming Motivation and Game Characteristics Clusters Reveal about Targeting Demographics when Designing Gamified Contents?

    Directory of Open Access Journals (Sweden)

    Alireza Tavakkoli

    2015-06-01

    Full Text Available This paper presents the comprehensive results of the study of a cohort of college graduate and undergraduate students who participated in playing a Massively Multiplayer Online Role Playing Game (MMORPG as a gameplay rich with social interaction as well as intellectual and aesthetic features. We present the full results of the study in the form of inferential statistics and a review of our descriptive statistics previously reported in [46]. Separate one-way independent-measures multivariate analysis of variance (MANOVA's were used to analyze the data from several instruments to determine if there were statistically significant differences first by gender, then by age group, and then by degree. Moreover, a one-way repeated-measures analysis of variance (ANOVA was used to determine if there was a statistically significant difference between the clusters in the 5 gaming clusters on the Game Characteristic Survey. Follow-up paired samples t-tests were used to see if there was a statistically significant difference between each of the 10 possible combinations of paired clusters. Our results support the hypotheses and outline the features that may need to be taken into account in support of tailoring gamified educational content targeting a certain demographic. Sections 1, 2, and 3 below from our pervious study [46] are included because this is the second part of the two-part study. [46] Tavakkoli, A., Loffredo, D., Ward, M., Sr. (2014. "Insights from Massively Multiplayer Online Role Playing Games to Enhance Gamification in Education", Journal of Systemics, Cybernetics, and Informatics, 12(4, 66-78.

  3. Characterization of streptokinases from group A Streptococci reveals a strong functional relationship that supports the coinheritance of plasminogen-binding M protein and cluster 2b streptokinase.

    Science.gov (United States)

    Zhang, Yueling; Liang, Zhong; Hsueh, Hsing-Tse; Ploplis, Victoria A; Castellino, Francis J

    2012-12-07

    Group A streptococcus (GAS) strains secrete the protein streptokinase (SK), which functions by activating host human plasminogen (hPg) to plasmin (hPm), thus providing a proteolytic framework for invasive GAS strains. The types of SK secreted by GAS have been grouped into two clusters (SK1 and SK2) and one subcluster (SK2a and SK2b). SKs from cluster 1 (SK1) and cluster 2b (SK2b) display significant evolutionary and functional differences, and attempts to relate these properties to GAS skin or pharynx tropism and invasiveness are of great interest. In this study, using four purified SKs from each cluster, new relationships between plasminogen-binding group A streptococcal M (PAM) protein and SK2b have been revealed. All SK1 proteins efficiently activated hPg, whereas all subclass SK2b proteins only weakly activated hPg in the absence of PAM. Surface plasmon resonance studies revealed that the lower affinity of SK2b to hPg served as the basis for the attenuated activation of hPg by SK2b. Binding of hPg to either human fibrinogen (hFg) or PAM greatly enhanced activation of hPg by SK2b but minimally influenced the already effective activation of hPg by SK1. Activation of hPg in the presence of GAS cells containing PAM demonstrated that PAM is the only factor on the surface of SK2b-expressing cells that enabled the direct activation of hPg by SK2b. As the binding of hPg to PAM is necessary for hPg activation by SK2b, this dependence explains the coinherant relationship between PAM and SK2b and the ability of these particular strains to generate the proteolytic activity that disrupts the innate barriers that limit invasiveness.

  4. A hierarchical linear model for tree height prediction.

    Science.gov (United States)

    Vicente J. Monleon

    2003-01-01

    Measuring tree height is a time-consuming process. Often, tree diameter is measured and height is estimated from a published regression model. Trees used to develop these models are clustered into stands, but this structure is ignored and independence is assumed. In this study, hierarchical linear models that account explicitly for the clustered structure of the data...

  5. New resampling method for evaluating stability of clusters

    Directory of Open Access Journals (Sweden)

    Neuhaeuser Markus

    2008-01-01

    Full Text Available Abstract Background Hierarchical clustering is a widely applied tool in the analysis of microarray gene expression data. The assessment of cluster stability is a major challenge in clustering procedures. Statistical methods are required to distinguish between real and random clusters. Several methods for assessing cluster stability have been published, including resampling methods such as the bootstrap. We propose a new resampling method based on continuous weights to assess the stability of clusters in hierarchical clustering. While in bootstrapping approximately one third of the original items is lost, continuous weights avoid zero elements and instead allow non integer diagonal elements, which leads to retention of the full dimensionality of space, i.e. each variable of the original data set is represented in the resampling sample. Results Comparison of continuous weights and bootstrapping using real datasets and simulation studies reveals the advantage of continuous weights especially when the dataset has only few observations, few differentially expressed genes and the fold change of differentially expressed genes is low. Conclusion We recommend the use of continuous weights in small as well as in large datasets, because according to our results they produce at least the same results as conventional bootstrapping and in some cases they surpass it.

  6. 3D Pharmacophore, hierarchical methods, and 5-HT4 receptor binding data.

    Science.gov (United States)

    Varin, Thibault; Saettel, Nicolas; Villain, Jonathan; Lesnard, Aurelien; Dauphin, François; Bureau, Ronan; Rault, Sylvain

    2008-10-01

    5-Hydroxytryptamine subtype-4 (5-HT(4)) receptors have stimulated considerable interest amongst scientists and clinicians owing to their importance in neurophysiology and potential as therapeutic targets. A comparative analysis of hierarchical methods applied to data from one thousand 5-HT(4) receptor-ligand binding interactions was carried out. The chemical structures were described as chemical and pharmacophore fingerprints. The definitions of indices, related to the quality of the hierarchies in being able to distinguish between active and inactive compounds, revealed two interesting hierarchies with the Unity (1 active cluster) and pharmacophore fingerprints (4 active clusters). The results of this study also showed the importance of correct choice of metrics as well as the effectiveness of a new alternative of the Ward clustering algorithm named Energy (Minimum E-Distance method). In parallel, the relationship between these classifications and a previously defined 3D 5-HT(4) antagonist pharmacophore was established.

  7. Extremely slow rate of evolution in the HOX cluster revealed by comparison between Tanzanian and Indonesian coelacanths.

    Science.gov (United States)

    Higasa, Koichiro; Nikaido, Masato; Saito, Taro L; Yoshimura, Jun; Suzuki, Yutaka; Suzuki, Hikoyu; Nishihara, Hidenori; Aibara, Mitsuto; Ngatunga, Benjamin P; Kalombo, Hassan W J; Sugano, Sumio; Morishita, Shinichi; Okada, Norihiro

    2012-09-01

    Coelacanths are known as "living fossils" because their morphology has changed very little from that in the fossil record. To elucidate why coelacanths have evolved so slowly is thus of primary importance in evolutionary biology. In the present study, we determined the entire sequence of the HOX cluster of the Tanzanian coelacanth (Latimeria chalumnae) and compared it with that of the Indonesian coelacanth (L. menadoensis), which was available in the literature. The most intriguing result was the extremely small genetic divergence between the two coelacanths. The synonymous divergence of the HOX coding region between the two coelacanths was estimated to be 0.07%, which is ~11-fold smaller than that of human-chimp. When we applied the estimated divergence time of the two coelacanths of 6 million years ago (MYA) and 30 MYA, which were proposed in independent mitochondrial DNA analyses, the synonymous substitution rate of the coelacanth HOX cluster was estimated to be ~11-fold and 56-fold smaller than that of human-chimp, respectively. Thus, the present study implies that the reduction of the nucleotide substitution rate in coelacanth HOX genes may account for the conservation of coelacanth morphology during evolution.

  8. Clustering of Subgingival Microbiota Reveals Microbial Disease Ecotypes Associated with Clinical Stages of Periodontitis in a Cross-Sectional Study

    Science.gov (United States)

    Boutin, Sébastien; Hagenfeld, Daniel; Zimmermann, Heiko; El Sayed, Nihad; Höpker, Tanja; Greiser, Halina K.; Becher, Heiko; Kim, Ti-Sun; Dalpke, Alexander H.

    2017-01-01

    Periodontitis is characterized by chronic inflammation associated with alteration of the oral microbiota. In contrast to previous microbiome studies focusing a priori on comparison between extreme phenotypes, our study analyzed a random sample of 85 people. The aim of this study was to link microbial differences to disease’s prevalence and severity. Using next generation sequencing of 16S rRNA amplicons and cluster analysis, we observed that the population can be divided into two major ecotypes: One mainly contained periodontal healthy/mild periodontitis individuals whereas the second ecotype showed a heterogeneous microbial distribution and clustered into three distinct sub-ecotypes. Those sub-ecotypes differed with respect to the frequency of diseased patients and displayed a gradual change in distinct subgingival microbiota that goes along with clinical disease symptoms. In ecotype 2, the subgroup with no clinical signs of disease was linked to an increase of F. nucleatum vincentii but also several other species, while only in “end-stage” dysbiosis classical red complex bacteria gained overweight. Therefore, the microbial disease ecotypes observed in our population can lead to an establishment of an early microbial risk profile for clinically healthy patients. PMID:28298910

  9. New redox states observed in [FeFe] hydrogenases reveal redox coupling within the H-cluster.

    Science.gov (United States)

    Adamska-Venkatesh, Agnieszka; Krawietz, Danuta; Siebel, Judith; Weber, Katharina; Happe, Thomas; Reijerse, Edward; Lubitz, Wolfgang

    2014-08-13

    Active [FeFe] hydrogenases can be obtained by expressing the unmaturated enzyme in Escherichia coli followed by incubation with a synthetic precursor of the binuclear [2Fe] subcluster, namely: [NEt4]2[Fe2(adt)(CO)4(CN)2] (adt = [S-CH2-NH-CH2-S](2-)). The binuclear subsite Fe2(adt)(CO)3(CN)2 is attached through a bridging cysteine side chain to a [4Fe-4S] subcluster already present in the unmaturated enzyme thus yielding the intact native "H-cluster". We present FTIR electrochemical studies of the [FeFe] hydrogenase from Chlamydomonas reinhardtii, CrHydA1, maturated with the precursor of the native cofactor [Fe2(adt)(CO)4(CN)2](2-) as well as a non-natural variant [Fe2(pdt)(CO)4(CN)2](2-) in which the bridging amine functionality is replaced by CH2. The obtained active enzyme CrHydA1(adt) shows the same redox states in the respective potential range as observed for the native system (E(ox/red) = -400 mV, E(red/sred) = -470 mV). For the Hox → Hred transition the reducing equivalent is stored on the binuclear part, ([4Fe-4S](2+)Fe(II)Fe(I) → [4Fe-4S](2+)Fe(I)Fe(I)), while the Hred → Hsred transition is characterized by a reduction of the [4Fe-4S] part of the H-cluster ([4Fe-4S](2+)Fe(I)Fe(I) → [4Fe-4S](+)Fe(I)Fe(I)). A similar transition is reported here for the CO inhibited state of the H-cluster: ([4Fe-4S](2+)Fe(I)Fe(II)CO → [4Fe-4S](+)Fe(I)Fe(II)CO). An FTIR electrochemical study of the inactive variant with the pdt ligand, CrHydA1(pdt), identified two redox states H(pdt)-ox and H(pdt)-"red". Both EPR and FTIR spectra of H(pdt)-ox are virtually identical to those of the H(adt)-ox and the native Hox state. The H(pdt)-"red" state is also characterized by a reduced [4Fe-4S] subcluster. In contrast to CrHydA1(adt), the H(pdt)-ox state of CrHydA1(pdt) is stable up to rather high potentials (+200 mV). This study demonstrates the distinct redox coupling between the two parts of the H-cluster and confirms that the [4Fe-4S]H subsite is also redox active and as

  10. Investigation on IMCP based clustering in LTE-M communication for smart metering applications

    National Research Council Canada - National Science Library

    Kartik Vishal Deshpande; A. Rajesh

    2017-01-01

    .... This paper investigates the proposed Improved M2M Clustering Process (IMCP) based clustering technique and it is compared with two well-known clustering algorithms, namely, Low Energy Adaptive Clustering Hierarchical (LEACH...

  11. caBIG™ VISDA: Modeling, visualization, and discovery for cluster analysis of genomic data

    Directory of Open Access Journals (Sweden)

    Xuan Jianhua

    2008-09-01

    Full Text Available Abstract Background The main limitations of most existing clustering methods used in genomic data analysis include heuristic or random algorithm initialization, the potential of finding poor local optima, the lack of cluster number detection, an inability to incorporate prior/expert knowledge, black-box and non-adaptive designs, in addition to the curse of dimensionality and the discernment of uninformative, uninteresting cluster structure associated with confounding variables. Results In an effort to partially address these limitations, we develop the VIsual Statistical Data Analyzer (VISDA for cluster modeling, visualization, and discovery in genomic data. VISDA performs progressive, coarse-to-fine (divisive hierarchical clustering and visualization, supported by hierarchical mixture modeling, supervised/unsupervised informative gene selection, supervised/unsupervised data visualization, and user/prior knowledge guidance, to discover hidden clusters within complex, high-dimensional genomic data. The hierarchical visualization and clustering scheme of VISDA uses multiple local visualization subspaces (one at each node of the hierarchy and consequent subspace data modeling to reveal both global and local cluster structures in a "divide and conquer" scenario. Multiple projection methods, each sensitive to a distinct type of clustering tendency, are used for data visualization, which increases the likelihood that cluster structures of interest are revealed. Initialization of the full dimensional model is based on first learning models with user/prior knowledge guidance on data projected into the low-dimensional visualization spaces. Model order selection for the high dimensional data is accomplished by Bayesian theoretic criteria and user justification applied via the hierarchy of low-dimensional visualization subspaces. Based on its complementary building blocks and flexible functionality, VISDA is generally applicable for gene clustering, sample

  12. caBIG VISDA: modeling, visualization, and discovery for cluster analysis of genomic data.

    Science.gov (United States)

    Zhu, Yitan; Li, Huai; Miller, David J; Wang, Zuyi; Xuan, Jianhua; Clarke, Robert; Hoffman, Eric P; Wang, Yue

    2008-09-18

    The main limitations of most existing clustering methods used in genomic data analysis include heuristic or random algorithm initialization, the potential of finding poor local optima, the lack of cluster number detection, an inability to incorporate prior/expert knowledge, black-box and non-adaptive designs, in addition to the curse of dimensionality and the discernment of uninformative, uninteresting cluster structure associated with confounding variables. In an effort to partially address these limitations, we develop the VIsual Statistical Data Analyzer (VISDA) for cluster modeling, visualization, and discovery in genomic data. VISDA performs progressive, coarse-to-fine (divisive) hierarchical clustering and visualization, supported by hierarchical mixture modeling, supervised/unsupervised informative gene selection, supervised/unsupervised data visualization, and user/prior knowledge guidance, to discover hidden clusters within complex, high-dimensional genomic data. The hierarchical visualization and clustering scheme of VISDA uses multiple local visualization subspaces (one at each node of the hierarchy) and consequent subspace data modeling to reveal both global and local cluster structures in a "divide and conquer" scenario. Multiple projection methods, each sensitive to a distinct type of clustering tendency, are used for data visualization, which increases the likelihood that cluster structures of interest are revealed. Initialization of the full dimensional model is based on first learning models with user/prior knowledge guidance on data projected into the low-dimensional visualization spaces. Model order selection for the high dimensional data is accomplished by Bayesian theoretic criteria and user justification applied via the hierarchy of low-dimensional visualization subspaces. Based on its complementary building blocks and flexible functionality, VISDA is generally applicable for gene clustering, sample clustering, and phenotype clustering

  13. Hierarchical self-organization of cytoskeletal active networks

    CERN Document Server

    Gordon, Daniel; Keasar, Chen; Farago, Oded

    2012-01-01

    The structural reorganization of the actin cytoskeleton is facilitated through the action of motor proteins that crosslink the actin filaments and transport them relative to each other. Here, we present a combined experimental-computational study that probes the dynamic evolution of mixtures of actin filaments and clusters of myosin motors. While on small spatial and temporal scales the system behaves in a very noisy manner, on larger scales it evolves into several well distinct patterns such as bundles, asters, and networks. These patterns are characterized by junctions with high connectivity, whose formation is possible due to the organization of the motors in "oligoclusters" (intermediate-size aggregates). The simulations reveal that the self-organization process proceeds through a series of hierarchical steps, starting from local microscopic moves and ranging up to the macroscopic large scales where the steady-state structures are formed. Our results shed light into the mechanisms involved in processes li...

  14. Live-cell FRET imaging reveals clustering of the prion protein at the cell surface induced by infectious prions.

    Science.gov (United States)

    Tavares, Evandro; Macedo, Joana A; Paulo, Pedro M R; Tavares, Catarina; Lopes, Carlos; Melo, Eduardo P

    2014-07-01

    Prion diseases are associated to the conversion of the prion protein into a misfolded pathological isoform. The mechanism of propagation of protein misfolding by protein templating remains largely unknown. Neuroblastoma cells were transfected with constructs of the prion protein fused to both CFP-GPI-anchored and to YFP-GPI-anchored and directed to its cell membrane location. Live-cell FRET imaging between the prion protein fused to CFP or YFP was measured giving consistent values of 10±2%. This result was confirmed by fluorescence lifetime imaging microscopy and indicates intermolecular interactions between neighbor prion proteins. In particular, considering that a maximum FRET efficiency of 17±2% was determined from a positive control consisting of a fusion CFP-YFP-GPI-anchored. A stable cell clone expressing the two fusions containing the prion protein was also selected to minimize cell-to-cell variability. In both, stable and transiently transfected cells, the FRET efficiency consistently increased in the presence of infectious prions - from 4±1% to 7±1% in the stable clone and from 10±2% to 16±1% in transiently transfected cells. These results clearly reflect an increased clustering of the prion protein on the membrane in the presence of infectious prions, which was not observed in negative control using constructs without the prion protein and upon addition of non-infected brain. Our data corroborates the recent view that the primary site for prion conversion is the cell membrane. Since our fluorescent cell clone is not susceptible to propagate infectivity, we hypothesize that the initial event of prion infectivity might be the clustering of the GPI-anchored prion protein.

  15. Cluster analysis of spontaneous preterm birth phenotypes identifies potential associations among preterm birth mechanisms

    Science.gov (United States)

    Esplin, M Sean; Manuck, Tracy A.; Varner, Michael W.; Christensen, Bryce; Biggio, Joseph; Bukowski, Radek; Parry, Samuel; Zhang, Heping; Huang, Hao; Andrews, William; Saade, George; Sadovsky, Yoel; Reddy, Uma M.; Ilekis, John

    2015-01-01

    Objective We sought to employ an innovative tool based on common biological pathways to identify specific phenotypes among women with spontaneous preterm birth (SPTB), in order to enhance investigators' ability to identify to highlight common mechanisms and underlying genetic factors responsible for SPTB. Study Design A secondary analysis of a prospective case-control multicenter study of SPTB. All cases delivered a preterm singleton at SPTB ≤34.0 weeks gestation. Each woman was assessed for the presence of underlying SPTB etiologies. A hierarchical cluster analysis was used to identify groups of women with homogeneous phenotypic profiles. One of the phenotypic clusters was selected for candidate gene association analysis using VEGAS software. Results 1028 women with SPTB were assigned phenotypes. Hierarchical clustering of the phenotypes revealed five major clusters. Cluster 1 (N=445) was characterized by maternal stress, cluster 2 (N=294) by premature membrane rupture, cluster 3 (N=120) by familial factors, and cluster 4 (N=63) by maternal comorbidities. Cluster 5 (N=106) was multifactorial, characterized by infection (INF), decidual hemorrhage (DH) and placental dysfunction (PD). These three phenotypes were highly correlated by Chi-square analysis [PD and DH (p<2.2e-6); PD and INF (p=6.2e-10); INF and DH (p=0.0036)]. Gene-based testing identified the INS (insulin) gene as significantly associated with cluster 3 of SPTB. Conclusion We identified 5 major clusters of SPTB based on a phenotype tool and hierarchal clustering. There was significant correlation between several of the phenotypes. The INS gene was associated with familial factors underlying SPTB. PMID:26070700

  16. Hierarchical models and chaotic spin glasses

    Science.gov (United States)

    Berker, A. Nihat; McKay, Susan R.

    1984-09-01

    Renormalization-group studies in position space have led to the discovery of hierarchical models which are exactly solvable, exhibiting nonclassical critical behavior at finite temperature. Position-space renormalization-group approximations that had been widely and successfully used are in fact alternatively applicable as exact solutions of hierarchical models, this realizability guaranteeing important physical requirements. For example, a hierarchized version of the Sierpiriski gasket is presented, corresponding to a renormalization-group approximation which has quantitatively yielded the multicritical phase diagrams of submonolayers on graphite. Hierarchical models are now being studied directly as a testing ground for new concepts. For example, with the introduction of frustration, chaotic renormalization-group trajectories were obtained for the first time. Thus, strong and weak correlations are randomly intermingled at successive length scales, and a new microscopic picture and mechanism for a spin glass emerges. An upper critical dimension occurs via a boundary crisis mechanism in cluster-hierarchical variants developed to have well-behaved susceptibilities.

  17. Clustering Patterns of Engagement in Massive Open Online Courses (MOOCs): The Use of Learning Analytics to Reveal Student Categories

    Science.gov (United States)

    Khalil, Mohammad; Ebner, Martin

    2017-01-01

    Massive Open Online Courses (MOOCs) are remote courses that excel in their students' heterogeneity and quantity. Due to the peculiarity of being massiveness, the large datasets generated by MOOC platforms require advanced tools and techniques to reveal hidden patterns for purposes of enhancing learning and educational behaviors. This publication…

  18. Hierarchical auxetic mechanical metamaterials.

    Science.gov (United States)

    Gatt, Ruben; Mizzi, Luke; Azzopardi, Joseph I; Azzopardi, Keith M; Attard, Daphne; Casha, Aaron; Briffa, Joseph; Grima, Joseph N

    2015-02-11

    Auxetic mechanical metamaterials are engineered systems that exhibit the unusual macroscopic property of a negative Poisson's ratio due to sub-unit structure rather than chemical composition. Although their unique behaviour makes them superior to conventional materials in many practical applications, they are limited in availability. Here, we propose a new class of hierarchical auxetics based on the rotating rigid units mechanism. These systems retain the enhanced properties from having a negative Poisson's ratio with the added benefits of being a hierarchical system. Using simulations on typical hierarchical multi-level rotating squares, we show that, through design, one can control the extent of auxeticity, degree of aperture and size of the different pores in the system. This makes the system more versatile than similar non-hierarchical ones, making them promising candidates for industrial and biomedical applications, such as stents and skin grafts.

  19. Introduction into Hierarchical Matrices

    KAUST Repository

    Litvinenko, Alexander

    2013-12-05

    Hierarchical matrices allow us to reduce computational storage and cost from cubic to almost linear. This technique can be applied for solving PDEs, integral equations, matrix equations and approximation of large covariance and precision matrices.

  20. Hierarchical Auxetic Mechanical Metamaterials

    Science.gov (United States)

    Gatt, Ruben; Mizzi, Luke; Azzopardi, Joseph I.; Azzopardi, Keith M.; Attard, Daphne; Casha, Aaron; Briffa, Joseph; Grima, Joseph N.

    2015-02-01

    Auxetic mechanical metamaterials are engineered systems that exhibit the unusual macroscopic property of a negative Poisson's ratio due to sub-unit structure rather than chemical composition. Although their unique behaviour makes them superior to conventional materials in many practical applications, they are limited in availability. Here, we propose a new class of hierarchical auxetics based on the rotating rigid units mechanism. These systems retain the enhanced properties from having a negative Poisson's ratio with the added benefits of being a hierarchical system. Using simulations on typical hierarchical multi-level rotating squares, we show that, through design, one can control the extent of auxeticity, degree of aperture and size of the different pores in the system. This makes the system more versatile than similar non-hierarchical ones, making them promising candidates for industrial and biomedical applications, such as stents and skin grafts.

  1. Applied Bayesian Hierarchical Methods

    CERN Document Server

    Congdon, Peter D

    2010-01-01

    Bayesian methods facilitate the analysis of complex models and data structures. Emphasizing data applications, alternative modeling specifications, and computer implementation, this book provides a practical overview of methods for Bayesian analysis of hierarchical models.

  2. Programming with Hierarchical Maps

    DEFF Research Database (Denmark)

    Ørbæk, Peter

    This report desribes the hierarchical maps used as a central data structure in the Corundum framework. We describe its most prominent features, ague for its usefulness and briefly describe some of the software prototypes implemented using the technology....

  3. Catalysis with hierarchical zeolites

    DEFF Research Database (Denmark)

    Holm, Martin Spangsberg; Taarning, Esben; Egeblad, Kresten

    2011-01-01

    Hierarchical (or mesoporous) zeolites have attracted significant attention during the first decade of the 21st century, and so far this interest continues to increase. There have already been several reviews giving detailed accounts of the developments emphasizing different aspects of this research...... topic. Until now, the main reason for developing hierarchical zeolites has been to achieve heterogeneous catalysts with improved performance but this particular facet has not yet been reviewed in detail. Thus, the present paper summaries and categorizes the catalytic studies utilizing hierarchical...... zeolites that have been reported hitherto. Prototypical examples from some of the different categories of catalytic reactions that have been studied using hierarchical zeolite catalysts are highlighted. This clearly illustrates the different ways that improved performance can be achieved with this family...

  4. Dynamic Organization of Hierarchical Memories.

    Science.gov (United States)

    Kurikawa, Tomoki; Kaneko, Kunihiko

    2016-01-01

    In the brain, external objects are categorized in a hierarchical way. Although it is widely accepted that objects are represented as static attractors in neural state space, this view does not take account interaction between intrinsic neural dynamics and external input, which is essential to understand how neural system responds to inputs. Indeed, structured spontaneous neural activity without external inputs is known to exist, and its relationship with evoked activities is discussed. Then, how categorical representation is embedded into the spontaneous and evoked activities has to be uncovered. To address this question, we studied bifurcation process with increasing input after hierarchically clustered associative memories are learned. We found a "dynamic categorization"; neural activity without input wanders globally over the state space including all memories. Then with the increase of input strength, diffuse representation of higher category exhibits transitions to focused ones specific to each object. The hierarchy of memories is embedded in the transition probability from one memory to another during the spontaneous dynamics. With increased input strength, neural activity wanders over a narrower state space including a smaller set of memories, showing more specific category or memory corresponding to the applied input. Moreover, such coarse-to-fine transitions are also observed temporally during transient process under constant input, which agrees with experimental findings in the temporal cortex. These results suggest the hierarchy emerging through interaction with an external input underlies hierarchy during transient process, as well as in the spontaneous activity.

  5. Revealing the effect of plasmon transmutation on charge transfer plasmons in substrate-mediated metallodielectric aluminum clusters

    Science.gov (United States)

    Nooshnab, Vida; Golmohammadi, Saeed

    2017-01-01

    Aluminum nanoparticle nanocomplexes have extensively been utilized for sustaining ultrastrong plasmonic bonding and antibonding resonant modes across the ultraviolet to visible spectrum. In this study, we analyze the plasmon response for two conventional symmetric heptamer and antisymmetric octamer antennas mediated by conductive film as a substrate to induce very sharp Fano-resonant mode at the high energy states. Besides, presence of an underlying conductive film in touching regime with the plasmonic nanoantennas leads to formation of charge transfer plasmons (CTPs) across the deep-UV band. It is also shown that presence of dielectric carbon nanospheres in the gap spots between proximal nanodisks gives rise to breaking the symmetry of the assemblies, while the new magnetic multipolar modes are induced and divided the Fano dip in two parts as well as formation of a couple of charge transfer plasmon resonant shoulders. The compactness and geometries of the clusters allow for inducing substantially strong resonant modes across the deep-UV domain. Our investigations provide new pathways and features for designing multifunctional molecular probes, biochemical sensors, and cathodoluminescence antennas across the UV spectrum. The proposed analysis were done using a blend of Finite-Difference Time-Domain (FDTD) calculations and transfer of plasmonic charges in nanoscale systems.

  6. Comparisons of clustered regularly interspaced short palindromic repeats and viromes in human saliva reveal bacterial adaptations to salivary viruses.

    Science.gov (United States)

    Pride, David T; Salzman, Julia; Relman, David A

    2012-09-01

    Explorations of human microbiota have provided substantial insight into microbial community composition; however, little is known about interactions between various microbial components in human ecosystems. In response to the powerful impact of viral predation, bacteria have acquired potent defences, including an adaptive immune response based on the clustered regularly interspaced short palindromic repeats (CRISPRs)/Cas system. To improve our understanding of the interactions between bacteria and their viruses in humans, we analysed 13 977 streptococcal CRISPR sequences and compared them with 2 588 172 virome reads in the saliva of four human subjects over 17 months. We found a diverse array of viruses and CRISPR spacers, many of which were specific to each subject and time point. There were numerous viral sequences matching CRISPR spacers; these matches were highly specific for salivary viruses. We determined that spacers and viruses coexist at the same time, which suggests that streptococcal CRISPR/Cas systems are under constant pressure from salivary viruses. CRISPRs in some subjects were just as likely to match viral sequences from other subjects as they were to match viruses from the same subject. Because interactions between bacteria and viruses help to determine the structure of bacterial communities, CRISPR-virus analyses are likely to provide insight into the forces shaping the human microbiome.

  7. Clustering of Infrared-bright Dust-obscured Galaxies Revealed by the Hyper Suprime-Cam and WISE

    Science.gov (United States)

    Toba, Yoshiki; Nagao, Tohru; Kajisawa, Masaru; Oogi, Taira; Akiyama, Masayuki; Ikeda, Hiroyuki; Coupon, Jean; Strauss, Michael A.; Wang, Wei-Hao; Tanaka, Masayuki; Niida, Mana; Imanishi, Masatoshi; Lee, Chien-Hsiu; Matsuhara, Hideo; Matsuoka, Yoshiki; Onoue, Masafusa; Terashima, Yuichi; Ueda, Yoshihiro; Harikane, Yuichi; Komiyama, Yutaka; Miyazaki, Satoshi; Noboriguchi, Akatoki; Usuda, Tomonori

    2017-01-01

    We present measurements of the clustering properties of a sample of infrared (IR) bright dust-obscured galaxies (DOGs). Combining 125 deg2 of wide and deep optical images obtained with the Hyper Suprime-Cam on the Subaru Telescope and all-sky mid-IR images taken with Wide-Field Infrared Survey Explorer, we have discovered 4367 IR-bright DOGs with {(i-[22])}{AB}> 7.0 and flux density at 22 μ {{m}}> 1.0 mJy. We calculate the angular autocorrelation function (ACF) for a uniform subsample of 1411 DOGs with 3.0 mJy r 0 = 12.0 ± 2.0 and 10.3 ± 1.7 {h}-1 Mpc, respectively. IR-bright DOGs reside in massive dark matter halos with a mass of {log}[ /({h}-1 {M}ȯ )]={13.57}-0.55+0.50 and {13.65}-0.52+0.45 in the two cases, respectively.

  8. A Scalable Clustering Algorithm in Dense Mobile Sensor Networks

    Directory of Open Access Journals (Sweden)

    Jianbo Li

    2011-03-01

    Full Text Available Clustering offers a kind of hierarchical organization to provide scalability and basic performance guarantee by partitioning the network into disjoint groups of nodes. In this paper a scalable and energy efficient clustering algorithm is proposed under dense mobile sensor networks scenario. In the initial cluster formation phase, our proposed scheme features a simple execution process with polynomial time complexity, and eliminates the “frozen time” requirement by introducing some GPS-capable mobile nodes to act as cluster heads. In the following cluster maintenance stage, the maintenance of clusters is asynchronously and event driven so as to thoroughly eliminate the “ripple effect” brought by node mobility. As a result local changes in a cluster need not be seen and updated by the entire network, thus bringing greatly reduced communication overheads and being well suitable for the high mobility environment. Extensive simulations have been conducted and the simulation results reveal that our proposed algorithm successfully achieves its target at incurring much less clustering overheads as well as maintaining much more stable cluster structure, as compared to HCC(High Connectivity Clustering  algorithm

  9. Uncovering hierarchical data structure in single molecule transport

    Science.gov (United States)

    Wu, Ben H.; Ivie, Jeffrey A.; Johnson, Tyler K.; Monti, Oliver L. A.

    2017-03-01

    Interpretation of single molecule transport data is complicated by the fact that all such data are inherently highly stochastic in nature. Features are often broad, seemingly unstructured and distributed over more than an order of magnitude. However, the distribution contains information necessary for capturing the full variety of processes relevant in nanoscale transport, and a better understanding of its hierarchical structure is needed to gain deeper insight into the physics and chemistry of single molecule electronics. Here, we describe a novel data analysis approach based on hierarchical clustering to aid in the interpretation of single molecule conductance-displacement histograms. The primary purpose of statistically partitioning transport data is to provide avenues for unbiased hypothesis generation in single molecule break junction experiments by revealing otherwise potentially hidden aspects in the conductance data. Our approach is generalizable to the analysis of a wide variety of other single molecule experiments in molecular electronics, as well as in single molecule fluorescence spectroscopy, force microscopy, and ion-channel conductance measurements.

  10. Disruption of six open reading frames on chromosome X of Saccharomyces cerevisiae reveals a cluster of four essential genes.

    Science.gov (United States)

    Esser, K; Scholle, B; Michaelis, G

    1999-07-01

    In this study we report the construction and basic phenotypic analysis of six Saccharomyces cerevisiae deletion mutants. The open reading frames (ORFs) YJL008C (gene symbol CCT8), YJL010C, YJL011C, YJL012C, YJL017W, and YJL020C from chromosome X have been disrupted by integration of deletion cassettes, comprising the bacterial KanMX4 marker gene and terminal long (LFH) or short (SFH) flanking sequences that are homologous to the 5' and 3' untranslated regions of the respective ORFs. For correct disruption of ORF YJL008C, it was necessary to construct a deletion cassette flanked by 300-350 bp long target guide sequences by LFH-PCR. Transformations using ORF YJL008C gene disruption cassettes synthesized by standard SFH-PCR exclusively resulted in false-positive or multiple integration events, probably because seven additional genes homologous to CCT8 exist in the yeast genome. The other five ORFs have been disrupted using cassettes generated by SFH-PCR, comprising terminal homologous regions of approximately 50 bp to each target site. Correct genomic integration of the reporter modules was verified by analytical PCR and Southern hybridization. Deletion of YJL008C, YJL010C, YJL011C, and YJL012C was found to be lethal, as shown by sporulation and tetrad analysis. This result is in contrast to the finding that only 16-20% of the genes in S. cerevisiae are estimated to be essential. The four essential genes described in this work are clustered, while the two other non-essential ORFs are separated by further ORFs. Although the two viable deletion mutants were tested against 60 different inhibitors, heavy metal ions and salts, no phenotype could be detected that co-segregated with the deletion during meiosis.

  11. Genome-Wide Analysis of Secondary Metabolite Gene Clusters in Ophiostoma ulmi and Ophiostoma novo-ulmi Reveals a Fujikurin-Like Gene Cluster with a Putative Role in Infection

    Directory of Open Access Journals (Sweden)

    Nicolau Sbaraini

    2017-06-01

    Full Text Available The emergence of new microbial pathogens can result in destructive outbreaks, since their hosts have limited resistance and pathogens may be excessively aggressive. Described as the major ecological incident of the twentieth century, Dutch elm disease, caused by ascomycete fungi from the Ophiostoma genus, has caused a significant decline in elm tree populations (Ulmus sp. in North America and Europe. Genome sequencing of the two main causative agents of Dutch elm disease (Ophiostoma ulmi and Ophiostoma novo-ulmi, along with closely related species with different lifestyles, allows for unique comparisons to be made to identify how pathogens and virulence determinants have emerged. Among several established virulence determinants, secondary metabolites (SMs have been suggested to play significant roles during phytopathogen infection. Interestingly, the secondary metabolism of Dutch elm pathogens remains almost unexplored, and little is known about how SM biosynthetic genes are organized in these species. To better understand the metabolic potential of O. ulmi and O. novo-ulmi, we performed a deep survey and description of SM biosynthetic gene clusters (BGCs in these species and assessed their conservation among eight species from the Ophiostomataceae family. Among 19 identified BGCs, a fujikurin-like gene cluster (OpPKS8 was unique to Dutch elm pathogens. Phylogenetic analysis revealed that orthologs for this gene cluster are widespread among phytopathogens and plant-associated fungi, suggesting that OpPKS8 may have been horizontally acquired by the Ophiostoma genus. Moreover, the detailed identification of several BGCs paves the way for future in-depth research and supports the potential impact of secondary metabolism on Ophiostoma genus’ lifestyle.

  12. The association between content of the elements S, Cl, K, Fe, Cu, Zn and Br in normal and cirrhotic liver tissue from Danes and Greenlandic Inuit examined by dual hierarchical clustering analysis

    DEFF Research Database (Denmark)

    Laursen, Jens; Milman, Nils; Pind, N.;

    2014-01-01

    contents according to calculated similarities, one clustering elements according to correlation coefficients between the element contents, both using Euclidian distance and Ward Procedure. RESULTS: One dendrogram separated subjects in 7 clusters showing no differences in ethnicity, gender or age....... The analysis discriminated between elements in normal and cirrhotic livers. The other dendrogram clustered elements in four clusters: sulphur and chlorine; copper and bromine; potassium and zinc; iron. There were significant correlations between the elements in normal liver samples: S was associated with Cl, K...

  13. Cooperative protein structural dynamics of homodimeric hemoglobin linked to water cluster at subunit interface revealed by time-resolved X-ray solution scattering.

    Science.gov (United States)

    Kim, Jong Goo; Muniyappan, Srinivasan; Oang, Key Young; Kim, Tae Wu; Yang, Cheolhee; Kim, Kyung Hwan; Kim, Jeongho; Ihee, Hyotcherl

    2016-03-01

    Homodimeric hemoglobin (HbI) consisting of two subunits is a good model system for investigating the allosteric structural transition as it exhibits cooperativity in ligand binding. In this work, as an effort to extend our previous study on wild-type and F97Y mutant HbI, we investigate structural dynamics of a mutant HbI in solution to examine the role of well-organized interfacial water cluster, which has been known to mediate intersubunit communication in HbI. In the T72V mutant of HbI, the interfacial water cluster in the T state is perturbed due to the lack of Thr72, resulting in two less interfacial water molecules than in wild-type HbI. By performing picosecond time-resolved X-ray solution scattering experiment and kinetic analysis on the T72V mutant, we identify three structurally distinct intermediates (I1, I2, and I3) and show that the kinetics of the T72V mutant are well described by the same kinetic model used for wild-type and F97Y HbI, which involves biphasic kinetics, geminate recombination, and bimolecular CO recombination. The optimized kinetic model shows that the R-T transition and bimolecular CO recombination are faster in the T72V mutant than in the wild type. From structural analysis using species-associated difference scattering curves for the intermediates, we find that the T-like deoxy I3 intermediate in solution has a different structure from deoxy HbI in crystal. In addition, we extract detailed structural parameters of the intermediates such as E-F distance, intersubunit rotation angle, and heme-heme distance. By comparing the structures of protein intermediates in wild-type HbI and the T72V mutant, we reveal how the perturbation in the interfacial water cluster affects the kinetics and structures of reaction intermediates of HbI.

  14. Parallel hierarchical radiosity rendering

    Energy Technology Data Exchange (ETDEWEB)

    Carter, M.

    1993-07-01

    In this dissertation, the step-by-step development of a scalable parallel hierarchical radiosity renderer is documented. First, a new look is taken at the traditional radiosity equation, and a new form is presented in which the matrix of linear system coefficients is transformed into a symmetric matrix, thereby simplifying the problem and enabling a new solution technique to be applied. Next, the state-of-the-art hierarchical radiosity methods are examined for their suitability to parallel implementation, and scalability. Significant enhancements are also discovered which both improve their theoretical foundations and improve the images they generate. The resultant hierarchical radiosity algorithm is then examined for sources of parallelism, and for an architectural mapping. Several architectural mappings are discussed. A few key algorithmic changes are suggested during the process of making the algorithm parallel. Next, the performance, efficiency, and scalability of the algorithm are analyzed. The dissertation closes with a discussion of several ideas which have the potential to further enhance the hierarchical radiosity method, or provide an entirely new forum for the application of hierarchical methods.

  15. Phosphopeptide analysis reveals two discrete clusters of phosphorylation in the N-terminus and the Roc domain of the Parkinson-disease associated protein kinase LRRK2.

    Science.gov (United States)

    Gloeckner, Christian Johannes; Boldt, Karsten; von Zweydorf, Felix; Helm, Sandra; Wiesent, Ludwig; Sarioglu, Hakan; Ueffing, Marius

    2010-04-05

    Mutations in leucine-rich repeat kinase 2 (LRRK2) that increase its kinase activity associate with familial forms of Parkinson disease (PD). As phosphorylation determines the functional state of most protein kinases, we systematically mapped LRRK2 phosphorylation sites by mass spectrometry. Our analysis revealed a high degree of constitutive phosphorylation in a narrow serine-rich region preceding the LRR-domain. Allowing de novo autophosphorylation of purified LRRK2 in an in vitro autokinase assay prior to mass spectrometric analysis, we discovered multiple sites of autophosphorylation. Solely serine and threonine residues were found phosphorylated suggesting LRRK2 as a true serine threonine kinase. Autophosphorylation mainly targets the ROC GTPase domain and its clustering around the GTP binding pocket of ROC suggests cross-regulatory activity between kinase and Roc domain. In conclusion, the phosphoprotein LRRK2 functions as an autocatalytically active serine threonine kinase. Clustering of phosphosites within two discrete domains suggest that phosphorylation may regulate its biological functions in a yet unknown fashion.

  16. Clustering and Functional Coupling of Diverse Ion Channels and Signaling Proteins Revealed by Super-resolution STORM Microscopy in Neurons.

    Science.gov (United States)

    Zhang, Jie; Carver, Chase M; Choveau, Frank S; Shapiro, Mark S

    2016-10-19

    The fidelity of neuronal signaling requires organization of signaling molecules into macromolecular complexes, whose components are in intimate proximity. The intrinsic diffraction limit of light makes visualization of individual signaling complexes using visible light extremely difficult. However, using super-resolution stochastic optical reconstruction microscopy (STORM), we observed intimate association of individual molecules within signaling complexes containing ion channels (M-type K(+), L-type Ca(2+), or TRPV1 channels) and G protein-coupled receptors coupled by the scaffolding protein A-kinase-anchoring protein (AKAP)79/150. Some channels assembled as multi-channel supercomplexes. Surprisingly, we identified novel layers of interplay within macromolecular complexes containing diverse channel types at the single-complex level in sensory neurons, dependent on AKAP79/150. Electrophysiological studies revealed that such ion channels are functionally coupled as well. Our findings illustrate the novel role of AKAP79/150 as a molecular coupler of different channels that conveys crosstalk between channel activities within single microdomains in tuning the physiological response of neurons.

  17. Faint Submillimeter Galaxies Revealed by Multifield Deep ALMA Observations: Number Counts, Spatial Clustering, and Dark Submillimeter Emitters

    CERN Document Server

    Ono, Yoshiaki; Kurono, Yasutaka; Momose, Rieko

    2014-01-01

    We present the statistics of faint submillimeter/millimeter galaxies (SMGs) and serendipitous detections of submillimeter/millimeter emitters (SMEs) with no multi-wavelength continuum counterpart revealed by the deep ALMA observations. We identify faint SMGs with flux densities of 0.1-1.0 mJy in the deep Band 6 and Band 7 maps of 10 independent fields that reduce cosmic variance effects. The differential number counts at 1.2 mm are found to increase with decreasing flux density down to 0.1 mJy. Our number counts indicate that the faint (0.1-1.0 mJy, or SFR_IR ~ 30-300 Msun/yr) SMGs contribute nearly a half of the extragalactic background light (EBL), while the remaining half of the EBL is mostly contributed by very faint sources with flux densities of 1 mJy) SMGs, but comparable with abundant high-z star-forming populations such as sBzKs, LBGs, and LAEs. Finally, we report the serendipitous detections of SMEs with continuum counterparts neither in our 1.2 mm-band nor multi-wavelength images including ultra de...

  18. Hydrogen-Mediated Electron Doping of Gold Clusters As Revealed by In Situ X-ray and UV-vis Absorption Spectroscopy.

    Science.gov (United States)

    Ishida, Ryo; Hayashi, Shun; Yamazoe, Seiji; Kato, Kazuo; Tsukuda, Tatsuya

    2017-06-01

    We previously reported that small (∼1.2 nm) gold clusters stabilized by poly(N-vinyl-2-pyrrolidone) (Au:PVP) exhibited a localized surface plasmon resonance (LSPR) band at ∼520 nm in the presence of NaBH4. To reveal the mechanism of this phenomenon, the electronic structure of Au:PVP during the reaction with NaBH4 in air was examined by means of in situ X-ray absorption spectroscopy at Au L3-edge and UV-vis spectroscopy. These measurements indicated that the appearance of the LSPR band is not associated with the growth in size but is ascribed to electron doping to the Au sp band by the adsorbed H atoms.

  19. Sample phenotype clusters in high-density oligonucleotide microarray data sets are revealed using Isomap, a nonlinear algorithm

    Directory of Open Access Journals (Sweden)

    Malyj Wasyl

    2005-08-01

    Full Text Available Abstract Background Life processes are determined by the organism's genetic profile and multiple environmental variables. However the interaction between these factors is inherently non-linear 1. Microarray data is one representation of the nonlinear interactions among genes and genes and environmental factors. Still most microarray studies use linear methods for the interpretation of nonlinear data. In this study, we apply Isomap, a nonlinear method of dimensionality reduction, to analyze three independent large Affymetrix high-density oligonucleotide microarray data sets. Results Isomap discovered low-dimensional structures embedded in the Affymetrix microarray data sets. These structures correspond to and help to interpret biological phenomena present in the data. This analysis provides examples of temporal, spatial, and functional processes revealed by the Isomap algorithm. In a spinal cord injury data set, Isomap discovers the three main modalities of the experiment – location and severity of the injury and the time elapsed after the injury. In a multiple tissue data set, Isomap discovers a low-dimensional structure that corresponds to anatomical locations of the source tissues. This model is capable of describing low- and high-resolution differences in the same model, such as kidney-vs.-brain and differences between the nuclei of the amygdala, respectively. In a high-throughput drug screening data set, Isomap discovers the monocytic and granulocytic differentiation of myeloid cells and maps several chemical compounds on the two-dimensional model. Conclusion Visualization of Isomap models provides useful tools for exploratory analysis of microarray data sets. In most instances, Isomap models explain more of the variance present in the microarray data than PCA or MDS. Finally, Isomap is a promising new algorithm for class discovery and class prediction in high-density oligonucleotide data sets.

  20. Clustering Approach to Stock Market Prediction

    Directory of Open Access Journals (Sweden)

    M.Suresh Babu

    2012-01-01

    Full Text Available Clustering is an adaptive procedure in which objects are clustered or grouped together, based on the principle of maximizing the intra-class similarity and minimizing the inter-class similarity. Various clustering algorithms have been developed which results to a good performance on datasets for cluster formation. This paper analyze the major clustering algorithms: K-Means, Hierarchical clustering algorithm and reverse K means and compare the performance of these three major clustering algorithms on the aspect of correctly class wise cluster building ability of algorithm. An effective clustering method, HRK (Hierarchical agglomerative and Recursive K-means clustering is proposed, to predict the short-term stock price movements after the release of financial reports. The proposed method consists of three phases. First, we convert each financial report into a feature vector and use the hierarchical agglomerative clustering method to divide the converted feature vectors into clusters. Second, for each cluster, we recursively apply the K-means clustering method to partition each cluster into sub-clusters so that most feature vectors in each subcluster belong to the same class. Then, for each sub cluster, we choose its centroid as the representative feature vector. Finally, we employ the representative feature vectors to predict the stock price movements. The experimental results show the proposed method outperforms SVM in terms of accuracy and average profits.

  1. Embedded Star Clusters in the W51 Giant Molecular Cloud

    CERN Document Server

    Kumar, M S N; Davis, C J

    2004-01-01

    We present sub-arcsecond (0.35"-0.9"), near-infrared J,H,K band photometric observations of six fields along the W51 Giant Molecular Cloud (W51 GMC). Our observations reveal four new, embedded clusters and provide a new high-resolution (0.35") view of the W51IRS2 (G49.5-0.4) region. The cluster associated with G48.9-0.3 is found to be a double cluster enclosed in a nest of near-infrared nebulosity. We construct stellar surface density maps for four major clusters in the W51 GMC. These unveil the underlying hierarchical structure. Color-color and color-magnitude diagrams for each of these clusters show clear differences in the embedded stellar populations and indicate the relative ages of these clusters. In particular, the clusters associated with the HII regions G48.9-0.3 and G49.0-0.3 are found to have a high fraction of YSOs and are therefore considered the youngest of all the near-infrared clusters in the W51 GMC. The estimated masses of the individual clusters, when summed, yield a total stellar mass of ~...

  2. Passive Clustering for Efficient Energy Conservation in Wireless Sensor Network

    Directory of Open Access Journals (Sweden)

    Abderrahim MAIZATE

    2013-02-01

    Full Text Available A wireless sensor network is a set of miniature nodes that consume little energy and route information to a base station. It will enable reliable monitoring of a wide variety of phenomena for civilian, military and medical applications. Almost any sensor network application requires some form of self-organisation to route information. Recent years many protocols for network self-organization and management have been proposed and being implemented. Hierarchical clustering algorithms are very important in increasing the network’s life time. The most important point in this algorithm is cluster head selection and cluster formation because a good clustering guarantees reliability, energy efficiency and load balancing in the network. In this paper, we will use the principles of passive clustering to propose a new mechanism for selecting clusterheads. This mechanism allows the election of an alternate for each cluster head and a dynamic balancing of the role of clusterhead to the alternate when leaving or failure. Thus, it provides several advantages network reliability, stability of clusters and reduces energy consumption among the sensor nodes. Comparison with the existing schemes such as Passive Clustering and GRIDS (Geographically Repulsive Insomnious Distributed Sensors reveals that the mechanism for selecting an alternate for clusterhead nodes, which is the most important factor influencing the clustering performance, can significantly improves the network lifetime.

  3. Faint submillimeter galaxies revealed by multifield deep ALMA observations: number counts, spatial clustering, and a dark submillimeter line emitter

    Energy Technology Data Exchange (ETDEWEB)

    Ono, Yoshiaki; Ouchi, Masami; Momose, Rieko [Institute for Cosmic Ray Research, The University of Tokyo, Kashiwa, Chiba 277-8582 (Japan); Kurono, Yasutaka, E-mail: ono@icrr.u-tokyo.ac.jp [Joint ALMA Observatory, Alonso de Cordova 3107, Vitacura, Santiago 763-0355 (Chile)

    2014-11-01

    We present the statistics of faint submillimeter/millimeter galaxies (SMGs) and serendipitous detections of a submillimeter/millimeter line emitter (SLE) with no multi-wavelength continuum counterpart revealed by the deep ALMA observations. We identify faint SMGs with flux densities of 0.1-1.0 mJy in the deep Band-6 and Band-7 maps of 10 independent fields that reduce cosmic variance effects. The differential number counts at 1.2 mm are found to increase with decreasing flux density down to 0.1 mJy. Our number counts indicate that the faint (0.1-1.0 mJy, or SFR{sub IR} ∼ 30-300 M {sub ☉} yr{sup –1}) SMGs contribute nearly a half of the extragalactic background light (EBL), while the remaining half of the EBL is mostly contributed by very faint sources with flux densities of <0.1 mJy (SFR{sub IR} ≲ 30 M {sub ☉} yr{sup –1}). We conduct counts-in-cells analysis with multifield ALMA data for the faint SMGs, and obtain a coarse estimate of galaxy bias, b {sub g} < 4. The galaxy bias suggests that the dark halo masses of the faint SMGs are ≲ 7 × 10{sup 12} M {sub ☉}, which is smaller than those of bright (>1 mJy) SMGs, but consistent with abundant high-z star-forming populations, such as sBzKs, LBGs, and LAEs. Finally, we report the serendipitous detection of SLE-1, which has no continuum counterparts in our 1.2 mm-band or multi-wavelength images, including ultra deep HST/WFC3 and Spitzer data. The SLE has a significant line at 249.9 GHz with a signal-to-noise ratio of 7.1. If the SLE is not a spurious source made by the unknown systematic noise of ALMA, the strong upper limits of our multi-wavelength data suggest that the SLE would be a faint galaxy at z ≳ 6.

  4. Strategic games on a hierarchical network model

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    Among complex network models, the hierarchical network model is the one most close to such real networks as world trade web, metabolic network, WWW, actor network, and so on. It has not only the property of power-law degree distribution, but growth based on growth and preferential attachment, showing the scale-free degree distribution property. In this paper, we study the evolution of cooperation on a hierarchical network model, adopting the prisoner's dilemma (PD) game and snowdrift game (SG) as metaphors of the interplay between connected nodes. BA model provides a unifying framework for the emergence of cooperation. But interestingly, we found that on hierarchical model, there is no sign of cooperation for PD game, while the frequency of cooperation decreases as the common benefit decreases for SG. By comparing the scaling clustering coefficient properties of the hierarchical network model with that of BA model, we found that the former amplifies the effect of hubs. Considering different performances of PD game and SG on complex network, we also found that common benefit leads to cooperation in the evolution. Thus our study may shed light on the emergence of cooperation in both natural and social environments.

  5. Clustering of resting state networks.

    Directory of Open Access Journals (Sweden)

    Megan H Lee

    Full Text Available BACKGROUND: The goal of the study was to demonstrate a hierarchical structure of resting state activity in the healthy brain using a data-driven clustering algorithm. METHODOLOGY/PRINCIPAL FINDINGS: The fuzzy-c-means clustering algorithm was applied to resting state fMRI data in cortical and subcortical gray matter from two groups acquired separately, one of 17 healthy individuals and the second of 21 healthy individuals. Different numbers of clusters and different starting conditions were used. A cluster dispersion measure determined the optimal numbers of clusters. An inner product metric provided a measure of similarity between different clusters. The two cluster result found the task-negative and task-positive systems. The cluster dispersion measure was minimized with seven and eleven clusters. Each of the clusters in the seven and eleven cluster result was associated with either the task-negative or task-positive system. Applying the algorithm to find seven clusters recovered previously described resting state networks, including the default mode network, frontoparietal control network, ventral and dorsal attention networks, somatomotor, visual, and language networks. The language and ventral attention networks had significant subcortical involvement. This parcellation was consistently found in a large majority of algorithm runs under different conditions and was robust to different methods of initialization. CONCLUSIONS/SIGNIFICANCE: The clustering of resting state activity using different optimal numbers of clusters identified resting state networks comparable to previously obtained results. This work reinforces the observation that resting state networks are hierarchically organized.

  6. Exploring the individual patterns of spiritual well-being in people newly diagnosed with advanced cancer: a cluster analysis.

    Science.gov (United States)

    Bai, Mei; Dixon, Jane; Williams, Anna-Leila; Jeon, Sangchoon; Lazenby, Mark; McCorkle, Ruth

    2016-11-01

    Research shows that spiritual well-being correlates positively with quality of life (QOL) for people with cancer, whereas contradictory findings are frequently reported with respect to the differentiated associations between dimensions of spiritual well-being, namely peace, meaning and faith, and QOL. This study aimed to examine individual patterns of spiritual well-being among patients newly diagnosed with advanced cancer. Cluster analysis was based on the twelve items of the 12-item Functional Assessment of Chronic Illness Therapy-Spiritual Well-Being Scale at Time 1. A combination of hierarchical and k-means (non-hierarchical) clustering methods was employed to jointly determine the number of clusters. Self-rated health, depressive symptoms, peace, meaning and faith, and overall QOL were compared at Time 1 and Time 2. Hierarchical and k-means clustering methods both suggested four clusters. Comparison of the four clusters supported statistically significant and clinically meaningful differences in QOL outcomes among clusters while revealing contrasting relations of faith with QOL. Cluster 1, Cluster 3, and Cluster 4 represented high, medium, and low levels of overall QOL, respectively, with correspondingly high, medium, and low levels of peace, meaning, and faith. Cluster 2 was distinguished from other clusters by its medium levels of overall QOL, peace, and meaning and low level of faith. This study provides empirical support for individual difference in response to a newly diagnosed cancer and brings into focus conceptual and methodological challenges associated with the measure of spiritual well-being, which may partly contribute to the attenuated relation between faith and QOL.

  7. Hierarchical self-organization of tectonic plates

    CERN Document Server

    Morra, Gabriele; Müller, R Dietmar

    2010-01-01

    The Earth's surface is subdivided into eight large tectonic plates and many smaller ones. We reconstruct the plate tessellation history and demonstrate that both large and small plates display two distinct hierarchical patterns, described by different power-law size-relationships. While small plates display little organisational change through time, the structure of the large plates oscillate between minimum and maximum hierarchical tessellations. The organization of large plates rapidly changes from a weak hierarchy at 120-100 million years ago (Ma) towards a strong hierarchy, which peaked at 65-50, Ma subsequently relaxing back towards a minimum hierarchical structure. We suggest that this fluctuation reflects an alternation between top and bottom driven plate tectonics, revealing a previously undiscovered tectonic cyclicity at a timescale of 100 million years.

  8. Hierarchical organisation of Britain through percolation theory

    CERN Document Server

    Arcaute, Elsa; Hatna, Erez; Murcio, Roberto; Vargas-Ruiz, Camilo; Masucci, Paolo; Wang, Jiaqiu; Batty, Michael

    2015-01-01

    Urban systems present hierarchical structures at many different scales. These are observed as administrative regional delimitations, which are the outcome of geographical, political and historical constraints. Using percolation theory on the street intersections and on the road network of Britain, we obtain hierarchies at different scales that are independent of administrative arrangements. Natural boundaries, such as islands and National Parks, consistently emerge at the largest/regional scales. Cities are devised through recursive percolations on each of the emerging clusters, but the system does not undergo a phase transition at the distance threshold at which cities can be defined. This specific distance is obtained by computing the fractal dimension of the clusters extracted at each distance threshold. We observe that the fractal dimension presents a maximum over all the different distance thresholds. The clusters obtained at this maximum are in very good correspondence to the morphological definition of...

  9. Hierarchical Porous Structures

    Energy Technology Data Exchange (ETDEWEB)

    Grote, Christopher John [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

    2016-06-07

    Materials Design is often at the forefront of technological innovation. While there has always been a push to generate increasingly low density materials, such as aero or hydrogels, more recently the idea of bicontinuous structures has gone more into play. This review will cover some of the methods and applications for generating both porous, and hierarchically porous structures.

  10. Cluster analysis

    CERN Document Server

    Everitt, Brian S; Leese, Morven; Stahl, Daniel

    2011-01-01

    Cluster analysis comprises a range of methods for classifying multivariate data into subgroups. By organizing multivariate data into such subgroups, clustering can help reveal the characteristics of any structure or patterns present. These techniques have proven useful in a wide range of areas such as medicine, psychology, market research and bioinformatics.This fifth edition of the highly successful Cluster Analysis includes coverage of the latest developments in the field and a new chapter dealing with finite mixture models for structured data.Real life examples are used throughout to demons

  11. Hierarchical Neural Regression Models for Customer Churn Prediction

    Directory of Open Access Journals (Sweden)

    Golshan Mohammadi

    2013-01-01

    Full Text Available As customers are the main assets of each industry, customer churn prediction is becoming a major task for companies to remain in competition with competitors. In the literature, the better applicability and efficiency of hierarchical data mining techniques has been reported. This paper considers three hierarchical models by combining four different data mining techniques for churn prediction, which are backpropagation artificial neural networks (ANN, self-organizing maps (SOM, alpha-cut fuzzy c-means (α-FCM, and Cox proportional hazards regression model. The hierarchical models are ANN + ANN + Cox, SOM + ANN + Cox, and α-FCM + ANN + Cox. In particular, the first component of the models aims to cluster data in two churner and nonchurner groups and also filter out unrepresentative data or outliers. Then, the clustered data as the outputs are used to assign customers to churner and nonchurner groups by the second technique. Finally, the correctly classified data are used to create Cox proportional hazards model. To evaluate the performance of the hierarchical models, an Iranian mobile dataset is considered. The experimental results show that the hierarchical models outperform the single Cox regression baseline model in terms of prediction accuracy, Types I and II errors, RMSE, and MAD metrics. In addition, the α-FCM + ANN + Cox model significantly performs better than the two other hierarchical models.

  12. The Application of Hierarchical Cluster Analysis to the Prediction of Grain Security of Small Research Areas-A Case Study of Kunshan%谱系聚类法在小区域粮食安全预测中的应用——以昆山市为例

    Institute of Scientific and Technical Information of China (English)

    姚鑫; 杨桂山; 万荣荣

    2011-01-01

    粮食安全对国民经济的可持续发展起着不可替代的基础性作用,小区域由于受政策因素的影响较大,粮食安全相关指标的变化呈一定阶段性,长时间序列的数学规律并不突出,不利于规划工作的展开.论文基于昆山市的研究,提出谱系聚类与数学模型相结合的基本思路,在此基础上推出了聚类结果有效性的量化判定标准并对聚类法运用准则做了深入的探讨.实际数据分析结果表明:昆山的粮食安全相关的社会经济指标变化确实呈明显阶段性;与利用全部时间序列数据建立的模型相比,运用谱系聚类的模型拟合和预测效果都有明显优势;至2015年,昆山市粮食自给率将下降至6%,最小人均耕地面积降低至0.022 hm2.通过进一步的分析、对比及讨论,文章认为,谱系聚类法运用于小区域粮食安全预测,方法可操作性强,结论科学性显著.%Grain security is fundamental to the sustainable development of our society and national economy. As research regions with small area are vulnerable to the impacts of policy changes, indexes related to grain security of these areas often change in the form of stages, which means that the mathematical regularity of long-term datasets is not significant. As a result, it is difficult to implement grain security programming for the future.We put forward a new method of combining hierarchical cluster analysis with traditional mathematical models, and established a quantification standard for the validity judgment of the clustering results. Meanwhile, a criterion for the using of hierarchical cluster analysis was also proposed, but we strongly recommended that mass data from other research areas are needed to calibrate and perfect it.Kunshan ( 1985 -2007 ) was chosen as a study region to prove the new method, because it is small in area but with rapid economic development. The results of analysis showed that: the indexes related to grain security did

  13. Cooperative protein structural dynamics of homodimeric hemoglobin linked to water cluster at subunit interface revealed by time-resolved X-ray solution scattering

    Directory of Open Access Journals (Sweden)

    Jong Goo Kim

    2016-03-01

    Full Text Available Homodimeric hemoglobin (HbI consisting of two subunits is a good model system for investigating the allosteric structural transition as it exhibits cooperativity in ligand binding. In this work, as an effort to extend our previous study on wild-type and F97Y mutant HbI, we investigate structural dynamics of a mutant HbI in solution to examine the role of well-organized interfacial water cluster, which has been known to mediate intersubunit communication in HbI. In the T72V mutant of HbI, the interfacial water cluster in the T state is perturbed due to the lack of Thr72, resulting in two less interfacial water molecules than in wild-type HbI. By performing picosecond time-resolved X-ray solution scattering experiment and kinetic analysis on the T72V mutant, we identify three structurally distinct intermediates (I1, I2, and I3 and show that the kinetics of the T72V mutant are well described by the same kinetic model used for wild-type and F97Y HbI, which involves biphasic kinetics, geminate recombination, and bimolecular CO recombination. The optimized kinetic model shows that the R-T transition and bimolecular CO recombination are faster in the T72V mutant than in the wild type. From structural analysis using species-associated difference scattering curves for the intermediates, we find that the T-like deoxy I3 intermediate in solution has a different structure from deoxy HbI in crystal. In addition, we extract detailed structural parameters of the intermediates such as E-F distance, intersubunit rotation angle, and heme-heme distance. By comparing the structures of protein intermediates in wild-type HbI and the T72V mutant, we reveal how the perturbation in the interfacial water cluster affects the kinetics and structures of reaction intermediates of HbI.

  14. Modeling the deformation behavior of nanocrystalline alloy with hierarchical microstructures

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Hongxi; Zhou, Jianqiu, E-mail: zhouj@njtech.edu.cn [Nanjing Tech University, Department of Mechanical Engineering (China); Zhao, Yonghao, E-mail: yhzhao@njust.edu.cn [Nanjing University of Science and Technology, Nanostructural Materials Research Center, School of Materials Science and Engineering (China)

    2016-02-15

    A mechanism-based plasticity model based on dislocation theory is developed to describe the mechanical behavior of the hierarchical nanocrystalline alloys. The stress–strain relationship is derived by invoking the impeding effect of the intra-granular solute clusters and the inter-granular nanostructures on the dislocation movements along the sliding path. We found that the interaction between dislocations and the hierarchical microstructures contributes to the strain hardening property and greatly influence the ductility of nanocrystalline metals. The analysis indicates that the proposed model can successfully describe the enhanced strength of the nanocrystalline hierarchical alloy. Moreover, the strain hardening rate is sensitive to the volume fraction of the hierarchical microstructures. The present model provides a new perspective to design the microstructures for optimizing the mechanical properties in nanostructural metals.

  15. Classifying hospitals as mortality outliers: logistic versus hierarchical logistic models.

    Science.gov (United States)

    Alexandrescu, Roxana; Bottle, Alex; Jarman, Brian; Aylin, Paul

    2014-05-01

    The use of hierarchical logistic regression for provider profiling has been recommended due to the clustering of patients within hospitals, but has some associated difficulties. We assess changes in hospital outlier status based on standard logistic versus hierarchical logistic modelling of mortality. The study population consisted of all patients admitted to acute, non-specialist hospitals in England between 2007 and 2011 with a primary diagnosis of acute myocardial infarction, acute cerebrovascular disease or fracture of neck of femur or a primary procedure of coronary artery bypass graft or repair of abdominal aortic aneurysm. We compared standardised mortality ratios (SMRs) from non-hierarchical models with SMRs from hierarchical models, without and with shrinkage estimates of the predicted probabilities (Model 1 and Model 2). The SMRs from standard logistic and hierarchical models were highly statistically significantly correlated (r > 0.91, p = 0.01). More outliers were recorded in the standard logistic regression than hierarchical modelling only when using shrinkage estimates (Model 2): 21 hospitals (out of a cumulative number of 565 pairs of hospitals under study) changed from a low outlier and 8 hospitals changed from a high outlier based on the logistic regression to a not-an-outlier based on shrinkage estimates. Both standard logistic and hierarchical modelling have identified nearly the same hospitals as mortality outliers. The choice of methodological approach should, however, also consider whether the modelling aim is judgment or improvement, as shrinkage may be more appropriate for the former than the latter.

  16. Hierarchical star formation across the grand-design spiral NGC 1566

    Science.gov (United States)

    Gouliermis, Dimitrios A.; Elmegreen, Bruce G.; Elmegreen, Debra M.; Calzetti, Daniela; Cignoni, Michele; Gallagher, John S., III; Kennicutt, Robert C.; Klessen, Ralf S.; Sabbi, Elena; Thilker, David; Ubeda, Leonardo; Aloisi, Alessandra; Adamo, Angela; Cook, David O.; Dale, Daniel; Grasha, Kathryn; Grebel, Eva K.; Johnson, Kelsey E.; Sacchi, Elena; Shabani, Fayezeh; Smith, Linda J.; Wofford, Aida

    2017-06-01

    We investigate how star formation is spatially organized in the grand-design spiral NGC 1566 from deep Hubble Space Telescope photometry with the Legacy ExtraGalactic UV Survey. Our contour-based clustering analysis reveals 890 distinct stellar conglomerations at various levels of significance. These star-forming complexes are organized in a hierarchical fashion with the larger congregations consisting of smaller structures, which themselves fragment into even smaller and more compact stellar groupings. Their size distribution, covering a wide range in length-scales, shows a power law as expected from scale-free processes. We explain this shape with a simple 'fragmentation and enrichment' model. The hierarchical morphology of the complexes is confirmed by their mass-size relation that can be represented by a power law with a fractional exponent, analogous to that determined for fractal molecular clouds. The surface stellar density distribution of the complexes shows a lognormal shape similar to that for supersonic non-gravitating turbulent gas. Between 50 and 65 per cent of the recently formed stars, as well as about 90 per cent of the young star clusters, are found inside the stellar complexes, located along the spiral arms. We find an age difference between young stars inside the complexes and those in their direct vicinity in the arms of at least 10 Myr. This time-scale may relate to the minimum time for stellar evaporation, although we cannot exclude the in situ formation of stars. As expected, star formation preferentially occurs in spiral arms. Our findings reveal turbulent-driven hierarchical star formation along the arms of a grand-design galaxy.

  17. Collaborative Hierarchical Sparse Modeling

    CERN Document Server

    Sprechmann, Pablo; Sapiro, Guillermo; Eldar, Yonina C

    2010-01-01

    Sparse modeling is a powerful framework for data analysis and processing. Traditionally, encoding in this framework is done by solving an l_1-regularized linear regression problem, usually called Lasso. In this work we first combine the sparsity-inducing property of the Lasso model, at the individual feature level, with the block-sparsity property of the group Lasso model, where sparse groups of features are jointly encoded, obtaining a sparsity pattern hierarchically structured. This results in the hierarchical Lasso, which shows important practical modeling advantages. We then extend this approach to the collaborative case, where a set of simultaneously coded signals share the same sparsity pattern at the higher (group) level but not necessarily at the lower one. Signals then share the same active groups, or classes, but not necessarily the same active set. This is very well suited for applications such as source separation. An efficient optimization procedure, which guarantees convergence to the global opt...

  18. Heuristics for Hierarchical Partitioning with Application to Model Checking

    DEFF Research Database (Denmark)

    Möller, Michael Oliver; Alur, Rajeev

    2001-01-01

    Given a collection of connected components, it is often desired to cluster together parts of strong correspondence, yielding a hierarchical structure. We address the automation of this process and apply heuristics to battle the combinatorial and computational complexity. We define a cost function...

  19. Hierarchical manifold learning.

    Science.gov (United States)

    Bhatia, Kanwal K; Rao, Anil; Price, Anthony N; Wolz, Robin; Hajnal, Jo; Rueckert, Daniel

    2012-01-01

    We present a novel method of hierarchical manifold learning which aims to automatically discover regional variations within images. This involves constructing manifolds in a hierarchy of image patches of increasing granularity, while ensuring consistency between hierarchy levels. We demonstrate its utility in two very different settings: (1) to learn the regional correlations in motion within a sequence of time-resolved images of the thoracic cavity; (2) to find discriminative regions of 3D brain images in the classification of neurodegenerative disease,

  20. Hierarchically Structured Electrospun Fibers

    Directory of Open Access Journals (Sweden)

    Nicole E. Zander

    2013-01-01

    Full Text Available Traditional electrospun nanofibers have a myriad of applications ranging from scaffolds for tissue engineering to components of biosensors and energy harvesting devices. The generally smooth one-dimensional structure of the fibers has stood as a limitation to several interesting novel applications. Control of fiber diameter, porosity and collector geometry will be briefly discussed, as will more traditional methods for controlling fiber morphology and fiber mat architecture. The remainder of the review will focus on new techniques to prepare hierarchically structured fibers. Fibers with hierarchical primary structures—including helical, buckled, and beads-on-a-string fibers, as well as fibers with secondary structures, such as nanopores, nanopillars, nanorods, and internally structured fibers and their applications—will be discussed. These new materials with helical/buckled morphology are expected to possess unique optical and mechanical properties with possible applications for negative refractive index materials, highly stretchable/high-tensile-strength materials, and components in microelectromechanical devices. Core-shell type fibers enable a much wider variety of materials to be electrospun and are expected to be widely applied in the sensing, drug delivery/controlled release fields, and in the encapsulation of live cells for biological applications. Materials with a hierarchical secondary structure are expected to provide new superhydrophobic and self-cleaning materials.

  1. HDS: Hierarchical Data System

    Science.gov (United States)

    Pearce, Dave; Walter, Anton; Lupton, W. F.; Warren-Smith, Rodney F.; Lawden, Mike; McIlwrath, Brian; Peden, J. C. M.; Jenness, Tim; Draper, Peter W.

    2015-02-01

    The Hierarchical Data System (HDS) is a file-based hierarchical data system designed for the storage of a wide variety of information. It is particularly suited to the storage of large multi-dimensional arrays (with their ancillary data) where efficient access is needed. It is a key component of the Starlink software collection (ascl:1110.012) and is used by the Starlink N-Dimensional Data Format (NDF) library (ascl:1411.023). HDS organizes data into hierarchies, broadly similar to the directory structure of a hierarchical filing system, but contained within a single HDS container file. The structures stored in these files are self-describing and flexible; HDS supports modification and extension of structures previously created, as well as functions such as deletion, copying, and renaming. All information stored in HDS files is portable between the machines on which HDS is implemented. Thus, there are no format conversion problems when moving between machines. HDS can write files in a private binary format (version 4), or be layered on top of HDF5 (version 5).

  2. Swift reveals a ~5.7 day super-orbital period in the M31 globular cluster X-ray binary XB158

    CERN Document Server

    Barnard, R; Murrray, S S

    2015-01-01

    The M31 globular cluster X-ray binary XB158 (a.k.a. Bo 158) exhibits intensity dips on a 2.78 hr period in some observations, but not others. The short period suggests a low mass ratio, and an asymmetric, precessing disk due to additional tidal torques from the donor star since the disk crosses the 3:1 resonance. Previous theoretical 3D smoothed particle hydrodynamical modeling suggested a super-orbital disk precession period 29$\\pm$1 times the orbital period, i.e. $\\sim$81$\\pm$3 hr. We conducted a Swift monitoring campaign of 30 observations over ~1 month in order to search for evidence of such a super-orbital period. Fitting the 0.3--10 keV Swift XRT luminosity lightcurve with a sinusoid yielded a period of 5.65+/-0.05 days, and a >5$\\sigma$ improvement in $\\chi^2$ over the best fit constant intensity model. A Lomb-Scargle periodogram revealed that periods 5.4--5.8 days were detected at a >3$\\sigma$ level, with a peak at 5.6 days. We consider this strong evidence for a 5.65 day super-orbital period, ~70\\% l...

  3. Unsupervised Clustering of Subcellular Protein Expression Patterns in High-Throughput Microscopy Images Reveals Protein Complexes and Functional Relationships between Proteins

    Science.gov (United States)

    Handfield, Louis-François; Chong, Yolanda T.; Simmons, Jibril; Andrews, Brenda J.; Moses, Alan M.

    2013-01-01

    Protein subcellular localization has been systematically characterized in budding yeast using fluorescently tagged proteins. Based on the fluorescence microscopy images, subcellular localization of many proteins can be classified automatically using supervised machine learning approaches that have been trained to recognize predefined image classes based on statistical features. Here, we present an unsupervised analysis of protein expression patterns in a set of high-resolution, high-throughput microscope images. Our analysis is based on 7 biologically interpretable features which are evaluated on automatically identified cells, and whose cell-stage dependency is captured by a continuous model for cell growth. We show that it is possible to identify most previously identified localization patterns in a cluster analysis based on these features and that similarities between the inferred expression patterns contain more information about protein function than can be explained by a previous manual categorization of subcellular localization. Furthermore, the inferred cell-stage associated to each fluorescence measurement allows us to visualize large groups of proteins entering the bud at specific stages of bud growth. These correspond to proteins localized to organelles, revealing that the organelles must be entering the bud in a stereotypical order. We also identify and organize a smaller group of proteins that show subtle differences in the way they move around the bud during growth. Our results suggest that biologically interpretable features based on explicit models of cell morphology will yield unprecedented power for pattern discovery in high-resolution, high-throughput microscopy images. PMID:23785265

  4. Crystal Structure of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated Csn2 Protein Revealed Ca[superscript 2+]-dependent Double-stranded DNA Binding Activity

    Energy Technology Data Exchange (ETDEWEB)

    Nam, Ki Hyun; Kurinov, Igor; Ke, Ailong (Cornell); (NWU)

    2012-05-22

    Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated protein genes (cas genes) are widespread in bacteria and archaea. They form a line of RNA-based immunity to eradicate invading bacteriophages and malicious plasmids. A key molecular event during this process is the acquisition of new spacers into the CRISPR loci to guide the selective degradation of the matching foreign genetic elements. Csn2 is a Nmeni subtype-specific cas gene required for new spacer acquisition. Here we characterize the Enterococcus faecalis Csn2 protein as a double-stranded (ds-) DNA-binding protein and report its 2.7 {angstrom} tetrameric ring structure. The inner circle of the Csn2 tetrameric ring is {approx}26 {angstrom} wide and populated with conserved lysine residues poised for nonspecific interactions with ds-DNA. Each Csn2 protomer contains an {alpha}/{beta} domain and an {alpha}-helical domain; significant hinge motion was observed between these two domains. Ca{sup 2+} was located at strategic positions in the oligomerization interface. We further showed that removal of Ca{sup 2+} ions altered the oligomerization state of Csn2, which in turn severely decreased its affinity for ds-DNA. In summary, our results provided the first insight into the function of the Csn2 protein in CRISPR adaptation by revealing that it is a ds-DNA-binding protein functioning at the quaternary structure level and regulated by Ca{sup 2+} ions.

  5. In silico clustering of Salmonella global gene expression data reveals novel genes co-regulated with the SPI-1 virulence genes through HilD

    Science.gov (United States)

    Martínez-Flores, Irma; Pérez-Morales, Deyanira; Sánchez-Pérez, Mishael; Paredes, Claudia C.; Collado-Vides, Julio; Salgado, Heladia; Bustamante, Víctor H.

    2016-01-01

    A wide variety of Salmonella enterica serovars cause intestinal and systemic infections to humans and animals. Salmonella Patogenicity Island 1 (SPI-1) is a chromosomal region containing 39 genes that have crucial virulence roles. The AraC-like transcriptional regulator HilD, encoded in SPI-1, positively controls the expression of the SPI-1 genes, as well as of several other virulence genes located outside SPI-1. In this study, we applied a clustering method to the global gene expression data of S. enterica serovar Typhimurium from the COLOMBOS database; thus genes that show an expression pattern similar to that of SPI-1 genes were selected. This analysis revealed nine novel genes that are co-expressed with SPI-1, which are located in different chromosomal regions. Expression analyses and protein-DNA interaction assays showed regulation by HilD for six of these genes: gtgE, phoH, sinR, SL1263 (lpxR) and SL4247 were regulated directly, whereas SL1896 was regulated indirectly. Interestingly, phoH is an ancestral gene conserved in most of bacteria, whereas the other genes show characteristics of genes acquired by Salmonella. A role in virulence has been previously demonstrated for gtgE, lpxR and sinR. Our results further expand the regulon of HilD and thus identify novel possible Salmonella virulence genes. PMID:27886269

  6. Unsupervised clustering of subcellular protein expression patterns in high-throughput microscopy images reveals protein complexes and functional relationships between proteins.

    Directory of Open Access Journals (Sweden)

    Louis-François Handfield

    Full Text Available Protein subcellular localization has been systematically characterized in budding yeast using fluorescently tagged proteins. Based on the fluorescence microscopy images, subcellular localization of many proteins can be classified automatically using supervised machine learning approaches that have been trained to recognize predefined image classes based on statistical features. Here, we present an unsupervised analysis of protein expression patterns in a set of high-resolution, high-throughput microscope images. Our analysis is based on 7 biologically interpretable features which are evaluated on automatically identified cells, and whose cell-stage dependency is captured by a continuous model for cell growth. We show that it is possible to identify most previously identified localization patterns in a cluster analysis based on these features and that similarities between the inferred expression patterns contain more information about protein function than can be explained by a previous manual categorization of subcellular localization. Furthermore, the inferred cell-stage associated to each fluorescence measurement allows us to visualize large groups of proteins entering the bud at specific stages of bud growth. These correspond to proteins localized to organelles, revealing that the organelles must be entering the bud in a stereotypical order. We also identify and organize a smaller group of proteins that show subtle differences in the way they move around the bud during growth. Our results suggest that biologically interpretable features based on explicit models of cell morphology will yield unprecedented power for pattern discovery in high-resolution, high-throughput microscopy images.

  7. Crystal structure of clustered regularly interspaced short palindromic repeats (CRISPR)-associated Csn2 protein revealed Ca2+-dependent double-stranded DNA binding activity.

    Science.gov (United States)

    Nam, Ki Hyun; Kurinov, Igor; Ke, Ailong

    2011-09-02

    Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated protein genes (cas genes) are widespread in bacteria and archaea. They form a line of RNA-based immunity to eradicate invading bacteriophages and malicious plasmids. A key molecular event during this process is the acquisition of new spacers into the CRISPR loci to guide the selective degradation of the matching foreign genetic elements. Csn2 is a Nmeni subtype-specific cas gene required for new spacer acquisition. Here we characterize the Enterococcus faecalis Csn2 protein as a double-stranded (ds-) DNA-binding protein and report its 2.7 Å tetrameric ring structure. The inner circle of the Csn2 tetrameric ring is ∼26 Å wide and populated with conserved lysine residues poised for nonspecific interactions with ds-DNA. Each Csn2 protomer contains an α/β domain and an α-helical domain; significant hinge motion was observed between these two domains. Ca(2+) was located at strategic positions in the oligomerization interface. We further showed that removal of Ca(2+) ions altered the oligomerization state of Csn2, which in turn severely decreased its affinity for ds-DNA. In summary, our results provided the first insight into the function of the Csn2 protein in CRISPR adaptation by revealing that it is a ds-DNA-binding protein functioning at the quaternary structure level and regulated by Ca(2+) ions.

  8. Unsupervised clustering of subcellular protein expression patterns in high-throughput microscopy images reveals protein complexes and functional relationships between proteins.

    Directory of Open Access Journals (Sweden)

    Louis-François Handfield

    Full Text Available Protein subcellular localization has been systematically characterized in budding yeast using fluorescently tagged proteins. Based on the fluorescence microscopy images, subcellular localization of many proteins can be classified automatically using supervised machine learning approaches that have been trained to recognize predefined image classes based on statistical features. Here, we present an unsupervised analysis of protein expression patterns in a set of high-resolution, high-throughput microscope images. Our analysis is based on 7 biologically interpretable features which are evaluated on automatically identified cells, and whose cell-stage dependency is captured by a continuous model for cell growth. We show that it is possible to identify most previously identified localization patterns in a cluster analysis based on these features and that similarities between the inferred expression patterns contain more information about protein function than can be explained by a previous manual categorization of subcellular localization. Furthermore, the inferred cell-stage associated to each fluorescence measurement allows us to visualize large groups of proteins entering the bud at specific stages of bud growth. These correspond to proteins localized to organelles, revealing that the organelles must be entering the bud in a stereotypical order. We also identify and organize a smaller group of proteins that show subtle differences in the way they move around the bud during growth. Our results suggest that biologically interpretable features based on explicit models of cell morphology will yield unprecedented power for pattern discovery in high-resolution, high-throughput microscopy images.

  9. Unsupervised clustering of subcellular protein expression patterns in high-throughput microscopy images reveals protein complexes and functional relationships between proteins.

    Science.gov (United States)

    Handfield, Louis-François; Chong, Yolanda T; Simmons, Jibril; Andrews, Brenda J; Moses, Alan M

    2013-01-01

    Protein subcellular localization has been systematically characterized in budding yeast using fluorescently tagged proteins. Based on the fluorescence microscopy images, subcellular localization of many proteins can be classified automatically using supervised machine learning approaches that have been trained to recognize predefined image classes based on statistical features. Here, we present an unsupervised analysis of protein expression patterns in a set of high-resolution, high-throughput microscope images. Our analysis is based on 7 biologically interpretable features which are evaluated on automatically identified cells, and whose cell-stage dependency is captured by a continuous model for cell growth. We show that it is possible to identify most previously identified localization patterns in a cluster analysis based on these features and that similarities between the inferred expression patterns contain more information about protein function than can be explained by a previous manual categorization of subcellular localization. Furthermore, the inferred cell-stage associated to each fluorescence measurement allows us to visualize large groups of proteins entering the bud at specific stages of bud growth. These correspond to proteins localized to organelles, revealing that the organelles must be entering the bud in a stereotypical order. We also identify and organize a smaller group of proteins that show subtle differences in the way they move around the bud during growth. Our results suggest that biologically interpretable features based on explicit models of cell morphology will yield unprecedented power for pattern discovery in high-resolution, high-throughput microscopy images.

  10. A study of hierarchical structure on South China industrial electricity-consumption correlation

    Science.gov (United States)

    Yao, Can-Zhong; Lin, Ji-Nan; Liu, Xiao-Feng

    2016-02-01

    Based on industrial electricity-consumption data of five southern provinces of China from 2005 to 2013, we study the industrial correlation mechanism with MST (minimal spanning tree) and HT (hierarchical tree) models. First, we comparatively analyze the industrial electricity-consumption correlation structure in pre-crisis and after-crisis period using MST model and Bootstrap technique of statistical reliability test of links. Results exhibit that all industrial electricity-consumption trees of five southern provinces of China in pre-crisis and after-crisis time are in formation of chain, and the "center-periphery structure" of those chain-like trees is consistent with industrial specialization in classical industrial chain theory. Additionally, the industrial structure of some provinces is reorganized and transferred in pre-crisis and after-crisis time. Further, the comparative analysis with hierarchical tree and Bootstrap technique demonstrates that as for both observations of GD and overall NF, the industrial electricity-consumption correlation is non-significant clustered in pre-crisis period, whereas it turns significant clustered in after-crisis time. Therefore we propose that in perspective of electricity-consumption, their industrial structures are directed to optimized organization and global correlation. Finally, the analysis of distance of HTs verifies that industrial reorganization and development may strengthen market integration, coordination and correlation of industrial production. Except GZ, other four provinces have a shorter distance of industrial electricity-consumption correlation in after-crisis period, revealing a better performance of regional specialization and integration.

  11. The fragmentation and stability of hierarchical structure in Serpens South

    CERN Document Server

    Friesen, R K; Di Francesco, J; Gutermuth, R; Myers, P C

    2016-01-01

    Filamentary structures are ubiquitous in molecular clouds, and have been recently argued to play an important role in regulating the size and mass of embedded clumps through fragmentation and mass accretion. Here, we reveal the dynamical state and fragmentation of filamentary molecular gas associated with the Serpens South protocluster through analysis of wide (~4 x 4 pc) observations of NH3 (1,1) and (2,2) inversion transitions with the Green Bank Telescope. Detailed modeling of the NH3 lines reveals that the kinematics of the cluster and surrounding filaments are complex. We identify hierarchical structure using a dendrogram analysis of the NH3 emission. The distance between neighbour structures that are embedded within the same parent structure is generally greater than expected from a spherical Jeans analysis, and is in better agreement with cylindrical fragmentation models. The NH3 line width-size relation is flat, and average gas motions are sub- or trans-sonic over all physical scales observed. Subsoni...

  12. Hierarchical Scaling in Systems of Natural Cities

    CERN Document Server

    Chen, Yanguang

    2016-01-01

    Hierarchies can be modeled by a set of exponential functions, from which we can derive a set of power laws indicative of scaling. These scaling laws are followed by many natural and social phenomena such as cities, earthquakes, and rivers. This paper is devoted to revealing the scaling patterns in systems of natural cities by reconstructing the hierarchy with cascade structure. The cities of America, Britain, France, and Germany are taken as examples to make empirical analyses. The hierarchical scaling relations can be well fitted to the data points within the scaling ranges of the size and area of the natural cities. The size-number and area-number scaling exponents are close to 1, and the allometric scaling exponent is slightly less than 1. The results suggest that natural cities follow hierarchical scaling laws and hierarchical conservation law. Zipf's law proved to be one of the indications of the hierarchical scaling, and the primate law of city-size distribution represents a local pattern and can be mer...

  13. Hierarchical Ag mesostructures for single particle SERS substrate

    Science.gov (United States)

    Xu, Minwei; Zhang, Yin

    2017-01-01

    Hierarchical Ag mesostructures with highly rough surface morphology have been synthesized at room temperature through a simple seed-mediated approach. Electron microscopy characterizations indicate that the obtained Ag mesostructures exhibit a textured surface morphology with the flower-like architecture. Moreover, the particle size can be tailored easily in the range of 250-500 nm. For the growth process of the hierarchical Ag mesostructures, it is believed that the self-assembly mechanism is more reasonable rather than the epitaxial overgrowth of Ag seed. The oriented attachment of nanoparticles is revealed during the formation of Ag mesostructures. Single particle surface enhanced Raman spectra (sp-SERS) of crystal violet adsorbed on the hierarchical Ag mesostructures were measured. Results reveal that the hierarchical Ag mesostructures can be highly sensitive sp-SERS substrates with good reproducibility. The average enhancement factors for individual Ag mesostructures are estimated to be about 106.

  14. Detecting Hierarchical Structure in Networks

    DEFF Research Database (Denmark)

    Herlau, Tue; Mørup, Morten; Schmidt, Mikkel Nørgaard;

    2012-01-01

    a generative Bayesian model that is able to infer whether hierarchies are present or not from a hypothesis space encompassing all types of hierarchical tree structures. For efficient inference we propose a collapsed Gibbs sampling procedure that jointly infers a partition and its hierarchical structure......Many real-world networks exhibit hierarchical organization. Previous models of hierarchies within relational data has focused on binary trees; however, for many networks it is unknown whether there is hierarchical structure, and if there is, a binary tree might not account well for it. We propose....... On synthetic and real data we demonstrate that our model can detect hierarchical structure leading to better link-prediction than competing models. Our model can be used to detect if a network exhibits hierarchical structure, thereby leading to a better comprehension and statistical account the network....

  15. Context updates are hierarchical

    Directory of Open Access Journals (Sweden)

    Anton Karl Ingason

    2016-10-01

    Full Text Available This squib studies the order in which elements are added to the shared context of interlocutors in a conversation. It focuses on context updates within one hierarchical structure and argues that structurally higher elements are entered into the context before lower elements, even if the structurally higher elements are pronounced after the lower elements. The crucial data are drawn from a comparison of relative clauses in two head-initial languages, English and Icelandic, and two head-final languages, Korean and Japanese. The findings have consequences for any theory of a dynamic semantics.

  16. The structure of dust aggregates in hierarchical coagulation

    CERN Document Server

    Dominik, Carsten; Borel, Herman

    2016-01-01

    Dust coagulation in interstellar space and protoplanetary disks is usually treated as one of 2 extreme cases: Particle-Cluster Aggregation and Cluster-Cluster Aggregation. In this paper we study the process of hierarchical growth, where aggregates are built from significantly smaller aggregates (but not monomers). We show that this process can be understood as a modified, PCA-like process that produces porous, but non-fractal particles whose filling factor is chiefly determined by the porosity of the building blocks. We also show that in a coagulation environment where relative velocities are driven by turbulence, a logarithmically flat mass distribution (equal mass per mass decade) as it is typically found in environments where fragmentation replenishes small grains, leads to a situation where small particles and aggregates dominate the growth of large ones. Therefore, in such environments, hierarchical growth should be seen as the norm. Consequently, we predict that the aggregates in such environments are n...

  17. Sequence-based Screening for Rare Enzymes: New Insights into the World of AMDases Reveal a Conserved Motif and 58 Novel Enzymes Clustering in Eight Distinct Families.

    Directory of Open Access Journals (Sweden)

    Janine Maimanakos

    2016-08-01

    Full Text Available Arylmalonate-Decarboxylases (AMDases, EC 4.1.1.76 are very rare and mostly underexplored enzymes. Currently only four known and biochemically characterized representatives exist. However, their ability to decarboxylate α-disubstituted malonic acid derivatives to optically pure products without cofactors makes them attractive and promising candidates for the use as biocatalysts in industrial processes. Until now, AMDases could not be separated from other members of the aspartate/glutamate racemase superfamily based on their gene sequences. Within this work, a search algorithm was developed that enables a reliable prediction of AMDase activity for potential candidates. Based on specific sequence patterns and screening methods 58 novel AMDase candidate genes could be identified in this work. Thereby, AMDases with the conserved sequence pattern of Bordetella bronchiseptica’s prototype appeared to be limited to the classes of Alpha-, Beta- and Gammaproteobacteria. Amino acid homologies and comparison of gene surrounding sequences enabled the classification of eight enzyme clusters. Particularly striking is the accumulation of genes coding for different transporters of the TTT family, TRAP transporters and ABC transporters as well as genes coding for mandelate racemases/muconate lactonizing enzymes that might be involved in substrate uptake or degradation of AMDase products. Further, three novel AMDases were characterized which showed a high enantiomeric excess (>99% of the (R-enantiomer of flurbiprofen. These are the recombinant AmdA and AmdV from Variovorax sp. strains HH01 and HH02, originated from soil, and AmdP from Polymorphum gilvum found by a data base search. Altogether our findings give new insights into the class of AMDases and reveal many previously unknown enzyme candidates with high potential for bioindustrial processes.

  18. Embryonic neural inducing factor churchill is not a DNA-binding zinc finger protein: solution structure reveals a solvent-exposed beta-sheet and zinc binuclear cluster.

    Science.gov (United States)

    Lee, Brian M; Buck-Koehntop, Bethany A; Martinez-Yamout, Maria A; Dyson, H Jane; Wright, Peter E

    2007-08-31

    Churchill is a zinc-containing protein that is involved in neural induction during embryogenesis. At the time of its discovery, it was thought on the basis of sequence alignment to contain two zinc fingers of the C4 type. Further, binding of an N-terminal GST-Churchill fusion protein to a particular DNA sequence was demonstrated by immunoprecipitation selection assay, suggesting that Churchill may function as a transcriptional regulator by sequence-specific DNA binding. We show by NMR solution structure determination that, far from containing canonical C4 zinc fingers, the protein contains three bound zinc ions in novel coordination sites, including an unusual binuclear zinc cluster. The secondary structure of Churchill is also unusual, consisting of a highly solvent-exposed single-layer beta-sheet. Hydrogen-deuterium exchange and backbone relaxation measurements reveal that Churchill is unusually dynamic on a number of time scales, with the exception of regions surrounding the zinc coordinating sites, which serve to stabilize the otherwise unstructured N terminus and the single-layer beta-sheet. No binding of Churchill to the previously identified DNA sequence could be detected, and extensive searches using DNA sequence selection techniques could find no other DNA sequence that was bound by Churchill. Since the N-terminal amino acids of Churchill form part of the zinc-binding motif, the addition of a fusion protein at the N terminus causes loss of zinc and unfolding of Churchill. This observation most likely explains the published DNA-binding results, which would arise due to non-specific interaction of the unfolded protein in the immunoprecipitation selection assay. Since Churchill does not appear to bind DNA, we suggest that it may function in embryogenesis as a protein-interaction factor.

  19. Molecular analysis of three Ljungan virus isolates reveals a new, close-to-root lineage of the Picornaviridae with a cluster of two unrelated 2A proteins.

    Science.gov (United States)

    Johansson, Susanne; Niklasson, Bo; Maizel, Jacob; Gorbalenya, Alexander E; Lindberg, A Michael

    2002-09-01

    Ljungan virus (LV) is a suspected human pathogen recently isolated from bank voles (Clethrionomys glareolus). In the present study, it is revealed through comparative sequence analysis that three newly determined Swedish LV genomes are closely related and possess a deviant picornavirus-like organization: 5' untranslated region-VP0-VP3-VP1-2A1-2A2-2B-2C-3A-3B-3C-3D-3' untranslated region. The LV genomes and the polyproteins encoded by them exhibit several exceptional features, such as the absence of a predicted maturation cleavage of VP0, a conserved sequence determinant in VP0 that is typically found in VP1 of other picornaviruses, and a cluster of two unrelated 2A proteins. The 2A1 protein is related to the 2A protein of cardio-, erbo-, tescho-, and aphthoviruses, and the 2A2 protein is related to the 2A protein of parechoviruses, kobuviruses, and avian encephalomyelitis virus. The unprecedented association of two structurally different 2A proteins is a feature never previously observed among picornaviruses and implies that their functions are not mutually exclusive. Secondary polyprotein processing of the LV polyprotein is mediated by proteinase 3C (3C(pro)) possessing canonical affinity to Glu and Gln at the P1 position and small amino acid residues at the P1' position. In addition, LV 3C(pro) appears to have unique substrate specificity to Asn, Gln, and Asp and to bulky hydrophobic residues at the P2 and P4 positions, respectively. Phylogenetic analysis suggests that LVs form a separate division, which, together with the Parechovirus genus, has branched off the picornavirus tree most closely to its root. The presence of two 2A proteins indicates that some contemporary picornaviruses with a single 2A may have evolved from the ancestral multi-2A picornavirus.

  20. Multiple Genome Comparison within a Bacterial Species Reveals a Unit of Evolution Spanning Two Adjacent Genes in a Tandem Paralog Cluster

    Science.gov (United States)

    Tsuru, Takeshi

    2008-01-01

    It has been assumed that an open reading frame (ORF) represents a unit of gene evolution as well as a unit of gene expression and function. In the present work, we report a case in which a unit comprising the 3′ region of an ORF linked to a downstream intergenic region that is in turn linked to the 5′ region of a downstream ORF has been conserved, and has served as the unit of gene evolution. The genes are tandem paralogous genes from the bacterium Staphylococcus aureus, for which more than ten entire genomes have been sequenced. We compared these multiple genome sequences at a locus for the lpl (lipoprotein-like) cluster (encoding lipoprotein homologs presumably related to their host interaction) in the genomic island termed νSaα. A highly conserved nucleotide sequence found within every lpl ORF is likely to provide a site for homologous recombination. Comparison of phylogenies of the 5′-variable region and the 3′-variable region within the same ORF revealed significant incongruence. In contrast, pairs of the 3′-variable region of an ORF and the 5′-variable region of the next downstream ORF gave more congruent phylogenies, with distinct groups of conserved pairs. The intergenic region seemed to have coevolved with the flanking variable regions. Multiple recombination events at the central conserved region appear to have caused various types of rearrangements among strains, shuffling the two variable regions in one ORF, but maintaining a conserved unit comprising the 3′-variable region, the intergenic region, and the 5′-variable region spanning adjacent ORFs. This result has strong impact on our understanding of gene evolution because most gene lineages underwent tandem duplication and then diversified. This work also illustrates the use of multiple genome sequences for high-resolution evolutionary analysis within the same species. PMID:18765438

  1. Sequence-Based Screening for Rare Enzymes: New Insights into the World of AMDases Reveal a Conserved Motif and 58 Novel Enzymes Clustering in Eight Distinct Families

    Science.gov (United States)

    Maimanakos, Janine; Chow, Jennifer; Gaßmeyer, Sarah K.; Güllert, Simon; Busch, Florian; Kourist, Robert; Streit, Wolfgang R.

    2016-01-01

    Arylmalonate Decarboxylases (AMDases, EC 4.1.1.76) are very rare and mostly underexplored enzymes. Currently only four known and biochemically characterized representatives exist. However, their ability to decarboxylate α-disubstituted malonic acid derivatives to optically pure products without cofactors makes them attractive and promising candidates for the use as biocatalysts in industrial processes. Until now, AMDases could not be separated from other members of the aspartate/glutamate racemase superfamily based on their gene sequences. Within this work, a search algorithm was developed that enables a reliable prediction of AMDase activity for potential candidates. Based on specific sequence patterns and screening methods 58 novel AMDase candidate genes could be identified in this work. Thereby, AMDases with the conserved sequence pattern of Bordetella bronchiseptica’s prototype appeared to be limited to the classes of Alpha-, Beta-, and Gamma-proteobacteria. Amino acid homologies and comparison of gene surrounding sequences enabled the classification of eight enzyme clusters. Particularly striking is the accumulation of genes coding for different transporters of the tripartite tricarboxylate transporters family, TRAP transporters and ABC transporters as well as genes coding for mandelate racemases/muconate lactonizing enzymes that might be involved in substrate uptake or degradation of AMDase products. Further, three novel AMDases were characterized which showed a high enantiomeric excess (>99%) of the (R)-enantiomer of flurbiprofen. These are the recombinant AmdA and AmdV from Variovorax sp. strains HH01 and HH02, originated from soil, and AmdP from Polymorphum gilvum found by a data base search. Altogether our findings give new insights into the class of AMDases and reveal many previously unknown enzyme candidates with high potential for bioindustrial processes. PMID:27610105

  2. Molecular analysis of the beta-globin gene cluster in the Niokholo Mandenka population reveals a recent origin of the beta(S) Senegal mutation.

    Science.gov (United States)

    Currat, Mathias; Trabuchet, Guy; Rees, David; Perrin, Pascale; Harding, Rosalind M; Clegg, John B; Langaney, André; Excoffier, Laurent

    2002-01-01

    A large and ethnically well-defined Mandenka sample from eastern Senegal was analyzed for the polymorphism of the beta-globin gene cluster on chromosome 11. Five RFLP sites of the 5' region were investigated in 193 individuals revealing the presence of 10 different haplotypes. The frequency of the sickle-cell anemia causing mutation (beta(S)) in the Mandenka estimated from this sample is 11.7%. This mutation was found strictly associated with the single Senegal haplotype. Approximately 600 bp of the upstream region of the beta-globin gene were sequenced for a subset of 94 chromosomes, showing the presence of four transversions, five transitions, and a composite microsatellite polymorphism. The sequence of 22 beta(S) chromosomes was also identical to the previously defined Senegal haplotype, suggesting that this mutation is very recent. Monte Carlo simulations (allowing for a specific balancing selection model, a logistic growth of the population, and variable initial frequencies of the Senegal haplotype) were used to estimate the age of the beta(S) mutation. Resulting maximum-likelihood estimates are 45-70 generations (1,350-2,100 years) for very different demographic scenarios. Smallest confidence intervals (25-690 generations) are obtained under the hypothesis that the Mandenka population is large (N(e) >5,000) and stationary or that it has undergone a rapid demographic expansion to a current size of >5,000 reproducing individuals, which is quite likely in view of the great diversity found on beta(A) chromosomes.

  3. Cluster Physics with Merging Galaxy Clusters

    Directory of Open Access Journals (Sweden)

    Sandor M. Molnar

    2016-02-01

    Full Text Available Collisions between galaxy clusters provide a unique opportunity to study matter in a parameter space which cannot be explored in our laboratories on Earth. In the standard LCDM model, where the total density is dominated by the cosmological constant ($Lambda$ and the matter density by cold dark matter (CDM, structure formation is hierarchical, and clusters grow mostly by merging.Mergers of two massive clusters are the most energetic events in the universe after the Big Bang,hence they provide a unique laboratory to study cluster physics.The two main mass components in clusters behave differently during collisions:the dark matter is nearly collisionless, responding only to gravity, while the gas is subject to pressure forces and dissipation, and shocks and turbulenceare developed during collisions. In the present contribution we review the different methods used to derive the physical properties of merging clusters. Different physical processes leave their signatures on different wavelengths, thusour review is based on a multifrequency analysis. In principle, the best way to analyze multifrequency observations of merging clustersis to model them using N-body/HYDRO numerical simulations. We discuss the results of such detailed analyses.New high spatial and spectral resolution ground and space based telescopeswill come online in the near future. Motivated by these new opportunities,we briefly discuss methods which will be feasible in the near future in studying merging clusters.

  4. A hierarchical approach to forest landscape pattern characterization.

    Science.gov (United States)

    Wang, Jialing; Yang, Xiaojun

    2012-01-01

    Landscape spatial patterns have increasingly been considered to be essential for environmental planning and resources management. In this study, we proposed a hierarchical approach for landscape classification and evaluation by characterizing landscape spatial patterns across different hierarchical levels. The case study site is the Red Hills region of northern Florida and southwestern Georgia, well known for its biodiversity, historic resources, and scenic beauty. We used one Landsat Enhanced Thematic Mapper image to extract land-use/-cover information. Then, we employed principal-component analysis to help identify key class-level landscape metrics for forests at different hierarchical levels, namely, open pine, upland pine, and forest as a whole. We found that the key class-level landscape metrics varied across different hierarchical levels. Compared with forest as a whole, open pine forest is much more fragmented. The landscape metric, such as CONTIG_MN, which measures whether pine patches are contiguous or not, is more important to characterize the spatial pattern of pine forest than to forest as a whole. This suggests that different metric sets should be used to characterize landscape patterns at different hierarchical levels. We further used these key metrics, along with the total class area, to classify and evaluate subwatersheds through cluster analysis. This study demonstrates a promising approach that can be used to integrate spatial patterns and processes for hierarchical forest landscape planning and management.

  5. Hierarchical Dragonfly Wing: Microstructure-Biomechanical Behavior Relations

    Institute of Scientific and Technical Information of China (English)

    Yinglong Chen; Xishu Wang; Huaihui Ren; Hang Yin; Su Jia

    2012-01-01

    The dragonfly wing,which consists of veins and membrane,is of biological hierarchical material.We observed the cross-sections of longitudinal veins and membrane using Environmental Scanning Electron Microscopy (ESEM).Based on the experiments and previous studies,we described the longitudinal vein and the membrane in terms of two hierarchical levels of organization of composite materials at the micro- and nano-scales.The longitudinal vein of dragonfly wing has a complex sandwich structure with two chitinous shells and a protein layer,and it is considered as the first hierarchical level of the vein.Moreover,the chitinous shells are concentric multilayered structures.Clusters of nano-fibrils grow along the circumferential orientation embedded into the protein layer.It is considered as the second level of the hierarchy.Similarly,the upper and lower epidermises of membrane constitute the first hierarchical level of organization in micro scale.Similar to the vein shell,the membrane epidermises were found to be a paralleled multilayered structure,defined as the second hierarchical level of the membrane.Combining with the mechanical behavior analysis of the dragonfly wing,we concluded that the growth orientation of the hierarchical structure of the longitudinal vein and membrane is relevant to its biomechanical behavior.

  6. Hierarchical Star Formation in Nearby LEGUS Galaxies

    CERN Document Server

    Elmegreen, Debra Meloy; Adamo, Angela; Aloisi, Alessandra; Andrews, Jennifer; Annibali, Francesca; Bright, Stacey N; Calzetti, Daniela; Cignoni, Michele; Evans, Aaron S; Gallagher, John S; Gouliermis, Dimitrios A; Grebel, Eva K; Hunter, Deidre A; Johnson, Kelsey; Kim, Hwi; Lee, Janice; Sabbi, Elena; Smith, Linda; Thilker, David; Tosi, Monica; Ubeda, Leonardo

    2014-01-01

    Hierarchical structure in ultraviolet images of 12 late-type LEGUS galaxies is studied by determining the numbers and fluxes of nested regions as a function of size from ~1 to ~200 pc, and the number as a function of flux. Two starburst dwarfs, NGC 1705 and NGC 5253, have steeper number-size and flux-size distributions than the others, indicating high fractions of the projected areas filled with star formation. Nine subregions in 7 galaxies have similarly steep number-size slopes, even when the whole galaxies have shallower slopes. The results suggest that hierarchically structured star-forming regions several hundred parsecs or larger represent common unit structures. Small galaxies dominated by only a few of these units tend to be starbursts. The self-similarity of young stellar structures down to parsec scales suggests that star clusters form in the densest parts of a turbulent medium that also forms loose stellar groupings on larger scales. The presence of super star clusters in two of our starburst dwarf...

  7. Detect overlapping and hierarchical community structure in networks

    CERN Document Server

    Shen, Huawei; Cai, Kai; Hu, Mao-Bin

    2008-01-01

    Clustering and community structure is crucial for many network systems and the related dynamic processes. It has been shown that communities are usually overlapping and hierarchical. However, previous methods investigate these two properties of community structure separately. This paper propose an algorithm (EAGLE) to detect both the overlapping and hierarchical properties of complex community structure together. This algorithm deals with the set of maximal cliques and adopts an agglomerative framework. The quality function of modularity is extended to evaluate the goodness of a cover. The examples of application to real world networks give excellent results.

  8. Semi-supervised consensus clustering for gene expression data analysis

    OpenAIRE

    Wang, Yunli; Pan, Youlian

    2014-01-01

    Background Simple clustering methods such as hierarchical clustering and k-means are widely used for gene expression data analysis; but they are unable to deal with noise and high dimensionality associated with the microarray gene expression data. Consensus clustering appears to improve the robustness and quality of clustering results. Incorporating prior knowledge in clustering process (semi-supervised clustering) has been shown to improve the consistency between the data partitioning and do...

  9. [Prognostic differences of phenotypes in pT1-2N0 invasive breast cancer: a large cohort study with cluster analysis].

    Science.gov (United States)

    Wang, Z; Wang, W H; Wang, S L; Jin, J; Song, Y W; Liu, Y P; Ren, H; Fang, H; Tang, Y; Chen, B; Qi, S N; Lu, N N; Li, N; Tang, Y; Liu, X F; Yu, Z H; Li, Y X

    2016-06-23

    To find phenotypic subgroups of patients with pT1-2N0 invasive breast cancer by means of cluster analysis and estimate the prognosis and clinicopathological features of these subgroups. From 1999 to 2013, 4979 patients with pT1-2N0 invasive breast cancer were recruited for hierarchical clustering analysis. Age (≤40, 41-70, 70+ years), size of primary tumor, pathological type, grade of differentiation, microvascular invasion, estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER-2) were chosen as distance metric between patients. Hierarchical cluster analysis was performed using Ward's method. Cophenetic correlation coefficient (CPCC) and Spearman correlation coefficient were used to validate clustering structures. The CPCC was 0.603. The Spearman correlation coefficient was 0.617 (Pcluster model seemed to best illustrate our patient cohort. Patients in cluster 5, 9 and 12 had best prognosis and were characterized by age >40 years, smaller primary tumor, lower histologic grade, positive ER and PR status, and mainly negative HER-2. Patients in the cluster 1 and 11 had the worst prognosis, The cluster 1 was characterized by a larger tumor, higher grade and negative ER and PR status, while the cluster 11 was characterized by positive microvascular invasion. Patients in other 7 clusters had a moderate prognosis, and patients in each cluster had distinctive clinicopathological features and recurrent patterns. This study identified distinctive clinicopathologic phenotypes in a large cohort of patients with pT1-2N0 breast cancer through hierarchical clustering and revealed different prognosis. This integrative model may help physicians to make more personalized decisions regarding adjuvant therapy.

  10. Simultaneous analysis of T helper subsets (Th1, Th2, Th9, Th17, Th22, Tfh, Tr1 and Tregs markers expression in periapical lesions reveals multiple cytokine clusters accountable for lesions activity and inactivity status

    Directory of Open Access Journals (Sweden)

    Ana Claudia ARAUJO-PIRES

    2014-07-01

    lesions, hierarchical cluster analysis demonstrates the association of TNF-α, IL-21, IL-17 and IFN-γ with lesions activity, and the association of FOXP3, IL-10, IL-9, IL-4 and IL-22 with lesions inactivity.

  11. Hierarchical partial order ranking.

    Science.gov (United States)

    Carlsen, Lars

    2008-09-01

    Assessing the potential impact on environmental and human health from the production and use of chemicals or from polluted sites involves a multi-criteria evaluation scheme. A priori several parameters are to address, e.g., production tonnage, specific release scenarios, geographical and site-specific factors in addition to various substance dependent parameters. Further socio-economic factors may be taken into consideration. The number of parameters to be included may well appear to be prohibitive for developing a sensible model. The study introduces hierarchical partial order ranking (HPOR) that remedies this problem. By HPOR the original parameters are initially grouped based on their mutual connection and a set of meta-descriptors is derived representing the ranking corresponding to the single groups of descriptors, respectively. A second partial order ranking is carried out based on the meta-descriptors, the final ranking being disclosed though average ranks. An illustrative example on the prioritization of polluted sites is given.

  12. Trees and Hierarchical Structures

    CERN Document Server

    Haeseler, Arndt

    1990-01-01

    The "raison d'etre" of hierarchical dustering theory stems from one basic phe­ nomenon: This is the notorious non-transitivity of similarity relations. In spite of the fact that very often two objects may be quite similar to a third without being that similar to each other, one still wants to dassify objects according to their similarity. This should be achieved by grouping them into a hierarchy of non-overlapping dusters such that any two objects in ~ne duster appear to be more related to each other than they are to objects outside this duster. In everyday life, as well as in essentially every field of scientific investigation, there is an urge to reduce complexity by recognizing and establishing reasonable das­ sification schemes. Unfortunately, this is counterbalanced by the experience of seemingly unavoidable deadlocks caused by the existence of sequences of objects, each comparatively similar to the next, but the last rather different from the first.

  13. Optimisation by hierarchical search

    Science.gov (United States)

    Zintchenko, Ilia; Hastings, Matthew; Troyer, Matthias

    2015-03-01

    Finding optimal values for a set of variables relative to a cost function gives rise to some of the hardest problems in physics, computer science and applied mathematics. Although often very simple in their formulation, these problems have a complex cost function landscape which prevents currently known algorithms from efficiently finding the global optimum. Countless techniques have been proposed to partially circumvent this problem, but an efficient method is yet to be found. We present a heuristic, general purpose approach to potentially improve the performance of conventional algorithms or special purpose hardware devices by optimising groups of variables in a hierarchical way. We apply this approach to problems in combinatorial optimisation, machine learning and other fields.

  14. How hierarchical is language use?

    Science.gov (United States)

    Frank, Stefan L.; Bod, Rens; Christiansen, Morten H.

    2012-01-01

    It is generally assumed that hierarchical phrase structure plays a central role in human language. However, considerations of simplicity and evolutionary continuity suggest that hierarchical structure should not be invoked too hastily. Indeed, recent neurophysiological, behavioural and computational studies show that sequential sentence structure has considerable explanatory power and that hierarchical processing is often not involved. In this paper, we review evidence from the recent literature supporting the hypothesis that sequential structure may be fundamental to the comprehension, production and acquisition of human language. Moreover, we provide a preliminary sketch outlining a non-hierarchical model of language use and discuss its implications and testable predictions. If linguistic phenomena can be explained by sequential rather than hierarchical structure, this will have considerable impact in a wide range of fields, such as linguistics, ethology, cognitive neuroscience, psychology and computer science. PMID:22977157

  15. How hierarchical is language use?

    Science.gov (United States)

    Frank, Stefan L; Bod, Rens; Christiansen, Morten H

    2012-11-22

    It is generally assumed that hierarchical phrase structure plays a central role in human language. However, considerations of simplicity and evolutionary continuity suggest that hierarchical structure should not be invoked too hastily. Indeed, recent neurophysiological, behavioural and computational studies show that sequential sentence structure has considerable explanatory power and that hierarchical processing is often not involved. In this paper, we review evidence from the recent literature supporting the hypothesis that sequential structure may be fundamental to the comprehension, production and acquisition of human language. Moreover, we provide a preliminary sketch outlining a non-hierarchical model of language use and discuss its implications and testable predictions. If linguistic phenomena can be explained by sequential rather than hierarchical structure, this will have considerable impact in a wide range of fields, such as linguistics, ethology, cognitive neuroscience, psychology and computer science.

  16. Associative Hierarchical Random Fields.

    Science.gov (United States)

    Ladický, L'ubor; Russell, Chris; Kohli, Pushmeet; Torr, Philip H S

    2014-06-01

    This paper makes two contributions: the first is the proposal of a new model-The associative hierarchical random field (AHRF), and a novel algorithm for its optimization; the second is the application of this model to the problem of semantic segmentation. Most methods for semantic segmentation are formulated as a labeling problem for variables that might correspond to either pixels or segments such as super-pixels. It is well known that the generation of super pixel segmentations is not unique. This has motivated many researchers to use multiple super pixel segmentations for problems such as semantic segmentation or single view reconstruction. These super-pixels have not yet been combined in a principled manner, this is a difficult problem, as they may overlap, or be nested in such a way that the segmentations form a segmentation tree. Our new hierarchical random field model allows information from all of the multiple segmentations to contribute to a global energy. MAP inference in this model can be performed efficiently using powerful graph cut based move making algorithms. Our framework generalizes much of the previous work based on pixels or segments, and the resulting labelings can be viewed both as a detailed segmentation at the pixel level, or at the other extreme, as a segment selector that pieces together a solution like a jigsaw, selecting the best segments from different segmentations as pieces. We evaluate its performance on some of the most challenging data sets for object class segmentation, and show that this ability to perform inference using multiple overlapping segmentations leads to state-of-the-art results.

  17. What Makes Clusters Decline?

    DEFF Research Database (Denmark)

    Østergaard, Christian Richter; Park, Eun Kyung

    2015-01-01

    Most studies on regional clusters focus on identifying factors and processes that make clusters grow. However, sometimes technologies and market conditions suddenly shift, and clusters decline. This paper analyses the process of decline of the wireless communication cluster in Denmark....... The longitudinal study on the high-tech cluster reveals that technological lock-in and exit of key firms have contributed to decline. Entrepreneurship has a positive effect on the cluster’s adaptive capabilities, while multinational companies have contradicting effects by bringing in new resources to the cluster...

  18. Adaptive color visualization for dichromats using a customized hierarchical palette

    Science.gov (United States)

    Rodríguez-Pardo, Carlos E.; Sharma, Gaurav

    2011-01-01

    We propose a user-centric methodology for displaying digital color documents, that optimizes color representations in an observer specific and adaptive fashion. We apply our framework to situations involving viewers with common dichromatic color vision deficiencies, who face challenges in perceiving information presented in color images and graphics designed for color normal individuals. For situations involving qualitative data visualization, we present a computationally efficient solution that combines a customized observer-specific hierarchical palette with "display time" selection of the number of colors to generate renderings with colors that are easily discriminated by the intended viewer. The palette design is accomplished via a clustering algorithm, that arranges colors in a hierarchical tree based on their perceived differences for the intended viewer. A desired number of highly discriminable colors are readily obtained from the hierarchical palette via a simple truncation. As an illustration, we demonstrate the application of the methodology to Ishihara style images.

  19. Modelling hierarchical and modular complex networks: division and independence

    Science.gov (United States)

    Kim, D.-H.; Rodgers, G. J.; Kahng, B.; Kim, D.

    2005-06-01

    We introduce a growing network model which generates both modular and hierarchical structure in a self-organized way. To this end, we modify the Barabási-Albert model into the one evolving under the principles of division and independence as well as growth and preferential attachment (PA). A newly added vertex chooses one of the modules composed of existing vertices, and attaches edges to vertices belonging to that module following the PA rule. When the module size reaches a proper size, the module is divided into two, and a new module is created. The karate club network studied by Zachary is a simple version of the current model. We find that the model can reproduce both modular and hierarchical properties, characterized by the hierarchical clustering function of a vertex with degree k, C(k), being in good agreement with empirical measurements for real-world networks.

  20. Modeling hierarchical structures - Hierarchical Linear Modeling using MPlus

    CERN Document Server

    Jelonek, M

    2006-01-01

    The aim of this paper is to present the technique (and its linkage with physics) of overcoming problems connected to modeling social structures, which are typically hierarchical. Hierarchical Linear Models provide a conceptual and statistical mechanism for drawing conclusions regarding the influence of phenomena at different levels of analysis. In the social sciences it is used to analyze many problems such as educational, organizational or market dilemma. This paper introduces the logic of modeling hierarchical linear equations and estimation based on MPlus software. I present my own model to illustrate the impact of different factors on school acceptation level.

  1. The Large-Scale Environment of Dynamical Young Clusters of Galaxies

    OpenAIRE

    Plionis, M.; Basilakos, S.

    2001-01-01

    We investigate whether the dynamical status of clusters is related to the large-scale structure of the Universe. We find that cluster substructure is strongly correlated with the tendency of clusters to be aligned with their nearest neighbour and in general with the nearby clusters that belong to the same supercluster. Furthermore, dynamically young clusters are more clustered than the overall cluster population. These are strong indications that clusters develop in a hierarchical fashion by ...

  2. Two genetic clusters in swine hemoplasmas revealed by analyses of the 16S rRNA and RNase P RNA genes.

    Science.gov (United States)

    Watanabe, Yusaku; Fujihara, Masatoshi; Obara, Hisato; Nagai, Kazuya; Harasawa, Ryô

    2011-12-01

    Only two hemoplasma species, Eperythrozoon parvum and Mycoplasma suis, have been recognized in pigs. Here we demonstrate the genetic variations among six hemoplasma strains detected from pigs, by analyzing the 16S rRNA and RNase P RNA (rnpB) genes, and propose a novel hemoplasma taxon that has not been described previously. Phylogenetic trees based on the nucleotide sequence of the 16S rRNA gene indicated that these six hemoplasmas were divided into two clusters representing M. suis and a novel taxon. We further examined the primary and secondary structures of the nucleotide sequences of the rnpB gene of the novel taxon, and found it distinct from that of M. suis. In conclusion, we unveiled a genetic cluster distinct from M. suis, suggesting a new swine hemoplasma species or E. parvum. Our findings also suggest that this novel cluster should be included in the genus Mycoplasma.

  3. Revealing isomerism in sodium-water clusters: Photoionization spectra of Na(H2O)n (n = 2-90)

    Science.gov (United States)

    Dierking, Christoph W.; Zurheide, Florian; Zeuch, Thomas; Med, Jakub; Parez, Stanislav; Slavíček, Petr

    2017-06-01

    Soft ionization of sodium tagged polar clusters is increasingly used as a powerful technique for sizing and characterization of small aerosols with possible application, e.g., in atmospheric chemistry or combustion science. Understanding the structure and photoionization of the sodium doped clusters is critical for such applications. In this work, we report on measurements of photoionization spectra for sodium doped water clusters containing 2-90 water molecules. While most of the previous studies focused on the ionization threshold of the Na(H2O)n clusters, we provide for the first time full photoionization spectra, including the high-energy region, which are used as reference for a comparison with theory. As reported in previous work, we have seen an initial drop of the appearance ionization energy with cluster size to values of about 3.2 eV for n electron cloud. The appearance ionization energy is determined by isomers with fully solvated sodium and a highly delocalized electron cloud, while both fully and incompletely solvated isomers with localized electron clouds can contribute to the high energy part of the photoionization spectrum. Simulations at elevated temperatures show an increased abundance of isomers with low ionization energies, an entropic effect enabling size selective infrared action spectroscopy, based on near threshold photoionization of Na(H2O)n clusters. In addition, simulations of the sodium pick-up process were carried out to study the gradual formation of the hydrated electron which is the basis of the sodium-tagging sizing.

  4. In silico sampling reveals the effect of clustering and shows that the log-normal rank abundance curve is an artefact

    NARCIS (Netherlands)

    Neuteboom, J.H.; Struik, P.C.

    2005-01-01

    The impact of clustering on rank abundance, species-individual (S-N)and species-area curves was investigated using a computer programme for in silico sampling. In a rank abundance curve the abundances of species are plotted on log-scale against species sequence. In an S-N curve the number of species

  5. Resilient 3D hierarchical architected metamaterials.

    Science.gov (United States)

    Meza, Lucas R; Zelhofer, Alex J; Clarke, Nigel; Mateos, Arturo J; Kochmann, Dennis M; Greer, Julia R

    2015-09-15

    Hierarchically designed structures with architectural features that span across multiple length scales are found in numerous hard biomaterials, like bone, wood, and glass sponge skeletons, as well as manmade structures, like the Eiffel Tower. It has been hypothesized that their mechanical robustness and damage tolerance stem from sophisticated ordering within the constituents, but the specific role of hierarchy remains to be fully described and understood. We apply the principles of hierarchical design to create structural metamaterials from three material systems: (i) polymer, (ii) hollow ceramic, and (iii) ceramic-polymer composites that are patterned into self-similar unit cells in a fractal-like geometry. In situ nanomechanical experiments revealed (i) a nearly theoretical scaling of structural strength and stiffness with relative density, which outperforms existing nonhierarchical nanolattices; (ii) recoverability, with hollow alumina samples recovering up to 98% of their original height after compression to ≥ 50% strain; (iii) suppression of brittle failure and structural instabilities in hollow ceramic hierarchical nanolattices; and (iv) a range of deformation mechanisms that can be tuned by changing the slenderness ratios of the beams. Additional levels of hierarchy beyond a second order did not increase the strength or stiffness, which suggests the existence of an optimal degree of hierarchy to amplify resilience. We developed a computational model that captures local stress distributions within the nanolattices under compression and explains some of the underlying deformation mechanisms as well as validates the measured effective stiffness to be interpreted as a metamaterial property.

  6. Modeling hierarchical structures - Hierarchical Linear Modeling using MPlus

    OpenAIRE

    Jelonek, Magdalena

    2006-01-01

    The aim of this paper is to present the technique (and its linkage with physics) of overcoming problems connected to modeling social structures, which are typically hierarchical. Hierarchical Linear Models provide a conceptual and statistical mechanism for drawing conclusions regarding the influence of phenomena at different levels of analysis. In the social sciences it is used to analyze many problems such as educational, organizational or market dilemma. This paper introduces the logic of m...

  7. Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.

    Science.gov (United States)

    Yao, Jianchao; Chang, Chunqi; Salmi, Mari L; Hung, Yeung Sam; Loraine, Ann; Roux, Stanley J

    2008-06-18

    Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD)-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient) using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. This study shows that SCC is an alternative to the Pearson

  8. Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient

    Directory of Open Access Journals (Sweden)

    Loraine Ann

    2008-06-01

    Full Text Available Abstract Background Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. Results In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC, that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. Conclusion

  9. The K2 M67 Study: Revisiting Old Friends with K2 Reveals Oscillating Red Giants in the Open Cluster M67

    Science.gov (United States)

    Stello, Dennis; Vanderburg, Andrew; Casagrande, Luca; Gilliland, Ron; Silva Aguirre, Victor; Sandquist, Eric; Leiner, Emily; Mathieu, Robert; Soderblom, David R.

    2016-12-01

    Observations of stellar clusters have had a tremendous impact in forming our understanding of stellar evolution. The open cluster M67 has a particularly important role as a calibration benchmark for stellar evolution theory due to its near-solar composition and age. As a result, it has been observed extensively, including attempts to detect solar-like oscillations in its main sequence and red giant stars. However, any asteroseismic inference has so far remained elusive due to the difficulty in measuring these extremely low-amplitude oscillations. Here we report the first unambiguous detection of solar-like oscillations in the red giants of M67. We use data from the Kepler ecliptic mission, K2, to measure the global asteroseismic properties. We find a model-independent seismic-informed distance of 816 ± 11 pc, or {(m-M)}0 = 9.57+/- 0.03 mag, an average red giant mass of 1.36+/- 0.01 {M}⊙ , in agreement with the dynamical mass from an eclipsing binary near the cluster turn-off, and ages of individual stars compatible with isochrone fitting. We see no evidence of strong mass loss on the red giant branch. We also determine seismic {log}g of all the cluster giants with a typical precision of ˜ 0.01 dex. Our results generally show good agreement with independent methods and support the use of seismic scaling relations to determine global properties of red giant stars with near-solar metallicity. We further illustrate that the data are of such high quality that future work on individual mode frequencies should be possible, which would extend the scope of seismic analysis of this cluster.

  10. Hierarchical fringe tracking

    CERN Document Server

    Petrov, Romain G; Boskri, Abdelkarim; Folcher, Jean-Pierre; Lagarde, Stephane; Bresson, Yves; Benkhaldoum, Zouhair; Lazrek, Mohamed; Rakshit, Suvendu

    2014-01-01

    The limiting magnitude is a key issue for optical interferometry. Pairwise fringe trackers based on the integrated optics concepts used for example in GRAVITY seem limited to about K=10.5 with the 8m Unit Telescopes of the VLTI, and there is a general "common sense" statement that the efficiency of fringe tracking, and hence the sensitivity of optical interferometry, must decrease as the number of apertures increases, at least in the near infrared where we are still limited by detector readout noise. Here we present a Hierarchical Fringe Tracking (HFT) concept with sensitivity at least equal to this of a two apertures fringe trackers. HFT is based of the combination of the apertures in pairs, then in pairs of pairs then in pairs of groups. The key HFT module is a device that behaves like a spatial filter for two telescopes (2TSF) and transmits all or most of the flux of a cophased pair in a single mode beam. We give an example of such an achromatic 2TSF, based on very broadband dispersed fringes analyzed by g...

  11. Onboard hierarchical network

    Science.gov (United States)

    Tunesi, Luca; Armbruster, Philippe

    2004-02-01

    The objective of this paper is to demonstrate a suitable hierarchical networking solution to improve capabilities and performances of space systems, with significant recurrent costs saving and more efficient design & manufacturing flows. Classically, a satellite can be split in two functional sub-systems: the platform and the payload complement. The platform is in charge of providing power, attitude & orbit control and up/down-link services, whereas the payload represents the scientific and/or operational instruments/transponders and embodies the objectives of the mission. One major possibility to improve the performance of payloads, by limiting the data return to pertinent information, is to process data on board thanks to a proper implementation of the payload data system. In this way, it is possible to share non-recurring development costs by exploiting a system that can be adopted by the majority of space missions. It is believed that the Modular and Scalable Payload Data System, under development by ESA, provides a suitable solution to fulfil a large range of future mission requirements. The backbone of the system is the standardised high data rate SpaceWire network http://www.ecss.nl/. As complement, a lower speed command and control bus connecting peripherals is required. For instance, at instrument level, there is a need for a "local" low complexity bus, which gives the possibility to command and control sensors and actuators. Moreover, most of the connections at sub-system level are related to discrete signals management or simple telemetry acquisitions, which can easily and efficiently be handled by a local bus. An on-board hierarchical network can therefore be defined by interconnecting high-speed links and local buses. Additionally, it is worth stressing another important aspect of the design process: Agencies and ESA in particular are frequently confronted with a big consortium of geographically spread companies located in different countries, each one

  12. Hierarchical Reverberation Mapping

    CERN Document Server

    Brewer, Brendon J

    2013-01-01

    Reverberation mapping (RM) is an important technique in studies of active galactic nuclei (AGN). The key idea of RM is to measure the time lag $\\tau$ between variations in the continuum emission from the accretion disc and subsequent response of the broad line region (BLR). The measurement of $\\tau$ is typically used to estimate the physical size of the BLR and is combined with other measurements to estimate the black hole mass $M_{\\rm BH}$. A major difficulty with RM campaigns is the large amount of data needed to measure $\\tau$. Recently, Fine et al (2012) introduced a new approach to RM where the BLR light curve is sparsely sampled, but this is counteracted by observing a large sample of AGN, rather than a single system. The results are combined to infer properties of the sample of AGN. In this letter we implement this method using a hierarchical Bayesian model and contrast this with the results from the previous stacked cross-correlation technique. We find that our inferences are more precise and allow fo...

  13. Hierarchical Parallelization of Gene Differential Association Analysis

    Directory of Open Access Journals (Sweden)

    Dwarkadas Sandhya

    2011-09-01

    Full Text Available Abstract Background Microarray gene differential expression analysis is a widely used technique that deals with high dimensional data and is computationally intensive for permutation-based procedures. Microarray gene differential association analysis is even more computationally demanding and must take advantage of multicore computing technology, which is the driving force behind increasing compute power in recent years. In this paper, we present a two-layer hierarchical parallel implementation of gene differential association analysis. It takes advantage of both fine- and coarse-grain (with granularity defined by the frequency of communication parallelism in order to effectively leverage the non-uniform nature of parallel processing available in the cutting-edge systems of today. Results Our results show that this hierarchical strategy matches data sharing behavior to the properties of the underlying hardware, thereby reducing the memory and bandwidth needs of the application. The resulting improved efficiency reduces computation time and allows the gene differential association analysis code to scale its execution with the number of processors. The code and biological data used in this study are downloadable from http://www.urmc.rochester.edu/biostat/people/faculty/hu.cfm. Conclusions The performance sweet spot occurs when using a number of threads per MPI process that allows the working sets of the corresponding MPI processes running on the multicore to fit within the machine cache. Hence, we suggest that practitioners follow this principle in selecting the appropriate number of MPI processes and threads within each MPI process for their cluster configurations. We believe that the principles of this hierarchical approach to parallelization can be utilized in the parallelization of other computationally demanding kernels.

  14. Topology of the correlation networks among major currencies using hierarchical structure methods

    Science.gov (United States)

    Keskin, Mustafa; Deviren, Bayram; Kocakaplan, Yusuf

    2011-02-01

    We studied the topology of correlation networks among 34 major currencies using the concept of a minimal spanning tree and hierarchical tree for the full years of 2007-2008 when major economic turbulence occurred. We used the USD (US Dollar) and the TL (Turkish Lira) as numeraires in which the USD was the major currency and the TL was the minor currency. We derived a hierarchical organization and constructed minimal spanning trees (MSTs) and hierarchical trees (HTs) for the full years of 2007, 2008 and for the 2007-2008 period. We performed a technique to associate a value of reliability to the links of MSTs and HTs by using bootstrap replicas of data. We also used the average linkage cluster analysis for obtaining the hierarchical trees in the case of the TL as the numeraire. These trees are useful tools for understanding and detecting the global structure, taxonomy and hierarchy in financial data. We illustrated how the minimal spanning trees and their related hierarchical trees developed over a period of time. From these trees we identified different clusters of currencies according to their proximity and economic ties. The clustered structure of the currencies and the key currency in each cluster were obtained and we found that the clusters matched nicely with the geographical regions of corresponding countries in the world such as Asia or Europe. As expected the key currencies were generally those showing major economic activity.

  15. Self-organized Criticality in Hierarchical Brain Network

    Institute of Scientific and Technical Information of China (English)

    YANG Qiu-Ying; ZHANG Ying-Yue; CHEN Tian-Lun

    2008-01-01

    It is shown that the cortical brain network of the macaque displays a hierarchically clustered organization and the neuron network shows small-world properties. Now the two factors will be considered in our model and the dynamical behavior of the model will be studied. We study the characters of the model and find that the distribution of avalanche size of the model follows power-law behavior.

  16. Hierarchical materials: Background and perspectives

    DEFF Research Database (Denmark)

    2016-01-01

    Hierarchical design draws inspiration from analysis of biological materials and has opened new possibilities for enhancing performance and enabling new functionalities and extraordinary properties. With the development of nanotechnology, the necessary technological requirements for the manufactur...

  17. Direct hierarchical assembly of nanoparticles

    Science.gov (United States)

    Xu, Ting; Zhao, Yue; Thorkelsson, Kari

    2014-07-22

    The present invention provides hierarchical assemblies of a block copolymer, a bifunctional linking compound and a nanoparticle. The block copolymers form one micro-domain and the nanoparticles another micro-domain.

  18. Bimodal Color Distribution in Hierarchical Galaxy Formation

    CERN Document Server

    Menci, N; Giallongo, E; Salimbeni, S

    2005-01-01

    We show how the observed bimodality in the color distribution of galaxies can be explained in the framework of the hierarchical clustering picture in terms of the interplay between the properties of the merging histories and the feedback/star-formation processes in the progenitors of local galaxies. Using a semi-analytic model of hierarchical galaxy formation, we compute the color distributions of galaxies with different luminosities and compare them with the observations. Our fiducial model matches the fundamental properties of the observed distributions, namely: 1) the distribution of objects brighter than M_r = -18 is clearly bimodal, with a fraction of red objects increasing with luminosity; 2) for objects brighter than M_r = -21 the color distribution is dominated by red objects with color u-r = 2.2-2.4; 3) the spread on the distribution of the red population is smaller than that of the blue population; 4) the fraction of red galaxies is larger in denser environments, even for low-luminosity objects; 5) ...

  19. Hierarchical video summarization for medical data

    Science.gov (United States)

    Zhu, Xingquan; Fan, Jianping; Elmagarmid, Ahmed K.; Aref, Walid G.

    2001-12-01

    To provide users with an overview of medical video content at various levels of abstraction which can be used for more efficient database browsing and access, a hierarchical video summarization strategy has been developed and is presented in this paper. To generate an overview, the key frames of a video are preprocessed to extract special frames (black frames, slides, clip art, sketch drawings) and special regions (faces, skin or blood-red areas). A shot grouping method is then applied to merge the spatially or temporally related shots into groups. The visual features and knowledge from the video shots are integrated to assign the groups into predefined semantic categories. Based on the video groups and their semantic categories, video summaries for different levels are constructed by group merging, hierarchical group clustering and semantic category selection. Based on this strategy, a user can select the layer of the summary to access. The higher the layer, the more concise the video summary; the lower the layer, the greater the detail contained in the summary.

  20. Multiple dynamical time-scales in networks with hierarchically nested modular organization

    Indian Academy of Sciences (India)

    Sitabhra Sinha; Swarup Poria

    2011-11-01

    Many natural and engineered complex networks have intricate mesoscopic organization, e.g., the clustering of the constituent nodes into several communities or modules. Often, such modularity is manifested at several different hierarchical levels, where the clusters defined at one level appear as elementary entities at the next higher level. Using a simple model of a hierarchical modular network, we show that such a topological structure gives rise to characteristic time-scale separation between dynamics occurring at different levels of the hierarchy. This generalizes our earlier result for simple modular networks, where fast intramodular and slow intermodular processes were clearly distinguished. Investigating the process of synchronization of oscillators in a hierarchical modular network, we show the existence of as many distinct time-scales as there are hierarchical levels in the system. This suggests a possible functional role of such mesoscopic organization principle in natural systems, viz., in the dynamical separation of events occurring at different spatial scales.

  1. PROPOSED A HETEROGENEOUS CLUSTERING ALGORITHM TO IMPROVE QOS IN WSN

    Directory of Open Access Journals (Sweden)

    Mehran Mokhtari

    2016-07-01

    Full Text Available In this article it has presented leach extended hierarchical 3-level clustered heterogeneous and dynamics algorithm. On suggested protocol (LEH3LA with planning of selected auction cluster head, and alternative cluster head node, problem of delay on processing, processing of selecting members, decrease of expenses, and energy consumption, decrease of sending message, and receiving messages inside the clusters, selecting of cluster heads in large sensor networks were solved. This algorithm uses hierarchical heterogeneous network (3-levels, collective intelligence, and intra-cluster interaction for communications. Also it will solve the problems of sending data in Multi-BS mobile networks, expanding inter-cluster networks, overlap cluster, genesis orphan nodes, boundary change dynamically clusters, using backbone networks, cloud sensor. Using sleep/wake scheduling algorithm or TDMA-schedule alternative cluster head node provides redundancy, and fault tolerance. Local processing in cluster head nodes, and alternative cluster head, intra-cluster and inter-cluster communications such as Multi-HOP cause increase on processing speed, and sending data intra-cluster and inter-cluster. Decrease of overhead network, and increase the load balancing among cluster heads. Using encapsulation of data method, by cluster head nodes, energy consumption decrease during sending data. Also by improving quality of service (QoS in CBRP, LEACH, 802.15.4, decrease of energy consumption in sensors, cluster heads and alternative cluster head nodes, cause increase on lift time of sensor networks

  2. Hierarchical method of task assignment for multiple cooperating UAV teams

    Institute of Scientific and Technical Information of China (English)

    Xiaoxuan Hu; Huawei Ma; Qingsong Ye; He Luo

    2015-01-01

    The problem of task assignment for multiple cooperat-ing unmanned aerial vehicle (UAV) teams is considered. Multiple UAVs forming several smal teams are needed to perform attack tasks on a set of predetermined ground targets. A hierarchical task assignment method is presented to address the problem. It breaks the original problem down to three levels of sub-problems: tar-get clustering, cluster al ocation and target assignment. The first two sub-problems are central y solved by using clustering algo-rithms and integer linear programming, respectively, and the third sub-problem is solved in a distributed and paral el manner, using a mixed integer linear programming model and an improved ant colony algorithm. The proposed hierarchical method can reduce the computational complexity of the task assignment problem con-siderably, especial y when the number of tasks or the number of UAVs is large. Experimental results show that this method is feasi-ble and more efficient than non-hierarchical methods.

  3. A Novel Cluster Head Selection Algorithm Based on Fuzzy Clustering and Particle Swarm Optimization.

    Science.gov (United States)

    Ni, Qingjian; Pan, Qianqi