WorldWideScience

Sample records for profile similarity search

  1. Protein structural similarity search by Ramachandran codes

    Directory of Open Access Journals (Sweden)

    Chang Chih-Hung

    2007-08-01

    Full Text Available Abstract Background Protein structural data has increased exponentially, such that fast and accurate tools are necessary to access structure similarity search. To improve the search speed, several methods have been designed to reduce three-dimensional protein structures to one-dimensional text strings that are then analyzed by traditional sequence alignment methods; however, the accuracy is usually sacrificed and the speed is still unable to match sequence similarity search tools. Here, we aimed to improve the linear encoding methodology and develop efficient search tools that can rapidly retrieve structural homologs from large protein databases. Results We propose a new linear encoding method, SARST (Structural similarity search Aided by Ramachandran Sequential Transformation. SARST transforms protein structures into text strings through a Ramachandran map organized by nearest-neighbor clustering and uses a regenerative approach to produce substitution matrices. Then, classical sequence similarity search methods can be applied to the structural similarity search. Its accuracy is similar to Combinatorial Extension (CE and works over 243,000 times faster, searching 34,000 proteins in 0.34 sec with a 3.2-GHz CPU. SARST provides statistically meaningful expectation values to assess the retrieved information. It has been implemented into a web service and a stand-alone Java program that is able to run on many different platforms. Conclusion As a database search method, SARST can rapidly distinguish high from low similarities and efficiently retrieve homologous structures. It demonstrates that the easily accessible linear encoding methodology has the potential to serve as a foundation for efficient protein structural similarity search tools. These search tools are supposed applicable to automated and high-throughput functional annotations or predictions for the ever increasing number of published protein structures in this post-genomic era.

  2. Hashing for Similarity Search: A Survey

    OpenAIRE

    Wang, Jingdong; Shen, Heng Tao; Song, Jingkuan; Ji, Jianqiu

    2014-01-01

    Similarity search (nearest neighbor search) is a problem of pursuing the data items whose distances to a query item are the smallest from a large database. Various methods have been developed to address this problem, and recently a lot of efforts have been devoted to approximate search. In this paper, we present a survey on one of the main solutions, hashing, which has been widely studied since the pioneering work locality sensitive hashing. We divide the hashing algorithms two main categorie...

  3. Similarity search processing. Paralelization and indexing technologies.

    Directory of Open Access Journals (Sweden)

    Eder Dos Santos

    2015-08-01

    The next Scientific-Technical Report addresses the similarity search and the implementation of metric structures on parallel environments. It also presents the state of the art related to similarity search on metric structures and parallelism technologies. Comparative analysis are also proposed, seeking to identify the behavior of a set of metric spaces and metric structures over processing platforms multicore-based and GPU-based.

  4. Biosequence Similarity Search on the Mercury System

    Science.gov (United States)

    Krishnamurthy, Praveen; Buhler, Jeremy; Chamberlain, Roger; Franklin, Mark; Gyang, Kwame; Jacob, Arpith; Lancaster, Joseph

    2007-01-01

    Biosequence similarity search is an important application in modern molecular biology. Search algorithms aim to identify sets of sequences whose extensional similarity suggests a common evolutionary origin or function. The most widely used similarity search tool for biosequences is BLAST, a program designed to compare query sequences to a database. Here, we present the design of BLASTN, the version of BLAST that searches DNA sequences, on the Mercury system, an architecture that supports high-volume, high-throughput data movement off a data store and into reconfigurable hardware. An important component of application deployment on the Mercury system is the functional decomposition of the application onto both the reconfigurable hardware and the traditional processor. Both the Mercury BLASTN application design and its performance analysis are described. PMID:18846267

  5. Fast similarity search for learned metrics.

    Science.gov (United States)

    Kulis, Brian; Jain, Prateek; Grauman, Kristen

    2009-12-01

    We introduce a method that enables scalable similarity search for learned metrics. Given pairwise similarity and dissimilarity constraints between some examples, we learn a Mahalanobis distance function that captures the examples' underlying relationships well. To allow sublinear time similarity search under the learned metric, we show how to encode the learned metric parameterization into randomized locality-sensitive hash functions. We further formulate an indirect solution that enables metric learning and hashing for vector spaces whose high dimensionality makes it infeasible to learn an explicit transformation over the feature dimensions. We demonstrate the approach applied to a variety of image data sets, as well as a systems data set. The learned metrics improve accuracy relative to commonly used metric baselines, while our hashing construction enables efficient indexing with learned distances and very large databases.

  6. New similarity search based glioma grading

    Energy Technology Data Exchange (ETDEWEB)

    Haegler, Katrin; Brueckmann, Hartmut; Linn, Jennifer [Ludwig-Maximilians-University of Munich, Department of Neuroradiology, Munich (Germany); Wiesmann, Martin; Freiherr, Jessica [RWTH Aachen University, Department of Neuroradiology, Aachen (Germany); Boehm, Christian [Ludwig-Maximilians-University of Munich, Department of Computer Science, Munich (Germany); Schnell, Oliver; Tonn, Joerg-Christian [Ludwig-Maximilians-University of Munich, Department of Neurosurgery, Munich (Germany)

    2012-08-15

    MR-based differentiation between low- and high-grade gliomas is predominately based on contrast-enhanced T1-weighted images (CE-T1w). However, functional MR sequences as perfusion- and diffusion-weighted sequences can provide additional information on tumor grade. Here, we tested the potential of a recently developed similarity search based method that integrates information of CE-T1w and perfusion maps for non-invasive MR-based glioma grading. We prospectively included 37 untreated glioma patients (23 grade I/II, 14 grade III gliomas), in whom 3T MRI with FLAIR, pre- and post-contrast T1-weighted, and perfusion sequences was performed. Cerebral blood volume, cerebral blood flow, and mean transit time maps as well as CE-T1w images were used as input for the similarity search. Data sets were preprocessed and converted to four-dimensional Gaussian Mixture Models that considered correlations between the different MR sequences. For each patient, a so-called tumor feature vector (= probability-based classifier) was defined and used for grading. Biopsy was used as gold standard, and similarity based grading was compared to grading solely based on CE-T1w. Accuracy, sensitivity, and specificity of pure CE-T1w based glioma grading were 64.9%, 78.6%, and 56.5%, respectively. Similarity search based tumor grading allowed differentiation between low-grade (I or II) and high-grade (III) gliomas with an accuracy, sensitivity, and specificity of 83.8%, 78.6%, and 87.0%. Our findings indicate that integration of perfusion parameters and CE-T1w information in a semi-automatic similarity search based analysis improves the potential of MR-based glioma grading compared to CE-T1w data alone. (orig.)

  7. Similarity Search for Continuous Seismic Data

    Science.gov (United States)

    OReilly, O. J.; Yoon, C. E.; Beroza, G. C.

    2013-12-01

    Cross-correlation of seismic data streams with a priori known waveform templates has emerged as an effective tool to identify the occurrence of similar seismic signals; yet, this approach is difficult if the form of the templates is unknown. This challenge has been partially met by constructing waveform templates using reoccurring seismic signals sharing similar waveforms. This waveform similarity arises because the Earth's structure is essentially time invariant at the temporal scales considered in seismology. The problem of finding similar waveforms without known templates has been approached previously by segmenting incoming seismic data streams into multiple overlapping windows, each with a fixed length and lag, followed by matched filtering using each window as a template. An immediate shortcoming of this strategy is that it is computationally expensive; it scales quadratically with the number of lags needed, which limits the analysis to time series of short duration. The principal concept behind our approach, which enables scalable similarity search is to use a hierarchical approach to investigate only a small, near-constant sized subset of all possible waveform pairs for each query. In computer science and related fields, there are efficient techniques to solve this problem, which appears in numerous applications. Here, we bring these techniques into detection seismology. As a first step, we present a prototype database application that relies on a fingerprinting scheme that produces numerous high-dimensional sparse binary data representations of the windowed data streams (each fingerprint is significantly compressed compared to the actual window). These fingerprints encode key features of the actual window, enabling comparison among fingerprints rather than cross-correlating windows for comparison. Further dimensionality reduction is then applied to each fingerprint and similar fingerprints are grouped together using locality-sensitive hashing. Developing

  8. Earthquake detection through computationally efficient similarity search.

    Science.gov (United States)

    Yoon, Clara E; O'Reilly, Ossian; Bergen, Karianne J; Beroza, Gregory C

    2015-12-01

    Seismology is experiencing rapid growth in the quantity of data, which has outpaced the development of processing algorithms. Earthquake detection-identification of seismic events in continuous data-is a fundamental operation for observational seismology. We developed an efficient method to detect earthquakes using waveform similarity that overcomes the disadvantages of existing detection methods. Our method, called Fingerprint And Similarity Thresholding (FAST), can analyze a week of continuous seismic waveform data in less than 2 hours, or 140 times faster than autocorrelation. FAST adapts a data mining algorithm, originally designed to identify similar audio clips within large databases; it first creates compact "fingerprints" of waveforms by extracting key discriminative features, then groups similar fingerprints together within a database to facilitate fast, scalable search for similar fingerprint pairs, and finally generates a list of earthquake detections. FAST detected most (21 of 24) cataloged earthquakes and 68 uncataloged earthquakes in 1 week of continuous data from a station located near the Calaveras Fault in central California, achieving detection performance comparable to that of autocorrelation, with some additional false detections. FAST is expected to realize its full potential when applied to extremely long duration data sets over a distributed network of seismic stations. The widespread application of FAST has the potential to aid in the discovery of unexpected seismic signals, improve seismic monitoring, and promote a greater understanding of a variety of earthquake processes.

  9. Mining patents using molecular similarity search.

    Science.gov (United States)

    Rhodes, James; Boyer, Stephen; Kreulen, Jeffrey; Chen, Ying; Ordonez, Patricia

    2007-01-01

    Text analytics is becoming an increasingly important tool used in biomedical research. While advances continue to be made in the core algorithms for entity identification and relation extraction, a need for practical applications of these technologies arises. We developed a system that allows users to explore the US Patent corpus using molecular information. The core of our system contains three main technologies: A high performing chemical annotator which identifies chemical terms and converts them to structures, a similarity search engine based on the emerging IUPAC International Chemical Identifier (InChI) standard, and a set of on demand data mining tools. By leveraging this technology we were able to rapidly identify and index 3,623,248 unique chemical structures from 4,375,036 US Patents and Patent Applications. Using this system a user may go to a web page, draw a molecule, search for related Intellectual Property (IP) and analyze the results. Our results prove that this is a far more effective way for identifying IP than traditional keyword based approaches.

  10. Optimal neighborhood indexing for protein similarity search.

    Science.gov (United States)

    Peterlongo, Pierre; Noé, Laurent; Lavenier, Dominique; Nguyen, Van Hoa; Kucherov, Gregory; Giraud, Mathieu

    2008-12-16

    Similarity inference, one of the main bioinformatics tasks, has to face an exponential growth of the biological data. A classical approach used to cope with this data flow involves heuristics with large seed indexes. In order to speed up this technique, the index can be enhanced by storing additional information to limit the number of random memory accesses. However, this improvement leads to a larger index that may become a bottleneck. In the case of protein similarity search, we propose to decrease the index size by reducing the amino acid alphabet. The paper presents two main contributions. First, we show that an optimal neighborhood indexing combining an alphabet reduction and a longer neighborhood leads to a reduction of 35% of memory involved into the process, without sacrificing the quality of results nor the computational time. Second, our approach led us to develop a new kind of substitution score matrices and their associated e-value parameters. In contrast to usual matrices, these matrices are rectangular since they compare amino acid groups from different alphabets. We describe the method used for computing those matrices and we provide some typical examples that can be used in such comparisons. Supplementary data can be found on the website http://bioinfo.lifl.fr/reblosum. We propose a practical index size reduction of the neighborhood data, that does not negatively affect the performance of large-scale search in protein sequences. Such an index can be used in any study involving large protein data. Moreover, rectangular substitution score matrices and their associated statistical parameters can have applications in any study involving an alphabet reduction.

  11. Omokage search: shape similarity search service for biomolecular structures in both the PDB and EMDB.

    Science.gov (United States)

    Suzuki, Hirofumi; Kawabata, Takeshi; Nakamura, Haruki

    2016-02-15

    Omokage search is a service to search the global shape similarity of biological macromolecules and their assemblies, in both the Protein Data Bank (PDB) and Electron Microscopy Data Bank (EMDB). The server compares global shapes of assemblies independent of sequence order and number of subunits. As a search query, the user inputs a structure ID (PDB ID or EMDB ID) or uploads an atomic model or 3D density map to the server. The search is performed usually within 1 min, using one-dimensional profiles (incremental distance rank profiles) to characterize the shapes. Using the gmfit (Gaussian mixture model fitting) program, the found structures are fitted onto the query structure and their superimposed structures are displayed on the Web browser. Our service provides new structural perspectives to life science researchers. Omokage search is freely accessible at http://pdbj.org/omokage/. © The Author 2015. Published by Oxford University Press.

  12. Outsourced similarity search on metric data assets

    DEFF Research Database (Denmark)

    Yiu, Man Lung; Assent, Ira; Jensen, Christian Søndergaard

    2012-01-01

    This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example...

  13. A Similarity Search Using Molecular Topological Graphs

    Directory of Open Access Journals (Sweden)

    Yoshifumi Fukunishi

    2009-01-01

    Full Text Available A molecular similarity measure has been developed using molecular topological graphs and atomic partial charges. Two kinds of topological graphs were used. One is the ordinary adjacency matrix and the other is a matrix which represents the minimum path length between two atoms of the molecule. The ordinary adjacency matrix is suitable to compare the local structures of molecules such as functional groups, and the other matrix is suitable to compare the global structures of molecules. The combination of these two matrices gave a similarity measure. This method was applied to in silico drug screening, and the results showed that it was effective as a similarity measure.

  14. Outsourced similarity search on metric data assets

    KAUST Repository

    Yiu, Man Lung

    2012-02-01

    This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example. Outsourcing offers the data owner scalability and a low-initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise confidential. Given this setting, the paper presents techniques that transform the data prior to supplying it to the service provider for similarity queries on the transformed data. Our techniques provide interesting trade-offs between query cost and accuracy. They are then further extended to offer an intuitive privacy guarantee. Empirical studies with real data demonstrate that the techniques are capable of offering privacy while enabling efficient and accurate processing of similarity queries.

  15. Adaptive and Dynamic Pivot Selection for Similarity Search

    OpenAIRE

    Salvetti, Mariano; Deco, Claudia; Reyes, Nora; Bender, Cristina

    2011-01-01

    In this paper, a new indexing and similarity search method based on dynamic selection of pivots is presented. It uses Sparse Spatial Selection (SSS) for the initial selection of pivots. Two new selection policies of pivots are added, in order to the index suits itself to searches when it adapts to the metric space. The proposed structure automatically adjusts to the region where most of searches are made. In this way, the amount of distance computations during searches is reduced. The adjustm...

  16. Modeling Tanimoto Similarity Value Distributions and Predicting Search Results.

    Science.gov (United States)

    Vogt, Martin; Bajorath, Jürgen

    2017-07-01

    Similarity searching using molecular fingerprints has a long history in chemoinformatics and continues to be a popular approach for virtual screening. Typically, known active reference molecules are used to search databases for new active compounds. However, this search has black box character because similarity value distributions are dependent on fingerprints and compound classes. Consequently, no generally applicable similarity threshold values are available as reliable indicators of activity relationships between reference and database compounds. Therefore, it is generally uncertain where new active compounds might appear in database rankings, if at all. In this contribution, methods are discussed for modeling similarity value distributions of fingerprint search calculations using Tanimoto coefficients and estimating rank positions of active compounds. To our knowledge, these are the first approaches for predicting the results of fingerprint-based similarity searching. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Applying ligands profiling using multiple extended electron distribution based field templates and feature trees similarity searching in the discovery of new generation of urea-based antineoplastic kinase inhibitors.

    Directory of Open Access Journals (Sweden)

    Eman M Dokla

    Full Text Available This study provides a comprehensive computational procedure for the discovery of novel urea-based antineoplastic kinase inhibitors while focusing on diversification of both chemotype and selectivity pattern. It presents a systematic structural analysis of the different binding motifs of urea-based kinase inhibitors and the corresponding configurations of the kinase enzymes. The computational model depends on simultaneous application of two protocols. The first protocol applies multiple consecutive validated virtual screening filters including SMARTS, support vector-machine model (ROC = 0.98, Bayesian model (ROC = 0.86 and structure-based pharmacophore filters based on urea-based kinase inhibitors complexes retrieved from literature. This is followed by hits profiling against different extended electron distribution (XED based field templates representing different kinase targets. The second protocol enables cancericidal activity verification by using the algorithm of feature trees (Ftrees similarity searching against NCI database. Being a proof-of-concept study, this combined procedure was experimentally validated by its utilization in developing a novel series of urea-based derivatives of strong anticancer activity. This new series is based on 3-benzylbenzo[d]thiazol-2(3H-one scaffold which has interesting chemical feasibility and wide diversification capability. Antineoplastic activity of this series was assayed in vitro against NCI 60 tumor-cell lines showing very strong inhibition of GI(50 as low as 0.9 uM. Additionally, its mechanism was unleashed using KINEX™ protein kinase microarray-based small molecule inhibitor profiling platform and cell cycle analysis showing a peculiar selectivity pattern against Zap70, c-src, Mink1, csk and MeKK2 kinases. Interestingly, it showed activity on syk kinase confirming the recent studies finding of the high activity of diphenyl urea containing compounds against this kinase. Allover, the new series

  18. Exact Score Distribution Computation for Similarity Searches in Ontologies

    Science.gov (United States)

    Schulz, Marcel H.; Köhler, Sebastian; Bauer, Sebastian; Vingron, Martin; Robinson, Peter N.

    Semantic similarity searches in ontologies are an important component of many bioinformatic algorithms, e.g., protein function prediction with the Gene Ontology. In this paper we consider the exact computation of score distributions for similarity searches in ontologies, and introduce a simple null hypothesis which can be used to compute a P-value for the statistical significance of similarity scores. We concentrate on measures based on Resnik’s definition of ontological similarity. A new algorithm is proposed that collapses subgraphs of the ontology graph and thereby allows fast score distribution computation. The new algorithm is several orders of magnitude faster than the naive approach, as we demonstrate by computing score distributions for similarity searches in the Human Phenotype Ontology.

  19. Margin-Based Pivot Selection for Similarity Search Indexes

    Science.gov (United States)

    Kurasawa, Hisashi; Fukagawa, Daiji; Takasu, Atsuhiro; Adachi, Jun

    When developing an index for a similarity search in metric spaces, how to divide the space for effective search pruning is a fundamental issue. We present Maximal Metric Margin Partitioning (MMMP), a partitioning scheme for similarity search indexes. MMMP divides the data based on its distribution pattern, especially for the boundaries of clusters. A partitioning boundary created by MMMP is likely to be located in a sparse area between clusters. Moreover, the partitioning boundary is at maximum distances from the two cluster edges. We also present an indexing scheme, named the MMMP-Index, which uses MMMP and pivot filtering. The MMMP-Index can prune many objects that are not relevant to a query, and it reduces the query execution cost. Our experimental results show that MMMP effectively indexes clustered data and reduces the search cost. For clustered data in a vector space, the MMMP-Index reduces the computational cost to less than two thirds that of comparable schemes.

  20. Image similarity search using a negative selection algorithm

    NARCIS (Netherlands)

    Keijzers, S.; Maandag, P.; Marchiori, E.; Sprinkhuizen-Kuyper, I.G.

    2013-01-01

    The Negative Selection Algorithm is an immune inspired algorithm that can be used for different purposes such as fault detection, data integrity protection and virus detection. In this paper we show how the Negative Selection Algorithm can be adapted to tackle the similar image search problem: given

  1. How Google Web Search copes with very similar documents

    NARCIS (Netherlands)

    W. Mettrop (Wouter); P. Nieuwenhuysen; H. Smulders

    2006-01-01

    textabstractA significant portion of the computer files that carry documents, multimedia, programs etc. on the Web are identical or very similar to other files on the Web. How do search engines cope with this? Do they perform some kind of “deduplication”? How should users take into account that

  2. RAPSearch: a fast protein similarity search tool for short reads

    Directory of Open Access Journals (Sweden)

    Choi Jeong-Hyeon

    2011-05-01

    Full Text Available Abstract Background Next Generation Sequencing (NGS is producing enormous corpuses of short DNA reads, affecting emerging fields like metagenomics. Protein similarity search--a key step to achieve annotation of protein-coding genes in these short reads, and identification of their biological functions--faces daunting challenges because of the very sizes of the short read datasets. Results We developed a fast protein similarity search tool RAPSearch that utilizes a reduced amino acid alphabet and suffix array to detect seeds of flexible length. For short reads (translated in 6 frames we tested, RAPSearch achieved ~20-90 times speedup as compared to BLASTX. RAPSearch missed only a small fraction (~1.3-3.2% of BLASTX similarity hits, but it also discovered additional homologous proteins (~0.3-2.1% that BLASTX missed. By contrast, BLAT, a tool that is even slightly faster than RAPSearch, had significant loss of sensitivity as compared to RAPSearch and BLAST. Conclusions RAPSearch is implemented as open-source software and is accessible at http://omics.informatics.indiana.edu/mg/RAPSearch. It enables faster protein similarity search. The application of RAPSearch in metageomics has also been demonstrated.

  3. Robust hashing with local models for approximate similarity search.

    Science.gov (United States)

    Song, Jingkuan; Yang, Yi; Li, Xuelong; Huang, Zi; Yang, Yang

    2014-07-01

    Similarity search plays an important role in many applications involving high-dimensional data. Due to the known dimensionality curse, the performance of most existing indexing structures degrades quickly as the feature dimensionality increases. Hashing methods, such as locality sensitive hashing (LSH) and its variants, have been widely used to achieve fast approximate similarity search by trading search quality for efficiency. However, most existing hashing methods make use of randomized algorithms to generate hash codes without considering the specific structural information in the data. In this paper, we propose a novel hashing method, namely, robust hashing with local models (RHLM), which learns a set of robust hash functions to map the high-dimensional data points into binary hash codes by effectively utilizing local structural information. In RHLM, for each individual data point in the training dataset, a local hashing model is learned and used to predict the hash codes of its neighboring data points. The local models from all the data points are globally aligned so that an optimal hash code can be assigned to each data point. After obtaining the hash codes of all the training data points, we design a robust method by employing l2,1 -norm minimization on the loss function to learn effective hash functions, which are then used to map each database point into its hash code. Given a query data point, the search process first maps it into the query hash code by the hash functions and then explores the buckets, which have similar hash codes to the query hash code. Extensive experimental results conducted on real-life datasets show that the proposed RHLM outperforms the state-of-the-art methods in terms of search quality and efficiency.

  4. Semantic similarity measure in biomedical domain leverage web search engine.

    Science.gov (United States)

    Chen, Chi-Huang; Hsieh, Sheau-Ling; Weng, Yung-Ching; Chang, Wen-Yung; Lai, Feipei

    2010-01-01

    Semantic similarity measure plays an essential role in Information Retrieval and Natural Language Processing. In this paper we propose a page-count-based semantic similarity measure and apply it in biomedical domains. Previous researches in semantic web related applications have deployed various semantic similarity measures. Despite the usefulness of the measurements in those applications, measuring semantic similarity between two terms remains a challenge task. The proposed method exploits page counts returned by the Web Search Engine. We define various similarity scores for two given terms P and Q, using the page counts for querying P, Q and P AND Q. Moreover, we propose a novel approach to compute semantic similarity using lexico-syntactic patterns with page counts. These different similarity scores are integrated adapting support vector machines, to leverage the robustness of semantic similarity measures. Experimental results on two datasets achieve correlation coefficients of 0.798 on the dataset provided by A. Hliaoutakis, 0.705 on the dataset provide by T. Pedersen with physician scores and 0.496 on the dataset provided by T. Pedersen et al. with expert scores.

  5. Target-nontarget similarity decreases search efficiency and increases stimulus-driven control in visual search.

    Science.gov (United States)

    Barras, Caroline; Kerzel, Dirk

    2017-10-01

    Some points of criticism against the idea that attentional selection is controlled by bottom-up processing were dispelled by the attentional window account. The attentional window account claims that saliency computations during visual search are only performed for stimuli inside the attentional window. Therefore, a small attentional window may avoid attentional capture by salient distractors because it is likely that the salient distractor is located outside the window. In contrast, a large attentional window increases the chances of attentional capture by a salient distractor. Large and small attentional windows have been associated with efficient (parallel) and inefficient (serial) search, respectively. We compared the effect of a salient color singleton on visual search for a shape singleton during efficient and inefficient search. To vary search efficiency, the nontarget shapes were either similar or dissimilar with respect to the shape singleton. We found that interference from the color singleton was larger with inefficient than efficient search, which contradicts the attentional window account. While inconsistent with the attentional window account, our results are predicted by computational models of visual search. Because of target-nontarget similarity, the target was less salient with inefficient than efficient search. Consequently, the relative saliency of the color distractor was higher with inefficient than with efficient search. Accordingly, stronger attentional capture resulted. Overall, the present results show that bottom-up control by stimulus saliency is stronger when search is difficult, which is inconsistent with the attentional window account.

  6. Multiple search methods for similarity-based virtual screening: analysis of search overlap and precision

    Directory of Open Access Journals (Sweden)

    Holliday John D

    2011-08-01

    Full Text Available Abstract Background Data fusion methods are widely used in virtual screening, and make the implicit assumption that the more often a molecule is retrieved in multiple similarity searches, the more likely it is to be active. This paper tests the correctness of this assumption. Results Sets of 25 searches using either the same reference structure and 25 different similarity measures (similarity fusion or 25 different reference structures and the same similarity measure (group fusion show that large numbers of unique molecules are retrieved by just a single search, but that the numbers of unique molecules decrease very rapidly as more searches are considered. This rapid decrease is accompanied by a rapid increase in the fraction of those retrieved molecules that are active. There is an approximately log-log relationship between the numbers of different molecules retrieved and the number of searches carried out, and a rationale for this power-law behaviour is provided. Conclusions Using multiple searches provides a simple way of increasing the precision of a similarity search, and thus provides a justification for the use of data fusion methods in virtual screening.

  7. Long-Term Mean Wind Profiles Based on Similarity Theory

    DEFF Research Database (Denmark)

    Kelly, Mark C.; Gryning, Sven-Erik

    2010-01-01

    in turn facilitates the derivation of a long-term mean wind profile based on Monin–Obukhov similarity theory. The modelled stability distributions exhibit good agreement with measurements from sites having different local conditions. The long-term wind profile formulation is further extended to include...... the influence of the depth of the atmospheric boundary layer (h), which becomes relevant for heights above h/3, and the resultant long-term ‘tall’ profile form also matches observations....

  8. Query-dependent banding (QDB for faster RNA similarity searches.

    Directory of Open Access Journals (Sweden)

    Eric P Nawrocki

    2007-03-01

    Full Text Available When searching sequence databases for RNAs, it is desirable to score both primary sequence and RNA secondary structure similarity. Covariance models (CMs are probabilistic models well-suited for RNA similarity search applications. However, the computational complexity of CM dynamic programming alignment algorithms has limited their practical application. Here we describe an acceleration method called query-dependent banding (QDB, which uses the probabilistic query CM to precalculate regions of the dynamic programming lattice that have negligible probability, independently of the target database. We have implemented QDB in the freely available Infernal software package. QDB reduces the average case time complexity of CM alignment from LN(2.4 to LN(1.3 for a query RNA of N residues and a target database of L residues, resulting in a 4-fold speedup for typical RNA queries. Combined with other improvements to Infernal, including informative mixture Dirichlet priors on model parameters, benchmarks also show increased sensitivity and specificity resulting from improved parameterization.

  9. Similarity-based search of model organism, disease and drug effect phenotypes

    KAUST Repository

    Hoehndorf, Robert

    2015-02-19

    Background: Semantic similarity measures over phenotype ontologies have been demonstrated to provide a powerful approach for the analysis of model organism phenotypes, the discovery of animal models of human disease, novel pathways, gene functions, druggable therapeutic targets, and determination of pathogenicity. Results: We have developed PhenomeNET 2, a system that enables similarity-based searches over a large repository of phenotypes in real-time. It can be used to identify strains of model organisms that are phenotypically similar to human patients, diseases that are phenotypically similar to model organism phenotypes, or drug effect profiles that are similar to the phenotypes observed in a patient or model organism. PhenomeNET 2 is available at http://aber-owl.net/phenomenet. Conclusions: Phenotype-similarity searches can provide a powerful tool for the discovery and investigation of molecular mechanisms underlying an observed phenotypic manifestation. PhenomeNET 2 facilitates user-defined similarity searches and allows researchers to analyze their data within a large repository of human, mouse and rat phenotypes.

  10. POSTER: Privacy-Preserving Profile Similarity Computation in Online Social Networks

    NARCIS (Netherlands)

    Jeckmans, Arjan; Tang, Qiang; Hartel, Pieter H.

    2011-01-01

    Currently, none of the existing online social networks (OSNs) enables its users to make new friends without revealing their private information. This leaves the users in a vulnerable position when searching for new friends. We propose a solution which enables a user to compute her profile similarity

  11. PHOG-BLAST – a new generation tool for fast similarity search of protein families

    Directory of Open Access Journals (Sweden)

    Mironov Andrey A

    2006-06-01

    Full Text Available Abstract Background The need to compare protein profiles frequently arises in various protein research areas: comparison of protein families, domain searches, resolution of orthology and paralogy. The existing fast algorithms can only compare a protein sequence with a protein sequence and a profile with a sequence. Algorithms to compare profiles use dynamic programming and complex scoring functions. Results We developed a new algorithm called PHOG-BLAST for fast similarity search of profiles. This algorithm uses profile discretization to convert a profile to a finite alphabet and utilizes hashing for fast search. To determine the optimal alphabet, we analyzed columns in reliable multiple alignments and obtained column clusters in the 20-dimensional profile space by applying a special clustering procedure. We show that the clustering procedure works best if its parameters are chosen so that 20 profile clusters are obtained which can be interpreted as ancestral amino acid residues. With these clusters, only less than 2% of columns in multiple alignments are out of clusters. We tested the performance of PHOG-BLAST vs. PSI-BLAST on three well-known databases of multiple alignments: COG, PFAM and BALIBASE. On the COG database both algorithms showed the same performance, on PFAM and BALIBASE PHOG-BLAST was much superior to PSI-BLAST. PHOG-BLAST required 10–20 times less computer memory and computation time than PSI-BLAST. Conclusion Since PHOG-BLAST can compare multiple alignments of protein families, it can be used in different areas of comparative proteomics and protein evolution. For example, PHOG-BLAST helped to build the PHOG database of phylogenetic orthologous groups. An essential step in building this database was comparing protein complements of different species and orthologous groups of different taxons on a personal computer in reasonable time. When it is applied to detect weak similarity between protein families, PHOG-BLAST is less

  12. Profile Similarity Metrics Increase Personality Scale Validity (Briefing Charts)

    Science.gov (United States)

    2016-04-15

    unless so designated by other documentation. Personality Tests • Modest scale validity & minor adverse impact • Scale scores often computed as the mean ...Profile Similarity Metrics Increase Personality Scale Validity Peter J. Legree, Robert N. Kilcullen U.S. Army Research Institute for the Behavioral... Personality Tests & Distance Metrics • Conventional and distance scores are redundant: r = -1.00 3 Fitness Motivation Rating Conventional Score Key Distance

  13. Similarity between neonatal profile and socioeconomic index: a spatial approach

    Directory of Open Access Journals (Sweden)

    d'Orsi Eleonora

    2005-01-01

    Full Text Available This study aims to compare neonatal characteristics and socioeconomic conditions in Rio de Janeiro city neighborhoods in order to identify priority areas for intervention. The study design was ecological. Two databases were used: the Brazilian Population Census and the Live Birth Information System, aggregated by neighborhoods. Spatial analysis, multivariate cluster classification, and Moran's I statistics for detection of spatial clustering were used. A similarity index was created to compare socioeconomic clusters with the neonatal profile in each neighborhood. The proportions of Apgar score above 8 and cesarean sections showed positive spatial correlation and high similarity with the socioeconomic index. The proportion of low birth weight infants showed a random spatial distribution, indicating that at this scale of analysis, birth weight is not sufficiently sensitive to discriminate subtler differences among population groups. The observed relationship between the neighborhoods' neonatal profile (particularly Apgar score and mode of delivery and socioeconomic conditions shows evidence of a change in infant health profile, where the possibility for intervention shifts to medical services and the Apgar score assumes growing significance as a risk indicator.

  14. Density-based similarity measures for content based search

    Energy Technology Data Exchange (ETDEWEB)

    Hush, Don R [Los Alamos National Laboratory; Porter, Reid B [Los Alamos National Laboratory; Ruggiero, Christy E [Los Alamos National Laboratory

    2009-01-01

    We consider the query by multiple example problem where the goal is to identify database samples whose content is similar to a coUection of query samples. To assess the similarity we use a relative content density which quantifies the relative concentration of the query distribution to the database distribution. If the database distribution is a mixture of the query distribution and a background distribution then it can be shown that database samples whose relative content density is greater than a particular threshold {rho} are more likely to have been generated by the query distribution than the background distribution. We describe an algorithm for predicting samples with relative content density greater than {rho} that is computationally efficient and possesses strong performance guarantees. We also show empirical results for applications in computer network monitoring and image segmentation.

  15. SHOP: scaffold hopping by GRID-based similarity searches

    DEFF Research Database (Denmark)

    Bergmann, Rikke; Linusson, Anna; Zamora, Ismael

    2007-01-01

    scaffolds were in the 31 top-ranked scaffolds. SHOP also identified new scaffolds with substantially different chemotypes from the queries. Docking analysis indicated that the new scaffolds would have similar binding modes to those of the respective query scaffolds observed in X-ray structures...... ligands of three different protein targets relevant for drug discovery using a rational approach based on statistical experimental design. Five out of eight and seven out of eight thrombin scaffolds and all seven HIV protease scaffolds were recovered within the top 10 and 31 out of 31 neuraminidase...

  16. Content-Based Search on a Database of Geometric Models: Identifying Objects of Similar Shape

    Energy Technology Data Exchange (ETDEWEB)

    XAVIER, PATRICK G.; HENRY, TYSON R.; LAFARGE, ROBERT A.; MEIRANS, LILITA; RAY, LAWRENCE P.

    2001-11-01

    The Geometric Search Engine is a software system for storing and searching a database of geometric models. The database maybe searched for modeled objects similar in shape to a target model supplied by the user. The database models are generally from CAD models while the target model may be either a CAD model or a model generated from range data collected from a physical object. This document describes key generation, database layout, and search of the database.

  17. Perceptual Grouping in Haptic Search: The Influence of Proximity, Similarity, and Good Continuation

    Science.gov (United States)

    Overvliet, Krista E.; Krampe, Ralf Th.; Wagemans, Johan

    2012-01-01

    We conducted a haptic search experiment to investigate the influence of the Gestalt principles of proximity, similarity, and good continuation. We expected faster search when the distractors could be grouped. We chose edges at different orientations as stimuli because they are processed similarly in the haptic and visual modality. We therefore…

  18. OS2: Oblivious similarity based searching for encrypted data outsourced to an untrusted domain

    Science.gov (United States)

    Pervez, Zeeshan; Ahmad, Mahmood; Khattak, Asad Masood; Ramzan, Naeem

    2017-01-01

    Public cloud storage services are becoming prevalent and myriad data sharing, archiving and collaborative services have emerged which harness the pay-as-you-go business model of public cloud. To ensure privacy and confidentiality often encrypted data is outsourced to such services, which further complicates the process of accessing relevant data by using search queries. Search over encrypted data schemes solve this problem by exploiting cryptographic primitives and secure indexing to identify outsourced data that satisfy the search criteria. Almost all of these schemes rely on exact matching between the encrypted data and search criteria. A few schemes which extend the notion of exact matching to similarity based search, lack realism as those schemes rely on trusted third parties or due to increase storage and computational complexity. In this paper we propose Oblivious Similarity based Search (OS2) for encrypted data. It enables authorized users to model their own encrypted search queries which are resilient to typographical errors. Unlike conventional methodologies, OS2 ranks the search results by using similarity measure offering a better search experience than exact matching. It utilizes encrypted bloom filter and probabilistic homomorphic encryption to enable authorized users to access relevant data without revealing results of search query evaluation process to the untrusted cloud service provider. Encrypted bloom filter based search enables OS2 to reduce search space to potentially relevant encrypted data avoiding unnecessary computation on public cloud. The efficacy of OS2 is evaluated on Google App Engine for various bloom filter lengths on different cloud configurations. PMID:28692697

  19. Personalization in E-commerce using profiles similarity

    Directory of Open Access Journals (Sweden)

    Radu LIXANDROIU

    2015-06-01

    Full Text Available Understanding the use needs is one of the key factors of an online project. If these needs are quickly identified, the customer can be offered the best products immediately. Creating profiles allows the identification and communication of needs efficiently and effectively. Basically if these profiles are well established, it remains to identify just which is the closest profile to the online client. Profiling is a useful tool in marketing, increasing the functionality of sales application. These tools fall into customer-oriented tools, together with the analytical techniques ones and those suggesting the desired products.

  20. In search of the entrepreneurial profile(s) in Luxembourg

    OpenAIRE

    Dimaria, Charles-Henri; Ries, Jean

    2006-01-01

    This article tries to characterize the profiles of entrepreneurs in Luxembourg. First, theoretical benchmark definitions of entrepreneur and entrepreneurship are surveyed and descriptive statistics are computed to define an average profile of the entrepreneur using a new and original dataset for Luxembourg. Then, using the Factors of Business Success survey (FoBS), clustering techniques are used to determine potential entrepreneurial profiles in Luxembourg.

  1. Accelerated similarity searching and clustering of large compound sets by geometric embedding and locality sensitive hashing.

    Science.gov (United States)

    Cao, Yiqun; Jiang, Tao; Girke, Thomas

    2010-04-01

    Similarity searching and clustering of chemical compounds by structural similarities are important computational approaches for identifying drug-like small molecules. Most algorithms available for these tasks are limited by their speed and scalability, and cannot handle today's large compound databases with several million entries. In this article, we introduce a new algorithm for accelerated similarity searching and clustering of very large compound sets using embedding and indexing (EI) techniques. First, we present EI-Search as a general purpose similarity search method for finding objects with similar features in large databases and apply it here to searching and clustering of large compound sets. The method embeds the compounds in a high-dimensional Euclidean space and searches this space using an efficient index-aware nearest neighbor search method based on locality sensitive hashing (LSH). Second, to cluster large compound sets, we introduce the EI-Clustering algorithm that combines the EI-Search method with Jarvis-Patrick clustering. Both methods were tested on three large datasets with sizes ranging from about 260 000 to over 19 million compounds. In comparison to sequential search methods, the EI-Search method was 40-200 times faster, while maintaining comparable recall rates. The EI-Clustering method allowed us to significantly reduce the CPU time required to cluster these large compound libraries from several months to only a few days. Software implementations and online services have been developed based on the methods introduced in this study. The online services provide access to the generated clustering results and ultra-fast similarity searching of the PubChem Compound database with subsecond response time.

  2. Content Based Retrieval Database Management System with Support for Similarity Searching and Query Refinement

    National Research Council Canada - National Science Library

    Ortega-Binderberger, Michael

    2002-01-01

    ... as a critical area of research. This thesis explores how to enhance database systems with content based search over arbitrary abstract data types in a similarity based framework with query refinement...

  3. [Formula: see text]: Oblivious similarity based searching for encrypted data outsourced to an untrusted domain.

    Directory of Open Access Journals (Sweden)

    Zeeshan Pervez

    Full Text Available Public cloud storage services are becoming prevalent and myriad data sharing, archiving and collaborative services have emerged which harness the pay-as-you-go business model of public cloud. To ensure privacy and confidentiality often encrypted data is outsourced to such services, which further complicates the process of accessing relevant data by using search queries. Search over encrypted data schemes solve this problem by exploiting cryptographic primitives and secure indexing to identify outsourced data that satisfy the search criteria. Almost all of these schemes rely on exact matching between the encrypted data and search criteria. A few schemes which extend the notion of exact matching to similarity based search, lack realism as those schemes rely on trusted third parties or due to increase storage and computational complexity. In this paper we propose Oblivious Similarity based Search ([Formula: see text] for encrypted data. It enables authorized users to model their own encrypted search queries which are resilient to typographical errors. Unlike conventional methodologies, [Formula: see text] ranks the search results by using similarity measure offering a better search experience than exact matching. It utilizes encrypted bloom filter and probabilistic homomorphic encryption to enable authorized users to access relevant data without revealing results of search query evaluation process to the untrusted cloud service provider. Encrypted bloom filter based search enables [Formula: see text] to reduce search space to potentially relevant encrypted data avoiding unnecessary computation on public cloud. The efficacy of [Formula: see text] is evaluated on Google App Engine for various bloom filter lengths on different cloud configurations.

  4. [Formula: see text]: Oblivious similarity based searching for encrypted data outsourced to an untrusted domain.

    Science.gov (United States)

    Pervez, Zeeshan; Ahmad, Mahmood; Khattak, Asad Masood; Ramzan, Naeem; Khan, Wajahat Ali

    2017-01-01

    Public cloud storage services are becoming prevalent and myriad data sharing, archiving and collaborative services have emerged which harness the pay-as-you-go business model of public cloud. To ensure privacy and confidentiality often encrypted data is outsourced to such services, which further complicates the process of accessing relevant data by using search queries. Search over encrypted data schemes solve this problem by exploiting cryptographic primitives and secure indexing to identify outsourced data that satisfy the search criteria. Almost all of these schemes rely on exact matching between the encrypted data and search criteria. A few schemes which extend the notion of exact matching to similarity based search, lack realism as those schemes rely on trusted third parties or due to increase storage and computational complexity. In this paper we propose Oblivious Similarity based Search ([Formula: see text]) for encrypted data. It enables authorized users to model their own encrypted search queries which are resilient to typographical errors. Unlike conventional methodologies, [Formula: see text] ranks the search results by using similarity measure offering a better search experience than exact matching. It utilizes encrypted bloom filter and probabilistic homomorphic encryption to enable authorized users to access relevant data without revealing results of search query evaluation process to the untrusted cloud service provider. Encrypted bloom filter based search enables [Formula: see text] to reduce search space to potentially relevant encrypted data avoiding unnecessary computation on public cloud. The efficacy of [Formula: see text] is evaluated on Google App Engine for various bloom filter lengths on different cloud configurations.

  5. SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters

    Directory of Open Access Journals (Sweden)

    Lefkowitz Elliot J

    2004-10-01

    Full Text Available Abstract Background Large-scale sequence comparison is a powerful tool for biological inference in modern molecular biology. Comparing new sequences to those in annotated databases is a useful source of functional and structural information about these sequences. Using software such as the basic local alignment search tool (BLAST or HMMPFAM to identify statistically significant matches between newly sequenced segments of genetic material and those in databases is an important task for most molecular biologists. Searching algorithms are intrinsically slow and data-intensive, especially in light of the rapid growth of biological sequence databases due to the emergence of high throughput DNA sequencing techniques. Thus, traditional bioinformatics tools are impractical on PCs and even on dedicated UNIX servers. To take advantage of larger databases and more reliable methods, high performance computation becomes necessary. Results We describe the implementation of SS-Wrapper (Similarity Search Wrapper, a package of wrapper applications that can parallelize similarity search applications on a Linux cluster. Our wrapper utilizes a query segmentation-search (QS-search approach to parallelize sequence database search applications. It takes into consideration load balancing between each node on the cluster to maximize resource usage. QS-search is designed to wrap many different search tools, such as BLAST and HMMPFAM using the same interface. This implementation does not alter the original program, so newly obtained programs and program updates should be accommodated easily. Benchmark experiments using QS-search to optimize BLAST and HMMPFAM showed that QS-search accelerated the performance of these programs almost linearly in proportion to the number of CPUs used. We have also implemented a wrapper that utilizes a database segmentation approach (DS-BLAST that provides a complementary solution for BLAST searches when the database is too large to fit into

  6. SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters.

    Science.gov (United States)

    Wang, Chunlin; Lefkowitz, Elliot J

    2004-10-28

    Large-scale sequence comparison is a powerful tool for biological inference in modern molecular biology. Comparing new sequences to those in annotated databases is a useful source of functional and structural information about these sequences. Using software such as the basic local alignment search tool (BLAST) or HMMPFAM to identify statistically significant matches between newly sequenced segments of genetic material and those in databases is an important task for most molecular biologists. Searching algorithms are intrinsically slow and data-intensive, especially in light of the rapid growth of biological sequence databases due to the emergence of high throughput DNA sequencing techniques. Thus, traditional bioinformatics tools are impractical on PCs and even on dedicated UNIX servers. To take advantage of larger databases and more reliable methods, high performance computation becomes necessary. We describe the implementation of SS-Wrapper (Similarity Search Wrapper), a package of wrapper applications that can parallelize similarity search applications on a Linux cluster. Our wrapper utilizes a query segmentation-search (QS-search) approach to parallelize sequence database search applications. It takes into consideration load balancing between each node on the cluster to maximize resource usage. QS-search is designed to wrap many different search tools, such as BLAST and HMMPFAM using the same interface. This implementation does not alter the original program, so newly obtained programs and program updates should be accommodated easily. Benchmark experiments using QS-search to optimize BLAST and HMMPFAM showed that QS-search accelerated the performance of these programs almost linearly in proportion to the number of CPUs used. We have also implemented a wrapper that utilizes a database segmentation approach (DS-BLAST) that provides a complementary solution for BLAST searches when the database is too large to fit into the memory of a single node. Used together

  7. MEASURING THE PERFORMANCE OF SIMILARITY PROPAGATION IN AN SEMANTIC SEARCH ENGINE

    Directory of Open Access Journals (Sweden)

    S. K. Jayanthi

    2013-10-01

    Full Text Available In the current scenario, web page result personalization is playing a vital role. Nearly 80 % of the users expect the best results in the first page itself without having any persistence to browse longer in URL mode. This research work focuses on two main themes: Semantic web search through online and Domain based search through offline. The first part is to find an effective method which allows grouping similar results together using BookShelf Data Structure and organizing the various clusters. The second one is focused on the academic domain based search through offline. This paper focuses on finding documents which are similar and how Vector space can be used to solve it. So more weightage is given for the principles and working methodology of similarity propagation. Cosine similarity measure is used for finding the relevancy among the documents.

  8. RxnFinder: biochemical reaction search engines using molecular structures, molecular fragments and reaction similarity.

    Science.gov (United States)

    Hu, Qian-Nan; Deng, Zhe; Hu, Huanan; Cao, Dong-Sheng; Liang, Yi-Zeng

    2011-09-01

    Biochemical reactions play a key role to help sustain life and allow cells to grow. RxnFinder was developed to search biochemical reactions from KEGG reaction database using three search criteria: molecular structures, molecular fragments and reaction similarity. RxnFinder is helpful to get reference reactions for biosynthesis and xenobiotics metabolism. RxnFinder is freely available via: http://sdd.whu.edu.cn/rxnfinder. qnhu@whu.edu.cn.

  9. Searching for Unknown Earthquakes in the Guy-Greenbrier, Arkansas, Earthquake Sequence using Efficient Waveform Similarity Search

    Science.gov (United States)

    Yoon, C. E.; OReilly, O. J.; Bergen, K.; Huang, Y.; Beroza, G. C.

    2015-12-01

    Recent seismicity rate increases in the central United States have been attributed to injection of wastewater from oil and gas production. One example is the Guy-Greenbrier, Arkansas, earthquake sequence, which occurred from July 2010 through October 2011, and was potentially induced by fluid injection into nearby disposal wells (Horton, 2012). Although the Arkansas seismic network is sparse, a single 3-component station WHAR recorded continuous data before, during, and after this earthquake sequence at distances ranging from 2-9 km. Huang and Beroza (2015) used template matching to detect over 100 times the number of cataloged earthquakes by cross-correlating the continuous data with waveform templates based on known earthquakes to search for additional low-magnitude events. Because known waveform templates do not necessarily fully represent all seismic signals in the continuous data, small earthquakes from unknown sources could have escaped detection. We use a method called Fingerprint And Similarity Thresholding (FAST) to detect additional low-magnitude earthquakes that were missed by template matching. FAST enables fast, scalable search for earthquakes with similar waveforms without making prior assumptions about the seismic signal of interest. FAST, based on a data mining technique, first creates compact "fingerprints" of waveforms by extracting discriminative features, then uses locality-sensitive hashing to organize and efficiently search for similar fingerprints (and therefore similar earthquake waveforms) in a probabilistic manner. With FAST, each search query is processed in near-constant time, independent of the dataset size; this computational efficiency is gained at the expense of an approximate, rather than exact, search. During one week of continuous data at station WHAR, from 2010-07-01 to 2010-07-08, FAST detected over 200 uncataloged earthquakes that were not found through template matching. These newly detected earthquakes have the potential to

  10. Wikipedia Chemical Structure Explorer: substructure and similarity searching of molecules from Wikipedia.

    Science.gov (United States)

    Ertl, Peter; Patiny, Luc; Sander, Thomas; Rufener, Christian; Zasso, Michaël

    2015-01-01

    Wikipedia, the world's largest and most popular encyclopedia is an indispensable source of chemistry information. It contains among others also entries for over 15,000 chemicals including metabolites, drugs, agrochemicals and industrial chemicals. To provide an easy access to this wealth of information we decided to develop a substructure and similarity search tool for chemical structures referenced in Wikipedia. We extracted chemical structures from entries in Wikipedia and implemented a web system allowing structure and similarity searching on these data. The whole search as well as visualization system is written in JavaScript and therefore can run locally within a web page and does not require a central server. The Wikipedia Chemical Structure Explorer is accessible on-line at www.cheminfo.org/wikipedia and is available also as an open source project from GitHub for local installation. The web-based Wikipedia Chemical Structure Explorer provides a useful resource for research as well as for chemical education enabling both researchers and students easy and user friendly chemistry searching and identification of relevant information in Wikipedia. The tool can also help to improve quality of chemical entries in Wikipedia by providing potential contributors regularly updated list of entries with problematic structures. And last but not least this search system is a nice example of how the modern web technology can be applied in the field of cheminformatics. Graphical abstractWikipedia Chemical Structure Explorer allows substructure and similarity searches on molecules referenced in Wikipedia.

  11. Determinants of dwell time in visual search: similarity or perceptual difficulty?

    Science.gov (United States)

    Becker, Stefanie I

    2011-03-08

    The present study examined the factors that determine the dwell times in a visual search task, that is, the duration the gaze remains fixated on an object. It has been suggested that an item's similarity to the search target should be an important determiner of dwell times, because dwell times are taken to reflect the time needed to reject the item as a distractor, and such discriminations are supposed to be harder the more similar an item is to the search target. In line with this similarity view, a previous study shows that, in search for a target ring of thin line-width, dwell times on thin linewidth Landolt C's distractors were longer than dwell times on Landolt C's with thick or medium linewidth. However, dwell times may have been longer on thin Landolt C's because the thin line-width made it harder to detect whether the stimuli had a gap or not. Thus, it is an open question whether dwell times on thin line-width distractors were longer because they were similar to the target or because the perceptual decision was more difficult. The present study de-coupled similarity from perceptual difficulty, by measuring dwell times on thin, medium and thick line-width distractors when the target had thin, medium or thick line-width. The results showed that dwell times were longer on target-similar than target-dissimilar stimuli across all target conditions and regardless of the line-width. It is concluded that prior findings of longer dwell times on thin linewidth-distractors can clearly be attributed to target similarity. As will be discussed towards the end, the finding of similarity effects on dwell times has important implications for current theories of visual search and eye movement control.

  12. Ligand scaffold hopping combining 3D maximal substructure search and molecular similarity

    Directory of Open Access Journals (Sweden)

    Petitjean Michel

    2009-08-01

    Full Text Available Abstract Background Virtual screening methods are now well established as effective to identify hit and lead candidates and are fully integrated in most drug discovery programs. Ligand-based approaches make use of physico-chemical, structural and energetics properties of known active compounds to search large chemical libraries for related and novel chemotypes. While 2D-similarity search tools are known to be fast and efficient, the use of 3D-similarity search methods can be very valuable to many research projects as integration of "3D knowledge" can facilitate the identification of not only related molecules but also of chemicals possessing distant scaffolds as compared to the query and therefore be more inclined to scaffolds hopping. To date, very few methods performing this task are easily available to the scientific community. Results We introduce a new approach (LigCSRre to the 3D ligand similarity search of drug candidates. It combines a 3D maximum common substructure search algorithm independent on atom order with a tunable description of atomic compatibilities to prune the search and increase its physico-chemical relevance. We show, on 47 experimentally validated active compounds across five protein targets having different specificities, that for single compound search, the approach is able to recover on average 52% of the co-actives in the top 1% of the ranked list which is better than gold standards of the field. Moreover, the combination of several runs on a single protein target using different query active compounds shows a remarkable improvement in enrichment. Such Results demonstrate LigCSRre as a valuable tool for ligand-based screening. Conclusion LigCSRre constitutes a new efficient and generic approach to the 3D similarity screening of small compounds, whose flexible design opens the door to many enhancements. The program is freely available to the academics for non-profit research at: http://bioserv.rpbs.univ-paris-diderot.fr/LigCSRre.html.

  13. On the similarity between exchangeable profiles: A psychometric model, analytic strategy, and empirical illustration.

    Science.gov (United States)

    Furr, R Michael; Wood, Dustin

    2013-06-01

    Analyses of profile similarity are widespread in personality psychology, but their apparent simplicity masks difficult psychometric and statistical issues. We present a psychometric framework that addresses an important challenge (i.e., profile normativeness) in examinations of dyadic exchangeable profiles. In addition, we present an analytic strategy accounting for non-independence that often arises in analyses of profile similarity, facilitating integrated examinations of variables at dyadic and individual levels. An empirical analysis of personality similarity and relationship quality demonstrates that the model and analytic strategy can reveal novel psychological insights. These are important advances, as previous work has ignored exchangeable profiles and has failed to present an integrated psychometric and statistical framework for profile similarity.

  14. RScan: fast searching structural similarities for structured RNAs in large databases

    Directory of Open Access Journals (Sweden)

    Liu Guo-Ping

    2007-07-01

    Full Text Available Abstract Background Many RNAs have evolutionarily conserved secondary structures instead of primary sequences. Recently, there are an increasing number of methods being developed with focus on the structural alignments for finding conserved secondary structures as well as common structural motifs in pair-wise or multiple sequences. A challenging task is to search similar structures quickly for structured RNA sequences in large genomic databases since existing methods are too slow to be used in large databases. Results An implementation of a fast structural alignment algorithm, RScan, is proposed to fulfill the task. RScan is developed by levering the advantages of both hashing algorithms and local alignment algorithms. In our experiment, on the average, the times for searching a tRNA and an rRNA in the randomized A. pernix genome are only 256 seconds and 832 seconds respectively by using RScan, but need 3,178 seconds and 8,951 seconds respectively by using an existing method RSEARCH. Remarkably, RScan can handle large database queries, taking less than 4 minutes for searching similar structures for a microRNA precursor in human chromosome 21. Conclusion These results indicate that RScan is a preferable choice for real-life application of searching structural similarities for structured RNAs in large databases. RScan software is freely available at http://bioinfo.au.tsinghua.edu.cn/member/cxue/rscan/RScan.htm.

  15. A search profile for dwellings with elevated radon levels

    DEFF Research Database (Denmark)

    Damkjær, A.; Andersen, C.E.; Majborn, B.

    1996-01-01

    A search profile for dwellings with elevated radon levels has been employed to investigate possibly radon-prone areas in Denmark and to find houses suitable for radon mitigation studies. The profile is defined as dwellings which are single-family houses with slab-on-grade foundation or partly...... basement/slab-on-grade foundation built on either fractured granitic basement rocks, or fractured limestone. Clayey till areas were also included in the profile in order to confirm earlier findings. Three areas representing these surface geologies were selected for indoor radon measurements with CR-39...... track detectors, and a total of 200 houses matching the profile underwent radon measurements during the winter 1994-95. The distribution of the measured radon concentrations were found in most cases to comply with log-normal distributions. Measurements in the living rooms of houses in each of the three...

  16. Software Suite for Gene and Protein Annotation Prediction and Similarity Search.

    Science.gov (United States)

    Chicco, Davide; Masseroli, Marco

    2015-01-01

    In the computational biology community, machine learning algorithms are key instruments for many applications, including the prediction of gene-functions based upon the available biomolecular annotations. Additionally, they may also be employed to compute similarity between genes or proteins. Here, we describe and discuss a software suite we developed to implement and make publicly available some of such prediction methods and a computational technique based upon Latent Semantic Indexing (LSI), which leverages both inferred and available annotations to search for semantically similar genes. The suite consists of three components. BioAnnotationPredictor is a computational software module to predict new gene-functions based upon Singular Value Decomposition of available annotations. SimilBio is a Web module that leverages annotations available or predicted by BioAnnotationPredictor to discover similarities between genes via LSI. The suite includes also SemSim, a new Web service built upon these modules to allow accessing them programmatically. We integrated SemSim in the Bio Search Computing framework (http://www.bioinformatics.deib. polimi.it/bio-seco/seco/), where users can exploit the Search Computing technology to run multi-topic complex queries on multiple integrated Web services. Accordingly, researchers may obtain ranked answers involving the computation of the functional similarity between genes in support of biomedical knowledge discovery.

  17. BSSF: a fingerprint based ultrafast binding site similarity search and function analysis server

    Directory of Open Access Journals (Sweden)

    Jiang Hualiang

    2010-01-01

    Full Text Available Abstract Background Genome sequencing and post-genomics projects such as structural genomics are extending the frontier of the study of sequence-structure-function relationship of genes and their products. Although many sequence/structure-based methods have been devised with the aim of deciphering this delicate relationship, there still remain large gaps in this fundamental problem, which continuously drives researchers to develop novel methods to extract relevant information from sequences and structures and to infer the functions of newly identified genes by genomics technology. Results Here we present an ultrafast method, named BSSF(Binding Site Similarity & Function, which enables researchers to conduct similarity searches in a comprehensive three-dimensional binding site database extracted from PDB structures. This method utilizes a fingerprint representation of the binding site and a validated statistical Z-score function scheme to judge the similarity between the query and database items, even if their similarities are only constrained in a sub-pocket. This fingerprint based similarity measurement was also validated on a known binding site dataset by comparing with geometric hashing, which is a standard 3D similarity method. The comparison clearly demonstrated the utility of this ultrafast method. After conducting the database searching, the hit list is further analyzed to provide basic statistical information about the occurrences of Gene Ontology terms and Enzyme Commission numbers, which may benefit researchers by helping them to design further experiments to study the query proteins. Conclusions This ultrafast web-based system will not only help researchers interested in drug design and structural genomics to identify similar binding sites, but also assist them by providing further analysis of hit list from database searching.

  18. SW#db: GPU-Accelerated Exact Sequence Similarity Database Search.

    Directory of Open Access Journals (Sweden)

    Matija Korpar

    Full Text Available In recent years we have witnessed a growth in sequencing yield, the number of samples sequenced, and as a result-the growth of publicly maintained sequence databases. The increase of data present all around has put high requirements on protein similarity search algorithms with two ever-opposite goals: how to keep the running times acceptable while maintaining a high-enough level of sensitivity. The most time consuming step of similarity search are the local alignments between query and database sequences. This step is usually performed using exact local alignment algorithms such as Smith-Waterman. Due to its quadratic time complexity, alignments of a query to the whole database are usually too slow. Therefore, the majority of the protein similarity search methods prior to doing the exact local alignment apply heuristics to reduce the number of possible candidate sequences in the database. However, there is still a need for the alignment of a query sequence to a reduced database. In this paper we present the SW#db tool and a library for fast exact similarity search. Although its running times, as a standalone tool, are comparable to the running times of BLAST, it is primarily intended to be used for exact local alignment phase in which the database of sequences has already been reduced. It uses both GPU and CPU parallelization and was 4-5 times faster than SSEARCH, 6-25 times faster than CUDASW++ and more than 20 times faster than SSW at the time of writing, using multiple queries on Swiss-prot and Uniref90 databases.

  19. Efficient Similarity Search Using the Earth Mover's Distance for Large Multimedia Databases

    DEFF Research Database (Denmark)

    Assent, Ira; Wichterich, Marc; Meisen, Tobias

    2008-01-01

    Multimedia similarity search in large databases requires efficient query processing. The Earth mover's distance, introduced in computer vision, is successfully used as a similarity model in a number of small-scale applications. Its computational complexity hindered its adoption in large multimedia...... databases. We enable directly indexing the Earth mover's distance in structures such as the R-tree and the VA-file by providing the accurate 'MinDist' function to any bounding rectangle in the index. We exploit the computational structure of the new MinDist to derive a new lower bound for the EMD Min...

  20. Applying Statistical Models and Parametric Distance Measures for Music Similarity Search

    Science.gov (United States)

    Lukashevich, Hanna; Dittmar, Christian; Bastuck, Christoph

    Automatic deriving of similarity relations between music pieces is an inherent field of music information retrieval research. Due to the nearly unrestricted amount of musical data, the real-world similarity search algorithms have to be highly efficient and scalable. The possible solution is to represent each music excerpt with a statistical model (ex. Gaussian mixture model) and thus to reduce the computational costs by applying the parametric distance measures between the models. In this paper we discuss the combinations of applying different parametric modelling techniques and distance measures and weigh the benefits of each one against the others.

  1. Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA.

    Science.gov (United States)

    Mrozek, Dariusz; Brożek, Miłosz; Małysiak-Mrozek, Bożena

    2014-02-01

    Searching for similar 3D protein structures is one of the primary processes employed in the field of structural bioinformatics. However, the computational complexity of this process means that it is constantly necessary to search for new methods that can perform such a process faster and more efficiently. Finding molecular substructures that complex protein structures have in common is still a challenging task, especially when entire databases containing tens or even hundreds of thousands of protein structures must be scanned. Graphics processing units (GPUs) and general purpose graphics processing units (GPGPUs) can perform many time-consuming and computationally demanding processes much more quickly than a classical CPU can. In this paper, we describe the GPU-based implementation of the CASSERT algorithm for 3D protein structure similarity searching. This algorithm is based on the two-phase alignment of protein structures when matching fragments of the compared proteins. The GPU (GeForce GTX 560Ti: 384 cores, 2GB RAM) implementation of CASSERT ("GPU-CASSERT") parallelizes both alignment phases and yields an average 180-fold increase in speed over its CPU-based, single-core implementation on an Intel Xeon E5620 (2.40GHz, 4 cores). In this paper, we show that massive parallelization of the 3D structure similarity search process on many-core GPU devices can reduce the execution time of the process, allowing it to be performed in real time. GPU-CASSERT is available at: http://zti.polsl.pl/dmrozek/science/gpucassert/cassert.htm.

  2. Query-seeded iterative sequence similarity searching improves selectivity 5-20-fold.

    Science.gov (United States)

    Pearson, William R; Li, Weizhong; Lopez, Rodrigo

    2017-04-20

    Iterative similarity search programs, like psiblast, jackhmmer, and psisearch, are much more sensitive than pairwise similarity search methods like blast and ssearch because they build a position specific scoring model (a PSSM or HMM) that captures the pattern of sequence conservation characteristic to a protein family. But models are subject to contamination; once an unrelated sequence has been added to the model, homologs of the unrelated sequence will also produce high scores, and the model can diverge from the original protein family. Examination of alignment errors during psiblast PSSM contamination suggested a simple strategy for dramatically reducing PSSM contamination. psiblast PSSMs are built from the query-based multiple sequence alignment (MSA) implied by the pairwise alignments between the query model (PSSM, HMM) and the subject sequences in the library. When the original query sequence residues are inserted into gapped positions in the aligned subject sequence, the resulting PSSM rarely produces alignment over-extensions or alignments to unrelated sequences. This simple step, which tends to anchor the PSSM to the original query sequence and slightly increase target percent identity, can reduce the frequency of false-positive alignments more than 20-fold compared with psiblast and jackhmmer, with little loss in search sensitivity. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Query-seeded iterative sequence similarity searching improves selectivity 5–20-fold

    Science.gov (United States)

    Li, Weizhong; Lopez, Rodrigo

    2017-01-01

    Abstract Iterative similarity search programs, like psiblast, jackhmmer, and psisearch, are much more sensitive than pairwise similarity search methods like blast and ssearch because they build a position specific scoring model (a PSSM or HMM) that captures the pattern of sequence conservation characteristic to a protein family. But models are subject to contamination; once an unrelated sequence has been added to the model, homologs of the unrelated sequence will also produce high scores, and the model can diverge from the original protein family. Examination of alignment errors during psiblast PSSM contamination suggested a simple strategy for dramatically reducing PSSM contamination. psiblast PSSMs are built from the query-based multiple sequence alignment (MSA) implied by the pairwise alignments between the query model (PSSM, HMM) and the subject sequences in the library. When the original query sequence residues are inserted into gapped positions in the aligned subject sequence, the resulting PSSM rarely produces alignment over-extensions or alignments to unrelated sequences. This simple step, which tends to anchor the PSSM to the original query sequence and slightly increase target percent identity, can reduce the frequency of false-positive alignments more than 20-fold compared with psiblast and jackhmmer, with little loss in search sensitivity. PMID:27923999

  4. Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis

    Directory of Open Access Journals (Sweden)

    Kim Sunghwan

    2012-11-01

    Full Text Available Abstract Background To improve the utility of PubChem, a public repository containing biological activities of small molecules, the PubChem3D project adds computationally-derived three-dimensional (3-D descriptions to the small-molecule records contained in the PubChem Compound database and provides various search and analysis tools that exploit 3-D molecular similarity. Therefore, the efficient use of PubChem3D resources requires an understanding of the statistical and biological meaning of computed 3-D molecular similarity scores between molecules. Results The present study investigated effects of employing multiple conformers per compound upon the 3-D similarity scores between ten thousand randomly selected biologically-tested compounds (10-K set and between non-inactive compounds in a given biological assay (156-K set. When the “best-conformer-pair” approach, in which a 3-D similarity score between two compounds is represented by the greatest similarity score among all possible conformer pairs arising from a compound pair, was employed with ten diverse conformers per compound, the average 3-D similarity scores for the 10-K set increased by 0.11, 0.09, 0.15, 0.16, 0.07, and 0.18 for STST-opt, CTST-opt, ComboTST-opt, STCT-opt, CTCT-opt, and ComboTCT-opt, respectively, relative to the corresponding averages computed using a single conformer per compound. Interestingly, the best-conformer-pair approach also increased the average 3-D similarity scores for the non-inactive–non-inactive (NN pairs for a given assay, by comparable amounts to those for the random compound pairs, although some assays showed a pronounced increase in the per-assay NN-pair 3-D similarity scores, compared to the average increase for the random compound pairs. Conclusion These results suggest that the use of ten diverse conformers per compound in PubChem bioassay data analysis using 3-D molecular similarity is not expected to increase the separation of non

  5. Semantic similarity measures in the biomedical domain by leveraging a web search engine.

    Science.gov (United States)

    Hsieh, Sheau-Ling; Chang, Wen-Yung; Chen, Chi-Huang; Weng, Yung-Ching

    2013-07-01

    Various researches in web related semantic similarity measures have been deployed. However, measuring semantic similarity between two terms remains a challenging task. The traditional ontology-based methodologies have a limitation that both concepts must be resided in the same ontology tree(s). Unfortunately, in practice, the assumption is not always applicable. On the other hand, if the corpus is sufficiently adequate, the corpus-based methodologies can overcome the limitation. Now, the web is a continuous and enormous growth corpus. Therefore, a method of estimating semantic similarity is proposed via exploiting the page counts of two biomedical concepts returned by Google AJAX web search engine. The features are extracted as the co-occurrence patterns of two given terms P and Q, by querying P, Q, as well as P AND Q, and the web search hit counts of the defined lexico-syntactic patterns. These similarity scores of different patterns are evaluated, by adapting support vector machines for classification, to leverage the robustness of semantic similarity measures. Experimental results validating against two datasets: dataset 1 provided by A. Hliaoutakis; dataset 2 provided by T. Pedersen, are presented and discussed. In dataset 1, the proposed approach achieves the best correlation coefficient (0.802) under SNOMED-CT. In dataset 2, the proposed method obtains the best correlation coefficient (SNOMED-CT: 0.705; MeSH: 0.723) with physician scores comparing with measures of other methods. However, the correlation coefficients (SNOMED-CT: 0.496; MeSH: 0.539) with coder scores received opposite outcomes. In conclusion, the semantic similarity findings of the proposed method are close to those of physicians' ratings. Furthermore, the study provides a cornerstone investigation for extracting fully relevant information from digitizing, free-text medical records in the National Taiwan University Hospital database.

  6. Quantifying the Association between Personality Similarity and Marital Adjustment Using Profile Correlations: A Cautionary Tale.

    Science.gov (United States)

    Humbad, Mikhila N; Donnellan, M Brent; Iacono, William G; McGue, Matthew; Burt, S Alexandra

    2013-02-01

    Profile correlations are sometimes used to quantify personality trait similarity between relationship partners. These coefficients are then used to test whether similar couples are happier couples. The current paper describes several different methods of calculating profile correlations and outlines procedures for testing whether these coefficients are related to marital adjustment in a sample of 1,643 couples. There was little evidence that profile correlations were related to marital adjustment after accounting for normativeness (i.e., the degree to which individual's matched the typical personality profile) and when accounting for each individual's personality attributes. Results suggest that researchers using profile correlations should be cautious given that the interpretation and psychological meaning of results often depend on how the coefficients are calculated.

  7. SHOP: receptor-based scaffold hopping by GRID-based similarity searches

    DEFF Research Database (Denmark)

    Bergmann, Rikke; Liljefors, Tommy; Sørensen, Morten D

    2009-01-01

    A new field-derived 3D method for receptor-based scaffold hopping, implemented in the software SHOP, is presented. Information from a protein-ligand complex is utilized to substitute a fragment of the ligand with another fragment from a database of synthetically accessible scaffolds. A GRID......-based interaction profile of the receptor and geometrical descriptions of a ligand scaffold are used to obtain new scaffolds with different structural features and are able to replace the original scaffold in the protein-ligand complex. An enrichment study was successfully performed verifying the ability of SHOP...... to find known active CDK2 scaffolds in a database. Additionally, SHOP was used for suggesting new inhibitors of p38 MAP kinase. Four p38 complexes were used to perform six scaffold searches. Several new scaffolds were suggested, and the resulting compounds were successfully docked into the query proteins....

  8. Detecting and classifying method based on similarity matching of Android malware behavior with profile.

    Science.gov (United States)

    Jang, Jae-Wook; Yun, Jaesung; Mohaisen, Aziz; Woo, Jiyoung; Kim, Huy Kang

    2016-01-01

    Mass-market mobile security threats have increased recently due to the growth of mobile technologies and the popularity of mobile devices. Accordingly, techniques have been introduced for identifying, classifying, and defending against mobile threats utilizing static, dynamic, on-device, and off-device techniques. Static techniques are easy to evade, while dynamic techniques are expensive. On-device techniques are evasion, while off-device techniques need being always online. To address some of those shortcomings, we introduce Andro-profiler, a hybrid behavior based analysis and classification system for mobile malware. Andro-profiler main goals are efficiency, scalability, and accuracy. For that, Andro-profiler classifies malware by exploiting the behavior profiling extracted from the integrated system logs including system calls. Andro-profiler executes a malicious application on an emulator in order to generate the integrated system logs, and creates human-readable behavior profiles by analyzing the integrated system logs. By comparing the behavior profile of malicious application with representative behavior profile for each malware family using a weighted similarity matching technique, Andro-profiler detects and classifies it into malware families. The experiment results demonstrate that Andro-profiler is scalable, performs well in detecting and classifying malware with accuracy greater than 98 %, outperforms the existing state-of-the-art work, and is capable of identifying 0-day mobile malware samples.

  9. Searching the protein structure database for ligand-binding site similarities using CPASS v.2

    Directory of Open Access Journals (Sweden)

    Caprez Adam

    2011-01-01

    Full Text Available Abstract Background A recent analysis of protein sequences deposited in the NCBI RefSeq database indicates that ~8.5 million protein sequences are encoded in prokaryotic and eukaryotic genomes, where ~30% are explicitly annotated as "hypothetical" or "uncharacterized" protein. Our Comparison of Protein Active-Site Structures (CPASS v.2 database and software compares the sequence and structural characteristics of experimentally determined ligand binding sites to infer a functional relationship in the absence of global sequence or structure similarity. CPASS is an important component of our Functional Annotation Screening Technology by NMR (FAST-NMR protocol and has been successfully applied to aid the annotation of a number of proteins of unknown function. Findings We report a major upgrade to our CPASS software and database that significantly improves its broad utility. CPASS v.2 is designed with a layered architecture to increase flexibility and portability that also enables job distribution over the Open Science Grid (OSG to increase speed. Similarly, the CPASS interface was enhanced to provide more user flexibility in submitting a CPASS query. CPASS v.2 now allows for both automatic and manual definition of ligand-binding sites and permits pair-wise, one versus all, one versus list, or list versus list comparisons. Solvent accessible surface area, ligand root-mean square difference, and Cβ distances have been incorporated into the CPASS similarity function to improve the quality of the results. The CPASS database has also been updated. Conclusions CPASS v.2 is more than an order of magnitude faster than the original implementation, and allows for multiple simultaneous job submissions. Similarly, the CPASS database of ligand-defined binding sites has increased in size by ~ 38%, dramatically increasing the likelihood of a positive search result. The modification to the CPASS similarity function is effective in reducing CPASS similarity scores

  10. Efficient Retrieval of Images for Search Engine by Visual Similarity and Re Ranking

    OpenAIRE

    Viswa S S

    2013-01-01

    Nowadays, web scale image search engines (e.g. Google Image Search, Microsoft Live Image Search) rely almost purely on surrounding text features. Users type keywords in hope of finding a certain type of images. The search engine returns thousands of images ranked by the text keywords extracted from the surrounding text. However, many of returned images are noisy, disorganized, or irrelevant. Even Google and Microsoft have no Visual Information for searching of images. Using visual information...

  11. Fast association-rule-based similarity search in 3D models

    Science.gov (United States)

    Dua, Sumeet; Jain, Vineet

    2004-11-01

    Advances in automated data collection tools in design and manufacturing have far exceeded our capacity to analyze this data for novel information. Techniques of data mining and knowledge discovery in large databases promise computationally efficient and accurate means to analyze such data for patterns and similar structures. In this paper, we present a unique data mining approach for finding similarities in classes of 3D models, using discovery of association rules. PCA is first performed on the 3D model to transform it along first principal axis. Transformed 3D model is then sliced and segmented along multiple principal axes, such that each slice can be interpreted as a transaction in a transaction database. Association-rule discovery is performed on this transaction space for multiple models and common association rules among those transactions are stored as a representative of a class of models. We have evaluated the performance of association rules for efficient representation of classes of shape models. The method is time and space efficient, besides presenting a novel paradigm for searching content dependencies in a database of 3D models.

  12. Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

    Science.gov (United States)

    Zhu, Jie; Qin, Yufang; Liu, Taigang; Wang, Jun; Zheng, Xiaoqi

    2013-01-01

    Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. Programs and data are available upon request.

  13. The Use of Cattell's Profile Similarity Coefficient in the Classification of Football Athletes.

    Science.gov (United States)

    Evans, Virden; Johnson, DeWayne

    Using Cattell's Profile Similarity Coefficient, 154 high school football players from 21 different public high schools were classified as being successful or unsuccessful. Seventeen physical and motor ability variables relating to athletic ability were administered to the football players. The variables included: (1) standard height; (2) body…

  14. PSimScan: algorithm and utility for fast protein similarity search.

    Directory of Open Access Journals (Sweden)

    Anna Kaznadzey

    Full Text Available In the era of metagenomics and diagnostics sequencing, the importance of protein comparison methods of boosted performance cannot be overstated. Here we present PSimScan (Protein Similarity Scanner, a flexible open source protein similarity search tool which provides a significant gain in speed compared to BLASTP at the price of controlled sensitivity loss. The PSimScan algorithm introduces a number of novel performance optimization methods that can be further used by the community to improve the speed and lower hardware requirements of bioinformatics software. The optimization starts at the lookup table construction, then the initial lookup table-based hits are passed through a pipeline of filtering and aggregation routines of increasing computational complexity. The first step in this pipeline is a novel algorithm that builds and selects 'similarity zones' aggregated from neighboring matches on small arrays of adjacent diagonals. PSimScan performs 5 to 100 times faster than the standard NCBI BLASTP, depending on chosen parameters, and runs on commodity hardware. Its sensitivity and selectivity at the slowest settings are comparable to the NCBI BLASTP's and decrease with the increase of speed, yet stay at the levels reasonable for many tasks. PSimScan is most advantageous when used on large collections of query sequences. Comparing the entire proteome of Streptocuccus pneumoniae (2,042 proteins to the NCBI's non-redundant protein database of 16,971,855 records takes 6.5 hours on a moderately powerful PC, while the same task with the NCBI BLASTP takes over 66 hours. We describe innovations in the PSimScan algorithm in considerable detail to encourage bioinformaticians to improve on the tool and to use the innovations in their own software development.

  15. Similar compounds searching system by using the gene expression microarray database.

    Science.gov (United States)

    Toyoshiba, Hiroyoshi; Sawada, Hiroshi; Naeshiro, Ichiro; Horinouchi, Akira

    2009-04-10

    Numbers of microarrays have been examined and several public and commercial databases have been developed. However, it is not easy to compare in-house microarray data with those in a database because of insufficient reproducibility due to differences in the experimental conditions. As one of the approach to use these databases, we developed the similar compounds searching system (SCSS) on a toxicogenomics database. The datasets of 55 compounds administered to rats in the Toxicogenomics Project (TGP) database in Japan were used in this study. Using the fold-change ranking method developed by Lamb et al. [Lamb, J., Crawford, E.D., Peck, D., Modell, J.W., Blat, I.C., Wrobel, M.J., Lerner, J., Brunet, J.P., Subramanian, A., Ross, K.N., Reich, M., Hieronymus, H., Wei, G., Armstrong, S.A., Haggarty, S.J., Clemons, P.A., Wei, R., Carr, S.A., Lander, E.S., Golub, T.R., 2006. The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313, 1929-1935] and criteria called hit ratio, the system let us compare in-house microarray data and those in the database. In-house generated data for clofibrate, phenobarbital, and a proprietary compound were tested to evaluate the performance of the SCSS method. Phenobarbital and clofibrate, which were included in the TGP database, scored highest by the SCSS method. Other high scoring compounds had effects similar to either phenobarbital (a cytochrome P450s inducer) or clofibrate (a peroxisome proliferator). Some of high scoring compounds identified using the proprietary compound-administered rats have been known to cause similar toxicological changes in different species. Our results suggest that the SCSS method could be used in drug discovery and development. Moreover, this method may be a powerful tool to understand the mechanisms by which biological systems respond to various chemical compounds and may also predict adverse effects of new compounds.

  16. FitSearch: a robust way to interpret a yeast fitness profile in terms of drug's mode-of-action.

    Science.gov (United States)

    Lee, Minho; Han, Sangjo; Chang, Hyeshik; Kwak, Youn-Sig; Weller, David M; Kim, Dongsup

    2013-01-01

    Yeast deletion-mutant collections have been successfully used to infer the mode-of-action of drugs especially by profiling chemical-genetic and genetic-genetic interactions on a genome-wide scale. Although tens of thousands of those profiles are publicly available, a lack of an accurate method for mining such data has been a major bottleneck for more widespread use of these useful resources. For general usage of those public resources, we designed FitRankDB as a general repository of fitness profiles, and developed a new search algorithm, FitSearch, for identifying the profiles that have a high similarity score with statistical significance for a given fitness profile. We demonstrated that our new repository and algorithm are highly beneficial to researchers who attempting to make hypotheses based on unknown modes-of-action of bioactive compounds, regardless of the types of experiments that have been performed using yeast deletion-mutant collection in various types of different measurement platforms, especially non-chip-based platforms. We showed that our new database and algorithm are useful when attempting to construct a hypothesis regarding the unknown function of a bioactive compound through small-scale experiments with a yeast deletion collection in a platform independent manner. The FitRankDB and FitSearch enhance the ease of searching public yeast fitness profiles and obtaining insights into unknown mechanisms of action of drugs. FitSearch is freely available at http://fitsearch.kaist.ac.kr.

  17. The collaborative search by tag-based user profile in social media.

    Science.gov (United States)

    Xie, Haoran; Li, Xiaodong; Wang, Jiantao; Li, Qing; Cai, Yi

    2014-01-01

    Recently, we have witnessed the popularity and proliferation of social media applications (e.g., Delicious, Flickr, and YouTube) in the web 2.0 era. The rapid growth of user-generated data results in the problem of information overload to users. Facing such a tremendous volume of data, it is a big challenge to assist the users to find their desired data. To attack this critical problem, we propose the collaborative search approach in this paper. The core idea is that similar users may have common interests so as to help users to find their demanded data. Similar research has been conducted on the user log analysis in web search. However, the rapid growth and change of user-generated data in social media require us to discover a brand-new approach to address the unsolved issues (e.g., how to profile users, how to measure the similar users, and how to depict user-generated resources) rather than adopting existing method from web search. Therefore, we investigate various metrics to identify the similar users (user community). Moreover, we conduct the experiment on two real-life data sets by comparing the Collaborative method with the latest baselines. The empirical results show the effectiveness of the proposed approach and validate our observations.

  18. Managing biomedical image metadata for search and retrieval of similar images.

    Science.gov (United States)

    Korenblum, Daniel; Rubin, Daniel; Napel, Sandy; Rodriguez, Cesar; Beaulieu, Chris

    2011-08-01

    Radiology images are generally disconnected from the metadata describing their contents, such as imaging observations ("semantic" metadata), which are usually described in text reports that are not directly linked to the images. We developed a system, the Biomedical Image Metadata Manager (BIMM) to (1) address the problem of managing biomedical image metadata and (2) facilitate the retrieval of similar images using semantic feature metadata. Our approach allows radiologists, researchers, and students to take advantage of the vast and growing repositories of medical image data by explicitly linking images to their associated metadata in a relational database that is globally accessible through a Web application. BIMM receives input in the form of standard-based metadata files using Web service and parses and stores the metadata in a relational database allowing efficient data query and maintenance capabilities. Upon querying BIMM for images, 2D regions of interest (ROIs) stored as metadata are automatically rendered onto preview images included in search results. The system's "match observations" function retrieves images with similar ROIs based on specific semantic features describing imaging observation characteristics (IOCs). We demonstrate that the system, using IOCs alone, can accurately retrieve images with diagnoses matching the query images, and we evaluate its performance on a set of annotated liver lesion images. BIMM has several potential applications, e.g., computer-aided detection and diagnosis, content-based image retrieval, automating medical analysis protocols, and gathering population statistics like disease prevalences. The system provides a framework for decision support systems, potentially improving their diagnostic accuracy and selection of appropriate therapies.

  19. Application of 3D Zernike descriptors to shape-based ligand similarity searching

    Directory of Open Access Journals (Sweden)

    Venkatraman Vishwesh

    2009-12-01

    Full Text Available Abstract Background The identification of promising drug leads from a large database of compounds is an important step in the preliminary stages of drug design. Although shape is known to play a key role in the molecular recognition process, its application to virtual screening poses significant hurdles both in terms of the encoding scheme and speed. Results In this study, we have examined the efficacy of the alignment independent three-dimensional Zernike descriptor (3DZD for fast shape based similarity searching. Performance of this approach was compared with several other methods including the statistical moments based ultrafast shape recognition scheme (USR and SIMCOMP, a graph matching algorithm that compares atom environments. Three benchmark datasets are used to thoroughly test the methods in terms of their ability for molecular classification, retrieval rate, and performance under the situation that simulates actual virtual screening tasks over a large pharmaceutical database. The 3DZD performed better than or comparable to the other methods examined, depending on the datasets and evaluation metrics used. Reasons for the success and the failure of the shape based methods for specific cases are investigated. Based on the results for the three datasets, general conclusions are drawn with regard to their efficiency and applicability. Conclusion The 3DZD has unique ability for fast comparison of three-dimensional shape of compounds. Examples analyzed illustrate the advantages and the room for improvements for the 3DZD.

  20. Genetics of muscle strength and power: polygenic profile similarity limits skeletal muscle performance.

    Science.gov (United States)

    Hughes, David C; Day, Stephen H; Ahmetov, Ildus I; Williams, Alun G

    2011-10-01

    Environmental and genetic factors influence muscle function, resulting in large variations in phenotype between individuals. Multiple genetic variants (polygenic in nature) are thought to influence exercise-related phenotypes, yet how the relevant polymorphisms combine to influence muscular strength in individuals and populations is unclear. In this analysis, 22 genetic polymorphisms were identified in the literature that have been associated with muscular strength and power phenotypes. Using typical genotype frequencies, the probability of any given individual possessing an "optimal" polygenic profile was calculated as 0.0003% for the world population. Future identification of additional polymorphisms associated with muscular strength phenotypes would most likely reduce that probability even further. To examine the genetic potential for muscular strength within a human population, a "total genotype score" was generated for each individual within a hypothetical population of one million. The population expressed high similarity in polygenic profile with no individual differing by more than seven genotypes from a typical profile. Therefore, skeletal muscle strength potential within humans appears to be limited by polygenic profile similarity. Future research should aim to replicate more genotype-phenotype associations for muscular strength, because only five common genetic polymorphisms identified to date have positive replicated findings.

  1. The meaning profiles of anxiety and depression: similarities and differences in two age groups.

    Science.gov (United States)

    Kreitler, Shulamith

    2017-04-25

    The distinctiveness of anxiety and depression is discussed concerning their nature, definitions, uses, manifestations and determinants. The objective was to examine the difference and similarity of anxiety and depression by applying the psychosemantic approach, which is a theory and methodology based on analysing the cognitive processes applied in communicating meanings. In Study 1, there were 760 participants of both genders, 23-31 years old. They were administered the Meanings Test, which yields the respondent's meaning profile, and one of seven anxiety scales or one of three depression scales. Significant correlations between the meaning profiles and the anxiety or depression scales were summarised and compared. In Study 2, there were 78 individuals over 65 years old who were administered the Meanings Test plus an anxiety or depression scale. The findings for anxiety and depression were compared within and across age groups. The results yielded two distinct meaning profiles for anxiety and depression in the two age groups. The anxiety profile indicated more focusing on one's internal world. whereas the depression profile indicated focusing both on the personal and the interpersonally shared reality. The conclusion was that anxiety and depression are different constructs that need to be considered as separate theoretically and practically.

  2. O S 2 : Oblivious similarity based searching for encrypted data outsourced to an untrusted domain

    OpenAIRE

    Pervez, Zeeshan; AHMAD, Mahmood; Khattak, Asad Masood; Ramzan, Naeem; Khan, Wajahat Ali

    2017-01-01

    Public cloud storage services are becoming prevalent and myriad data sharing, archiving and collaborative services have emerged which harness the pay-as-you-go business model of public cloud. To ensure privacy and confidentiality often encrypted data is outsourced to such services, which further complicates the process of accessing relevant data by using search queries. Search over encrypted data schemes solve this problem by exploiting cryptographic primitives and secure indexing to identify...

  3. Personalized Profile Based Search Interface With Ranked and Clustered Display

    National Research Council Canada - National Science Library

    Kumar, Sachin; Oztekin, B. U; Ertoz, Levent; Singhal, Saurabh; Han, Euihong; Kumar, Vipin

    2001-01-01

    We have developed an experimental meta-search engine, which takes the snippets from traditional search engines and presents them to the user either in the form of clusters, indices or re-ranked list...

  4. Genome-Wide Expression Profiling of Five Mouse Models Identifies Similarities and Differences with Human Psoriasis

    Science.gov (United States)

    Swindell, William R.; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P.; Voorhees, John J.; Elder, James T.; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P.; DiGiovanni, John; Pittelkow, Mark R.; Ward, Nicole L.; Gudjonsson, Johann E.

    2011-01-01

    Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis. PMID:21483750

  5. Chemical similarity among domesticated and wild genotypes of peanut based on n-alkanes profiles

    Directory of Open Access Journals (Sweden)

    Renata Janaína Carvalho de Souza

    2010-11-01

    Full Text Available The objective of this work was to analyze the epicuticular n-alkane profile of domesticated and wild peanut genotypes. Foliar epicuticular n-alkanes of four Arachis hypogaea genotypes and two wild species - A. monticola and A. stenosperma - were analyzed by gas chromatography. Chemical relationships between them were evaluated using the Dice coefficient and UPGMA method. Two clusters were formed: one with four A. hypogaea genotypes and the other with the two wild species. There is more similarity between the BR1 and LIGO-PE06 genotypes and between the BRS 151 L-7 and BRS Havana genotypes.

  6. A comparison of mean partialing and dual-hypothesis testing to evaluate stereotype effects when assessing profile similarity.

    Science.gov (United States)

    Conroy, David E; Pincus, Aaron L

    2006-04-01

    Assessing profile similarity is an important task in research and clinical practice, but conclusions about profile similarity may be confounded by stereotype effects that create artificially similar profiles. In this article, we review the impact of stereotype effects on profile similarity and a conventional approach to addressing this confound (i.e., mean partialing). We argue that a dual-hypothesis testing approach distinguishing the no-effect null hypothesis (i.e., is the observed similarity different from zero?) from the chance-effect null hypothesis (i.e., is the observed similarity different from chance given the distribution of profile elements?) can provide a more nuanced understanding of profile similarity. To compare results from these 2 perspectives, we analyzed data from 2 samples using within-persons correlations as indexes of similar profile shapes. Results indicated that a dual-hypothesis testing approach led to more conservative conclusions about profile similarity (i.e., fewer Type 1 errors) than mean partialing and may be especially valuable when dealing with moderate-sized stereotype effects. Both approaches led to identical conclusions when stereotype effects were largest. Conclusions emphasize the relative merits and limitations of the dual-hypothesis testing approach as well as potential future applications in the personality assessment domain.

  7. Self-similar solutions with compactly supported profile of some nonlinear Schrodinger equations

    Directory of Open Access Journals (Sweden)

    Pascal Begout

    2014-04-01

    Full Text Available ``Sharp localized'' solutions (i.e. with compact support for each given time t of a singular nonlinear type Schr\\"odinger equation in the whole space $\\mathbb{R}^N$ are constructed here under the assumption that they have a self-similar structure. It requires the assumption that the external forcing term satisfies that $\\mathbf{f}(t,x=t^{-(\\mathbf{p}-2/2}\\mathbf{F}(t^{-1/2}x$ for some complex exponent $\\mathbf{p}$ and for some profile function $\\mathbf{F}$ which is assumed to be with compact support in $\\mathbb{R}^N$. We show the existence of solutions of the form $\\mathbf{u}(t,x=t^{\\mathbf{p}/2}\\mathbf{U}(t^{-1/2}x$, with a profile $\\mathbf{U}$, which also has compact support in $\\mathbb{R}^N$. The proof of the localization of the support of the profile $\\mathbf{U}$ uses some suitable energy method applied to the stationary problem satisfied by $\\mathbf{U}$ after some unknown transformation.

  8. Wake structure and similar behavior of wake profiles downstream of a plunging airfoil

    Directory of Open Access Journals (Sweden)

    Ali R. DAVARI

    2017-08-01

    Full Text Available Very limited attention has already been paid to the velocity behavior in the wake region in unsteady aerodynamic problems. A series of tests has been performed on a flapping airfoil in a subsonic wind tunnel to study the wake structure for different sets of mean angle of attack, plunging amplitude and reduced frequency. In this study, the velocity profiles in the wake for various oscillation parameters have been measured using a wide shoulder rake, especially designed for the present experiments. The airfoil under consideration was a critical section of a 660 kW wind turbine. The results show that for a flapping airfoil the wake structure can be of drag producing type, thrust producing or neutral, depending on the mean angle of attack, oscillation amplitude and reduced frequency. In a thrust producing wake, a high-momentum high-velocity jet flow is formed in the core region of the wake instead of the conventional low-momentum flow. As a result, the drag force normally experienced by the body due to the momentum deficit would be replaced by a thrust force. According to the results, the momentum loss in the wake decreases as the reduced frequency increases. The thrust producing wake pattern for the flapping airfoil has been observed for sufficiently low angles of attack in the absence of the viscous effects. This phenomenon has also been observed for either high oscillation amplitudes or high reduced frequencies. According to the results, for different reduced frequencies and plunging amplitudes, such that the product of them be a constant, the velocity profiles exhibit similar behavior and coalesce on each other. This similarity parameter works excellently at small angles of attack. However, at near stall boundaries, the similarity is not as evident as before.

  9. Similarity Digest Search: A Survey and Comparative Analysis of Strategies to Perform Known File Filtering Using Approximate Matching

    Directory of Open Access Journals (Sweden)

    Vitor Hugo Galhardo Moia

    2017-01-01

    Full Text Available Digital forensics is a branch of Computer Science aiming at investigating and analyzing electronic devices in the search for crime evidence. There are several ways to perform this search. Known File Filter (KFF is one of them, where a list of interest objects is used to reduce/separate data for analysis. Holding a database of hashes of such objects, the examiner performs lookups for matches against the target device. However, due to limitations over hash functions (inability to detect similar objects, new methods have been designed, called approximate matching. This sort of function has interesting characteristics for KFF investigations but suffers mainly from high costs when dealing with huge data sets, as the search is usually done by brute force. To mitigate this problem, strategies have been developed to better perform lookups. In this paper, we present the state of the art of similarity digest search strategies, along with a detailed comparison involving several aspects, as time complexity, memory requirement, and search precision. Our results show that none of the approaches address at least these main aspects. Finally, we discuss future directions and present requirements for a new strategy aiming to fulfill current limitations.

  10. Primary EBV infection induces an expression profile distinct from other viruses but similar to hemophagocytic syndromes.

    Directory of Open Access Journals (Sweden)

    Samantha K Dunmire

    Full Text Available Epstein-Barr Virus (EBV causes infectious mononucleosis and establishes lifelong infection associated with cancer and autoimmune disease. To better understand immunity to EBV, we performed a prospective study of natural infection in healthy humans. Transcriptome analysis defined a striking and reproducible expression profile during acute infection but no lasting gene changes were apparent during latent infection. Comparing the EBV response profile to multiple other acute viral infections, including influenza A (influenza, respiratory syncytial virus (RSV, human rhinovirus (HRV, attenuated yellow fever virus (YFV, and Dengue fever virus (DENV, revealed similarity only to DENV. The signature shared by EBV and DENV was also present in patients with hemophagocytic syndromes, suggesting these two viruses cause uncontrolled inflammatory responses. Interestingly, while EBV induced a strong type I interferon response, a subset of interferon induced genes, including MX1, HERC5, and OAS1, were not upregulated, suggesting a mechanism by which viral antagonism of immunity results in a profound inflammatory response. These data provide an important first description of the response to a natural herpesvirus infection in humans.

  11. Proposal for a Similar Question Search System on a Q&A Site

    Directory of Open Access Journals (Sweden)

    Katsutoshi Kanamori

    2014-06-01

    Full Text Available There is a service to help Internet users obtain answers to specific questions when they visit a Q&A site. A Q&A site is very useful for the Internet user, but posted questions are often not answered immediately. This delay in answering occurs because in most cases another site user is answering the question manually. In this study, we propose a system that can present a question that is similar to a question posted by a user. An advantage of this system is that a user can refer to an answer to a similar question. This research measures the similarity of a candidate question based on word and dependency parsing. In an experiment, we examined the effectiveness of the proposed system for questions actually posted on the Q&A site. The result indicates that the system can show the questioner the answer to a similar question. However, the system still has a number of aspects that should be improved.

  12. Clinical Diagnostics in Human Genetics with Semantic Similarity Searches in Ontologies

    OpenAIRE

    Köhler, Sebastian; Schulz, Marcel H.; Krawitz, Peter; Bauer, Sebastian; Dölken, Sandra; Ott, Claus E.; Mundlos, Christine; Horn, Denise; Mundlos, Stefan; Robinson, Peter N

    2009-01-01

    The differential diagnostic process attempts to identify candidate diseases that best explain a set of clinical features. This process can be complicated by the fact that the features can have varying degrees of specificity, as well as by the presence of features unrelated to the disease itself. Depending on the experience of the physician and the availability of laboratory tests, clinical abnormalities may be described in greater or lesser detail. We have adapted semantic similarity metrics ...

  13. HBLAST: Parallelised sequence similarity--A Hadoop MapReducable basic local alignment search tool.

    Science.gov (United States)

    O'Driscoll, Aisling; Belogrudov, Vladislav; Carroll, John; Kropp, Kai; Walsh, Paul; Ghazal, Peter; Sleator, Roy D

    2015-04-01

    The recent exponential growth of genomic databases has resulted in the common task of sequence alignment becoming one of the major bottlenecks in the field of computational biology. It is typical for these large datasets and complex computations to require cost prohibitive High Performance Computing (HPC) to function. As such, parallelised solutions have been proposed but many exhibit scalability limitations and are incapable of effectively processing "Big Data" - the name attributed to datasets that are extremely large, complex and require rapid processing. The Hadoop framework, comprised of distributed storage and a parallelised programming framework known as MapReduce, is specifically designed to work with such datasets but it is not trivial to efficiently redesign and implement bioinformatics algorithms according to this paradigm. The parallelisation strategy of "divide and conquer" for alignment algorithms can be applied to both data sets and input query sequences. However, scalability is still an issue due to memory constraints or large databases, with very large database segmentation leading to additional performance decline. Herein, we present Hadoop Blast (HBlast), a parallelised BLAST algorithm that proposes a flexible method to partition both databases and input query sequences using "virtual partitioning". HBlast presents improved scalability over existing solutions and well balanced computational work load while keeping database segmentation and recompilation to a minimum. Enhanced BLAST search performance on cheap memory constrained hardware has significant implications for in field clinical diagnostic testing; enabling faster and more accurate identification of pathogenic DNA in human blood or tissue samples. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. Big-Five Personality Profile of Executive Search Recruiters.

    Science.gov (United States)

    Dykeman, Cass; Dykeman, James J.

    1996-01-01

    This study surveyed a sample of nationally certified executive search recruiters (N=114), with the use of the NEO Five-Factor Inventory (NEO-FFI). Chi-square test analyses revealed that this sample differed significantly from adult norms on four of the five NEO-FFI scales: Neuroticism, Extraversion, Openness, and Conscientiousness. Implications of…

  15. Health Effects Profiles for Searching Selected Lockheed DIALOG Data Bases.

    Science.gov (United States)

    Clement, Linda Lee

    This preliminary study attempted to determine the most effective search strategies for the topic "health effects" in relation to specific chemicals and/or pollutants--in this case, asbestos--for each of five selected Lockheed DIALOG data bases: BIOSIS Previews, Chemical Abstracts Condensates (Chemcon), NTIS, Enviroline, and Pollution…

  16. Maternal and perinatal outcomes in 143 Danish women with gestational diabetes mellitus and 143 controls with a similar risk profile

    DEFF Research Database (Denmark)

    Jensen, Dorte Møller; Sørensen, B; Feilberg-Jørgensen, N

    2000-01-01

    To assess maternal and fetal outcomes in pregnancies complicated by gestational diabetes mellitus (GDM) compared to non-diabetic pregnancies with an otherwise similar risk profile and to study the association between different anti-diabetic treatments and fetal outcomes....

  17. Proventil HFA and ventolin have similar safety profiles during regular use.

    Science.gov (United States)

    Tinkelman, D G; Bleecker, E R; Ramsdell, J; Ekholm, B P; Klinger, N M; Colice, G L; Slade, H B

    1998-02-01

    . No clinically meaningful changes in results of standard laboratory tests were found in any treatment group during this study. Proventil HFA had a similar safety profile as Ventolin during regular use. A dosage of 16 puffs per day of propellant HFA-134a was well tolerated by asthmatics. Regular use of either Proventil HFA or Ventolin did not cause asthma control to deteriorate.

  18. Autoantibody profiles in autoimmune hepatitis and chronic hepatitis C identifies similarities in patients with severe disease.

    Science.gov (United States)

    Amin, Kawa; Rasool, Aram H; Hattem, Ali; Al-Karboly, Taha Am; Taher, Taher E; Bystrom, Jonas

    2017-02-28

    To determine how the auto-antibodies (Abs) profiles overlap in chronic hepatitis C infection (CHC) and autoimmune hepatitis (AIH) and correlate to liver disease. Levels of antinuclear Ab, smooth muscle antibody (SMA) and liver/kidney microsomal-1 (LKM-1) Ab and markers of liver damage were determined in the sera of 50 patients with CHC infection, 20 AIH patients and 20 healthy controls using enzyme linked immunosorbent assay and other immune assays. We found that AIH patients had more severe liver disease as determined by elevation of total IgG, alkaline phosphatase, total serum bilirubin and serum transaminases and significantly higher prevalence of the three non-organ-specific autoantibodies (auto-Abs) than CHC patients. Antinuclear Ab, SMA and LKM-1 Ab were also present in 36% of CHC patients and related to disease severity. CHC cases positive for auto-Abs were directly comparable to AIH in respect of most markers of liver damage and total IgG. These cases had longer disease duration compared with auto-Ab negative cases, but there was no difference in gender, age or viral load. KLM-1+ Ab CHC cases showed best overlap with AIH. Auto-Ab levels in CHC may be important markers of disease severity and positive cases have a disease similar to AIH. Auto-Abs might have a pathogenic role as indicated by elevated markers of liver damage. Future studies will unravel any novel associations between these two diseases, whether genetic or other.

  19. Analyses of sublittoral macrobenthic community change in a marine nature reserve using similarity profiles (SIMPROF).

    Science.gov (United States)

    Somerfield, P J; Burton, M; Sanderson, W G

    2014-12-01

    Sublittoral macrobenthic communities in the Skomer Marine Nature Reserve (SMNR), Pembrokeshire, Wales, were sampled at 10 stations in 1993, 1996, 1998, 2003, 2007 and 2009 using a Day grab and a 0.5 mm mesh. The time series is analysed using Similarities Profiles (SIMPROF) tests and associated methods. Q-mode analysis using clustering with Type 1 SIMPROF addresses multivariate structure among samples, showing that there is clear structure associated with differences among years. Inverse (r-mode) analysis using Type 2 SIMPROF decisively rejects a hypothesis that species are not associated with each other. Clustering of the variables (species) with Type 3 SIMPROF identifies groups of species which covary coherently through the time-series. The time-series is characterised by a dramatic decline in abundances and diversity between the 1993 and 1996 surveys. By 1998 there had been a shift in community composition from the 1993 situation, with different species dominating. Communities had recovered in terms of abundance and species richness, but different species dominated the community. No single factor could be identified which unequivocally explained the dramatic changes observed in the SMNR. Possible causes were the effects of dispersed oil and dispersants from the Sea Empress oil spill in February 1996 and the cessation of dredge-spoil disposal off St Annes Head in 1995, but the most likely cause was severe weather. With many species, and a demonstrable recovery from an impact, communities within the SMNR appear to be diverse and resilient. If attributable to natural storms, the changes observed here indicate that natural variability may be much more important than is generally taken into account in the design of monitoring programmes. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. SymDex: increasing the efficiency of chemical fingerprint similarity searches for comparing large chemical libraries by using query set indexing.

    Science.gov (United States)

    Tai, David; Fang, Jianwen

    2012-08-27

    The large sizes of today's chemical databases require efficient algorithms to perform similarity searches. It can be very time consuming to compare two large chemical databases. This paper seeks to build upon existing research efforts by describing a novel strategy for accelerating existing search algorithms for comparing large chemical collections. The quest for efficiency has focused on developing better indexing algorithms by creating heuristics for searching individual chemical against a chemical library by detecting and eliminating needless similarity calculations. For comparing two chemical collections, these algorithms simply execute searches for each chemical in the query set sequentially. The strategy presented in this paper achieves a speedup upon these algorithms by indexing the set of all query chemicals so redundant calculations that arise in the case of sequential searches are eliminated. We implement this novel algorithm by developing a similarity search program called Symmetric inDexing or SymDex. SymDex shows over a 232% maximum speedup compared to the state-of-the-art single query search algorithm over real data for various fingerprint lengths. Considerable speedup is even seen for batch searches where query set sizes are relatively small compared to typical database sizes. To the best of our knowledge, SymDex is the first search algorithm designed specifically for comparing chemical libraries. It can be adapted to most, if not all, existing indexing algorithms and shows potential for accelerating future similarity search algorithms for comparing chemical databases.

  1. Profiles of secreted neuropeptides and catecholamines illustrate similarities and differences in response to stimulation by distinct secretagogues.

    Science.gov (United States)

    Podvin, Sonia; Bundey, Richard; Toneff, Thomas; Ziegler, Michael; Hook, Vivian

    2015-09-01

    The goal of this study was to define profiles of secreted neuropeptide and catecholamine neurotransmitters that undergo co-release from sympathoadrenal chromaffin cells upon stimulation by distinct secretagogues. Chromaffin cells of the adrenal medulla participate in the dynamic responses to stress, especially that of 'fight and flight', and, thus, analyses of the co-release of multiple neurotransmitters is necessary to gain knowledge of how the stress response regulates cell-cell communication among physiological systems. Results of this study demonstrated that six different secretagogues stimulated the co-release of the neuropeptides Met-enkephalin, galanin, NPY, and VIP with the catecholamines dopamine, norepinephrine, and epinephrine. Importantly, the quantitative profiles of the secreted neurotransmitters showed similarities and differences upon stimulation by the different secretagogues evaluated, composed of KCl depolarization, nicotine, carbachol, PACAP, bradykinin, and histamine. The rank-orders of the secreted profiles of the neurotransmitters were generally similar among these secretagogues, but differences in the secreted amounts of each neurotransmitter occurred with different secretagogues. Epinephrine among the catecholamines showed the highest level of secretion. (Met)enkephalin showed the largest levels of secretion compared to the other neuropeptides examined. Levels of secreted catecholamines were greater than that of the neuropeptides. These data support the hypothesis that profiles of secreted neuropeptide and catecholamine neurotransmitters show similarities and differences upon stimulation by distinct secretagogues. These results illustrate the co-release of concerted neurotransmitter profiles that participate in the stress response of the sympathoadrenal nervous system. Copyright © 2015 Elsevier Inc. All rights reserved.

  2. Similarity in depressive symptom profile in a population-based study of migrants in the Netherlands

    NARCIS (Netherlands)

    Schrier, A.C.; de Wit, M.A.S.; Rijmen, F.; Tuinebreijer, W.C.; Verhoeff, A.P.; Kupka, R.W.; Dekker, J.; Beekman, A.T.F.

    2010-01-01

    Objective: Depression is a clinical syndrome developed in Western Europe and North-America. The expression of symptoms and the impact of symptoms on functioning may therefore be expected to vary across cultures and languages. Our first aim was to study differences in depressive symptom profile

  3. Similarity in depressive symptom profile in a population-based study of migrants in the Netherlands

    NARCIS (Netherlands)

    Schrier, A.C.; de Wit, M.A.S.; Rijmen, F.; Tuinebreijer, W.C.; Verhoeff, A.P.; Kupka, R.W.; Dekker, J.; Beekman, A.T.F.

    2010-01-01

    Objective Depression is a clinical syndrome developed in Western Europe and North-America. The expression of symptoms and the impact of symptoms on functioning may therefore be expected to vary across cultures and languages. Our first aim was to study differences in depressive symptom profile

  4. Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological features

    Directory of Open Access Journals (Sweden)

    Bissell MJ

    2006-03-01

    Full Text Available Abstract Background Ensemble attribute profile clustering is a novel, text-based strategy for analyzing a user-defined list of genes and/or proteins. The strategy exploits annotation data present in gene-centered corpora and utilizes ideas from statistical information retrieval to discover and characterize properties shared by subsets of the list. The practical utility of this method is demonstrated by employing it in a retrospective study of two non-overlapping sets of genes defined by a published investigation as markers for normal human breast luminal epithelial cells and myoepithelial cells. Results Each genetic locus was characterized using a finite set of biological properties and represented as a vector of features indicating attributes associated with the locus (a gene attribute profile. In this study, the vector space models for a pre-defined list of genes were constructed from the Gene Ontology (GO terms and the Conserved Domain Database (CDD protein domain terms assigned to the loci by the gene-centered corpus LocusLink. This data set of GO- and CDD-based gene attribute profiles, vectors of binary random variables, was used to estimate multiple finite mixture models and each ensuing model utilized to partition the profiles into clusters. The resultant partitionings were combined using a unanimous voting scheme to produce consensus clusters, sets of profiles that co-occured consistently in the same cluster. Attributes that were important in defining the genes assigned to a consensus cluster were identified. The clusters and their attributes were inspected to ascertain the GO and CDD terms most associated with subsets of genes and in conjunction with external knowledge such as chromosomal location, used to gain functional insights into human breast biology. The 52 luminal epithelial cell markers and 89 myoepithelial cell markers are disjoint sets of genes. Ensemble attribute profile clustering-based analysis indicated that both lists

  5. MASS ACCRETION AND ITS EFFECTS ON THE SELF-SIMILARITY OF GAS PROFILES IN THE OUTSKIRTS OF GALAXY CLUSTERS

    Energy Technology Data Exchange (ETDEWEB)

    Lau, Erwin T.; Nagai, Daisuke; Avestruz, Camille [Department of Physics, Yale University, New Haven, CT 06520 (United States); Nelson, Kaylea [Yale Center for Astronomy and Astrophysics, Yale University, New Haven, CT 06520 (United States); Vikhlinin, Alexey, E-mail: erwin.lau@yale.edu [Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cambridge, MA 02138 (United States)

    2015-06-10

    Galaxy clusters exhibit remarkable self-similar behavior which allows us to establish simple scaling relationships between observable quantities and cluster masses, making galaxy clusters useful cosmological probes. Recent X-ray observations suggested that self-similarity may be broken in the outskirts of galaxy clusters. In this work, we analyze a mass-limited sample of massive galaxy clusters from the Omega500 cosmological hydrodynamic simulation to investigate the self-similarity of the diffuse X-ray emitting intracluster medium (ICM) in the outskirts of galaxy clusters. We find that the self-similarity of the outer ICM profiles is better preserved if they are normalized with respect to the mean density of the universe, while the inner profiles are more self-similar when normalized using the critical density. However, the outer ICM profiles as well as the location of accretion shock around clusters are sensitive to their mass accretion rate, which causes the apparent breaking of self-similarity in cluster outskirts. We also find that the collisional gas does not follow the distribution of collisionless dark matter (DM) perfectly in the infall regions of galaxy clusters, leading to 10% departures in the gas-to-DM density ratio from the cosmic mean value. Our results have a number implications for interpreting observations of galaxy clusters in X-ray and through the Sunyaev–Zel’dovich effect, and their applications to cosmology.

  6. Behavioral Profiles of Children with Williams Syndrome from Spain and the United States: Cross-Cultural Similarities and Differences

    Science.gov (United States)

    Pérez-García, Débora; Brun-Gasca, Carme; Pérez-Jurado, Luis A.; Mervis, Carolyn B.

    2017-01-01

    To identify similarities and differences in the behavioral profile of children with Williams syndrome from Spain (n = 53) and the United States (n = 145), we asked parents of 6- to 14-year-olds with Williams syndrome to complete the Child Behavior Checklist 6-18. The distribution of raw scores was significantly higher for the Spanish sample than…

  7. Chemical similarity among domesticated and wild genotypes of peanut based on n-alkanes profiles

    OpenAIRE

    Renata Janaína Carvalho de Souza; Suzene Izídio da Silva; Antonio Fernando Morais de Oliveira

    2010-01-01

    The objective of this work was to analyze the epicuticular n-alkane profile of domesticated and wild peanut genotypes. Foliar epicuticular n-alkanes of four Arachis hypogaea genotypes and two wild species - A. monticola and A. stenosperma - were analyzed by gas chromatography. Chemical relationships between them were evaluated using the Dice coefficient and UPGMA method. Two clusters were formed: one with four A. hypogaea genotypes and the other with the two wild species. There is more simila...

  8. Insect Adhesion Secretions: Similarities and Dissimilarities in Hydrocarbon Profiles of Tarsi and Corresponding Tibiae.

    Science.gov (United States)

    Gerhardt, Heike; Betz, Oliver; Albert, Klaus; Lämmerhofer, Michael

    2016-08-01

    Spatially controlled in vivo sampling by contact solid phase microextraction with a non-coated silica fiber combined with gas chromatography-mass spectrometry (GC-MS) was utilized for hydrocarbon profiling in tarsal adhesion secretions of four insect species (Nicrophorus vespilloides, Nicrophorus nepalensis, Sagra femorata, and Gromphadorhina portentosa) by using distinct adhesion systems, viz. hairy or smooth tarsi. For comparison, corresponding samples from tibiae, representing the general cuticular hydrocarbon profile, were analyzed to enable the statistical inference of active molecular adhesion principles in tarsal secretions possibly contributed by specific hydrocarbons. n-Alkanes, monomethyl and dimethyl alkanes, alkenes, alkadienes, and one aldehyde were detected. Multivariate statistical analysis (principal component and orthogonal partial least square discriminant analyses) gave insights into distinctive molecular features among the various insect species and between tarsus and tibia samples. In general, corresponding hydrocarbon profiles in tarsus and tibia samples largely resembled each other, both qualitatively and in relative abundances as well. However, several specific hydrocarbons showed significantly different relative abundances between corresponding tarsus and tibia samples, thus indicating that such differences of specific hydrocarbons in the complex mixtures might constitute a delicate mechanism for fine-tuning the reversible attachment performances in tarsal adhesive fluids that are composed of substances originating from the same pool as cuticular hydrocarbons. Caused by melting point depression, the multicomponent tarsal adhesion secretion, made up of straight chain alkanes, methyl alkanes, and alkenes will have a semi-solid, grease-like consistency, which might provide the basis for a good reversible attachment performance.

  9. Formulation and evaluation of a montelukast sodium orally disintegrating tablet with a similar dissolution profile as the marketed product.

    Science.gov (United States)

    Chen, Yong; Feng, Tingting; Li, Yong; Du, Bin; Weng, Weiyu

    2017-03-01

    A major challenge of orally disintegrating tablet (ODT) development is predicting its bioequivalence to its corresponding marketed product. Therefore, comparing ODT dissolution profiles to those of the corresponding marketed product is very important. The objective of this study was to develop a 5.2-mg montelukast sodium (MS) ODT with a similar dissolution profile to that of the marketed chewable tablet. Dissolution profiles were examined in different media to screen each formulation. We found that MS dissolution from ODTs in acidic medium heavily depended on manufacturing methods. All MS ODTs prepared using direct compression rapidly disintegrated in acidic medium. However, dispersed MS powders aggregated into sticky masses, resulting in slow dissolution. In contrast, MS ODTs prepared using wet granulation had much faster dissolution rates in acidic medium with no obvious aggregation. Additionally, the optimized formulation, prepared using wet granulation, displayed similar dissolution profiles to the marketed reference in all four types of media examined (f2 > 50). The in vitro disintegration time of the optimized ODT was 9.5 ± 2.4 s, which meets FDA requirements. In conclusion, the wet granulation preparation method of MS ODTs resulted in a product with equivalent dissolution profiles as those of the marketed product.

  10. MS/MS similarity networking accelerated target profiling of triterpene saponins in Eleutherococcus senticosus leaves.

    Science.gov (United States)

    Ge, Yue-Wei; Zhu, Shu; Yoshimatsu, Kayo; Komatsu, Katsuko

    2017-07-15

    The targeted mass information of compounds accelerated their discovery in a large volume of untargeted MS data. An MS/MS similarity networking is advanced in clustering the structural analogues, which benefits the collection of mass information of similar compounds. The triterpene saponins extracted from Eleutherococcus senticosus leaves (ESL), a kind of functional tea, have shown promise in the relief of Alzheimer's disease. In this work, a target-precursor list (TPL) generated using MS/MS similarity networking was employed to rapidly trace 106 triterpene saponins from the aqueous extracts of ESL, of which 49 were tentatively identified as potentially new triterpene saponins. Moreover, a compound database of triterpene saponins was established and successfully applied to uncover their distribution features in ESL samples collected from different areas. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Unveiled electric profiles within hydrogen bonds suggest DNA base pairs with similar bond strengths.

    Directory of Open Access Journals (Sweden)

    Y B Ruiz-Blanco

    Full Text Available Electrical forces are the background of all the interactions occurring in biochemical systems. From here and by using a combination of ab-initio and ad-hoc models, we introduce the first description of electric field profiles with intrabond resolution to support a characterization of single bond forces attending to its electrical origin. This fundamental issue has eluded a physical description so far. Our method is applied to describe hydrogen bonds (HB in DNA base pairs. Numerical results reveal that base pairs in DNA could be equivalent considering HB strength contributions, which challenges previous interpretations of thermodynamic properties of DNA based on the assumption that Adenine/Thymine pairs are weaker than Guanine/Cytosine pairs due to the sole difference in the number of HB. Thus, our methodology provides solid foundations to support the development of extended models intended to go deeper into the molecular mechanisms of DNA functioning.

  12. Similarity in depressive symptom profile in a population-based study of migrants in the Netherlands.

    Science.gov (United States)

    Schrier, Agnes C; de Wit, Matty A S; Rijmen, Frank; Tuinebreijer, Wilco C; Verhoeff, Arnoud P; Kupka, Ralph W; Dekker, Jack; Beekman, Aartjan T F

    2010-10-01

    Depression is a clinical syndrome developed in Western Europe and North-America. The expression of symptoms and the impact of symptoms on functioning may therefore be expected to vary across cultures and languages. Our first aim was to study differences in depressive symptom profile between indigenous and non-Western immigrant populations in the Netherlands. We hypothesized that differences in expression of depressive symptoms would be more likely in the domains of mood and cognitions, and less likely in the domains of psychomotor and vegetative symptoms. Our second aim was to study ethnic differences in the association of depressive symptoms and general functioning. In a random community sample stratified for ethnicity in Amsterdam, the Netherlands, depressive symptoms were assessed by bilingual interviewers using the Composite International Diagnostic Interview (CIDI 2.1) and the Symptom Checklist-90-Revised (SCL-90-R). Impairments in functioning were measured by the World Health Organization Disability Assessment Schedule II (WHODAS II). Results were obtained from 812 subjects: N = 321 native Dutch, N = 213 Turkish-Dutch, N = 191 Moroccan-Dutch, N = 87 Surinamese-Dutch. Differences in depressive symptom expression were tested by differential item functioning. The prevalence of DSM-IV depressive disorder and the overall level of depressive symptoms were higher in the Turkish and Moroccan immigrant groups compared to native Dutch subjects. Ethnic differences in item functioning of depressive symptoms were rare, and equally unlikely in all four symptom domains. Depression was equally associated with functional impairment across ethnic groups. Although depressive symptoms were more common among migrants than in the indigenous population, both the depressive symptom profile and the associated functional impairments were comparable. These findings may help diminishing concerns about the validity of using existing diagnostic procedures among ethnic minority

  13. Discordant clinical presentations of preeclampsia and intrauterine fetal growth restriction with similar pro- and anti-angiogenic profiles.

    Science.gov (United States)

    Alahakoon, Thushari I; Zhang, Weiyi; Trudinger, Brian J; Lee, Vincent W

    2014-12-01

    To evaluate the plasma levels of angiogenic factors in preeclampsia (PE) and intrauterine fetal growth restriction (IUGR) and their potential as biomarkers to distinguish normal from pathologic pregnancies. Case control study included singleton pregnancies in four categories: (i) normal (n = 29), (ii) PE (n = 15), (iii) PE and IUGR (n = 16) and (iv) IUGR (n = 24). The classification of IUGR included umbilical artery Doppler resistance. Maternal plasma placental growth factor (PlGF), soluble fms-like tyrosine kinase-1 (sFlt-1), soluble kinase domain receptor (sKDR) and soluble endoglin (sEng) as well as fetal umbilical artery sFlt-1 levels were determined. Each individual marker and their ratios were assessed for their potential to distinguish normal pregnancy from pregnancies affected by PE and/or IUGR. We found (i) elevated plasma sFlt-1, sEng and reduced PlGF, sKDR in PE and IUGR; (ii) similar angiogenic profiles in PE and IUGR and (iii) sEng and sFlt-1*sEng/PlGF performed best as biomarkers in identifying pathologic pregnancies. PE and IUGR have similar angiogenic profiles, suggesting that angiogenic marker profiles lack specificity in identifying PE and that other factors are required for the development of PE instead of IUGR. sEng should be included in a biomarker profile for predicting PE or IUGR.

  14. Polar solar panels: Arctic and Antarctic microbiomes display similar taxonomic profiles.

    Science.gov (United States)

    Tanner, Kristie; Manuel Martí, José; Belliure, Josabel; Fernández-Méndez, Mar; Molina-Menor, Esther; Peretó, Juli; Porcar, Manuel

    2017-12-01

    Solar panels located on high (Arctic and Antarctic) latitudes combine the harshness of the climate with that of the solar exposure. We report here that these polar solar panels are inhabited by similar microbial communities in taxonomic terms, dominated by Hymenobacter spp., Sphingomonas spp., and Ascomycota. Our results suggest that solar panels, even on high latitudes, can shape a microbial ecosystem adapted to irradiation and desiccation. This article is protected by copyright. All rights reserved. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  15. Three-level prediction of protein function by combining profile-sequence search, profile-profile search, and domain co-occurrence networks.

    Science.gov (United States)

    Wang, Zheng; Cao, Renzhi; Cheng, Jianlin

    2013-01-01

    Predicting protein function from sequence is useful for biochemical experiment design, mutagenesis analysis, protein engineering, protein design, biological pathway analysis, drug design, disease diagnosis, and genome annotation as a vast number of protein sequences with unknown function are routinely being generated by DNA, RNA and protein sequencing in the genomic era. However, despite significant progresses in the last several years, the accuracy of protein function prediction still needs to be improved in order to be used effectively in practice, particularly when little or no homology exists between a target protein and proteins with annotated function. Here, we developed a method that integrated profile-sequence alignment, profile-profile alignment, and Domain Co-Occurrence Networks (DCN) to predict protein function at different levels of complexity, ranging from obvious homology, to remote homology, to no homology. We tested the method blindingly in the 2011 Critical Assessment of Function Annotation (CAFA). Our experiments demonstrated that our three-level prediction method effectively increased the recall of function prediction while maintaining a reasonable precision. Particularly, our method can predict function terms defined by the Gene Ontology more accurately than three standard baseline methods in most situations, handle multi-domain proteins naturally, and make ab initio function prediction when no homology exists. These results show that our approach can combine complementary strengths of most widely used BLAST-based function prediction methods, rarely used in function prediction but more sensitive profile-profile comparison-based homology detection methods, and non-homology-based domain co-occurrence networks, to effectively extend the power of function prediction from high homology, to low homology, to no homology (ab initio cases).

  16. Early Trichinella spiralis and Trichinella nativa infections induce similar gene expression profiles in rat jejunal mucosa.

    Science.gov (United States)

    Airas, Niina; Näreaho, Anu; Lindén, Jere; Valo, Erkka; Hautaniemi, Sampsa; Jokelainen, Pikka; Sukura, Antti

    2013-10-01

    Trichinella spiralis causes a significantly higher parasite burden in rat muscle than Trichinella nativa. To assess whether the difference in infectivity is due to the early intestinal response, we analyzed gene expression changes in the rat jejunum during Trichinella infection with a whole-genome microarray. The rats were euthanized on day five of infection, and their jejunal mucosa was sampled for microarray analysis. In addition, intestinal histology and hematology were examined. Against our expectations, the gene expression changes were similar in both T.nativa- and T. spiralis-infected groups. The two groups were hence pooled, and in the combined Trichinella-infected group, 551 genes were overexpressed and 427 underexpressed when compared to controls (false discovery rate ≤ 0.001 and fold change at least 2 in either direction). Pathway analysis identified seven pathways significantly associated with Trichinella infection (p Trichinella infection caused complex gene expression changes that indicate a host response to tissue damage in the mucosa of the jejunum, but the changes were not notably dependent on the studied species of Trichinella. Copyright © 2013 Elsevier Inc. All rights reserved.

  17. Molecular and Cellular Profiling of Scalp Psoriasis Reveals Differences and Similarities Compared to Skin Psoriasis.

    Directory of Open Access Journals (Sweden)

    Juan Ruano

    Full Text Available Scalp psoriasis shows a variable clinical spectrum and in many cases poses a great therapeutic challenge. However, it remains unknown whether the immune response of scalp psoriasis differs from understood pathomechanisms of psoriasis in other skin areas. We sought to determine the cellular and molecular phenotype of scalp psoriasis by performing a comparative analysis of scalp and skin using lesional and nonlesional samples from 20 Caucasian subjects with untreated moderate to severe psoriasis and significant scalp involvement and 10 control subjects without psoriasis. Our results suggest that even in the scalp, psoriasis is a disease of the inter-follicular skin. The immune mechanisms that mediate scalp psoriasis were found to be similar to those involved in skin psoriasis. However, the magnitude of dysregulation, number of differentially expressed genes, and enrichment of the psoriatic genomic fingerprint were more prominent in skin lesions. Furthermore, the scalp transcriptome showed increased modulation of several gene-sets, particularly those induced by interferon-gamma, compared with that of skin psoriasis, which was mainly associated with activation of TNFα/L-17/IL-22-induced keratinocyte response genes. We also detected differences in expression of gene-sets involving negative regulation, epigenetic regulation, epidermal differentiation, and dendritic cell or Th1/Th17/Th22-related T-cell processes.

  18. Molecular and Cellular Profiling of Scalp Psoriasis Reveals Differences and Similarities Compared to Skin Psoriasis.

    Science.gov (United States)

    Ruano, Juan; Suárez-Fariñas, Mayte; Shemer, Avner; Oliva, Margeaux; Guttman-Yassky, Emma; Krueger, James G

    2016-01-01

    Scalp psoriasis shows a variable clinical spectrum and in many cases poses a great therapeutic challenge. However, it remains unknown whether the immune response of scalp psoriasis differs from understood pathomechanisms of psoriasis in other skin areas. We sought to determine the cellular and molecular phenotype of scalp psoriasis by performing a comparative analysis of scalp and skin using lesional and nonlesional samples from 20 Caucasian subjects with untreated moderate to severe psoriasis and significant scalp involvement and 10 control subjects without psoriasis. Our results suggest that even in the scalp, psoriasis is a disease of the inter-follicular skin. The immune mechanisms that mediate scalp psoriasis were found to be similar to those involved in skin psoriasis. However, the magnitude of dysregulation, number of differentially expressed genes, and enrichment of the psoriatic genomic fingerprint were more prominent in skin lesions. Furthermore, the scalp transcriptome showed increased modulation of several gene-sets, particularly those induced by interferon-gamma, compared with that of skin psoriasis, which was mainly associated with activation of TNFα/L-17/IL-22-induced keratinocyte response genes. We also detected differences in expression of gene-sets involving negative regulation, epigenetic regulation, epidermal differentiation, and dendritic cell or Th1/Th17/Th22-related T-cell processes.

  19. Molecular and Cellular Profiling of Scalp Psoriasis Reveals Differences and Similarities Compared to Skin Psoriasis

    Science.gov (United States)

    Ruano, Juan; Suárez-Fariñas, Mayte; Shemer, Avner; Oliva, Margeaux

    2016-01-01

    Scalp psoriasis shows a variable clinical spectrum and in many cases poses a great therapeutic challenge. However, it remains unknown whether the immune response of scalp psoriasis differs from understood pathomechanisms of psoriasis in other skin areas. We sought to determine the cellular and molecular phenotype of scalp psoriasis by performing a comparative analysis of scalp and skin using lesional and nonlesional samples from 20 Caucasian subjects with untreated moderate to severe psoriasis and significant scalp involvement and 10 control subjects without psoriasis. Our results suggest that even in the scalp, psoriasis is a disease of the inter-follicular skin. The immune mechanisms that mediate scalp psoriasis were found to be similar to those involved in skin psoriasis. However, the magnitude of dysregulation, number of differentially expressed genes, and enrichment of the psoriatic genomic fingerprint were more prominent in skin lesions. Furthermore, the scalp transcriptome showed increased modulation of several gene-sets, particularly those induced by interferon-gamma, compared with that of skin psoriasis, which was mainly associated with activation of TNFα/L-17/IL-22-induced keratinocyte response genes. We also detected differences in expression of gene-sets involving negative regulation, epigenetic regulation, epidermal differentiation, and dendritic cell or Th1/Th17/Th22-related T-cell processes. PMID:26849645

  20. Encoding of low-quality DNA profiles as genotype probability matrices for improved profile comparisons, relatedness evaluation and database searches.

    Science.gov (United States)

    Ryan, K; Williams, D Gareth; Balding, David J

    2016-11-01

    Many DNA profiles recovered from crime scene samples are of a quality that does not allow them to be searched against, nor entered into, databases. We propose a method for the comparison of profiles arising from two DNA samples, one or both of which can have multiple donors and be affected by low DNA template or degraded DNA. We compute likelihood ratios to evaluate the hypothesis that the two samples have a common DNA donor, and hypotheses specifying the relatedness of two donors. Our method uses a probability distribution for the genotype of the donor of interest in each sample. This distribution can be obtained from a statistical model, or we can exploit the ability of trained human experts to assess genotype probabilities, thus extracting much information that would be discarded by standard interpretation rules. Our method is compatible with established methods in simple settings, but is more widely applicable and can make better use of information than many current methods for the analysis of mixed-source, low-template DNA profiles. It can accommodate uncertainty arising from relatedness instead of or in addition to uncertainty arising from noisy genotyping. We describe a computer program GPMDNA, available under an open source licence, to calculate LRs using the method presented in this paper. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  1. Evaluation of dissolution profile similarity - Comparison between the f2, the multivariate statistical distance and the f2 bootstrapping methods.

    Science.gov (United States)

    Paixão, Paulo; Gouveia, Luís F; Silva, Nuno; Morais, José A G

    2017-03-01

    A simulation study is presented, evaluating the performance of the f2, the model-independent multivariate statistical distance and the f2 bootstrap methods in the ability to conclude similarity between two dissolution profiles. Different dissolution profiles, based on the Noyes-Whitney equation and ranging from theoretical f2 values between 100 and 40, were simulated. Variability was introduced in the dissolution model parameters in an increasing order, ranging from a situation complying with the European guidelines requirements for the use of the f2 metric to several situations where the f2 metric could not be used anymore. Results have shown that the f2 is an acceptable metric when used according to the regulatory requirements, but loses its applicability when variability increases. The multivariate statistical distance presented contradictory results in several of the simulation scenarios, which makes it an unreliable metric for dissolution profile comparisons. The bootstrap f2, although conservative in its conclusions is an alternative suitable method. Overall, as variability increases, all of the discussed methods reveal problems that can only be solved by increasing the number of dosage form units used in the comparison, which is usually not practical or feasible. Additionally, experimental corrective measures may be undertaken in order to reduce the overall variability, particularly when it is shown that it is mainly due to the dissolution assessment instead of being intrinsic to the dosage form. Copyright © 2016. Published by Elsevier B.V.

  2. Fibroblasts from phenotypically normal palmar fascia exhibit molecular profiles highly similar to fibroblasts from active disease in Dupuytren's Contracture

    Directory of Open Access Journals (Sweden)

    Satish Latha

    2012-05-01

    Full Text Available Abstract Background Dupuytren's contracture (DC is a fibroproliferative disorder characterized by the progressive development of a scar-like collagen-rich cord that affects the palmar fascia of the hand and leads to digital flexion contractures. DC is most commonly treated by surgical resection of the diseased tissue, but has a high reported recurrence rate ranging from 27% to 80%. We sought to determine if the transcriptomic profiles of fibroblasts derived from DC-affected palmar fascia, adjacent phenotypically normal palmar fascia, and non-DC palmar fascial tissues might provide mechanistic clues to understanding the puzzle of disease predisposition and recurrence in DC. Methods To achieve this, total RNA was obtained from fibroblasts derived from primary DC-affected palmar fascia, patient-matched unaffected palmar fascia, and palmar fascia from non-DC patients undergoing carpal tunnel release (6 patients in each group. These cells were grown on a type-1 collagen substrate (to better mimic their in vivo environments. Microarray analyses were subsequently performed using Illumina BeadChip arrays to compare the transcriptomic profiles of these three cell populations. Data were analyzed using Significance Analysis of Microarrays (SAM v3.02, hierarchical clustering, concordance mapping and Venn diagram. Results We found that the transcriptomic profiles of DC-disease fibroblasts and fibroblasts from unaffected fascia of DC patients exhibited a much greater overlap than fibroblasts derived from the palmar fascia of patients undergoing carpal tunnel release. Quantitative real time RT-PCR confirmed the differential expression of select genes validating the microarray data analyses. These data are consistent with the hypothesis that predisposition and recurrence in DC may stem, at least in part, from intrinsic similarities in the basal gene expression of diseased and phenotypically unaffected palmar fascia fibroblasts. These data also demonstrate that

  3. Fibroblasts from phenotypically normal palmar fascia exhibit molecular profiles highly similar to fibroblasts from active disease in Dupuytren's Contracture

    Science.gov (United States)

    2012-01-01

    Background Dupuytren's contracture (DC) is a fibroproliferative disorder characterized by the progressive development of a scar-like collagen-rich cord that affects the palmar fascia of the hand and leads to digital flexion contractures. DC is most commonly treated by surgical resection of the diseased tissue, but has a high reported recurrence rate ranging from 27% to 80%. We sought to determine if the transcriptomic profiles of fibroblasts derived from DC-affected palmar fascia, adjacent phenotypically normal palmar fascia, and non-DC palmar fascial tissues might provide mechanistic clues to understanding the puzzle of disease predisposition and recurrence in DC. Methods To achieve this, total RNA was obtained from fibroblasts derived from primary DC-affected palmar fascia, patient-matched unaffected palmar fascia, and palmar fascia from non-DC patients undergoing carpal tunnel release (6 patients in each group). These cells were grown on a type-1 collagen substrate (to better mimic their in vivo environments). Microarray analyses were subsequently performed using Illumina BeadChip arrays to compare the transcriptomic profiles of these three cell populations. Data were analyzed using Significance Analysis of Microarrays (SAM v3.02), hierarchical clustering, concordance mapping and Venn diagram. Results We found that the transcriptomic profiles of DC-disease fibroblasts and fibroblasts from unaffected fascia of DC patients exhibited a much greater overlap than fibroblasts derived from the palmar fascia of patients undergoing carpal tunnel release. Quantitative real time RT-PCR confirmed the differential expression of select genes validating the microarray data analyses. These data are consistent with the hypothesis that predisposition and recurrence in DC may stem, at least in part, from intrinsic similarities in the basal gene expression of diseased and phenotypically unaffected palmar fascia fibroblasts. These data also demonstrate that a collagen

  4. Global transcriptional profiles of beating clusters derived from human induced pluripotent stem cells and embryonic stem cells are highly similar

    Directory of Open Access Journals (Sweden)

    Gupta Manoj K

    2010-09-01

    Full Text Available Abstract Background Functional and molecular integrity of cardiomyocytes (CMs derived from induced pluripotent stem (iPS cells is essential for their use in tissue repair, disease modelling and drug screening. In this study we compared global transcriptomes of beating clusters (BCs microdissected from differentiating human iPS cells and embryonic stem (ES cells. Results Hierarchical clustering and principal component analysis revealed that iPS-BCs and ES-BCs cluster together, are similarly enriched for cardiospecific genes and differ in expression of only 1.9% of present transcripts. Similarly, sarcomeric organization, electrophysiological properties and calcium handling of iPS-CMs were indistinguishable from those of ES-CMs. Gene ontology analysis revealed that among 204 genes that were upregulated in iPS-BCs vs ES-BCs the processes related to extracellular matrix, cell adhesion and tissue development were overrepresented. Interestingly, 47 of 106 genes that were upregulated in undifferentiated iPS vs ES cells remained enriched in iPS-BCs vs ES-BCs. Most of these genes were found to be highly expressed in fibroblasts used for reprogramming and 34% overlapped with the recently reported iPS cell-enriched genes. Conclusions These data suggest that iPS-BCs are transcriptionally highly similar to ES-BCs. However, iPS-BCs appear to share some somatic cell signature with undifferentiated iPS cells. Thus, iPS-BCs may not be perfectly identical to ES-BCs. These minor differences in the expression profiles may occur due to differential cellular composition of iPS-BCs and ES-BCs, due to retention of some genetic profile of somatic cells in differentiated iPS cell-derivatives, or both.

  5. Behavioral Profiles of Children With Williams Syndrome From Spain and the United States: Cross-Cultural Similarities and Differences.

    Science.gov (United States)

    Pérez-García, Débora; Brun-Gasca, Carme; Pérez-Jurado, Luis A; Mervis, Carolyn B

    2017-03-01

    To identify similarities and differences in the behavioral profile of children with Williams syndrome from Spain (n = 53) and the United States (n = 145), we asked parents of 6- to 14-year-olds with Williams syndrome to complete the Child Behavior Checklist 6-18. The distribution of raw scores was significantly higher for the Spanish sample than the American sample for all of the higher-order factors and half of both the empirically based and Diagnostic and Statistical Manual of Mental Disorders (DSM)-oriented scales. In contrast, analyses based on country-specific T-scores indicated that the distribution for the Spanish sample was significantly higher than for the American sample only on the Social Problems scale. No gender differences were found. Genetic and cultural influences on children's behavior and cultural influences on parental ratings of behavior are discussed.

  6. Diagnostic Power of Broad Emission Line Profiles in Searches for Binary Supermassive Black Holes.

    Science.gov (United States)

    Nguyen, Khai; Bogdanovic, Tamara

    2015-01-01

    Motivated by advances in observational searches for sub-parsec supermassive black hole binaries (SBHBs) made in the past few years we develop a semi-analytic model to describe the spectral emission line signatures of these systems. The goal of this work is to test one of the leading models of binary accretion flows in the literature: SBHB in a circumbinary disk. In this context, we model SBHB accretion flows as a set of three accretion disks: two mini-disks that are gravitationally bound to the individual black holes and a circumbinary disk that forms a common envelope about a gravitationally bound binary. We find that emission line profiles tend to have different statistical properties depending on the semi-major axis, mass ratio, eccentricity of the binary, and the alignment of the triple-disk system, and can in principle be used to infer the statistical distribution of these parameters.

  7. eF-seek: prediction of the functional sites of proteins by searching for similar electrostatic potential and molecular surface shape

    Science.gov (United States)

    Kinoshita, Kengo; Murakami, Yoichi; Nakamura, Haruki

    2007-01-01

    We have developed a method to predict ligand-binding sites in a new protein structure by searching for similar binding sites in the Protein Data Bank (PDB). The similarities are measured according to the shapes of the molecular surfaces and their electrostatic potentials. A new web server, eF-seek, provides an interface to our search method. It simply requires a coordinate file in the PDB format, and generates a prediction result as a virtual complex structure, with the putative ligands in a PDB format file as the output. In addition, the predicted interacting interface is displayed to facilitate the examination of the virtual complex structure on our own applet viewer with the web browser (URL: http://eF-site.hgc.jp/eF-seek). PMID:17567616

  8. Modes of variability of the vertical temperature profile of the middle atmosphere at mid-latitude: Similarities with solar forcing

    Science.gov (United States)

    Keckhut, Philippe; Hauchecorne, Alain; Kerzenmacher, Tobias; Angot, Guillaume

    2012-02-01

    A long and continuous temperature data set from ground to mesopause was obtained in merging lidar and radiosonde data at mid-latitude over south of France (44°N). The analyses using Empirical Orthogonal Functions has been applied on vertical temperature profiles to investigate the variability differently than it has been done in previous investigations. This study reveals as the first mode in winter, a strong anti-correlation between upper stratosphere and mesosphere that is most probably link with planetary waves propagation and associated stratospheric warmings. While in summer the variability is located in the mesosphere and associated with mesospheric inversions that are probably generated by gravity waves breaking. This study shows that even if the daily temperature variability appears to be complex, a large part (30%) can be modeled, each season, using the first EOF. These vertical patterns exhibit some similarities with solar-atmospheric responses, suggesting a potential feedback of the dynamic. This is already observed for winter response, but during summer the contribution of gravity waves on the mesospheric solar response suggests future investigations to explore the role of this potential mechanism in solar-atmospheric connections.

  9. Similarities in Gene Expression Profiles during In Vitro Aging of Primary Human Embryonic Lung and Foreskin Fibroblasts

    Directory of Open Access Journals (Sweden)

    Shiva Marthandan

    2015-01-01

    Full Text Available Replicative senescence is of fundamental importance for the process of cellular aging, since it is a property of most of our somatic cells. Here, we elucidated this process by comparing gene expression changes, measured by RNA-seq, in fibroblasts originating from two different tissues, embryonic lung (MRC-5 and foreskin (HFF, at five different time points during their transition into senescence. Although the expression patterns of both fibroblast cell lines can be clearly distinguished, the similar differential expression of an ensemble of genes was found to correlate well with their transition into senescence, with only a minority of genes being cell line specific. Clustering-based approaches further revealed common signatures between the cell lines. Investigation of the mRNA expression levels at various time points during the lifespan of either of the fibroblasts resulted in a number of monotonically up- and downregulated genes which clearly showed a novel strong link to aging and senescence related processes which might be functional. In terms of expression profiles of differentially expressed genes with age, common genes identified here have the potential to rule the transition into senescence of embryonic lung and foreskin fibroblasts irrespective of their different cellular origin.

  10. Validation of SmartRank: A likelihood ratio software for searching national DNA databases with complex DNA profiles.

    Science.gov (United States)

    Benschop, Corina C G; van de Merwe, Linda; de Jong, Jeroen; Vanvooren, Vanessa; Kempenaers, Morgane; Kees van der Beek, C P; Barni, Filippo; Reyes, Eusebio López; Moulin, Léa; Pene, Laurent; Haned, Hinda; Sijen, Titia

    2017-07-01

    Searching a national DNA database with complex and incomplete profiles usually yields very large numbers of possible matches that can present many candidate suspects to be further investigated by the forensic scientist and/or police. Current practice in most forensic laboratories consists of ordering these 'hits' based on the number of matching alleles with the searched profile. Thus, candidate profiles that share the same number of matching alleles are not differentiated and due to the lack of other ranking criteria for the candidate list it may be difficult to discern a true match from the false positives or notice that all candidates are in fact false positives. SmartRank was developed to put forward only relevant candidates and rank them accordingly. The SmartRank software computes a likelihood ratio (LR) for the searched profile and each profile in the DNA database and ranks database entries above a defined LR threshold according to the calculated LR. In this study, we examined for mixed DNA profiles of variable complexity whether the true donors are retrieved, what the number of false positives above an LR threshold is and the ranking position of the true donors. Using 343 mixed DNA profiles over 750 SmartRank searches were performed. In addition, the performance of SmartRank and CODIS were compared regarding DNA database searches and SmartRank was found complementary to CODIS. We also describe the applicable domain of SmartRank and provide guidelines. The SmartRank software is open-source and freely available. Using the best practice guidelines, SmartRank enables obtaining investigative leads in criminal cases lacking a suspect. Copyright © 2017 Elsevier B.V. All rights reserved.

  11. SysFinder: A customized platform for search, comparison and assisted design of appropriate animal models based on systematic similarity.

    Science.gov (United States)

    Yang, Shuang; Zhang, Guoqing; Liu, Wan; Wang, Zhen; Zhang, Jifeng; Yang, Dongshan; Chen, Y Eugene; Sun, Hong; Li, Yixue

    2017-05-20

    Animal models are increasingly gaining values by cross-comparisons of response or resistance to clinical agents used for patients. However, many disease mechanisms and drug effects generated from animal models are not transferable to human. To address these issues, we developed SysFinder (http://lifecenter.sgst.cn/SysFinder), a platform for scientists to find appropriate animal models for translational research. SysFinder offers a "topic-centered" approach for systematic comparisons of human genes, whose functions are involved in a specific scientific topic, to the corresponding homologous genes of animal models. Scientific topic can be a certain disease, drug, gene function or biological pathway. SysFinder calculates multi-level similarity indexes to evaluate the similarities between human and animal models in specified scientific topics. Meanwhile, SysFinder offers species-specific information to investigate the differences in molecular mechanisms between humans and animal models. Furthermore, SysFinder provides a user-friendly platform for determination of short guide RNAs (sgRNAs) and homology arms to design a new animal model. Case studies illustrate the ability of SysFinder in helping experimental scientists. SysFinder is a useful platform for experimental scientists to carry out their research in the human molecular mechanisms. Copyright © 2017 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.

  12. Protein profile of Lupinus texensis phloem sap exudates: searching for Fe- and Zn-containing proteins.

    Science.gov (United States)

    Lattanzio, Giuseppe; Andaluz, Sofía; Matros, Andrea; Calvete, Juan José; Kehr, Julia; Abadía, Anunciación; Abadía, Javier; López-Millán, Ana-Flor

    2013-08-01

    The aim of this study was to obtain a comprehensive overview of the phloem sap protein profile of Lupinus texensis, with a special focus on proteins binding Fe and Zn. L. texensis was chosen as model plant given the simplicity to obtain exudates from sieve elements. Protein profiling by 2DE revealed 249 spots, and 54 of them were unambiguously identified by MALDI-MS and ESI-MS/MS. The largest number of identified protein species belongs to protein modification/turnover and general metabolism (19-21%), followed by redox homeostasis (9%) and defense and cell structural components (7%). This protein profile is similar to that reported in other plant species, suggesting that the phloem sap proteome is quite conserved. Staining of 2DE gels for Fe-containing proteins and affinity chromatography experiments revealed the presence of two low molecular weight Fe-binding proteins in phloem sap: a metallothionein-like protein type 2B identified in the Fe-affinity chromatography, and a second protein identified with both Fe staining methods. This protein species had a molecular weight of 13.5 kDa, a pI of 5.6 and 51% homology to a phloem-specific protein from Medicago truncatula. Zinc affinity chromatography revealed four Zn-binding proteins in phloem sap, one belonging to the dehydrin family and three Zn finger proteins. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. A systematic in silico search for target similarity identifies several approved drugs with potential activity against the Plasmodium falciparum apicoplast.

    Directory of Open Access Journals (Sweden)

    Nadlla Alves Bispo

    Full Text Available Most of the drugs in use against Plasmodium falciparum share similar modes of action and, consequently, there is a need to identify alternative potential drug targets. Here, we focus on the apicoplast, a malarial plastid-like organelle of algal source which evolved through secondary endosymbiosis. We undertake a systematic in silico target-based identification approach for detecting drugs already approved for clinical use in humans that may be able to interfere with the P. falciparum apicoplast. The P. falciparum genome database GeneDB was used to compile a list of ≈600 proteins containing apicoplast signal peptides. Each of these proteins was treated as a potential drug target and its predicted sequence was used to interrogate three different freely available databases (Therapeutic Target Database, DrugBank and STITCH3.1 that provide synoptic data on drugs and their primary or putative drug targets. We were able to identify several drugs that are expected to interact with forty-seven (47 peptides predicted to be involved in the biology of the P. falciparum apicoplast. Fifteen (15 of these putative targets are predicted to have affinity to drugs that are already approved for clinical use but have never been evaluated against malaria parasites. We suggest that some of these drugs should be experimentally tested and/or serve as leads for engineering new antimalarials.

  14. Realm of PD-(D/EXK nuclease superfamily revisited: detection of novel families with modified transitive meta profile searches

    Directory of Open Access Journals (Sweden)

    Knizewski Lukasz

    2007-06-01

    Full Text Available Abstract Background PD-(D/EXK nucleases constitute a large and highly diverse superfamily of enzymes that display little sequence similarity despite retaining a common core fold and a few critical active site residues. This makes identification of new PD-(D/EXK nuclease families a challenging task as they usually escape detection with standard sequence-based methods. We developed a modified transitive meta profile search approach and to consider the structural diversity of PD-(D/EXK nuclease fold more thoroughly we analyzed also lower than threshold Meta-BASIC hits to select potentially correct predictions placed among unreliable or incorrect ones. Results Application of a modified transitive Meta-BASIC searches on updated PFAM families and PDB structures resulted in detection of five new PD-(D/EXK nuclease families encompassing hundreds of so far uncharacterized and poorly annotated proteins. These include four families catalogued in PFAM database as domains of unknown function (DUF506, DUF524, DUF1626 and DUF1703 and YhgA-like family of putative transposases. Three of these families represent extremely distant homologs (DUF506, DUF524, and YhgA-like, while two are newly defined in updated database (DUF1626 and DUF1703. In addition, we also confidently identified an extended AAA-ATPase domain in the N-terminal region of DUF1703 family proteins. Conclusion Obtained results suggest that detailed analysis of below threshold Meta-BASIC hits may push limits further for distant homology detection in the 'midnight zone' of homology. All identified families conserve the core evolutionary fold, secondary structure and hydrophobic patterns common to existing PD-(D/EXK nucleases and maintain critical active site motifs that contribute to nucleic acid cleavage. Further experimental investigations should address the predicted activity and clarify potential substrates providing further insight into detailed biological role of these newly detected nucleases.

  15. Using Profile Elevation to Increase the Usefulness of the Self-Directed Search and Other Inventories

    Science.gov (United States)

    Bullock, Emily E.; Reardon, Robert C.

    2005-01-01

    The authors explore the construct of profile elevation on interest inventories results. They speak to the need for career counselors to maximize their use of the interest inventory and how they can do that by properly interpreting profile elevation. Profile elevation is defined, and correlates of high and low profile elevation are presented. The…

  16. A summary report on the search for current technologies and developers to develop depth profiling/physical parameter end effectors

    Energy Technology Data Exchange (ETDEWEB)

    Nguyen, Q.H.

    1994-09-12

    This report documents the search strategies and results for available technologies and developers to develop tank waste depth profiling/physical parameter sensors. Sources searched include worldwide research reports, technical papers, journals, private industries, and work at Westinghouse Hanford Company (WHC) at Richland site. Tank waste physical parameters of interest are: abrasiveness, compressive strength, corrosiveness, density, pH, particle size/shape, porosity, radiation, settling velocity, shear strength, shear wave velocity, tensile strength, temperature, viscosity, and viscoelasticity. A list of related articles or sources for each physical parameters is provided.

  17. Concurrent and aerobic exercise training promote similar benefits in body composition and metabolic profiles in obese adolescents.

    Science.gov (United States)

    Monteiro, Paula Alves; Chen, Kong Y; Lira, Fabio Santos; Saraiva, Bruna Thamyres Cicotti; Antunes, Barbara Moura Mello; Campos, Eduardo Zapaterra; Freitas, Ismael Forte

    2015-11-26

    The prevalence of obesity in pediatric population is increasing at an accelerated rate in many countries, and has become a major public health concern. Physical activity, particularly exercise training, remains to be a cornerstone of pediatric obesity interventions. The purpose of our current randomized intervention trial was to compare the effects of two types of training matched for training volume, aerobic and concurrent, on body composition and metabolic profile in obese adolescents. Thus the aim of the study was compare the effects of two types of training matched for training volume, aerobic and concurrent, on body composition and metabolic profile in obese adolescents. 32 obese adolescents participated in two randomized training groups, concurrent or aerobic, for 20 weeks (50 mins x 3 per week, supervised), and were compared to a 16-subject control group. We measured the percentage body fat (%BF, primary outcome), fat-free mass, percentage of android fat by dual energy x-ray absorptiometry, and others metabolic profiles at baseline and after interventions, and compared them between groups using the Intent-to-treat design. In 20 weeks, both exercise training groups significantly reduced %BF by 2.9-3.6% as compare to no change in the control group (p = 0.042). There were also positive changes in lipid levels in exercise groups. No noticeable changes were found between aerobic and concurrent training groups. The benefits of exercise in reducing body fat and metabolic risk profiles can be achieved by performing either type of training in obese adolescents. RBR-4HN597.

  18. Hydropathy profile alignment : a tool to search for structural homologues of membrane proteins

    NARCIS (Netherlands)

    Lolkema, JS; Slotboom, DJ

    1998-01-01

    Hydropathy profile alignment is introduced as a tool in functional genomics. The architecture of membrane proteins is reflected in the hydropathy profile of the amino acid sequence. Both secondary and tertiary structural elements determine the profile which provides enough sensitivity to detect

  19. Design of a bioactive small molecule that targets the myotonic dystrophy type 1 RNA via an RNA motif-ligand database and chemical similarity searching.

    Science.gov (United States)

    Parkesh, Raman; Childs-Disney, Jessica L; Nakamori, Masayuki; Kumar, Amit; Wang, Eric; Wang, Thomas; Hoskins, Jason; Tran, Tuan; Housman, David; Thornton, Charles A; Disney, Matthew D

    2012-03-14

    Myotonic dystrophy type 1 (DM1) is a triplet repeating disorder caused by expanded CTG repeats in the 3'-untranslated region of the dystrophia myotonica protein kinase (DMPK) gene. The transcribed repeats fold into an RNA hairpin with multiple copies of a 5'CUG/3'GUC motif that binds the RNA splicing regulator muscleblind-like 1 protein (MBNL1). Sequestration of MBNL1 by expanded r(CUG) repeats causes splicing defects in a subset of pre-mRNAs including the insulin receptor, the muscle-specific chloride ion channel, sarco(endo)plasmic reticulum Ca(2+) ATPase 1, and cardiac troponin T. Based on these observations, the development of small-molecule ligands that target specifically expanded DM1 repeats could be of use as therapeutics. In the present study, chemical similarity searching was employed to improve the efficacy of pentamidine and Hoechst 33258 ligands that have been shown previously to target the DM1 triplet repeat. A series of in vitro inhibitors of the RNA-protein complex were identified with low micromolar IC(50)'s, which are >20-fold more potent than the query compounds. Importantly, a bis-benzimidazole identified from the Hoechst query improves DM1-associated pre-mRNA splicing defects in cell and mouse models of DM1 (when dosed with 1 mM and 100 mg/kg, respectively). Since Hoechst 33258 was identified as a DM1 binder through analysis of an RNA motif-ligand database, these studies suggest that lead ligands targeting RNA with improved biological activity can be identified by using a synergistic approach that combines analysis of known RNA-ligand interactions with chemical similarity searching.

  20. Expression Profiles of Bovine Adeno-Associated Virus and Avian Adeno-Associated Virus Display Significant Similarity to That of Adeno-Associated Virus Type 5

    OpenAIRE

    Qiu, Jianming; Cheng, Fang; Pintel, David J.

    2006-01-01

    We present the first detailed expression profiles of nonprimate-derived adeno-associated viruses, namely, bovine adeno-associated virus (B-AAV) and avian adeno-associated virus (A-AAV), which were obtained after the infection of cell lines derived from their natural hosts. In general, the profiles of B-AAV and A-AAV were quite similar to that of AAV5; however, both exhibited features found for AAV2 as well. Like adeno-associated virus type 5 (AAV5), B-AAV and A-AAV utilized an internal polyad...

  1. Diagnostic Power of Broad Emission Line Profiles in Searches for Binary Supermassive Black Holes: Comparison of Models with Observations

    Science.gov (United States)

    Nguyen, Khai; Bogdanovic, Tamara; Eracleous, Michael; Runnoe, Jessie C.; Sigurdsson, Steinn

    2017-01-01

    Motivated by observational searches for sub-parsec supermassive black hole binaries (SBHBs) we develop a semi-analytic model to describe the spectral emission line signatures of these systems. We are particularly interested in modeling the profiles of the broad emission lines, which have been used as a tool to search for SBHBs. The goal of this work is to test one of the leading models of binary accretion flows in the literature: SBHB in a circumbinary disk. In this context, we model SBHB accretion flows as a set of three accretion disks: two mini-disks that are gravitationally bound to the individual black holes and a circumbinary disk that forms a common envelope about a gravitationally bound binary. Our first generation model shows that emission line profiles tend to have different statistical properties depending on the semi-major axis, mass ratio, eccentricity of the binary, and the alignment of the triple-disk system, and can in principle be used to constrain the statistical distribution of these parameters. We present the results of a second generation model, which improves upon the treatment of radiative transfer by taking into account the effect of line-driven winds on the properties of the model emission line profiles. This improvement allows a preliminary comparison of the model profiles with the observed SBHB candidates and AGN population in general.

  2. Genome-wide transcriptome profiling revealed cotton fuzz fiber development having a similar molecular model as Arabidopsis trichome.

    Directory of Open Access Journals (Sweden)

    Qun Wan

    Full Text Available The cotton fiber, as a single-celled trichome, is a biological model system for studying cell differentiation and elongation. However, the complexity of gene expression and regulation in the fiber complicates genetic research. In this study, we investigated the genome-wide transcriptome profiling in Texas Marker-1 (TM-1 and five naked seed or fuzzless mutants (three dominant and two recessive during the fuzz initial development stage. More than three million clean tags were generated from each sample representing the expression data for 27,325 genes, which account for 72.8% of the annotated Gossypium raimondii primary transcript genes. Thousands of differentially expressed genes (DEGs were identified between TM-1 and the mutants. Based on functional enrichment analysis, the DEGs downregulated in the mutants were enriched in protein synthesis-related genes and transcription factors, while DEGs upregulated in the mutants were enriched in DNA/chromatin structure-related genes and transcription factors. Pathway analysis showed that ATP synthesis, and sugar and lipid metabolism-related pathways play important roles in fuzz initial development. Also, we identified a large number of transcription factors such as MYB, bHLH, HB, WRKY, AP2/EREBP, bZIP and C2H2 zinc finger families that were differently expressed between TM-1 and the mutants, and were also related to trichome development in Arabidopsis.

  3. Genome-wide transcriptome profiling revealed cotton fuzz fiber development having a similar molecular model as Arabidopsis trichome.

    Science.gov (United States)

    Wan, Qun; Zhang, Hua; Ye, Wenxue; Wu, Huaitong; Zhang, Tianzhen

    2014-01-01

    The cotton fiber, as a single-celled trichome, is a biological model system for studying cell differentiation and elongation. However, the complexity of gene expression and regulation in the fiber complicates genetic research. In this study, we investigated the genome-wide transcriptome profiling in Texas Marker-1 (TM-1) and five naked seed or fuzzless mutants (three dominant and two recessive) during the fuzz initial development stage. More than three million clean tags were generated from each sample representing the expression data for 27,325 genes, which account for 72.8% of the annotated Gossypium raimondii primary transcript genes. Thousands of differentially expressed genes (DEGs) were identified between TM-1 and the mutants. Based on functional enrichment analysis, the DEGs downregulated in the mutants were enriched in protein synthesis-related genes and transcription factors, while DEGs upregulated in the mutants were enriched in DNA/chromatin structure-related genes and transcription factors. Pathway analysis showed that ATP synthesis, and sugar and lipid metabolism-related pathways play important roles in fuzz initial development. Also, we identified a large number of transcription factors such as MYB, bHLH, HB, WRKY, AP2/EREBP, bZIP and C2H2 zinc finger families that were differently expressed between TM-1 and the mutants, and were also related to trichome development in Arabidopsis.

  4. Discovery and electrophysiological characterization of SKF-32802: A novel hERG agonist found through a large-scale structural similarity search.

    Science.gov (United States)

    Donovan, Brian T; Bandyopadhyay, Deepak; Duraiswami, Chaya; Nixon, Christopher J; Townsend, Claire Y; Martens, Stan F

    2017-10-16

    Despite the importance of the hERG channel in drug discovery and the sizable number of antagonist molecules discovered, only a few hERG agonists have been discovered. Here we report a novel hERG agonist; SKF-32802 and a structural analog of the agonist NS3623, SB-335573. These were discovered through a similarity search of published hERG agonists. SKF-32802 incorporates an amide linker rather than NS3623's urea, resulting in a compound with a different mechanism of action. We find that both compounds decrease the time constant of open channel kinetics, increase the amplitude of the envelope of tails assay, mildly increased the amplitude of the IV curve, bind the hERG channel in either open or closed states, increase the plateau of the voltage dependence of activation and modulate the effects of the hERG antagonist, quinidine. Neither compound affects inactivation nor deactivation kinetics, a property unique among hERG agonists. Additionally, SKF-32802 induces a leftward shift in the voltage dependence of activation. Our structural models show that both compounds make strong bridging interactions with multiple channel subunits and are stabilized by internal hydrogen bonding similar to NS3623, PD-307243 and RPR26024. While SB-335573 binds in a nearly identical fashion as NS3623, SKF-32802 makes an additional hydrogen bond with neighboring threonine 623. In summary, SB-335573 is a type 4 agonist which increases open channel probability while SKF-32802 is a type 3 agonist which induces a leftward shift in the voltage dependence of activation. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. High similarity of phylogenetic profiles of rate-limiting enzymes with inhibitory relation in Human, Mouse, Rat, budding Yeast and E. coli

    Science.gov (United States)

    2011-01-01

    Background The phylogenetic profile is widely used to characterize functional linkage and conservation between proteins without amino acid sequence similarity. To survey the conservative regulatory properties of rate-limiting enzymes (RLEs) in metabolic inhibitory network across different species, we define the enzyme inhibiting pair as: where the first enzyme in a pair is the inhibitor provider and the second is the target of the inhibitor. Phylogenetic profiles of enzymes in the inhibiting pairs are further generated to measure the functional linkage of these enzymes during evolutionary history. Results We find that the RLEs generate, on average, over half of all in vivo inhibitors in each surveyed model organism. And these inhibitors inhibit on average over 85% targets in metabolic inhibitory network and cover the majority of targets of cross-pathway inhibiting relations. Furthermore, we demonstrate that the phylogenetic profiles of the enzymes in inhibiting pairs in which at least one enzyme is rate-limiting often show higher similarities than those in common inhibiting enzyme pairs. In addition, RLEs, compared to common metabolic enzymes, often tend to produce ADP instead of AMP in conservative inhibitory networks. Conclusions Combined with the conservative roles of RLEs in their efficiency in sensing metabolic signals and transmitting regulatory signals to the rest of the metabolic system, the RLEs may be important molecules in balancing energy homeostasis via maintaining the ratio of ATP to ADP in living cells. Furthermore, our results indicate that similarities of phylogenetic profiles of enzymes in the inhibiting enzyme pairs are not only correlated with enzyme topological importance, but also related with roles of the enzymes in metabolic inhibitory network. PMID:22369203

  6. Mouse Obox and Crxos modulate preimplantation transcriptional profiles revealing similarity between paralogous mouse and human homeobox genes

    Directory of Open Access Journals (Sweden)

    Amy H. Royall

    2018-01-01

    Full Text Available Abstract Background ETCHbox genes are eutherian-specific homeobox genes expressed during preimplantation development at a time when the first cell lineage decisions are being made. The mouse has an unusual repertoire of ETCHbox genes with several gene families lost in evolution and the remaining two, Crxos and Obox, greatly divergent in sequence and number. Each has undergone duplication to give a double homeodomain Crxos locus and a large cluster of over 60 Obox loci. The gene content differences between species raise important questions about how evolution can tolerate loss of genes implicated in key developmental events. Results We find that Crxos internal duplication occurred in the mouse lineage, while Obox duplication was stepwise, generating subgroups with distinct sequence and expression. Ectopic expression of three Obox genes and a Crxos transcript in primary mouse embryonic cells followed by transcriptome sequencing allowed investigation into their functional roles. We find distinct transcriptomic influences for different Obox subgroups and Crxos, including modulation of genes related to zygotic genome activation and preparation for blastocyst formation. Comparison with similar experiments performed using human homeobox genes reveals striking overlap between genes downstream of mouse Crxos and genes downstream of human ARGFX. Conclusions Mouse Crxos and human ARGFX homeobox genes are paralogous rather than orthologous, yet they have evolved to regulate a common set of genes. This suggests there was compensation of function alongside gene loss through co-option of a different locus. Functional compensation by non-orthologous genes with dissimilar sequences is unusual but may indicate underlying distributed robustness. Compensation may be driven by the strong evolutionary pressure for successful early embryo development.

  7. Transcriptional profile of breast muscle in heat stressed layers is similar to that of broiler chickens at control temperature.

    Science.gov (United States)

    Zahoor, Imran; de Koning, Dirk-Jan; Hocking, Paul M

    2017-09-20

    In recent years, the commercial importance of changes in muscle function of broiler chickens and of the corresponding effects on meat quality has increased. Furthermore, broilers are more sensitive to heat stress during transport and at high ambient temperatures than smaller egg-laying chickens. We hypothesised that heat stress would amplify muscle damage and expression of genes that are involved in such changes and, thus, lead to the identification of pathways and networks associated with broiler muscle and meat quality traits. Broiler and layer chickens were exposed to control or high ambient temperatures to characterise differences in gene expression between the two genotypes and the two environments. Whole-genome expression studies in breast muscles of broiler and layer chickens were conducted before and after heat stress; 2213 differentially-expressed genes were detected based on a significant (P broilers compared with control and heat-stressed layers. Expression of these genes was further increased in heat-stressed broilers. Differences in gene expression between broiler and layer chickens under control and heat stress conditions suggest that damage of breast muscles in broilers at normal ambient temperatures is similar to that in heat-stressed layers and is amplified when broilers are exposed to heat stress. The patterns of gene expression of the two genotypes under heat stress were almost the polar opposite of each other, which is consistent with the conclusion that broiler chickens were not able to cope with heat stress by dissipating their body heat. The differentially expressed gene networks and pathways were consistent with the pathological changes that are observed in the breast muscle of heat-stressed broilers.

  8. A spin transfer torque magnetoresistance random access memory-based high-density and ultralow-power associative memory for fully data-adaptive nearest neighbor search with current-mode similarity evaluation and time-domain minimum searching

    Science.gov (United States)

    Ma, Yitao; Miura, Sadahiko; Honjo, Hiroaki; Ikeda, Shoji; Hanyu, Takahiro; Ohno, Hideo; Endoh, Tetsuo

    2017-04-01

    A high-density nonvolatile associative memory (NV-AM) based on spin transfer torque magnetoresistive random access memory (STT-MRAM), which achieves highly concurrent and ultralow-power nearest neighbor search with full adaptivity of the template data format, has been proposed and fabricated using the 90 nm CMOS/70 nm perpendicular-magnetic-tunnel-junction hybrid process. A truly compact current-mode circuitry is developed to realize flexibly controllable and high-parallel similarity evaluation, which makes the NV-AM adaptable to any dimensionality and component-bit of template data. A compact dual-stage time-domain minimum searching circuit is also developed, which can freely extend the system for more template data by connecting multiple NM-AM cores without additional circuits for integrated processing. Both the embedded STT-MRAM module and the computing circuit modules in this NV-AM chip are synchronously power-gated to completely eliminate standby power and maximally reduce operation power by only activating the currently accessed circuit blocks. The operations of a prototype chip at 40 MHz are demonstrated by measurement. The average operation power is only 130 µW, and the circuit density is less than 11 µm2/bit. Compared with the latest conventional works in both volatile and nonvolatile approaches, more than 31.3% circuit area reductions and 99.2% power improvements are achieved, respectively. Further power performance analyses are discussed, which verify the special superiority of the proposed NV-AM in low-power and large-memory-based VLSIs.

  9. Appropriately differentiated ARPE-19 cells regain phenotype and gene expression profiles similar to those of native RPE cells.

    Science.gov (United States)

    Samuel, William; Jaworski, Cynthia; Postnikova, Olga A; Kutty, R Krishnan; Duncan, Todd; Tan, Li Xuan; Poliakov, Eugenia; Lakkaraju, Aparna; Redmond, T Michael

    2017-01-01

    of 2.5 or more differences between 4 months and 4 days of culture. Gene Ontology analysis showed that the upregulated genes were associated with visual cycle, phagocytosis, pigment synthesis, cell differentiation, and RPE-related transcription factors. The majority of the downregulated genes play a role in cell cycle and proliferation. The ARPE-19 cells cultured for 4 months developed a phenotype characteristic of native RPE and expressed proteins, mRNAs, and miRNAs characteristic of the RPE. Comparison of the ARPE-19 RNA-Seq data set with that of primary human fetal RPE, embryonic stem cell-derived RPE, and native RPE revealed an important overall similar expression ratio among all the models and native tissue. However, none of the cultured models reached the absolute values in the native tissue. The results of this study demonstrate that low-passage ARPE-19 cells can express genes specific to native human RPE cells when appropriately cultured and differentiated.

  10. Transcript profiling of cytokinin action in Arabidopsis roots and shoots discovers largely similar but also organ-specific responses

    Directory of Open Access Journals (Sweden)

    Brenner Wolfram G

    2012-07-01

    the growth response of roots and shoots to the hormone, the vast majority of the cytokinin-regulated transcriptome showed similar response patterns in roots and shoots. Conclusions The shift of the root and shoot transcriptomes towards the respective other organ depending on the cytokinin status indicated that the hormone determines part of the organ-specific transcriptome pattern independent of morphological organ identity. Numerous novel cytokinin-regulated genes were discovered which had escaped earlier discovery, most probably due to unspecific sampling. These offer novel insights into the diverse activities of cytokinin, including crosstalk with other hormones and different environmental cues, identify the AP2/ERF class of transcriptions factors as particularly cytokinin sensitive, and also suggest translational control of cytokinin-induced changes.

  11. Transcript profiling of cytokinin action in Arabidopsis roots and shoots discovers largely similar but also organ-specific responses

    Science.gov (United States)

    2012-01-01

    roots and shoots to the hormone, the vast majority of the cytokinin-regulated transcriptome showed similar response patterns in roots and shoots. Conclusions The shift of the root and shoot transcriptomes towards the respective other organ depending on the cytokinin status indicated that the hormone determines part of the organ-specific transcriptome pattern independent of morphological organ identity. Numerous novel cytokinin-regulated genes were discovered which had escaped earlier discovery, most probably due to unspecific sampling. These offer novel insights into the diverse activities of cytokinin, including crosstalk with other hormones and different environmental cues, identify the AP2/ERF class of transcriptions factors as particularly cytokinin sensitive, and also suggest translational control of cytokinin-induced changes. PMID:22824128

  12. Mouse gastric tumor models with prostaglandin E2 pathway activation show similar gene expression profiles to intestinal-type human gastric cancer

    Directory of Open Access Journals (Sweden)

    Oshima Masanobu

    2009-12-01

    Full Text Available Abstract Background Gastric cancers are generally classified into better differentiated intestinal-type tumor and poorly differentiated diffuse-type one according to Lauren's histological categorization. Although induction of prostaglandin E2 pathway promotes gastric tumors in mice in cooperation with deregulated Wnt or BMP signalings, it has remained unresolved whether the gastric tumor mouse models recapitulate either of human gastric cancer type. This study assessed the similarity in expression profiling between gastric tumors of transgenic mice and various tissues of human cancers to find best-fit human tumors for the transgenic mice models. Results Global expression profiling initially found gastric tumors from COX-2/mPGES-1 (C2mE-related transgenic mice (K19-C2mE, K19-Wnt1/C2mE, and K19-Nog/C2mE resembled gastric cancers among the several tissues of human cancers including colon, breast, lung and gastric tumors. Next, classification of the C2mE-related transgenic mice by a gene signature to distinguish human intestinal- and diffuse-type tumors showed C2mE-related transgenic mice were more similar to intestinal-type compared with diffuse one. We finally revealed that induction of Wnt pathway cooperating with the prostaglandin E2 pathway in mice (K19-Wnt1/C2mE mice further reproduce features of human gastric intestinal-type tumors. Conclusion We demonstrated that C2mE-related transgenic mice show significant similarity to intestinal-type gastric cancer when analyzed by global expression profiling. These results suggest that the C2mE-related transgenic mice, especially K19-Wnt1/C2mE mice, serve as a best-fit model to study molecular mechanism underlying the tumorigenesis of human gastric intestinal-type cancers.

  13. [Ablation profiles in refractive surgery. Part 1: in search of excellence].

    Science.gov (United States)

    Smadja, D; Reggiani-Mello, G; Touboul, D; Colin, J

    2012-02-01

    To provide an overview of the clinical results of various ablation profiles and discuss their technical characteristics and limitations. Literature review. With the emergence of wavefront technology, new photoablation profiles have been developed, allowing for customization of refractive treatments and reduction of nocturnal visual symptoms, which adversely affect the reputation of refractive surgery. Over the past decade, several comparative studies have been published in the literature aiming to demonstrate either the superiority of wavefront-guided correction over conventional, or one laser platform over another. However, has an ideal treatment algorithm really emerged from these studies? Does one ablation profile clearly demonstrate superiority over another, in terms of visual performance? Despite technological advances as well as improved visual results for custom versus conventional photoablation, the promise of excellence in visual performance has not been achieved with these various technologies. The concept of an individualized eye model has emerged recently, based on an optical ray tracing algorithm, and could theoretically provide an ideal ablation profile, thus fulfilling the promise of "supernormal vision". Copyright © 2011 Elsevier Masson SAS. All rights reserved.

  14. Stress and Schooling: A Search for Stress Profiles of Adolescent Students.

    Science.gov (United States)

    Thomas, Elwyn

    This study examined how a sample of pre-university Singaporean adolescents (N=212) with an average age of 17 years 5 months attending a variety of junior colleges perceived and managed different sources of stress. Measures of stress included Behavior Profile, Life Events, School Stress Factors, Home-School Stress Conditions and Stress Symptoms.…

  15. Collaborative Ranking and Profiling: Exploiting the Wisdom of Crowds in Tailored Web Search

    OpenAIRE

    Felber, Pascal; Kropf, Peter; Leonini, Lorenzo; Luu, Toan; Rajman, Martin; Rivière, Etienne

    2010-01-01

    International audience; Popular search engines essentially rely on information about the structure of the graph of linked elements to find the most relevant results for a given query. While this approach is satisfactory for popular interest domains or when the user expectations follow the main trend, it is very sensitive to the case of ambiguous queries, where queries can have answers over several different domains. Elements pertaining to an implicitly targeted interest domain with low popula...

  16. General expression profiles of human native odontoblasts and pulp-derived cultured odontoblast-like cells are similar but reveal differential neuropeptide expression levels.

    Science.gov (United States)

    Pääkkönen, Virve; Bleicher, Françoise; Carrouel, Florence; Vuoristo, Jussi T; Salo, Tuula; Wappler, Ilka; Couble, Marie-Lise; Magloire, Henry; Peters, Heiko; Tjäderhane, Leo

    2009-01-01

    Odontoblasts play a central role during the dentin formation by organic matrix production and mineralisation. Recently, suitable in vitro techniques for studying mature primary odontoblasts and the newly differentiated odontoblasts have been developed. Firstly, the gene expression profiles of native and cultured odontoblasts were compared at large-scale to investigate the similarities and differences between the samples. Secondly, differential expression levels of the genes encoding neuronal proteins were analyzed to study odontoblasts sensory function. Microarray analysis was performed to mature native and cultured pulp-derived odontoblast-like cells to compare their transcriptome. Then, the probes positive only in one sample were divided into gene ontology categories. Expression levels of selected neuronal proteins were further studied with quantitative PCR, and at the protein level by immunofluorescence of mature and newly differentiated odontoblasts in developing tooth. Remarkable similarities between the general and neuronal protein gene expression profiles were observed. Higher cortistatin, galanin, somatostatin receptor 1 (SSTR1) and tyrosine phosphatase receptor type Z1 (PTPRZ1) expression was detected in native than in cultured odontoblast at the mRNA level. Pronociceptin was more abundantly expressed in cultured than in native odontoblasts. Immunofluorescence of mature and newly differentiated odontoblasts on human tooth germs confirmed the results. Cultured odontoblasts used in this study have similar general gene expression pattern to native odontoblasts, and therefore offer a valuable tool for the in vitro odontoblast studies. The expression of PTPRZ1 and galanin, which participate in sensory signal transduction, supports the previously suggested role of odontoblasts as sensory cells.

  17. Untargeted metabolomic profiling plasma samples of patients with lung cancer for searching significant metabolites by HPLC-MS method

    Science.gov (United States)

    Dementeva, N.; Ivanova, K.; Kokova, D.; Kurzina, I.; Ponomaryova, A.; Kzhyshkowska, J.

    2017-09-01

    Lung cancer is one of the most common types of cancer leading to death. Consequently, the search and the identification of the metabolites associated with the risk of developing cancer are very valuable. For the purpose, untargeted metabolic profiling of the plasma samples collected from the patients with lung cancer (n = 100) and the control group (n = 100) was conducted. After sample preparation, the plasma samples were analyzed using LC-MS method. Biostatistics methods were applied to pre-process the data for elicitation of dominating metabolites which responded to the difference between the case and the control groups. At least seven significant metabolites were evaluated and annotated. The most part of identified metabolites are connected with lipid metabolism and their combination could be useful for follow-up studies of lung cancer pathogenesis.

  18. In Search of the Right Questions: Language Background Profiling at Ontario Public Schools

    Directory of Open Access Journals (Sweden)

    Nikolay Slavkov

    2016-06-01

    Full Text Available Abstract This article examines some of the challenges that the notion of a (monolingual native speaker faces in a global context of increasing awareness that bilingualism and multilingualism are the norm rather than the exception. It also discusses the distinction between two child language acquisition environments, bilingual first language acquisition and early second language acquisition, which can lead to bilingual or multilingual outcomes early on in life. This serves as a backdrop for a study of language profiling practices in public schools across the province of Ontario. Student registration forms from 44 district school boards were analysed with regard to the number, type and combination patterns of language background questions. The findings indicate that school boards are aware of the potentially diverse linguistic backgrounds of incoming students, but may not be conceptually or methodologically equipped to recognize the full spectrum of linguistic complexity involved. Some degree of standardization of language background profiling across different districts is recommended as a measure that may benefit the province. Résumé Cet article examine certains défis auxquels la notion d’un locuteur natif (monolingue fait face dans un contexte mondial où il existe une reconnaissance croissante que le bilinguisme et le multilinguisme représentent la norme plutôt que l’exception. L’article aborde également la distinction entre deux contextes d’acquisition de la langue chez les enfants, ceux-ci étant l'acquisition bilingue de la langue première et l’acquisition précoce de la langue seconde, qui peuvent mener à des résultats bilingues ou multilingues tôt dans la vie. Ceci sert de toile de fond pour une étude des pratiques de profilage linguistique dans des écoles publiques à travers la province de l’Ontario. Des formulaires d’inscription d’élèves provenant de 44 conseils scolaires ont été analysés en termes du

  19. Searching for degenerate Higgs bosons using a profile likelihood ratio method

    CERN Document Server

    Heikkilä, Jaana

    ATLAS and CMS collaborations at the Large Hadron Collider have observed a new resonance con- sistent with the standard model Higgs boson. However, it has been suggested that the observed signal could also be produced by multiple nearly mass-degenerate states that couple differently to the standard model particles. In this work, a method to discriminate between the hypothesis of a single Higgs boson and that of multiple mass-degenerate Higgs bosons was developed. Using the matrix of measured signal strengths in different production and decay modes, parametrizations for the two hypotheses were constructed as a general rank 1 matrix and the most general $5 \\times 4$ matrix, respectively. The test statistic was defined as a ratio of profile likelihoods for the two hypotheses. The method was applied to the CMS measurements. The expected test statistic distribution was estimated twice by generating pseudo-experiments according to both the standard model hypothesis and the single Higgs boson hypothesis best fitting...

  20. Genome-wide mapping indicates that p73 and p63 co-occupy target sites and have similar dna-binding profiles in vivo.

    Directory of Open Access Journals (Sweden)

    Annie Yang

    2010-07-01

    Full Text Available The p53 homologs, p63 and p73, share approximately 85% amino acid identity in their DNA-binding domains, but they have distinct biological functions.Using chromatin immunoprecipitation and high-resolution tiling arrays covering the human genome, we identify p73 DNA binding sites on a genome-wide level in ME180 human cervical carcinoma cells. Strikingly, the p73 binding profile is indistinguishable from the previously described binding profile for p63 in the same cells. Moreover, the p73:p63 binding ratio is similar at all genomic loci tested, suggesting that there are few, if any, targets that are specific for one of these factors. As assayed by sequential chromatin immunoprecipitation, p63 and p73 co-occupy DNA target sites in vivo, suggesting that p63 and p73 bind primarily as heterotetrameric complexes in ME180 cells.The observation that p63 and p73 associate with the same genomic targets suggest that their distinct biological functions are due to cell-type specific expression and/or protein domains that involve functions other than DNA binding.

  1. Similar Metabolic, Innate Immunity, and Adipokine Profiles in Adult and Pediatric Sepsis Versus Systemic Inflammatory Response Syndrome-A Pilot Study.

    Science.gov (United States)

    Tavladaki, Theonymfi; Spanaki, Anna Maria; Dimitriou, Helen; Kondili, Efmorfia; Choulaki, Christianna; Georgopoulos, Dimitris; Briassoulis, George

    2017-11-01

    To examine whether the septic profiles of heat shock protein 72, heat shock protein 90α, resistin, adiponectin, oxygen consumption, CO2 production, energy expenditure, and metabolic pattern, along with illness severity, nutritional, and inflammatory indices, differ between adult and pediatric patients compared with systemic inflammatory response syndrome and healthy controls. To evaluate whether these biomolecules may discriminate sepsis from systemic inflammatory response syndrome in adult and pediatric patients. Prospective cohort study. University ICU and PICU. Seventy-eight adults (sepsis/23; systemic inflammatory response syndrome/23; healthy controls/33), 67 children (sepsis/18; systemic inflammatory response syndrome/23; controls/27), mechanically ventilated. None. Flow cytometry determined mean fluorescence intensity for monocyte or neutrophil heat shock protein expression. Resistin, adiponectin, and extracellular heat shock proteins were measured using enzyme-linked immunosorbent assay; energy expenditure by E-COVX (GE Healthcare). Genomic DNA was extracted with PureLink Genomic DNA kit (Invitrogen, Carlsbad, CA) to detect heat shock protein 72 single nucleotide polymorphisms. Similarly, in adult and pediatric patients, Acute Physiology and Chronic Evaluation-II/Acute Physiology and Pediatric Risk of Mortality-III, Simplified Acute Physiology Score-III, C-reactive protein, lactate, and resistin were higher and myocardial contractility, monocyte heat shock protein 72, oxygen consumption, CO2 production, energy expenditure, metabolic pattern, glucose, and albumin lower in sepsis compared with systemic inflammatory response syndrome or controls (p shock protein 90α, and lactate achieved a receiver operating characteristic curve greater than 0.80 in children and greater than 0.75 in adults (p shock protein 72 analysis did not disclose any diagnosis or mortality group differences regarding either rs6457452 or rs1061581 haplotypes. Sepsis presents with similar

  2. Similar Spectral Power Densities Within the Schumann Resonance and a Large Population of Quantitative Electroencephalographic Profiles: Supportive Evidence for Koenig and Pobachenko

    Science.gov (United States)

    Saroka, Kevin S.; Vares, David E.; Persinger, Michael A.

    2016-01-01

    In 1954 and 1960 Koenig and his colleagues described the remarkable similarities of spectral power density profiles and patterns between the earth-ionosphere resonance and human brain activity which also share magnitudes for both electric field (mV/m) and magnetic field (pT) components. In 2006 Pobachenko and colleagues reported real time coherence between variations in the Schumann and brain activity spectra within the 6–16 Hz band for a small sample. We examined the ratios of the average potential differences (~3 μV) obtained by whole brain quantitative electroencephalography (QEEG) between rostral-caudal and left-right (hemispheric) comparisons of 238 measurements from 184 individuals over a 3.5 year period. Spectral densities for the rostral-caudal axis revealed a powerful peak at 10.25 Hz while the left-right peak was 1.95 Hz with beat-differences of ~7.5 to 8 Hz. When global cerebral measures were employed, the first (7–8 Hz), second (13–14 Hz) and third (19–20 Hz) harmonics of the Schumann resonances were discernable in averaged QEEG profiles in some but not all participants. The intensity of the endogenous Schumann resonance was related to the ‘best-of-fitness’ of the traditional 4-class microstate model. Additional measurements demonstrated real-time coherence for durations approximating microstates in spectral power density variations between Schumann frequencies measured in Sudbury, Canada and Cumiana, Italy with the QEEGs of local subjects. Our results confirm the measurements reported by earlier researchers that demonstrated unexpected similarities in the spectral patterns and strengths of electromagnetic fields generated by the human brain and the earth-ionospheric cavity. PMID:26785376

  3. Similar Spectral Power Densities Within the Schumann Resonance and a Large Population of Quantitative Electroencephalographic Profiles: Supportive Evidence for Koenig and Pobachenko.

    Directory of Open Access Journals (Sweden)

    Kevin S Saroka

    Full Text Available In 1954 and 1960 Koenig and his colleagues described the remarkable similarities of spectral power density profiles and patterns between the earth-ionosphere resonance and human brain activity which also share magnitudes for both electric field (mV/m and magnetic field (pT components. In 2006 Pobachenko and colleagues reported real time coherence between variations in the Schumann and brain activity spectra within the 6-16 Hz band for a small sample. We examined the ratios of the average potential differences (~3 μV obtained by whole brain quantitative electroencephalography (QEEG between rostral-caudal and left-right (hemispheric comparisons of 238 measurements from 184 individuals over a 3.5 year period. Spectral densities for the rostral-caudal axis revealed a powerful peak at 10.25 Hz while the left-right peak was 1.95 Hz with beat-differences of ~7.5 to 8 Hz. When global cerebral measures were employed, the first (7-8 Hz, second (13-14 Hz and third (19-20 Hz harmonics of the Schumann resonances were discernable in averaged QEEG profiles in some but not all participants. The intensity of the endogenous Schumann resonance was related to the 'best-of-fitness' of the traditional 4-class microstate model. Additional measurements demonstrated real-time coherence for durations approximating microstates in spectral power density variations between Schumann frequencies measured in Sudbury, Canada and Cumiana, Italy with the QEEGs of local subjects. Our results confirm the measurements reported by earlier researchers that demonstrated unexpected similarities in the spectral patterns and strengths of electromagnetic fields generated by the human brain and the earth-ionospheric cavity.

  4. Prion Strain Differences in Accumulation of PrPSc on Neurons and Glia Are Associated with Similar Expression Profiles of Neuroinflammatory Genes: Comparison of Three Prion Strains.

    Science.gov (United States)

    Carroll, James A; Striebel, James F; Rangel, Alejandra; Woods, Tyson; Phillips, Katie; Peterson, Karin E; Race, Brent; Chesebro, Bruce

    2016-04-01

    Misfolding and aggregation of host proteins are important features of the pathogenesis of neurodegenerative diseases including Alzheimer's disease, Parkinson's disease, frontotemporal dementia and prion diseases. In all these diseases, the misfolded protein increases in amount by a mechanism involving seeded polymerization. In prion diseases, host prion protein is misfolded to form a pathogenic protease-resistant form, PrPSc, which accumulates in neurons, astroglia and microglia in the CNS. Here using dual-staining immunohistochemistry, we compared the cell specificity of PrPSc accumulation at early preclinical times post-infection using three mouse scrapie strains that differ in brain regional pathology. PrPSc from each strain had a different pattern of cell specificity. Strain 22L was mainly associated with astroglia, whereas strain ME7 was mainly associated with neurons and neuropil. In thalamus and cortex, strain RML was similar to 22L, but in substantia nigra, RML was similar to ME7. Expression of 90 genes involved in neuroinflammation was studied quantitatively using mRNA from thalamus at preclinical times. Surprisingly, despite the cellular differences in PrPSc accumulation, the pattern of upregulated genes was similar for all three strains, and the small differences observed correlated with variations in the early disease tempo. Gene upregulation correlated with activation of both astroglia and microglia detected in early disease prior to vacuolar pathology or clinical signs. Interestingly, the profile of upregulated genes in scrapie differed markedly from that seen in two acute viral CNS diseases (LaCrosse virus and BE polytropic Friend retrovirus) that had reactive gliosis at levels similar to our prion-infected mice.

  5. Avaliação do perfil de dissolução de comprimidos de ciprofloxacino 250 mg comercializados como similares no Brasil Dissolution profile evaluation of ciprofloxacin 250 mg tablets marketed as similar drugs in Brazil

    Directory of Open Access Journals (Sweden)

    Nilson Oliveira Gonçalves Pita

    2004-09-01

    Full Text Available O presente trabalho avaliou, in vitro, oito especialidades farmacêuticas contendo ciprofloxacino (250 mg, sendo o produto C, o medicamento referência (Cipro® fabricado pela Bayer® S.A. e A, B, D, E, F, G e H medicamentos considerados similares. Além do teste de dissolução preconizado pela Farmacopéia americana, foi traçado o perfil de dissolução de cada amostra a partir do qual foram calculados parâmetros de eficiência de dissolução (ED. O preço máximo ao consumidor também foi levantado, para que fosse possível compará-lo com o desempenho dos produtos. Os resultados obtidos demonstram que todas as amostras atendem aos critérios especificados pelo teste de dissolução descrito na USP 24 ed., mas somente os produtos B, C, E e H foram aprovados no primeiro estágio (S1. Após tratamento estatístico empregando-se ANOVA e teste de Tukey, para os valores de eficiência de dissolução, notou-se que os produtos A e D, B e C, B e E e C e H são semelhantes quanto à liberação do fármaco in vitro. Entretanto, os produtos que mostraram melhor desempenho quanto à liberação de ciprofloxacino in vitro não são os mesmos que se apresentam como melhor opção de custo ao consumidor.The present study has assessed, in vitro, eight pharmaceutical brands containing ciprofloxacin (250 mg. The product C is the reference drug (Cipro® - Bayer® S/A and A, B, D, E, F, G and H are similar drugs. The following physicochemical tests have been carried out: dissolution test and dissolution profile, from which dissolution efficiency (DE have been calculate. The maximum price for consumer has also been obtained, in order that it was possible to compare it with the performance of the products. The results show that all the produts comply with USP specifications, but only the products, B, C, E and H have been approved on the first stage (S1. After statistic treatment, by applying ANOVA and Tukey test for DE values, it has been noted that the

  6. Compensating for literature annotation bias when predicting novel drug-disease relationships through Medical Subject Heading Over-representation Profile (MeSHOP) similarity.

    Science.gov (United States)

    Cheung, Warren A; Ouellette, B F Francis; Wasserman, Wyeth W

    2013-01-01

    Using annotations to the articles in MEDLINE®/PubMed®, over six thousand chemical compounds with pharmacological actions have been tracked since 1996. Medical Subject Heading Over-representation Profiles (MeSHOPs) quantitatively leverage the literature associated with biological entities such as diseases or drugs, providing the opportunity to reposition known compounds towards novel disease applications. A MeSHOP is constructed by counting the number of times each medical subject term is assigned to an entity-related research publication in the MEDLINE database and calculating the significance of the count by comparing against the count of the term in a background set of publications. Based on the expectation that drugs suitable for treatment of a disease (or disease symptom) will have similar annotation properties to the disease, we successfully predict drug-disease associations by comparing MeSHOPs of diseases and drugs. The MeSHOP comparison approach delivers an 11% improvement over bibliometric baselines. However, novel drug-disease associations are observed to be biased towards drugs and diseases with more publications. To account for the annotation biases, a correction procedure is introduced and evaluated. By explicitly accounting for the annotation bias, unexpectedly similar drug-disease pairs are highlighted as candidates for drug repositioning research. MeSHOPs are shown to provide a literature-supported perspective for discovery of new links between drugs and diseases based on pre-existing knowledge.

  7. Proteomic profiling of the planarian Schmidtea mediterranea and its mucous reveals similarities with human secretions and those predicted for parasitic flatworms.

    Science.gov (United States)

    Bocchinfuso, Donald G; Taylor, Paul; Ross, Eric; Ignatchenko, Alex; Ignatchenko, Vladimir; Kislinger, Thomas; Pearson, Bret J; Moran, Michael F

    2012-09-01

    The freshwater planarian Schmidtea mediterranea has been used in research for over 100 years, and is an emerging stem cell model because of its capability of regenerating large portions of missing body parts. Exteriorly, planarians are covered in mucous secretions of unknown composition, implicated in locomotion, predation, innate immunity, and substrate adhesion. Although the planarian genome has been sequenced, it remains mostly unannotated, challenging both genomic and proteomic analyses. The goal of the current study was to annotate the proteome of the whole planarian and its mucous fraction. The S. mediterranea proteome was analyzed via mass spectrometry by using multidimensional protein identification technology with whole-worm tryptic digests. By using a proteogenomics approach, MS data were searched against an in silico translated planarian transcript database, and by using the Swiss-Prot BLAST algorithm to identify proteins similar to planarian queries. A total of 1604 proteins were identified. The mucous subproteome was defined through analysis of a mucous trail fraction and an extract obtained by treating whole worms with the mucolytic agent N-acetylcysteine. Gene Ontology analysis confirmed that the mucous fractions were enriched with secreted proteins. The S. mediterranea proteome is highly similar to that predicted for the trematode Schistosoma mansoni associated with intestinal schistosomiasis, with the mucous subproteome particularly highly conserved. Remarkably, orthologs of 119 planarian mucous proteins are present in human mucosal secretions and tear fluid. We suggest planarians have potential to be a model system for the characterization of mucous protein function and relevant to parasitic flatworm infections and diseases underlined by mucous aberrancies, such as cystic fibrosis, asthma, and other lung diseases.

  8. Similarity and modeling of axial CO and NO concentration profiles in a fluidized-bed combustor (co-)firing biomass fuels

    Energy Technology Data Exchange (ETDEWEB)

    Kasama Janvijitsakul; Vladimir I. Kuprianov [Thammasat University, Pathum Thani (Thailand). School of Manufacturing Systems and Mechanical Engineering

    2008-07-15

    Some common features and trends of the behavior of CO and NO along the height in a conical fluidized-bed combustor (FBC) firing rice husk and pre-dried sugarcane bagasse, and co-firing 'as-received' rice husk with 'as-received' sugarcane bagasse, for wide ranges of fuel properties and operating conditions, were summarized. Empirical models for predicting axial CO and NO concentration profiles in this fluidized-bed combustor were developed based on analysis and treatment of experimental results. Supporting correlations, including the effects of excess air, the bed temperature and fuel properties, were proposed for estimating the peak of the CO concentration (CO{sub max}) and NO concentration (NO{sub max}) located in the reactor at a level of XCOmax and XNOmax, respectively, above the air distributor. These models secured the computational error of {+-}25% for COmax and {+-}20% for NOmax. Experimental dependencies of the relative gas concentration on the relative axial distance, CO/COmax=f(X/XCOmax) and NO/NOmax=f(X/XNOmax), were found to show an apparent similarity for all the fuels and firing options analyzed in this work, and could be, therefore, approximated by the fitting equations at R{sup 2} = 0.74-0.90. With the proposed equations, CO and NO concentrations can be estimated for any arbitrary level (X) above the air distributor in the conical FBC, for the particular (selected) fuel properties and operating conditions. 19 refs., 11 figs., 2 tabs.

  9. Combination of 2D/3D Ligand-Based Similarity Search in Rapid Virtual Screening from Multimillion Compound Repositories. Selection and Biological Evaluation of Potential PDE4 and PDE5 Inhibitors

    Directory of Open Access Journals (Sweden)

    Krisztina Dobi

    2014-05-01

    Full Text Available Rapid in silico selection of target focused libraries from commercial repositories is an attractive and cost effective approach. If structures of active compounds are available rapid 2D similarity search can be performed on multimillion compound databases but the generated library requires further focusing by various 2D/3D chemoinformatics tools. We report here a combination of the 2D approach with a ligand-based 3D method (Screen3D which applies flexible matching to align reference and target compounds in a dynamic manner and thus to assess their structural and conformational similarity. In the first case study we compared the 2D and 3D similarity scores on an existing dataset derived from the biological evaluation of a PDE5 focused library. Based on the obtained similarity metrices a fusion score was proposed. The fusion score was applied to refine the 2D similarity search in a second case study where we aimed at selecting and evaluating a PDE4B focused library. The application of this fused 2D/3D similarity measure led to an increase of the hit rate from 8.5% (1st round, 47% inhibition at 10 µM to 28.5% (2nd round at 50% inhibition at 10 µM and the best two hits had 53 nM inhibitory activities.

  10. Charting the cellular and extracellular proteome analysis of Brevibacterium linens DSM 20158 with unsequenced genome by mass spectrometry-driven sequence similarity searches.

    Science.gov (United States)

    Shabbiri, Khadija; Botting, Catherine H; Adnan, Ahmad; Fuszard, Matthew

    2013-05-27

    Brevibacterium linens DSM 20158 is an industrially important actinobacterium which is well-known for the production of amino acids and enzymes. However, as this strain has an unsequenced genome, there is no detailed information regarding its proteome although another strain of this microbe, BL2, has a shotgun genome sequence. However, this still does not cover the entire scope of its proteome. The present study is carried out by first identifying proteins by homology matches using the Mascot search algorithm followed by an advanced approach using de novo sequencing and MS BLAST to expand the B. linens proteome. The proteins identified in the secretome and cellular portion appear to be involved in various metabolic and physiological processes of this unsequenced organism. This study will help to enhance the usability of this strain of B. linens in different areas of research in the future rather than mainly in the food industries. The present study describes the construction of the first detailed proteomic reference map of B. linens DSM 20158 with unsequenced genome by comparative proteome research analysis. This opens new horizons in proteomics to understand the role of proteins involved in the metabolism and physiology of other organisms with unsequenced genomes. Copyright © 2013 Elsevier B.V. All rights reserved.

  11. Use of DNA profiles for investigation using a simulated national DNA database: Part II. Statistical and ethical considerations on familial searching.

    Science.gov (United States)

    Hicks, T; Taroni, F; Curran, J; Buckleton, J; Castella, V; Ribaux, O

    2010-10-01

    Familial searching consists of searching for a full profile left at a crime scene in a National DNA Database (NDNAD). In this paper we are interested in the circumstance where no full match is returned, but a partial match is found between a database member's profile and the crime stain. Because close relatives share more of their DNA than unrelated persons, this partial match may indicate that the crime stain was left by a close relative of the person with whom the partial match was found. This approach has successfully solved important crimes in the UK and the USA. In a previous paper, a model, which takes into account substructure and siblings, was used to simulate a NDNAD. In this paper, we have used this model to test the usefulness of familial searching and offer guidelines for pre-assessment of the cases based on the likelihood ratio. Siblings of "persons" present in the simulated Swiss NDNAD were created. These profiles (N=10,000) were used as traces and were then compared to the whole database (N=100,000). The statistical results obtained show that the technique has great potential confirming the findings of previous studies. However, effectiveness of the technique is only one part of the story. Familial searching has juridical and ethical aspects that should not be ignored. In Switzerland for example, there are no specific guidelines to the legality or otherwise of familial searching. This article both presents statistical results, and addresses criminological and civil liberties aspects to take into account risks and benefits of familial searching. Copyright © 2009 Elsevier Ireland Ltd. All rights reserved.

  12. The Transcription Profile of Tax-3 Is More Similar to Tax-1 than Tax-2: Insights into HTLV-3 Potential Leukemogenic Properties

    Science.gov (United States)

    Chevalier, Sébastien A.; Durand, Stéphanie; Dasgupta, Arindam; Radonovich, Michael; Cimarelli, Andrea; Brady, John N.

    2012-01-01

    Human T-cell Lymphotropic Viruses type 1 (HTLV-1) is the etiological agent of Adult T-cell Leukemia/Lymphoma. Although associated with lymphocytosis, HTLV-2 infection is not associated with any malignant hematological disease. Similarly, no infection-related symptom has been detected in HTLV-3-infected individuals studied so far. Differences in individual Tax transcriptional activity might account for these distinct physiopathological outcomes. Tax-1 and Tax-3 possess a PDZ binding motif in their sequence. Interestingly, this motif, which is critical for Tax-1 transforming activity, is absent from Tax-2. We used the DNA microarray technology to analyze and compare the global gene expression profiles of different T- and non T-cell types expressing Tax-1, Tax-2 or Tax-3 viral transactivators. In a T-cell line, this analysis allowed us to identify 48 genes whose expression is commonly affected by all Tax proteins and are hence characteristic of the HTLV infection, independently of the virus type. Importantly, we also identified a subset of genes (n = 70) which are specifically up-regulated by Tax-1 and Tax-3, while Tax-1 and Tax-2 shared only 1 gene and Tax-2 and Tax-3 shared 8 genes. These results demonstrate that Tax-3 and Tax-1 are closely related in terms of cellular gene deregulation. Analysis of the molecular interactions existing between those Tax-1/Tax-3 deregulated genes then allowed us to highlight biological networks of genes characteristic of HTLV-1 and HTLV-3 infection. The majority of those up-regulated genes are functionally linked in biological processes characteristic of HTLV-1-infected T-cells expressing Tax such as regulation of transcription and apoptosis, activation of the NF-κB cascade, T-cell mediated immunity and induction of cell proliferation and differentiation. In conclusion, our results demonstrate for the first time that, in T- and non T-cells types, Tax-3 is a functional analogue of Tax-1 in terms of transcriptional activation and

  13. Personalized Search

    CERN Document Server

    AUTHOR|(SzGeCERN)749939

    2015-01-01

    As the volume of electronically available information grows, relevant items become harder to find. This work presents an approach to personalizing search results in scientific publication databases. This work focuses on re-ranking search results from existing search engines like Solr or ElasticSearch. This work also includes the development of Obelix, a new recommendation system used to re-rank search results. The project was proposed and performed at CERN, using the scientific publications available on the CERN Document Server (CDS). This work experiments with re-ranking using offline and online evaluation of users and documents in CDS. The experiments conclude that the personalized search result outperform both latest first and word similarity in terms of click position in the search result for global search in CDS.

  14. Familial searching: a specialist forensic DNA profiling service utilising the National DNA Database to identify unknown offenders via their relatives--the UK experience.

    Science.gov (United States)

    Maguire, C N; McCallum, L A; Storey, C; Whitaker, J P

    2014-01-01

    The National DNA Database (NDNAD) of England and Wales was established on April 10th 1995. The NDNAD is governed by a variety of legislative instruments that mean that DNA samples can be taken if an individual is arrested and detained in a police station. The biological samples and the DNA profiles derived from them can be used for purposes related to the prevention and detection of crime, the investigation of an offence and for the conduct of a prosecution. Following the South East Asian Tsunami of December 2004, the legislation was amended to allow the use of the NDNAD to assist in the identification of a deceased person or of a body part where death has occurred from natural causes or from a natural disaster. The UK NDNAD now contains the DNA profiles of approximately 6 million individuals representing 9.6% of the UK population. As the science of DNA profiling advanced, the National DNA Database provided a potential resource for increased intelligence beyond the direct matching for which it was originally created. The familial searching service offered to the police by several UK forensic science providers exploits the size and geographic coverage of the NDNAD and the fact that close relatives of an offender may share a significant proportion of that offender's DNA profile and will often reside in close geographic proximity to him or her. Between 2002 and 2011 Forensic Science Service Ltd. (FSS) provided familial search services to support 188 police investigations, 70 of which are still active cases. This technique, which may be used in serious crime cases or in 'cold case' reviews when there are few or no investigative leads, has led to the identification of 41 perpetrators or suspects. In this paper we discuss the processes, utility, and governance of the familial search service in which the NDNAD is searched for close genetic relatives of an offender who has left DNA evidence at a crime scene, but whose DNA profile is not represented within the NDNAD. We

  15. Adenoid cystic carcinomas of the salivary gland, lacrimal gland, and breast are morphologically and genetically similar but have distinct microRNA expression profiles

    DEFF Research Database (Denmark)

    Andreasen, Simon; Tan, Qihua; Agander, Tina Klitmøller

    2018-01-01

    Adenoid cystic carcinoma is among the most frequent malignancies in the salivary and lacrimal glands and has a grave prognosis characterized by frequent local recurrences, distant metastases, and tumor-related mortality. Conversely, adenoid cystic carcinoma of the breast is a rare type of triple......-negative (estrogen and progesterone receptor, HER2) and basal-like carcinoma, which in contrast to other triple-negative and basal-like breast carcinomas has a very favorable prognosis. Irrespective of site, adenoid cystic carcinoma is characterized by gene fusions involving MYB, MYBL1, and NFIB, and the reason...... for the different clinical outcomes is unknown. In order to identify the molecular mechanisms underlying the discrepancy in clinical outcome, we characterized the phenotypic profiles, pattern of gene rearrangements, and global microRNA expression profiles of 64 salivary gland, 9 lacrimal gland, and 11 breast...

  16. Echinococcus multilocularis in Kyrgyzstan: similarity in the Asian EmsB genotypic profiles from village populations of Eastern mole voles (Ellobius tancrei) and dogs in the Alay valley.

    Science.gov (United States)

    Afonso, E; Knapp, J; Tête, N; Umhang, G; Rieffel, D; van Kesteren, F; Ziadinov, I; Craig, P S; Torgerson, P R; Giraudoux, P

    2015-11-01

    Echinococcus multilocularis is a cestode that causes human alveolar echinococcosis, a lethal zoonosis of public health concern in central Asia and western China. In the present study, one of 42 Eastern mole voles (Ellobius tancrei) caught in Sary Mogol (Alay valley, southern Kyrgyzstan) presented liver lesions with E. multilocularis from which the EmsB target was amplified. The Asian profile obtained was almost identical to one amplified from domestic dog faeces collected in a nearby village. This observation adds additional information to the potential role of E. tancrei in the transmission of E. multilocularis, and to the known distribution range of E. multilocularis (Asian strain) in central Asia.

  17. High similarity of Trypanosoma cruzi kDNA genetic profiles detected by LSSP-PCR within family groups in an endemic area of Chagas disease in Brazil

    Directory of Open Access Journals (Sweden)

    Sandra Maria Alkmim-Oliveira

    2014-10-01

    Full Text Available Introduction Determining the genetic similarities among Trypanosoma cruzi populations isolated from different hosts and vectors is very important to clarify the epidemiology of Chagas disease. Methods An epidemiological study was conducted in a Brazilian endemic area for Chagas disease, including 76 chronic chagasic individuals (96.1% with an indeterminate form; 46.1% with positive hemoculture. Results T. cruzi I (TcI was isolated from one child and TcII was found in the remaining (97.1% subjects. Low-stringency single-specific-primer-polymerase chain reaction (LSSP-PCR showed high heterogeneity among TcII populations (46% of shared bands; however, high similarities (80-100% among pairs of mothers/children, siblings, or cousins were detected. Conclusions LSSP-PCR showed potential for identifying similar parasite populations among individuals with close kinship in epidemiological studies of Chagas disease.

  18. Contig sequences and their annotation (amino acid sequence and results of homology search), and expression profile - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Dicty_cDB Contig sequences and their annotation (amino acid sequence and results of homology... search), and expression profile Data detail Data name Contig sequences and their annotation (amino acid seq...f data contents Contig sequences of cDNA sequences of Dictyostelium discoideum and the...EST-3'EST-ligated sequence and full-length cDNA sequence by the assembly program ...Phrap ( http://www.phrap.org/index.html ). Link to the list of clones constituting the contig, the information on its mapping to the

  19. Comparison of Dissolution Similarity Assessment Methods for Products with Large Variations: f2 Statistics and Model-Independent Multivariate Confidence Region Procedure for Dissolution Profiles of Multiple Oral Products.

    Science.gov (United States)

    Yoshida, Hiroyuki; Shibata, Hiroko; Izutsu, Ken-Ichi; Goda, Yukihiro

    2017-01-01

    The current Japanese Ministry of Health Labour and Welfare (MHLW)'s Guideline for Bioequivalence Studies of Generic Products uses averaged dissolution rates for the assessment of dissolution similarity between test and reference formulations. This study clarifies how the application of model-independent multivariate confidence region procedure (Method B), described in the European Medical Agency and U.S. Food and Drug Administration guidelines, affects similarity outcomes obtained empirically from dissolution profiles with large variations in individual dissolution rates. Sixty-one datasets of dissolution profiles for immediate release, oral generic, and corresponding innovator products that showed large variation in individual dissolution rates in generic products were assessed on their similarity by using the f2 statistics defined in the MHLW guidelines (MHLW f2 method) and two different Method B procedures, including a bootstrap method applied with f2 statistics (BS method) and a multivariate analysis method using the Mahalanobis distance (MV method). The MHLW f2 and BS methods provided similar dissolution similarities between reference and generic products. Although a small difference in the similarity assessment may be due to the decrease in the lower confidence interval for expected f2 values derived from the large variation in individual dissolution rates, the MV method provided results different from those obtained through MHLW f2 and BS methods. Analysis of actual dissolution data for products with large individual variations would provide valuable information towards an enhanced understanding of these methods and their possible incorporation in the MHLW guidelines.

  20. Livestock associated epidemiological information profiling in New Sandwip Island (Jahajerchar of the Meghna estuary, Noakhali using participatory disease searching tool

    Directory of Open Access Journals (Sweden)

    SK Shaheenur Islam

    2017-09-01

    Conclusion: This place is potential for sheep and buffalo raising rather than cattle. The study has validated the significance of accepting participatory disease searching tool in order to capture voluntarily submitted epidemiological data towards establishing a cost effective, unique national disease surveillance system in Bangladesh. [J Adv Vet Anim Res 2017; 4(3.000: 267-273

  1. Proteomic analysis of cellular soluble proteins from human bronchial smooth muscle cells by combining nondenaturing micro 2DE and quantitative LC-MS/MS. 2. Similarity search between protein maps for the analysis of protein complexes.

    Science.gov (United States)

    Jin, Ya; Yuan, Qi; Zhang, Jun; Manabe, Takashi; Tan, Wen

    2015-09-01

    Human bronchial smooth muscle cell soluble proteins were analyzed by a combined method of nondenaturing micro 2DE, grid gel-cutting, and quantitative LC-MS/MS and a native protein map was prepared for each of the identified 4323 proteins [1]. A method to evaluate the degree of similarity between the protein maps was developed since we expected the proteins comprising a protein complex would be separated together under nondenaturing conditions. The following procedure was employed using Excel macros; (i) maps that have three or more squares with protein quantity data were selected (2328 maps), (ii) within each map, the quantity values of the squares were normalized setting the highest value to be 1.0, (iii) in comparing a map with another map, the smaller normalized quantity in two corresponding squares was taken and summed throughout the map to give an "overlap score," (iv) each map was compared against all the 2328 maps and the largest overlap score, obtained when a map was compared with itself, was set to be 1.0 thus providing 2328 "overlap factors," (v) step (iv) was repeated for all maps providing 2328 × 2328 matrix of overlap factors. From the matrix, protein pairs that showed overlap factors above 0.65 from both protein sides were selected (431 protein pairs). Each protein pair was searched in a database (UniProtKB) on complex formation and 301 protein pairs, which comprise 35 protein complexes, were found to be documented. These results demonstrated that native protein maps and their similarity search would enable simultaneous analysis of multiple protein complexes in cells. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Proteomic analysis of cellular soluble proteins from human bronchial smooth muscle cells by combining nondenaturing micro 2DE and quantitative LC‐MS/MS. 2. Similarity search between protein maps for the analysis of protein complexes

    Science.gov (United States)

    Jin, Ya; Yuan, Qi; Zhang, Jun; Manabe, Takashi

    2015-01-01

    Human bronchial smooth muscle cell soluble proteins were analyzed by a combined method of nondenaturing micro 2DE, grid gel‐cutting, and quantitative LC‐MS/MS and a native protein map was prepared for each of the identified 4323 proteins [1]. A method to evaluate the degree of similarity between the protein maps was developed since we expected the proteins comprising a protein complex would be separated together under nondenaturing conditions. The following procedure was employed using Excel macros; (i) maps that have three or more squares with protein quantity data were selected (2328 maps), (ii) within each map, the quantity values of the squares were normalized setting the highest value to be 1.0, (iii) in comparing a map with another map, the smaller normalized quantity in two corresponding squares was taken and summed throughout the map to give an “overlap score,” (iv) each map was compared against all the 2328 maps and the largest overlap score, obtained when a map was compared with itself, was set to be 1.0 thus providing 2328 “overlap factors,” (v) step (iv) was repeated for all maps providing 2328 × 2328 matrix of overlap factors. From the matrix, protein pairs that showed overlap factors above 0.65 from both protein sides were selected (431 protein pairs). Each protein pair was searched in a database (UniProtKB) on complex formation and 301 protein pairs, which comprise 35 protein complexes, were found to be documented. These results demonstrated that native protein maps and their similarity search would enable simultaneous analysis of multiple protein complexes in cells. PMID:26031785

  3. Multicomponent LBSap vaccine displays immunological and parasitological profiles similar to those of Leish-Tec® and Leishmune® vaccines against visceral leishmaniasis.

    Science.gov (United States)

    de Mendonça, Ludmila Zanandreis; Resende, Lucilene Aparecida; Lanna, Mariana Ferreira; Aguiar-Soares, Rodrigo Dian de Oliveira; Roatt, Bruno Mendes; Castro, Renata Alves de Oliveira E; Batista, Maurício Azevedo; Silveira-Lemos, Denise; Gomes, Juliana de Assis Silva; Fujiwara, Ricardo Toshio; Rezende, Simone Aparecida; Martins-Filho, Olindo Assis; Corrêa-Oliveira, Rodrigo; Dutra, Walderez Ornelas; Reis, Alexandre Barbosa; Giunchetti, Rodolfo Cordeiro

    2016-08-30

    In past years, many researchers have sought canine visceral leishmaniasis (CVL) prevention through the characterization of Leishmania antigens as vaccine candidates. Despite these efforts, there is still no efficient vaccine for CVL control. In the present study, we performed a pre-clinical vaccine trial using BALB/c mice to compare the effects of the multicomponent LBSap vaccine with those of Leish-Tec® and Leishmune®. Blood was collected to determine the frequency of peripheral blood cells and to evaluate hematologic and immunophenotypic parameters. Liver and spleen samples were collected for parasitological quantification, and spleen samples were used to access the cytokine profile. When measuring total IgG and IgG1 anti-Leishmania levels after the third vaccination and L. infantum challenge, it was evident that all vaccines were able to induce humoral immune response. Regarding the innate immune response, increased levels of NK CD3(-)CD49(+) cells were the hallmark of all vaccinated groups, whereas only the Leish-Tec® group displayed a high frequency of CD14(+) monocytes after L. infantum challenge. Moreover, CD3(+)CD4(+) T cells were the main circulating lymphocytes induced after L. infantum challenge with all evaluated vaccines. Importantly, after L. infantum challenge, splenocytes from the Leishmune® vaccine produced high levels of IL-2, whereas a prominent type 1 immune response was the hallmark of the LBSap vaccine, which presented high levels of IL-2, IL-6, TNF-α, and IFN-γ. The efficacy analysis using real-time polymerase chain reaction demonstrated a reduction in the parasitism in the spleen (Leishmune®: 64 %; LBSap: 42 %; and Leish-Tec®: 36 %) and liver (Leishmune®: 71 %; LBSap: 62 %; and Leish-Tec®: 48 %). The dataset led to the conclusion that the LBSap vaccination was able to induce immune and efficacy profiles comparable with those of commercial vaccines, thus demonstrating its potential as a promising vaccine candidate for visceral

  4. A search for thermal excursions from ancient extraterrestrial impacts using Hadean zircon Ti-U-Th-Pb depth profiles.

    Science.gov (United States)

    Abbott, Sunshine S; Harrison, T Mark; Schmitt, Axel K; Mojzsis, Stephen J

    2012-08-21

    Few terrestrial localities preserve more than a trace lithic record prior to ca. 3.8 Ga greatly limiting our understanding of the first 700 Ma of Earth history, a period inferred to have included a spike in the bolide flux to the inner solar system at ca. 3.85-3.95 Ga (the Late Heavy Bombardment, LHB). An accessible record of this era may be found in Hadean detrital zircons from the Jack Hills, Western Australia, in the form of μm-scale epitaxial overgrowths. By comparing crystallization temperatures of pre-3.8 Ga zircon overgrowths to the archive of zircon temperature spectra, it should, in principle, be possible to identify a distinctive impact signature. We have developed Ti-U-Th-Pb ion microprobe depth profiling to obtain age and temperature information within these zircon overgrowths and undertaken a feasibility study of its possible use in identifying impact events. Of eight grains profiled in this fashion, four have overgrowths of LHB-era age. Age vs. temperature profiles reveal a period between ca. 3.85-3.95 Ga (i.e., LHB era) characterized by significantly higher temperatures (approximately 840-875 °C) than do older or younger zircons or zircon domains (approximately 630-750 °C). However, temperatures approaching 900 °C can result in Pb isotopic exchange rendering interpretation of these profiles nonunique. Coupled age-temperature depth profiling shows promise in this role, and the preliminary data we report could represent the first terrestrial evidence for impact-related heating during the LHB.

  5. Similarities and Improvements of GPM Dual-Frequency Precipitation Radar (DPR upon TRMM Precipitation Radar (PR in Global Precipitation Rate Estimation, Type Classification and Vertical Profiling

    Directory of Open Access Journals (Sweden)

    Jinyu Gao

    2017-11-01

    Full Text Available Spaceborne precipitation radars are powerful tools used to acquire adequate and high-quality precipitation estimates with high spatial resolution for a variety of applications in hydrological research. The Global Precipitation Measurement (GPM mission, which deployed the first spaceborne Ka- and Ku-dual frequency radar (DPR, was launched in February 2014 as the upgraded successor of the Tropical Rainfall Measuring Mission (TRMM. This study matches the swath data of TRMM PR and GPM DPR Level 2 products during their overlapping periods at the global scale to investigate their similarities and DPR’s improvements concerning precipitation amount estimation and type classification of GPM DPR over TRMM PR. Results show that PR and DPR agree very well with each other in the global distribution of precipitation, while DPR improves the detectability of precipitation events significantly, particularly for light precipitation. The occurrences of total precipitation and the light precipitation (rain rates < 1 mm/h detected by GPM DPR are ~1.7 and ~2.53 times more than that of PR. With regard to type classification, the dual-frequency (Ka/Ku and single frequency (Ku methods performed similarly. In both inner (the central 25 beams and outer swaths (1–12 beams and 38–49 beams of DPR, the results are consistent. GPM DPR improves precipitation type classification remarkably, reducing the misclassification of clouds and noise signals as precipitation type “other” from 10.14% of TRMM PR to 0.5%. Generally, GPM DPR exhibits the same type division for around 82.89% (71.02% of stratiform (convective precipitation events recognized by TRMM PR. With regard to the freezing level height and bright band (BB height, both radars correspond with each other very well, contributing to the consistency in stratiform precipitation classification. Both heights show clear latitudinal dependence. Results in this study shall contribute to future development of spaceborne

  6. Cancer incidence profile in sub-Saharan African-born blacks in the United States: Similarities and differences with US-born non-Hispanic blacks.

    Science.gov (United States)

    Medhanie, Genet A; Fedewa, Stacey A; Adissu, Hibret; DeSantis, Carol E; Siegel, Rebecca L; Jemal, Ahmedin

    2017-08-15

    Sub-Saharan African-born blacks (ABs) are one of the fastest-growing populations in the United States. However, to the authors' knowledge, data regarding the cancer burden in this group are lacking, which would inform targeted cancer prevention and control. The authors calculated age-standardized proportional incidence ratios (PIRs) comparing the frequency of the top 15 cancers in ABs with that of US-born non-Hispanic blacks (USBs) by sex and region of birth using incidence data for 2000 through 2012 from the Surveillance, Epidemiology, and End Results (SEER 17) program. Compared with USBs, ABs had significantly higher PIRs of infection-related cancers (liver, stomach, and Kaposi sarcoma), blood cancers (leukemia and non-Hodgkin lymphoma), prostate cancer, and thyroid cancers (females only). For example, the PIR for Kaposi sarcoma in AB versus USB women was 12.06 (95% confidence interval [95% CI], 5.23-18.90). In contrast, ABs had lower PIRs for smoking-related and colorectal cancers (eg, for lung cancer among men, the PIR was 0.30 [95% CI, 0.27-0.34]). Furthermore, cancer occurrence in ABs versus USBs varied by region of birth. For example, the higher PIRs for liver cancer noted among male ABs (PIR, 3.57; 95% CI, 1.79-5.35) and for thyroid cancer in female ABs (PIR, 3.03; 95% CI, 2.03-4.02) were confined to Eastern African-born blacks, whereas the higher PIR for prostate cancer (PIR, 1.90; 95% CI, 1.78, 2.02) was confined to Western African-born blacks. The cancer incidence profile of ABs is different from that of USBs and varies by region of birth, suggesting differences in environmental, cultural, social, and genetic factors. The findings of the current study could stimulate etiologic research and help to inform targeted interventions. Cancer 2017;123:3116-24. © 2017 American Cancer Society. © 2017 American Cancer Society.

  7. Adeno-associated virus (AAV) serotypes 2, 4 and 5 display similar transduction profiles and penetrate solid tumor tissue in models of human glioma.

    Science.gov (United States)

    Thorsen, Frits; Afione, Sandra; Huszthy, Peter C; Tysnes, Berit B; Svendsen, Agnete; Bjerkvig, Rolf; Kotin, Robert M; Lønning, Per Eystein; Hoover, Frank

    2006-09-01

    Adeno-associated viral (AAV) vectors are potent delivery vehicles for gene transfer strategies directed at the central nervous system (CNS), muscle and liver. However, comparatively few studies have described AAV-mediated gene transfer to tumor tissues. We have previously demonstrated that while AAV2 and Adenoviral (Ad) 5 vectors have similar broad host ranges in tumor-derived cell lines, AAV2 was able to penetrate human glioblastoma biopsy spheroids and xenografts more efficiently than Ad 5 vectors. These results suggested that AAV vectors could be suitable for therapeutic gene delivery to solid tumor tissue. In the present work, the transduction efficacy of AAV serotypes 4 and 5 were compared to AAV2, both in vitro and in intracranial GBM xenografts derived from patient biopsies implanted into nude rats. AAV vector serotypes 2, 4, and 5 containing either the green fluorescent protein (GFP) or the bacterial beta-galactosidase (lacZ) reporter gene were added to five different human glioma cell lines, to multicellular spheroids generated from glioblastoma patient biopsies, and to spheroids xenografted intracranially in nude rats. Transduction efficiency was assessed by fluorescence imaging, histochemistry, immunohistochemistry and flow cytometry. While all three AAV serotypes were able to transduce the glioma cell lines when added individually or when they were administered in concert, AAV2 transduced the glioma cells most effectively compared to AAV4 or AAV5. Upon infecting glioblastoma spheroids in vitro, all three AAV serotypes efficiently transduced cells located at the surface as well as within deeper layers of the spheroids. In addition, similarly to what was observed for AAV2 16, both AAV4 and AAV5 were able to transduce human glioblastoma xenografts implanted intracranially. In addition to the widely used AAV2 serotype, AAV4 and AAV5 serotypes may also be used to transduce biologically diverse glioma cell lines. They also penetrate and transduce solid human

  8. GEMINI: a computationally-efficient search engine for large gene expression datasets.

    Science.gov (United States)

    DeFreitas, Timothy; Saddiki, Hachem; Flaherty, Patrick

    2016-02-24

    Low-cost DNA sequencing allows organizations to accumulate massive amounts of genomic data and use that data to answer a diverse range of research questions. Presently, users must search for relevant genomic data using a keyword, accession number of meta-data tag. However, in this search paradigm the form of the query - a text-based string - is mismatched with the form of the target - a genomic profile. To improve access to massive genomic data resources, we have developed a fast search engine, GEMINI, that uses a genomic profile as a query to search for similar genomic profiles. GEMINI implements a nearest-neighbor search algorithm using a vantage-point tree to store a database of n profiles and in certain circumstances achieves an [Formula: see text] expected query time in the limit. We tested GEMINI on breast and ovarian cancer gene expression data from The Cancer Genome Atlas project and show that it achieves a query time that scales as the logarithm of the number of records in practice on genomic data. In a database with 10(5) samples, GEMINI identifies the nearest neighbor in 0.05 sec compared to a brute force search time of 0.6 sec. GEMINI is a fast search engine that uses a query genomic profile to search for similar profiles in a very large genomic database. It enables users to identify similar profiles independent of sample label, data origin or other meta-data information.

  9. Allospecific CD8 T suppressor cells induced by multiple MLC stimulation or priming in the presence of ILT3.Fc have similar gene expression profiles.

    Science.gov (United States)

    Chen, Ling; Xu, Zheng; Chang, Chris; Ho, Sophey; Liu, Zhuoru; Vlad, George; Cortesini, Raffaello; Clynes, Raphael A; Luo, Yun; Suciu-Foca, Nicole

    2014-02-01

    Alloantigen specific CD8 T suppressor cells can be generated in vitro either by multiple stimulations of CD3 T cells with allogeneic APC or by single stimulation in primary MLC containing recombinant ILT3.Fc protein. The aim of the present study was to determine whether multiple MLC stimulation induced in CD8(+) CD28(-) T suppressor cells molecular changes that are similar to those observed in CD8 T suppressor cells from primary MLC containing ILT3.Fc protein. Our study demonstrates that the characteristic signatures of CD8 T suppressor cells, generated by either of these methods are the same consisting of up-regulation of the BCL6 transcriptional repressor and down-regulation of inflammatory microRNAs, miR-21, miR-30b, miR-146a, and miR-155 expression. In conclusion microRNAs which are increased under inflammatory conditions in activated CD4 and CD8 T cells with helper or cytotoxic function show low levels of expression in CD8 T cells which have acquired antigen-specific suppressor activity. Copyright © 2014. Published by Elsevier Inc.

  10. BRCA1-mutated and basal-like breast cancers have similar aCGH profiles and a high incidence of protein truncating TP53 mutations

    Directory of Open Access Journals (Sweden)

    van de Vijver Marc J

    2010-11-01

    Full Text Available Abstract Background Basal-like breast cancers (BLBC are aggressive breast cancers for which, so far, no targeted therapy is available because they typically lack expression of hormone receptors and HER2. Phenotypic features of BLBCs, such as clinical presentation and early age of onset, resemble those of breast tumors from BRCA1-mutation carriers. The genomic instability of BRCA1-mutated tumors can be effectively targeted with DNA-damaging agents and poly-(ADP-ribose polymerase 1 (PARP1 inhibitors. Molecular similarities between BLBCs and BRCA1-mutated tumors may therefore provide predictive markers for therapeutic response of BLBCs. Methods There are several known molecular features characteristic for BRCA1-mutated breast tumors: 1 increased numbers of genomic aberrations, 2 a distinct pattern of genomic aberrations, 3 a high frequency of TP53 mutations and 4 a high incidence of complex, protein-truncating TP53 mutations. We compared the frequency of TP53 mutations and the pattern and amount of genomic aberrations between BRCA1-mutated breast tumors, BLBCs and luminal breast tumors by TP53 gene sequencing and array-based comparative genomics hybridization (aCGH analysis. Results We found that the high incidence of protein truncating TP53 mutations and the pattern and amount of genomic aberrations specific for BRCA1-mutated breast tumors are also characteristic for BLBCs and different from luminal breast tumors. Conclusions Complex, protein truncating TP53 mutations in BRCA1-mutated tumors may be a direct consequence of genomic instability caused by BRCA1 loss, therefore, the presence of these types of TP53 mutations in sporadic BLBCs might be a hallmark of BRCAness and a potential biomarker for sensitivity to PARP inhibition. Also, our data suggest that a small subset of genomic regions may be used to identify BRCA1-like BLBCs. BLBCs share molecular features that were previously found to be specific for BRCA1-mutated breast tumors. These

  11. BRCA1-mutated and basal-like breast cancers have similar aCGH profiles and a high incidence of protein truncating TP53 mutations.

    Science.gov (United States)

    Holstege, Henne; Horlings, Hugo M; Velds, Arno; Langerød, Anita; Børresen-Dale, Anne-Lise; van de Vijver, Marc J; Nederlof, Petra M; Jonkers, Jos

    2010-11-30

    Basal-like breast cancers (BLBC) are aggressive breast cancers for which, so far, no targeted therapy is available because they typically lack expression of hormone receptors and HER2. Phenotypic features of BLBCs, such as clinical presentation and early age of onset, resemble those of breast tumors from BRCA1-mutation carriers. The genomic instability of BRCA1-mutated tumors can be effectively targeted with DNA-damaging agents and poly-(ADP-ribose) polymerase 1 (PARP1) inhibitors. Molecular similarities between BLBCs and BRCA1-mutated tumors may therefore provide predictive markers for therapeutic response of BLBCs. There are several known molecular features characteristic for BRCA1-mutated breast tumors: 1) increased numbers of genomic aberrations, 2) a distinct pattern of genomic aberrations, 3) a high frequency of TP53 mutations and 4) a high incidence of complex, protein-truncating TP53 mutations. We compared the frequency of TP53 mutations and the pattern and amount of genomic aberrations between BRCA1-mutated breast tumors, BLBCs and luminal breast tumors by TP53 gene sequencing and array-based comparative genomics hybridization (aCGH) analysis. We found that the high incidence of protein truncating TP53 mutations and the pattern and amount of genomic aberrations specific for BRCA1-mutated breast tumors are also characteristic for BLBCs and different from luminal breast tumors. Complex, protein truncating TP53 mutations in BRCA1-mutated tumors may be a direct consequence of genomic instability caused by BRCA1 loss, therefore, the presence of these types of TP53 mutations in sporadic BLBCs might be a hallmark of BRCAness and a potential biomarker for sensitivity to PARP inhibition. Also, our data suggest that a small subset of genomic regions may be used to identify BRCA1-like BLBCs. BLBCs share molecular features that were previously found to be specific for BRCA1-mutated breast tumors. These features might be useful for the identification of tumors with

  12. Keep Searching and You’ll Find

    DEFF Research Database (Denmark)

    Laursen, Keld

    2012-01-01

    This article critically reviews and synthesizes the contributions found in theoretical and empirical studies of firm-level innovation search processes. It explores the advantages and disadvantages of local and non-local search, discusses organizational responses, and identifies potential exogenous...... different search strategies, but end up with very similar technological profiles in fast-growing technologies. The article concludes by highlighting what we have learnt from the literature and suggesting some new avenues for research....... triggers for different kinds of search. It argues that the initial focus on local search was a consequence, in part, of the attention in evolutionary economics to path-dependent behavior, but that as localized behavior was increasingly accepted as the standard mode, studies began to question whether local...

  13. Effects of dietary palmitoleic acid on plasma lipoprotein profile and aortic cholesterol accumulation are similar to those of other unsaturated fatty acids in the F1B golden Syrian hamster.

    Science.gov (United States)

    Matthan, Nirupa R; Dillard, Alice; Lecker, Jaime L; Ip, Blanche; Lichtenstein, Alice H

    2009-02-01

    The lower susceptibility of palmitoleic acid (16:1) to oxidation compared to PUFA may confer functional advantages with respect to finding acceptable alternatives to partially hydrogenated fats, but limited data are available on its effect on cardiovascular risk factors. This study investigated the effect of diets (10% fat, 0.1% cholesterol, wt:wt) enriched with macadamia [monounsaturated fatty acid (MUFA)16:1], palm (SFA,16:0), canola (MUFA,18:1), or safflower (PUFA,18:2) oils on lipoprotein profiles and aortic cholesterol accumulation in F1B Golden Syrian hamsters (n = 16/group). After 12 wk, 8 hamsters in each group were killed (phase 1). The remaining hamsters fed palm oil were changed to a diet containing coconut oil, while hamsters in the other diet groups continued on their original diets for an additional 6 wk (phase 2). With minor exceptions, the time course and dietary SFA source did not alter the study outcomes. Macadamia oil-fed hamsters had lower non-HDL cholesterol and triglyceride concentrations compared with the palm and coconut oil-fed hamsters and higher HDL-cholesterol compared with the coconut, canola, and safflower oil-fed hamsters. The aortic cholesterol concentration was not affected by dietary fat type. The hepatic cholesterol concentration was higher in the unsaturated compared with the saturated oil-fed hamsters. RBC membrane and aortic cholesteryl ester, triglyceride, and phospholipid fatty acid profiles reflected that of the dietary oil. These data suggest that an oil relatively high in palmitoleic acid does not adversely affect plasma lipoprotein profiles or aortic cholesterol accumulation and was similar to other unsaturated fatty acid-rich oils.

  14. Effects of Dietary Palmitoleic Acid on Plasma Lipoprotein Profile and Aortic Cholesterol Accumulation Are Similar to Those of Other Unsaturated Fatty Acids in the F1B Golden Syrian Hamster 1–3

    Science.gov (United States)

    Matthan, Nirupa R.; Dillard, Alice; Lecker, Jaime L.; Ip, Blanche; Lichtenstein, Alice H.

    2008-01-01

    The lower susceptibility of palmitoleic acid (16:1) to oxidation compared to PUFA may confer functional advantages with respect to finding acceptable alternatives to partially hydrogenated fats, but limited data are available on its effect on cardiovascular risk factors. This study investigated the effect of diets (10% fat, 0.1% cholesterol, wt:wt) enriched with macadamia [monounsaturated fatty acid (MUFA)16:1], palm (SFA,16:0), canola (MUFA,18:1), or safflower (PUFA,18:2) oils on lipoprotein profiles and aortic cholesterol accumulation in F1B Golden Syrian hamsters (n = 16/group). After 12 wk, 8 hamsters in each group were killed (phase 1). The remaining hamsters fed palm oil were changed to a diet containing coconut oil, while hamsters in the other diet groups continued on their original diets for an additional 6 wk (phase 2). With minor exceptions, the time course and dietary SFA source did not alter the study outcomes. Macadamia oil-fed hamsters had lower non-HDL cholesterol and triglyceride concentrations compared with the palm and coconut oil-fed hamsters and higher HDL-cholesterol compared with the coconut, canola, and safflower oil-fed hamsters. The aortic cholesterol concentration was not affected by dietary fat type. The hepatic cholesterol concentration was higher in the unsaturated compared with the saturated oil-fed hamsters. RBC membrane and aortic cholesteryl ester, triglyceride, and phospholipid fatty acid profiles reflected that of the dietary oil. These data suggest that an oil relatively high in palmitoleic acid does not adversely affect plasma lipoprotein profiles or aortic cholesterol accumulation and was similar to other unsaturated fatty acid-rich oils. PMID:19106316

  15. Rapid profiling of polymeric phenolic acids in Salvia miltiorrhiza by hybrid data-dependent/targeted multistage mass spectrometry acquisition based on expected compounds prediction and fragment ion searching.

    Science.gov (United States)

    Shen, Yao; Feng, Zijin; Yang, Min; Zhou, Zhe; Han, Sumei; Hou, Jinjun; Li, Zhenwei; Wu, Wanying; Guo, De-An

    2018-01-15

    Phenolic acids are the major water-soluble components in Salvia miltiorrhiza (> 5%). According to previous studies, many of them contribute to the cardiovascular effects and antioxidant effects of S. miltiorrhiza. Polymeric phenolic acids can be considered as the tanshinol derived metabolites, e.g., dimmers, trimers and tetramers. A strategy combined with tanshinol-based expected compounds prediction, total ion chromatogram filtering, Fragment Ion Searching and parent list-based multistage mass spectrometry acquisition by Linear Trap Quadropole-orbitrap Velos Mass Spectrometry was proposed to rapid profile polymeric phenolic acids in S. miltiorrhiza. More than 480 potential polymeric phenolic acids could be screened out by this strategy. Based on the fragment information obtained by parent list-activated data dependent multistage mass spectrometry acquisition, 190 polymeric phenolic acids were characterized by comparing their mass information with literature data, and 18 of them were firstly detected from S. miltiorrhiza. Seven potential compounds were tentatively characterized as new polymeric phenolic acids from S. miltiorrhiza. This strategy facilitates identification of polymeric phenolic acids in complex matrix with both selectivity and sensitivity, which could be expanded for rapid discovery and identification of compounds from complex matrix. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  16. Exploration of polar lipid accumulation profiles in Euglena gracilis using LipidBlast, an MS/MS spectral library constructed in silico.

    Science.gov (United States)

    Ogawa, Takumi; Furuhashi, Takeshi; Okazawa, Atsushi; Nakai, Rai; Nakazawa, Masami; Kind, Tobias; Fiehn, Oliver; Kanaya, Shigehiko; Arita, Masanori; Ohta, Daisaku

    2014-01-01

    A rapid protocol for polar lipid profiling was applied to Euglena gracilis lipid metabolism by LipidBlast, an MS/MS spectral similarity search tool. The similarity search results suggested anoxia-induced polar lipid metabolism in Euglena characterized by the accumulation of differential lipid classes, carbon chain lengths, and unsaturated bond numbers. The informatics-supported MS spectral search provides an alternative option for global lipid profiling studies.

  17. Custom Search Engines: Tools & Tips

    Science.gov (United States)

    Notess, Greg R.

    2008-01-01

    Few have the resources to build a Google or Yahoo! from scratch. Yet anyone can build a search engine based on a subset of the large search engines' databases. Use Google Custom Search Engine or Yahoo! Search Builder or any of the other similar programs to create a vertical search engine targeting sites of interest to users. The basic steps to…

  18. Species spectrum, diversity profile and infection indices of helminth parasite fauna of Chirruh snowtrout, Schizothorax esocinus (Heckel) in lake ecosystems of Kashmir Himalayas-Do similarity and host-parasite associations arise?

    Science.gov (United States)

    Zargar, U R; Chishti, M Z; Yousuf, A R; Ahmad, Fayaz

    2013-09-01

    In order to assess the species richness and diversity profile of helminth parasite fauna in an endemic fish, an investigation was carried out in two urban and two rural lakes of Kashmir. Overall nine species of helminth parasites were observed in four lakes. Of these three were autogenic and six were allogenic. Heteroxenous parasite species were more in number than monoxenous species. Results showed significant differences in heteroxenous / monoxenous ratio between different lakes. Core species (Prevalence > 20) were only found in hypertrophic lake (Anchar Lake). Overall, majority of helminth species were either secondary or satellite species. Prevalence of some helminth parasites showed significant differences in different lakes. In addition mean intensity showed significant differences between autogenic and allogenic parasites (P Diversity indices showed significant variation between different lakes. Maximum helminth species per host was in Anchar Lake. Finally we concluded that helminth parasite fauna showed significant differences in species richness and infection indices between different lakes. Diversity profile was higher in Anchar Lake in comparison to other three lakes. The results clearly show that environmental features of lake ecosystems have got an impact on distribution pattern of helminth parasites in S. esocinus. We suggest comparative parasitological study should be taken between different species of fish in order to have a clear picture regarding the species composition of helminth species in this region. Also we need to characterize the species spectrum of parasitic worms in fish of freshwater bodies of this region as well as other similar type of climatic zones because parasite fauna is an integral part of the inventory of biodiversity and as possible regulators of host populations in aquatic ecosystems.

  19. Accelerating Information Retrieval from Profile Hidden Markov Model Databases.

    Directory of Open Access Journals (Sweden)

    Ahmad Tamimi

    Full Text Available Profile Hidden Markov Model (Profile-HMM is an efficient statistical approach to represent protein families. Currently, several databases maintain valuable protein sequence information as profile-HMMs. There is an increasing interest to improve the efficiency of searching Profile-HMM databases to detect sequence-profile or profile-profile homology. However, most efforts to enhance searching efficiency have been focusing on improving the alignment algorithms. Although the performance of these algorithms is fairly acceptable, the growing size of these databases, as well as the increasing demand for using batch query searching approach, are strong motivations that call for further enhancement of information retrieval from profile-HMM databases. This work presents a heuristic method to accelerate the current profile-HMM homology searching approaches. The method works by cluster-based remodeling of the database to reduce the search space, rather than focusing on the alignment algorithms. Using different clustering techniques, 4284 TIGRFAMs profiles were clustered based on their similarities. A representative for each cluster was assigned. To enhance sensitivity, we proposed an extended step that allows overlapping among clusters. A validation benchmark of 6000 randomly selected protein sequences was used to query the clustered profiles. To evaluate the efficiency of our approach, speed and recall values were measured and compared with the sequential search approach. Using hierarchical, k-means, and connected component clustering techniques followed by the extended overlapping step, we obtained an average reduction in time of 41%, and an average recall of 96%. Our results demonstrate that representation of profile-HMMs using a clustering-based approach can significantly accelerate data retrieval from profile-HMM databases.

  20. Ellenore Flood's Kuder Occupational Interest Survey and Career Search Schedule.

    Science.gov (United States)

    Zytowski, Donald G.

    1998-01-01

    The Kuder Occupational Interest Survey (KOIS) and Kuder Career Search Survey (KCSS) are interpreted for a 29-year-old female client in the form of a letter addressed to her. Technique, the author's reasoning, and suggestions for interventions are discussed. Includes job profiles of people whose scores were similar to the client's. (MKA)

  1. SETI The Search for Extraterrestrial Intelligence.

    Science.gov (United States)

    Jones, Barrie W.

    1991-01-01

    Discussed is the search for life on other planets similar to Earth based on the Drake equation. Described are search strategies and microwave searches. The reasons why people are searching are also discussed. (KR)

  2. Similarity of TIMSS Math and Science Achievement of Nations

    Directory of Open Access Journals (Sweden)

    Algirdas Zabulionis

    2001-09-01

    Full Text Available In 1991-97, the International Association for the Evaluation of Educational Achievement (IEA undertook a Third International Mathematics and Science Study (TIMSS in which data about the mathematics and science achievement of the thirteen year-old students in more than 40 countries were collected. These data provided the opportunity to search for patterns of students' answers to the test items: which group of items was relatively more difficult (or more easy for the students from a particular country (or group of countries. Using this massive data set an attempt was made to measure the similarities among country profiles of how students responded to the test items.

  3. Domain similarity based orthology detection.

    Science.gov (United States)

    Bitard-Feildel, Tristan; Kemena, Carsten; Greenwood, Jenny M; Bornberg-Bauer, Erich

    2015-05-13

    Orthologous protein detection software mostly uses pairwise comparisons of amino-acid sequences to assert whether two proteins are orthologous or not. Accordingly, when the number of sequences for comparison increases, the number of comparisons to compute grows in a quadratic order. A current challenge of bioinformatic research, especially when taking into account the increasing number of sequenced organisms available, is to make this ever-growing number of comparisons computationally feasible in a reasonable amount of time. We propose to speed up the detection of orthologous proteins by using strings of domains to characterize the proteins. We present two new protein similarity measures, a cosine and a maximal weight matching score based on domain content similarity, and new software, named porthoDom. The qualities of the cosine and the maximal weight matching similarity measures are compared against curated datasets. The measures show that domain content similarities are able to correctly group proteins into their families. Accordingly, the cosine similarity measure is used inside porthoDom, the wrapper developed for proteinortho. porthoDom makes use of domain content similarity measures to group proteins together before searching for orthologs. By using domains instead of amino acid sequences, the reduction of the search space decreases the computational complexity of an all-against-all sequence comparison. We demonstrate that representing and comparing proteins as strings of discrete domains, i.e. as a concatenation of their unique identifiers, allows a drastic simplification of search space. porthoDom has the advantage of speeding up orthology detection while maintaining a degree of accuracy similar to proteinortho. The implementation of porthoDom is released using python and C++ languages and is available under the GNU GPL licence 3 at http://www.bornberglab.org/pages/porthoda .

  4. Searching the Sinus Amoris: Using profiles of geological units, impact and volcanic features to characterize a major terrane interface on the Moon

    Science.gov (United States)

    Clark, P.; Joerg, S.; Dehon, R.

    1994-02-01

    Geochemical profiles of surface units, impact, and volcanic features are studied in detail to determine the underlying structure in an area of extensive mare/highland interface, Sinus Amoris. This study region includes and surrounds the northeastern embayment of Mare Tranquillitatis. The concentrations of two major rock-forming elements (Mg and Al), which were derived from the Apollo 15 orbital geochemical measurements, were used in this study. Mapped units and deposits associated with craters in the northwestern part of the region tend to have correlated low Mg and Al concentrations, indicating the presence of Potassium (K)-Rare Earth Elements (REE)-Phosphorus (P) (KREEP)-enriched basalt. Found along the northeastern rim of Tranquillitatis were areas with correlated high Mg and Al concentration, indicating the presence of troctolite. Distinctive west/east and north/south trends were observed in the concentrations of Mg and Al, and, by implication, in the distribution of major rock components on the surface. Evidence for a systematic geochemical transition in highland or basin-forming units may be observed here in the form of distinctive differences in chemistry in otherwise similar units in the western and eastern portions of the study region.

  5. THE MASS PROFILE AND SHAPE OF BARS IN THE SPITZER SURVEY OF STELLAR STRUCTURE IN GALAXIES (S{sup 4}G): SEARCH FOR AN AGE INDICATOR FOR BARS

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Taehyun; Lee, Myung Gyoon [Astronomy Program, Department of Physics and Astronomy, Seoul National University, Seoul 151-742 (Korea, Republic of); Sheth, Kartik; Muñoz-Mateos, Juan-Carlos [National Radio Astronomy Observatory/NAASC, 520 Edgemont Road, Charlottesville, VA 22903 (United States); Gadotti, Dimitri A. [European Southern Observatory, Casilla 19001, Santiago 19 (Chile); Zaritsky, Dennis [University of Arizona, 933 N. Cherry Avenue, Tucson, AZ 85721 (United States); Elmegreen, Bruce G. [IBM Research Division, T. J. Watson Research Center, Yorktown Heights, NY 10598 (United States); Athanassoula, E.; Bosma, Albert [Aix Marseille Université, CNRS, LAM (Laboratoire d' Astrophysique de Marseille) UMR 7326, 13388 Marseille (France); Holwerda, Benne [European Space Agency, ESTEC, Keplerlaan 1, 2200-AG, Noordwijk (Netherlands); Ho, Luis C. [Kavli Institute for Astronomy and Astrophysics, Peking University, Beijing 100871 (China); Comerón, Sébastien; Laurikainen, Eija; Salo, Heikki [Division of Astronomy, Department of Physical Sciences, University of Oulu, Oulu, FIN-90014 (Finland); Knapen, Johan H.; Erroz-Ferrer, Santiago [Instituto de Astrofísica de Canarias, E-38200 La Laguna, Tenerife (Spain); Hinz, Joannah L. [MMTO, University of Arizona, 933 N. Cherry Avenue, Tucson, AZ 85721 (United States); Buta, Ronald J. [Department of Physics and Astronomy, University of Alabama, Box 870324, Tuscaloosa, AL 35487 (United States); Kim, Minjin [Korea Astronomy and Space Science Institute, Daejeon 305-348 (Korea, Republic of); Madore, Barry F. [The Observatories of the Carnegie Institution of Washington, 813 Santa Barbara Street, Pasadena, CA 91101 (United States); and others

    2015-01-20

    We have measured the radial light profiles and global shapes of bars using two-dimensional 3.6 μm image decompositions for 144 face-on barred galaxies from the Spitzer Survey of Stellar Structure in Galaxies. The bar surface brightness profile is correlated with the stellar mass and bulge-to-total (B/T) ratio of their host galaxies. Bars in massive and bulge-dominated galaxies (B/T > 0.2) show a flat profile, while bars in less massive, disk-dominated galaxies (B/T ∼ 0) show an exponential, disk-like profile with a wider spread in the radial profile than in the bulge-dominated galaxies. The global two-dimensional shapes of bars, however, are rectangular/boxy, independent of the bulge or disk properties. We speculate that because bars are formed out of disks, bars initially have an exponential (disk-like) profile that evolves over time, trapping more disk stars to boxy bar orbits. This leads bars to become stronger and have flatter profiles. The narrow spread of bar radial profiles in more massive disks suggests that these bars formed earlier (z > 1), while the disk-like profiles and a larger spread in the radial profile in less massive systems imply a later and more gradual evolution, consistent with the cosmological evolution of bars inferred from observational studies. Therefore, we expect that the flatness of the bar profile can be used as a dynamical age indicator of the bar to measure the time elapsed since the bar formation. We argue that cosmic gas accretion is required to explain our results on bar profile and the presence of gas within the bar region.

  6. In search of late time evolution self-similar scaling laws of Rayleigh-Taylor and Richtmyer-Meshkov hydrodynamic instabilities - recent theorical advance and NIF Discovery-Science experiments

    Science.gov (United States)

    Shvarts, Dov

    2017-10-01

    Hydrodynamic instabilities, and the mixing that they cause, are of crucial importance in describing many phenomena, from very large scales such as stellar explosions (supernovae) to very small scales, such as inertial confinement fusion (ICF) implosions. Such mixing causes the ejection of stellar core material in supernovae, and impedes attempts at ICF ignition. The Rayleigh-Taylor instability (RTI) occurs at an accelerated interface between two fluids with the lower density accelerating the higher density fluid. The Richtmyer-Meshkov (RM) instability occurs when a shock wave passes an interface between the two fluids of different density. In the RTI, buoyancy causes ``bubbles'' of the light fluid to rise through (penetrate) the denser fluid, while ``spikes'' of the heavy fluid sink through (penetrate) the lighter fluid. With realistic multi-mode initial conditions, in the deep nonlinear regime, the mixing zone width, H, and its internal structure, progress through an inverse cascade of spatial scales, reaching an asymptotic self-similar evolution: hRT =αRT Agt2 for RT and hRM =αRM tθ for RM. While this characteristic behavior has been known for years, the self-similar parameters αRT and θRM and their dependence on dimensionality and density ratio have continued to be intensively studied and a relatively wide distribution of those values have emerged. This talk will describe recent theoretical advances in the description of this turbulent mixing evolution that sheds light on the spread in αRT and θRM. Results of new and specially designed experiments, done by scientists from several laboratories, were performed recently using NIF, the only facility that is powerful enough to reach the self-similar regime, for quantitative testing of this theoretical advance, will be presented.

  7. Cube search, revisited

    Science.gov (United States)

    Zhang, Xuetao; Huang, Jie; Yigit-Elliott, Serap; Rosenholtz, Ruth

    2015-01-01

    Observers can quickly search among shaded cubes for one lit from a unique direction. However, replace the cubes with similar 2-D patterns that do not appear to have a 3-D shape, and search difficulty increases. These results have challenged models of visual search and attention. We demonstrate that cube search displays differ from those with “equivalent” 2-D search items in terms of the informativeness of fairly low-level image statistics. This informativeness predicts peripheral discriminability of target-present from target-absent patches, which in turn predicts visual search performance, across a wide range of conditions. Comparing model performance on a number of classic search tasks, cube search does not appear unexpectedly easy. Easy cube search, per se, does not provide evidence for preattentive computation of 3-D scene properties. However, search asymmetries derived from rotating and/or flipping the cube search displays cannot be explained by the information in our current set of image statistics. This may merely suggest a need to modify the model's set of 2-D image statistics. Alternatively, it may be difficult cube search that provides evidence for preattentive computation of 3-D scene properties. By attributing 2-D luminance variations to a shaded 3-D shape, 3-D scene understanding may slow search for 2-D features of the target. PMID:25780063

  8. Search Patterns

    CERN Document Server

    Morville, Peter

    2010-01-01

    What people are saying about Search Patterns "Search Patterns is a delight to read -- very thoughtful and thought provoking. It's the most comprehensive survey of designing effective search experiences I've seen." --Irene Au, Director of User Experience, Google "I love this book! Thanks to Peter and Jeffery, I now know that search (yes, boring old yucky who cares search) is one of the coolest ways around of looking at the world." --Dan Roam, author, The Back of the Napkin (Portfolio Hardcover) "Search Patterns is a playful guide to the practical concerns of search interface design. It cont

  9. Similarity transformations of MAPs

    Directory of Open Access Journals (Sweden)

    Andersen Allan T.

    1999-01-01

    Full Text Available We introduce the notion of similar Markovian Arrival Processes (MAPs and show that the event stationary point processes related to two similar MAPs are stochastically equivalent. This holds true for the time stationary point processes too. We show that several well known stochastical equivalences as e.g. that between the H 2 renewal process and the Interrupted Poisson Process (IPP can be expressed by the similarity transformations of MAPs. In the appendix the valid region of similarity transformations for two-state MAPs is characterized.

  10. Similarity after Goodman

    NARCIS (Netherlands)

    Decock, L.B.; Douven, I.

    2011-01-01

    In a famous critique, Goodman dismissed similarity as a slippery and both philosophically and scientifically useless notion. We revisit his critique in the light of important recent work on similarity in psychology and cognitive science. Specifically, we use Tversky's influential set-theoretic

  11. Judgments of brand similarity

    NARCIS (Netherlands)

    Bijmolt, THA; Wedel, M; Pieters, RGM; DeSarbo, WS

    This paper provides empirical insight into the way consumers make pairwise similarity judgments between brands, and how familiarity with the brands, serial position of the pair in a sequence, and the presentation format affect these judgments. Within the similarity judgment process both the

  12. Similarity Measure of Graphs

    Directory of Open Access Journals (Sweden)

    Amine Labriji

    2017-07-01

    Full Text Available The topic of identifying the similarity of graphs was considered as highly recommended research field in the Web semantic, artificial intelligence, the shape recognition and information research. One of the fundamental problems of graph databases is finding similar graphs to a graph query. Existing approaches dealing with this problem are usually based on the nodes and arcs of the two graphs, regardless of parental semantic links. For instance, a common connection is not identified as being part of the similarity of two graphs in cases like two graphs without common concepts, the measure of similarity based on the union of two graphs, or the one based on the notion of maximum common sub-graph (SCM, or the distance of edition of graphs. This leads to an inadequate situation in the context of information research. To overcome this problem, we suggest a new measure of similarity between graphs, based on the similarity measure of Wu and Palmer. We have shown that this new measure satisfies the properties of a measure of similarities and we applied this new measure on examples. The results show that our measure provides a run time with a gain of time compared to existing approaches. In addition, we compared the relevance of the similarity values obtained, it appears that this new graphs measure is advantageous and  offers a contribution to solving the problem mentioned above.

  13. Antigen-specific CD4 T cells are induced after intravesical BCG-instillation therapy in patients with bladder cancer and show similar cytokine profiles as in active tuberculosis.

    Directory of Open Access Journals (Sweden)

    Julia Elsäßer

    Full Text Available Specific T cell immunity in patients with active tuberculosis is associated with a decrease in multifunctionality. However, it is unknown whether cytokine profiles differ in patients with primary infection and those with prior contact. We therefore used intravesical immunotherapy with attenuated live Bacille Calmette-Guérin (BCG in patients with urothelial carcinoma as a model to characterise the induction of systemic immunity towards purified protein derivate (PPD and to study whether cytokine profiles differ depending on pre-existing immunity. Eighteen patients with non-muscle invasive bladder cancer were recruited during the BCG-induction course. Fifty-four healthy individuals served as controls. Interferon (IFN-γ and interleukin (IL-2 producing PPD-specific CD4 T cells were analysed longitudinally before each instillation using a rapid flow-cytometric whole blood immunoassay. Baseline levels of IFN-γ producing PPD-specific T cells were comparable to controls. T cells showed a 5-fold increase to 0.23% by week 2/3, and further increased 8-fold by week 4/5 (to 0.42%, p=0.0007. Systemic immunity was induced in all patients, although the increase was less pronounced in patients with pre-existing immunity. As in active TB, cytokine profiling during therapy revealed a lower percentage of multifunctional IFN-γ/IL-2 double-positive T cells compared to controls (60.2% vs. 71.9%, p=0.0003. Of note, when comparing patients with and without pre-existing immunity, cytokine profiles in patients with primary immunity were shifted towards IL-2 single producing T cells (p=0.02, whereas those in patients with pre-existing immunity were shifted towards IFN-γ single-positivity (p=0.01. In conclusion, systemic T cell responses were induced after BCG-therapy, and their kinetics and cytokine profile depended on pre-existing immunity. Decreased functionality is a typical feature of specific immunity in both patients with active tuberculosis and BCG

  14. New Similarity Functions

    DEFF Research Database (Denmark)

    Yazdani, Hossein; Ortiz-Arroyo, Daniel; Kwasnicka, Halina

    2016-01-01

    spaces, in addition to their similarity in the vector space. Prioritized Weighted Feature Distance (PWFD) works similarly as WFD, but provides the ability to give priorities to desirable features. The accuracy of the proposed functions are compared with other similarity functions on several data sets......In data science, there are important parameters that affect the accuracy of the algorithms used. Some of these parameters are: the type of data objects, the membership assignments, and distance or similarity functions. This paper discusses similarity functions as fundamental elements in membership...... assignments. The paper introduces Weighted Feature Distance (WFD), and Prioritized Weighted Feature Distance (PWFD), two new distance functions that take into account the diversity in feature spaces. WFD functions perform better in supervised and unsupervised methods by comparing data objects on their feature...

  15. Optimization of interactive visual-similarity-based search

    NARCIS (Netherlands)

    Nguyen, G.P.; Worring, M.

    2008-01-01

    At one end of the spectrum, research in interactive content-based retrieval concentrates on machine learning methods for effective use of relevance feedback. On the other end, the information visualization community focuses on effective methods for conveying information to the user. What is lacking

  16. The semantic similarity ensemble

    Directory of Open Access Journals (Sweden)

    Andrea Ballatore

    2013-12-01

    Full Text Available Computational measures of semantic similarity between geographic terms provide valuable support across geographic information retrieval, data mining, and information integration. To date, a wide variety of approaches to geo-semantic similarity have been devised. A judgment of similarity is not intrinsically right or wrong, but obtains a certain degree of cognitive plausibility, depending on how closely it mimics human behavior. Thus selecting the most appropriate measure for a specific task is a significant challenge. To address this issue, we make an analogy between computational similarity measures and soliciting domain expert opinions, which incorporate a subjective set of beliefs, perceptions, hypotheses, and epistemic biases. Following this analogy, we define the semantic similarity ensemble (SSE as a composition of different similarity measures, acting as a panel of experts having to reach a decision on the semantic similarity of a set of geographic terms. The approach is evaluated in comparison to human judgments, and results indicate that an SSE performs better than the average of its parts. Although the best member tends to outperform the ensemble, all ensembles outperform the average performance of each ensemble's member. Hence, in contexts where the best measure is unknown, the ensemble provides a more cognitively plausible approach.

  17. Additive Similarity Trees

    Science.gov (United States)

    Sattath, Shmuel; Tversky, Amos

    1977-01-01

    Tree representations of similarity data are investigated. Hierarchical clustering is critically examined, and a more general procedure, called the additive tree, is presented. The additive tree representation is then compared to multidimensional scaling. (Author/JKS)

  18. Gender similarities and differences.

    Science.gov (United States)

    Hyde, Janet Shibley

    2014-01-01

    Whether men and women are fundamentally different or similar has been debated for more than a century. This review summarizes major theories designed to explain gender differences: evolutionary theories, cognitive social learning theory, sociocultural theory, and expectancy-value theory. The gender similarities hypothesis raises the possibility of theorizing gender similarities. Statistical methods for the analysis of gender differences and similarities are reviewed, including effect sizes, meta-analysis, taxometric analysis, and equivalence testing. Then, relying mainly on evidence from meta-analyses, gender differences are reviewed in cognitive performance (e.g., math performance), personality and social behaviors (e.g., temperament, emotions, aggression, and leadership), and psychological well-being. The evidence on gender differences in variance is summarized. The final sections explore applications of intersectionality and directions for future research.

  19. Visual search

    NARCIS (Netherlands)

    Toet, A.; Bijl, P.

    2003-01-01

    Visual search, with or without the aid of optical or electro-optical instruments, plays a significant role in various types of military and civilian operations (e.g., reconnaissance, surveillance, and search and rescue). Advance knowledge of human visual search and target acquisition performance is

  20. Gene expression profile of high-fat diet-fed C57BL/6J mice: in search of potential role of azelaic acid.

    Science.gov (United States)

    Muthulakshmi, Shanmugam; Chakrabarti, Alok K; Mukherjee, Sanjay

    2015-03-01

    High-fat diet (HFD) elevates circulatory fatty acids and influences glucose and fat metabolism. Azelaic acid (AzA), a naturally occurring α,ω-dicarboxylic acid in wheat, rye, barley, oat seeds and sorghum, has been reported to exert antidiabetic effects in HFD-induced type 2 diabetes mellitus (T2DM) C57BL/6J mice. The present study was undertaken to identify the genes that are differentially modulated by treatment with AzA in HFD-fed mice. Mice were fed HFD for 10 weeks and subjected to intragastric administration of 80 mg/kg body weight (BW) of AzA daily along with HFD from 11 to 15 weeks. Lipid profile, adipokines and cytokines were examined in the plasma/liver of mice. Whole genome profiling was performed in the liver of mice using microarray and validated by qRT-PCR, Western blot and immunohistochemical analyses. HFD intake resulted in significantly elevated lipids (except high-density lipoproteins), resistin, tumour necrosis factor alpha and interleukin-6 with marked reduction in adiponectin. Administration of AzA to HFD-fed mice significantly restored the lipids, adipokines and cytokines to near normal. Transcript profiling revealed that HFD intake activated the genes involved in stress response, cell cycle regulation and apoptosis. Treatment with AzA caused increased expression of genes involved in reactive oxygen species (ROS) scavenging, receptor-mediated signalling, transcription, protein modification and insulin signal transduction. AzA activates insulin signal molecules leading to insulin sensitivity. The ability of AzA to modulate the expression of these genes supports the notion that AzA is a promising drug candidate for the treatment of insulin resistance associated with T2DM.

  1. Trajectory similarity join in spatial networks

    KAUST Repository

    Shang, Shuo

    2017-09-07

    The matching of similar pairs of objects, called similarity join, is fundamental functionality in data management. We consider the case of trajectory similarity join (TS-Join), where the objects are trajectories of vehicles moving in road networks. Thus, given two sets of trajectories and a threshold θ, the TS-Join returns all pairs of trajectories from the two sets with similarity above θ. This join targets applications such as trajectory near-duplicate detection, data cleaning, ridesharing recommendation, and traffic congestion prediction. With these applications in mind, we provide a purposeful definition of similarity. To enable efficient TS-Join processing on large sets of trajectories, we develop search space pruning techniques and take into account the parallel processing capabilities of modern processors. Specifically, we present a two-phase divide-and-conquer algorithm. For each trajectory, the algorithm first finds similar trajectories. Then it merges the results to achieve a final result. The algorithm exploits an upper bound on the spatiotemporal similarity and a heuristic scheduling strategy for search space pruning. The algorithm\\'s per-trajectory searches are independent of each other and can be performed in parallel, and the merging has constant cost. An empirical study with real data offers insight in the performance of the algorithm and demonstrates that is capable of outperforming a well-designed baseline algorithm by an order of magnitude.

  2. Similarity or difference?

    DEFF Research Database (Denmark)

    Villadsen, Anders Ryom

    2013-01-01

    While the organizational structures and strategies of public organizations have attracted substantial research attention among public management scholars, little research has explored how these organizational core dimensions are interconnected and influenced by pressures for similarity....... In this paper I address this topic by exploring the relation between expenditure strategy isomorphism and structure isomorphism in Danish municipalities. Different literatures suggest that organizations exist in concurrent pressures for being similar to and different from other organizations in their field...... of action. It is theorized that to meet this challenge organizations may substitute increased similarity on one core dimension for increased idiosyncrasy on another, but only after a certain level of isomorphism is reached. Results of quantitative analyses support this theory and show that an inverse U...

  3. Incremental Similarity and Turbulence

    DEFF Research Database (Denmark)

    Barndorff-Nielsen, Ole E.; Hedevang, Emil; Schmiegel, Jürgen

    This paper discusses the mathematical representation of an empirically observed phenomenon, referred to as Incremental Similarity. We discuss this feature from the viewpoint of stochastic processes and present a variety of non-trivial examples, including those that are of relevance for turbulence...

  4. Efficient Similarity Retrieval in Music Databases

    DEFF Research Database (Denmark)

    Ruxanda, Maria Magdalena; Jensen, Christian Søndergaard

    2006-01-01

    Audio music is increasingly becoming available in digital form, and the digital music collections of individuals continue to grow. Addressing the need for effective means of retrieving music from such collections, this paper proposes new techniques for content-based similarity search. Each music...

  5. Music Retrieval based on Melodic Similarity

    NARCIS (Netherlands)

    Typke, R.

    2007-01-01

    This thesis introduces a method for measuring melodic similarity for notated music such as MIDI files. This music search algorithm views music as sets of notes that are represented as weighted points in the two-dimensional space of time and pitch. Two point sets can be compared by calculating how

  6. Faceted Search

    CERN Document Server

    Tunkelang, Daniel

    2009-01-01

    We live in an information age that requires us, more than ever, to represent, access, and use information. Over the last several decades, we have developed a modern science and technology for information retrieval, relentlessly pursuing the vision of a "memex" that Vannevar Bush proposed in his seminal article, "As We May Think." Faceted search plays a key role in this program. Faceted search addresses weaknesses of conventional search approaches and has emerged as a foundation for interactive information retrieval. User studies demonstrate that faceted search provides more

  7. More Similar Than Different

    DEFF Research Database (Denmark)

    Pedersen, Mogens Jin

    2015-01-01

    What role do employee features play into the success of different personnel management practices for serving high performance? Using data from a randomized survey experiment among 5,982 individuals of all ages, this article examines how gender conditions the compliance effects of different...... incentive treatments—each relating to the basic content of distinct types of personnel management practices. The findings show that males and females are more similar than different in terms of the incentive treatments’ effects: Significant average effects are found for three out of five incentive...

  8. External Differences and Internal Similarities

    DEFF Research Database (Denmark)

    Katz-Gerro, Tally; Jæger, Mads Meier

    This paper analyzes change in cultural consumption profiles in Denmark over the period 1975-2004 along three dimensions: number of profiles; between-profile socioeconomic heterogeneity; and within-profile socioeconomic heterogeneity. Previous studies have considered the first and second aforement...

  9. Advanced Search

    African Journals Online (AJOL)

    Items 151 - 200 of 500 ... Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more ...

  10. Search Results

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  11. Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  12. The application of similar image retrieval in electronic commerce.

    Science.gov (United States)

    Hu, YuPing; Yin, Hua; Han, Dezhi; Yu, Fei

    2014-01-01

    Traditional online shopping platform (OSP), which searches product information by keywords, faces three problems: indirect search mode, large search space, and inaccuracy in search results. For solving these problems, we discuss and research the application of similar image retrieval in electronic commerce. Aiming at improving the network customers' experience and providing merchants with the accuracy of advertising, we design a reasonable and extensive electronic commerce application system, which includes three subsystems: image search display subsystem, image search subsystem, and product information collecting subsystem. This system can provide seamless connection between information platform and OSP, on which consumers can automatically and directly search similar images according to the pictures from information platform. At the same time, it can be used to provide accuracy of internet marketing for enterprises. The experiment shows the efficiency of constructing the system.

  13. The Application of Similar Image Retrieval in Electronic Commerce

    Science.gov (United States)

    Hu, YuPing; Yin, Hua; Han, Dezhi; Yu, Fei

    2014-01-01

    Traditional online shopping platform (OSP), which searches product information by keywords, faces three problems: indirect search mode, large search space, and inaccuracy in search results. For solving these problems, we discuss and research the application of similar image retrieval in electronic commerce. Aiming at improving the network customers' experience and providing merchants with the accuracy of advertising, we design a reasonable and extensive electronic commerce application system, which includes three subsystems: image search display subsystem, image search subsystem, and product information collecting subsystem. This system can provide seamless connection between information platform and OSP, on which consumers can automatically and directly search similar images according to the pictures from information platform. At the same time, it can be used to provide accuracy of internet marketing for enterprises. The experiment shows the efficiency of constructing the system. PMID:24883411

  14. The Application of Similar Image Retrieval in Electronic Commerce

    Directory of Open Access Journals (Sweden)

    YuPing Hu

    2014-01-01

    Full Text Available Traditional online shopping platform (OSP, which searches product information by keywords, faces three problems: indirect search mode, large search space, and inaccuracy in search results. For solving these problems, we discuss and research the application of similar image retrieval in electronic commerce. Aiming at improving the network customers’ experience and providing merchants with the accuracy of advertising, we design a reasonable and extensive electronic commerce application system, which includes three subsystems: image search display subsystem, image search subsystem, and product information collecting subsystem. This system can provide seamless connection between information platform and OSP, on which consumers can automatically and directly search similar images according to the pictures from information platform. At the same time, it can be used to provide accuracy of internet marketing for enterprises. The experiment shows the efficiency of constructing the system.

  15. Similar or different?

    DEFF Research Database (Denmark)

    Cornér, Solveig; Pyhältö, Kirsi; Peltonen, Jouni

    2018-01-01

    experienced by PhD students within the same discipline. This study explores the support experiences of 381 PhD students within the humanities and social sciences from three research-intensive universities in Denmark (n=145) and Finland (n=236). The mixed methods design was utilized. The data were collected...... counter partners, whereas the Finnish students perceived lower levels of instrumental support than the Danish students. The findings imply that seemingly similar contexts hold valid differences in experienced social support and educational strategies at the PhD level.......Previous research has identified researcher community and supervisory support as key determinants of the doctoral journey contributing to students’ persistence and robustness. However, we still know little about cross-cultural variation in the researcher community and supervisory support...

  16. Visual search: a retrospective.

    Science.gov (United States)

    Eckstein, Miguel P

    2011-12-30

    Visual search, a vital task for humans and animals, has also become a common and important tool for studying many topics central to active vision and cognition ranging from spatial vision, attention, and oculomotor control to memory, decision making, and rewards. While visual search often seems effortless to humans, trying to recreate human visual search abilities in machines has represented an incredible challenge for computer scientists and engineers. What are the brain computations that ensure successful search? This review article draws on efforts from various subfields and discusses the mechanisms and strategies the brain uses to optimize visual search: the psychophysical evidence, their neural correlates, and if unknown, possible loci of the neural computations. Mechanisms and strategies include use of knowledge about the target, distractor, background statistical properties, location probabilities, contextual cues, scene context, rewards, target prevalence, and also the role of saliency, center-surround organization of search templates, and eye movement plans. I provide overviews of classic and contemporary theories of covert attention and eye movements during search explaining their differences and similarities. To allow the reader to anchor some of the laboratory findings to real-world tasks, the article includes interviews with three expert searchers: a radiologist, a fisherman, and a satellite image analyst.

  17. Identification of specific markers for amphetamine synthesised from the pre-precursor APAAN following the Leuckart route and retrospective search for APAAN markers in profiling databases from Germany and the Netherlands.

    Science.gov (United States)

    Hauser, Frank M; Rößler, Thorsten; Hulshof, Janneke W; Weigel, Diana; Zimmermann, Ralf; Pütz, Michael

    2017-08-30

    α-Phenylacetoacetonitrile (APAAN) is one of the most important pre-precursors for amphetamine production in recent years. This assumption is based on seizure data but there is little analytical data available showing how much amphetamine really originated from APAAN. In this study, several syntheses of amphetamine following the Leuckart route were performed starting from different organic compounds including APAAN. The organic phases were analysed using gas chromatography-mass spectrometry (GC-MS) to search for signals caused by possible APAAN markers. Three compounds were discovered, isolated, and based on the performed syntheses it was found that they are highly specific for the use of APAAN. Using mass spectra, high resolution MS and nuclear magnetic resonance (NMR) data the compounds were characterised and identified as 2-phenyl-2-butenenitrile, 3-amino-2-phenyl-2-butenenitrile, and 4-amino-6-methyl-5-phenylpyrimidine. To investigate their significance, they were searched in data from seized amphetamine samples to determine to what extent they were present in illicitly produced amphetamine. Data of more than 580 cases from amphetamine profiling databases in Germany and the Netherlands were used for this purpose. These databases allowed analysis of the yearly occurrence of the markers going back to 2009. The markers revealed a trend that was in agreement with seizure reports and reflected an increasing use of APAAN from 2010 on. This paper presents experimental proof that APAAN is indeed the most important pre-precursor of amphetamine in recent years. It also illustrates how important it is to look for new ways to identify current trends in drug production since such trends can change within a few years. Copyright © 2017 John Wiley & Sons, Ltd.

  18. Autonomous search

    CERN Document Server

    Hamadi, Youssef; Saubion, Frédéric

    2012-01-01

    Autonomous combinatorial search (AS) represents a new field in combinatorial problem solving. Its major standpoint and originality is that it considers that problem solvers must be capable of self-improvement operations. This is the first book dedicated to AS.

  19. Measuring Source Code Similarity Using Reference Vectors

    Science.gov (United States)

    Ohno, Asako; Murao, Hajime

    In this paper, we propose a novel method to measure similarity between program source codes. Different to others, our method doues not compare two source codes directly but compares two reference vectors, where a reference vector is calculated from one source code and a set of reference source codes. This means that our method requires no original source code when considering an application open to public such as a search engine for the program source code on the internet. We have built a simple search system and have evaluated with Java source codes made in the university course of basic programming. Results show that the system can achieve quite high average precision rate in very short time which means the proposed method can measure correct similarity very fast.

  20. The physical therapy profile questionnaire (PTPQ): development, validation and pilot testing

    OpenAIRE

    Grimmer-Somers Karen; Dizon Janine Margarita R; Kumar Saravana

    2011-01-01

    Abstract Background Country by country similarities and differences in physical therapy practice exists. Therefore, before updates in practice can be provided, such as trainings in evidence-based practice, it is necessary to identify the profile and nature of practice in a given country or setting. Following a search of the international literature, no appropriate tool was identified to collect and establish data to create the profile of physical therapy practice in the Philippines. We theref...

  1. $A$ searches

    CERN Document Server

    Beacham, James

    The Standard Model of particle physics encompasses three of the four known fundamental forces of nature, and has remarkably withstood all of the experimental tests of its predictions, a fact solidified by the discovery, in 2012, of the Higgs boson. However, it cannot be the complete picture. Many measurements have been made that hint at physics beyond the Standard Model, and the main task of the high- energy experimental physics community is to conduct searches for new physics in as many di↵erent places and regimes as possible. I present three searches for new phenomena in three di↵erent high-energy collider experiments, namely, a search for events with at least three photons in the final state, which is sensitive to an exotic decay of a Higgs boson into four photons via intermediate pseudoscalar particles, a, with ATLAS, at the Large Hadron Collider; a search for a dark photon, also known as an A0 , with APEX, at Thomas Je↵erson National Accelerator Facility; and a search for a Higgs decaying into four...

  2. Cross Validated Temperament Scale Validities Computed Using Profile Similarity Metrics

    Science.gov (United States)

    2017-04-27

    The Challenge and Opportunity of the Inverted U. Perspectives on Psychological Science, 6, 61-76. Hogan, R. (2005). In defense of personality ...27 April 2017 at the 32nd Annual Conference of the Society for Industrial and Organizational Psychology , Orlando, FL Disclaimer: All...14. ABSTRACT Personality and temperament scales are used in employment settings to predict performance because they are valid and have

  3. Face Search at Scale.

    Science.gov (United States)

    Wang, Dayong; Otto, Charles; Jain, Anil K

    2017-06-01

    Given the prevalence of social media websites, one challenge facing computer vision researchers is to devise methods to search for persons of interest among the billions of shared photos on these websites. Despite significant progress in face recognition, searching a large collection of unconstrained face images remains a difficult problem. To address this challenge, we propose a face search system which combines a fast search procedure, coupled with a state-of-the-art commercial off the shelf (COTS) matcher, in a cascaded framework. Given a probe face, we first filter the large gallery of photos to find the top- k most similar faces using features learned by a convolutional neural network. The k retrieved candidates are re-ranked by combining similarities based on deep features and those output by the COTS matcher. We evaluate the proposed face search system on a gallery containing 80 million web-downloaded face images. Experimental results demonstrate that while the deep features perform worse than the COTS matcher on a mugshot dataset (93.7 percent versus 98.6 percent TAR@FAR of 0.01 percent), fusing the deep features with the COTS matcher improves the overall performance ( 99.5 percent TAR@FAR of 0.01 percent). This shows that the learned deep features provide complementary information over representations used in state-of-the-art face matchers. On the unconstrained face image benchmarks, the performance of the learned deep features is competitive with reported accuracies. LFW database: 98.20 percent accuracy under the standard protocol and 88.03 percent TAR@FAR of 0.1 percent under the BLUFR protocol; IJB-A benchmark: 51.0 percent TAR@FAR of 0.1 percent (verification), rank 1 retrieval of 82.2 percent (closed-set search), 61.5 percent FNIR@FAR of 1 percent (open-set search). The proposed face search system offers an excellent trade-off between accuracy and scalability on galleries with millions of images. Additionally, in a face search experiment involving

  4. A class of self-similar hydrodynamics test problems

    Energy Technology Data Exchange (ETDEWEB)

    Ramsey, Scott D [Los Alamos National Laboratory; Brown, Lowell S [Los Alamos National Laboratory; Nelson, Eric M [Los Alamos National Laboratory; Alme, Marv L [Los Alamos National Laboratory

    2010-12-08

    We consider self-similar solutions to the gas dynamics equations. One such solution - a spherical geometry Gaussian density profile - has been analyzed in the existing literature, and a connection between it, a linear velocity profile, and a uniform specific internal energy profile has been identified. In this work, we assume the linear velocity profile to construct an entire class of self-similar sol utions in both cylindrical and spherical geometry, of which the Gaussian form is one possible member. After completing the derivation, we present some results in the context of a test problem for compressible flow codes.

  5. Notions of similarity for computational biology models

    KAUST Repository

    Waltemath, Dagmar

    2016-03-21

    Computational models used in biology are rapidly increasing in complexity, size, and numbers. To build such large models, researchers need to rely on software tools for model retrieval, model combination, and version control. These tools need to be able to quantify the differences and similarities between computational models. However, depending on the specific application, the notion of similarity may greatly vary. A general notion of model similarity, applicable to various types of models, is still missing. Here, we introduce a general notion of quantitative model similarities, survey the use of existing model comparison methods in model building and management, and discuss potential applications of model comparison. To frame model comparison as a general problem, we describe a theoretical approach to defining and computing similarities based on different model aspects. Potentially relevant aspects of a model comprise its references to biological entities, network structure, mathematical equations and parameters, and dynamic behaviour. Future similarity measures could combine these model aspects in flexible, problem-specific ways in order to mimic users\\' intuition about model similarity, and to support complex model searches in databases.

  6. A rapid profiling of gallotannins and flavonoids of the aqueous extract of Rhus coriaria L. by flow injection analysis with high-resolution mass spectrometry assisted with database searching.

    Science.gov (United States)

    Regazzoni, Luca; Arlandini, Emanuele; Garzon, Davide; Santagati, Natale Alfredo; Beretta, Giangiacomo; Maffei Facino, Roberto

    2013-01-01

    Hydrolysable tannins appear to have some extremely important biological roles including antioxidant, antibacterial, anti-inflammatory, anti-hypoglycemic, anti-angiogenic, and anticancer activities. The aim of this work was to set up a flow injection high-resolution mass spectrometric approach combined with database searching to obtain rapidly a profiling of gallotannins and other phenolics in a crude extract from plant tissue. The flow injection analysis (FIA) takes place in an electrospray ionization source of an hybrid orbitrap high resolution mass spectrometry system (ESI-HR-MS/MS(2), resolution 100,000, negative ion mode) and polyphenols are tentatively identified by matching the monoisotopic masses of the spectra with those of polyphenols databases. This leads to the most probable molecular formulas and to the possible structures among those reported in the database. The structures confirmation occurs by the compliance of MS(2) fragments with those of a prediction fragment commercial database. With this method we identified in the aqueous extract of sumac leaves, with a maximum error of 1.7 ppm, a group of ten gallotannins from mono- to deca-galloyl glycosides of the class of hydrolysable tannins and a set of coextracted flavonoid derivatives including myricetin, quercetin-3-O-rhamnoside, myricetin-3-O-glucoside, myricetin-3-O-glucuronide, and myricetin-3-O-rhamnoglucoside. The separation of isomers of gallotannins and flavonoids present in the same extract occurred by high-performance liquid chromatography with electrospray ionization tandem mass spectrometry (HPLC/ESI-HR-MS(2)); this approach allowed the structure resolution of the isobaric flavonoids quercetin-3-O-glucoside and myricetin-3-O-rhamnoside. Copyright © 2012 Elsevier B.V. All rights reserved.

  7. Internet Search Engines

    OpenAIRE

    Fatmaa El Zahraa Mohamed Abdou

    2004-01-01

    A general study about the internet search engines, the study deals main 7 points; the differance between search engines and search directories, components of search engines, the percentage of sites covered by search engines, cataloging of sites, the needed time for sites appearance in search engines, search capabilities, and types of search engines.

  8. Methods of Profile Cloning Detection in Online Social Networks

    Directory of Open Access Journals (Sweden)

    Zabielski Michał

    2016-01-01

    Full Text Available With the arrival of online social networks, the importance of privacy on the Internet has increased dramatically. Thus, it is important to develop mechanisms that will prevent our hidden personal data from unauthorized access and use. In this paper an attempt was made to present a concept of profile cloning detection in Online Social Networks (OSN using Graph and Networks Theory. By analysing structural similarity of network and value of attributes of user personal profile, we will be able to search for attackers which steal our identity.

  9. Textual and chemical information processing: different domains but similar algorithms

    Directory of Open Access Journals (Sweden)

    Peter Willett

    2000-01-01

    Full Text Available This paper discusses the extent to which algorithms developed for the processing of textual databases are also applicable to the processing of chemical structure databases, and vice versa. Applications discussed include: an algorithm for distribution sorting that has been applied to the design of screening systems for rapid chemical substructure searching; the use of measures of inter-molecular structural similarity for the analysis of hypertext graphs; a genetic algorithm for calculating term weights for relevance feedback searching for determining whether a molecule is likely to exhibit biological activity; and the use of data fusion to combine the results of different chemical similarity searches.

  10. Resolving Person Names in Web People Search

    Science.gov (United States)

    Balog, Krisztian; Azzopardi, Leif; de Rijke, Maarten

    Disambiguating person names in a set of documents (such as a set of web pages returned in response to a person name) is a key task for the presentation of results and the automatic profiling of experts. With largely unstructured documents and an unknown number of people with the same name the problem presents many difficulties and challenges. This chapter treats the task of person name disambiguation as a document clustering problem, where it is assumed that the documents represent particular people. This leads to the person cluster hypothesis, which states that similar documents tend to represent the same person. Single Pass Clustering, k-Means Clustering, Agglomerative Clustering and Probabilistic Latent Semantic Analysis are employed and empirically evaluated in this context. On the SemEval 2007 Web People Search it is shown that the person cluster hypothesis holds reasonably well and that the Single Pass Clustering and Agglomerative Clustering methods provide the best performance.

  11. Clinician search behaviors may be influenced by search engine design.

    Science.gov (United States)

    Lau, Annie Y S; Coiera, Enrico; Zrimec, Tatjana; Compton, Paul

    2010-06-30

    Searching the Web for documents using information retrieval systems plays an important part in clinicians' practice of evidence-based medicine. While much research focuses on the design of methods to retrieve documents, there has been little examination of the way different search engine capabilities influence clinician search behaviors. Previous studies have shown that use of task-based search engines allows for faster searches with no loss of decision accuracy compared with resource-based engines. We hypothesized that changes in search behaviors may explain these differences. In all, 75 clinicians (44 doctors and 31 clinical nurse consultants) were randomized to use either a resource-based or a task-based version of a clinical information retrieval system to answer questions about 8 clinical scenarios in a controlled setting in a university computer laboratory. Clinicians using the resource-based system could select 1 of 6 resources, such as PubMed; clinicians using the task-based system could select 1 of 6 clinical tasks, such as diagnosis. Clinicians in both systems could reformulate search queries. System logs unobtrusively capturing clinicians' interactions with the systems were coded and analyzed for clinicians' search actions and query reformulation strategies. The most frequent search action of clinicians using the resource-based system was to explore a new resource with the same query, that is, these clinicians exhibited a "breadth-first" search behaviour. Of 1398 search actions, clinicians using the resource-based system conducted 401 (28.7%, 95% confidence interval [CI] 26.37-31.11) in this way. In contrast, the majority of clinicians using the task-based system exhibited a "depth-first" search behavior in which they reformulated query keywords while keeping to the same task profiles. Of 585 search actions conducted by clinicians using the task-based system, 379 (64.8%, 95% CI 60.83-68.55) were conducted in this way. This study provides evidence that

  12. Inferring Trust Based on Similarity with TILLIT

    Science.gov (United States)

    Tavakolifard, Mozhgan; Herrmann, Peter; Knapskog, Svein J.

    A network of people having established trust relations and a model for propagation of related trust scores are fundamental building blocks in many of today’s most successful e-commerce and recommendation systems. However, the web of trust is often too sparse to predict trust values between non-familiar people with high accuracy. Trust inferences are transitive associations among users in the context of an underlying social network and may provide additional information to alleviate the consequences of the sparsity and possible cold-start problems. Such approaches are helpful, provided that a complete trust path exists between the two users. An alternative approach to the problem is advocated in this paper. Based on collaborative filtering one can exploit the like-mindedness resp. similarity of individuals to infer trust to yet unknown parties which increases the trust relations in the web. For instance, if one knows that with respect to a specific property, two parties are trusted alike by a large number of different trusters, one can assume that they are similar. Thus, if one has a certain degree of trust to the one party, one can safely assume a very similar trustworthiness of the other one. In an attempt to provide high quality recommendations and proper initial trust values even when no complete trust propagation path or user profile exists, we propose TILLIT — a model based on combination of trust inferences and user similarity. The similarity is derived from the structure of the trust graph and users’ trust behavior as opposed to other collaborative-filtering based approaches which use ratings of items or user’s profile. We describe an algorithm realizing the approach based on a combination of trust inferences and user similarity, and validate the algorithm using a real large-scale data-set.

  13. Visibiome: an efficient microbiome search engine based on a scalable, distributed architecture.

    Science.gov (United States)

    Azman, Syafiq Kamarul; Anwar, Muhammad Zohaib; Henschel, Andreas

    2017-07-24

    Given the current influx of 16S rRNA profiles of microbiota samples, it is conceivable that large amounts of them eventually are available for search, comparison and contextualization with respect to novel samples. This process facilitates the identification of similar compositional features in microbiota elsewhere and therefore can help to understand driving factors for microbial community assembly. We present Visibiome, a microbiome search engine that can perform exhaustive, phylogeny based similarity search and contextualization of user-provided samples against a comprehensive dataset of 16S rRNA profiles environments, while tackling several computational challenges. In order to scale to high demands, we developed a distributed system that combines web framework technology, task queueing and scheduling, cloud computing and a dedicated database server. To further ensure speed and efficiency, we have deployed Nearest Neighbor search algorithms, capable of sublinear searches in high-dimensional metric spaces in combination with an optimized Earth Mover Distance based implementation of weighted UniFrac. The search also incorporates pairwise (adaptive) rarefaction and optionally, 16S rRNA copy number correction. The result of a query microbiome sample is the contextualization against a comprehensive database of microbiome samples from a diverse range of environments, visualized through a rich set of interactive figures and diagrams, including barchart-based compositional comparisons and ranking of the closest matches in the database. Visibiome is a convenient, scalable and efficient framework to search microbiomes against a comprehensive database of environmental samples. The search engine leverages a popular but computationally expensive, phylogeny based distance metric, while providing numerous advantages over the current state of the art tool.

  14. Geometrical and frequential watermarking scheme using similarities

    Science.gov (United States)

    Bas, Patrick; Chassery, Jean-Marc; Davoine, Franck

    1999-04-01

    Watermarking schemes are more and more robust to classical degradations. The NEC system developed by Cox, using both original and marked images, can detect the mark with a JPEG compression ratio of 30. Nevertheless a very simple geometric attack done by the program Stirmark can remove the watermark. Most of the present watermarking schemes only map a mark on the image without geometric reference and therefore are not robust to geometric transformation. We present a scheme based on the modification of a collage map (issued from a fractal code used in fractal compression). We add a mark introducing similarities in the image. The embedding of the mark is done by selection of points of interest supporting blocks on which similarities are hided. This selection is done by the Stephens-Harris detector. The similarity is embedded locally to be robust to cropping. Contrary to many schemes, the reference mark used for the detection comes from the marked image and thus undergoes geometrical distortions. The detection of the mark is done by searching interest blocks and their similarities. It does not use the original image and the robustness is guaranteed by a key. Our first results show that the similarities-based watermarking is quite robust to geometric transformation such as translations, rotations and cropping.

  15. Pentaquark searches with ALICE

    CERN Document Server

    Bobulska, Dana

    2016-01-01

    In this report we present the results of the data analysis for searching for possible invariant mass signals from pentaquarks in the ALICE data. Analysis was based on filtered data from real p-Pb events at psNN=5.02 TeV collected in 2013. The motivation for this project was the recent discovery of pentaquark states by the LHCb collaboration (c ¯ cuud resonance P+ c ) [1]. The search for similar not yet observed pentaquarks is an interesting research topic [2]. In this analysis we searched for a s ¯ suud pentaquark resonance P+ s and its possible decay channel to f meson and proton. The ALICE detector is well suited for the search of certain candidates thanks to its low material budget and strong PID capabilities. Additionally we might expect the production of such particles in ALICE as in heavy-ion and proton-ion collisions the thermal models describes well the particle yields and ratios [3]. Therefore it is reasonable to expect other species of hadrons, including also possible pentaquarks, to be produced w...

  16. Earthquake Fingerprints: Representing Earthquake Waveforms for Similarity-Based Detection

    Science.gov (United States)

    Bergen, K.; Beroza, G. C.

    2016-12-01

    New earthquake detection methods, such as Fingerprint and Similarity Thresholding (FAST), use fast approximate similarity search to identify similar waveforms in long-duration data without templates (Yoon et al. 2015). These methods have two key components: fingerprint extraction and an efficient search algorithm. Fingerprint extraction converts waveforms into fingerprints, compact signatures that represent short-duration waveforms for identification and search. Earthquakes are detected using an efficient indexing and search scheme, such as locality-sensitive hashing, that identifies similar waveforms in a fingerprint database. The quality of the search results, and thus the earthquake detection results, is strongly dependent on the fingerprinting scheme. Fingerprint extraction should map similar earthquake waveforms to similar waveform fingerprints to ensure a high detection rate, even under additive noise and small distortions. Additionally, fingerprints corresponding to noise intervals should have mutually dissimilar fingerprints to minimize false detections. In this work, we compare the performance of multiple fingerprint extraction approaches for the earthquake waveform similarity search problem. We apply existing audio fingerprinting (used in content-based audio identification systems) and time series indexing techniques and present modified versions that are specifically adapted for seismic data. We also explore data-driven fingerprinting approaches that can take advantage of labeled or unlabeled waveform data. For each fingerprinting approach we measure its ability to identify similar waveforms in a low signal-to-noise setting, and quantify the trade-off between true and false detection rates in the presence of persistent noise sources. We compare the performance using known event waveforms from eight independent stations in the Northern California Seismic Network.

  17. Is visual search really like foraging?

    Science.gov (United States)

    Gilchrist, I D; North, A; Hood, B

    2001-01-01

    The visual-search paradigm provides a controlled and easy to implement experimental situation in which to study the search process. However, little work has been carried out in humans to investigate the extent to which traditional visual-search tasks are similar to more general search or foraging. Here we report results from a task in which search involves walking around a room and leaning down to inspect individual locations. Consistent with more traditional search tasks, search time increases linearly with display size, and the target-present to target-absent search slope is 1:2. However, although rechecking of locations did occur, compared to more traditional search it was relatively rare, suggesting an increased role for memory.

  18. A COMPARISON OF SEMANTIC SIMILARITY MODELS IN EVALUATING CONCEPT SIMILARITY

    Directory of Open Access Journals (Sweden)

    Q. X. Xu

    2012-08-01

    Full Text Available The semantic similarities are important in concept definition, recognition, categorization, interpretation, and integration. Many semantic similarity models have been established to evaluate semantic similarities of objects or/and concepts. To find out the suitability and performance of different models in evaluating concept similarities, we make a comparison of four main types of models in this paper: the geometric model, the feature model, the network model, and the transformational model. Fundamental principles and main characteristics of these models are introduced and compared firstly. Land use and land cover concepts of NLCD92 are employed as examples in the case study. The results demonstrate that correlations between these models are very high for a possible reason that all these models are designed to simulate the similarity judgement of human mind.

  19. Similarity in romantic couples' drinking motivations and drinking behaviors.

    Science.gov (United States)

    Kehayes, Ivy-Lee L; Mackinnon, Sean P; Sherry, Simon B; Leonard, Kenneth E; Stewart, Sherry H

    2017-01-01

    Research suggests that enhancement, conformity, social, coping-with-anxiety, and coping-with-depression drinking motives are linked to specific drinking outcomes in a theoretically expected manner. Social learning theory suggests that people who spend more time together emulate each other's behavior to acquire reinforcing outcomes. The present study sought to integrate drinking motives theory and social learning theory to investigate similarity in drinking behaviors and drinking motives in romantic couples. We hypothesized that couples would be more similar than chance in their drinking behaviors and motives. We also hypothesized that demographics reflecting time around and interactions with romantic partners (e.g., days spent drinking together) would positively correlate with similarity in drinking behaviors and motivations. The present study tested hypotheses in 203 romantic couples. Participants completed a Timeline Follow-Back measure and the Modified Drinking Motives Questionnaire-Revised to track their alcohol use and drinking motives. Similarity profiles were calculated using McCrae's (J Pers Assess. 2008;90:105-109) coefficient of profile agreement, rpa. Couples were more similar in their drinking behavioral and motivational profiles than could be explained by chance. Days spent drinking together and days with face-to-face contact predicted increased similarity in drinking behavior profiles, but not similarity in drinking motives profiles. Results are partially consistent with social learning theory and suggest that social influences within couples could be important intervention targets to prevent escalations in drinking.

  20. Renewing the Respect for Similarity

    Directory of Open Access Journals (Sweden)

    Shimon eEdelman

    2012-07-01

    Full Text Available In psychology, the concept of similarity has traditionally evoked a mixture of respect, stemmingfrom its ubiquity and intuitive appeal, and concern, due to its dependence on the framing of the problemat hand and on its context. We argue for a renewed focus on similarity as an explanatory concept, bysurveying established results and new developments in the theory and methods of similarity-preservingassociative lookup and dimensionality reduction — critical components of many cognitive functions, aswell as of intelligent data management in computer vision. We focus in particular on the growing familyof algorithms that support associative memory by performing hashing that respects local similarity, andon the uses of similarity in representing structured objects and scenes. Insofar as these similarity-basedideas and methods are useful in cognitive modeling and in AI applications, they should be included inthe core conceptual toolkit of computational neuroscience.

  1. Design of transonic airfoil sections using a similarity theory

    Science.gov (United States)

    Nixon, D.

    1978-01-01

    A study of the available methods for transonic airfoil and wing design indicates that the most powerful technique is the numerical optimization procedure. However, the computer time for this method is relatively large because of the amount of computation required in the searches during optimization. The optimization method requires that base and calibration solutions be computed to determine a minimum drag direction. The design space is then computationally searched in this direction; it is these searches that dominate the computation time. A recent similarity theory allows certain transonic flows to be calculated rapidly from the base and calibration solutions. In this paper the application of the similarity theory to design problems is examined with the object of at least partially eliminating the costly searches of the design optimization method. An example of an airfoil design is presented.

  2. Meta Search Engines.

    Science.gov (United States)

    Garman, Nancy

    1999-01-01

    Describes common options and features to consider in evaluating which meta search engine will best meet a searcher's needs. Discusses number and names of engines searched; other sources and specialty engines; search queries; other search options; and results options. (AEF)

  3. Dynamic similarity in erosional processes

    Science.gov (United States)

    Scheidegger, A.E.

    1963-01-01

    A study is made of the dynamic similarity conditions obtaining in a variety of erosional processes. The pertinent equations for each type of process are written in dimensionless form; the similarity conditions can then easily be deduced. The processes treated are: raindrop action, slope evolution and river erosion. ?? 1963 Istituto Geofisico Italiano.

  4. Similarities in functional attributes and nutritional effects of magadi ...

    African Journals Online (AJOL)

    Similarities in functional attributes and nutritional effects of magadi soda and bean debris-ash used in cooking African traditional dishes. ... profile, alkalinity, decreased cooking time and lowered mineral uptake by magadi soda and bean debris-ash explain similar functionality in foods they are added to during cooking.

  5. Renewing the respect for similarity.

    Science.gov (United States)

    Edelman, Shimon; Shahbazi, Reza

    2012-01-01

    In psychology, the concept of similarity has traditionally evoked a mixture of respect, stemming from its ubiquity and intuitive appeal, and concern, due to its dependence on the framing of the problem at hand and on its context. We argue for a renewed focus on similarity as an explanatory concept, by surveying established results and new developments in the theory and methods of similarity-preserving associative lookup and dimensionality reduction-critical components of many cognitive functions, as well as of intelligent data management in computer vision. We focus in particular on the growing family of algorithms that support associative memory by performing hashing that respects local similarity, and on the uses of similarity in representing structured objects and scenes. Insofar as these similarity-based ideas and methods are useful in cognitive modeling and in AI applications, they should be included in the core conceptual toolkit of computational neuroscience. In support of this stance, the present paper (1) offers a discussion of conceptual, mathematical, computational, and empirical aspects of similarity, as applied to the problems of visual object and scene representation, recognition, and interpretation, (2) mentions some key computational problems arising in attempts to put similarity to use, along with their possible solutions, (3) briefly states a previously developed similarity-based framework for visual object representation, the Chorus of Prototypes, along with the empirical support it enjoys, (4) presents new mathematical insights into the effectiveness of this framework, derived from its relationship to locality-sensitive hashing (LSH) and to concomitant statistics, (5) introduces a new model, the Chorus of Relational Descriptors (ChoRD), that extends this framework to scene representation and interpretation, (6) describes its implementation and testing, and finally (7) suggests possible directions in which the present research program can be

  6. Self-similar aftershock rates

    CERN Document Server

    Davidsen, Jörn

    2016-01-01

    In many important systems exhibiting crackling noise --- intermittent avalanche-like relaxation response with power-law and, thus, self-similar distributed event sizes --- the "laws" for the rate of activity after large events are not consistent with the overall self-similar behavior expected on theoretical grounds. This is in particular true for the case of seismicity and a satisfying solution to this paradox has remained outstanding. Here, we propose a generalized description of the aftershock rates which is both self-similar and consistent with all other known self-similar features. Comparing our theoretical predictions with high resolution earthquake data from Southern California we find excellent agreement, providing in particular clear evidence for a unified description of aftershocks and foreshocks. This may offer an improved way of time-dependent seismic hazard assessment and earthquake forecasting.

  7. Method of similarity for cavitation

    Energy Technology Data Exchange (ETDEWEB)

    Espanet, L.; Tekatlian, A.; Barbier, D. [CEA/Cadarache, Dept. d' Etudes des Combustibles (DEC), 13 - Saint-Paul-lez-Durance (France); Gouin, H. [Aix-Marseille-3 Univ., 13 - Marseille (France). Laboratoire de Modelisation en Mecanique et Thermodynamique

    1998-07-01

    The knowledge of possible cavitation in subassembly nozzles of the fast reactor core implies the realization of a fluid dynamic model test. We propose a method of similarity based on the non-dimensionalization of the equation of motion for viscous capillarity fluid issued from the Cahn and Hilliard model. Taking into account the dissolved gas effect, a condition of compatibility is determined. This condition must be respected by the fluid in experiment, along with the scaling between the two similar flows. (author)

  8. Self-Similar Isentropic Implosions

    Energy Technology Data Exchange (ETDEWEB)

    Rodriguez, M.; Amable, L.

    1980-07-01

    The self-similar compression of an isentropic spherical gas pellet Is analyzed for large values of the ratio of the final to initial densities. An asymptotic analysis provides the solution corresponding to a prescribed value of the final density when it is high. In addition an approximate solution is given when the specific heat ratio is not constant. The time evolution of the pressure on the outer surface leading to the self-similar solutions, is calculated for large density ratios. (Author)

  9. Refining search terms for nanotechnology

    Energy Technology Data Exchange (ETDEWEB)

    Porter, Alan L. [Georgia Institute of Technology (United States); Youtie, Jan [Georgia Institute of Technology, Enterprise Innovation Institute (United States)], E-mail: jan.youtie@innovate.gatech.edu; Shapira, Philip [Georgia Institute of Technology (United States); Schoeneck, David J. [Search Technology, Inc. (United States)

    2008-05-15

    The ability to delineate the boundaries of an emerging technology is central to obtaining an understanding of the technology's research paths and commercialization prospects. Nowhere is this more relevant than in the case of nanotechnology (hereafter identified as 'nano') given its current rapid growth and multidisciplinary nature. (Under the rubric of nanotechnology, we also include nanoscience and nanoengineering.) Past efforts have utilized several strategies, including simple term search for the prefix nano, complex lexical and citation-based approaches, and bootstrapping techniques. This research introduces a modularized Boolean approach to defining nanotechnology which has been applied to several research and patenting databases. We explain our approach to downloading and cleaning data, and report initial results. Comparisons of this approach with other nanotechnology search formulations are presented. Implications for search strategy development and profiling of the nanotechnology field are discussed.

  10. Refining search terms for nanotechnology

    Science.gov (United States)

    Porter, Alan L.; Youtie, Jan; Shapira, Philip; Schoeneck, David J.

    2008-05-01

    The ability to delineate the boundaries of an emerging technology is central to obtaining an understanding of the technology's research paths and commercialization prospects. Nowhere is this more relevant than in the case of nanotechnology (hereafter identified as "nano") given its current rapid growth and multidisciplinary nature. (Under the rubric of nanotechnology, we also include nanoscience and nanoengineering.) Past efforts have utilized several strategies, including simple term search for the prefix nano, complex lexical and citation-based approaches, and bootstrapping techniques. This research introduces a modularized Boolean approach to defining nanotechnology which has been applied to several research and patenting databases. We explain our approach to downloading and cleaning data, and report initial results. Comparisons of this approach with other nanotechnology search formulations are presented. Implications for search strategy development and profiling of the nanotechnology field are discussed.

  11. Conversation level syntax similarity metric.

    Science.gov (United States)

    Boghrati, Reihane; Hoover, Joe; Johnson, Kate M; Garten, Justin; Dehghani, Morteza

    2017-07-11

    The syntax and semantics of human language can illuminate many individual psychological differences and important dimensions of social interaction. Accordingly, psychological and psycholinguistic research has begun incorporating sophisticated representations of semantic content to better understand the connection between word choice and psychological processes. In this work we introduce ConversAtion level Syntax SImilarity Metric (CASSIM), a novel method for calculating conversation-level syntax similarity. CASSIM estimates the syntax similarity between conversations by automatically generating syntactical representations of the sentences in conversation, estimating the structural differences between them, and calculating an optimized estimate of the conversation-level syntax similarity. After introducing and explaining this method, we report results from two method validation experiments (Study 1) and conduct a series of analyses with CASSIM to investigate syntax accommodation in social media discourse (Study 2). We run the same experiments using two well-known existing syntactic metrics, LSM and Coh-Metrix, and compare their results to CASSIM. Overall, our results indicate that CASSIM is able to reliably measure syntax similarity and to provide robust evidence of syntax accommodation within social media discourse.

  12. Visualizing multiple word similarity measures.

    Science.gov (United States)

    Kievit-Kylar, Brent; Jones, Michael N

    2012-09-01

    Although many recent advances have taken place in corpus-based tools, the techniques used to guide exploration and evaluation of these systems have advanced little. Typically, the plausibility of a semantic space is explored by sampling the nearest neighbors to a target word and evaluating the neighborhood on the basis of the modeler's intuition. Tools for visualization of these large-scale similarity spaces are nearly nonexistent. We present a new open-source tool to plot and visualize semantic spaces, thereby allowing researchers to rapidly explore patterns in visual data that describe the statistical relations between words. Words are visualized as nodes, and word similarities are shown as directed edges of varying strengths. The "Word-2-Word" visualization environment allows for easy manipulation of graph data to test word similarity measures on their own or in comparisons between multiple similarity metrics. The system contains a large library of statistical relationship models, along with an interface to teach them from various language sources. The modularity of the visualization environment allows for quick insertion of new similarity measures so as to compare new corpus-based metrics against the current state of the art. The software is available at www.indiana.edu/~semantic/word2word/.

  13. ESG: extended similarity group method for automated protein function prediction.

    Science.gov (United States)

    Chitale, Meghana; Hawkins, Troy; Park, Changsoon; Kihara, Daisuke

    2009-07-15

    Importance of accurate automatic protein function prediction is ever increasing in the face of a large number of newly sequenced genomes and proteomics data that are awaiting biological interpretation. Conventional methods have focused on high sequence similarity-based annotation transfer which relies on the concept of homology. However, many cases have been reported that simple transfer of function from top hits of a homology search causes erroneous annotation. New methods are required to handle the sequence similarity in a more robust way to combine together signals from strongly and weakly similar proteins for effectively predicting function for unknown proteins with high reliability. We present the extended similarity group (ESG) method, which performs iterative sequence database searches and annotates a query sequence with Gene Ontology terms. Each annotation is assigned with probability based on its relative similarity score with the multiple-level neighbors in the protein similarity graph. We will depict how the statistical framework of ESG improves the prediction accuracy by iteratively taking into account the neighborhood of query protein in the sequence similarity space. ESG outperforms conventional PSI-BLAST and the protein function prediction (PFP) algorithm. It is found that the iterative search is effective in capturing multiple-domains in a query protein, enabling accurately predicting several functions which originate from different domains. ESG web server is available for automated protein function prediction at http://dragon.bio.purdue.edu/ESG/.

  14. Similarity measures for face recognition

    CERN Document Server

    Vezzetti, Enrico

    2015-01-01

    Face recognition has several applications, including security, such as (authentication and identification of device users and criminal suspects), and in medicine (corrective surgery and diagnosis). Facial recognition programs rely on algorithms that can compare and compute the similarity between two sets of images. This eBook explains some of the similarity measures used in facial recognition systems in a single volume. Readers will learn about various measures including Minkowski distances, Mahalanobis distances, Hansdorff distances, cosine-based distances, among other methods. The book also summarizes errors that may occur in face recognition methods. Computer scientists "facing face" and looking to select and test different methods of computing similarities will benefit from this book. The book is also useful tool for students undertaking computer vision courses.

  15. Kolmogorov's Lagrangian similarity law revisited

    Science.gov (United States)

    Barjona, Manuel; da Silva, Carlos B.

    2017-10-01

    Kolmogorov's similarity turbulence theory in a Lagrangian frame is assessed with new direct numerical simulations of isotropic turbulence with and without hyperviscosity, which attain higher Reynolds numbers than previously available. It is demonstrated that hyperviscous simulations can be used to accurately predict the second order Lagrangian velocity structure function (LVSF-2) in the inertial range, by using an original new procedure. The results strongly support Kolmogorov's Lagrangian similarity assumption and allow the universal constant of LVSF-2 to be computed with a new level of confidence with C0=7.4 ±0.2 .

  16. Revisiting Inter-Genre Similarity

    DEFF Research Database (Denmark)

    Sturm, Bob L.; Gouyon, Fabien

    2013-01-01

    We revisit the idea of ``inter-genre similarity'' (IGS) for machine learning in general, and music genre recognition in particular. We show analytically that the probability of error for IGS is higher than naive Bayes classification with zero-one loss (NB). We show empirically that IGS does...

  17. A study of Consistency in the Selection of Search Terms and Search Concepts: A Case Study in National Taiwan University

    Directory of Open Access Journals (Sweden)

    Mu-hsuan Huang

    2001-12-01

    Full Text Available This article analyzes the consistency in the selection of search terms and search contents of college and graduate students in National Taiwan University when they are using PsycLIT CD-ROM database. 31 students conducted pre-assigned searches, doing 59 searches generating 609 search terms. The study finds the consistency in selection of search terms of first level is 22.14% and second level is 35%. These results are similar with others’ researches. About the consistency in search concepts, no matter the overlaps of searched articles or judge relevant articles are lower than other researches. [Article content in Chinese

  18. Gene expression profiling during murine tooth development

    Directory of Open Access Journals (Sweden)

    Maria A dos Santos silva Landin

    2012-07-01

    Full Text Available The aim of this study was to describe the expression of genes, including ameloblastin (Ambn, amelogenin X chromosome (Amelx and enamelin (Enam during early (pre-secretory tooth development. The expression of these genes has predominantly been studied at post-secretory stages. Deoxyoligonucleotide microarrays were used to study gene expression during development of the murine first molar tooth germ at 24h intervals, starting at the eleventh embryonic day (E11.5 and up to the seventh day after birth (P7. The profile search function of Spotfire software was used to select genes with similar expression profile as the enamel genes (Ambn, Amelx and Enam. Microarray results where validated using real-time Reverse Transcription-Polymerase Chain Reaction (real-time RT-PCR, and translated proteins identified by Western blotting. In situ localisation of the Ambn, Amelx and Enam mRNAs were monitored from E12.5 to E17.5 using deoxyoligonucleotide probes. Bioinformatics analysis was used to associate biological functions with differentially (p ≤0.05 expressed (DE genes.Microarray results showed a total of 4362 genes including Ambn, Amelx and Enam to be significant differentially expressed throughout the time-course. The expression of the three enamel genes was low at pre-natal stages (E11.5-P0 increasing after birth (P1-P7. Profile search lead to isolation of 87 genes with significantly similar expression to the three enamel proteins. The mRNAs expressed in dental epithelium and epithelium derived cells. Although expression of Ambn, Amelx and Enam were lower during early tooth development compared to secretory stages enamel proteins were detectable by Western blotting. Bioinformatic analysis associated the 87 genes with multiple biological functions. Around thirty-five genes were associated with fifteen transcription factors.

  19. Searching for Dark Sectors

    Science.gov (United States)

    Zhong, Yi-Ming

    The existence of Dark Matter suggests the presence of a dark sector, consisting of particles neutral under all Standard Model forces. Various "portals" can connect the dark sector to the Standard Model sector. Two popular examples are the vector portal, which gives rise to a "dark photon" ( A'), and the Higgs portal, which gives rise to a "dark Higgs". Such dark forces appear in many well-motivated extensions of the Standard Model. In some cases, they may resolve discrepancies between experimental data and theoretical predictions, such as the muon anomalous magnetic moment. We show that dark sectors and forces can be constrained from several novel probes in current and future experiments, including mono- photon searches in low-energy positron-electron colliders, rare muon decays in the Mu3e, and exotic Higgs decays at the Large Hadron Collider (LHC). We first investigate the power of low-energy, high-luminosity electron-positron colliders to probe dark sectors with a mass below 10 GeV, which couple to Standard Model particles through a low-mass dark mediator. Dark matter candidates in this mass range are well- motivated and can give rise to distinctive mono-photon signals at B-factories and similar experiments. We use data from an existing mono-photon search by BABAR to place new constraints on this class of models, and give projections for the sensitivity of a similar search at a future B-factory such as Belle II. We find that the sensitivity of such searches are more powerful than searches at other collider or fixed-target facilities for dark-sector mediators and particles with masses between a few hundred MeV and 10 GeV. We compare our results to existing and future direct detection experiments and show that low-energy colliders provide an indispensable and complementary avenue to search for light dark matter. We also find that dark photons with masses 10 MeV - 80 MeV can probed in the rare muon decay process mu+→ e+nue numuA', A '→ e+ e-, in the upcoming Mu

  20. A Semantics-Based Measure of Emoji Similarity

    OpenAIRE

    Wijeratne, Sanjaya; Balasuriya, Lakshika; Sheth, Amit; Doran, Derek

    2017-01-01

    Emoji have grown to become one of the most important forms of communication on the web. With its widespread use, measuring the similarity of emoji has become an important problem for contemporary text processing since it lies at the heart of sentiment analysis, search, and interface design tasks. This paper presents a comprehensive analysis of the semantic similarity of emoji through embedding models that are learned over machine-readable emoji meanings in the EmojiNet knowledge base. Using e...

  1. Efficient k-NN search on vertically decomposed data

    NARCIS (Netherlands)

    A.P. de Vries (Arjen); N. Mamoulis; N.J. Nes (Niels); M.L. Kersten (Martin)

    2002-01-01

    textabstractApplications like multimedia retrieval require efficient support for similarity search on large data collections. Yet, nearest neighbor search is a difficult problem in high dimensional spaces, rendering efficient applications hard to realize: index structures degrade rapidly with

  2. Scaling, similarity, and the fourth paradigm for hydrology

    NARCIS (Netherlands)

    Peters-Lidard, Christa D.; Clark, Martyn; Samaniego, Luis; Verhoest, Niko E.C.; Emmerik, Van Tim; Uijlenhoet, Remko; Achieng, Kevin; Franz, Trenton E.; Woods, Ross A.

    2017-01-01

    In this synthesis paper addressing hydrologic scaling and similarity, we posit that roadblocks in the search for universal laws of hydrology are hindered by our focus on computational simulation (the third paradigm) and assert that it is time for hydrology to embrace a fourth paradigm of

  3. A ligand's view of target similarity

    DEFF Research Database (Denmark)

    Garland, Stephen L; Gloriam, David E

    2011-01-01

    shows with several examples how focusing on the binding site(s) has a clear advantage when it comes to establishing sequence-correlated pharmacological profiles. By organizing and comparing sequence and structural data it is possible to "borrow" SAR from similar targets to increase the speed of lead......-finding and, potentially, to produce ligands for previously intractable receptors. Sequence motifs correlated with ligands can be applied in the design of target-specific focused libraries that are both efficient and cost-effective and should provide increased hit-rates over diversity screening. Furthermore......, in the optimization phase, the binding motif approach offers the possibility to identify quickly the most likely off-target candidates to be chosen for selectivity screening, as well as potentially characterizing those pockets which may best be exploited for improved selectivity....

  4. Similarity measures for protein ensembles

    DEFF Research Database (Denmark)

    Lindorff-Larsen, Kresten; Ferkinghoff-Borg, Jesper

    2009-01-01

    a synthetic example from molecular dynamics simulations. We then apply the algorithms to revisit the problem of ensemble averaging during structure determination of proteins, and find that an ensemble refinement method is able to recover the correct distribution of conformations better than standard single....... However, instead of examining individual conformations it is in many cases more relevant to analyse ensembles of conformations that have been obtained either through experiments or from methods such as molecular dynamics simulations. We here present three approaches that can be used to compare......Analyses of similarities and changes in protein conformation can provide important information regarding protein function and evolution. Many scores, including the commonly used root mean square deviation, have therefore been developed to quantify the similarities of different protein conformations...

  5. Google Ajax Search API

    CERN Document Server

    Fitzgerald, Michael

    2007-01-01

    Use the Google Ajax Search API to integrateweb search, image search, localsearch, and other types of search intoyour web site by embedding a simple, dynamicsearch box to display search resultsin your own web pages using a fewlines of JavaScript. For those who do not want to write code,the search wizards and solutions builtwith the Google Ajax Search API generatecode to accomplish common taskslike adding local search results to a GoogleMaps API mashup, adding videosearch thumbnails to your web site, oradding a news reel with the latest up todate stories to your blog. More advanced users can

  6. Profiles and Context for Structured Text Retrieval

    DEFF Research Database (Denmark)

    Koolen, Marijn; Bogers, Toine

    2017-01-01

    The combination of structured information retrieval with user profile information represents the scenario where systems search with an explicit statement of the information need—a search query—as well as a profile of a user, which can contain information about previous interactions, search history......, user demographics, or other relevant information about the user’s preferences. The relation between the profile and the information need is implicit and may contain many irrelevant signals. The task of the system then is to model both the current information need and the background user preferences...

  7. Haptic Influence on Visual Search

    Directory of Open Access Journals (Sweden)

    Marcia Grabowecky

    2011-10-01

    Full Text Available Information from different sensory modalities influences perception and attention in tasks such as visual search. We have previously reported identity-based crossmodal influences of audition on visual search (Iordanescu, Guzman-Martinez, Grabowecky, & Suzuki, 2008; Iordanescu, Grabowecky, Franconeri, Theeuwes, and Suzuki, 2010; Iordanescu, Grabowecky and Suzuki, 2011. Here, we extend those results and demonstrate a novel crossmodal interaction between haptic shape information and visual attention. Manually-explored, but unseen, shapes facilitated visual search for similarly-shaped objects. This effect manifests as a reduction in both overall search times and initial saccade latencies when the haptic shape (eg, a sphere is consistent with a visual target (eg, an orange compared to when it is inconsistent with a visual target (eg, a hockey puck]. This haptic-visual interaction occurred even though the manually held shapes were not predictive of the visual target's shape or location, suggesting that the interaction occurs automatically. Furthermore, when the haptic shape was consistent with a distracter in the visual search array (instead of with the target, initial saccades toward the target were disrupted. Together, these results demonstrate a robust shape-specific haptic influence on visual search.

  8. OpenSearch technology for geospatial resources discovery

    Science.gov (United States)

    Papeschi, Fabrizio; Enrico, Boldrini; Mazzetti, Paolo

    2010-05-01

    In 2005, the term Web 2.0 has been coined by Tim O'Reilly to describe a quickly growing set of Web-based applications that share a common philosophy of "mutually maximizing collective intelligence and added value for each participant by formalized and dynamic information sharing". Around this same period, OpenSearch a new Web 2.0 technology, was developed. More properly, OpenSearch is a collection of technologies that allow publishing of search results in a format suitable for syndication and aggregation. It is a way for websites and search engines to publish search results in a standard and accessible format. Due to its strong impact on the way the Web is perceived by users and also due its relevance for businesses, Web 2.0 has attracted the attention of both mass media and the scientific community. This explosive growth in popularity of Web 2.0 technologies like OpenSearch, and practical applications of Service Oriented Architecture (SOA) resulted in an increased interest in similarities, convergence, and a potential synergy of these two concepts. SOA is considered as the philosophy of encapsulating application logic in services with a uniformly defined interface and making these publicly available via discovery mechanisms. Service consumers may then retrieve these services, compose and use them according to their current needs. A great degree of similarity between SOA and Web 2.0 may be leading to a convergence between the two paradigms. They also expose divergent elements, such as the Web 2.0 support to the human interaction in opposition to the typical SOA machine-to-machine interaction. According to these considerations, the Geospatial Information (GI) domain, is also moving first steps towards a new approach of data publishing and discovering, in particular taking advantage of the OpenSearch technology. A specific GI niche is represented by the OGC Catalog Service for Web (CSW) that is part of the OGC Web Services (OWS) specifications suite, which provides a

  9. Search Engine For Ebook Portal

    Directory of Open Access Journals (Sweden)

    Prashant Kanade

    2017-05-01

    Full Text Available The purpose of this paper is to establish the textual analytics involved in developing a search engine for an ebook portal. We have extracted our dataset from Project Gutenberg using a robot harvester. Textual Analytics is used for efficient search retrieval. The entire dataset is represented using Vector Space Model where each document is a vector in the vector space. Further for computational purposes we represent our dataset in the form of a Term Frequency- Inverse Document Frequency tf-idf matrix. The first step involves obtaining the most coherent sequence of words of the search query entered. The entered query is processed using Front End algorithms this includes-Spell Checker Text Segmentation and Language Modeling. Back End processing includes Similarity Modeling Clustering Indexing and Retrieval. The relationship between documents and words is established using cosine similarity measured between the documents and words in Vector Space. Clustering performed is used to suggest books that are similar to the search query entered by the user. Lastly the Lucene Based Elasticsearch engine is used for indexing on the documents. This allows faster retrieval of data. Elasticsearch returns a dictionary and creates a tf-idf matrix. The processed query is compared with the dictionary obtained and tf-idf matrix is used to calculate the score for each match to give most relevant result.

  10. How doctors search

    DEFF Research Database (Denmark)

    Lykke, Marianne; Price, Susan; Delcambre, Lois

    2012-01-01

    Professional, workplace searching is different from general searching, because it is typically limited to specific facets and targeted to a single answer. We have developed the semantic component (SC) model, which is a search feature that allows searchers to structure and specify the search...... to context-specific aspects of the main topic of the documents. We have tested the model in an interactive searching study with family doctors with the purpose to explore doctors’ querying behaviour, how they applied the means for specifying a search, and how these features contributed to the search outcome....... In general, the doctors were capable of exploiting system features and search tactics during the searching. Most searchers produced well-structured queries that contained appropriate search facets. When searches failed it was not due to query structure or query length. Failures were mostly caused by the well...

  11. Landscape similarity, retrieval, and machine mapping of physiographic units

    Science.gov (United States)

    Jasiewicz, Jaroslaw; Netzel, Pawel; Stepinski, Tomasz F.

    2014-09-01

    We introduce landscape similarity - a numerical measure that assesses affinity between two landscapes on the basis of similarity between the patterns of their constituent landform elements. Such a similarity function provides core technology for a landscape search engine - an algorithm that parses the topography of a study area and finds all places with landscapes broadly similar to a landscape template. A landscape search can yield answers to a query in real time, enabling a highly effective means to explore large topographic datasets. In turn, a landscape search facilitates auto-mapping of physiographic units within a study area. The country of Poland serves as a test bed for these novel concepts. The topography of Poland is given by a 30 m resolution DEM. The geomorphons method is applied to this DEM to classify the topography into ten common types of landform elements. A local landscape is represented by a square tile cut out of a map of landform elements. A histogram of cell-pair features is used to succinctly encode the composition and texture of a pattern within a local landscape. The affinity between two local landscapes is assessed using the Wave-Hedges similarity function applied to the two corresponding histograms. For a landscape search the study area is organized into a lattice of local landscapes. During the search the algorithm calculates the similarity between each local landscape and a given query. Our landscape search for Poland is implemented as a GeoWeb application called TerraEx-Pl and is available at http://sil.uc.edu/. Given a sample, or a number of samples, from a target physiographic unit the landscape search delineates this unit using the principles of supervised machine learning. Repeating this procedure for all units yields a complete physiographic map. The application of this methodology to topographic data of Poland results in the delineation of nine physiographic units. The resultant map bears a close resemblance to a conventional

  12. Data Profiling

    OpenAIRE

    Hladíková, Radka

    2010-01-01

    Title: Data Profiling Author: Radka Hladíková Department: Department of Software Engineering Supervisor: Ing. Vladimír Kyjonka Supervisor's e-mail address: Abstract: This thesis puts mind on problems with data quality and data profiling. This Work analyses and summarizes problems of data quality, data defects, process of data quality, data quality assessment and data profiling. The main topic is data profiling as a process of researching data available in existing...

  13. Code Tagging and Similarity-based Retrieval with myCBR

    Science.gov (United States)

    Roth-Berghofer, Thomas R.; Bahls, Daniel

    This paper describes the code tagging plug-in coTag, which allows annotating code snippets in the integrated development environment eclipse. coTag offers an easy-to-use interface for tagging and searching. Using the similarity-based search engine of the open-source tool myCBR, the user can search not only for exactly the same tags as offered by other code tagging extensions, but also for similar tags and, thus, for similar code snippets. coTag provides means for context-based adding of new as well as changing of existing similarity links between tags, supported by myCBR's explanation component.

  14. Dark matter searches

    Science.gov (United States)

    Bettini, Alessandro

    These lectures begin with a brief survey of the astrophysical and cosmological evidence for dark matter. We then consider the three principal theoretically motivated types of dark matter, sterile neutrinos, axions and SUSY WIMPs. In chapter 4 we discuss the motivations for the so-called neutrino minimal standard model, nuMSM, an extension of the SM with three sterile neutrinos with masses similar to the charged fermions. In chapter 5 we briefly recall the strong CP problem of the SM and the solution proposed by Peccei and Quinn leading to the prediction of axions and of their characteristics. We then discuss the experimental status and perspectives. In chapter 6 we assume that the reader to be acquainted with the theoretical motivations for SUSY and move directly to the direct search for dark matter and the description of the principal detector techniques: scintillators, noble fluids and bolometers. We conclude with an outlook on the future perspectives.

  15. submitter Searches at LEP

    CERN Document Server

    Kawagoe, Kiyotomo

    2001-01-01

    Searches for new particles and new physics were extensively performed at LEP. Although no evidence for new particle/physics was discovered, the null results set very stringent limits to theories beyond the standard model. In this paper, searches at LEP and anomalies observed in the searches are presented. Future prospect of searches at the new energy frontier machines is also discussed.

  16. Web Search Engines

    OpenAIRE

    Rajashekar, TB

    1998-01-01

    The World Wide Web is emerging as an all-in-one information source. Tools for searching Web-based information include search engines, subject directories and meta search tools. We take a look at key features of these tools and suggest practical hints for effective Web searching.

  17. Sound Search Engine Concept

    DEFF Research Database (Denmark)

    2006-01-01

    Sound search is provided by the major search engines, however, indexing is text based, not sound based. We will establish a dedicated sound search services with based on sound feature indexing. The current demo shows the concept of the sound search engine. The first engine will be realased June...

  18. Search for Pentaquarks at BABAR

    CERN Document Server

    Goetzen, K

    2007-01-01

    The results of searches for the strange pentaquark states, Theta(1540)+, Xi(1860)-- and Xi(1860)0 in data recorded by the BABAR experiment are presented. We search for these three states inclusively in 123.4 fb^-1 of e+ e- annihilation data produced at the PEP-II asymmetric storage rings; we find no evidence for their production in any physics process, and set limits on their production rates that are well below the measured rates for conventional baryons. We also search for Theta(1540)+ produced in interactions of electrons or hadrons in the material of the inner part of the detector. No evidence for this state is found in a sample with much higher statistics than similar electroproduction experiments that claim a signal.

  19. In Silico target fishing: addressing a “Big Data” problem by ligand-based similarity rankings with data fusion

    Science.gov (United States)

    2014-01-01

    Background Ligand-based in silico target fishing can be used to identify the potential interacting target of bioactive ligands, which is useful for understanding the polypharmacology and safety profile of existing drugs. The underlying principle of the approach is that known bioactive ligands can be used as reference to predict the targets for a new compound. Results We tested a pipeline enabling large-scale target fishing and drug repositioning, based on simple fingerprint similarity rankings with data fusion. A large library containing 533 drug relevant targets with 179,807 active ligands was compiled, where each target was defined by its ligand set. For a given query molecule, its target profile is generated by similarity searching against the ligand sets assigned to each target, for which individual searches utilizing multiple reference structures are then fused into a single ranking list representing the potential target interaction profile of the query compound. The proposed approach was validated by 10-fold cross validation and two external tests using data from DrugBank and Therapeutic Target Database (TTD). The use of the approach was further demonstrated with some examples concerning the drug repositioning and drug side-effects prediction. The promising results suggest that the proposed method is useful for not only finding promiscuous drugs for their new usages, but also predicting some important toxic liabilities. Conclusions With the rapid increasing volume and diversity of data concerning drug related targets and their ligands, the simple ligand-based target fishing approach would play an important role in assisting future drug design and discovery. PMID:24976868

  20. Pedogenesis of two lithologically similar soils under vegetation of ...

    African Journals Online (AJOL)

    Pedogenesis of two lithologically similar soils under vegetation of contrasting features in Ohaji, south-eastern Nigeria. ... The results were then subjected to statistical analysis using the Statistical Analytical System. The results showed that soil profiles were deep (200 cm) and well drained. The soils were darker and browner ...

  1. The safety profile of the fluoroquinolones.

    Science.gov (United States)

    Bertino, J; Fish, D

    2000-07-01

    Premarketing trials showed the fluoroquinolone agents to have a favorable side-effect profile, with treatment-related adverse events comprising gastrointestinal, central nervous system, and dermatologic effects that were generally mild and reversible on cessation of treatment. However, postmarketing surveillance studies have identified severe adverse events, including severe anaphylaxis, QTc-interval prolongation, and potential cardiotoxicity, associated with 3 quinolone agents that either resulted in the removal of the agent from the market (temafloxacin and grepafloxacin) or significantly restricted its use due to substantial mortality and morbidity associated with liver toxicity (trovafloxacin). To date, there have been no such significant adverse events associated with the older fluoroquinolone agents, including ciprofloxacin, ofloxacin, norfloxacin, and levofloxacin. However, there are fewer data from postmarketing surveillance studies on the most recently approved agents, such as moxifloxacin and gatifloxacin, or agents awaiting approval, such as gemifloxacin. This paper examines safety data from the premarketing trials and postmarketing surveillance studies of fluoroquinolones available in the United States. A MEDLINE search was performed to identify all English-language studies published since 1980 concerning the safety profiles of the fluoroquinolones. The fluoroquinolone antibacterial agents offer broad-spectrum therapy in patients with a variety of infections. Given similar spectra of activity, the choice between quinolones may be based on differences in efficacy and safety or tolerability profiles. Most drug reactions involving these agents are minor and reversible on discontinuing treatment, but adverse effects can be associated with significant mortality and morbidity, as was seen in the case of trovafloxacin and temafloxacin.

  2. Interpreting metabolomic profiles using unbiased pathway models.

    Directory of Open Access Journals (Sweden)

    Rahul C Deo

    2010-02-01

    Full Text Available Human disease is heterogeneous, with similar disease phenotypes resulting from distinct combinations of genetic and environmental factors. Small-molecule profiling can address disease heterogeneity by evaluating the underlying biologic state of individuals through non-invasive interrogation of plasma metabolite levels. We analyzed metabolite profiles from an oral glucose tolerance test (OGTT in 50 individuals, 25 with normal (NGT and 25 with impaired glucose tolerance (IGT. Our focus was to elucidate underlying biologic processes. Although we initially found little overlap between changed metabolites and preconceived definitions of metabolic pathways, the use of unbiased network approaches identified significant concerted changes. Specifically, we derived a metabolic network with edges drawn between reactant and product nodes in individual reactions and between all substrates of individual enzymes and transporters. We searched for "active modules"--regions of the metabolic network enriched for changes in metabolite levels. Active modules identified relationships among changed metabolites and highlighted the importance of specific solute carriers in metabolite profiles. Furthermore, hierarchical clustering and principal component analysis demonstrated that changed metabolites in OGTT naturally grouped according to the activities of the System A and L amino acid transporters, the osmolyte carrier SLC6A12, and the mitochondrial aspartate-glutamate transporter SLC25A13. Comparison between NGT and IGT groups supported blunted glucose- and/or insulin-stimulated activities in the IGT group. Using unbiased pathway models, we offer evidence supporting the important role of solute carriers in the physiologic response to glucose challenge and conclude that carrier activities are reflected in individual metabolite profiles of perturbation experiments. Given the involvement of transporters in human disease, metabolite profiling may contribute to improved

  3. HangOut: generating clean PSI-BLAST profiles for domains with long insertions.

    Science.gov (United States)

    Kim, Bong-Hyun; Cong, Qian; Grishin, Nick V

    2010-06-15

    Profile-based similarity search is an essential step in structure-function studies of proteins. However, inclusion of non-homologous sequence segments into a profile causes its corruption and results in false positives. Profile corruption is common in multidomain proteins, and single domains with long insertions are a significant source of errors. We developed a procedure (HangOut) that, for a single domain with specified insertion position, cleans erroneously extended PSI-BLAST alignments to generate better profiles. HangOut is implemented in Python 2.3 and runs on all Unix-compatible platforms. The source code is available under the GNU GPL license at http://prodata.swmed.edu/HangOut/. Supplementary data are available at Bioinformatics online.

  4. Application of shape-based similarity query for aerodynamic optimization of wind tunnel primary nozzle

    Directory of Open Access Journals (Sweden)

    Kolář Jan

    2012-04-01

    Full Text Available The aerodynamic shape optimization of the supersonic flat nozzle is the aim of proposed paper. The nozzle discussed, is applied as a primary nozzle of the inlet part of supersonic wind tunnel. Supersonic nozzles of the measure area inlet parts need to guarantee several requirements of flow properties and quality. Mach number and minimal differences between real and required velocity and turbulence profiles at the nozzle exit are the most important parameters to meet. The aerodynamic shape optimization of the flat 2D nozzle in CFD is employed to reach as uniform exit velocity profile as possible, with the mean Mach number 1.4. Optimization process does not use any of standard routines of global or local optimum searching. Instead, newly formed routine, which exploits shape-based oriented sequence of nozzles, is used to research within whole discretized parametric space. The movement within optimization process is not driven by gradient or evolutionary too, instead, the Path of Minimal Shape Deformation is followed. Dynamic mesh approach is used to deform the shape and mesh from the actual nozzle to the subsequent one. Dynamic deformation of mesh allows to speed up whole converging process as an initialization of flow at the newly formed mesh is based on afore-computed shape. Shape-based similarity query in field of supersonic nozzles is discussed and applied. Evolutionary technique with genetic algorithm is used to search for minimal deformational path. As a result, the best variant from the set of solved shapes is analyzed at the base of momentum coefficient and desired Mach number at the nozzle exit.

  5. Application of shape-based similarity query for aerodynamic optimization of wind tunnel primary nozzle

    Science.gov (United States)

    Kolář, Jan

    2012-04-01

    The aerodynamic shape optimization of the supersonic flat nozzle is the aim of proposed paper. The nozzle discussed, is applied as a primary nozzle of the inlet part of supersonic wind tunnel. Supersonic nozzles of the measure area inlet parts need to guarantee several requirements of flow properties and quality. Mach number and minimal differences between real and required velocity and turbulence profiles at the nozzle exit are the most important parameters to meet. The aerodynamic shape optimization of the flat 2D nozzle in CFD is employed to reach as uniform exit velocity profile as possible, with the mean Mach number 1.4. Optimization process does not use any of standard routines of global or local optimum searching. Instead, newly formed routine, which exploits shape-based oriented sequence of nozzles, is used to research within whole discretized parametric space. The movement within optimization process is not driven by gradient or evolutionary too, instead, the Path of Minimal Shape Deformation is followed. Dynamic mesh approach is used to deform the shape and mesh from the actual nozzle to the subsequent one. Dynamic deformation of mesh allows to speed up whole converging process as an initialization of flow at the newly formed mesh is based on afore-computed shape. Shape-based similarity query in field of supersonic nozzles is discussed and applied. Evolutionary technique with genetic algorithm is used to search for minimal deformational path. As a result, the best variant from the set of solved shapes is analyzed at the base of momentum coefficient and desired Mach number at the nozzle exit.

  6. [Differences and similarities between the competencies of a nursing supervisor and an advanced clinical nurse specialist].

    Science.gov (United States)

    del Barrio-Linares, M; Pumar-Méndez, M J

    2015-01-01

    With the aim of contributing to the development of a more specific professional regulation, the present study was to identify differences and similarities between the competencies of the nursing supervisor and clinical nurse specialist in an intensive care unit. A critical analysis of the literature published between 2003 and 2013 was conducted, identified through systematic searches in electronic databases, health management and practitioner journals and reference lists of the 17 items included. «Management and administration» and «direct clinical practice» were identified as specific competencies of nursing supervisor and clinical nurse specialist respectively. «Collaboration», «leadership» and «research» emerged as competencies shared by both profiles, but with different a operationalization way of conducting it. These findings imply that regulation, education and implementation of these profiles must address their specific skills as the distinctive approach taken in operationalizing shared. Copyright © 2014 Elsevier España, S.L.U. y SEEIUC. All rights reserved.

  7. Alaska, Gulf spills share similarities

    Energy Technology Data Exchange (ETDEWEB)

    Usher, D. (Marine Pollution Control Co., Detroit, MI (United States))

    The accidental Exxon Valdez oil spill in Alaska and the deliberate dumping of crude oil into the Persian Gulf as a tactic of war contain both glaring differences and surprising similarities. Public reaction and public response was much greater to the Exxon Valdez spill in pristine Prince William Sound than to the war-related tragedy in the Persian Gulf. More than 12,000 workers helped in the Alaskan cleanup; only 350 have been involved in Kuwait. But in both instances, environmental damages appear to be less than anticipated. Natures highly effective self-cleansing action is primarily responsible for minimizing the damages. One positive action growing out of the two incidents is increased international cooperation and participation in oil-spill clean-up efforts. In 1990, in the aftermath of the Exxon Valdez spill, 94 nations signed an international accord on cooperation in future spills. The spills can be historic environmental landmarks leading to creation of more sophisticated response systems worldwide.

  8. To Search or Not to Search!

    Science.gov (United States)

    Taylor, J. Rodney

    Constitutional guarantees, as provided by the Bill of Rights, are enjoyed by all citizens. This principle applies no less to students with respect to their college or university domicile. Case law on this subject suggests that three questions must be answered to determine the reasonableness of residence searching: (1) by whom the search is…

  9. Visual search of illusory contours: an attempt of automatization

    Directory of Open Access Journals (Sweden)

    Gvozdenović Vasilije

    2008-01-01

    Full Text Available Laboratory of Experimental Psychology, University of Belgrade Recent research, which was mostly focused on assessing the types of visual search of illusory contours, showed that visual search is dependent on factors like target configuration and task type. Some experimental research supports the theory of parallel search while other research supports the theory of serial search of illusory contours. The inconsistency is most likely due to the fact that various types of illusory contour configurations were used in set creation. Up to this point, our research indicated that the serial search is used in most cases. Some exceptions of search type have been proven in some modification of task type but nevertheless the search profile remained serial. In this article, we are reporting on two visual search experiments. The first experiment was an investigation of a specific feature of a Kanisza type illusory triangle, orientation. The validity of the profile defined in the first experiment was tested in our second experiment with an attempt to automatize the visual search by the multiplication of the initial experimental trials. Our results confirmed that, regardless of the number of experimental trials, the visual search profile remains serial.

  10. Short guide to SDI profiling at ORNL

    Energy Technology Data Exchange (ETDEWEB)

    Pomerance, H.S.

    1976-06-01

    ORNL has machine-searchable data bases that correspond to printed indexes and abstracts. This guide describes the peculiarities of those several data bases and the conventions of the ORNL search system so that users can write their own queries or search profiles and can interpret the part of the output that is encoded.

  11. Unsupervised User Similarity Mining in GSM Sensor Networks

    Directory of Open Access Journals (Sweden)

    Shafqat Ali Shad

    2013-01-01

    Full Text Available Mobility data has attracted the researchers for the past few years because of its rich context and spatiotemporal nature, where this information can be used for potential applications like early warning system, route prediction, traffic management, advertisement, social networking, and community finding. All the mentioned applications are based on mobility profile building and user trend analysis, where mobility profile building is done through significant places extraction, user’s actual movement prediction, and context awareness. However, significant places extraction and user’s actual movement prediction for mobility profile building are a trivial task. In this paper, we present the user similarity mining-based methodology through user mobility profile building by using the semantic tagging information provided by user and basic GSM network architecture properties based on unsupervised clustering approach. As the mobility information is in low-level raw form, our proposed methodology successfully converts it to a high-level meaningful information by using the cell-Id location information rather than previously used location capturing methods like GPS, Infrared, and Wifi for profile mining and user similarity mining.

  12. Unsupervised user similarity mining in GSM sensor networks.

    Science.gov (United States)

    Shad, Shafqat Ali; Chen, Enhong

    2013-01-01

    Mobility data has attracted the researchers for the past few years because of its rich context and spatiotemporal nature, where this information can be used for potential applications like early warning system, route prediction, traffic management, advertisement, social networking, and community finding. All the mentioned applications are based on mobility profile building and user trend analysis, where mobility profile building is done through significant places extraction, user's actual movement prediction, and context awareness. However, significant places extraction and user's actual movement prediction for mobility profile building are a trivial task. In this paper, we present the user similarity mining-based methodology through user mobility profile building by using the semantic tagging information provided by user and basic GSM network architecture properties based on unsupervised clustering approach. As the mobility information is in low-level raw form, our proposed methodology successfully converts it to a high-level meaningful information by using the cell-Id location information rather than previously used location capturing methods like GPS, Infrared, and Wifi for profile mining and user similarity mining.

  13. Dynamic Search and Working Memory in Social Recall

    Science.gov (United States)

    Hills, Thomas T.; Pachur, Thorsten

    2012-01-01

    What are the mechanisms underlying search in social memory (e.g., remembering the people one knows)? Do the search mechanisms involve dynamic local-to-global transitions similar to semantic search, and are these transitions governed by the general control of attention, associated with working memory span? To find out, we asked participants to…

  14. Prevalence of stress references on college freshmen Facebook profiles.

    Science.gov (United States)

    Egan, Katie G; Moreno, Megan A

    2011-10-01

    Stress is common among college students and associated with adverse health outcomes. This study used the social networking Web site Facebook to identify self-reported stress and associated conditions among college students. Public Facebook profiles of undergraduate freshman at a large Midwestern State University (n = 300) were identified using a Facebook search. Content analysis of Facebook profiles included demographic information and displayed references to stress, weight concerns, depressive symptoms, and alcohol. The mean reported age was 18.4 years, and the majority of profile owners were female (62%). Stress references were displayed on 37% of the profiles, weight concerns on 6%, depressive symptoms on 24%, and alcohol on 73%. The display of stress references was associated with female sex (odds ratio [OR], 2.81; confidence interval [CI], 1.7-4.7), weight concerns (OR, 5.36; CI, 1.87-15.34), and depressive symptoms (OR, 2.7; CI, 1.57-4.63). No associations were found between stress and alcohol references. College freshmen frequently display references to stress on Facebook profiles with prevalence rates similar to self-reported national survey data. Findings suggest a positive association between referencing stress and both weight concerns and depressive symptoms. Facebook may be a useful venue to identify students at risk for stress-related conditions and to disseminate information about campus resources to these students.

  15. Large Neighborhood Search

    DEFF Research Database (Denmark)

    Pisinger, David; Røpke, Stefan

    2010-01-01

    Heuristics based on large neighborhood search have recently shown outstanding results in solving various transportation and scheduling problems. Large neighborhood search methods explore a complex neighborhood by use of heuristics. Using large neighborhoods makes it possible to find better candid...

  16. New particle searches

    CERN Document Server

    Ruhlmann-Kleider, V

    2000-01-01

    This review presents recent results on new particle searches achieved at Tevatron, Hera and LEP. After a brief outline of the searches on exotic particles, results on supersymmetric particles and Higgs bosons are detailed. Near future prospects are also given.

  17. Risk Profile May Affect Search Process but Not Results. A review of: McKibbon, K. Ann, Douglas B. Fridsma, and Rebecca S. Crowley. “How Primary Care Physicians’ Attitudes Toward Risk and Uncertainty Affect Their Use of Electronic Information Resources.” Journal of the Medical Library Association 95.2 (2007: 138‐46, e49‐50. 10 Nov. 2007 .

    Directory of Open Access Journals (Sweden)

    Gale G. Hannigan

    2007-12-01

    Full Text Available Objective – To compare the use, in terms of process and outcomes, of electronic information resources by primary care physicians with different risk profiles and comfort with uncertainty.Design – Survey, and observation using “think‐aloud” method.Setting – Physicians’ offices.Subjects – Canadian and U.S. primary carephysicians who report seeing patients in clinic settings.Methods – Volunteers were recruited from personal contacts and the list of physicianswho rate current studies for the McMaster Online Rating of Evidence (MORE project. Physicians completed the Pearson scale to measure attitude toward risk and the Gerrity scale to measure comfort with uncertainty, and those who scored at the extremes of each of these two scales were included in the study (n=25, resulting in four groups (risk‐seeking, risk‐avoiding, uncertainty‐stressed, uncertainty unstressed. One researcher observed each of these physicians in their offices for an hour during which they completed questionnaires about their computer skills and familiarity with resources, answered multiple‐choice clinical questions, and indicated level of certainty with regard to those answers (scale of 0 to 100%. Physicians also chose two of the clinical questions to answer using their own resources. The think‐aloud method was employed, and transcripts were coded and analyzed.Main results – The study analysis included two comparisons: risk‐seeking (11 subjects versus risk‐avoiding (11 subjects physicians,and uncertainty‐stressed (11 subjects versus uncertainty‐unstressed (10 subjects physicians. Most physicians were included in both sets of analyses. The researchers found no association of risk attitude and uncertainty stress with computer skills nor with familiarity and use of specific information resources (Internet, MEDLINE, PIER, Clinical Evidence, and UpToDate. No differences were found for the following outcomes: time spent searching, answers correct

  18. Analysis of newly established EST databases reveals similarities between heart regeneration in newt and fish

    Directory of Open Access Journals (Sweden)

    Weis Patrick

    2010-01-01

    Full Text Available Abstract Background The newt Notophthalmus viridescens possesses the remarkable ability to respond to cardiac damage by formation of new myocardial tissue. Surprisingly little is known about changes in gene activities that occur during the course of regeneration. To begin to decipher the molecular processes, that underlie restoration of functional cardiac tissue, we generated an EST database from regenerating newt hearts and compared the transcriptional profile of selected candidates with genes deregulated during zebrafish heart regeneration. Results A cDNA library of 100,000 cDNA clones was generated from newt hearts 14 days after ventricular injury. Sequencing of 11520 cDNA clones resulted in 2894 assembled contigs. BLAST searches revealed 1695 sequences with potential homology to sequences from the NCBI database. BLAST searches to TrEMBL and Swiss-Prot databases assigned 1116 proteins to Gene Ontology terms. We also identified a relatively large set of 174 ORFs, which are likely to be unique for urodele amphibians. Expression analysis of newt-zebrafish homologues confirmed the deregulation of selected genes during heart regeneration. Sequences, BLAST results and GO annotations were visualized in a relational web based database followed by grouping of identified proteins into clusters of GO Terms. Comparison of data from regenerating zebrafish hearts identified biological processes, which were uniformly overrepresented during cardiac regeneration in newt and zebrafish. Conclusion We concluded that heart regeneration in newts and zebrafish led to the activation of similar sets of genes, which suggests that heart regeneration in both species might follow similar principles. The design of the newly established newt EST database allows identification of molecular pathways important for heart regeneration.

  19. Integrated vs. Federated Search

    DEFF Research Database (Denmark)

    Løvschall, Kasper

    2009-01-01

    Oplæg om forskelle og ligheder mellem integrated og federated search i bibliotekskontekst. Holdt ved temadag om "Integrated Search - samsøgning i alle kilder" på Danmarks Biblioteksskole den 22. januar 2009.......Oplæg om forskelle og ligheder mellem integrated og federated search i bibliotekskontekst. Holdt ved temadag om "Integrated Search - samsøgning i alle kilder" på Danmarks Biblioteksskole den 22. januar 2009....

  20. Searching with Kids.

    Science.gov (United States)

    Valenza, Joyce Kasman

    2000-01-01

    Discusses student search tools for the World Wide Web and describes and compares seven that are appropriate for pre-kindergarten through secondary school. Highlights include access; age appropriateness; directories versus full search engines; filters; limits of some tools; and adult-oriented search tools that can be limited for students' access.…

  1. Measurement and Improvement of Subject Specialists Performance Searching Chemical Abstracts Online as Available on SDC Search Systems.

    Science.gov (United States)

    Copeland, Richard F.; And Others

    The first phase of a project to design a prompting system to help semi-experienced end users to search Chemical Abstracts online, this study focused on the differences and similarities in the search approaches used by experienced users and those with less expertise. Four online searches on topics solicited from chemistry professors in small…

  2. Using concept similarity in cross ontology for adaptive e-Learning systems

    Directory of Open Access Journals (Sweden)

    B. Saleena

    2015-01-01

    Full Text Available e-Learning is one of the most preferred media of learning by the learners. The learners search the web to gather knowledge about a particular topic from the information in the repositories. Retrieval of relevant materials from a domain can be easily implemented if the information is organized and related in some way. Ontologies are a key concept that helps us to relate information for providing the more relevant lessons to the learner. This paper proposes an adaptive e-Learning system, which generates a user specific e-Learning content by comparing the concepts with more than one system using similarity measures. A cross ontology measure is defined, which consists of fuzzy domain ontology as the primary ontology and the domain expert’s ontology as the secondary ontology, for the comparison process. A personalized document is provided to the user with a user profile, which includes the data obtained from the processing of the proposed method under a User score, which is obtained through the user evaluation. The results of the proposed e-Learning system under the designed cross ontology similarity measure show a significant increase in performance and accuracy under different conditions. The assessment of the comparative analysis, showed the difference in performance of our proposed method over other methods. Based on the assessment results it is proved that the proposed approach is effective over other methods.

  3. FACT: Functional annotation transfer between proteins with similar feature architectures

    Directory of Open Access Journals (Sweden)

    Koestler Tina

    2010-08-01

    Full Text Available Abstract Background The increasing number of sequenced genomes provides the basis for exploring the genetic and functional diversity within the tree of life. Only a tiny fraction of the encoded proteins undergoes a thorough experimental characterization. For the remainder, bioinformatics annotation tools are the only means to infer their function. Exploiting significant sequence similarities to already characterized proteins, commonly taken as evidence for homology, is the prevalent method to deduce functional equivalence. Such methods fail when homologs are too diverged, or when they have assumed a different function. Finally, due to convergent evolution, functional equivalence is not necessarily linked to common ancestry. Therefore complementary approaches are required to identify functional equivalents. Results We present the Feature Architecture Comparison Tool http://www.cibiv.at/FACT to search for functionally equivalent proteins. FACT uses the similarity between feature architectures of two proteins, i.e., the arrangements of functional domains, secondary structure elements and compositional properties, as a proxy for their functional equivalence. A scoring function measures feature architecture similarities, which enables searching for functional equivalents in entire proteomes. Our evaluation of 9,570 EC classified enzymes revealed that FACT, using the full feature, set outperformed the existing architecture-based approaches by identifying significantly more functional equivalents as highest scoring proteins. We show that FACT can identify functional equivalents that share no significant sequence similarity. However, when the highest scoring protein of FACT is also the protein with the highest local sequence similarity, it is in 99% of the cases functionally equivalent to the query. We demonstrate the versatility of FACT by identifying a missing link in the yeast glutathione metabolism and also by searching for the human GolgA5

  4. Search engines and the production of academic knowledge

    NARCIS (Netherlands)

    van Dijck, J.

    2010-01-01

    This article argues that search engines in general, and Google Scholar in particular, have become significant co-producers of academic knowledge. Knowledge is not simply conveyed to users, but is co-produced by search engines’ ranking systems and profiling systems, none of which are open to the

  5. Visual search in barn owls: Task difficulty and saccadic behavior.

    Science.gov (United States)

    Orlowski, Julius; Ben-Shahar, Ohad; Wagner, Hermann

    2018-01-01

    How do we find what we are looking for? A target can be in plain view, but it may be detected only after extensive search. During a search we make directed attentional deployments like saccades to segment the scene until we detect the target. Depending on difficulty, the search may be fast with few attentional deployments or slow with many, shorter deployments. Here we study visual search in barn owls by tracking their overt attentional deployments-that is, their head movements-with a camera. We conducted a low-contrast feature search, a high-contrast orientation conjunction search, and a low-contrast orientation conjunction search, each with set sizes varying from 16 to 64 items. The barn owls were able to learn all of these tasks and showed serial search behavior. In a subsequent step, we analyzed how search behavior of owls changes with search complexity. We compared the search mechanisms in these three serial searches with results from pop-out searches our group had reported earlier. Saccade amplitude shortened and fixation duration increased in difficult searches. Also, in conjunction search saccades were guided toward items with shared target features. These data suggest that during visual search, barn owls utilize mechanisms similar to those that humans use.

  6. Scaling, Similarity, and the Fourth Paradigm for Hydrology

    Science.gov (United States)

    Peters-Lidard, Christa D.; Clark, Martyn; Samaniego, Luis; Verhoest, Niko E. C.; van Emmerik, Tim; Uijlenhoet, Remko; Achieng, Kevin; Franz, Trenton E.; Woods, Ross

    2017-01-01

    In this synthesis paper addressing hydrologic scaling and similarity, we posit that roadblocks in the search for universal laws of hydrology are hindered by our focus on computational simulation (the third paradigm), and assert that it is time for hydrology to embrace a fourth paradigm of data-intensive science. Advances in information-based hydrologic science, coupled with an explosion of hydrologic data and advances in parameter estimation and modelling, have laid the foundation for a data-driven framework for scrutinizing hydrological scaling and similarity hypotheses. We summarize important scaling and similarity concepts (hypotheses) that require testing, describe a mutual information framework for testing these hypotheses, describe boundary condition, state flux, and parameter data requirements across scales to support testing these hypotheses, and discuss some challenges to overcome while pursuing the fourth hydrological paradigm. We call upon the hydrologic sciences community to develop a focused effort towards adopting the fourth paradigm and apply this to outstanding challenges in scaling and similarity.

  7. Scaling, similarity, and the fourth paradigm for hydrology

    Science.gov (United States)

    Peters-Lidard, Christa D.; Clark, Martyn; Samaniego, Luis; Verhoest, Niko E. C.; van Emmerik, Tim; Uijlenhoet, Remko; Achieng, Kevin; Franz, Trenton E.; Woods, Ross

    2017-07-01

    In this synthesis paper addressing hydrologic scaling and similarity, we posit that roadblocks in the search for universal laws of hydrology are hindered by our focus on computational simulation (the third paradigm) and assert that it is time for hydrology to embrace a fourth paradigm of data-intensive science. Advances in information-based hydrologic science, coupled with an explosion of hydrologic data and advances in parameter estimation and modeling, have laid the foundation for a data-driven framework for scrutinizing hydrological scaling and similarity hypotheses. We summarize important scaling and similarity concepts (hypotheses) that require testing; describe a mutual information framework for testing these hypotheses; describe boundary condition, state, flux, and parameter data requirements across scales to support testing these hypotheses; and discuss some challenges to overcome while pursuing the fourth hydrological paradigm. We call upon the hydrologic sciences community to develop a focused effort towards adopting the fourth paradigm and apply this to outstanding challenges in scaling and similarity.

  8. Log analysis to understand medical professionals' image searching behaviour.

    Science.gov (United States)

    Tsikrika, Theodora; Müller, Henning; Kahn, Charles E

    2012-01-01

    This paper reports on the analysis of the query logs of a visual medical information retrieval system that provides access to radiology resources. Our analysis shows that, despite sharing similarities with general Web search and also with biomedical text search, query formulation and query modification when searching for visual biomedical information have unique characteristics that need to be taken into account in order to enhance the effectiveness of the search support offered by such systems. Typical information needs of medical professionals searching radiology resources are also identified with the goal to create realistic search tasks for a medical image retrieval evaluation benchmark.

  9. Adaptive Large Neighbourhood Search

    DEFF Research Database (Denmark)

    Røpke, Stefan

    Large neighborhood search is a metaheuristic that has gained popularity in recent years. The heuristic repeatedly moves from solution to solution by first partially destroying the solution and then repairing it. The best solution observed during this search is presented as the final solution....... This tutorial introduces the large neighborhood search metaheuristic and the variant adaptive large neighborhood search that dynamically tunes parameters of the heuristic while it is running. Both heuristics belong to a broader class of heuristics that are searching a solution space using very large...

  10. Keyword Search in Databases

    CERN Document Server

    Yu, Jeffrey Xu; Chang, Lijun

    2009-01-01

    It has become highly desirable to provide users with flexible ways to query/search information over databases as simple as keyword search like Google search. This book surveys the recent developments on keyword search over databases, and focuses on finding structural information among objects in a database using a set of keywords. Such structural information to be returned can be either trees or subgraphs representing how the objects, that contain the required keywords, are interconnected in a relational database or in an XML database. The structural keyword search is completely different from

  11. A toolbox for representational similarity analysis.

    Directory of Open Access Journals (Sweden)

    Hamed Nili

    2014-04-01

    Full Text Available Neuronal population codes are increasingly being investigated with multivariate pattern-information analyses. A key challenge is to use measured brain-activity patterns to test computational models of brain information processing. One approach to this problem is representational similarity analysis (RSA, which characterizes a representation in a brain or computational model by the distance matrix of the response patterns elicited by a set of stimuli. The representational distance matrix encapsulates what distinctions between stimuli are emphasized and what distinctions are de-emphasized in the representation. A model is tested by comparing the representational distance matrix it predicts to that of a measured brain region. RSA also enables us to compare representations between stages of processing within a given brain or model, between brain and behavioral data, and between individuals and species. Here, we introduce a Matlab toolbox for RSA. The toolbox supports an analysis approach that is simultaneously data- and hypothesis-driven. It is designed to help integrate a wide range of computational models into the analysis of multichannel brain-activity measurements as provided by modern functional imaging and neuronal recording techniques. Tools for visualization and inference enable the user to relate sets of models to sets of brain regions and to statistically test and compare the models using nonparametric inference methods. The toolbox supports searchlight-based RSA, to continuously map a measured brain volume in search of a neuronal population code with a specific geometry. Finally, we introduce the linear-discriminant t value as a measure of representational discriminability that bridges the gap between linear decoding analyses and RSA. In order to demonstrate the capabilities of the toolbox, we apply it to both simulated and real fMRI data. The key functions are equally applicable to other modalities of brain-activity measurement. The

  12. Quantum Search Algorithms

    Science.gov (United States)

    Korepin, Vladimir E.; Xu, Ying

    This article reviews recent progress in quantum database search algorithms. The subject is presented in a self-contained and pedagogical way. The problem of searching a large database (a Hilbert space) for a target item is performed by the famous Grover algorithm which locates the target item with high probability and a quadratic speed-up compared with the corresponding classical algorithm. If the database is partitioned into blocks and one is searching for the block containing the target item instead of the target item itself, then the problem is referred to as partial search. Partial search trades accuracy for speed and the most efficient version is the Grover-Radhakrishnan-Korepin (GRK) algorithm. The target block can be further partitioned into sub-blocks so that GRK's can be performed in a sequence called a hierarchy. We study the Grover search and GRK partial search in detail and prove that a GRK hierarchy is less efficient than a direct GRK partial search. Both the Grover search and the GRK partial search can be generalized to the case with several target items (or target blocks for a GRK). The GRK partial search algorithm can also be represented in terms of group theory.

  13. Searching and Indexing Genomic Databases via Kernelization

    Directory of Open Access Journals (Sweden)

    Travis eGagie

    2015-02-01

    Full Text Available The rapid advance of DNA sequencing technologies has yielded databases of thousands of genomes. To search and index these databases effectively, it is important that we take advantage of the similarity between those genomes. Several authors have recently suggested searching or indexing only one reference genome and the parts of the other genomes where they differ. In this paper we survey the twenty-year history of this idea and discuss its relation to kernelization in parameterized complexity.

  14. Biophysical Profile

    Science.gov (United States)

    ... and pregnancy High-risk pregnancy Biophysical profile About Advertisement Mayo Clinic does not endorse companies or products. ... a Job Site Map About This Site Twitter Facebook Google YouTube Pinterest Mayo Clinic is a not- ...

  15. Book Review: Web Search-Public Searching of the Web

    OpenAIRE

    Yazdan Mansourian

    2004-01-01

    The book consists of four sections including (1) the context of web search, (2) how people search the web, (3) subjects of web search and (4) conclusion: trends and future directions. The first section includes three chapters addressing a brief but informative introduction about the main involved elements of web search process and web search research including search engines mechanism, human computer interaction in web searching and research design in web search studies.

  16. Automatic Planning of External Search Engine Optimization

    Directory of Open Access Journals (Sweden)

    Vita Jasevičiūtė

    2015-07-01

    Full Text Available This paper describes an investigation of the external search engine optimization (SEO action planning tool, dedicated to automatically extract a small set of most important keywords for each month during whole year period. The keywords in the set are extracted accordingly to external measured parameters, such as average number of searches during the year and for every month individually. Additionally the position of the optimized web site for each keyword is taken into account. The generated optimization plan is similar to the optimization plans prepared manually by the SEO professionals and can be successfully used as a support tool for web site search engine optimization.

  17. SCANPS: a web server for iterative protein sequence database searching by dynamic programing, with display in a hierarchical SCOP browser.

    Science.gov (United States)

    Walsh, Thomas P; Webber, Caleb; Searle, Stephen; Sturrock, Shane S; Barton, Geoffrey J

    2008-07-01

    SCANPS performs iterative profile searching similar to PSI-BLAST but with full dynamic programing on each cycle and on-the-fly estimation of significance. This combination gives good sensitivity and selectivity that outperforms PSI-BLAST in domain-searching benchmarks. Although computationally expensive, SCANPS exploits onchip parallelism (MMX and SSE2 instructions on Intel chips) as well as MPI parallelism to give acceptable turnround times even for large databases. A web server developed to run SCANPS searches is now available at http://www.compbio.dundee.ac.uk/www-scanps. The server interface allows a range of different protein sequence databases to be searched including the SCOP database of protein domains. The server provides the user with regularly updated versions of the main protein sequence databases and is backed up by significant computing resources which ensure that searches are performed rapidly. For SCOP searches, the results may be viewed in a new tree-based representation that reflects the structure of the SCOP hierarchy; this aids the user in placing each hit in the context of its SCOP classification and understanding its relationship to other domains in SCOP.

  18. Profiling cancer

    DEFF Research Database (Denmark)

    Ciro, Marco; Bracken, Adrian P; Helin, Kristian

    2003-01-01

    In the past couple of years, several very exciting studies have demonstrated the enormous power of gene-expression profiling for cancer classification and prediction of patient survival. In addition to promising a more accurate classification of cancer and therefore better treatment of patients......, gene-expression profiling can result in the identification of novel potential targets for cancer therapy and a better understanding of the molecular mechanisms leading to cancer....

  19. [Advanced online search techniques and dedicated search engines for physicians].

    Science.gov (United States)

    Nahum, Yoav

    2008-02-01

    In recent years search engines have become an essential tool in the work of physicians. This article will review advanced search techniques from the world of information specialists, as well as some advanced search engine operators that may help physicians improve their online search capabilities, and maximize the yield of their searches. This article also reviews popular dedicated scientific and biomedical literature search engines.

  20. University Students' Online Information Searching Strategies in Different Search Contexts

    Science.gov (United States)

    Tsai, Meng-Jung; Liang, Jyh-Chong; Hou, Huei-Tse; Tsai, Chin-Chung

    2012-01-01

    This study investigates the role of search context played in university students' online information searching strategies. A total of 304 university students in Taiwan were surveyed with questionnaires in which two search contexts were defined as searching for learning, and searching for daily life information. Students' online search strategies…

  1. Genetic algorithms as global random search methods

    Science.gov (United States)

    Peck, Charles C.; Dhawan, Atam P.

    1995-01-01

    Genetic algorithm behavior is described in terms of the construction and evolution of the sampling distributions over the space of candidate solutions. This novel perspective is motivated by analysis indicating that the schema theory is inadequate for completely and properly explaining genetic algorithm behavior. Based on the proposed theory, it is argued that the similarities of candidate solutions should be exploited directly, rather than encoding candidate solutions and then exploiting their similarities. Proportional selection is characterized as a global search operator, and recombination is characterized as the search process that exploits similarities. Sequential algorithms and many deletion methods are also analyzed. It is shown that by properly constraining the search breadth of recombination operators, convergence of genetic algorithms to a global optimum can be ensured.

  2. Search on Rugged Landscapes

    DEFF Research Database (Denmark)

    Billinger, Stephan; Stieglitz, Nils; Schumacher, Terry

    2014-01-01

    This paper presents findings from a laboratory experiment on human decision-making in a complex combinatorial task. We find strong evidence for a behavioral model of adaptive search. Success narrows down search to the neighborhood of the status quo, while failure promotes gradually more explorative...... search. Task complexity does not have a direct effect on behavior, but systematically affects the feedback conditions that guide success-induced exploitation and failure-induced exploration. The analysis also shows that human participants were prone to over-exploration, since they broke off the search...... for local improvements too early. We derive stylized decision rules that generate the search behavior observed in the experiment and discuss the implications of our findings for individual decision-making and organizational search....

  3. Supporting complex search tasks

    DEFF Research Database (Denmark)

    Gäde, Maria; Hall, Mark; Huurdeman, Hugo

    2015-01-01

    There is broad consensus in the field of IR that search is complex in many use cases and applications, both on the Web and in domain specific collections, and both professionally and in our daily life. Yet our understanding of complex search tasks, in comparison to simple look up tasks......, is fragmented at best. The workshop addressed the many open research questions: What are the obvious use cases and applications of complex search? What are essential features of work tasks and search tasks to take into account? And how do these evolve over time? With a multitude of information, varying from...... introductory to specialized, and from authoritative to speculative or opinionated, when to show what sources of information? How does the information seeking process evolve and what are relevant differences between different stages? With complex task and search process management, blending searching, browsing...

  4. Efficient protein structure search using indexing methods.

    Science.gov (United States)

    Kim, Sungchul; Sael, Lee; Yu, Hwanjo

    2013-01-01

    Understanding functions of proteins is one of the most important challenges in many studies of biological processes. The function of a protein can be predicted by analyzing the functions of structurally similar proteins, thus finding structurally similar proteins accurately and efficiently from a large set of proteins is crucial. A protein structure can be represented as a vector by 3D-Zernike Descriptor (3DZD) which compactly represents the surface shape of the protein tertiary structure. This simplified representation accelerates the searching process. However, computing the similarity of two protein structures is still computationally expensive, thus it is hard to efficiently process many simultaneous requests of structurally similar protein search. This paper proposes indexing techniques which substantially reduce the search time to find structurally similar proteins. In particular, we first exploit two indexing techniques, i.e., iDistance and iKernel, on the 3DZDs. After that, we extend the techniques to further improve the search speed for protein structures. The extended indexing techniques build and utilize an reduced index constructed from the first few attributes of 3DZDs of protein structures. To retrieve top-k similar structures, top-10 × k similar structures are first found using the reduced index, and top-k structures are selected among them. We also modify the indexing techniques to support θ-based nearest neighbor search, which returns data points less than θ to the query point. The results show that both iDistance and iKernel significantly enhance the searching speed. In top-k nearest neighbor search, the searching time is reduced 69.6%, 77%, 77.4% and 87.9%, respectively using iDistance, iKernel, the extended iDistance, and the extended iKernel. In θ-based nearest neighbor serach, the searching time is reduced 80%, 81%, 95.6% and 95.6% using iDistance, iKernel, the extended iDistance, and the extended iKernel, respectively.

  5. Using interpolation techniques to determine the optimal profile interval in ground-penetrating radar applications

    Science.gov (United States)

    Samet, Refik; Çelik, Ertuğ; Tural, Serhat; Şengönül, Erkan; Özkan, Merve; Damcı, Emre

    2017-05-01

    Ground-penetrating radar (GPR) is a geophysical method that is seeing increasing use for near-surface underground research and applications. To achieve the required depth and accuracy for the GPR measurement of underground structures, the effects of certain data acquisition and interpolation parameters should be considered. These parameters include antenna frequency, sample length, profile interval, trace interval, sampling interval and resolution of the mesh of grid. For example, when the profile interval is large, underground structures may not be identified with sufficient accuracy. By contrast, selecting a narrower profile interval will increase the accuracy but will also increase the acquisition cost and processing time. The main objective of this study is to determine the optimal profile and trace intervals and the optimal mesh of grid by analyzing the interpolation-induced effects of different profile and trace intervals and different meshes of grid on the measured geometry of underground structures. Within the defined objective, an analysis procedure is proposed. This procedure determines the optimal profile and trace intervals to scan the search area by GPR at the data acquisition phase and the optimal mesh of grid to visualize the underground structures with high accuracy at the data processing phase. Proposed procedure was tested on the synthetic and real data. According to the main findings, the users are suggested to use the optimal profile interval 0.25 m and trace interval 0.04 m to acquire the data from the search area and the optimal mesh of grid (0.025 × 0.01) m2 to visualize the underground structures with high accuracy and with optimum acquisition cost and processing time. Obtained results showed that the determined optimal profile and trace intervals and mesh of grid exert positive effects about 5% for similarity and 21% for error when determining and visualizing the geometry of underground structures.

  6. ElasticSearch cookbook

    CERN Document Server

    Paro, Alberto

    2013-01-01

    Written in an engaging, easy-to-follow style, the recipes will help you to extend the capabilities of ElasticSearch to manage your data effectively.If you are a developer who implements ElasticSearch in your web applications, manage data, or have decided to start using ElasticSearch, this book is ideal for you. This book assumes that you've got working knowledge of JSON and Java

  7. Mastering ElasticSearch

    CERN Document Server

    Kuc, Rafal

    2013-01-01

    A practical tutorial that covers the difficult design, implementation, and management of search solutions.Mastering ElasticSearch is aimed at to intermediate users who want to extend their knowledge about ElasticSearch. The topics that are described in the book are detailed, but we assume that you already know the basics, like the query DSL or data indexing. Advanced users will also find this book useful, as the examples are getting deep into the internals where it is needed.

  8. Demystifying the Search Button

    Science.gov (United States)

    McKeever, Liam; Nguyen, Van; Peterson, Sarah J.; Gomez-Perez, Sandra

    2015-01-01

    A thorough review of the literature is the basis of all research and evidence-based practice. A gold-standard efficient and exhaustive search strategy is needed to ensure all relevant citations have been captured and that the search performed is reproducible. The PubMed database comprises both the MEDLINE and non-MEDLINE databases. MEDLINE-based search strategies are robust but capture only 89% of the total available citations in PubMed. The remaining 11% include the most recent and possibly relevant citations but are only searchable through less efficient techniques. An effective search strategy must employ both the MEDLINE and the non-MEDLINE portion of PubMed to ensure all studies have been identified. The robust MEDLINE search strategies are used for the MEDLINE portion of the search. Usage of the less robust strategies is then efficiently confined to search only the remaining 11% of PubMed citations that have not been indexed for MEDLINE. The current article offers step-by-step instructions for building such a search exploring methods for the discovery of medical subject heading (MeSH) terms to search MEDLINE, text-based methods for exploring the non-MEDLINE database, information on the limitations of convenience algorithms such as the “related citations feature,” the strengths and pitfalls associated with commonly used filters, the proper usage of Boolean operators to organize a master search strategy, and instructions for automating that search through “MyNCBI” to receive search query updates by email as new citations become available. PMID:26129895

  9. Interest in Anesthesia as Reflected by Keyword Searches using Common Search Engines.

    Science.gov (United States)

    Liu, Renyu; García, Paul S; Fleisher, Lee A

    2012-01-23

    Since current general interest in anesthesia is unknown, we analyzed internet keyword searches to gauge general interest in anesthesia in comparison with surgery and pain. The trend of keyword searches from 2004 to 2010 related to anesthesia and anaesthesia was investigated using Google Insights for Search. The trend of number of peer reviewed articles on anesthesia cited on PubMed and Medline from 2004 to 2010 was investigated. The average cost on advertising on anesthesia, surgery and pain was estimated using Google AdWords. Searching results in other common search engines were also analyzed. Correlation between year and relative number of searches was determined with psearch engines may provide different total number of searching results (available posts), the ratios of searching results between some common keywords related to perioperative care are comparable, indicating similar trend. The peer reviewed manuscripts on "anesthesia" and the proportion of papers on "anesthesia and outcome" are trending up. Estimates for spending of advertising dollars are less for anesthesia-related terms when compared to that for pain or surgery due to relative smaller number of searching traffic. General interest in anesthesia (anaesthesia) as measured by internet searches appears to be decreasing. Pain, preanesthesia evaluation, anesthesia and outcome and side effects of anesthesia are the critical areas that anesthesiologists should focus on to address the increasing concerns.

  10. Interest in Anesthesia as Reflected by Keyword Searches using Common Search Engines

    Science.gov (United States)

    Liu, Renyu; García, Paul S.; Fleisher, Lee A.

    2012-01-01

    Background Since current general interest in anesthesia is unknown, we analyzed internet keyword searches to gauge general interest in anesthesia in comparison with surgery and pain. Methods The trend of keyword searches from 2004 to 2010 related to anesthesia and anaesthesia was investigated using Google Insights for Search. The trend of number of peer reviewed articles on anesthesia cited on PubMed and Medline from 2004 to 2010 was investigated. The average cost on advertising on anesthesia, surgery and pain was estimated using Google AdWords. Searching results in other common search engines were also analyzed. Correlation between year and relative number of searches was determined with pengines may provide different total number of searching results (available posts), the ratios of searching results between some common keywords related to perioperative care are comparable, indicating similar trend. The peer reviewed manuscripts on “anesthesia” and the proportion of papers on “anesthesia and outcome” are trending up. Estimates for spending of advertising dollars are less for anesthesia-related terms when compared to that for pain or surgery due to relative smaller number of searching traffic. Conclusions General interest in anesthesia (anaesthesia) as measured by internet searches appears to be decreasing. Pain, preanesthesia evaluation, anesthesia and outcome and side effects of anesthesia are the critical areas that anesthesiologists should focus on to address the increasing concerns. PMID:23853739

  11. When Gravity Fails: Local Search Topology

    Science.gov (United States)

    Frank, Jeremy; Cheeseman, Peter; Stutz, John; Lau, Sonie (Technical Monitor)

    1997-01-01

    Local search algorithms for combinatorial search problems frequently encounter a sequence of states in which it is impossible to improve the value of the objective function; moves through these regions, called {\\em plateau moves), dominate the time spent in local search. We analyze and characterize {\\em plateaus) for three different classes of randomly generated Boolean Satisfiability problems. We identify several interesting features of plateaus that impact the performance of local search algorithms. We show that local minima tend to be small but occasionally may be very large. We also show that local minima can be escaped without unsatisfying a large number of clauses, but that systematically searching for an escape route may be computationally expensive if the local minimum is large. We show that plateaus with exits, called benches, tend to be much larger than minima, and that some benches have very few exit states which local search can use to escape. We show that the solutions (i.e. global minima) of randomly generated problem instances form clusters, which behave similarly to local minima. We revisit several enhancements of local search algorithms and explain their performance in light of our results. Finally we discuss strategies for creating the next generation of local search algorithms.

  12. Automatization and training in visual search.

    Science.gov (United States)

    Czerwinski, M; Lightfoot, N; Shiffrin, R M

    1992-01-01

    In several search tasks, the amount of practice on particular combinations of targets and distractors was equated in varied-mapping (VM) and consistent-mapping (CM) conditions. The results indicate the importance of distinguishing between memory and visual search tasks, and implicate a number of factors that play important roles in visual search and its learning. Visual search was studied in Experiment 1. VM and CM performance were almost equal, and slope reductions occurred during practice for both, suggesting the learning of efficient attentive search based on features, and no important role for automatic attention attraction. However, positive transfer effects occurred when previous CM targets were re-paired with previous CM distractors, even though these targets and distractors had not been trained together. Also, the introduction of a demanding simultaneous task produced advantages of CM over VM. These latter two results demonstrated the operation of automatic attention attraction. Visual search was further studied in Experiment 2, using novel characters for which feature overlap and similarity were controlled. The design and many of the findings paralleled Experiment 1. In addition, enormous search improvement was seen over 35 sessions of training, suggesting the operation of perceptual unitization for the novel characters. Experiment 3 showed a large, persistent advantage for CM over VM performance in memory search, even when practice on particular combinations of targets and distractors was equated in the two training conditions. A multifactor theory of automatization and attention is put forth to account for these findings and others in the literature.

  13. Intrinsic position uncertainty impairs overt search performance.

    Science.gov (United States)

    Semizer, Yelda; Michel, Melchi M

    2017-08-01

    Uncertainty regarding the position of the search target is a fundamental component of visual search. However, due to perceptual limitations of the human visual system, this uncertainty can arise from intrinsic, as well as extrinsic, sources. The current study sought to characterize the role of intrinsic position uncertainty (IPU) in overt visual search and to determine whether it significantly limits human search performance. After completing a preliminary detection experiment to characterize sensitivity as a function of visual field position, observers completed a search task that required localizing a Gabor target within a field of synthetic luminance noise. The search experiment included two clutter conditions designed to modulate the effect of IPU across search displays of varying set size. In the Cluttered condition, the display was tiled uniformly with feature clutter to maximize the effects of IPU. In the Uncluttered condition, the clutter at irrelevant locations was removed to attenuate the effects of IPU. Finally, we derived an IPU-constrained ideal searcher model, limited by the IPU measured in human observers. Ideal searchers were simulated based on the detection sensitivity and fixation sequences measured for individual human observers. The IPU-constrained ideal searcher predicted performance trends similar to those exhibited by the human observers. In the Uncluttered condition, performance decreased steeply as a function of increasing set size. However, in the Cluttered condition, the effect of IPU dominated and performance was approximately constant as a function of set size. Our findings suggest that IPU substantially limits overt search performance, especially in crowded displays.

  14. Exotic Higgs searches

    CERN Document Server

    Pelliccioni, Mario

    2016-01-01

    Exotic Higgs searches cover a wide range of signatures, thus leading to indications to new physics beyond Standard Model. We report a review on exotic Higgs searches for lepton flavour violating Higgs decays, for "mono-Higgs" searches, for Higgs decays to invisible and for high mass Higgs searches. Both ATLAS and CMS results will be shown, for Run-1 data statistics collected at the energy of $\\sqrt s$ = 7,8 TeV and for the first data collected during Run-2 phase at the energy of $\\sqrt s$ = 13 TeV.

  15. Google Power Search

    CERN Document Server

    Spencer, Stephan

    2011-01-01

    Behind Google's deceptively simple interface is immense power for both market and competitive research-if you know how to use it well. Sure, basic searches are easy, but complex searches require specialized skills. This concise book takes you through the full range of Google's powerful search-refinement features, so you can quickly find the specific information you need. Learn techniques ranging from simple Boolean logic to URL parameters and other advanced tools, and see how they're applied to real-world market research examples. Incorporate advanced search operators such as filetype:, intit

  16. Delaying information search

    Directory of Open Access Journals (Sweden)

    Yaniv Shani

    2012-11-01

    Full Text Available In three studies, we examined factors that may temporarily attenuate information search. People are generally curious and dislike uncertainty, which typically encourages them to look for relevant information. Despite these strong forces that promote information search, people sometimes deliberately delay obtaining valuable information. We find they may do so when they are concerned that the information might interfere with future pleasurable activities. Interestingly, the decision to search or to postpone searching for information is influenced not only by the value and importance of the information itself but also by well-being maintenance goals related to possible detrimental effects that negative knowledge may have on unrelated future plans.

  17. Target similarity effects: support for the parallel distributed processing assumptions.

    Science.gov (United States)

    Humphreys, M S; Tehan, G; O'Shea, A; Bolland, S W

    2000-07-01

    Recent research has begun to provide support for the assumptions that memories are stored as a composite and are accessed in parallel (Tehan & Humphreys, 1998). New predictions derived from these assumptions and from the Chappell and Humphreys (1994) implementation of these assumptions were tested. In three experiments, subjects studied relatively short lists of words. Some of the lists contained two similar targets (thief and theft) or two dissimilar targets (thief and steal) associated with the same cue (robbery). As predicted, target similarity affected performance in cued recall but not free association. Contrary to predictions, two spaced presentations of a target did not improve performance in free association. Two additional experiments confirmed and extended this finding. Several alternative explanations for the target similarity effect, which incorporate assumptions about separate representations and sequential search, are rejected. The importance of the finding that, in at least one implicit memory paradigm, repetition does not improve performance is also discussed.

  18. The Hofmethode: Computing Semantic Similarities between E-Learning Products

    Directory of Open Access Journals (Sweden)

    Oliver Michel

    2009-11-01

    Full Text Available The key task in building useful e-learning repositories is to develop a system with an algorithm allowing users to retrieve information that corresponds to their specific requirements. To achieve this, products (or their verbal descriptions, i.e. presented in metadata need to be compared and structured according to the results of this comparison. Such structuring is crucial insofar as there are many search results that correspond to the entered keyword. The Hofmethode is an algorithm (based on psychological considerations to compute semantic similarities between texts and therefore offer a way to compare e-learning products. The computed similarity values are used to build semantic maps in which the products are visually arranged according to their similarities. The paper describes how the Hofmethode is implemented in the online database edulap, and how it contributes to help the user to explore the data in which he is interested.

  19. Selective scanpath repetition during memory-guided visual search.

    Science.gov (United States)

    Wynn, Jordana S; Bone, Michael B; Dragan, Michelle C; Hoffman, Kari L; Buchsbaum, Bradley R; Ryan, Jennifer D

    2016-01-02

    Visual search efficiency improves with repetition of a search display, yet the mechanisms behind these processing gains remain unclear. According to Scanpath Theory, memory retrieval is mediated by repetition of the pattern of eye movements or "scanpath" elicited during stimulus encoding. Using this framework, we tested the prediction that scanpath recapitulation reflects relational memory guidance during repeated search events. Younger and older subjects were instructed to find changing targets within flickering naturalistic scenes. Search efficiency (search time, number of fixations, fixation duration) and scanpath similarity (repetition) were compared across age groups for novel (V1) and repeated (V2) search events. Younger adults outperformed older adults on all efficiency measures at both V1 and V2, while the search time benefit for repeated viewing (V1-V2) did not differ by age. Fixation-binned scanpath similarity analyses revealed repetition of initial and final (but not middle) V1 fixations at V2, with older adults repeating more initial V1 fixations than young adults. In young adults only, early scanpath similarity correlated negatively with search time at test, indicating increased efficiency, whereas the similarity of V2 fixations to middle V1 fixations predicted poor search performance. We conclude that scanpath compression mediates increased search efficiency by selectively recapitulating encoding fixations that provide goal-relevant input. Extending Scanpath Theory, results suggest that scanpath repetition varies as a function of time and memory integrity.

  20. A similarity-based data warehousing environment for medical images.

    Science.gov (United States)

    Teixeira, Jefferson William; Annibal, Luana Peixoto; Felipe, Joaquim Cezar; Ciferri, Ricardo Rodrigues; Ciferri, Cristina Dutra de Aguiar

    2015-11-01

    A core issue of the decision-making process in the medical field is to support the execution of analytical (OLAP) similarity queries over images in data warehousing environments. In this paper, we focus on this issue. We propose imageDWE, a non-conventional data warehousing environment that enables the storage of intrinsic features taken from medical images in a data warehouse and supports OLAP similarity queries over them. To comply with this goal, we introduce the concept of perceptual layer, which is an abstraction used to represent an image dataset according to a given feature descriptor in order to enable similarity search. Based on this concept, we propose the imageDW, an extended data warehouse with dimension tables specifically designed to support one or more perceptual layers. We also detail how to build an imageDW and how to load image data into it. Furthermore, we show how to process OLAP similarity queries composed of a conventional predicate and a similarity search predicate that encompasses the specification of one or more perceptual layers. Moreover, we introduce an index technique to improve the OLAP query processing over images. We carried out performance tests over a data warehouse environment that consolidated medical images from exams of several modalities. The results demonstrated the feasibility and efficiency of our proposed imageDWE to manage images and to process OLAP similarity queries. The results also demonstrated that the use of the proposed index technique guaranteed a great improvement in query processing. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. Efficient EMD-based Similarity Search in Multimedia Databases via Flexible Dimensionality Reduction

    DEFF Research Database (Denmark)

    Wichterich, Marc; Assent, Ira; Philipp, Kranen

    2008-01-01

    to medium dimensionality. However, multimedia applications commonly exhibit high-dimensional feature representations for which the computational complexity of the EMD hinders its adoption. An efficient query processing approach that mitigates and overcomes this effect is crucial. We propose novel...

  2. HBLAST: Parallelised sequence similarity--A Hadoop MapReducable basic local alignment search tool

    National Research Council Canada - National Science Library

    O'Driscoll, Aisling; Belogrudov, Vladislav; Carroll, John; Kropp, Kai; Walsh, Paul; Ghazal, Peter; Sleator, Roy D

    2015-01-01

    ... large, complex and require rapid processing. The Hadoop framework, comprised of distributed storage and a parallelised programming framework known as MapReduce, is specifically designed to work with such datasets but it is...

  3. HBLAST: Parallelised sequence similarity--A Hadoop MapReducable basic local alignment search tool

    National Research Council Canada - National Science Library

    O'Driscoll, Aisling; Belogrudov, Vladislav; Carroll, John; Kropp, Kai; Walsh, Paul; Ghazal, Peter; Sleator, Roy D

    2015-01-01

    ... large, complex and require rapid processing. The Hadoop framework, comprised of distributed storage and a parallelised programming framework known as MapReduce, is specifically designed to work with such datasets...

  4. Personalized online information search and visualization

    Directory of Open Access Journals (Sweden)

    Orthner Helmuth F

    2005-03-01

    Full Text Available Abstract Background The rapid growth of online publications such as the Medline and other sources raises the questions how to get the relevant information efficiently. It is important, for a bench scientist, e.g., to monitor related publications constantly. It is also important, for a clinician, e.g., to access the patient records anywhere and anytime. Although time-consuming, this kind of searching procedure is usually similar and simple. Likely, it involves a search engine and a visualization interface. Different words or combination reflects different research topics. The objective of this study is to automate this tedious procedure by recording those words/terms in a database and online sources, and use the information for an automated search and retrieval. The retrieved information will be available anytime and anywhere through a secure web server. Results We developed such a database that stored searching terms, journals and et al., and implement a piece of software for searching the medical subject heading-indexed sources such as the Medline and other online sources automatically. The returned information were stored locally, as is, on a server and visible through a Web-based interface. The search was performed daily or otherwise scheduled and the users logon to the website anytime without typing any words. The system has potentials to retrieve similarly from non-medical subject heading-indexed literature or a privileged information source such as a clinical information system. The issues such as security, presentation and visualization of the retrieved information were thus addressed. One of the presentation issues such as wireless access was also experimented. A user survey showed that the personalized online searches saved time and increased and relevancy. Handheld devices could also be used to access the stored information but less satisfactory. Conclusion The Web-searching software or similar system has potential to be an efficient

  5. Strain Elastography of Breast and Prostata Cancer: Similarities and Differences.

    Science.gov (United States)

    Daniaux, M; Auer, T; De Zordo, T; Junker, D; Santner, W; Hubalek, M; Jaschke, W; Aigner, F

    2016-03-01

    Typically both breast and prostate cancer present as tissue with decreased elasticity. Palpation is the oldest technique of tumor detection in both organs and is based on this principle. Thus an operator can grade a palpable mass as suspicious for cancer. Strain elastography as modern ultrasound technique allows the visualization of tissue elasticity in a color coded elastogram and can be understood as technical finger. The following article shows similarities and differences of ultrasound strain elastography in the diagnosis of breast and prostate cancer. • In prostata cancer elastography, in breast cancer B-mode is the primary sonographic search modality. • The diagnostic value of the search modalities change with increasing age.• A cut-off value for a strain ratio is hard to obtain in the elastography of the prostata, because there is no stabile reference tissue in the prostata. © Georg Thieme Verlag KG Stuttgart · New York.

  6. Narcissists of a Feather Flock Together: Narcissism and the Similarity of Friends.

    Science.gov (United States)

    Maaß, Ulrike; Lämmle, Lena; Bensch, Doreen; Ziegler, Matthias

    2016-03-01

    Who is willing to expose himself or herself to narcissists on a long-term basis? Studies that address the interactions of narcissists focus mainly on their interactions with strangers. Hence, the aim of the present study was to investigate the extent to which two best friends' similarity in narcissism would influence their similarities in other personality profiles. A total of 290 best friends' dyads filled out measurements of the whole Dark Triad as well as the Big Five. For each personality domain, profile similarity and its dependence on the similarity in the Dark Triad were determined. Results showed that the friends' similarity in narcissism significantly predicted similarity in all Big Five domains. For the general Big Five similarity as well as extraversion, the effect of narcissism similarity was stronger for male than female or mixed friends. Similarity in psychopathy and Machiavellianism significantly predicted all domains except for openness and extraversion, respectively. © 2016 by the Society for Personality and Social Psychology, Inc.

  7. IMPROVING PERSONALIZED WEB SEARCH USING BOOKSHELF DATA STRUCTURE

    Directory of Open Access Journals (Sweden)

    S.K. Jayanthi

    2012-10-01

    Full Text Available Search engines are playing a vital role in retrieving relevant information for the web user. In this research work a user profile based web search is proposed. So the web user from different domain may receive different set of results. The main challenging work is to provide relevant results at the right level of reading difficulty. Estimating user expertise and re-ranking the results are the main aspects of this paper. The retrieved results are arranged in Bookshelf Data Structure for easy access. Better presentation of search results hence increases the usability of web search engines significantly in visual mode.

  8. Spying on Search Strategies

    Science.gov (United States)

    Tenopir, Carol

    2004-01-01

    Only the most dedicated super-searchers are motivated to learn and control command systems, like DialogClassic, that rely on the user to input complex search strategies. Infrequent searchers and most end users choose interfaces that do some of the work for them and make the search process appear easy. However, the easier a good interface seems to…

  9. Agronomie Africaine: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  10. Scientia Africana: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  11. With News Search Engines

    Science.gov (United States)

    Gunn, Holly

    2005-01-01

    Although there are many news search engines on the Web, finding the news items one wants can be challenging. Choosing appropriate search terms is one of the biggest challenges. Unless one has seen the article that one is seeking, it is often difficult to select words that were used in the headline or text of the article. The limited archives of…

  12. Biokemistri: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  13. Searches for Supersymmetry

    CERN Document Server

    Ventura, Andrea; The ATLAS collaboration

    2017-01-01

    New and recent results on Supersymmetry searches are shown for the ATLAS and the CMS experiments. Analyses with about 36 fb$^{-1}$ are considered for searches concerning light squarks and gluinos, direct pair production of 3$^{rd}$ generation squarks, electroweak production of charginos, neutralinos, sleptons, R-parity violating scenarios and long-lived particles.

  14. Search and Recommendation

    DEFF Research Database (Denmark)

    Bogers, Toine

    2014-01-01

    -scale application by companies like Amazon, Facebook, and Netflix. But are search and recommendation really two different fields of research that address different problems with different sets of algorithms in papers published at distinct conferences? In my talk, I want to argue that search and recommendation...

  15. Search and switching costs

    NARCIS (Netherlands)

    Siekman, Wilhelm Henricus

    2016-01-01

    This thesis analyses markets with search and with switching costs. It provides insights in several important issues in search markets, including how loss aversion may affect consumer behavior and firm conduct, and how prices, welfare, and profits may change when an intermediating platform orders

  16. Sciences & Nature: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  17. Afrika Statistika: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  18. Quaestiones Mathematicae: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  19. Acta Theologica: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  20. Nigerian Libraries: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  1. Water SA: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  2. Lexikos: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  3. Philosophical Papers: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  4. Kiswahili: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  5. Afrimedic Journal: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  6. Kenya Veterinarian: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  7. Building Internet Search Engines

    Directory of Open Access Journals (Sweden)

    Mustafa Akgül

    1996-09-01

    Full Text Available Internet search engines are powerful tools to find electronics objects such as addresses of individuals and institutions, documents, statistics of all kinds, dictionaries, cata­logs, product information etc. This paper explains how to build and run some very common search engines on Unix platforms, so as to serve documents through the Web.

  8. ElasticSearch cookbook

    CERN Document Server

    Paro, Alberto

    2015-01-01

    If you are a developer who implements ElasticSearch in your web applications and want to sharpen your understanding of the core elements and applications, this is the book for you. It is assumed that you've got working knowledge of JSON and, if you want to extend ElasticSearch, of Java and related technologies.

  9. Searches for Supersymmetry

    CERN Document Server

    Ventura, Andrea; The ATLAS collaboration

    2017-01-01

    New and recents results on Supersymmetry searches are shown for the ATLAS and the CMS experiments. Analyses with about 36 fb^-1 are considered for searches concerning light squarks and gluinos, direct pair production of 3rd generation squarks, electroweak production of charginos, neutralinos, sleptons, R-parity violating scenarios and long-lived particles.

  10. African Environment: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  11. Human memory search

    NARCIS (Netherlands)

    Davelaar, E.J.; Raaijmakers, J.G.W.; Hills, T.T.; Robbins, T.W.; Todd, P.M.

    2012-01-01

    The importance of understanding human memory search is hard to exaggerate: we build and live our lives based on what whe remember. This chapter explores the characteristics of memory search, with special emphasis on the use of retrieval cues. We introduce the dependent measures that are obtained

  12. Fixing Dataset Search

    Science.gov (United States)

    Lynnes, Chris

    2014-01-01

    Three current search engines are queried for ozone data at the GES DISC. The results range from sub-optimal to counter-intuitive. We propose a method to fix dataset search by implementing a robust relevancy ranking scheme. The relevancy ranking scheme is based on several heuristics culled from more than 20 years of helping users select datasets.

  13. Vulture News: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  14. Ergonomics SA: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  15. Zede Journal: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  16. African Anthropologist: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  17. Africa Development: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  18. Innovation: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  19. Acta Structilia: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  20. Agrosearch: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  1. African Zoology: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  2. Rwanda Journal: Advanced Search

    African Journals Online (AJOL)

    Search tips: Search terms are case-insensitive; Common words are ignored; By default only articles containing all terms in the query are returned (i.e., AND is implied); Combine multiple words with OR to find articles containing either term; e.g., education OR research; Use parentheses to create more complex queries; e.g., ...

  3. Distributed Deep Web Search

    NARCIS (Netherlands)

    Tjin-Kam-Jet, Kien

    2013-01-01

    The World Wide Web contains billions of documents (and counting); hence, it is likely that some document will contain the answer or content you are searching for. While major search engines like Bing and Google often manage to return relevant results to your query, there are plenty of situations in

  4. Hit expansion approaches using multiple similarity methods and virtualized query structures.

    Science.gov (United States)

    Bergner, Andreas; Parel, Serge P

    2013-05-24

    Ligand-based virtual screening and computational hit expansion methods undoubtedly facilitate the finding of novel active chemical entities, utilizing already existing knowledge of active compounds. It has been demonstrated that the parallel execution of complementary similarity search methods enhances the performance of such virtual screening campaigns. In this article, we examine the use of virtualized template (query, seed) structures as an extension to common search methods, such as fingerprint and pharmacophore graph-based similarity searches. We demonstrate that template virtualization by bioisosteric enumeration and other rule-based methods, in combination with standard similarity search techniques, represents a powerful approach for hit expansion following high-throughput screening campaigns. The reliability of the methods is demonstrated by four different test data sets representing different target classes and two hit finding case studies on the epigenetic targets G9a and LSD1.

  5. Search and Disrupt

    DEFF Research Database (Denmark)

    Ørding Olsen, Anders

    This paper analyzes how external search is affected by strategic interest alignment among knowledge sources. I focus on misalignment arising from the heterogeneous effects of disruptive technologies by analyzing the influence of incumbents on 2,855 non-incumbents? external knowledge search efforts....... The efforts most likely to solve innovation problems obtained funding from the European Commission?s 7th Framework Program (2007-2013). The results show that involving incumbents improves search in complementary technologies, while demoting it when strategic interests are misaligned in disruptive technologies....... However, incumbent sources engaged in capability reconfiguration to accommodate disruption improve search efforts in disruptive technologies. The paper concludes that the value of external sources is contingent on more than their knowledge. Specifically, interdependence of sources in search gives rise...

  6. Semantic search among heterogeneous biological databases based on gene ontology.

    Science.gov (United States)

    Cao, Shun-Liang; Qin, Lei; He, Wei-Zhong; Zhong, Yang; Zhu, Yang-Yong; Li, Yi-Xue

    2004-05-01

    Semantic search is a key issue in integration of heterogeneous biological databases. In this paper, we present a methodology for implementing semantic search in BioDW, an integrated biological data warehouse. Two tables are presented: the DB2GO table to correlate Gene Ontology (GO) annotated entries from BioDW data sources with GO, and the semantic similarity table to record similarity scores derived from any pair of GO terms. Based on the two tables, multifarious ways for semantic search are provided and the corresponding entries in heterogeneous biological databases in semantic terms can be expediently searched.

  7. Fast Structural Search in Phylogenetic Databases

    Directory of Open Access Journals (Sweden)

    William H. Piel

    2005-01-01

    Full Text Available As the size of phylogenetic databases grows, the need for efficiently searching these databases arises. Thanks to previous and ongoing research, searching by attribute value and by text has become commonplace in these databases. However, searching by topological or physical structure, especially for large databases and especially for approximate matches, is still an art. We propose structural search techniques that, given a query or pattern tree P and a database of phylogenies D, find trees in D that are sufficiently close to P . The “closeness” is a measure of the topological relationships in P that are found to be the same or similar in a tree D in D. We develop a filtering technique that accelerates searches and present algorithms for rooted and unrooted trees where the trees can be weighted or unweighted. Experimental results on comparing the similarity measure with existing tree metrics and on evaluating the efficiency of the search techniques demonstrate that the proposed approach is promising

  8. Enhancing Media Personalization by Extracting Similarity Knowledge from Metadata

    DEFF Research Database (Denmark)

    Butkus, Andrius

    with many interrelated parts – recommendation engines, content metadata, contextual information and user profiles. In the center of any type of recommendation lies the notion of similarity. The most popular way to approach similarity is to look for the feature overlaps. This results often in recommending......, especially when we talk about the contents of music. Therefore the main focus in the thesis is how to extract such emotional information from media, and how to use it to enhance media personalization. This dissertation proposes a novel method to extract emotional information from text (unstructured metadata......) using Latent Semantic Analysis (one of the unsupervised machine learning techniques). It presents three separate cases to illustrate the similarity knowledge extraction from the metadata, where the emotional components in each case represents different abstraction levels – genres, synopsis and lyrics...

  9. Quantifying the Search Behaviour of Different Demographics Using Google Correlate

    Science.gov (United States)

    Letchford, Adrian; Preis, Tobias; Moat, Helen Susannah

    2016-01-01

    Vast records of our everyday interests and concerns are being generated by our frequent interactions with the Internet. Here, we investigate how the searches of Google users vary across U.S. states with different birth rates and infant mortality rates. We find that users in states with higher birth rates search for more information about pregnancy, while those in states with lower birth rates search for more information about cats. Similarly, we find that users in states with higher infant mortality rates search for more information about credit, loans and diseases. Our results provide evidence that Internet search data could offer new insight into the concerns of different demographics. PMID:26910464

  10. Quantifying the Search Behaviour of Different Demographics Using Google Correlate.

    Directory of Open Access Journals (Sweden)

    Adrian Letchford

    Full Text Available Vast records of our everyday interests and concerns are being generated by our frequent interactions with the Internet. Here, we investigate how the searches of Google users vary across U.S. states with different birth rates and infant mortality rates. We find that users in states with higher birth rates search for more information about pregnancy, while those in states with lower birth rates search for more information about cats. Similarly, we find that users in states with higher infant mortality rates search for more information about credit, loans and diseases. Our results provide evidence that Internet search data could offer new insight into the concerns of different demographics.

  11. Fellow Profile

    Indian Academy of Sciences (India)

    Fellow Profile. Elected: 1971 Section: Chemistry. Narasimhan, Prof. Palliakaranai Thirumalai Ph.D. (Madras), FNA, FNASc. Date of birth: 28 July 1928. Date of death: 3 May 2013. Specialization: Theoretical Chemistry and Magnetic Resonance Last known address: 1013, Lupine Drive, Sunnyvale, CA 94086, USA. YouTube ...

  12. Similarities and differences in borderline and organic personality disorder.

    Science.gov (United States)

    Mathiesen, Birgit B; Simonsen, Erik; Soegaard, Ulf; Kvist, Kajsa

    2014-01-01

    Previous research has shown that brain injury patients with Organic Personality Disorder (OPD) may display "borderline" traits due to prefrontal damage, and their personality structure may be unstable and close to a borderline personality organisation. They may have few general neuropsychological dysfunctions but specific executive deficits. Similar deficits have been found in patients with Borderline Personality Disorder (BPD). The objective of this study was to identify differences and similarities between the neuropsychological and personality profiles of BPD and OPD patients. Twenty BPD patients and 24 OPD patients were assessed with the Structured Clinical Interview for DSM-IV Axis II Disorders (SCID-II), the Karolinska Psychodynamic Profile (KAPP), and a comprehensive neuropsychological test battery. Very few neuropsychological differences were found between the two patient groups. However, the verbal fluency, verbal intelligence, verbal memory, and immediate auditory memory/attention of the BPD patients were significantly poorer than the OPD patients'. The KAPP profiles of the BPD patients showed significantly poorer functioning in three areas: frustration tolerance, the body as a factor of self-esteem, and overall personality organisation. These results support our clinical experience and expectations concerning the severity of symptoms of both patient groups. We suggest considering in depth assessments of both neuropsychological and personality-related problems for each of these patients in order to inform treatment.

  13. Asthma and COPD: Differences and Similarities

    Science.gov (United States)

    ... and COPD: differences and similarities Share | Asthma and COPD: Differences and Similarities This article has been reviewed ... or you could have Chronic Obstructive Pulmonary Disease (COPD) , such as emphysema or chronic bronchitis. Because asthma ...

  14. How Users Search the Library from a Single Search Box

    Science.gov (United States)

    Lown, Cory; Sierra, Tito; Boyer, Josh

    2013-01-01

    Academic libraries are turning increasingly to unified search solutions to simplify search and discovery of library resources. Unfortunately, very little research has been published on library user search behavior in single search box environments. This study examines how users search a large public university library using a prominent, single…

  15. Skewed Binary Search Trees

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Moruz, Gabriel

    2006-01-01

    It is well-known that to minimize the number of comparisons a binary search tree should be perfectly balanced. Previous work has shown that a dominating factor over the running time for a search is the number of cache faults performed, and that an appropriate memory layout of a binary search tree...... can reduce the number of cache faults by several hundred percent. Motivated by the fact that during a search branching to the left or right at a node does not necessarily have the same cost, e.g. because of branch prediction schemes, we in this paper study the class of skewed binary search trees....... For all nodes in a skewed binary search tree the ratio between the size of the left subtree and the size of the tree is a fixed constant (a ratio of 1/2 gives perfect balanced trees). In this paper we present an experimental study of various memory layouts of static skewed binary search trees, where each...

  16. JSC Search System Usability Case Study

    Science.gov (United States)

    Meza, David; Berndt, Sarah

    2014-01-01

    The advanced nature of "search" has facilitated the movement from keyword match to the delivery of every conceivable information topic from career, commerce, entertainment, learning... the list is infinite. At NASA Johnson Space Center (JSC ) the Search interface is an important means of knowledge transfer. By indexing multiple sources between directorates and organizations, the system's potential is culture changing in that through search, knowledge of the unique accomplishments in engineering and science can be seamlessly passed between generations. This paper reports the findings of an initial survey, the first of a four part study to help determine user sentiment on the intranet, or local (JSC) enterprise search environment as well as the larger NASA enterprise. The survey is a means through which end users provide direction on the development and transfer of knowledge by way of the search experience. The ideal is to identify what is working and what needs to be improved from the users' vantage point by documenting: (1) Where users are satisfied/dissatisfied (2) Perceived value of interface components (3) Gaps which cause any disappointment in search experience. The near term goal is it to inform JSC search in order to improve users' ability to utilize existing services and infrastructure to perform tasks with a shortened life cycle. Continuing steps include an agency based focus with modified questions to accomplish a similar purpose

  17. Interactive searching of facial image databases

    Science.gov (United States)

    Nicholls, Robert A.; Shepherd, John W.; Shepherd, Jean

    1995-09-01

    A set of psychological facial descriptors has been devised to enable computerized searching of criminal photograph albums. The descriptors have been used to encode image databased of up to twelve thousand images. Using a system called FACES, the databases are searched by translating a witness' verbal description into corresponding facial descriptors. Trials of FACES have shown that this coding scheme is more productive and efficient than searching traditional photograph albums. An alternative method of searching the encoded database using a genetic algorithm is currenly being tested. The genetic search method does not require the witness to verbalize a description of the target but merely to indicate a degree of similarity between the target and a limited selection of images from the database. The major drawback of FACES is that is requires a manual encoding of images. Research is being undertaken to automate the process, however, it will require an algorithm which can predict human descriptive values. Alternatives to human derived coding schemes exist using statistical classifications of images. Since databases encoded using statistical classifiers do not have an obvious direct mapping to human derived descriptors, a search method which does not require the entry of human descriptors is required. A genetic search algorithm is being tested for such a purpose.

  18. 7 CFR 51.1997 - Similar type.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar type. 51.1997 Section 51.1997 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards, Inspections, Marketing... Standards for Grades of Filberts in the Shell 1 Definitions § 51.1997 Similar type. Similar type means that...

  19. Molecular quantum similarity using conceptual DFT descriptors

    Indian Academy of Sciences (India)

    This paper reports a Molecular Quantum Similarity study for a set of congeneric steroid molecules, using as basic similarity descriptors electron density ρ (r), shape function (r), the Fukui functions +(r) and -(r) and local softness +(r) and -(r). Correlations are investigated between similarity indices for each couple of ...

  20. Towards improving searches on the NASA's Distributed Active Archive Centers (DAACs)

    Science.gov (United States)

    McGibbney, L. J.; Whitehall, K. D.; Ramapriyan, H.; Khalsa, S. J. S.; Lynnes, C.; Armstrong, E. M.

    2016-12-01

    NASA supports numerous observing missions to study the Earth, its interactions, and understand its changes. These missions generate heterogeneous data from a variety of sources including satellites and airborne platforms. NASA's Earth Observing System Data and Information System (EOSDIS) is the capability in NASA's Earth Science Data Systems Program responsible for the end-to-end management of these science data. More specifically, the EOSDIS Distributed Active Archive Centers (DAACs) are the key entities that maintain, distribute these data and provide related data services for the mission data associated with a given property of the Earth System e.g. PO.DAAC for physical oceanographic data, NSIDC DAAC for snow and ice data. As the volume, variety and velocity of Earth science data grow, users are focused on high veracity (i.e., data quality), and as their needs become more diverse, they find it more difficult to readily find the data that best suits their purposes. For instance, simple keyword searches on most DAAC holdings return many datasets of potential interest but which are unranked either based on the content of the query or the historical data usage. The Earth Science Data System Working Group (ESDSWG) on Search Relevance WG started in May 2015 to address these concerns. The mandates of the WG are: to characterize the term "search relevance" as it relates to EOSDIS; to assess the current implementations towards search relevance; and to determine how practices and standards in industry and other domains can be applied to DAACs - in a federated-sense - in order to effectively serve the Earth Science data consumers. This poster will present the WG's insights into user profiles and behaviors accessing the DAACs, identify the core areas essential to improve search relevance across the DAACs (individually and collectively), and highlight ongoing efforts within NASA and similar organizations towards search relevance.

  1. Data mining technique for fast retrieval of similar waveforms in Fusion massive databases

    Energy Technology Data Exchange (ETDEWEB)

    Vega, J. [Asociacion EURATOM/CIEMAT Para Fusion, Madrid (Spain)], E-mail: jesus.vega@ciemat.es; Pereira, A.; Portas, A. [Asociacion EURATOM/CIEMAT Para Fusion, Madrid (Spain); Dormido-Canto, S.; Farias, G.; Dormido, R.; Sanchez, J.; Duro, N. [Departamento de Informatica y Automatica, UNED, Madrid (Spain); Santos, M. [Departamento de Arquitectura de Computadores y Automatica, UCM, Madrid (Spain); Sanchez, E. [Asociacion EURATOM/CIEMAT Para Fusion, Madrid (Spain); Pajares, G. [Departamento de Arquitectura de Computadores y Automatica, UCM, Madrid (Spain)

    2008-01-15

    Fusion measurement systems generate similar waveforms for reproducible behavior. A major difficulty related to data analysis is the identification, in a rapid and automated way, of a set of discharges with comparable behaviour, i.e. discharges with 'similar' waveforms. Here we introduce a new technique for rapid searching and retrieval of 'similar' signals. The approach consists of building a classification system that avoids traversing the whole database looking for similarities. The classification system diminishes the problem dimensionality (by means of waveform feature extraction) and reduces the searching space to just the most probable 'similar' waveforms (clustering techniques). In the searching procedure, the input waveform is classified in any of the existing clusters. Then, a similarity measure is computed between the input signal and all cluster elements in order to identify the most similar waveforms. The inner product of normalized vectors is used as the similarity measure as it allows the searching process to be independent of signal gain and polarity. This development has been applied recently to TJ-II stellarator databases and has been integrated into its remote participation system.

  2. A New Method for Measuring Text Similarity in Learning Management Systems Using WordNet

    Science.gov (United States)

    Alkhatib, Bassel; Alnahhas, Ammar; Albadawi, Firas

    2014-01-01

    As text sources are getting broader, measuring text similarity is becoming more compelling. Automatic text classification, search engines and auto answering systems are samples of applications that rely on text similarity. Learning management systems (LMS) are becoming more important since electronic media is getting more publicly available. As…

  3. Using the Dual-Target Cost to Explore the Nature of Search Target Representations

    Science.gov (United States)

    Stroud, Michael J.; Menneer, Tamaryn; Cave, Kyle R.; Donnelly, Nick

    2012-01-01

    Eye movements were monitored to examine search efficiency and infer how color is mentally represented to guide search for multiple targets. Observers located a single color target very efficiently by fixating colors similar to the target. However, simultaneous search for 2 colors produced a dual-target cost. In addition, as the similarity between…

  4. Dual Target Search is Neither Purely Simultaneous nor Purely Successive.

    Science.gov (United States)

    Cave, Kyle R; Menneer, Tamaryn; Nomani, Mohammad S; Stroud, Michael J; Donnelly, Nick

    2017-08-31

    Previous research shows that visual search for two different targets is less efficient than search for a single target. Stroud, Menneer, Cave and Donnelly (2012) concluded that two target colours are represented separately based on modeling the fixation patterns. Although those analyses provide evidence for two separate target representations, they do not show whether participants search simultaneously for both targets, or first search for one target and then the other. Some studies suggest that multiple target representations are simultaneously active, while others indicate that search can be voluntarily simultaneous, or switching, or a mixture of both. Stroud et al.'s participants were not explicitly instructed to use any particular strategy. These data were revisited to determine which strategy was employed. Each fixated item was categorised according to whether its colour was more similar to one target or the other. Once an item similar to one target is fixated, the next fixated item is more likely to be similar to that target than the other, showing that at a given moment during search, one target is generally favoured. However, the search for one target is not completed before search for the other begins. Instead, there are often short runs of one or two fixations to distractors similar to one target, with each run followed by a switch to the other target. Thus, the results suggest that one target is more highly weighted than the other at any given time, but not to the extent that search is purely successive.

  5. Modified harmony search

    Science.gov (United States)

    Mohamed, Najihah; Lutfi Amri Ramli, Ahmad; Majid, Ahmad Abd; Piah, Abd Rahni Mt

    2017-09-01

    A metaheuristic algorithm, called Harmony Search is quite highly applied in optimizing parameters in many areas. HS is a derivative-free real parameter optimization algorithm, and draws an inspiration from the musical improvisation process of searching for a perfect state of harmony. Propose in this paper Modified Harmony Search for solving optimization problems, which employs a concept from genetic algorithm method and particle swarm optimization for generating new solution vectors that enhances the performance of HS algorithm. The performances of MHS and HS are investigated on ten benchmark optimization problems in order to make a comparison to reflect the efficiency of the MHS in terms of final accuracy, convergence speed and robustness.

  6. ElasticSearch server

    CERN Document Server

    Rogozinski, Marek

    2014-01-01

    This book is a detailed, practical, hands-on guide packed with real-life scenarios and examples which will show you how to implement an ElasticSearch search engine on your own websites.If you are a web developer or a user who wants to learn more about ElasticSearch, then this is the book for you. You do not need to know anything about ElastiSeach, Java, or Apache Lucene in order to use this book, though basic knowledge about databases and queries is required.

  7. Entropy, Search, Complexity

    CERN Document Server

    Katona, Gyula O H; Tardos, Gábor

    2007-01-01

    The present volume is a collection of survey papers in the fields of entropy, search and complexity. They summarize the latest developments in their respective areas. More than half of the papers belong to search theory which lies on the borderline of mathematics and computer science, information theory and combinatorics, respectively. Search theory has variegated applications, among others in bioinformatics. Some of these papers also have links to linear statistics and communicational complexity. Further works survey the fundamentals of information theory and quantum source coding. The volume is recommended to experienced researchers as well as young scientists and students both in mathematics and computer science

  8. Efficient Set Similarity Joins Using Min-prefixes

    Science.gov (United States)

    Ribeiro, Leonardo A.; Härder, Theo

    Identification of all objects in a dataset whose similarity is not less than a specified threshold is of major importance for management, search, and analysis of data. Set similarity joins are commonly used to implement this operation; they scale to large datasets and are versatile to represent a variety of similarity notions. Most set similarity join methods proposed so far present two main phases at a high level of abstraction: candidate generation producing a set of candidate pairs and verification applying the actual similarity measure to the candidates and returning the correct answer. Previous work has primarily focused on the reduction of candidates, where candidate generation presented the major effort to obtain better pruning results. Here, we propose an opposite approach. We drastically decrease the computational cost of candidate generation by dynamically reducing the number of indexed objects at the expense of increasing the workload of the verification phase. Our experimental findings show that this trade-off is advantageous: we consistently achieve substantial speed-ups as compared to previous algorithms.

  9. A Quantum-Based Similarity Method in Virtual Screening

    Directory of Open Access Journals (Sweden)

    Mohammed Mumtaz Al-Dabbagh

    2015-10-01

    Full Text Available One of the most widely-used techniques for ligand-based virtual screening is similarity searching. This study adopted the concepts of quantum mechanics to present as state-of-the-art similarity method of molecules inspired from quantum theory. The representation of molecular compounds in mathematical quantum space plays a vital role in the development of quantum-based similarity approach. One of the key concepts of quantum theory is the use of complex numbers. Hence, this study proposed three various techniques to embed and to re-represent the molecular compounds to correspond with complex numbers format. The quantum-based similarity method that developed in this study depending on complex pure Hilbert space of molecules called Standard Quantum-Based (SQB. The recall of retrieved active molecules were at top 1% and top 5%, and significant test is used to evaluate our proposed methods. The MDL drug data report (MDDR, maximum unbiased validation (MUV and Directory of Useful Decoys (DUD data sets were used for experiments and were represented by 2D fingerprints. Simulated virtual screening experiment show that the effectiveness of SQB method was significantly increased due to the role of representational power of molecular compounds in complex numbers forms compared to Tanimoto benchmark similarity measure.

  10. Application II: Search Arts and Sports Photography

    Indian Academy of Sciences (India)

    First page Back Continue Last page Overview Graphics. Application II: Search Arts and Sports Photography. Depth variation induces defocus. Defocus blur is severe in nature and sports photography. Given a low DOF image, segment the focused region and retrieve the images based similarities in the focused region.

  11. Web Search Personalization Via Social Bookmarking and Tagging

    Science.gov (United States)

    Noll, Michael G.; Meinel, Christoph

    In this paper, we present a new approach to web search personalization based on user collaboration and sharing of information about web documents. The proposed personalization technique separates data collection and user profiling from the information system whose contents and indexed documents are being searched for, i.e. the search engines, and uses social bookmarking and tagging to re-rank web search results. It is independent of the search engine being used, so users are free to choose the one they prefer, even if their favorite search engine does not natively support personalization. We show how to design and implement such a system in practice and investigate its feasibility and usefulness with large sets of real-word data and a user study.

  12. Search Engine Bias and the Demise of Search Engine Utopianism

    Science.gov (United States)

    Goldman, E.

    Due to search engines' automated operations, people often assume that search engines display search results neutrally and without bias. However, this perception is mistaken. Like any other media company, search engines affirmatively control their users' experiences, which has the consequence of skewing search results (a phenomenon called "search engine bias"). Some commentators believe that search engine bias is a defect requiring legislative correction. Instead, this chapter argues that search engine bias is the beneficial consequence of search engines optimizing content for their users. The chapter further argues that the most problematic aspect of search engine bias, the "winner-take-all" effect caused by top placement in search results, will be mooted by emerging personalized search technology.

  13. Contextual factors for finding similar experts

    NARCIS (Netherlands)

    Hofmann, K.; Balog, K.; Bogers, T.; de Rijke, M.

    2010-01-01

    Expertise-seeking research studies how people search for expertise and choose whom to contact in the context of a specific task. An important outcome are models that identify factors that influence expert finding. Expertise retrieval addresses the same problem, expert finding, but from a

  14. A fingerprint based metric for measuring similarities of crystalline structures.

    Science.gov (United States)

    Zhu, Li; Amsler, Maximilian; Fuhrer, Tobias; Schaefer, Bastian; Faraji, Somayeh; Rostami, Samare; Ghasemi, S Alireza; Sadeghi, Ali; Grauzinyte, Migle; Wolverton, Chris; Goedecker, Stefan

    2016-01-21

    Measuring similarities/dissimilarities between atomic structures is important for the exploration of potential energy landscapes. However, the cell vectors together with the coordinates of the atoms, which are generally used to describe periodic systems, are quantities not directly suitable as fingerprints to distinguish structures. Based on a characterization of the local environment of all atoms in a cell, we introduce crystal fingerprints that can be calculated easily and define configurational distances between crystalline structures that satisfy the mathematical properties of a metric. This distance between two configurations is a measure of their similarity/dissimilarity and it allows in particular to distinguish structures. The new method can be a useful tool within various energy landscape exploration schemes, such as minima hopping, random search, swarm intelligence algorithms, and high-throughput screenings.

  15. A fingerprint based metric for measuring similarities of crystalline structures

    Energy Technology Data Exchange (ETDEWEB)

    Zhu, Li; Fuhrer, Tobias; Schaefer, Bastian; Grauzinyte, Migle; Goedecker, Stefan, E-mail: stefan.goedecker@unibas.ch [Department of Physics, Universität Basel, Klingelbergstr. 82, 4056 Basel (Switzerland); Amsler, Maximilian [Department of Physics, Universität Basel, Klingelbergstr. 82, 4056 Basel (Switzerland); Department of Materials Science and Engineering, Northwestern University, Evanston, Illinois 60208 (United States); Faraji, Somayeh; Rostami, Samare; Ghasemi, S. Alireza [Institute for Advanced Studies in Basic Sciences, P.O. Box 45195-1159, Zanjan (Iran, Islamic Republic of); Sadeghi, Ali [Physics Department, Shahid Beheshti University, G. C., Evin, 19839 Tehran (Iran, Islamic Republic of); Wolverton, Chris [Department of Materials Science and Engineering, Northwestern University, Evanston, Illinois 60208 (United States)

    2016-01-21

    Measuring similarities/dissimilarities between atomic structures is important for the exploration of potential energy landscapes. However, the cell vectors together with the coordinates of the atoms, which are generally used to describe periodic systems, are quantities not directly suitable as fingerprints to distinguish structures. Based on a characterization of the local environment of all atoms in a cell, we introduce crystal fingerprints that can be calculated easily and define configurational distances between crystalline structures that satisfy the mathematical properties of a metric. This distance between two configurations is a measure of their similarity/dissimilarity and it allows in particular to distinguish structures. The new method can be a useful tool within various energy landscape exploration schemes, such as minima hopping, random search, swarm intelligence algorithms, and high-throughput screenings.

  16. Infrared image super-resolution via transformed self-similarity

    Science.gov (United States)

    Qi, Wei; Han, Jing; Zhang, Yi; Bai, Lian-fa

    2017-03-01

    Single image super-resolution is of great importance in computer vision. Various methods (e.g. learning methods) have been successfully developed in recent years. Despite the demonstrated success in the natural images, less research focuses on the infrared images. In this paper, we present a transformed self-similarity based super-resolution method without any learning priors, restore high-resolution infrared images from low-resolution ones. We exploit appearance similarity, dense error, and region covariances, and use the detected cues to guide the patch search process. We also add scale cue to consider local scale variations. We then present a compositional framework to simultaneously accommodate the four different cues. Experimental results demonstrate that our method performs better than previous methods, restores pleasant results, and high evaluate scores further show the effectiveness and robustness of our method for the infrared images.

  17. A fingerprint based metric for measuring similarities of crystalline structures

    Science.gov (United States)

    Zhu, Li; Amsler, Maximilian; Fuhrer, Tobias; Schaefer, Bastian; Faraji, Somayeh; Rostami, Samare; Ghasemi, S. Alireza; Sadeghi, Ali; Grauzinyte, Migle; Wolverton, Chris; Goedecker, Stefan

    2016-01-01

    Measuring similarities/dissimilarities between atomic structures is important for the exploration of potential energy landscapes. However, the cell vectors together with the coordinates of the atoms, which are generally used to describe periodic systems, are quantities not directly suitable as fingerprints to distinguish structures. Based on a characterization of the local environment of all atoms in a cell, we introduce crystal fingerprints that can be calculated easily and define configurational distances between crystalline structures that satisfy the mathematical properties of a metric. This distance between two configurations is a measure of their similarity/dissimilarity and it allows in particular to distinguish structures. The new method can be a useful tool within various energy landscape exploration schemes, such as minima hopping, random search, swarm intelligence algorithms, and high-throughput screenings.

  18. Chemical Search Web Utility

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Chemical Search Web Utility is an intuitive web application that allows the public to easily find the chemical that they are interested in using, and which...

  19. Searches at LEP

    CERN Document Server

    Junk, Tom

    2000-01-01

    Searches have been conducted for a broad range of new phenomena by the four experiments ALEPH, DELPHI, L3, and OPAL, at LEP2. Each experiment contributes approximately 150 pb-1 of e+e- annihilation data with a mean sqrt(s) of 205.9 GeV in 2000 to these searches (data prepared for the September 5 LEPC meeting). The statistical procedure for setting limits and evaluating the significance of excesses observed in the data is reviewed. Search results are presented for the Standard Model Higgs boson, the neutral Higgs bosons in the MSSM, charged Higgs bosons, invisibly decaying Higgs bosons produced by Higgs-strahlung, and fermiophobic Higgs bosons. Search results are briefly summarized for gauginos, stops, and staus. The photon recoil spectrum is checked for hints of new physics.

  20. Beyond the search process

    DEFF Research Database (Denmark)

    Hyldegård, Jette

    2009-01-01

    This paper reports on the findings from a longitudinal case study exploring Kuhlthau's information search process (ISP)-model in a group based academic setting. The research focus is on group members' activities and cognitive and emotional experiences during the task process of writing an assignm......This paper reports on the findings from a longitudinal case study exploring Kuhlthau's information search process (ISP)-model in a group based academic setting. The research focus is on group members' activities and cognitive and emotional experiences during the task process of writing...... seeking, the cognitive pattern associated with focus formulation and the tendency towards an increase in writing activities while searching activities decreased. Differences in behavior were also found, which were associated with contextual and social factors beyond the mere search process...

  1. Supersymmetry searches in ATLAS

    CERN Document Server

    Torro Pastor, Emma; The ATLAS collaboration

    2016-01-01

    Weak scale supersymmetry remains one of the best motivated and studied Standard Model extensions. This talk summarises recent ATLAS results for searches for supersymmetric (SUSY) particles. Weak and strong production in both R-Parity conserving and R-Parity violating SUSY scenarios are considered. The searches involved final states including jets, missing transverse momentum, light leptons, taus or photons, as well as long-lived particle signatures.

  2. Accelerated Evidence Search Report

    Science.gov (United States)

    2014-07-01

    Accelerated Evidence Search Report IMPORTANT INFORMATIVE STATEMENTS Accelerated Multi-Camera Evidence Search and Retrieval CSSP Project #: CSSP -2013...CD-1063 was supported by the Canadian Safety and Security Program ( CSSP ) which is led by Defence Research and Development Canada’s Centre for...Border Technology Division The CSSP is a federally-funded program to strengthen Canada’s ability to anticipate, prevent/mitigate, prepare for, respond to

  3. Testing Self-Similarity Through Lamperti Transformations

    KAUST Repository

    Lee, Myoungji

    2016-07-14

    Self-similar processes have been widely used in modeling real-world phenomena occurring in environmetrics, network traffic, image processing, and stock pricing, to name but a few. The estimation of the degree of self-similarity has been studied extensively, while statistical tests for self-similarity are scarce and limited to processes indexed in one dimension. This paper proposes a statistical hypothesis test procedure for self-similarity of a stochastic process indexed in one dimension and multi-self-similarity for a random field indexed in higher dimensions. If self-similarity is not rejected, our test provides a set of estimated self-similarity indexes. The key is to test stationarity of the inverse Lamperti transformations of the process. The inverse Lamperti transformation of a self-similar process is a strongly stationary process, revealing a theoretical connection between the two processes. To demonstrate the capability of our test, we test self-similarity of fractional Brownian motions and sheets, their time deformations and mixtures with Gaussian white noise, and the generalized Cauchy family. We also apply the self-similarity test to real data: annual minimum water levels of the Nile River, network traffic records, and surface heights of food wrappings. © 2016, International Biometric Society.

  4. Similarity increases altruistic punishment in humans.

    Science.gov (United States)

    Mussweiler, Thomas; Ockenfels, Axel

    2013-11-26

    Humans are attracted to similar others. As a consequence, social networks are homogeneous in sociodemographic, intrapersonal, and other characteristics--a principle called homophily. Despite abundant evidence showing the importance of interpersonal similarity and homophily for human relationships, their behavioral correlates and cognitive foundations are poorly understood. Here, we show that perceived similarity substantially increases altruistic punishment, a key mechanism underlying human cooperation. We induced (dis)similarity perception by manipulating basic cognitive mechanisms in an economic cooperation game that included a punishment phase. We found that similarity-focused participants were more willing to punish others' uncooperative behavior. This influence of similarity is not explained by group identity, which has the opposite effect on altruistic punishment. Our findings demonstrate that pure similarity promotes reciprocity in ways known to encourage cooperation. At the same time, the increased willingness to punish norm violations among similarity-focused participants provides a rationale for why similar people are more likely to build stable social relationships. Finally, our findings show that altruistic punishment is differentially involved in encouraging cooperation under pure similarity vs. in-group conditions.

  5. Supporting Book Search

    DEFF Research Database (Denmark)

    Bogers, Toine; Petras, Vivien

    2017-01-01

    Book search is far from a solved problem. Complex information needs often go beyond bibliographic facts and cover a combination of different aspects, such as specific genres or plot elements, engagement or novelty. Conventional book metadata may not be sufficient to address these kinds of informa......Book search is far from a solved problem. Complex information needs often go beyond bibliographic facts and cover a combination of different aspects, such as specific genres or plot elements, engagement or novelty. Conventional book metadata may not be sufficient to address these kinds......-depth analysis of topical metadata, comparing controlled vocabularies with social tags. Tags perform better overall in this setting, but controlled vocabulary terms provide complementary information, which will improve a search. We analyze potential underlying factors that contribute to search performance......, such as the relevance aspect(s) mentioned in a request or the type of book. In addition, we investigate the possible causes of search failure. We conclude that neither tags nor controlled vocabularies are wholly suited to handling the complex information needs in book search, which means that different approaches...

  6. Similarity boosted quantitative structure-activity relationship--a systematic study of enhancing structural descriptors by molecular similarity.

    Science.gov (United States)

    Girschick, Tobias; Almeida, Pedro R; Kramer, Stefan; Stålring, Jonna

    2013-05-24

    The concept of molecular similarity is one of the most central in the fields of predictive toxicology and quantitative structure-activity relationship (QSAR) research. Many toxicological responses result from a multimechanistic process and, consequently, structural diversity among the active compounds is likely. Combining this knowledge, we introduce similarity boosted QSAR modeling, where we calculate molecular descriptors using similarities with respect to representative reference compounds to aid a statistical learning algorithm in distinguishing between different structural classes. We present three approaches for the selection of reference compounds, one by literature search and two by clustering. Our experimental evaluation on seven publicly available data sets shows that the similarity descriptors used on their own perform quite well compared to structural descriptors. We show that the combination of similarity and structural descriptors enhances the performance and that a simple stacking approach is able to use the complementary information encoded by the different descriptor sets to further improve predictive results. All software necessary for our experiments is available within the cheminformatics software framework AZOrange.

  7. User Oriented Trajectory Search for Trip Recommendation

    KAUST Repository

    Ding, Ruogu

    2012-07-08

    Trajectory sharing and searching have received significant attention in recent years. In this thesis, we propose and investigate the methods to find and recommend the best trajectory to the traveler, and mainly focus on a novel technique named User Oriented Trajectory Search (UOTS) query processing. In contrast to conventional trajectory search by locations (spatial domain only), we consider both spatial and textual domains in the new UOTS query. Given a trajectory data set, the query input contains a set of intended places given by the traveler and a set of textual attributes describing the traveler’s preference. If a trajectory is connecting/close to the specified query locations, and the textual attributes of the trajectory are similar to the traveler’s preference, it will be recommended to the traveler. This type of queries can enable many popular applications such as trip planning and recommendation. There are two challenges in UOTS query processing, (i) how to constrain the searching range in two domains and (ii) how to schedule multiple query sources effectively. To overcome the challenges and answer the UOTS query efficiently, a novel collaborative searching approach is developed. Conceptually, the UOTS query processing is conducted in the spatial and textual domains alternately. A pair of upper and lower bounds are devised to constrain the searching range in two domains. In the meantime, a heuristic searching strategy based on priority ranking is adopted for scheduling the multiple query sources, which can further reduce the searching range and enhance the query efficiency notably. Furthermore, the devised collaborative searching approach can be extended to situations where the query locations are ordered. Extensive experiments are conducted on both real and synthetic trajectory data in road networks. Our approach is verified to be effective in reducing both CPU time and disk I/O time.

  8. User oriented trajectory search for trip recommendation

    KAUST Repository

    Shang, Shuo

    2012-01-01

    Trajectory sharing and searching have received significant attentions in recent years. In this paper, we propose and investigate a novel problem called User Oriented Trajectory Search (UOTS) for trip recommendation. In contrast to conventional trajectory search by locations (spatial domain only), we consider both spatial and textual domains in the new UOTS query. Given a trajectory data set, the query input contains a set of intended places given by the traveler and a set of textual attributes describing the traveler\\'s preference. If a trajectory is connecting/close to the specified query locations, and the textual attributes of the trajectory are similar to the traveler\\'e preference, it will be recommended to the traveler for reference. This type of queries can bring significant benefits to travelers in many popular applications such as trip planning and recommendation. There are two challenges in the UOTS problem, (i) how to constrain the searching range in two domains and (ii) how to schedule multiple query sources effectively. To overcome the challenges and answer the UOTS query efficiently, a novel collaborative searching approach is developed. Conceptually, the UOTS query processing is conducted in the spatial and textual domains alternately. A pair of upper and lower bounds are devised to constrain the searching range in two domains. In the meantime, a heuristic searching strategy based on priority ranking is adopted for scheduling the multiple query sources, which can further reduce the searching range and enhance the query efficiency notably. Furthermore, the devised collaborative searching approach can be extended to situations where the query locations are ordered. The performance of the proposed UOTS query is verified by extensive experiments based on real and synthetic trajectory data in road networks. © 2012 ACM.

  9. Routine development of objectively derived search strategies

    Directory of Open Access Journals (Sweden)

    Hausner Elke

    2012-02-01

    Full Text Available Abstract Background Over the past few years, information retrieval has become more and more professionalized, and information specialists are considered full members of a research team conducting systematic reviews. Research groups preparing systematic reviews and clinical practice guidelines have been the driving force in the development of search strategies, but open questions remain regarding the transparency of the development process and the available resources. An empirically guided approach to the development of a search strategy provides a way to increase transparency and efficiency. Methods Our aim in this paper is to describe the empirically guided development process for search strategies as applied by the German Institute for Quality and Efficiency in Health Care (Institut für Qualität und Wirtschaftlichkeit im Gesundheitswesen, or "IQWiG". This strategy consists of the following steps: generation of a test set, as well as the development, validation and standardized documentation of the search strategy. Results We illustrate our approach by means of an example, that is, a search for literature on brachytherapy in patients with prostate cancer. For this purpose, a test set was generated, including a total of 38 references from 3 systematic reviews. The development set for the generation of the strategy included 25 references. After application of textual analytic procedures, a strategy was developed that included all references in the development set. To test the search strategy on an independent set of references, the remaining 13 references in the test set (the validation set were used. The validation set was also completely identified. Discussion Our conclusion is that an objectively derived approach similar to that used in search filter development is a feasible way to develop and validate reliable search strategies. Besides creating high-quality strategies, the widespread application of this approach will result in a

  10. Forensic utilization of familial searches in DNA databases.

    Science.gov (United States)

    Gershaw, Cassandra J; Schweighardt, Andrew J; Rourke, Linda C; Wallace, Margaret M

    2011-01-01

    DNA evidence is widely recognized as an invaluable tool in the process of investigation and identification, as well as one of the most sought after types of evidence for presentation to a jury. In the United States, the development of state and federal DNA databases has greatly impacted the forensic community by creating an efficient, searchable system that can be used to eliminate or include suspects in an investigation based on matching DNA profiles - the profile already in the database to the profile of the unknown sample in evidence. Recent changes in legislation have begun to allow for the possibility to expand the parameters of DNA database searches, taking into account the possibility of familial searches. This article discusses prospective positive outcomes of utilizing familial DNA searches and acknowledges potential negative outcomes, thereby presenting both sides of this very complicated, rapidly evolving situation. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  11. Search Tips: MedlinePlus

    Science.gov (United States)

    ... of this page: https://medlineplus.gov/searchtips.html Search Tips To use the sharing features on this page, please enable JavaScript. How do I search MedlinePlus? The search box appears at the top ...

  12. Axion Searches, Old and New

    CERN Multimedia

    CERN. Geneva

    2005-01-01

    Outline of the lecture: Constraints from laboratory searches and astrophysics, axion cosmology, the cavity detector of dark matter axions, solar axion searches, laser experiments, a telescope search, macroscopic forces mediated by axions.

  13. MOST: most-similar ligand based approach to target prediction.

    Science.gov (United States)

    Huang, Tao; Mi, Hong; Lin, Cheng-Yuan; Zhao, Ling; Zhong, Linda L D; Liu, Feng-Bin; Zhang, Ge; Lu, Ai-Ping; Bian, Zhao-Xiang

    2017-03-11

    Many computational approaches have been used for target prediction, including machine learning, reverse docking, bioactivity spectra analysis, and chemical similarity searching. Recent studies have suggested that chemical similarity searching may be driven by the most-similar ligand. However, the extent of bioactivity of most-similar ligands has been oversimplified or even neglected in these studies, and this has impaired the prediction power. Here we propose the MOst-Similar ligand-based Target inference approach, namely MOST, which uses fingerprint similarity and explicit bioactivity of the most-similar ligands to predict targets of the query compound. Performance of MOST was evaluated by using combinations of different fingerprint schemes, machine learning methods, and bioactivity representations. In sevenfold cross-validation with a benchmark Ki dataset from CHEMBL release 19 containing 61,937 bioactivity data of 173 human targets, MOST achieved high average prediction accuracy (0.95 for pKi ≥ 5, and 0.87 for pKi ≥ 6). Morgan fingerprint was shown to be slightly better than FP2. Logistic Regression and Random Forest methods performed better than Naïve Bayes. In a temporal validation, the Ki dataset from CHEMBL19 were used to train models and predict the bioactivity of newly deposited ligands in CHEMBL20. MOST also performed well with high accuracy (0.90 for pKi ≥ 5, and 0.76 for pKi ≥ 6), when Logistic Regression and Morgan fingerprint were employed. Furthermore, the p values associated with explicit bioactivity were found be a robust index for removing false positive predictions. Implicit bioactivity did not offer this capability. Finally, p values generated with Logistic Regression, Morgan fingerprint and explicit activity were integrated with a false discovery rate (FDR) control procedure to reduce false positives in multiple-target prediction scenario, and the success of this strategy it was demonstrated with a case of fluanisone

  14. Bridging Database Applications and Declarative Similarity Matching

    OpenAIRE

    Andrade Ribeiro, Leonardo; Schneider, Natália Cristina; de Souza Inácio, Andrei; Wagner, Harley Michel; von Wangenheim, Aldo

    2017-01-01

    Effective manipulation of string data is of fundamental importance to modern database applications. Very often, textual inconsistencies render equality comparisons meaningless and strings have to be matched in terms of their similarity. Previous work has proposed techniques to express similarity operations using declarative SQL statements. However, the non-trivial issue of embedding similarity support into object-oriented applications has received little attention. Particularly, declarative s...

  15. Integrating Triangle and Jaccard similarities for recommendation

    Science.gov (United States)

    Sun, Shuang-Bo; Zhang, Zhi-Heng; Dong, Xin-Ling; Zhang, Heng-Ru; Li, Tong-Jun; Zhang, Lin

    2017-01-01

    This paper proposes a new measure for recommendation through integrating Triangle and Jaccard similarities. The Triangle similarity considers both the length and the angle of rating vectors between them, while the Jaccard similarity considers non co-rating users. We compare the new similarity measure with eight state-of-the-art ones on four popular datasets under the leave-one-out scenario. Results show that the new measure outperforms all the counterparts in terms of the mean absolute error and the root mean square error. PMID:28817692

  16. Genome-wide search for eliminylating domains reveals novel function for BLES03-like proteins.

    Science.gov (United States)

    Khater, Shradha; Mohanty, Debasisa

    2014-07-24

    Bacterial phosphothreonine lyases catalyze a novel posttranslational modification involving formation of dehydrobutyrine/dehyroalanine by β elimination of the phosphate group of phosphothreonine or phosphoserine residues in their substrate proteins. Though there is experimental evidence for presence of dehydro amino acids in human proteins, no eukaryotic homologs of these lyases have been identified as of today. A comprehensive genome-wide search for identifying phosphothreonine lyase homologs in eukaryotes was carried out. Our fold-based search revealed structural and catalytic site similarity between bacterial phosphothreonine lyases and BLES03 (basophilic leukemia-expressed protein 03), a human protein with unknown function. Ligand induced conformational changes similar to bacterial phosphothreonine lyases, and movement of crucial arginines in the loop region to the catalytic pocket upon binding of phosphothreonine-containing peptides was seen during docking and molecular dynamics studies. Genome-wide search for BLES03 homologs using sensitive profile-based methods revealed their presence not only in eukaryotic classes such as chordata and fungi but also in bacterial and archaebacterial classes. The synteny of these archaebacterial BLES03-like proteins was remarkably similar to that of type IV lantibiotic synthetases which harbor LanL-like phosphothreonine lyase domains. Hence, context-based analysis reinforced our earlier sequence/structure-based prediction of phosphothreonine lyase catalytic function for BLES03. Our in silico analysis has revealed that BLES03-like proteins with previously unknown function are novel eukaryotic phosphothreonine lyases involved in biosynthesis of dehydro amino acids, whereas their bacterial and archaebacterial counterparts might be involved in biosynthesis of natural products similar to lantibiotics. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  17. An advanced search engine for patent analytics in medicinal chemistry.

    Science.gov (United States)

    Pasche, Emilie; Gobeill, Julien; Teodoro, Douglas; Gaudinat, Arnaud; Vishnykova, Dina; Lovis, Christian; Ruch, Patrick

    2012-01-01

    Patent collections contain an important amount of medical-related knowledge, but existing tools were reported to lack of useful functionalities. We present here the development of TWINC, an advanced search engine dedicated to patent retrieval in the domain of health and life sciences. Our tool embeds two search modes: an ad hoc search to retrieve relevant patents given a short query and a related patent search to retrieve similar patents given a patent. Both search modes rely on tuning experiments performed during several patent retrieval competitions. Moreover, TWINC is enhanced with interactive modules, such as chemical query expansion, which is of prior importance to cope with various ways of naming biomedical entities. While the related patent search showed promising performances, the ad-hoc search resulted in fairly contrasted results. Nonetheless, TWINC performed well during the Chemathlon task of the PatOlympics competition and experts appreciated its usability.

  18. SEARCHES FOR SUPERSYMMETRY IN ATLAS

    CERN Document Server

    Xu, Da; The ATLAS collaboration

    2017-01-01

    A wide range of supersymmetric searches are presented. All searches are based on the proton- proton collision dataset collected by the ATLAS experiment during the 2015 and 2016 (before summer) run with a center-of-mass energy of 13 TeV, corresponding to an integrated lumi- nosity of 36.1 (36.7) fb-1. The searches are categorized into inclusive gluino and squark search, third generation search, electroweak search, prompt RPV search and long-lived par- ticle search. No evidence of new physics is observed. The results are intepreted in various models and expressed in terms of limits on the masses of new particles.

  19. An artificial functional family filter in homolog searching in next-generation sequencing metagenomics.

    Directory of Open Access Journals (Sweden)

    Ruofei Du

    Full Text Available In functional metagenomics, BLAST homology search is a common method to classify metagenomic reads into protein/domain sequence families such as Clusters of Orthologous Groups of proteins (COGs in order to quantify the abundance of each COG in the community. The resulting functional profile of the community is then used in downstream analysis to correlate the change in abundance to environmental perturbation, clinical variation, and so on. However, the short read length coupled with next-generation sequencing technologies poses a barrier in this approach, essentially because similarity significance cannot be discerned by searching with short reads. Consequently, artificial functional families are produced, in which those with a large number of reads assigned decreases the accuracy of functional profile dramatically. There is no method available to address this problem. We intended to fill this gap in this paper. We revealed that BLAST similarity scores of homologues for short reads from COG protein members coding sequences are distributed differently from the scores of those derived elsewhere. We showed that, by choosing an appropriate score cut-off, we are able to filter out most artificial families and simultaneously to preserve sufficient information in order to build the functional profile. We also showed that, by incorporated application of BLAST and RPS-BLAST, some artificial families with large read counts can be further identified after the score cutoff filtration. Evaluated on three experimental metagenomic datasets with different coverages, we found that the proposed method is robust against read coverage and consistently outperforms the other E-value cutoff methods currently used in literatures.

  20. Scaling, similarity, and the fourth paradigm for hydrology

    Directory of Open Access Journals (Sweden)

    C. D. Peters-Lidard

    2017-07-01

    Full Text Available In this synthesis paper addressing hydrologic scaling and similarity, we posit that roadblocks in the search for universal laws of hydrology are hindered by our focus on computational simulation (the third paradigm and assert that it is time for hydrology to embrace a fourth paradigm of data-intensive science. Advances in information-based hydrologic science, coupled with an explosion of hydrologic data and advances in parameter estimation and modeling, have laid the foundation for a data-driven framework for scrutinizing hydrological scaling and similarity hypotheses. We summarize important scaling and similarity concepts (hypotheses that require testing; describe a mutual information framework for testing these hypotheses; describe boundary condition, state, flux, and parameter data requirements across scales to support testing these hypotheses; and discuss some challenges to overcome while pursuing the fourth hydrological paradigm. We call upon the hydrologic sciences community to develop a focused effort towards adopting the fourth paradigm and apply this to outstanding challenges in scaling and similarity.

  1. Monte-Carlo Tree Search for Poly-Y

    NARCIS (Netherlands)

    Wevers, L.; te Brinke, Steven

    2014-01-01

    Monte-Carlo tree search (MCTS) is a heuristic search algorithm that has recently been very successful in the games of Go and Hex. In this paper, we describe an MCTS player for the game of Poly-Y, which is a connection game similar to Hex. Our player won the CodeCup 2014 AI programming competition.

  2. MPI Profiling

    Energy Technology Data Exchange (ETDEWEB)

    Han, D K; Jones, T R

    2005-02-11

    The Message Passing Interface (MPI) is the de facto message-passing standard for massively parallel programs. It is often the case that application performance is a crucial factor, especially for solving grand challenge problems. While there have been many studies on the scalability of applications, there have not been many focusing on the specific types of MPI calls being made and their impact on application performance. Using a profiling tool called mpiP, a large spectrum of parallel scientific applications were surveyed and their performance results analyzed.

  3. Learning deep similarity in fundus photography

    Science.gov (United States)

    Chudzik, Piotr; Al-Diri, Bashir; Caliva, Francesco; Ometto, Giovanni; Hunter, Andrew

    2017-02-01

    Similarity learning is one of the most fundamental tasks in image analysis. The ability to extract similar images in the medical domain as part of content-based image retrieval (CBIR) systems has been researched for many years. The vast majority of methods used in CBIR systems are based on hand-crafted feature descriptors. The approximation of a similarity mapping for medical images is difficult due to the big variety of pixel-level structures of interest. In fundus photography (FP) analysis, a subtle difference in e.g. lesions and vessels shape and size can result in a different diagnosis. In this work, we demonstrated how to learn a similarity function for image patches derived directly from FP image data without the need of manually designed feature descriptors. We used a convolutional neural network (CNN) with a novel architecture adapted for similarity learning to accomplish this task. Furthermore, we explored and studied multiple CNN architectures. We show that our method can approximate the similarity between FP patches more efficiently and accurately than the state-of- the-art feature descriptors, including SIFT and SURF using a publicly available dataset. Finally, we observe that our approach, which is purely data-driven, learns that features such as vessels calibre and orientation are important discriminative factors, which resembles the way how humans reason about similarity. To the best of authors knowledge, this is the first attempt to approximate a visual similarity mapping in FP.

  4. Mining Diagnostic Assessment Data for Concept Similarity

    Science.gov (United States)

    Madhyastha, Tara; Hunt, Earl

    2009-01-01

    This paper introduces a method for mining multiple-choice assessment data for similarity of the concepts represented by the multiple choice responses. The resulting similarity matrix can be used to visualize the distance between concepts in a lower-dimensional space. This gives an instructor a visualization of the relative difficulty of concepts…

  5. Similarity Structure of Wave-Collapse

    DEFF Research Database (Denmark)

    Rypdal, Kristoffer; Juul Rasmussen, Jens; Thomsen, Kenneth

    1985-01-01

    Similarity transformations of the cubic Schrödinger equation (CSE) are investigated. The transformations are used to remove the explicit time variation in the CSE and reduce it to differential equations in the spatial variables only. Two different methods for similarity reduction are employed and...

  6. Similarity indices I: what do they measure.

    Energy Technology Data Exchange (ETDEWEB)

    Johnston, J.W.

    1976-11-01

    A method for estimating the effects of environmental effusions on ecosystems is described. The characteristics of 25 similarity indices used in studies of ecological communities were investigated. The type of data structure, to which these indices are frequently applied, was described as consisting of vectors of measurements on attributes (species) observed in a set of samples. A general similarity index was characterized as the result of a two-step process defined on a pair of vectors. In the first step an attribute similarity score is obtained for each attribute by comparing the attribute values observed in the pair of vectors. The result is a vector of attribute similarity scores. These are combined in the second step to arrive at the similarity index. The operation in the first step was characterized as a function, g, defined on pairs of attribute values. The second operation was characterized as a function, F, defined on the vector of attribute similarity scores from the first step. Usually, F was a simple sum or weighted sum of the attribute similarity scores. It is concluded that similarity indices should not be used as the test statistic to discriminate between two ecological communities.

  7. Some Effects of Similarity Self-Disclosure

    Science.gov (United States)

    Murphy, Kevin C.; Strong, Stanley R.

    1972-01-01

    College males were interviewed about how college had altered their friendships, values, and plans. The interviewers diclosed experiences and feelings similar to those revealed by the students. Results support Byrne's Law of Similarity in generating interpersonal attraction in the interview and suggest that the timing of self-disclosures is…

  8. An Estimation of Turbulent Kinetic Energy and Energy Dissipation Rate Based on Atmospheric Boundary Layer Similarity Theory

    Science.gov (United States)

    Han, Jongil; Arya, S. Pal; Shaohua, Shen; Lin, Yuh-Lang; Proctor, Fred H. (Technical Monitor)

    2000-01-01

    Algorithms are developed to extract atmospheric boundary layer profiles for turbulence kinetic energy (TKE) and energy dissipation rate (EDR), with data from a meteorological tower as input. The profiles are based on similarity theory and scalings for the atmospheric boundary layer. The calculated profiles of EDR and TKE are required to match the observed values at 5 and 40 m. The algorithms are coded for operational use and yield plausible profiles over the diurnal variation of the atmospheric boundary layer.

  9. Branch length similarity entropy-based descriptors for shape representation

    Science.gov (United States)

    Kwon, Ohsung; Lee, Sang-Hee

    2017-11-01

    In previous studies, we showed that the branch length similarity (BLS) entropy profile could be successfully used for the shape recognition such as battle tanks, facial expressions, and butterflies. In the present study, we proposed new descriptors, roundness, symmetry, and surface roughness, for the recognition, which are more accurate and fast in the computation than the previous descriptors. The roundness represents how closely a shape resembles to a circle, the symmetry characterizes how much one shape is similar with another when the shape is moved in flip, and the surface roughness quantifies the degree of vertical deviations of a shape boundary. To evaluate the performance of the descriptors, we used the database of leaf images with 12 species. Each species consisted of 10 - 20 leaf images and the total number of images were 160. The evaluation showed that the new descriptors successfully discriminated the leaf species. We believe that the descriptors can be a useful tool in the field of pattern recognition.

  10. Correlation between social proximity and mobility similarity.

    Science.gov (United States)

    Fan, Chao; Liu, Yiding; Huang, Junming; Rong, Zhihai; Zhou, Tao

    2017-09-20

    Human behaviors exhibit ubiquitous correlations in many aspects, such as individual and collective levels, temporal and spatial dimensions, content, social and geographical layers. With rich Internet data of online behaviors becoming available, it attracts academic interests to explore human mobility similarity from the perspective of social network proximity. Existent analysis shows a strong correlation between online social proximity and offline mobility similarity, namely, mobile records between friends are significantly more similar than between strangers, and those between friends with common neighbors are even more similar. We argue the importance of the number and diversity of common friends, with a counter intuitive finding that the number of common friends has no positive impact on mobility similarity while the diversity plays a key role, disagreeing with previous studies. Our analysis provides a novel view for better understanding the coupling between human online and offline behaviors, and will help model and predict human behaviors based on social proximity.

  11. Similarity and a Duality for Fullerenes

    Directory of Open Access Journals (Sweden)

    Jennifer J. Edmond

    2015-11-01

    Full Text Available Fullerenes are molecules of carbon that are modeled by trivalent plane graphs with only pentagonal and hexagonal faces. Scaling up a fullerene gives a notion of similarity, and fullerenes are partitioned into similarity classes. In this expository article, we illustrate how the values of two important fullerene parameters can be deduced for all fullerenes in a similarity class by computing the values of these parameters for just the three smallest representatives of that class. In addition, it turns out that there is a natural duality theory for similarity classes of fullerenes based on one of the most important fullerene construction techniques: leapfrog construction. The literature on fullerenes is very extensive, and since this is a general interest journal, we will summarize and illustrate the fundamental results that we will need to develop similarity and this duality.

  12. SearchResultFinder: federated search made easy

    NARCIS (Netherlands)

    Trieschnigg, Rudolf Berend; Tjin-Kam-Jet, Kien; Hiemstra, Djoerd

    Building a federated search engine based on a large number existing web search engines is a challenge: implementing the programming interface (API) for each search engine is an exacting and time-consuming job. In this demonstration we present SearchResultFinder, a browser plugin which speeds up

  13. Market Dominance and Search Quality in the Search Engine Market

    NARCIS (Netherlands)

    Lianos, I.; Motchenkova, E.I.

    2013-01-01

    We analyze a search engine market from a law and economics perspective and incorporate the choice of quality-improving innovations by a search engine platform in a two-sided model of Internet search engine. In the proposed framework, we first discuss the legal issues the search engine market raises

  14. Profiling application potential for alkali treated sisal fiber ...

    African Journals Online (AJOL)

    Profiling application potential for alkali treated sisal fiber-polypropylene composite in Ashby material-property plots. ... International Journal of Engineering, Science and Technology. Journal Home · ABOUT · Advanced Search · Current Issue ...

  15. Picasso: generating a covering set of protein family profiles.

    Science.gov (United States)

    Heger, A; Holm, L

    2001-03-01

    Evolutionary classification leads to an economical description of protein sequence data because attributes of function and structure are inherited in protein families. This paper presents Picasso, a procedure for deriving a minimal set of protein family profiles that cover all known protein sequences. Picasso starts from highly overlapping sequence neighbourhoods revealed by all-on-all pairwise Blast alignment. Overlaps are reduced by merging sequences or parts of sequences into multiple alignments. For maximum unification, the multiple alignments must reach into the twilight zone of sequence similarity. Sensitive and selective profile-profile comparison allows unification down to about 15% pairwise sequence identity. Families unified through a short conserved sequence motif are associated with multiple full-length alignments describing different subfamilies. Domains that are mobile modules are identified based on their association with different sets of neighbours. The result is 10000 unified domain families (excluding singletons) representing functionally related proteins and recovering classical prolific domain types in high numbers. The classification is useful, for example, in developing strategies for efficient database searching and for selecting targets to complete the map of all 3-D structures.

  16. Mastering Search Analytics Measuring SEO, SEM and Site Search

    CERN Document Server

    Chaters, Brent

    2011-01-01

    Many companies still approach Search Engine Optimization (SEO) and paid search as separate initiatives. This in-depth guide shows you how to use these programs as part of a comprehensive strategy-not just to improve your site's search rankings, but to attract the right people and increase your conversion rate. Learn how to measure, test, analyze, and interpret all of your search data with a wide array of analytic tools. Gain the knowledge you need to determine the strategy's return on investment. Ideal for search specialists, webmasters, and search marketing managers, Mastering Search Analyt

  17. Efficient protein alignment algorithm for protein search.

    Science.gov (United States)

    Lu, Zaixin; Zhao, Zhiyu; Fu, Bin

    2010-01-18

    Proteins show a great variety of 3D conformations, which can be used to infer their evolutionary relationship and to classify them into more general groups; therefore protein structure alignment algorithms are very helpful for protein biologists. However, an accurate alignment algorithm itself may be insufficient for effective discovering of structural relationships among tens of thousands of proteins. Due to the exponentially increasing amount of protein structural data, a fast and accurate structure alignment tool is necessary to access protein classification and protein similarity search; however, the complexity of current alignment algorithms are usually too high to make a fully alignment-based classification and search practical. We have developed an efficient protein pairwise alignment algorithm and applied it to our protein search tool, which aligns a query protein structure in the pairwise manner with all protein structures in the Protein Data Bank (PDB) to output similar protein structures. The algorithm can align hundreds of pairs of protein structures in one second. Given a protein structure, the tool efficiently discovers similar structures from tens of thousands of structures stored in the PDB always in 2 minutes in a single machine and 20 seconds in our cluster of 6 machines. The algorithm has been fully implemented and is accessible online at our webserver, which is supported by a cluster of computers. Our algorithm can work out hundreds of pairs of protein alignments in one second. Therefore, it is very suitable for protein search. Our experimental results show that it is more accurate than other well known protein search systems in finding proteins which are structurally similar at SCOP family and superfamily levels, and its speed is also competitive with those systems. In terms of the pairwise alignment performance, it is as good as some well known alignment algorithms.

  18. Two perspectives on similarity between words

    Science.gov (United States)

    Frisch, Stefan A.

    2003-10-01

    This presentation examines the similarity between words from both bottom up (phonetic) and top down (phonological/psycholinguistic) perspectives. From the phonological perspective, the influence of structure on similarity is explored using metalinguistic acceptability judgments for multisyllabic nonwords. Results from an experiment suggest that subjects try to align novel words with known words in order to maximize similarities while minimizing dissimilarities. This finding parallels results from psychology on similarity judgments for visual scenes. From the phonetic perspective, the influence of similar gestures on speech error rates is examined using ultrasound measurement of tongue position. In a pilot experiment, subjects, produced tongue twisters containing words where onset and vowel phonemes had similar gestures (e.g., tip, comb) and where the onset and vowel had dissimilar gestures (e.g., tube, keep). Preliminary results suggest that misarticulations are more frequent in the context of dissimilar gestures (e.g., in the tongue twister tip cape keep tape, error rates are higher for /k/ than /t/). These errors appear to be gestural interactions rather than errors at the phonemic or featural level of phonological spellout. Together, these two experiments indicate that similarity relations between words are found at multiple levels, any which are potentially relevant to the structure of phonological systems.

  19. Electrostatic Similarity Determination Using Multiresolution Analysis.

    Science.gov (United States)

    Hakkoymaz, Huseyin; Kieslich, Chris A; Jr, Ronald D Gorham; Gunopulos, Dimitrios; Morikis, Dimitrios

    2011-08-01

    Molecular similarity is an important tool in protein and drug design for analyzing the quantitative relationships between physicochemical properties of two molecules. We present a family of similarity measures which exploits the ability of wavelet transformation to analyze the spectral components of physicochemical properties and suggests a sensitive way for measuring similarities of biological molecules. In order to investigate how effective wavelet-based similarity measures were against conventional measures, we defined several patterns which involve scalar or topological changes in the distribution of electrostatic properties. The wavelet-based measures were more successful in discriminating these patterns in contrast to the current state-of-art similarity measures. We also present the validity of wavelet-based similarity measures through the hierarchical clustering of two protein datasets consisting of families of homologous domains and alanine scan mutants. This type of similarity analysis is useful for protein structure-function studies and protein design. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. Measure of Node Similarity in Multilayer Networks.

    Directory of Open Access Journals (Sweden)

    Anders Mollgaard

    Full Text Available The weight of links in a network is often related to the similarity of the nodes. Here, we introduce a simple tunable measure for analysing the similarity of nodes across different link weights. In particular, we use the measure to analyze homophily in a group of 659 freshman students at a large university. Our analysis is based on data obtained using smartphones equipped with custom data collection software, complemented by questionnaire-based data. The network of social contacts is represented as a weighted multilayer network constructed from different channels of telecommunication as well as data on face-to-face contacts. We find that even strongly connected individuals are not more similar with respect to basic personality traits than randomly chosen pairs of individuals. In contrast, several socio-demographics variables have a significant degree of similarity. We further observe that similarity might be present in one layer of the multilayer network and simultaneously be absent in the other layers. For a variable such as gender, our measure reveals a transition from similarity between nodes connected with links of relatively low weight to dis-similarity for the nodes connected by the strongest links. We finally analyze the overlap between layers in the network for different levels of acquaintanceships.

  1. Person Attribute Search For Large-Area Video Surveillance

    OpenAIRE

    Thornton, Jason; Butler, Daniel; Baran-Gale, Jeanette; Chan, Michael; Zwahlen, Heather

    2012-01-01

    This article appeared in Homeland Security Affairs (May 2012), supplement 5, article 1 "This paper describes novel video analytics technology which allows an operator to search through large volumes of surveillance video data to find persons that match a particular attribute profile. Since the proposed technique is geared for surveillance of large areas, this profile consists of attributes that are observable at a distance (including clothing information, hair color, gender, etc.) rather t...

  2. SUSY Searches at ATLAS

    CERN Document Server

    Mamuzic, Judita; The ATLAS collaboration

    2017-01-01

    Supersymmetry (SUSY) is considered one of the best motivated extensions of the Standard Model. It postulates a fundamental symmetry between fermions and bosons, and introduces a set of new supersymmetric particles at the electroweak scale. It addresses the hierarchy and naturalness problem, gives a solution to the gauge coupling unification, and offers a cold dark matter candidate. Different aspects of SUSY searches, using strong, electroweak, third generation production, and R-parity violation and long lived particles are being studied at the LHC. An overview of most recent SUSY searches results using the 13 TeV ATLAS RUN2 data will be presented.

  3. Searching with iterated maps

    Science.gov (United States)

    Elser, V.; Rankenburg, I.; Thibault, P.

    2007-01-01

    In many problems that require extensive searching, the solution can be described as satisfying two competing constraints, where satisfying each independently does not pose a challenge. As an alternative to tree-based and stochastic searching, for these problems we propose using an iterated map built from the projections to the two constraint sets. Algorithms of this kind have been the method of choice in a large variety of signal-processing applications; we show here that the scope of these algorithms is surprisingly broad, with applications as diverse as protein folding and Sudoku. PMID:17202267

  4. Upgrading Enterprise Search

    Energy Technology Data Exchange (ETDEWEB)

    McDunn, R

    2005-04-28

    This presentation will describe the process we went through this past year to upgrade our enterprise search tool from a very old version of Inktomi to the latest version of Verity Ultraseek. We started with requirements gathering and then compared requirements against several available products to determine which product to choose. After purchasing the product, we worked through several defined phases of implementation and customization, with initial rollout late January 2004. Finally, we will show you where we are today and describe future search plans.

  5. Search for $2\

    CERN Document Server

    :,; Auty, D J; Barbeau, P S; Beck, D; Belov, V; Breidenbach, M; Brunner, T; Burenkov, A; Cao, G F; Chambers, C; Chaves, J; Cleveland, B; Coon, M; Craycraft, A; Daniels, T; Danilov, M; Daugherty, S J; Davis, J; Delaquis, S; Der Mesrobian-Kabakian, A; DeVoe, R; Didberidze, T; Dilling, J; Dolgolenko, A; Dolinski, M J; Dunford, M; Fairbank, W; Farine, J; Feldmeier, W; Feyzbakhsh, S; Fierlinger, P; Fudenberg, D; Gornea, R; Graham, K; Gratta, G; Hall, C; Hughes, M; Jewell, M J; Johnson, A; Johnson, T N; Johnston, S; Karelin, A; Kaufman, L J; Killick, R; King, J; Koffas, T; Kravitz, S; Krücken, R; Kuchenkov, A; Kumar, K S; Leonard, D S; Licciardi, C; Lin, Y H; Ling, J; MacLellan, R; Marino, M G; Mong, B; Moore, D; Njoya, O; Nelson, R; Odian, A; Ostrovskiy, I; Piepke, A; Pocar, A; Prescott, C Y; Retière, F; Rowson, P C; Russell, J J; Schubert, A; Sinclair, D; Smith, E; Stekhanov, V; Tarka, M; Tolba, T; Tsang, R; Twelker, K; Vogel, P; Vuilleumier, J -L; Waite, A; Walton, J; Walton, T; Weber, M; Wen, L J; Wichoski, U; Winick, T A; Wood, J; Xu, Q Y; Yang, L; Yen, Y -R; Zeldovich, O Ya

    2015-01-01

    EXO-200 is a single phase liquid xenon detector designed to search for neutrinoless double-beta decay of $^{136}$Xe to the ground state of $^{136}$Ba. We report here on a search for the two-neutrino double-beta decay of $^{136}$Xe to the first $0^+$ excited state, $0^+_1$, of $^{136}$Ba based on a 100 kg$\\cdot$yr exposure of $^{136}$Xe. Using a specialized analysis employing a machine learning algorithm, we obtain a 90% CL half-life sensitivity of $1.7 \\times 10^{24}$ yr. We find no statistically significant evidence for the $2\

  6. Search for glueballs

    Energy Technology Data Exchange (ETDEWEB)

    Toki, W. [Colorado State Univ., Ft. Collins, CO (United States). Dept. of Physics

    1997-06-01

    In these Summer School lectures, the author reviews the results of recent glueball searches. He begins with a brief review of glueball phenomenology and meson spectroscopy, including a discussion of resonance behavior. The results on the f{sub o}(1500) and f{sub J}(1700) resonances from proton-antiproton experiments and radiative J/{Psi} decays are discussed. Finally, {pi}{pi} and {eta}{pi} studies from D{sub s} decays and exotic meson searches are reviewed. 46 refs., 40 figs.

  7. Searching for excellence

    Science.gov (United States)

    Yager, Robert E.

    Visits to six school districts which were identified by the National Science Teachers Association's Search for Excellence program were made during 1983 by teams of 17 researchers. The reports were analyzed in search for common characteristics that can explain the requirements necessary for excellent science programs. The results indicate that creative ideas, administrative and community involvement, local ownership and pride, and well-developed in-service programs and implementation strategies are vital. Exceptional teachers with boundless energies also seem to exist where exemplary science programs are found.

  8. The Search for Extraterrestrial Life

    Science.gov (United States)

    Peter, Ulmschneider

    Looking at the nature, origin, and evolution of life on Earth is one way of assessing whether extraterrestrial life exists on Earth-like planets elsewhere (see Chaps. 5 and 6). A more direct approach is to search for favorable conditions and traces of life on other celestial bodies, both in the solar system and beyond. Clearly, there is little chance of encountering nonhuman intelligent beings in the solar system. But there could well be primitive life on Mars, particularly as in the early history of the solar system the conditions on Mars were quite similar to those on Earth. In addition, surprisingly favorable conditions for life once existed on the moons of Jupiter. Yet even if extraterrestrial life is not encountered in forthcoming space missions, it would be of utmost importance to recover fossils of past organisms as such traces would greatly contribute to our basic understanding of the formation of life. In addition to the planned missions to Mars and Europa, there are extensive efforts to search for life outside the solar system. Rapid advances in the detection of extrasolar planets, outlined in Chap. 3, are expected to lead to the discovery of Earth-like planets in the near future. But how can we detect life on these distant bodies?

  9. Similarity and Decay Laws of Momentumless Wakes.

    Science.gov (United States)

    1977-05-26

    Naval Sea Systems Command , Code NSEA—03133. -‘ F 9 —2— 26 May 1977 SN : j ep SIMILARITY AND DECAY LAWS OF MOMENTUI’ILESS WAKE S by Samuel Ilassid...and this neg lect will have to be justified a posteriori. Now, one seeks self-similar solutions of the type: k = Xh(ri) c = Ee(n) Ud = UDf (n) ~ = where...0 9x r 3r L6 c Again, seeking self-similar solutions of the type: k = Kh(ti) c = Ee(ri) Ud = UDf (n) r~ = (6.4) and letting 2 2 = 1 (6.5)E L one

  10. Similarity-based pattern analysis and recognition

    CERN Document Server

    Pelillo, Marcello

    2013-01-01

    This accessible text/reference presents a coherent overview of the emerging field of non-Euclidean similarity learning. The book presents a broad range of perspectives on similarity-based pattern analysis and recognition methods, from purely theoretical challenges to practical, real-world applications. The coverage includes both supervised and unsupervised learning paradigms, as well as generative and discriminative models. Topics and features: explores the origination and causes of non-Euclidean (dis)similarity measures, and how they influence the performance of traditional classification alg

  11. Three journal similarity metrics and their application to biomedical journals.

    Science.gov (United States)

    D'Souza, Jennifer L; Smalheiser, Neil R

    2014-01-01

    In the present paper, we have created several novel journal similarity metrics. The MeSH odds ratio measures the topical similarity of any pair of journals, based on the major MeSH headings assigned to articles in MEDLINE. The second metric employed the 2009 Author-ity author name disambiguation dataset as a gold standard for estimating the author odds ratio. This gives a straightforward, intuitive answer to the question: Given two articles in PubMed that share the same author name (lastname, first initial), how does knowing only the identity of the journals (in which the articles were published) predict the relative likelihood that they are written by the same person vs. different persons? The article pair odds ratio detects the tendency of authors to publish repeatedly in the same journal, as well as in specific pairs of journals. The metrics can be applied not only to estimate the similarity of a pair of journals, but to provide novel profiles of individual journals as well. For example, for each journal, one can define the MeSH cloud as the number of other journals that are topically more similar to it than expected by chance, and the author cloud as the number of other journals that share more authors than expected by chance. These metrics for journal pairs and individual journals have been provided in the form of public datasets that can be readily studied and utilized by others.

  12. Overview of the INEX 2014 Social Book Search Track

    DEFF Research Database (Denmark)

    Koolen, Marijn; Bogers, Toine; Kazai, Gabriella

    2014-01-01

    The goal of the INEX 2014 Social Book Search Track is to evaluate approaches for supporting users in searching collections of books based on book metadata and associated user-generated content. The track investigates the complex nature of relevance in book search and the role of traditional...... and user-generated book metadata in retrieval. We extended last year’s investigation into the nature of book suggestions from the LibraryThing forums and how they compare to book relevance judgements. Participants were encouraged to incorporate rich user profiles of both topic creators and other Library...

  13. SUSY Searches at the Tevatron

    Energy Technology Data Exchange (ETDEWEB)

    Zivkovic, L.

    2011-07-01

    In this article results from supersymmetry searches at D0 and CDF are reported. Searches for third generation squarks, searches for gauginos, and searches for models with R-parity violation are described. As no signs of supersymmetry for these models are observed, the most stringent limits to date are presented.

  14. Complete local search with memory

    NARCIS (Netherlands)

    Ghosh, D.; Sierksma, G.

    2000-01-01

    Neighborhood search heuristics like local search and its variants are some of the most popular approaches to solve discrete optimization problems of moderate to large size. Apart from tabu search, most of these heuristics are memoryless. In this paper we introduce a new neighborhood search heuristic

  15. Complete local search with memory

    NARCIS (Netherlands)

    Ghosh, D.; Sierksma, G.

    2002-01-01

    Neighborhood search heuristics like local search and its variants are some of the most popular approaches to solve discrete optimization problems of moderate to large size. Apart from tabu search, most of these heuristics are memoryless. In this paper we introduce a new neighborhood search heuristic

  16. The Evolution of Web Searching.

    Science.gov (United States)

    Green, David

    2000-01-01

    Explores the interrelation between Web publishing and information retrieval technologies and lists new approaches to Web indexing and searching. Highlights include Web directories; search engines; portalisation; Internet service providers; browser providers; meta search engines; popularity based analysis; natural language searching; links-based…

  17. Prices and heterogeneous search costs

    NARCIS (Netherlands)

    Moraga Gonzalez, J.L.; Sandor, Z.; Wildenbeest, M.R.

    2017-01-01

    We study price formation in a model of consumer search for differentiated products in which consumers have heterogeneous search costs. We provide conditions under which a pure-strategy symmetric Nash equilibrium exists and is unique. Search costs affect two margins—the intensive search margin (or

  18. Self-learning search engines

    NARCIS (Netherlands)

    Schuth, A.

    How does a search engine such as Google know which search results to display? There are many competing algorithms that generate search results, but which one works best? We developed a new probabilistic method for quickly comparing large numbers of search algorithms by examining the results users

  19. Standardization of Keyword Search Mode

    Science.gov (United States)

    Su, Di

    2010-01-01

    In spite of its popularity, keyword search mode has not been standardized. Though information professionals are quick to adapt to various presentations of keyword search mode, novice end-users may find keyword search confusing. This article compares keyword search mode in some major reference databases and calls for standardization. (Contains 3…

  20. Distances and similarities in intuitionistic fuzzy sets

    CERN Document Server

    Szmidt, Eulalia

    2014-01-01

    This book presents the state-of-the-art in theory and practice regarding similarity and distance measures for intuitionistic fuzzy sets. Quantifying similarity and distances is crucial for many applications, e.g. data mining, machine learning, decision making, and control. The work provides readers with a comprehensive set of theoretical concepts and practical tools for both defining and determining similarity between intuitionistic fuzzy sets. It describes an automatic algorithm for deriving intuitionistic fuzzy sets from data, which can aid in the analysis of information in large databases. The book also discusses other important applications, e.g. the use of similarity measures to evaluate the extent of agreement between experts in the context of decision making.