WorldWideScience

Sample records for hemolysin-like proteins similar

  1. Protein structural similarity search by Ramachandran codes

    Directory of Open Access Journals (Sweden)

    Chang Chih-Hung

    2007-08-01

    Full Text Available Abstract Background Protein structural data has increased exponentially, such that fast and accurate tools are necessary to access structure similarity search. To improve the search speed, several methods have been designed to reduce three-dimensional protein structures to one-dimensional text strings that are then analyzed by traditional sequence alignment methods; however, the accuracy is usually sacrificed and the speed is still unable to match sequence similarity search tools. Here, we aimed to improve the linear encoding methodology and develop efficient search tools that can rapidly retrieve structural homologs from large protein databases. Results We propose a new linear encoding method, SARST (Structural similarity search Aided by Ramachandran Sequential Transformation. SARST transforms protein structures into text strings through a Ramachandran map organized by nearest-neighbor clustering and uses a regenerative approach to produce substitution matrices. Then, classical sequence similarity search methods can be applied to the structural similarity search. Its accuracy is similar to Combinatorial Extension (CE and works over 243,000 times faster, searching 34,000 proteins in 0.34 sec with a 3.2-GHz CPU. SARST provides statistically meaningful expectation values to assess the retrieved information. It has been implemented into a web service and a stand-alone Java program that is able to run on many different platforms. Conclusion As a database search method, SARST can rapidly distinguish high from low similarities and efficiently retrieve homologous structures. It demonstrates that the easily accessible linear encoding methodology has the potential to serve as a foundation for efficient protein structural similarity search tools. These search tools are supposed applicable to automated and high-throughput functional annotations or predictions for the ever increasing number of published protein structures in this post-genomic era.

  2. Optimal neighborhood indexing for protein similarity search.

    Science.gov (United States)

    Peterlongo, Pierre; Noé, Laurent; Lavenier, Dominique; Nguyen, Van Hoa; Kucherov, Gregory; Giraud, Mathieu

    2008-12-16

    Similarity inference, one of the main bioinformatics tasks, has to face an exponential growth of the biological data. A classical approach used to cope with this data flow involves heuristics with large seed indexes. In order to speed up this technique, the index can be enhanced by storing additional information to limit the number of random memory accesses. However, this improvement leads to a larger index that may become a bottleneck. In the case of protein similarity search, we propose to decrease the index size by reducing the amino acid alphabet. The paper presents two main contributions. First, we show that an optimal neighborhood indexing combining an alphabet reduction and a longer neighborhood leads to a reduction of 35% of memory involved into the process, without sacrificing the quality of results nor the computational time. Second, our approach led us to develop a new kind of substitution score matrices and their associated e-value parameters. In contrast to usual matrices, these matrices are rectangular since they compare amino acid groups from different alphabets. We describe the method used for computing those matrices and we provide some typical examples that can be used in such comparisons. Supplementary data can be found on the website http://bioinfo.lifl.fr/reblosum. We propose a practical index size reduction of the neighborhood data, that does not negatively affect the performance of large-scale search in protein sequences. Such an index can be used in any study involving large protein data. Moreover, rectangular substitution score matrices and their associated statistical parameters can have applications in any study involving an alphabet reduction.

  3. Optimal neighborhood indexing for protein similarity search

    Directory of Open Access Journals (Sweden)

    Nguyen Van

    2008-12-01

    Full Text Available Abstract Background Similarity inference, one of the main bioinformatics tasks, has to face an exponential growth of the biological data. A classical approach used to cope with this data flow involves heuristics with large seed indexes. In order to speed up this technique, the index can be enhanced by storing additional information to limit the number of random memory accesses. However, this improvement leads to a larger index that may become a bottleneck. In the case of protein similarity search, we propose to decrease the index size by reducing the amino acid alphabet. Results The paper presents two main contributions. First, we show that an optimal neighborhood indexing combining an alphabet reduction and a longer neighborhood leads to a reduction of 35% of memory involved into the process, without sacrificing the quality of results nor the computational time. Second, our approach led us to develop a new kind of substitution score matrices and their associated e-value parameters. In contrast to usual matrices, these matrices are rectangular since they compare amino acid groups from different alphabets. We describe the method used for computing those matrices and we provide some typical examples that can be used in such comparisons. Supplementary data can be found on the website http://bioinfo.lifl.fr/reblosum. Conclusion We propose a practical index size reduction of the neighborhood data, that does not negatively affect the performance of large-scale search in protein sequences. Such an index can be used in any study involving large protein data. Moreover, rectangular substitution score matrices and their associated statistical parameters can have applications in any study involving an alphabet reduction.

  4. Retinoid-binding proteins: similar protein architectures bind similar ligands via completely different ways.

    Directory of Open Access Journals (Sweden)

    Yu-Ru Zhang

    Full Text Available BACKGROUND: Retinoids are a class of compounds that are chemically related to vitamin A, which is an essential nutrient that plays a key role in vision, cell growth and differentiation. In vivo, retinoids must bind with specific proteins to perform their necessary functions. Plasma retinol-binding protein (RBP and epididymal retinoic acid binding protein (ERABP carry retinoids in bodily fluids, while cellular retinol-binding proteins (CRBPs and cellular retinoic acid-binding proteins (CRABPs carry retinoids within cells. Interestingly, although all of these transport proteins possess similar structures, the modes of binding for the different retinoid ligands with their carrier proteins are different. METHODOLOGY/PRINCIPAL FINDINGS: In this work, we analyzed the various retinoid transport mechanisms using structure and sequence comparisons, binding site analyses and molecular dynamics simulations. Our results show that in the same family of proteins and subcellular location, the orientation of a retinoid molecule within a binding protein is same, whereas when different families of proteins are considered, the orientation of the bound retinoid is completely different. In addition, none of the amino acid residues involved in ligand binding is conserved between the transport proteins. However, for each specific binding protein, the amino acids involved in the ligand binding are conserved. The results of this study allow us to propose a possible transport model for retinoids. CONCLUSIONS/SIGNIFICANCE: Our results reveal the differences in the binding modes between the different retinoid-binding proteins.

  5. Pythoscape: A framework for generation of large protein similarity networks

    OpenAIRE

    Babbitt, Patricia; Barber, AE; Babbitt, PC

    2012-01-01

    Pythoscape is a framework implemented in Python for processing large protein similarity networks for visualization in other software packages. Protein similarity networks are graphical representations of sequence, structural and other similarities among pr

  6. A Signal Processing Method to Explore Similarity in Protein Flexibility

    Directory of Open Access Journals (Sweden)

    Simina Vasilache

    2010-01-01

    Full Text Available Understanding mechanisms of protein flexibility is of great importance to structural biology. The ability to detect similarities between proteins and their patterns is vital in discovering new information about unknown protein functions. A Distance Constraint Model (DCM provides a means to generate a variety of flexibility measures based on a given protein structure. Although information about mechanical properties of flexibility is critical for understanding protein function for a given protein, the question of whether certain characteristics are shared across homologous proteins is difficult to assess. For a proper assessment, a quantified measure of similarity is necessary. This paper begins to explore image processing techniques to quantify similarities in signals and images that characterize protein flexibility. The dataset considered here consists of three different families of proteins, with three proteins in each family. The similarities and differences found within flexibility measures across homologous proteins do not align with sequence-based evolutionary methods.

  7. Investigating Correlation between Protein Sequence Similarity and Semantic Similarity Using Gene Ontology Annotations.

    Science.gov (United States)

    Ikram, Najmul; Qadir, Muhammad Abdul; Afzal, Muhammad Tanvir

    2018-01-01

    Sequence similarity is a commonly used measure to compare proteins. With the increasing use of ontologies, semantic (function) similarity is getting importance. The correlation between these measures has been applied in the evaluation of new semantic similarity methods, and in protein function prediction. In this research, we investigate the relationship between the two similarity methods. The results suggest absence of a strong correlation between sequence and semantic similarities. There is a large number of proteins with low sequence similarity and high semantic similarity. We observe that Pearson's correlation coefficient is not sufficient to explain the nature of this relationship. Interestingly, the term semantic similarity values above 0 and below 1 do not seem to play a role in improving the correlation. That is, the correlation coefficient depends only on the number of common GO terms in proteins under comparison, and the semantic similarity measurement method does not influence it. Semantic similarity and sequence similarity have a distinct behavior. These findings are of significant effect for future works on protein comparison, and will help understand the semantic similarity between proteins in a better way.

  8. Protein-protein interaction network-based detection of functionally similar proteins within species.

    Science.gov (United States)

    Song, Baoxing; Wang, Fen; Guo, Yang; Sang, Qing; Liu, Min; Li, Dengyun; Fang, Wei; Zhang, Deli

    2012-07-01

    Although functionally similar proteins across species have been widely studied, functionally similar proteins within species showing low sequence similarity have not been examined in detail. Identification of these proteins is of significant importance for understanding biological functions, evolution of protein families, progression of co-evolution, and convergent evolution and others which cannot be obtained by detection of functionally similar proteins across species. Here, we explored a method of detecting functionally similar proteins within species based on graph theory. After denoting protein-protein interaction networks using graphs, we split the graphs into subgraphs using the 1-hop method. Proteins with functional similarities in a species were detected using a method of modified shortest path to compare these subgraphs and to find the eligible optimal results. Using seven protein-protein interaction networks and this method, some functionally similar proteins with low sequence similarity that cannot detected by sequence alignment were identified. By analyzing the results, we found that, sometimes, it is difficult to separate homologous from convergent evolution. Evaluation of the performance of our method by gene ontology term overlap showed that the precision of our method was excellent. Copyright © 2012 Wiley Periodicals, Inc.

  9. Protein structure similarity from principle component correlation analysis

    Directory of Open Access Journals (Sweden)

    Chou James

    2006-01-01

    Full Text Available Abstract Background Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. Results We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. Conclusion The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum

  10. Pythoscape: a framework for generation of large protein similarity networks.

    Science.gov (United States)

    Barber, Alan E; Babbitt, Patricia C

    2012-11-01

    Pythoscape is a framework implemented in Python for processing large protein similarity networks for visualization in other software packages. Protein similarity networks are graphical representations of sequence, structural and other similarities among proteins for which pairwise all-by-all similarity connections have been calculated. Mapping of biological and other information to network nodes or edges enables hypothesis creation about sequence-structure-function relationships across sets of related proteins. Pythoscape provides several options to calculate pairwise similarities for input sequences or structures, applies filters to network edges and defines sets of similar nodes and their associated data as single nodes (termed representative nodes) for compression of network information and output data or formatted files for visualization.

  11. GIS: a comprehensive source for protein structure similarities.

    Science.gov (United States)

    Guerler, Aysam; Knapp, Ernst-Walter

    2010-07-01

    A web service for analysis of protein structures that are sequentially or non-sequentially similar was generated. Recently, the non-sequential structure alignment algorithm GANGSTA+ was introduced. GANGSTA+ can detect non-sequential structural analogs for proteins stated to possess novel folds. Since GANGSTA+ ignores the polypeptide chain connectivity of secondary structure elements (i.e. alpha-helices and beta-strands), it is able to detect structural similarities also between proteins whose sequences were reshuffled during evolution. GANGSTA+ was applied in an all-against-all comparison on the ASTRAL40 database (SCOP version 1.75), which consists of >10,000 protein domains yielding about 55 x 10(6) possible protein structure alignments. Here, we provide the resulting protein structure alignments as a public web-based service, named GANGSTA+ Internet Services (GIS). We also allow to browse the ASTRAL40 database of protein structures with GANGSTA+ relative to an externally given protein structure using different constraints to select specific results. GIS allows us to analyze protein structure families according to the SCOP classification scheme. Additionally, users can upload their own protein structures for pairwise protein structure comparison, alignment against all protein structures of the ASTRAL40 database (SCOP version 1.75) or symmetry analysis. GIS is publicly available at http://agknapp.chemie.fu-berlin.de/gplus.

  12. Genetic similarity of soybean genotypes revealed by seed protein

    Directory of Open Access Journals (Sweden)

    Nikolić Ana

    2005-01-01

    Full Text Available More accurate and complete descriptions of genotypes could help determinate future breeding strategies and facilitate introgression of new genotypes in current soybean genetic pool. The objective of this study was to characterize 20 soybean genotypes from the Maize Research Institute "Zemun Polje" collection, which have good agronomic performances, high yield, lodging and drought resistance, and low shuttering by seed proteins as biochemical markers. Seed proteins were isolated and separated by PAA electrophoresis. On the basis of the presence/absence of protein fractions coefficients of similarity were calculated as Dice and Roger and Tanamoto coefficient between pairs of genotypes. The similarity matrix was submitted for hierarchical cluster analysis of un weighted pair group using arithmetic average (UPGMA method and necessary computation were performed using NTSYS-pc program. Protein seed analysis confirmed low level of genetic diversity in soybean. The highest genetic similarity was between genotypes P9272 and Kador. According to obtained results, soybean genotypes were assigned in two larger groups and coefficients of similarity showed similar results. Because of the lack of pedigree data for analyzed genotypes, correspondence with marker data could not be determined. In plant with a narrow genetic base in their gene pool, such as soybean, protein markers may not be sufficient for characterization and study of genetic diversity.

  13. Identification of proteins similar to AvrE type III effector proteins from ...

    African Journals Online (AJOL)

    Stephen Opiyo

    GSE22274), and AraCyc databases, we highlighted 16 protein candidates from Arabidopsidis genome .... projection method similar to principal component analysis (PCA) .... RIN4 RIN4 (RPM1 INTERACTING PROTEIN 4); protein binding.

  14. Identification of proteins similar to AvrE type III effector proteins from ...

    African Journals Online (AJOL)

    Type III effector proteins are injected into host cells through type III secretion systems. Some effectors are similar to host proteins to promote pathogenicity, while others lead to the activation of disease resistance. We used partial least squares alignment-free bioinformatics methods to identify proteins similar to AvrE proteins ...

  15. Contrasting HIV phylogenetic relationships and V3 loop protein similarities

    Energy Technology Data Exchange (ETDEWEB)

    Korber, B. (Los Alamos National Lab., NM (United States) Santa Fe Inst., NM (United States)); Myers, G. (Los Alamos National Lab., NM (United States))

    1992-01-01

    At least five distinct sequence subtypes of HIV-I can be identified from the major centers of the AMS pandemic. While it is too early to tell whether these subtypes are serologically or phenotypically similar or distinct in terms of properties such as pathogenicity and transmissibility, we can begin to investigate their potential for phenotypic divergence at the protein sequence level. Phylogenetic analysis of HIV DNA sequences is being widely used to examine lineages of different viral strains as they evolve and spread throughout the globe. We have identified five distinct HIV-1 subtypes (designated A-E), or clades, based on phylogenetic clustering patterns generated from genetic information from both the gag and envelope (env) genes from a spectrum of international isolates. Our initial observations concerning both HIV-1 and HIV-2 sequences indicate that conserved patterns in protein chemistry may indeed exist across distant lineages. Such patterns in V3 loop amino acid chemistry may be indicative of stable lineages or convergence within this highly variable, though functionally and immunologically critical, region. We think that there may be parallels between the apparently stable HIV-2 V3 lineage and the previously mentioned HIV-1 V3 loops which are very similar at the protein level despite being distant by cladistic analysis, and which do not possess the distinctive positively charged residues. Highly conserved V3 loop protein sequences are also encountered in SIVAGMs and CIVs (chimpanzee viral strains), which do not appear to be pathogenic in their wild-caught natural hosts.

  16. Contrasting HIV phylogenetic relationships and V3 loop protein similarities

    Energy Technology Data Exchange (ETDEWEB)

    Korber, B. [Los Alamos National Lab., NM (United States)]|[Santa Fe Inst., NM (United States); Myers, G. [Los Alamos National Lab., NM (United States)

    1992-12-31

    At least five distinct sequence subtypes of HIV-I can be identified from the major centers of the AMS pandemic. While it is too early to tell whether these subtypes are serologically or phenotypically similar or distinct in terms of properties such as pathogenicity and transmissibility, we can begin to investigate their potential for phenotypic divergence at the protein sequence level. Phylogenetic analysis of HIV DNA sequences is being widely used to examine lineages of different viral strains as they evolve and spread throughout the globe. We have identified five distinct HIV-1 subtypes (designated A-E), or clades, based on phylogenetic clustering patterns generated from genetic information from both the gag and envelope (env) genes from a spectrum of international isolates. Our initial observations concerning both HIV-1 and HIV-2 sequences indicate that conserved patterns in protein chemistry may indeed exist across distant lineages. Such patterns in V3 loop amino acid chemistry may be indicative of stable lineages or convergence within this highly variable, though functionally and immunologically critical, region. We think that there may be parallels between the apparently stable HIV-2 V3 lineage and the previously mentioned HIV-1 V3 loops which are very similar at the protein level despite being distant by cladistic analysis, and which do not possess the distinctive positively charged residues. Highly conserved V3 loop protein sequences are also encountered in SIVAGMs and CIVs (chimpanzee viral strains), which do not appear to be pathogenic in their wild-caught natural hosts.

  17. Clustering and visualizing similarity networks of membrane proteins.

    Science.gov (United States)

    Hu, Geng-Ming; Mai, Te-Lun; Chen, Chi-Ming

    2015-08-01

    We proposed a fast and unsupervised clustering method, minimum span clustering (MSC), for analyzing the sequence-structure-function relationship of biological networks, and demonstrated its validity in clustering the sequence/structure similarity networks (SSN) of 682 membrane protein (MP) chains. The MSC clustering of MPs based on their sequence information was found to be consistent with their tertiary structures and functions. For the largest seven clusters predicted by MSC, the consistency in chain function within the same cluster is found to be 100%. From analyzing the edge distribution of SSN for MPs, we found a characteristic threshold distance for the boundary between clusters, over which SSN of MPs could be properly clustered by an unsupervised sparsification of the network distance matrix. The clustering results of MPs from both MSC and the unsupervised sparsification methods are consistent with each other, and have high intracluster similarity and low intercluster similarity in sequence, structure, and function. Our study showed a strong sequence-structure-function relationship of MPs. We discussed evidence of convergent evolution of MPs and suggested applications in finding structural similarities and predicting biological functions of MP chains based on their sequence information. © 2015 Wiley Periodicals, Inc.

  18. The HMMER Web Server for Protein Sequence Similarity Search.

    Science.gov (United States)

    Prakash, Ananth; Jeffryes, Matt; Bateman, Alex; Finn, Robert D

    2017-12-08

    Protein sequence similarity search is one of the most commonly used bioinformatics methods for identifying evolutionarily related proteins. In general, sequences that are evolutionarily related share some degree of similarity, and sequence-search algorithms use this principle to identify homologs. The requirement for a fast and sensitive sequence search method led to the development of the HMMER software, which in the latest version (v3.1) uses a combination of sophisticated acceleration heuristics and mathematical and computational optimizations to enable the use of profile hidden Markov models (HMMs) for sequence analysis. The HMMER Web server provides a common platform by linking the HMMER algorithms to databases, thereby enabling the search for homologs, as well as providing sequence and functional annotation by linking external databases. This unit describes three basic protocols and two alternate protocols that explain how to use the HMMER Web server using various input formats and user defined parameters. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.

  19. Protein-protein interaction inference based on semantic similarity of Gene Ontology terms.

    Science.gov (United States)

    Zhang, Shu-Bo; Tang, Qiang-Rong

    2016-07-21

    Identifying protein-protein interactions is important in molecular biology. Experimental methods to this issue have their limitations, and computational approaches have attracted more and more attentions from the biological community. The semantic similarity derived from the Gene Ontology (GO) annotation has been regarded as one of the most powerful indicators for protein interaction. However, conventional methods based on GO similarity fail to take advantage of the specificity of GO terms in the ontology graph. We proposed a GO-based method to predict protein-protein interaction by integrating different kinds of similarity measures derived from the intrinsic structure of GO graph. We extended five existing methods to derive the semantic similarity measures from the descending part of two GO terms in the GO graph, then adopted a feature integration strategy to combines both the ascending and the descending similarity scores derived from the three sub-ontologies to construct various kinds of features to characterize each protein pair. Support vector machines (SVM) were employed as discriminate classifiers, and five-fold cross validation experiments were conducted on both human and yeast protein-protein interaction datasets to evaluate the performance of different kinds of integrated features, the experimental results suggest the best performance of the feature that combines information from both the ascending and the descending parts of the three ontologies. Our method is appealing for effective prediction of protein-protein interaction. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Correlation between protein sequence similarity and x-ray diffraction quality in the protein data bank.

    Science.gov (United States)

    Lu, Hui-Meng; Yin, Da-Chuan; Ye, Ya-Jing; Luo, Hui-Min; Geng, Li-Qiang; Li, Hai-Sheng; Guo, Wei-Hong; Shang, Peng

    2009-01-01

    As the most widely utilized technique to determine the 3-dimensional structure of protein molecules, X-ray crystallography can provide structure of the highest resolution among the developed techniques. The resolution obtained via X-ray crystallography is known to be influenced by many factors, such as the crystal quality, diffraction techniques, and X-ray sources, etc. In this paper, the authors found that the protein sequence could also be one of the factors. We extracted information of the resolution and the sequence of proteins from the Protein Data Bank (PDB), classified the proteins into different clusters according to the sequence similarity, and statistically analyzed the relationship between the sequence similarity and the best resolution obtained. The results showed that there was a pronounced correlation between the sequence similarity and the obtained resolution. These results indicate that protein structure itself is one variable that may affect resolution when X-ray crystallography is used.

  1. Rift Valley fever virus NSs protein functions and the similarity to other bunyavirus NSs proteins.

    Science.gov (United States)

    Ly, Hoai J; Ikegami, Tetsuro

    2016-07-02

    Rift Valley fever is a mosquito-borne zoonotic disease that affects both ruminants and humans. The nonstructural (NS) protein, which is a major virulence factor for Rift Valley fever virus (RVFV), is encoded on the S-segment. Through the cullin 1-Skp1-Fbox E3 ligase complex, the NSs protein promotes the degradation of at least two host proteins, the TFIIH p62 and the PKR proteins. NSs protein bridges the Fbox protein with subsequent substrates, and facilitates the transfer of ubiquitin. The SAP30-YY1 complex also bridges the NSs protein with chromatin DNA, affecting cohesion and segregation of chromatin DNA as well as the activation of interferon-β promoter. The presence of NSs filaments in the nucleus induces DNA damage responses and causes cell-cycle arrest, p53 activation, and apoptosis. Despite the fact that NSs proteins have poor amino acid similarity among bunyaviruses, the strategy utilized to hijack host cells are similar. This review will provide and summarize an update of recent findings pertaining to the biological functions of the NSs protein of RVFV as well as the differences from those of other bunyaviruses.

  2. Immunochemical similarity of GTP-binding proteins from different systems

    International Nuclear Information System (INIS)

    Kalinina, S.N.

    1986-01-01

    It was found that antibodies against the GTP-binding proteins of bovine retinal photoreceptor membranes blocked the inhibitory effect of estradiol on phosphodiesterase from rat and human uterine cytosol and prevented the cumulative effect of catecholamines and guanylyl-5'-imidodiphosphate on rat skeletal muscle adenylate cyclase. It was established by means of double radial immunodiffusion that these antibodies form a precipitating complex with purified bovine brain tubulin as well as with retinal preparations obtained from eyes of the bull, pig, rat, frog, some species of fish, and one reptile species. Bands of precipitation were not observed with these antibodies when retinal preparations from invertebrates (squid and octopus) were used as the antigens. The antibodies obtained interacted with the α- and β-subunits of GTP-binding proteins from bovine retinal photoreceptor membranes

  3. Proteins in similarity relationship with the cluster - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Gclust Server Proteins in similarity relationship with the cluster Data detail Data name Pro...teins in similarity relationship with the cluster DOI 10.18908/lsdba.nbdc00464-003 Description of data conte...s Proteins in similarity relationship with the cluster - Gclust Server | LSDB Archive ...

  4. Electrostatic similarities between protein and small molecule ligands facilitate the design of protein-protein interaction inhibitors.

    Directory of Open Access Journals (Sweden)

    Arnout Voet

    Full Text Available One of the underlying principles in drug discovery is that a biologically active compound is complimentary in shape and molecular recognition features to its receptor. This principle infers that molecules binding to the same receptor may share some common features. Here, we have investigated whether the electrostatic similarity can be used for the discovery of small molecule protein-protein interaction inhibitors (SMPPIIs. We have developed a method that can be used to evaluate the similarity of electrostatic potentials between small molecules and known protein ligands. This method was implemented in a software called EleKit. Analyses of all available (at the time of research SMPPII structures indicate that SMPPIIs bear some similarities of electrostatic potential with the ligand proteins of the same receptor. This is especially true for the more polar SMPPIIs. Retrospective analysis of several successful SMPPIIs has shown the applicability of EleKit in the design of new SMPPIIs.

  5. Consumption of Milk Protein or Whey Protein Results in a Similar Increase in Muscle Protein Synthesis in Middle Aged Men.

    Science.gov (United States)

    Mitchell, Cameron J; McGregor, Robin A; D'Souza, Randall F; Thorstensen, Eric B; Markworth, James F; Fanning, Aaron C; Poppitt, Sally D; Cameron-Smith, David

    2015-10-21

    The differential ability of various milk protein fractions to stimulate muscle protein synthesis (MPS) has been previously described, with whey protein generally considered to be superior to other fractions. However, the relative ability of a whole milk protein to stimulate MPS has not been compared to whey. Sixteen healthy middle-aged males ingested either 20 g of milk protein (n = 8) or whey protein (n = 8) while undergoing a primed constant infusion of ring (13)C₆ phenylalanine. Muscle biopsies were obtained 120 min prior to consumption of the protein and 90 and 210 min afterwards. Resting myofibrillar fractional synthetic rates (FSR) were 0.019% ± 0.009% and 0.021% ± 0.018% h(-1) in the milk and whey groups respectively. For the first 90 min after protein ingestion the FSR increased (p whey groups respectively with no difference between groups (p = 0.810). FSR returned to baseline in both groups between 90 and 210 min after protein ingestion. Despite evidence of increased rate of digestion and leucine availability following the ingestion of whey protein, there was similar activation of MPS in middle-aged men with either 20 g of milk protein or whey protein.

  6. Similar Pathogen Targets in Arabidopsis thaliana and Homo sapiens Protein Networks

    Science.gov (United States)

    2012-09-21

    Similar Pathogen Targets in Arabidopsis thaliana and Homo sapiens Protein Networks Paulo Shakarian1*, J. Kenneth Wickiser2 1 Paulo Shakarian...significantly attacked. Citation: Shakarian P, Wickiser JK (2012) Similar Pathogen Targets in Arabidopsis thaliana and Homo sapiens Protein Networks...to 00-00-2012 4. TITLE AND SUBTITLE Similar Pathogen Targets in Arabidopsis thaliana and Homo sapiens Protein Networks 5a. CONTRACT NUMBER 5b

  7. An approach to large scale identification of non-obvious structural similarities between proteins

    Science.gov (United States)

    Cherkasov, Artem; Jones, Steven JM

    2004-01-01

    Background A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. Results The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. Conclusion Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence. PMID:15147578

  8. An approach to large scale identification of non-obvious structural similarities between proteins

    Directory of Open Access Journals (Sweden)

    Cherkasov Artem

    2004-05-01

    Full Text Available Abstract Background A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. Results The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. Conclusion Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence.

  9. Identification of structural similarities between putative transmission proteins of Polymyxa and Spongospora transmitted bymoviruses and furoviruses.

    Science.gov (United States)

    Dessens, J T; Meyer, M

    1996-01-01

    Comparison of amino acid sequence and hydropathy profiles shows conserved, structural similarities between the capsid readthrough protein of potato mop top virus (transmitted by Spongospora subterranea) and furovirus and bymovirus proteins implicated in transmission by Polymyxa spp. This suggests that these proteins have a common ancestry and are involved in a common biological process: virus transmission by plasmodiophorid fungi.

  10. Functional similarities between the dictyostelium protein AprA and the human protein dipeptidyl-peptidase IV.

    Science.gov (United States)

    Herlihy, Sarah E; Tang, Yu; Phillips, Jonathan E; Gomer, Richard H

    2017-03-01

    Autocrine proliferation repressor protein A (AprA) is a protein secreted by Dictyostelium discoideum cells. Although there is very little sequence similarity between AprA and any human protein, AprA has a predicted structural similarity to the human protein dipeptidyl peptidase IV (DPPIV). AprA is a chemorepellent for Dictyostelium cells, and DPPIV is a chemorepellent for neutrophils. This led us to investigate if AprA and DPPIV have additional functional similarities. We find that like AprA, DPPIV is a chemorepellent for, and inhibits the proliferation of, D. discoideum cells, and that AprA binds some DPPIV binding partners such as fibronectin. Conversely, rAprA has DPPIV-like protease activity. These results indicate a functional similarity between two eukaryotic chemorepellent proteins with very little sequence similarity, and emphasize the usefulness of using a predicted protein structure to search a protein structure database, in addition to searching for proteins with similar sequences. © 2016 The Protein Society.

  11. Functional similarities between the dictyostelium protein AprA and the human protein dipeptidyl‐peptidase IV

    Science.gov (United States)

    Herlihy, Sarah E.; Tang, Yu; Phillips, Jonathan E.

    2017-01-01

    Abstract Autocrine proliferation repressor protein A (AprA) is a protein secreted by Dictyostelium discoideum cells. Although there is very little sequence similarity between AprA and any human protein, AprA has a predicted structural similarity to the human protein dipeptidyl peptidase IV (DPPIV). AprA is a chemorepellent for Dictyostelium cells, and DPPIV is a chemorepellent for neutrophils. This led us to investigate if AprA and DPPIV have additional functional similarities. We find that like AprA, DPPIV is a chemorepellent for, and inhibits the proliferation of, D. discoideum cells, and that AprA binds some DPPIV binding partners such as fibronectin. Conversely, rAprA has DPPIV‐like protease activity. These results indicate a functional similarity between two eukaryotic chemorepellent proteins with very little sequence similarity, and emphasize the usefulness of using a predicted protein structure to search a protein structure database, in addition to searching for proteins with similar sequences. PMID:28028841

  12. Integration of Phenotypic Metadata and Protein Similarity in Archaea Using a Spectral Bipartitioning Approach

    Energy Technology Data Exchange (ETDEWEB)

    Hooper, Sean D.; Anderson, Iain J; Pati, Amrita; Dalevi, Daniel; Mavromatis, Konstantinos; Kyrpides, Nikos C

    2009-01-01

    In order to simplify and meaningfully categorize large sets of protein sequence data, it is commonplace to cluster proteins based on the similarity of those sequences. However, it quickly becomes clear that the sequence flexibility allowed a given protein varies significantly among different protein families. The degree to which sequences are conserved not only differs for each protein family, but also is affected by the phylogenetic divergence of the source organisms. Clustering techniques that use similarity thresholds for protein families do not always allow for these variations and thus cannot be confidently used for applications such as automated annotation and phylogenetic profiling. In this work, we applied a spectral bipartitioning technique to all proteins from 53 archaeal genomes. Comparisons between different taxonomic levels allowed us to study the effects of phylogenetic distances on cluster structure. Likewise, by associating functional annotations and phenotypic metadata with each protein, we could compare our protein similarity clusters with both protein function and associated phenotype. Our clusters can be analyzed graphically and interactively online.

  13. Identification of polycystic ovary syndrome potential drug targets based on pathobiological similarity in the protein-protein interaction network

    OpenAIRE

    Huang, Hao; He, Yuehan; Li, Wan; Wei, Wenqing; Li, Yiran; Xie, Ruiqiang; Guo, Shanshan; Wang, Yahui; Jiang, Jing; Chen, Binbin; Lv, Junjie; Zhang, Nana; Chen, Lina; He, Weiming

    2016-01-01

    Polycystic ovary syndrome (PCOS) is one of the most common endocrinological disorders in reproductive aged women. PCOS and Type 2 Diabetes (T2D) are closely linked in multiple levels and possess high pathobiological similarity. Here, we put forward a new computational approach based on the pathobiological similarity to identify PCOS potential drug target modules (PPDT-Modules) and PCOS potential drug targets in the protein-protein interaction network (PPIN). From the systems level and biologi...

  14. Protein profiling reveals inter-individual protein homogeneity of arachnoid cyst fluid and high qualitative similarity to cerebrospinal fluid

    Directory of Open Access Journals (Sweden)

    Berle Magnus

    2011-05-01

    the majority of abundant proteins in AC fluid also can be found in CSF. Compared to plasma, as many as 104 proteins in AC were not found in the list of 3017 plasma proteins. Conclusions Based on the protein content of AC fluid, our data indicate that temporal AC is a homogenous condition, pointing towards a similar AC filling mechanism for the 14 patients examined. Most of the proteins identified in AC fluid have been identified in CSF, indicating high similarity in the qualitative protein content of AC to CSF, whereas this was not the case between AC and plasma. This indicates that AC is filled with a liquid similar to CSF. As far as we know, this is the first proteomics study that explores the AC fluid proteome.

  15. ProCKSI: a decision support system for Protein (Structure Comparison, Knowledge, Similarity and Information

    Directory of Open Access Journals (Sweden)

    Błażewicz Jacek

    2007-10-01

    Full Text Available Abstract Background We introduce the decision support system for Protein (Structure Comparison, Knowledge, Similarity and Information (ProCKSI. ProCKSI integrates various protein similarity measures through an easy to use interface that allows the comparison of multiple proteins simultaneously. It employs the Universal Similarity Metric (USM, the Maximum Contact Map Overlap (MaxCMO of protein structures and other external methods such as the DaliLite and the TM-align methods, the Combinatorial Extension (CE of the optimal path, and the FAST Align and Search Tool (FAST. Additionally, ProCKSI allows the user to upload a user-defined similarity matrix supplementing the methods mentioned, and computes a similarity consensus in order to provide a rich, integrated, multicriteria view of large datasets of protein structures. Results We present ProCKSI's architecture and workflow describing its intuitive user interface, and show its potential on three distinct test-cases. In the first case, ProCKSI is used to evaluate the results of a previous CASP competition, assessing the similarity of proposed models for given targets where the structures could have a large deviation from one another. To perform this type of comparison reliably, we introduce a new consensus method. The second study deals with the verification of a classification scheme for protein kinases, originally derived by sequence comparison by Hanks and Hunter, but here we use a consensus similarity measure based on structures. In the third experiment using the Rost and Sander dataset (RS126, we investigate how a combination of different sets of similarity measures influences the quality and performance of ProCKSI's new consensus measure. ProCKSI performs well with all three datasets, showing its potential for complex, simultaneous multi-method assessment of structural similarity in large protein datasets. Furthermore, combining different similarity measures is usually more robust than

  16. Dynamics based alignment of proteins: an alternative approach to quantify dynamic similarity

    Directory of Open Access Journals (Sweden)

    Lyngsø Rune

    2010-04-01

    Full Text Available Abstract Background The dynamic motions of many proteins are central to their function. It therefore follows that the dynamic requirements of a protein are evolutionary constrained. In order to assess and quantify this, one needs to compare the dynamic motions of different proteins. Comparing the dynamics of distinct proteins may also provide insight into how protein motions are modified by variations in sequence and, consequently, by structure. The optimal way of comparing complex molecular motions is, however, far from trivial. The majority of comparative molecular dynamics studies performed to date relied upon prior sequence or structural alignment to define which residues were equivalent in 3-dimensional space. Results Here we discuss an alternative methodology for comparative molecular dynamics that does not require any prior alignment information. We show it is possible to align proteins based solely on their dynamics and that we can use these dynamics-based alignments to quantify the dynamic similarity of proteins. Our method was tested on 10 representative members of the PDZ domain family. Conclusions As a result of creating pair-wise dynamics-based alignments of PDZ domains, we have found evolutionarily conserved patterns in their backbone dynamics. The dynamic similarity of PDZ domains is highly correlated with their structural similarity as calculated with Dali. However, significant differences in their dynamics can be detected indicating that sequence has a more refined role to play in protein dynamics than just dictating the overall fold. We suggest that the method should be generally applicable.

  17. The prion protein has DNA strand transfer properties similar to retroviral nucleocapsid protein.

    Science.gov (United States)

    Gabus, C; Auxilien, S; Péchoux, C; Dormont, D; Swietnicki, W; Morillas, M; Surewicz, W; Nandi, P; Darlix, J L

    2001-04-06

    The transmissible spongiform encephalopathies are fatal neurodegenerative diseases that are associated with the accumulation of a protease-resistant form of the cellular prion protein (PrP). Although PrP is highly conserved and widely expressed in vertebrates, its function remains a matter of speculation. Indeed PrP null mice develop normally and are healthy. Recent results show that PrP binds to nucleic acids in vitro and is found associated with retroviral particles. Furthermore, in mice the scrapie infectious process appears to be accelerated by MuLV replication. These observations prompted us to further investigate the interaction between PrP and nucleic acids, and compare it with that of the retroviral nucleocapsid protein (NC). As the major nucleic acid-binding protein of the retroviral particle, NC protein is tightly associated with the genomic RNA in the virion nucleocapsid, where it chaperones proviral DNA synthesis by reverse transcriptase. Our results show that the human prion protein (huPrP) functionally resembles NCp7 of HIV-1. Both proteins form large nucleoprotein complexes upon binding to DNA. They accelerate the hybridization of complementary DNA strands and chaperone viral DNA synthesis during the minus and plus DNA strand transfers necessary to generate the long terminal repeats. The DNA-binding and strand transfer properties of huPrP appear to map to the N-terminal fragment comprising residues 23 to 144, whereas the C-terminal domain is inactive. These findings suggest that PrP could be involved in nucleic acid metabolism in vivo. Copyright 2001 Academic Press.

  18. Quality assessment of protein model-structures based on structural and functional similarities.

    Science.gov (United States)

    Konopka, Bogumil M; Nebel, Jean-Christophe; Kotulska, Malgorzata

    2012-09-21

    Experimental determination of protein 3D structures is expensive, time consuming and sometimes impossible. A gap between number of protein structures deposited in the World Wide Protein Data Bank and the number of sequenced proteins constantly broadens. Computational modeling is deemed to be one of the ways to deal with the problem. Although protein 3D structure prediction is a difficult task, many tools are available. These tools can model it from a sequence or partial structural information, e.g. contact maps. Consequently, biologists have the ability to generate automatically a putative 3D structure model of any protein. However, the main issue becomes evaluation of the model quality, which is one of the most important challenges of structural biology. GOBA--Gene Ontology-Based Assessment is a novel Protein Model Quality Assessment Program. It estimates the compatibility between a model-structure and its expected function. GOBA is based on the assumption that a high quality model is expected to be structurally similar to proteins functionally similar to the prediction target. Whereas DALI is used to measure structure similarity, protein functional similarity is quantified using standardized and hierarchical description of proteins provided by Gene Ontology combined with Wang's algorithm for calculating semantic similarity. Two approaches are proposed to express the quality of protein model-structures. One is a single model quality assessment method, the other is its modification, which provides a relative measure of model quality. Exhaustive evaluation is performed on data sets of model-structures submitted to the CASP8 and CASP9 contests. The validation shows that the method is able to discriminate between good and bad model-structures. The best of tested GOBA scores achieved 0.74 and 0.8 as a mean Pearson correlation to the observed quality of models in our CASP8 and CASP9-based validation sets. GOBA also obtained the best result for two targets of CASP8, and

  19. Adhesive proteins of stalked and acorn barnacles display homology with low sequence similarities.

    Directory of Open Access Journals (Sweden)

    Jaimie-Leigh Jonker

    Full Text Available Barnacle adhesion underwater is an important phenomenon to understand for the prevention of biofouling and potential biotechnological innovations, yet so far, identifying what makes barnacle glue proteins 'sticky' has proved elusive. Examination of a broad range of species within the barnacles may be instructive to identify conserved adhesive domains. We add to extensive information from the acorn barnacles (order Sessilia by providing the first protein analysis of a stalked barnacle adhesive, Lepas anatifera (order Lepadiformes. It was possible to separate the L. anatifera adhesive into at least 10 protein bands using SDS-PAGE. Intense bands were present at approximately 30, 70, 90 and 110 kilodaltons (kDa. Mass spectrometry for protein identification was followed by de novo sequencing which detected 52 peptides of 7-16 amino acids in length. None of the peptides matched published or unpublished transcriptome sequences, but some amino acid sequence similarity was apparent between L. anatifera and closely-related Dosima fascicularis. Antibodies against two acorn barnacle proteins (ab-cp-52k and ab-cp-68k showed cross-reactivity in the adhesive glands of L. anatifera. We also analysed the similarity of adhesive proteins across several barnacle taxa, including Pollicipes pollicipes (a stalked barnacle in the order Scalpelliformes. Sequence alignment of published expressed sequence tags clearly indicated that P. pollicipes possesses homologues for the 19 kDa and 100 kDa proteins in acorn barnacles. Homology aside, sequence similarity in amino acid and gene sequences tended to decline as taxonomic distance increased, with minimum similarities of 18-26%, depending on the gene. The results indicate that some adhesive proteins (e.g. 100 kDa are more conserved within barnacles than others (20 kDa.

  20. Improving Classification of Protein Interaction Articles Using Context Similarity-Based Feature Selection.

    Science.gov (United States)

    Chen, Yifei; Sun, Yuxing; Han, Bing-Qing

    2015-01-01

    Protein interaction article classification is a text classification task in the biological domain to determine which articles describe protein-protein interactions. Since the feature space in text classification is high-dimensional, feature selection is widely used for reducing the dimensionality of features to speed up computation without sacrificing classification performance. Many existing feature selection methods are based on the statistical measure of document frequency and term frequency. One potential drawback of these methods is that they treat features separately. Hence, first we design a similarity measure between the context information to take word cooccurrences and phrase chunks around the features into account. Then we introduce the similarity of context information to the importance measure of the features to substitute the document and term frequency. Hence we propose new context similarity-based feature selection methods. Their performance is evaluated on two protein interaction article collections and compared against the frequency-based methods. The experimental results reveal that the context similarity-based methods perform better in terms of the F1 measure and the dimension reduction rate. Benefiting from the context information surrounding the features, the proposed methods can select distinctive features effectively for protein interaction article classification.

  1. Local-global alignment for finding 3D similarities in protein structures

    Science.gov (United States)

    Zemla, Adam T [Brentwood, CA

    2011-09-20

    A method of finding 3D similarities in protein structures of a first molecule and a second molecule. The method comprises providing preselected information regarding the first molecule and the second molecule. Comparing the first molecule and the second molecule using Longest Continuous Segments (LCS) analysis. Comparing the first molecule and the second molecule using Global Distance Test (GDT) analysis. Comparing the first molecule and the second molecule using Local Global Alignment Scoring function (LGA_S) analysis. Verifying constructed alignment and repeating the steps to find the regions of 3D similarities in protein structures.

  2. Detecting Local Ligand-Binding Site Similarity in Non-Homologous Proteins by Surface Patch Comparison

    Science.gov (United States)

    Sael, Lee; Kihara, Daisuke

    2012-01-01

    Functional elucidation of proteins is one of the essential tasks in biology. Function of a protein, specifically, small ligand molecules that bind to a protein, can be predicted by finding similar local surface regions in binding sites of known proteins. Here, we developed an alignment free local surface comparison method for predicting a ligand molecule which binds to a query protein. The algorithm, named Patch-Surfer, represents a binding pocket as a combination of segmented surface patches, each of which is characterized by its geometrical shape, the electrostatic potential, the hydrophobicity, and the concaveness. Representing a pocket by a set of patches is effective to absorb difference of global pocket shape while capturing local similarity of pockets. The shape and the physicochemical properties of surface patches are represented using the 3D Zernike descriptor, which is a series expansion of mathematical 3D function. Two pockets are compared using a modified weighted bipartite matching algorithm, which matches similar patches from the two pockets. Patch-Surfer was benchmarked on three datasets, which consist in total of 390 proteins that bind to one of 21 ligands. Patch-Surfer showed superior performance to existing methods including a global pocket comparison method, Pocket-Surfer, which we have previously introduced. Particularly, as intended, the accuracy showed large improvement for flexible ligand molecules, which bind to pockets in different conformations. PMID:22275074

  3. Detecting local ligand-binding site similarity in nonhomologous proteins by surface patch comparison.

    Science.gov (United States)

    Sael, Lee; Kihara, Daisuke

    2012-04-01

    Functional elucidation of proteins is one of the essential tasks in biology. Function of a protein, specifically, small ligand molecules that bind to a protein, can be predicted by finding similar local surface regions in binding sites of known proteins. Here, we developed an alignment free local surface comparison method for predicting a ligand molecule which binds to a query protein. The algorithm, named Patch-Surfer, represents a binding pocket as a combination of segmented surface patches, each of which is characterized by its geometrical shape, the electrostatic potential, the hydrophobicity, and the concaveness. Representing a pocket by a set of patches is effective to absorb difference of global pocket shape while capturing local similarity of pockets. The shape and the physicochemical properties of surface patches are represented using the 3D Zernike descriptor, which is a series expansion of mathematical 3D function. Two pockets are compared using a modified weighted bipartite matching algorithm, which matches similar patches from the two pockets. Patch-Surfer was benchmarked on three datasets, which consist in total of 390 proteins that bind to one of 21 ligands. Patch-Surfer showed superior performance to existing methods including a global pocket comparison method, Pocket-Surfer, which we have previously introduced. Particularly, as intended, the accuracy showed large improvement for flexible ligand molecules, which bind to pockets in different conformations. Copyright © 2011 Wiley Periodicals, Inc.

  4. Fast protein tertiary structure retrieval based on global surface shape similarity.

    Science.gov (United States)

    Sael, Lee; Li, Bin; La, David; Fang, Yi; Ramani, Karthik; Rustamov, Raif; Kihara, Daisuke

    2008-09-01

    Characterization and identification of similar tertiary structure of proteins provides rich information for investigating function and evolution. The importance of structure similarity searches is increasing as structure databases continue to expand, partly due to the structural genomics projects. A crucial drawback of conventional protein structure comparison methods, which compare structures by their main-chain orientation or the spatial arrangement of secondary structure, is that a database search is too slow to be done in real-time. Here we introduce a global surface shape representation by three-dimensional (3D) Zernike descriptors, which represent a protein structure compactly as a series expansion of 3D functions. With this simplified representation, the search speed against a few thousand structures takes less than a minute. To investigate the agreement between surface representation defined by 3D Zernike descriptor and conventional main-chain based representation, a benchmark was performed against a protein classification generated by the combinatorial extension algorithm. Despite the different representation, 3D Zernike descriptor retrieved proteins of the same conformation defined by combinatorial extension in 89.6% of the cases within the top five closest structures. The real-time protein structure search by 3D Zernike descriptor will open up new possibility of large-scale global and local protein surface shape comparison. 2008 Wiley-Liss, Inc.

  5. Using sequence similarity networks for visualization of relationships across diverse protein superfamilies.

    Directory of Open Access Journals (Sweden)

    Holly J Atkinson

    Full Text Available The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new protein sequences--requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity--GPCRs and kinases from humans, and the crotonase superfamily of enzymes--we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.

  6. Using sequence similarity networks for visualization of relationships across diverse protein superfamilies.

    Science.gov (United States)

    Atkinson, Holly J; Morris, John H; Ferrin, Thomas E; Babbitt, Patricia C

    2009-01-01

    The dramatic increase in heterogeneous types of biological data--in particular, the abundance of new protein sequences--requires fast and user-friendly methods for organizing this information in a way that enables functional inference. The most widely used strategy to link sequence or structure to function, homology-based function prediction, relies on the fundamental assumption that sequence or structural similarity implies functional similarity. New tools that extend this approach are still urgently needed to associate sequence data with biological information in ways that accommodate the real complexity of the problem, while being accessible to experimental as well as computational biologists. To address this, we have examined the application of sequence similarity networks for visualizing functional trends across protein superfamilies from the context of sequence similarity. Using three large groups of homologous proteins of varying types of structural and functional diversity--GPCRs and kinases from humans, and the crotonase superfamily of enzymes--we show that overlaying networks with orthogonal information is a powerful approach for observing functional themes and revealing outliers. In comparison to other primary methods, networks provide both a good representation of group-wise sequence similarity relationships and a strong visual and quantitative correlation with phylogenetic trees, while enabling analysis and visualization of much larger sets of sequences than trees or multiple sequence alignments can easily accommodate. We also define important limitations and caveats in the application of these networks. As a broadly accessible and effective tool for the exploration of protein superfamilies, sequence similarity networks show great potential for generating testable hypotheses about protein structure-function relationships.

  7. Statistical potential-based amino acid similarity matrices for aligning distantly related protein sequences.

    Science.gov (United States)

    Tan, Yen Hock; Huang, He; Kihara, Daisuke

    2006-08-15

    Aligning distantly related protein sequences is a long-standing problem in bioinformatics, and a key for successful protein structure prediction. Its importance is increasing recently in the context of structural genomics projects because more and more experimentally solved structures are available as templates for protein structure modeling. Toward this end, recent structure prediction methods employ profile-profile alignments, and various ways of aligning two profiles have been developed. More fundamentally, a better amino acid similarity matrix can improve a profile itself; thereby resulting in more accurate profile-profile alignments. Here we have developed novel amino acid similarity matrices from knowledge-based amino acid contact potentials. Contact potentials are used because the contact propensity to the other amino acids would be one of the most conserved features of each position of a protein structure. The derived amino acid similarity matrices are tested on benchmark alignments at three different levels, namely, the family, the superfamily, and the fold level. Compared to BLOSUM45 and the other existing matrices, the contact potential-based matrices perform comparably in the family level alignments, but clearly outperform in the fold level alignments. The contact potential-based matrices perform even better when suboptimal alignments are considered. Comparing the matrices themselves with each other revealed that the contact potential-based matrices are very different from BLOSUM45 and the other matrices, indicating that they are located in a different basin in the amino acid similarity matrix space.

  8. Structural similarity-based predictions of protein interactions between HIV-1 and Homo sapiens

    Directory of Open Access Journals (Sweden)

    Gomez Shawn M

    2010-04-01

    Full Text Available Abstract Background In the course of infection, viruses such as HIV-1 must enter a cell, travel to sites where they can hijack host machinery to transcribe their genes and translate their proteins, assemble, and then leave the cell again, all while evading the host immune system. Thus, successful infection depends on the pathogen's ability to manipulate the biological pathways and processes of the organism it infects. Interactions between HIV-encoded and human proteins provide one means by which HIV-1 can connect into cellular pathways to carry out these survival processes. Results We developed and applied a computational approach to predict interactions between HIV and human proteins based on structural similarity of 9 HIV-1 proteins to human proteins having known interactions. Using functional data from RNAi studies as a filter, we generated over 2000 interaction predictions between HIV proteins and 406 unique human proteins. Additional filtering based on Gene Ontology cellular component annotation reduced the number of predictions to 502 interactions involving 137 human proteins. We find numerous known interactions as well as novel interactions showing significant functional relevance based on supporting Gene Ontology and literature evidence. Conclusions Understanding the interplay between HIV-1 and its human host will help in understanding the viral lifecycle and the ways in which this virus is able to manipulate its host. The results shown here provide a potential set of interactions that are amenable to further experimental manipulation as well as potential targets for therapeutic intervention.

  9. Similar pathogen targets in Arabidopsis thaliana and homo sapiens protein networks.

    Directory of Open Access Journals (Sweden)

    Paulo Shakarian

    Full Text Available We study the behavior of pathogens on host protein networks for humans and Arabidopsis - noting striking similarities. Specifically, we preform [Formula: see text]-shell decomposition analysis on these networks - which groups the proteins into various "shells" based on network structure. We observe that shells with a higher average degree are more highly targeted (with a power-law relationship and that highly targeted nodes lie in shells closer to the inner-core of the network. Additionally, we also note that the inner core of the network is significantly under-targeted. We show that these core proteins may have a role in intra-cellular communication and hypothesize that they are less attacked to ensure survival of the host. This may explain why certain high-degree proteins are not significantly attacked.

  10. Similarity of salt influences on the pH of buffers, polyelectrolytes, and proteins.

    Science.gov (United States)

    Voinescu, Alina E; Bauduin, Pierre; Pinna, M Cristina; Touraud, Didier; Ninham, Barry W; Kunz, Werner

    2006-05-04

    Changes in pH induced by the addition of electrolytes to buffers, polyelectrolytes (a polycarboxy polymethylene and a polyethyleneimine), and proteins (casein, whey, and lysozyme) solutions are explored systematically. The two buffer systems are triethanolamine/triethanolammonium chloride and citric acid/sodium citrate. These are chosen because of the similarity of their acid-base equilibria with those of amino acids predominant in most proteins, that is, amino acids that include carboxylate or ammonium groups in their structures. The pH of triethanolamine and of citrate buffers respectively increases and decreases when salt is added. At low electrolyte concentrations (buffer solutions. It is even possible to qualitatively predict these changes in protein solutions simply from the primary protein structure. At least in the systems considered here, the specific ion effects on pH seem to correlate with the bulk activity coefficients of the added electrolytes, at least at moderate salt concentrations.

  11. Searching the protein structure database for ligand-binding site similarities using CPASS v.2

    Directory of Open Access Journals (Sweden)

    Caprez Adam

    2011-01-01

    Full Text Available Abstract Background A recent analysis of protein sequences deposited in the NCBI RefSeq database indicates that ~8.5 million protein sequences are encoded in prokaryotic and eukaryotic genomes, where ~30% are explicitly annotated as "hypothetical" or "uncharacterized" protein. Our Comparison of Protein Active-Site Structures (CPASS v.2 database and software compares the sequence and structural characteristics of experimentally determined ligand binding sites to infer a functional relationship in the absence of global sequence or structure similarity. CPASS is an important component of our Functional Annotation Screening Technology by NMR (FAST-NMR protocol and has been successfully applied to aid the annotation of a number of proteins of unknown function. Findings We report a major upgrade to our CPASS software and database that significantly improves its broad utility. CPASS v.2 is designed with a layered architecture to increase flexibility and portability that also enables job distribution over the Open Science Grid (OSG to increase speed. Similarly, the CPASS interface was enhanced to provide more user flexibility in submitting a CPASS query. CPASS v.2 now allows for both automatic and manual definition of ligand-binding sites and permits pair-wise, one versus all, one versus list, or list versus list comparisons. Solvent accessible surface area, ligand root-mean square difference, and Cβ distances have been incorporated into the CPASS similarity function to improve the quality of the results. The CPASS database has also been updated. Conclusions CPASS v.2 is more than an order of magnitude faster than the original implementation, and allows for multiple simultaneous job submissions. Similarly, the CPASS database of ligand-defined binding sites has increased in size by ~ 38%, dramatically increasing the likelihood of a positive search result. The modification to the CPASS similarity function is effective in reducing CPASS similarity scores

  12. Functionally Similar WRKY Proteins Regulate Vacuolar Acidification in Petunia and Hair Development in Arabidopsis.

    Science.gov (United States)

    Verweij, Walter; Spelt, Cornelis E; Bliek, Mattijs; de Vries, Michel; Wit, Niek; Faraco, Marianna; Koes, Ronald; Quattrocchio, Francesca M

    2016-03-01

    The WD40 proteins ANTHOCYANIN11 (AN11) from petunia (Petunia hybrida) and TRANSPARENT TESTA GLABRA1 (TTG1) from Arabidopsis thaliana and associated basic helix-loop-helix (bHLH) and MYB transcription factors activate a variety of differentiation processes. In petunia petals, AN11 and the bHLH protein AN1 activate, together with the MYB protein AN2, anthocyanin biosynthesis and, together with the MYB protein PH4, distinct genes, such as PH1 and PH5, that acidify the vacuole. To understand how AN1 and AN11 activate anthocyanin biosynthetic and PH genes independently, we isolated PH3. We found that PH3 is a target gene of the AN11-AN1-PH4 complex and encodes a WRKY protein that can bind to AN11 and is required, in a feed-forward loop, together with AN11-AN1-PH4 for transcription of PH5. PH3 is highly similar to TTG2, which regulates hair development, tannin accumulation, and mucilage production in Arabidopsis. Like PH3, TTG2 can bind to petunia AN11 and the Arabidopsis homolog TTG1, complement ph3 in petunia, and reactivate the PH3 target gene PH5. Our findings show that the specificity of WD40-bHLH-MYB complexes is in part determined by interacting proteins, such as PH3 and TTG2, and reveal an unanticipated similarity in the regulatory circuitry that controls petunia vacuolar acidification and Arabidopsis hair development. © 2016 American Society of Plant Biologists. All rights reserved.

  13. Cloud4Psi: cloud computing for 3D protein structure similarity searching.

    Science.gov (United States)

    Mrozek, Dariusz; Małysiak-Mrozek, Bożena; Kłapciński, Artur

    2014-10-01

    Popular methods for 3D protein structure similarity searching, especially those that generate high-quality alignments such as Combinatorial Extension (CE) and Flexible structure Alignment by Chaining Aligned fragment pairs allowing Twists (FATCAT) are still time consuming. As a consequence, performing similarity searching against large repositories of structural data requires increased computational resources that are not always available. Cloud computing provides huge amounts of computational power that can be provisioned on a pay-as-you-go basis. We have developed the cloud-based system that allows scaling of the similarity searching process vertically and horizontally. Cloud4Psi (Cloud for Protein Similarity) was tested in the Microsoft Azure cloud environment and provided good, almost linearly proportional acceleration when scaled out onto many computational units. Cloud4Psi is available as Software as a Service for testing purposes at: http://cloud4psi.cloudapp.net/. For source code and software availability, please visit the Cloud4Psi project home page at http://zti.polsl.pl/dmrozek/science/cloud4psi.htm. © The Author 2014. Published by Oxford University Press.

  14. Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

    Science.gov (United States)

    Zhu, Jie; Qin, Yufang; Liu, Taigang; Wang, Jun; Zheng, Xiaoqi

    2013-01-01

    Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. Programs and data are available upon request.

  15. Decoding the Divergent Subcellular Location of Two Highly Similar Paralogous LEA Proteins

    Directory of Open Access Journals (Sweden)

    Marie-Hélène Avelange-Macherel

    2018-05-01

    Full Text Available Many mitochondrial proteins are synthesized as precursors in the cytosol with an N-terminal mitochondrial targeting sequence (MTS which is cleaved off upon import. Although much is known about import mechanisms and MTS structural features, the variability of MTS still hampers robust sub-cellular software predictions. Here, we took advantage of two paralogous late embryogenesis abundant proteins (LEA from Arabidopsis with different subcellular locations to investigate structural determinants of mitochondrial import and gain insight into the evolution of the LEA genes. LEA38 and LEA2 are short proteins of the LEA_3 family, which are very similar along their whole sequence, but LEA38 is targeted to mitochondria while LEA2 is cytosolic. Differences in the N-terminal protein sequences were used to generate a series of mutated LEA2 which were expressed as GFP-fusion proteins in leaf protoplasts. By combining three types of mutation (substitution, charge inversion, and segment replacement, we were able to redirect the mutated LEA2 to mitochondria. Analysis of the effect of the mutations and determination of the LEA38 MTS cleavage site highlighted important structural features within and beyond the MTS. Overall, these results provide an explanation for the likely loss of mitochondrial location after duplication of the ancestral gene.

  16. Pre- versus post-exercise protein intake has similar effects on muscular adaptations.

    Science.gov (United States)

    Schoenfeld, Brad Jon; Aragon, Alan; Wilborn, Colin; Urbina, Stacie L; Hayward, Sara E; Krieger, James

    2017-01-01

    The purpose of this study was to test the anabolic window theory by investigating muscle strength, hypertrophy, and body composition changes in response to an equal dose of protein consumed either immediately pre- versus post-resistance training (RT) in trained men. Subjects were 21 resistance-trained men (>1 year RT experience) recruited from a university population. After baseline testing, participants were randomly assigned to 1 of 2 experimental groups: a group that consumed a supplement containing 25 g protein and 1 g carbohydrate immediately prior to exercise (PRE-SUPP) ( n  = 9) or a group that consumed the same supplement immediately post-exercise (POST-SUPP) ( n  = 12). The RT protocol consisted of three weekly sessions performed on non-consecutive days for 10 weeks. A total-body routine was employed with three sets of 8-12 repetitions for each exercise. Results showed that pre- and post-workout protein consumption had similar effects on all measures studied ( p  > 0.05). These findings refute the contention of a narrow post-exercise anabolic window to maximize the muscular response and instead lends support to the theory that the interval for protein intake may be as wide as several hours or perhaps more after a training bout depending on when the pre-workout meal was consumed.

  17. Pre- versus post-exercise protein intake has similar effects on muscular adaptations

    Directory of Open Access Journals (Sweden)

    Brad Jon Schoenfeld

    2017-01-01

    Full Text Available The purpose of this study was to test the anabolic window theory by investigating muscle strength, hypertrophy, and body composition changes in response to an equal dose of protein consumed either immediately pre- versus post-resistance training (RT in trained men. Subjects were 21 resistance-trained men (>1 year RT experience recruited from a university population. After baseline testing, participants were randomly assigned to 1 of 2 experimental groups: a group that consumed a supplement containing 25 g protein and 1 g carbohydrate immediately prior to exercise (PRE-SUPP (n = 9 or a group that consumed the same supplement immediately post-exercise (POST-SUPP (n = 12. The RT protocol consisted of three weekly sessions performed on non-consecutive days for 10 weeks. A total-body routine was employed with three sets of 8–12 repetitions for each exercise. Results showed that pre- and post-workout protein consumption had similar effects on all measures studied (p > 0.05. These findings refute the contention of a narrow post-exercise anabolic window to maximize the muscular response and instead lends support to the theory that the interval for protein intake may be as wide as several hours or perhaps more after a training bout depending on when the pre-workout meal was consumed.

  18. Ice Shaping Properties, Similar to That of Antifreeze Proteins, of a Zirconium Acetate Complex

    Science.gov (United States)

    Deville, Sylvain; Viazzi, Céline; Leloup, Jérôme; Lasalle, Audrey; Guizard, Christian; Maire, Eric; Adrien, Jérôme; Gremillard, Laurent

    2011-01-01

    The control of the growth morphologies of ice crystals is a critical issue in fields as diverse as biomineralization, medicine, biology, civil or food engineering. Such control can be achieved through the ice-shaping properties of specific compounds. The development of synthetic ice-shaping compounds is inspired by the natural occurrence of such properties exhibited by antifreeze proteins. We reveal how a particular zirconium acetate complex is exhibiting ice-shaping properties very similar to that of antifreeze proteins, albeit being a radically different compound. We use these properties as a bioinspired approach to template unique faceted pores in cellular materials. These results suggest that ice-structuring properties are not exclusive to long organic molecules and should broaden the field of investigations and applications of such substances. PMID:22028886

  19. On the Power and Limits of Sequence Similarity Based Clustering of Proteins Into Families

    DEFF Research Database (Denmark)

    Wiwie, Christian; Röttger, Richard

    2017-01-01

    Over the last decades, we have observed an ongoing tremendous growth of available sequencing data fueled by the advancements in wet-lab technology. The sequencing information is only the beginning of the actual understanding of how organisms survive and prosper. It is, for instance, equally...... important to also unravel the proteomic repertoire of an organism. A classical computational approach for detecting protein families is a sequence-based similarity calculation coupled with a subsequent cluster analysis. In this work we have intensively analyzed various clustering tools on a large scale. We...... used the data to investigate the behavior of the tools' parameters underlining the diversity of the protein families. Furthermore, we trained regression models for predicting the expected performance of a clustering tool for an unknown data set and aimed to also suggest optimal parameters...

  20. Ice shaping properties, similar to that of antifreeze proteins, of a zirconium acetate complex.

    Directory of Open Access Journals (Sweden)

    Sylvain Deville

    Full Text Available The control of the growth morphologies of ice crystals is a critical issue in fields as diverse as biomineralization, medicine, biology, civil or food engineering. Such control can be achieved through the ice-shaping properties of specific compounds. The development of synthetic ice-shaping compounds is inspired by the natural occurrence of such properties exhibited by antifreeze proteins. We reveal how a particular zirconium acetate complex is exhibiting ice-shaping properties very similar to that of antifreeze proteins, albeit being a radically different compound. We use these properties as a bioinspired approach to template unique faceted pores in cellular materials. These results suggest that ice-structuring properties are not exclusive to long organic molecules and should broaden the field of investigations and applications of such substances.

  1. Identification of polycystic ovary syndrome potential drug targets based on pathobiological similarity in the protein-protein interaction network

    Science.gov (United States)

    Li, Wan; Wei, Wenqing; Li, Yiran; Xie, Ruiqiang; Guo, Shanshan; Wang, Yahui; Jiang, Jing; Chen, Binbin; Lv, Junjie; Zhang, Nana; Chen, Lina; He, Weiming

    2016-01-01

    Polycystic ovary syndrome (PCOS) is one of the most common endocrinological disorders in reproductive aged women. PCOS and Type 2 Diabetes (T2D) are closely linked in multiple levels and possess high pathobiological similarity. Here, we put forward a new computational approach based on the pathobiological similarity to identify PCOS potential drug target modules (PPDT-Modules) and PCOS potential drug targets in the protein-protein interaction network (PPIN). From the systems level and biological background, 1 PPDT-Module and 22 PCOS potential drug targets were identified, 21 of which were verified by literatures to be associated with the pathogenesis of PCOS. 42 drugs targeting to 13 PCOS potential drug targets were investigated experimentally or clinically for PCOS. Evaluated by independent datasets, the whole PPDT-Module and 22 PCOS potential drug targets could not only reveal the drug response, but also distinguish the statuses between normal and disease. Our identified PPDT-Module and PCOS potential drug targets would shed light on the treatment of PCOS. And our approach would provide valuable insights to research on the pathogenesis and drug response of other diseases. PMID:27191267

  2. Geomfinder: a multi-feature identifier of similar three-dimensional protein patterns: a ligand-independent approach.

    Science.gov (United States)

    Núñez-Vivanco, Gabriel; Valdés-Jiménez, Alejandro; Besoaín, Felipe; Reyes-Parada, Miguel

    2016-01-01

    Since the structure of proteins is more conserved than the sequence, the identification of conserved three-dimensional (3D) patterns among a set of proteins, can be important for protein function prediction, protein clustering, drug discovery and the establishment of evolutionary relationships. Thus, several computational applications to identify, describe and compare 3D patterns (or motifs) have been developed. Often, these tools consider a 3D pattern as that described by the residues surrounding co-crystallized/docked ligands available from X-ray crystal structures or homology models. Nevertheless, many of the protein structures stored in public databases do not provide information about the location and characteristics of ligand binding sites and/or other important 3D patterns such as allosteric sites, enzyme-cofactor interaction motifs, etc. This makes necessary the development of new ligand-independent methods to search and compare 3D patterns in all available protein structures. Here we introduce Geomfinder, an intuitive, flexible, alignment-free and ligand-independent web server for detailed estimation of similarities between all pairs of 3D patterns detected in any two given protein structures. We used around 1100 protein structures to form pairs of proteins which were assessed with Geomfinder. In these analyses each protein was considered in only one pair (e.g. in a subset of 100 different proteins, 50 pairs of proteins can be defined). Thus: (a) Geomfinder detected identical pairs of 3D patterns in a series of monoamine oxidase-B structures, which corresponded to the effectively similar ligand binding sites at these proteins; (b) we identified structural similarities among pairs of protein structures which are targets of compounds such as acarbose, benzamidine, adenosine triphosphate and pyridoxal phosphate; these similar 3D patterns are not detected using sequence-based methods; (c) the detailed evaluation of three specific cases showed the versatility

  3. Bidirectional gene sequences with similar homology to functional proteins of alkane degrading bacterium pseudomonas fredriksbergensis DNA

    International Nuclear Information System (INIS)

    Megeed, A.A.

    2011-01-01

    The potential for two overlapping fragments of DNA from a clone of newly isolated alkanes degrading bacterium Pseudomonas frederiksbergensis encoding sequences with similar homology to two parts of functional proteins is described. One strand contains a sequence with high homology to alkanes monooxygenase (alkB), a member of the alkanes hydroxylase family, and the other strand contains a sequence with some homology to alcohol dehydrogenase gene (alkJ). Overlapping of the genes on opposite strands has been reported in eukaryotic species, and is now reported in a bacterial species. The sequence comparisons and ORFS results revealed that the regulation and the genes organization involved in alkane oxidation represented in Pseudomonas frederiksberghensis varies among the different known alkane degrading bacteria. The alk gene cluster containing homologues to the known alkane monooxygenase (alkB), and rubredoxin (alkG) are oriented in the same direction, whereas alcohol dehydrogenase (alkJ) is oriented in the opposite direction. Such genomes encode messages on both strands of the DNA, or in an overlapping but different reading frames, of the same strand of DNA. The possibility of creating novel genes from pre-existing sequences, known as overprinting, which is a widespread phenomenon in small viruses. Here, the origin and evolution of the gene overlap to bacteriophages belonging to the family Microviridae have been investigated. Such a phenomenon is most widely described in extremely small genomes such as those of viruses or small plasmids, yet here is a unique phenomenon. (author)

  4. Design of compound libraries based on natural product scaffolds and protein structure similarity clustering (PSSC)

    NARCIS (Netherlands)

    Balamurugan, Rengarajan; Dekker, Frank J; Waldmann, Herbert; Dekker, Frans

    Recent advances in structural biology, bioinformatics and combinatorial chemistry have significantly impacted the discovery of small molecules that modulate protein functions. Natural products which have evolved to bind to proteins may serve as biologically validated starting points for the design

  5. MOCASSIN-prot: A multi-objective clustering approach for protein similarity networks

    Science.gov (United States)

    Motivation: Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary h...

  6. Right- and left-handed three-helix proteins. II. Similarity and differences in mechanical unfolding of proteins.

    Science.gov (United States)

    Glyakina, Anna V; Likhachev, Ilya V; Balabaev, Nikolay K; Galzitskaya, Oxana V

    2014-01-01

    Here, we study mechanical properties of eight 3-helix proteins (four right-handed and four left-handed ones), which are similar in size under stretching at a constant speed and at a constant force on the atomic level using molecular dynamics simulations. The analysis of 256 trajectories from molecular dynamics simulations with explicit water showed that the right-handed three-helix domains are more mechanically resistant than the left-handed domains. Such results are observed at different extension velocities studied (192 trajectories obtained at the following conditions: v = 0.1, 0.05, and 0.01 Å ps(-1) , T = 300 K) and under constant stretching force (64 trajectories, F = 800 pN, T = 300 K). We can explain this by the fact, at least in part, that the right-handed domains have a larger number of contacts per residue and the radius of cross section than the left-handed domains. Copyright © 2013 Wiley Periodicals, Inc.

  7. Evaluation of GO-based functional similarity measures using S. cerevisiae protein interaction and expression profile data

    Directory of Open Access Journals (Sweden)

    Du LinFang

    2008-11-01

    Full Text Available Abstract Background Researchers interested in analysing the expression patterns of functionally related genes usually hope to improve the accuracy of their results beyond the boundaries of currently available experimental data. Gene ontology (GO data provides a novel way to measure the functional relationship between gene products. Many approaches have been reported for calculating the similarities between two GO terms, known as semantic similarities. However, biologists are more interested in the relationship between gene products than in the scores linking the GO terms. To highlight the relationships among genes, recent studies have focused on functional similarities. Results In this study, we evaluated five functional similarity methods using both protein-protein interaction (PPI and expression data of S. cerevisiae. The receiver operating characteristics (ROC and correlation coefficient analysis of these methods showed that the maximum method outperformed the other methods. Statistical comparison of multiple- and single-term annotated proteins in biological process ontology indicated that genes with multiple GO terms may be more reliable for separating true positives from noise. Conclusion This study demonstrated the reliability of current approaches that elevate the similarity of GO terms to the similarity of proteins. Suggestions for further improvements in functional similarity analysis are also provided.

  8. Monoaminylation of Fibrinogen and Glia-Derived Proteins: Indication for Similar Mechanisms in Posttranslational Protein Modification in Blood and Brain.

    Science.gov (United States)

    Hummerich, René; Costina, Victor; Findeisen, Peter; Schloss, Patrick

    2015-07-15

    Distinct proteins have been demonstrated to be posttranslationally modified by covalent transamidation of serotonin (5-hydropxytryptamin) to glutamine residues of the target proteins. This process is mediated by transglutaminase (TGase) and has been termed "serotonylation." It has also been shown that other biogenic amines, including the neurotransmitters dopamine and norepinephrine, can substitute for serotonin, implying a more general mechanism of "monoaminylation" for this kind of protein modification. Here we transamidated the autofluorescent monoamine monodansylcadaverine (MDC) to purified plasma fibrinogen and to proteins from a primary glia cell culture. Electrophoretic separation of MDC-conjugated proteins followed by mass spectrometry identified three fibrinogen subunits (Aα, Bβ, γ), a homomeric Aα2 dimer, and adducts of >250 kDa molecular weight, as well as several glial proteins. TGase-mediated MDC incorporation was strongly reduced by serotonin, underlining the general mechanism of monoaminylation.

  9. Functionally Similar WRKY Proteins Regulate Vacuolar Acidification in Petunia and Hair Development in Arabidopsis

    NARCIS (Netherlands)

    Verweij, W.; Spelt, C.E.; Bliek, M.; de Vries, M.; Wit, N.; Faraco, M.; Koes, R.; Quattrocchio, F.

    2016-01-01

    The WD40 proteins ANTHOCYANIN11 (AN11) from petunia (Petunia hybrida) and TRANSPARENT TESTA GLABRA1 (TTG1) fromArabidopsis thalianaand associated basic helix-loop-helix (bHLH) and MYB transcription factors activate a variety of differentiation processes. In petunia petals, AN11 and the bHLH protein

  10. Chlamydia trachomatis contains a protein similar to the Legionella pneumophila mip gene product

    DEFF Research Database (Denmark)

    Lundemose, AG; Birkelund, Svend; Fey, SJ

    1991-01-01

    A 27kDa Chlamydia trachomatis L2 protein was characterized by the use of monoclonal antibodies and by two-dimensional gel electrophoresis. The protein was shown to be located in the membrane of reticulate bodies as well as elementary bodies. Its synthesis could be detected from 10 hours post-infe...... potentiator (mip) gene of Legionella pneumophila....

  11. Ascorbic acid glycation of lens proteins produces UVA sensitizers similar to those in human lens

    International Nuclear Information System (INIS)

    Ortwerth, B.J.; Linetsky, Mikhail; Olesen, P.R.

    1995-01-01

    Soluble calf lens proteins were extensively glycated during a 4 week incubation with ascorbic acid in the presence of oxygen. Amino acids analysis of the dialyzed proteins removed at weekly intervals showed an increasing loss of lysine, arginine and histidine, consistent with the extensive protein cross-linking observed. Irradiation of the dialyzed samples with UVA light (1.0 kJ/cm 2 total illumination through a 338 nm cutoff filter) caused an increasing loss of tryptophan, an additional loss of histidine and the production of micromolar concentrations of hydrogen peroxide. No alteration in amino acid content and no photolytic effects were seen in proteins incubated without ascorbic acid in proteins incubated with glucose for 4 weeks. The rate of hydrogen peroxide formation was linear with each glycated sample with a maximum production of 25 nmol/mg protein illuminated. The possibility that the sensitizer activity was due to an ascorbate-induced oxidation of tryptophan was eliminated by the presence of a heavy metal ion chelator during the incubation and by showing equivalent effects with ascorbate-incubated ribonuclease A, which is devoid of tryptophan. The ascorbate-incubated samples displayed increasing absorbance at wavelengths above 300 nm and increasing fluorescence (340/430) as glycation proceeded. The spectra of the 4 week glycated proteins were identical to those obtained with a solubilized water-insoluble fraction from human lens, which is known to have UVA sensitizer activity. (Author)

  12. A protein-tyrosine phosphatase with sequence similarity to the SH2 domain of the protein-tyrosine kinases.

    Science.gov (United States)

    Shen, S H; Bastien, L; Posner, B I; Chrétien, P

    1991-08-22

    The phosphorylation of proteins at tyrosine residues is critical in cellular signal transduction, neoplastic transformation and control of the mitotic cycle. These mechanisms are regulated by the activities of both protein-tyrosine kinases (PTKs) and protein-tyrosine phosphatases (PTPases). As in the PTKs, there are two classes of PTPases: membrane associated, receptor-like enzymes and soluble proteins. Here we report the isolation of a complementary DNA clone encoding a new form of soluble PTPase, PTP1C. The enzyme possesses a large noncatalytic region at the N terminus which unexpectedly contains two adjacent copies of the Src homology region 2 (the SH2 domain) found in various nonreceptor PTKs and other cytoplasmic signalling proteins. As with other SH2 sequences, the SH2 domains of PTP1C formed high-affinity complexes with the activated epidermal growth factor receptor and other phosphotyrosine-containing proteins. These results suggest that the SH2 regions in PTP1C may interact with other cellular components to modulate its own phosphatase activity against interacting substrates. PTPase activity may thus directly link growth factor receptors and other signalling proteins through protein-tyrosine phosphorylation.

  13. Structural and Sequence Similarities of Hydra Xeroderma Pigmentosum A Protein to Human Homolog Suggest Early Evolution and Conservation

    Directory of Open Access Journals (Sweden)

    Apurva Barve

    2013-01-01

    Full Text Available Xeroderma pigmentosum group A (XPA is a protein that binds to damaged DNA, verifies presence of a lesion, and recruits other proteins of the nucleotide excision repair (NER pathway to the site. Though its homologs from yeast, Drosophila, humans, and so forth are well studied, XPA has not so far been reported from protozoa and lower animal phyla. Hydra is a fresh-water cnidarian with a remarkable capacity for regeneration and apparent lack of organismal ageing. Cnidarians are among the first metazoa with a defined body axis, tissue grade organisation, and nervous system. We report here for the first time presence of XPA gene in hydra. Putative protein sequence of hydra XPA contains nuclear localization signal and bears the zinc-finger motif. It contains two conserved Pfam domains and various characterized features of XPA proteins like regions for binding to excision repair cross-complementing protein-1 (ERCC1 and replication protein A 70 kDa subunit (RPA70 proteins. Hydra XPA shows a high degree of similarity with vertebrate homologs and clusters with deuterostomes in phylogenetic analysis. Homology modelling corroborates the very close similarity between hydra and human XPA. The protein thus most likely functions in hydra in the same manner as in other animals, indicating that it arose early in evolution and has been conserved across animal phyla.

  14. Age- and Hypertension-Associated Protein Aggregates in Mouse Heart Have Similar Proteomic Profiles.

    Science.gov (United States)

    Ayyadevara, Srinivas; Mercanti, Federico; Wang, Xianwei; Mackintosh, Samuel G; Tackett, Alan J; Prayaga, Sastry V S; Romeo, Francesco; Shmookler Reis, Robert J; Mehta, Jawahar L

    2016-05-01

    Neurodegenerative diseases are largely defined by protein aggregates in affected tissues. Aggregates contain some shared components as well as proteins thought to be specific for each disease. Aggregation has not previously been reported in the normal, aging heart or the hypertensive heart. Detergent-insoluble protein aggregates were isolated from mouse heart and characterized on 2-dimensional gels. Their levels increased markedly and significantly with aging and after sustained angiotensin II-induced hypertension. Of the aggregate components identified by high-resolution proteomics, half changed in abundance with age (392/787) or with sustained hypertension (459/824), whereas 30% (273/901) changed concordantly in both, each Phypertensive hearts, we posited that aging of fibroblasts may contribute to the aggregates observed in cardiac tissue. Indeed, as cardiac myofibroblasts "senesced" (approached their replicative limit) in vitro, they accrued aggregates with many of the same constituent proteins observed in vivo during natural aging or sustained hypertension. In summary, we have shown for the first time that compact (detergent-insoluble) protein aggregates accumulate during natural aging, chronic hypertension, and in vitro myofibroblast senescence, sharing many common proteins. Thus, aggregates that arise from disparate causes (aging, hypertension, and replicative senescence) may have common underlying mechanisms of accrual. © 2016 American Heart Association, Inc.

  15. The dynamics of single protein molecules is non-equilibrium and self-similar over thirteen decades in time

    Science.gov (United States)

    Hu, Xiaohu; Hong, Liang; Dean Smith, Micholas; Neusius, Thomas; Cheng, Xiaolin; Smith, Jeremy C.

    2016-02-01

    Internal motions of proteins are essential to their function. The time dependence of protein structural fluctuations is highly complex, manifesting subdiffusive, non-exponential behaviour with effective relaxation times existing over many decades in time, from ps up to ~102 s (refs ,,,). Here, using molecular dynamics simulations, we show that, on timescales from 10-12 to 10-5 s, motions in single proteins are self-similar, non-equilibrium and exhibit ageing. The characteristic relaxation time for a distance fluctuation, such as inter-domain motion, is observation-time-dependent, increasing in a simple, power-law fashion, arising from the fractal nature of the topology and geometry of the energy landscape explored. Diffusion over the energy landscape follows a non-ergodic continuous time random walk. Comparison with single-molecule experiments suggests that the non-equilibrium self-similar dynamical behaviour persists up to timescales approaching the in vivo lifespan of individual protein molecules.

  16. Identification of similar regions of protein structures using integrated sequence and structure analysis tools

    Directory of Open Access Journals (Sweden)

    Heiland Randy

    2006-03-01

    Full Text Available Abstract Background Understanding protein function from its structure is a challenging problem. Sequence based approaches for finding homology have broad use for annotation of both structure and function. 3D structural information of protein domains and their interactions provide a complementary view to structure function relationships to sequence information. We have developed a web site http://www.sblest.org/ and an API of web services that enables users to submit protein structures and identify statistically significant neighbors and the underlying structural environments that make that match using a suite of sequence and structure analysis tools. To do this, we have integrated S-BLEST, PSI-BLAST and HMMer based superfamily predictions to give a unique integrated view to prediction of SCOP superfamilies, EC number, and GO term, as well as identification of the protein structural environments that are associated with that prediction. Additionally, we have extended UCSF Chimera and PyMOL to support our web services, so that users can characterize their own proteins of interest. Results Users are able to submit their own queries or use a structure already in the PDB. Currently the databases that a user can query include the popular structural datasets ASTRAL 40 v1.69, ASTRAL 95 v1.69, CLUSTER50, CLUSTER70 and CLUSTER90 and PDBSELECT25. The results can be downloaded directly from the site and include function prediction, analysis of the most conserved environments and automated annotation of query proteins. These results reflect both the hits found with PSI-BLAST, HMMer and with S-BLEST. We have evaluated how well annotation transfer can be performed on SCOP ID's, Gene Ontology (GO ID's and EC Numbers. The method is very efficient and totally automated, generally taking around fifteen minutes for a 400 residue protein. Conclusion With structural genomics initiatives determining structures with little, if any, functional characterization

  17. PrPC has nucleic acid chaperoning properties similar to the nucleocapsid protein of HIV-1.

    Science.gov (United States)

    Derrington, Edmund; Gabus, Caroline; Leblanc, Pascal; Chnaidermann, Jonas; Grave, Linda; Dormont, Dominique; Swietnicki, Wieslaw; Morillas, Manuel; Marck, Daniel; Nandi, Pradip; Darlix, Jean-Luc

    2002-01-01

    The function of the cellular prion protein (PrPC) remains obscure. Studies suggest that PrPC functions in several processes including signal transduction and Cu2+ metabolism. PrPC has also been established to bind nucleic acids. Therefore we investigated the properties of PrPC as a putative nucleic acid chaperone. Surprisingly, PrPC possesses all the nucleic acid chaperoning properties previously specific to retroviral nucleocapsid proteins. PrPC appears to be a molecular mimic of NCP7, the nucleocapsid protein of HIV-1. Thus PrPC, like NCP7, chaperones the annealing of tRNA(Lys) to the HIV-1 primer binding site, the initial step of retrovirus replication. PrPC also chaperones the two DNA strand transfers required for production of a complete proviral DNA with LTRs. Concerning the functions of NCP7 during budding, PrPC also mimices NCP7 by dimerizing the HIV-1 genomic RNA. These data are unprecedented because, although many cellular proteins have been identified as nucleic acid chaperones, none have the properties of retroviral nucleocapsid proteins.

  18. Herbaspirillum seropedicae signal transduction protein PII is structurally similar to the enteric GlnK.

    Science.gov (United States)

    Machado Benelli, Elaine; Buck, Martin; Polikarpov, Igor; Maltempi de Souza, Emanuel; Cruz, Leonardo M; Pedrosa, Fábio O

    2002-07-01

    PII-like proteins are signal transduction proteins found in bacteria, archaea and eukaryotes. They mediate a variety of cellular responses. A second PII-like protein, called GlnK, has been found in several organisms. In the diazotroph Herbaspirillum seropedicae, PII protein is involved in sensing nitrogen levels and controlling nitrogen fixation genes. In this work, the crystal structure of the unliganded H. seropedicae PII was solved by X-ray diffraction. H. seropedicae PII has a Gly residue, Gly108 preceding Pro109 and the main-chain forms a beta turn. The glycine at position 108 allows a bend in the C-terminal main-chain, thereby modifying the surface of the cleft between monomers and potentially changing function. The structure suggests that the C-terminal region of PII proteins may be involved in specificity of function, and nonenteric diazotrophs are found to have the C-terminal consensus XGXDAX(107-112). We are also proposing binding sites for ATP and 2-oxoglutarate based on the structural alignment of PII with PII-ATP/GlnK-ATP, 5-carboxymethyl-2-hydroxymuconate isomerase and 4-oxalocrotonate tautomerase bound to the inhibitor 2-oxo-3-pentynoate.

  19. MAPA distinguishes genotype-specific variability of highly similar regulatory protein isoforms in potato tuber.

    Science.gov (United States)

    Hoehenwarter, Wolfgang; Larhlimi, Abdelhalim; Hummel, Jan; Egelhofer, Volker; Selbig, Joachim; van Dongen, Joost T; Wienkoop, Stefanie; Weckwerth, Wolfram

    2011-07-01

    Mass Accuracy Precursor Alignment is a fast and flexible method for comparative proteome analysis that allows the comparison of unprecedented numbers of shotgun proteomics analyses on a personal computer in a matter of hours. We compared 183 LC-MS analyses and more than 2 million MS/MS spectra and could define and separate the proteomic phenotypes of field grown tubers of 12 tetraploid cultivars of the crop plant Solanum tuberosum. Protein isoforms of patatin as well as other major gene families such as lipoxygenase and cysteine protease inhibitor that regulate tuber development were found to be the primary source of variability between the cultivars. This suggests that differentially expressed protein isoforms modulate genotype specific tuber development and the plant phenotype. We properly assigned the measured abundance of tryptic peptides to different protein isoforms that share extensive stretches of primary structure and thus inferred their abundance. Peptides unique to different protein isoforms were used to classify the remaining peptides assigned to the entire subset of isoforms based on a common abundance profile using multivariate statistical procedures. We identified nearly 4000 proteins which we used for quantitative functional annotation making this the most extensive study of the tuber proteome to date.

  20. Loss of Niemann-Pick C1 or C2 protein results in similar biochemical changes suggesting that these proteins function in a common lysosomal pathway.

    Directory of Open Access Journals (Sweden)

    Sayali S Dixit

    Full Text Available Niemann-Pick Type C (NPC disease is a lysosomal storage disorder characterized by accumulation of unesterified cholesterol and other lipids in the endolysosomal system. NPC disease results from a defect in either of two distinct cholesterol-binding proteins: a transmembrane protein, NPC1, and a small soluble protein, NPC2. NPC1 and NPC2 are thought to function closely in the export of lysosomal cholesterol with both proteins binding cholesterol in vitro but they may have unrelated lysosomal roles. To investigate this possibility, we compared biochemical consequences of the loss of either protein. Analyses of lysosome-enriched subcellular fractions from brain and liver revealed similar decreases in buoyant densities of lysosomes from NPC1 or NPC2 deficient mice compared to controls. The subcellular distribution of both proteins was similar and paralleled a lysosomal marker. In liver, absence of either NPC1 or NPC2 resulted in similar alterations in the carbohydrate processing of the lysosomal protease, tripeptidyl peptidase I. These results highlight biochemical alterations in the lysosomal system of the NPC-mutant mice that appear secondary to lipid storage. In addition, the similarity in biochemical phenotypes resulting from either NPC1 or NPC2 deficiency supports models in which the function of these two proteins within lysosomes are linked closely.

  1. Characterization of two bacterial hydroxynitrile lyases with high similarity to cupin superfamily proteins

    NARCIS (Netherlands)

    Hussain, Z.; Wiedner, R.; Steiner, K.; Hajek, T.; Avi, M.; Hecher, B.; Sessitsch, A.; Schwab, H.

    2012-01-01

    Hydroxynitrile lyases (HNLs) catalyze the cleavage of cyanohydrins. In the reverse reaction, they catalyze the formation of carbon-carbon bonds by enantioselective condensation of hydrocyanic acid with carbonyls. In this study, we describe two proteins from endophytic bacteria that display activity

  2. Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM

    Directory of Open Access Journals (Sweden)

    Yunyun Liang

    2015-01-01

    Full Text Available Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM. Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS, segmented PsePSSM, and segmented autocovariance transformation (ACT based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640 are adopted in this paper. Then a 700-dimensional (700D feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA. To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences.

  3. Solenopsis invicta virus 3: mapping of structural proteins, ribosomal frameshifting, and similarities to Acyrthosiphon pisum virus and Kelp fly virus.

    Directory of Open Access Journals (Sweden)

    Steven M Valles

    Full Text Available Solenopsis invicta virus 3 (SINV-3 is a positive-sense single-stranded RNA virus that infects the red imported fire ant, Solenopsis invicta. We show that the second open reading frame (ORF of the dicistronic genome is expressed via a frameshifting mechanism and that the sequences encoding the structural proteins map to both ORF2 and the 3' end of ORF1, downstream of the sequence that encodes the RNA-dependent RNA polymerase. The genome organization and structural protein expression strategy resemble those of Acyrthosiphon pisum virus (APV, an aphid virus. The capsid protein that is encoded by the 3' end of ORF1 in SINV-3 and APV is predicted to have a jelly-roll fold similar to the capsid proteins of picornaviruses and caliciviruses. The capsid-extension protein that is produced by frameshifting, includes the jelly-roll fold domain encoded by ORF1 as its N-terminus, while the C-terminus encoded by the 5' half of ORF2 has no clear homology with other viral structural proteins. A third protein, encoded by the 3' half of ORF2, is associated with purified virions at sub-stoichiometric ratios. Although the structural proteins can be translated from the genomic RNA, we show that SINV-3 also produces a subgenomic RNA encoding the structural proteins. Circumstantial evidence suggests that APV may also produce such a subgenomic RNA. Both SINV-3 and APV are unclassified picorna-like viruses distantly related to members of the order Picornavirales and the family Caliciviridae. Within this grouping, features of the genome organization and capsid domain structure of SINV-3 and APV appear more similar to caliciviruses, perhaps suggesting the basis for a "Calicivirales" order.

  4. Pre- versus post-exercise protein intake has similar effects on muscular adaptations

    OpenAIRE

    Schoenfeld, Brad Jon; Aragon, Alan; Wilborn, Colin; Urbina, Stacie L.; Hayward, Sara E.; Krieger, James

    2017-01-01

    The purpose of this study was to test the anabolic window theory by investigating muscle strength, hypertrophy, and body composition changes in response to an equal dose of protein consumed either immediately pre- versus post-resistance training (RT) in trained men. Subjects were 21 resistance-trained men (>1 year RT experience) recruited from a university population. After baseline testing, participants were randomly assigned to 1 of 2 experimental groups: a group that consumed a supplement ...

  5. Towards predictive resistance models for agrochemicals by combining chemical and protein similarity via proteochemometric modelling.

    Science.gov (United States)

    van Westen, Gerard J P; Bender, Andreas; Overington, John P

    2014-10-01

    Resistance to pesticides is an increasing problem in agriculture. Despite practices such as phased use and cycling of 'orthogonally resistant' agents, resistance remains a major risk to national and global food security. To combat this problem, there is a need for both new approaches for pesticide design, as well as for novel chemical entities themselves. As summarized in this opinion article, a technique termed 'proteochemometric modelling' (PCM), from the field of chemoinformatics, could aid in the quantification and prediction of resistance that acts via point mutations in the target proteins of an agent. The technique combines information from both the chemical and biological domain to generate bioactivity models across large numbers of ligands as well as protein targets. PCM has previously been validated in prospective, experimental work in the medicinal chemistry area, and it draws on the growing amount of bioactivity information available in the public domain. Here, two potential applications of proteochemometric modelling to agrochemical data are described, based on previously published examples from the medicinal chemistry literature.

  6. Recognition of HIV-1 peptides by host CTL is related to HIV-1 similarity to human proteins.

    Directory of Open Access Journals (Sweden)

    Morgane Rolland

    Full Text Available BACKGROUND: While human immunodeficiency virus type 1 (HIV-1-specific cytotoxic T lymphocytes preferentially target specific regions of the viral proteome, HIV-1 features that contribute to immune recognition are not well understood. One hypothesis is that similarities between HIV and human proteins influence the host immune response, i.e., resemblance between viral and host peptides could preclude reactivity against certain HIV epitopes. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed the extent of similarity between HIV-1 and the human proteome. Proteins from the HIV-1 B consensus sequence from 2001 were dissected into overlapping k-mers, which were then probed against a non-redundant database of the human proteome in order to identify segments of high similarity. We tested the relationship between HIV-1 similarity to host encoded peptides and immune recognition in HIV-infected individuals, and found that HIV immunogenicity could be partially modulated by the sequence similarity to the host proteome. ELISpot responses to peptides spanning the entire viral proteome evaluated in 314 individuals showed a trend indicating an inverse relationship between the similarity to the host proteome and the frequency of recognition. In addition, analysis of responses by a group of 30 HIV-infected individuals against 944 overlapping peptides representing a broad range of individual HIV-1B Nef variants, affirmed that the degree of similarity to the host was significantly lower for peptides with reactive epitopes than for those that were not recognized. CONCLUSIONS/SIGNIFICANCE: Our results suggest that antigenic motifs that are scarcely represented in human proteins might represent more immunogenic CTL targets not selected against in the host. This observation could provide guidance in the design of more effective HIV immunogens, as sequences devoid of host-like features might afford superior immune reactivity.

  7. ProBiS-2012: web server and web services for detection of structurally similar binding sites in proteins.

    Science.gov (United States)

    Konc, Janez; Janezic, Dusanka

    2012-07-01

    The ProBiS web server is a web server for detection of structurally similar binding sites in the PDB and for local pairwise alignment of protein structures. In this article, we present a new version of the ProBiS web server that is 10 times faster than earlier versions, due to the efficient parallelization of the ProBiS algorithm, which now allows significantly faster comparison of a protein query against the PDB and reduces the calculation time for scanning the entire PDB from hours to minutes. It also features new web services, and an improved user interface. In addition, the new web server is united with the ProBiS-Database and thus provides instant access to pre-calculated protein similarity profiles for over 29 000 non-redundant protein structures. The ProBiS web server is particularly adept at detection of secondary binding sites in proteins. It is freely available at http://probis.cmm.ki.si/old-version, and the new ProBiS web server is at http://probis.cmm.ki.si.

  8. CMASA: an accurate algorithm for detecting local protein structural similarity and its application to enzyme catalytic site annotation

    Directory of Open Access Journals (Sweden)

    Li Gong-Hua

    2010-08-01

    Full Text Available Abstract Background The rapid development of structural genomics has resulted in many "unknown function" proteins being deposited in Protein Data Bank (PDB, thus, the functional prediction of these proteins has become a challenge for structural bioinformatics. Several sequence-based and structure-based methods have been developed to predict protein function, but these methods need to be improved further, such as, enhancing the accuracy, sensitivity, and the computational speed. Here, an accurate algorithm, the CMASA (Contact MAtrix based local Structural Alignment algorithm, has been developed to predict unknown functions of proteins based on the local protein structural similarity. This algorithm has been evaluated by building a test set including 164 enzyme families, and also been compared to other methods. Results The evaluation of CMASA shows that the CMASA is highly accurate (0.96, sensitive (0.86, and fast enough to be used in the large-scale functional annotation. Comparing to both sequence-based and global structure-based methods, not only the CMASA can find remote homologous proteins, but also can find the active site convergence. Comparing to other local structure comparison-based methods, the CMASA can obtain the better performance than both FFF (a method using geometry to predict protein function and SPASM (a local structure alignment method; and the CMASA is more sensitive than PINTS and is more accurate than JESS (both are local structure alignment methods. The CMASA was applied to annotate the enzyme catalytic sites of the non-redundant PDB, and at least 166 putative catalytic sites have been suggested, these sites can not be observed by the Catalytic Site Atlas (CSA. Conclusions The CMASA is an accurate algorithm for detecting local protein structural similarity, and it holds several advantages in predicting enzyme active sites. The CMASA can be used in large-scale enzyme active site annotation. The CMASA can be available by the

  9. Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.

    Directory of Open Access Journals (Sweden)

    Eric Venner

    Full Text Available High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks.

  10. Protein sequences and redox titrations indicate that the electron acceptors in reaction centers from heliobacteria are similar to Photosystem I

    Science.gov (United States)

    Trost, J. T.; Brune, D. C.; Blankenship, R. E.

    1992-01-01

    Photosynthetic reaction centers isolated from Heliobacillus mobilis exhibit a single major protein on SDS-PAGE of 47 000 Mr. Attempts to sequence the reaction center polypeptide indicated that the N-terminus is blocked. After enzymatic and chemical cleavage, four peptide fragments were sequenced from the Heliobacillus mobilis apoprotein. Only one of these sequences showed significant specific similarity to any of the protein and deduced protein sequences in the GenBank data base. This fragment is identical with 56% of the residues, including both cysteines, found in highly conserved region that is proposed to bind iron-sulfur center Fx in the Photosystem I reaction center peptide that is the psaB gene product. The similarity to the psaA gene product in this region is 48%. Redox titrations of laser-flash-induced photobleaching with millisecond decay kinetics on isolated reaction centers from Heliobacterium gestii indicate a midpoint potential of -414 mV with n = 2 titration behavior. In membranes, the behavior is intermediate between n = 1 and n = 2, and the apparent midpoint potential is -444 mV. This is compared to the behavior in Photosystem I, where the intermediate electron acceptor A1, thought to be a phylloquinone molecule, has been proposed to undergo a double reduction at low redox potentials in the presence of viologen redox mediators. These results strongly suggest that the acceptor side electron transfer system in reaction centers from heliobacteria is indeed analogous to that found in Photosystem I. The sequence similarities indicate that the divergence of the heliobacteria from the Photosystem I line occurred before the gene duplication and subsequent divergence that lead to the heterodimeric protein core of the Photosystem I reaction center.

  11. Adaptor proteins intersectin 1 and 2 bind similar proline-rich ligands but are differentially recognized by SH2 domain-containing proteins.

    Directory of Open Access Journals (Sweden)

    Olga Novokhatska

    Full Text Available BACKGROUND: Scaffolding proteins of the intersectin (ITSN family, ITSN1 and ITSN2, are crucial for the initiation stage of clathrin-mediated endocytosis. These proteins are closely related but have implications in distinct pathologies. To determine how these proteins could be separated in certain cell pathways we performed a comparative study of ITSNs. METHODOLOGY/PRINCIPAL FINDINGS: We have shown that endogenous ITSN1 and ITSN2 colocalize and form a complex in cells. A structural comparison of five SH3 domains, which mediated most ITSNs protein-protein interactions, demonstrated a similarity of their ligand-binding sites. We showed that the SH3 domains of ITSN2 bound well-established interactors of ITSN1 as well as newly identified ITSNs protein partners. A search for a novel interacting interface revealed multiple tyrosines that could be phosphorylated in ITSN2. Phosphorylation of ITSN2 isoforms but not ITSN1 short isoform was observed in various cell lines. EGF stimulation of HeLa cells enhanced tyrosine phosphorylation of ITSN2 isoforms and enabled their recognition by the SH2 domains of the Fyn, Fgr and Abl1 kinases, the regulatory subunit of PI3K, the adaptor proteins Grb2 and Crk, and phospholipase C gamma. The SH2 domains mentioned were unable to bind ITSN1 short isoform. CONCLUSIONS/SIGNIFICANCE: Our results indicate that during evolution of vertebrates ITSN2 acquired a novel protein-interaction interface that allows its specific recognition by the SH2 domains of signaling proteins. We propose that these data could be important to understand the functional diversity of paralogous ITSN proteins.

  12. Biochemical characterization of an exonuclease from Arabidopsis thaliana reveals similarities to the DNA exonuclease of the human Werner syndrome protein.

    Science.gov (United States)

    Plchova, Helena; Hartung, Frank; Puchta, Holger

    2003-11-07

    The human Werner syndrome protein (hWRN-p) possessing DNA helicase and exonuclease activities is essential for genome stability. Plants have no homologue of this bifunctional protein, but surprisingly the Arabidopsis genome contains a small open reading frame (ORF) (AtWRNexo) with homology to the exonuclease domain of hWRN-p. Expression of this ORF in Escherichia coli revealed an exonuclease activity for AtWRN-exo-p with similarities but also some significant differences to hWRN-p. The protein digests recessed strands of DNA duplexes in the 3' --> 5' direction but hardly single-stranded DNA or blunt-ended duplexes. In contrast to the Werner exonuclease, AtWRNexo-p is also able to digest 3'-protruding strands. DNA with recessed 3'-PO4 and 3'-OH termini is degraded to a similar extent. AtWRNexo-p hydrolyzes the 3'-recessed strand termini of duplexes containing mismatched bases. AtWRNexo-p needs the divalent cation Mg2+ for activity, which can be replaced by Mn2+. Apurinic sites, cholesterol adducts, and oxidative DNA damage (such as 8-oxoadenine and 8-oxoguanine) inhibit or block the enzyme. Other DNA modifications, including uracil, hypoxanthine and ethenoadenine, did not inhibit AtWRNexo-p. A mutation of a conserved residue within the exonuclease domain (E135A) completely abolished the exonucleolytic activity. Our results indicate that a type of WRN-like exonuclease activity seems to be a common feature of the DNA metabolism of animals and plants.

  13. Sterol regulatory element binding protein-1 (SREBP1) gene expression is similarly increased in polycystic ovary syndrome and endometrial cancer.

    Science.gov (United States)

    Shafiee, Mohamad N; Mongan, Nigel; Seedhouse, Claire; Chapman, Caroline; Deen, Suha; Abu, Jafaru; Atiomo, William

    2017-05-01

    Women with polycystic ovary syndrome have a three-fold higher risk of endometrial cancer. Insulin resistance and hyperlipidemia may be pertinent factors in the pathogenesis of both conditions. The aim of this study was to investigate endometrial sterol regulatory element binding protein-1 gene expression in polycystic ovary syndrome and endometrial cancer endometrium, and to correlate endometrial sterol regulatory element binding protein-1 gene expression with serum lipid profiles. A cross-sectional study was performed at Nottingham University Hospital, UK. A total of 102 women (polycystic ovary syndrome, endometrial cancer and controls; 34 participants in each group) were recruited. Clinical and biochemical assessments were performed before endometrial biopsies were obtained from all participants. Taqman real-time polymerase chain reaction for endometrial sterol regulatory element binding protein-1 gene and its systemic protein expression were analyzed. The body mass indices of women with polycystic ovary syndrome (29.28 ± 2.91 kg/m 2 ) and controls (28.58 ± 2.62 kg/m 2 ) were not significantly different. Women with endometrial cancer had a higher mean body mass index (32.22 ± 5.70 kg/m 2 ). Sterol regulatory element binding protein-1 gene expression was significantly increased in polycystic ovary syndrome and endometrial cancer endometrium compared with controls (p ovary syndrome, but this was not statistically significant. Similarly, statistically insignificant positive correlations were found between endometrial sterol regulatory element binding protein-1 gene expression and body mass index in endometrial cancer (r = 0.643, p = 0.06) and waist-hip ratio (r = 0.096, p = 0.073). Sterol regulatory element binding protein-1 gene expression was significantly positively correlated with triglyceride in both polycystic ovary syndrome and endometrial cancer (p = 0.028 and p = 0.027, respectively). Quantitative serum sterol regulatory element

  14. RNA-binding domain of the A protein component of the U1 small nuclear ribonucleoprotein analyzed by NMR spectroscopy is structurally similar to ribosomal proteins

    International Nuclear Information System (INIS)

    Hoffman, D.W.; Query, C.C.; Golden, B.L.; White, S.W.; Keene, J.D.

    1991-01-01

    An RNA recognition motif (RRM) of ∼80 amino acids constitutes the core of RNA-binding domains found in a large family of proteins involved in RNA processing. The U1 RNA-binding domain of the A protein component of the human U1 small nuclear ribonucleoprotein (RNP), which encompasses the RRM sequence, was analyzed by using NMR spectroscopy. The domain of the A protein is a highly stable monomer in solution consisting of four antiparallel β-strands and two α-helices. The highly conserved RNP1 and RNP2 consensus sequences, containing residues previously suggested to be involved in nucleic acid binding, are juxtaposed in adjacent β-strands. Conserved aromatic side chains that are critical for RNA binding are clustered on the surface to the molecule adjacent to a variable loop that influences recognition of specific RNA sequences. The secondary structure and topology of the RRM are similar to those of ribosomal proteins L12 and L30, suggesting a distant evolutionary relationship between these two types of RNA-associated proteins

  15. Similar rates of protein adaptation in Drosophila miranda and D. melanogaster, two species with different current effective population sizes

    Directory of Open Access Journals (Sweden)

    Bachtrog Doris

    2008-12-01

    Full Text Available Abstract Background Adaptive protein evolution is common in several Drosophila species investigated. Some studies point to very weak selection operating on amino-acid mutations, with average selection intensities on the order of Nes ~ 5 in D. melanogaster and D. simulans. Species with lower effective population sizes should undergo less adaptation since they generate fewer mutations and selection is ineffective on a greater proportion of beneficial mutations. Results Here I study patterns of polymorphism and divergence at 91 X-linked loci in D. miranda, a species with a roughly 5-fold smaller effective population size than D. melanogaster. Surprisingly, I find a similar fraction of amino-acid mutations being driven to fixation by positive selection in D. miranda and D. melanogaster. Genes with higher rates of amino-acid evolution show lower levels of neutral diversity, a pattern predicted by recurrent adaptive protein evolution. I fit a hitchhiking model to patterns of polymorphism in D. miranda and D. melanogaster and estimate an order of magnitude higher selection coefficients for beneficial mutations in D. miranda. Conclusion This analysis suggests that effective population size may not be a major determinant in rates of protein adaptation. Instead, adaptation may not be mutation-limited, or the distribution of fitness effects for beneficial mutations might differ vastly between different species or populations. Alternative explanation such as biases in estimating the fraction of beneficial mutations or slightly deleterious mutation models are also discussed.

  16. SVM-Prot 2016: A Web-Server for Machine Learning Prediction of Protein Functional Families from Sequence Irrespective of Similarity.

    Science.gov (United States)

    Li, Ying Hong; Xu, Jing Yu; Tao, Lin; Li, Xiao Feng; Li, Shuang; Zeng, Xian; Chen, Shang Ying; Zhang, Peng; Qin, Chu; Zhang, Cheng; Chen, Zhe; Zhu, Feng; Chen, Yu Zong

    2016-01-01

    Knowledge of protein function is important for biological, medical and therapeutic studies, but many proteins are still unknown in function. There is a need for more improved functional prediction methods. Our SVM-Prot web-server employed a machine learning method for predicting protein functional families from protein sequences irrespective of similarity, which complemented those similarity-based and other methods in predicting diverse classes of proteins including the distantly-related proteins and homologous proteins of different functions. Since its publication in 2003, we made major improvements to SVM-Prot with (1) expanded coverage from 54 to 192 functional families, (2) more diverse protein descriptors protein representation, (3) improved predictive performances due to the use of more enriched training datasets and more variety of protein descriptors, (4) newly integrated BLAST analysis option for assessing proteins in the SVM-Prot predicted functional families that were similar in sequence to a query protein, and (5) newly added batch submission option for supporting the classification of multiple proteins. Moreover, 2 more machine learning approaches, K nearest neighbor and probabilistic neural networks, were added for facilitating collective assessment of protein functions by multiple methods. SVM-Prot can be accessed at http://bidd2.nus.edu.sg/cgi-bin/svmprot/svmprot.cgi.

  17. Intracellular Localization and Cellular Factors Interaction of HTLV-1 and HTLV-2 Tax Proteins: Similarities and Functional Differences

    Directory of Open Access Journals (Sweden)

    Maria Grazia Romanelli

    2011-05-01

    Full Text Available Human T-lymphotropic viruses type 1 (HTLV-1 and type 2 (HTLV-2 present very similar genomic structures but HTLV-1 is more pathogenic than HTLV-2. Is this difference due to their transactivating Tax proteins, Tax-1 and Tax-2, which are responsible for viral and cellular gene activation? Do Tax-1 and Tax-2 differ in their cellular localization and in their interaction pattern with cellular factors? In this review, we summarize Tax-1 and Tax-2 structural and phenotypic properties, their interaction with factors involved in signal transduction and their localization-related behavior within the cell. Special attention will be given to the distinctions between Tax-1 and Tax-2 that likely play an important role in their transactivation activity.

  18. "Venom" of the slow loris: sequence similarity of prosimian skin gland protein and Fel d 1 cat allergen.

    Science.gov (United States)

    Krane, Sonja; Itagaki, Yasuhiro; Nakanishi, Koji; Weldon, Paul J

    2003-02-01

    Bites inflicted on humans by the slow loris (Nycticebus coucang), a prosimian from Indonesia, are painful and elicit anaphylaxis. Toxins from N. coucang are thought to originate in the brachial organ, a naked, gland-laden area of skin situated on the flexor surface of the arm that is licked during grooming. We isolated a major component of the brachial organ secretions from N. coucang, an approximately 18 kDa protein composed of two 70-90 amino-acid chains linked by one or more disulfide bonds. The N-termini of these peptide chains exhibit nearly 70% sequence similarity (37% identity, chain 1; 54% identity, chain 2) with the two chains of Fel d 1, the major allergen from the domestic cat (Felis catus). The extensive sequence similarity between the brachial organ component of N. coucang and the cat allergen suggests that they exhibit immunogenic cross-reactivity. This work clarifies the chemical nature of the brachial organ exudate and suggests a possible mode of action underlying the noxious effects of slow loris bites.

  19. The G protein-coupled receptor subset of the dog genome is more similar to that in humans than rodents.

    Science.gov (United States)

    Haitina, Tatjana; Fredriksson, Robert; Foord, Steven M; Schiöth, Helgi B; Gloriam, David E

    2009-01-15

    The dog is an important model organism and it is considered to be closer to humans than rodents regarding metabolism and responses to drugs. The close relationship between humans and dogs over many centuries has lead to the diversity of the canine species, important genetic discoveries and an appreciation of the effects of old age in another species. The superfamily of G protein-coupled receptors (GPCRs) is one of the largest gene families in most mammals and the most exploited in terms of drug discovery. An accurate comparison of the GPCR repertoires in dog and human is valuable for the prediction of functional similarities and differences between the species. We searched the dog genome for non-olfactory GPCRs and obtained 353 full-length GPCR gene sequences, 18 incomplete sequences and 13 pseudogenes. We established relationships between human, dog, rat and mouse GPCRs resolving orthologous pairs and species-specific duplicates. We found that 12 dog GPCR genes are missing in humans while 24 human GPCR genes are not part of the dog GPCR repertoire. There is a higher number of orthologous pairs between dog and human that are conserved as compared with either mouse or rat. In almost all cases the differences observed between the dog and human genomes coincide with other variations in the rodent species. Several GPCR gene expansions characteristic for rodents are not found in dog. The repertoire of dog non-olfactory GPCRs is more similar to the repertoire in humans as compared with the one in rodents. The comparison of the dog, human and rodent repertoires revealed several examples of species-specific gene duplications and deletions. This information is useful in the selection of model organisms for pharmacological experiments.

  20. The G protein-coupled receptor subset of the dog genome is more similar to that in humans than rodents

    Directory of Open Access Journals (Sweden)

    Schiöth Helgi B

    2009-01-01

    Full Text Available Abstract Background The dog is an important model organism and it is considered to be closer to humans than rodents regarding metabolism and responses to drugs. The close relationship between humans and dogs over many centuries has lead to the diversity of the canine species, important genetic discoveries and an appreciation of the effects of old age in another species. The superfamily of G protein-coupled receptors (GPCRs is one of the largest gene families in most mammals and the most exploited in terms of drug discovery. An accurate comparison of the GPCR repertoires in dog and human is valuable for the prediction of functional similarities and differences between the species. Results We searched the dog genome for non-olfactory GPCRs and obtained 353 full-length GPCR gene sequences, 18 incomplete sequences and 13 pseudogenes. We established relationships between human, dog, rat and mouse GPCRs resolving orthologous pairs and species-specific duplicates. We found that 12 dog GPCR genes are missing in humans while 24 human GPCR genes are not part of the dog GPCR repertoire. There is a higher number of orthologous pairs between dog and human that are conserved as compared with either mouse or rat. In almost all cases the differences observed between the dog and human genomes coincide with other variations in the rodent species. Several GPCR gene expansions characteristic for rodents are not found in dog. Conclusion The repertoire of dog non-olfactory GPCRs is more similar to the repertoire in humans as compared with the one in rodents. The comparison of the dog, human and rodent repertoires revealed several examples of species-specific gene duplications and deletions. This information is useful in the selection of model organisms for pharmacological experiments.

  1. Protein structure similarity clustering (PSSC) and natural product structure as inspiration sources for drug development and chemical genomics

    NARCIS (Netherlands)

    Dekker, Frank J; Koch, Marcus A; Waldmann, Herbert; Dekker, Frans

    Finding small molecules that modulate protein function is of primary importance in drug development and in the emerging field of chemical genomics. To facilitate the identification of such molecules, we developed a novel strategy making use of structural conservatism found in protein domain

  2. UPF201 Archaeal Specific Family Members Reveals Structural Similarity to RNA-Binding Proteins but Low Likelihood for RNA-Binding Function

    Energy Technology Data Exchange (ETDEWEB)

    Rao, K.N.; Swaminathan, S.; Burley, S. K.

    2008-12-11

    We have determined X-ray crystal structures of four members of an archaeal specific family of proteins of unknown function (UPF0201; Pfam classification: DUF54) to advance our understanding of the genetic repertoire of archaea. Despite low pairwise amino acid sequence identities (10-40%) and the absence of conserved sequence motifs, the three-dimensional structures of these proteins are remarkably similar to one another. Their common polypeptide chain fold, encompassing a five-stranded antiparallel {beta}-sheet and five {alpha}-helices, proved to be quite unexpectedly similar to that of the RRM-type RNA-binding domain of the ribosomal L5 protein, which is responsible for binding the 5S- rRNA. Structure-based sequence alignments enabled construction of a phylogenetic tree relating UPF0201 family members to L5 ribosomal proteins and other structurally similar RNA binding proteins, thereby expanding our understanding of the evolutionary purview of the RRM superfamily. Analyses of the surfaces of these newly determined UPF0201 structures suggest that they probably do not function as RNA binding proteins, and that this domain specific family of proteins has acquired a novel function in archaebacteria, which awaits experimental elucidation.

  3. Different methods of membrane domains isolation result in similar 2-D distribution patterns of membrane domain proteins

    Czech Academy of Sciences Publication Activity Database

    Matoušek, Petr; Hodný, Zdeněk; Švandová, I.; Svoboda, Petr

    2003-01-01

    Roč. 81, č. 6 (2003), s. 365-372 ISSN 0829-8211 R&D Projects: GA MŠk LN00A026 Grant - others:Wellcome Trust(GB) xx Institutional research plan: CEZ:AV0Z5011922; CEZ:MSM 113100003; CEZ:AV0Z5039906 Keywords : membrane domain * G protein * two-dimensional electrophoresis * GPI-ancored proteins Subject RIV: CE - Biochemistry Impact factor: 2.456, year: 2003

  4. Similarities and differences in the nucleic acid chaperone activity of HIV-2 and HIV-1 nucleocapsid proteins in vitro.

    Science.gov (United States)

    Pachulska-Wieczorek, Katarzyna; Stefaniak, Agnieszka K; Purzycka, Katarzyna J

    2014-07-03

    The nucleocapsid domain of Gag and mature nucleocapsid protein (NC) act as nucleic acid chaperones and facilitate folding of nucleic acids at critical steps of retroviral replication cycle. The basic N-terminus of HIV-1 NC protein was shown most important for the chaperone activity. The HIV-2 NC (NCp8) and HIV-1 NC (NCp7) proteins possess two highly conserved zinc fingers, flanked by basic residues. However, the NCp8 N-terminal domain is significantly shorter and contains less positively charged residues. This study characterizes previously unknown, nucleic acid chaperone activity of the HIV-2 NC protein. We have comparatively investigated the in vitro nucleic acid chaperone properties of the HIV-2 and HIV-1 NC proteins. Using substrates derived from the HIV-1 and HIV-2 genomes, we determined the ability of both proteins to chaperone nucleic acid aggregation, annealing and strand exchange in duplex structures. Both NC proteins displayed comparable, high annealing activity of HIV-1 TAR DNA and its complementary nucleic acid. Interesting differences between the two NC proteins were discovered when longer HIV substrates, particularly those derived from the HIV-2 genome, were used in chaperone assays. In contrast to NCp7, NCp8 weakly facilitates annealing of HIV-2 TAR RNA to its complementary TAR (-) DNA. NCp8 is also unable to efficiently stimulate tRNALys3 annealing to its respective HIV-2 PBS motif. Using truncated NCp8 peptide, we demonstrated that despite the fact that the N-terminus of NCp8 differs from that of NCp7, this domain is essential for NCp8 activity. Our data demonstrate that the HIV-2 NC protein displays reduced nucleic acid chaperone activity compared to that of HIV-1 NC. We found that NCp8 activity is limited by substrate length and stability to a greater degree than that of NCp7. This is especially interesting in light of the fact that the HIV-2 5'UTR is more structured than that of HIV-1. The reduced chaperone activity observed with NCp8 may

  5. Vaccinia protein F12 has structural similarity to kinesin light chain and contains a motor binding motif required for virion export.

    Directory of Open Access Journals (Sweden)

    Gareth W Morgan

    2010-02-01

    Full Text Available Vaccinia virus (VACV uses microtubules for export of virions to the cell surface and this process requires the viral protein F12. Here we show that F12 has structural similarity to kinesin light chain (KLC, a subunit of the kinesin-1 motor that binds cargo. F12 and KLC share similar size, pI, hydropathy and cargo-binding tetratricopeptide repeats (TPRs. Moreover, molecular modeling of F12 TPRs upon the crystal structure of KLC2 TPRs showed a striking conservation of structure. We also identified multiple TPRs in VACV proteins E2 and A36. Data presented demonstrate that F12 is critical for recruitment of kinesin-1 to virions and that a conserved tryptophan and aspartic acid (WD motif, which is conserved in the kinesin-1-binding sequence (KBS of the neuronal protein calsyntenin/alcadein and several other cellular kinesin-1 binding proteins, is essential for kinesin-1 recruitment and virion transport. In contrast, mutation of WD motifs in protein A36 revealed they were not required for kinesin-1 recruitment or IEV transport. This report of a viral KLC-like protein containing a KBS that is conserved in several cellular proteins advances our understanding of how VACV recruits the kinesin motor to virions, and exemplifies how viruses use molecular mimicry of cellular components to their advantage.

  6. HOMOLOGY BETWEEN SEGMENTS OF HUMAN HEMOSTATIC PROTEINS AND PROTEINS OF VIRUSES WHICH CAUSE ACUTE RESPIRATORY INFECTIONS OR DISEASES WITH SIMILAR SYMPTOMS

    Directory of Open Access Journals (Sweden)

    I. N. Zhilinskaya

    2017-01-01

    Full Text Available Objectives: To identify homologous segments of human hemostatic and viral proteins and to assess the role of human hemostatic proteins in viral replication. Materials and Methods: The following viruses were chosen for comparison: influenza B (B/Astrakhan/2/2017, coronaviruses (Hcov229E and SARS-Co, type 1 adenovirus (adenoid 71, measles (ICHINOSE-BA and rubella (Therien. The primary structures of viral proteins and 41 human hemostatic proteins were obtained from open–access www.ncbi.nlm.nih. gov and www.nextprot.org databases, respectively. Sequence homology was determined by comparing 12-amino-acid segments. Those sequences identical in ≥ 8 positions were considered homologous. Results: The analysis shows that viral proteins contain segments which mimic a number of human hemostatic proteins. Most of these segments, except those of adenovirus proteins, are homologous with coagulation factors. The increase in viral virulence, as in case of SARS-Co, correlates with an increased number of segments homologous with hemostatic proteins. Conclusion: Hemostasis plays an important role in viral replication and pathogenesis. The conclusion is consistent with the literature data about the relationship of hemostasis and inflammatory response to viral infections.

  7. The G protein-coupled receptor subset of the dog genome is more similar to that in humans than rodents

    OpenAIRE

    Schiöth Helgi B; Foord Steven M; Fredriksson Robert; Haitina Tatjana; Gloriam David E

    2009-01-01

    Abstract Background The dog is an important model organism and it is considered to be closer to humans than rodents regarding metabolism and responses to drugs. The close relationship between humans and dogs over many centuries has lead to the diversity of the canine species, important genetic discoveries and an appreciation of the effects of old age in another species. The superfamily of G protein-coupled receptors (GPCRs) is one of the largest gene families in most mammals and the most expl...

  8. Identifying Potential Protein Targets for Toluene Using a Molecular Similarity Search, in Silico Docking and in Vitro Validation

    Science.gov (United States)

    2015-01-01

    performed under standard conditions. Ana- lysis of purified hemoglobin using SDS and native polyacryl - amide gel electrophoresis (PAGE) indicated that the...search of T3DB. They represent several families of proteins (calcium-transporting ATPases, sodium/ potassium -transporting ATPase, cytochrome P450...REPORT DOCUMENTATION PAGE Form Approved OMB No. 0704-0188 Public reporting burden for this collection of information is estimated to average 1

  9. Duplicate Abalone Egg Coat Proteins Bind Sperm Lysin Similarly, but Evolve Oppositely, Consistent with Molecular Mimicry at Fertilization

    Science.gov (United States)

    Aagaard, Jan E.; Springer, Stevan A.; Soelberg, Scott D.; Swanson, Willie J.

    2013-01-01

    Sperm and egg proteins constitute a remarkable paradigm in evolutionary biology: despite their fundamental role in mediating fertilization (suggesting stasis), some of these molecules are among the most rapidly evolving ones known, and their divergence can lead to reproductive isolation. Because of strong selection to maintain function among interbreeding individuals, interacting fertilization proteins should also exhibit a strong signal of correlated divergence among closely related species. We use evidence of such molecular co-evolution to target biochemical studies of fertilization in North Pacific abalone (Haliotis spp.), a model system of reproductive protein evolution. We test the evolutionary rates (d N/d S) of abalone sperm lysin and two duplicated egg coat proteins (VERL and VEZP14), and find a signal of co-evolution specific to ZP-N, a putative sperm binding motif previously identified by homology modeling. Positively selected residues in VERL and VEZP14 occur on the same face of the structural model, suggesting a common mode of interaction with sperm lysin. We test this computational prediction biochemically, confirming that the ZP-N motif is sufficient to bind lysin and that the affinities of VERL and VEZP14 are comparable. However, we also find that on phylogenetic lineages where lysin and VERL evolve rapidly, VEZP14 evolves slowly, and vice versa. We describe a model of sexual conflict that can recreate this pattern of anti-correlated evolution by assuming that VEZP14 acts as a VERL mimic, reducing the intensity of sexual conflict and slowing the co-evolution of lysin and VERL. PMID:23408913

  10. Duplicate abalone egg coat proteins bind sperm lysin similarly, but evolve oppositely, consistent with molecular mimicry at fertilization.

    Directory of Open Access Journals (Sweden)

    Jan E Aagaard

    Full Text Available Sperm and egg proteins constitute a remarkable paradigm in evolutionary biology: despite their fundamental role in mediating fertilization (suggesting stasis, some of these molecules are among the most rapidly evolving ones known, and their divergence can lead to reproductive isolation. Because of strong selection to maintain function among interbreeding individuals, interacting fertilization proteins should also exhibit a strong signal of correlated divergence among closely related species. We use evidence of such molecular co-evolution to target biochemical studies of fertilization in North Pacific abalone (Haliotis spp., a model system of reproductive protein evolution. We test the evolutionary rates (d(N/d(S of abalone sperm lysin and two duplicated egg coat proteins (VERL and VEZP14, and find a signal of co-evolution specific to ZP-N, a putative sperm binding motif previously identified by homology modeling. Positively selected residues in VERL and VEZP14 occur on the same face of the structural model, suggesting a common mode of interaction with sperm lysin. We test this computational prediction biochemically, confirming that the ZP-N motif is sufficient to bind lysin and that the affinities of VERL and VEZP14 are comparable. However, we also find that on phylogenetic lineages where lysin and VERL evolve rapidly, VEZP14 evolves slowly, and vice versa. We describe a model of sexual conflict that can recreate this pattern of anti-correlated evolution by assuming that VEZP14 acts as a VERL mimic, reducing the intensity of sexual conflict and slowing the co-evolution of lysin and VERL.

  11. Native whey protein with high levels of leucine results in similar post-exercise muscular anabolic responses as regular whey protein: a randomized controlled trial.

    Science.gov (United States)

    Hamarsland, Håvard; Nordengen, Anne Lene; Nyvik Aas, Sigve; Holte, Kristin; Garthe, Ina; Paulsen, Gøran; Cotter, Matthew; Børsheim, Elisabet; Benestad, Haakon B; Raastad, Truls

    2017-01-01

    Protein intake is essential to maximally stimulate muscle protein synthesis, and the amino acid leucine seems to possess a superior effect on muscle protein synthesis compared to other amino acids. Native whey has higher leucine content and thus a potentially greater anabolic effect on muscle than regular whey (WPC-80). This study compared the acute anabolic effects of ingesting 2 × 20 g of native whey protein, WPC-80 or milk protein after a resistance exercise session. A total of 24 young resistance trained men and women took part in this double blind, randomized, partial crossover, controlled study. Participants received either WPC-80 and native whey ( n  = 10), in a crossover design, or milk ( n  = 12). Supplements were ingested immediately (20 g) and two hours after (20 g) a bout of heavy-load lower body resistance exercise. Blood samples and muscle biopsies were collected to measure plasma concentrations of amino acids by gas-chromatography mass spectrometry, muscle phosphorylation of p70S6K, 4E-BP1 and eEF-2 by immunoblotting, and mixed muscle protein synthesis by use of [ 2 H 5 ]phenylalanine-infusion, gas-chromatography mass spectrometry and isotope-ratio mass spectrometry. Being the main comparison, differences between native whey and WPC-80 were analysed by a one-way ANOVA and comparisons between the whey supplements and milk were analysed by a two-way ANOVA. Native whey increased blood leucine concentrations more than WPC-80 and milk ( P  whey ingestion induced a greater phosphorylation of p70S6K than milk 180 min after exercise ( P  = 0.03). Muscle protein synthesis rates increased 1-3 h hours after exercise with WPC-80 (0.119%), and 1-5 h after exercise with native whey (0.112%). Muscle protein synthesis rates were higher 1-5 h after exercise with native whey than with milk (0.112% vs. 0.064, P  = 0.023). Despite higher-magnitude increases in blood leucine concentrations with native whey, it was not superior to WPC-80

  12. An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids.

    Science.gov (United States)

    Li, Yushuang; Song, Tian; Yang, Jiasheng; Zhang, Yi; Yang, Jialiang

    2016-01-01

    In this paper, we have proposed a novel alignment-free method for comparing the similarity of protein sequences. We first encode a protein sequence into a 440 dimensional feature vector consisting of a 400 dimensional Pseudo-Markov transition probability vector among the 20 amino acids, a 20 dimensional content ratio vector, and a 20 dimensional position ratio vector of the amino acids in the sequence. By evaluating the Euclidean distances among the representing vectors, we compare the similarity of protein sequences. We then apply this method into the ND5 dataset consisting of the ND5 protein sequences of 9 species, and the F10 and G11 datasets representing two of the xylanases containing glycoside hydrolase families, i.e., families 10 and 11. As a result, our method achieves a correlation coefficient of 0.962 with the canonical protein sequence aligner ClustalW in the ND5 dataset, much higher than those of other 5 popular alignment-free methods. In addition, we successfully separate the xylanases sequences in the F10 family and the G11 family and illustrate that the F10 family is more heat stable than the G11 family, consistent with a few previous studies. Moreover, we prove mathematically an identity equation involving the Pseudo-Markov transition probability vector and the amino acids content ratio vector.

  13. Obese Hypertensive Men Have Plasma Concentrations of C-Reactive Protein Similar to That of Obese Normotensive Men

    DEFF Research Database (Denmark)

    Asferg, Camilla L; Andersen, Ulrik B; Linneberg, Allan

    2014-01-01

    BACKGROUND: Low-grade chronic inflammation is a characteristic feature of obesity, the most important lifestyle risk factor for hypertension. Elevated plasma concentrations of the inflammatory biomarker C-reactive protein (CRP) are associated with an increased risk of hypertension, but elevated...... plasma CRP concentrations are also closely associated with obesity. It is uncertain whether CRP is directly involved in the pathogenesis of hypertension or is only a marker of other pathogenic processes closely related to obesity. METHODS: We studied 103 obese men (body mass index (BMI) ≥ 30.0 kg/m(2......)); 63 of these men had 24-hour ambulatory blood pressure (ABP) ≥ 130/80 mm Hg and comprised the obese hypertensive (OHT) group. The 40 remaining obese men had 24-hour ABP obese normotensive (ONT) group. Our control group comprised 27 lean normotensive (LNT) men. All...

  14. Protein-pacing caloric-restriction enhances body composition similarly in obese men and women during weight loss and sustains efficacy during long-term weight maintenance

    DEFF Research Database (Denmark)

    Arciero, Paul J; Edmonds, Rohan; He, Feng

    2016-01-01

    Short-Term protein-pacing (P; ~6 meals/day, >30% protein/day) and caloric restriction (CR, ~25% energy deficit) improves total (TBF), abdominal (ABF) and visceral (VAT) fat loss, energy expenditure, and biomarkers compared to heart healthy (HH) recommendations (3 meals/day, 15% protein/day) in ob......Short-Term protein-pacing (P; ~6 meals/day, >30% protein/day) and caloric restriction (CR, ~25% energy deficit) improves total (TBF), abdominal (ABF) and visceral (VAT) fat loss, energy expenditure, and biomarkers compared to heart healthy (HH) recommendations (3 meals/day, 15% protein....../day) in obese adults. Less is known whether obese men and women respond similarly to P-CR during weight loss (WL) and whether a modified P-CR (mP-CR) is more efficacious than a HH diet during long-term (52 week) weight maintenance (WM). The purposes of this study were to evaluate the efficacy of: (1) P......-CR on TBF, ABF, resting metabolic rate (RMR), and biomarkers between obese men and women during WL (weeks 0-12); and (2) mP-CR compared to a HH diet during WM (weeks 13-64). During WL, men (n = 21) and women (n = 19) were assessed for TBF, ABF, VAT, RMR, and biomarkers at weeks 0 (pre) and 12 (post). Men...

  15. An alignment-free method to find similarity among protein sequences via the general form of Chou's pseudo amino acid composition.

    Science.gov (United States)

    Gupta, M K; Niyogi, R; Misra, M

    2013-01-01

    In this paper, we propose a method to create the 60-dimensional feature vector for protein sequences via the general form of pseudo amino acid composition. The construction of the feature vector is based on the contents of amino acids, total distance of each amino acid from the first amino acid in the protein sequence and the distribution of 20 amino acids. The obtained cosine distance metric (also called the similarity matrix) is used to construct the phylogenetic tree by the neighbour joining method. In order to show the applicability of our approach, we tested it on three proteins: 1) ND5 protein sequences from nine species, 2) ND6 protein sequences from eight species, and 3) 50 coronavirus spike proteins. The results are in agreement with known history and the output from the multiple sequence alignment program ClustalW, which is widely used. We have also compared our phylogenetic results with six other recently proposed alignment-free methods. These comparisons show that our proposed method gives a more consistent biological relationship than the others. In addition, the time complexity is linear and space required is less as compared with other alignment-free methods that use graphical representation. It should be noted that the multiple sequence alignment method has exponential time complexity.

  16. The Impact of Protein Structure and Sequence Similarity on the Accuracy of Machine-Learning Scoring Functions for Binding Affinity Prediction.

    Science.gov (United States)

    Li, Hongjian; Peng, Jiangjun; Leung, Yee; Leung, Kwong-Sak; Wong, Man-Hon; Lu, Gang; Ballester, Pedro J

    2018-03-14

    It has recently been claimed that the outstanding performance of machine-learning scoring functions (SFs) is exclusively due to the presence of training complexes with highly similar proteins to those in the test set. Here, we revisit this question using 24 similarity-based training sets, a widely used test set, and four SFs. Three of these SFs employ machine learning instead of the classical linear regression approach of the fourth SF (X-Score which has the best test set performance out of 16 classical SFs). We have found that random forest (RF)-based RF-Score-v3 outperforms X-Score even when 68% of the most similar proteins are removed from the training set. In addition, unlike X-Score, RF-Score-v3 is able to keep learning with an increasing training set size, becoming substantially more predictive than X-Score when the full 1105 complexes are used for training. These results show that machine-learning SFs owe a substantial part of their performance to training on complexes with dissimilar proteins to those in the test set, against what has been previously concluded using the same data. Given that a growing amount of structural and interaction data will be available from academic and industrial sources, this performance gap between machine-learning SFs and classical SFs is expected to enlarge in the future.

  17. Enzymatic protein hydrolysates from high pressure-pretreated isolated pea proteins have better antioxidant properties than similar hydrolysates produced from heat pretreatment.

    Science.gov (United States)

    Girgih, Abraham T; Chao, Dongfang; Lin, Lin; He, Rong; Jung, Stephanie; Aluko, Rotimi E

    2015-12-01

    Isolated pea protein (IPP) dispersions (1%, w/v) were pretreated with high pressure (HP) of 200, 400, or 600 MPa for 5 min at 24 °C or high temperature (HT) for 30 min at 100 °C prior to hydrolysis with 1% (w/w) Alcalase. HP pretreatment of IPP at 400 and 600 MPa levels led to significantly (P40%) oxygen radical absorption capacity (ORAC) of hydrolysates. 2,2-Diphenyl-1-picrylhydrazyl, superoxide radical and hydroxyl radical scavenging activities of pea protein hydrolysates were also significantly (PProtein hydrolysates from HT IPP showed no ORAC, superoxide or hydroxyl scavenging activity but had significantly (Pprotein hydrolysates had weaker antioxidant properties than glutathione but overall, the HP pretreatment was superior to HT pretreatment in facilitating enzymatic release of antioxidant peptides from IPP. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. Possibly similar genetic basis of resistance to Bacillus thuringiensis Cry1Ab protein in 3 resistant colonies of the sugarcane borer collected from Louisiana, USA.

    Science.gov (United States)

    Yang, Fei; Chen, Mao; Gowda, Anilkumar; Kerns, David L; Huang, Fangneng

    2018-04-01

    The sugarcane borer, Diatraea saccharalis (F.), is a major maize borer pest and a target of transgenic maize expressing Bacillus thuringiensis (Bt) proteins in South America and the mid-southern region of the United States. Evolution of resistance in target pest populations is a great threat to the long-term efficacy of Bt crops. In this study, we compared the genetic basis of resistance to Cry1Ab protein in 3 resistant colonies of sugarcane borer established from field populations in Louisiana, USA. Responses of larvae to the Cry1Ab protein for the parental and 10 other cross colonies were assayed in a diet-incorporated bioassay. All 3 resistant colonies were highly resistant to the Cry1Ab protein with a resistance ratio of >555.6 fold. No maternal effect or sex linkage was evident for the resistance in the 3 colonies; and the resistance was functionally nonrecessive at the Cry1Ab concentrations of ≤ 3.16 μg/g, but it became recessive at ≥10 μg/g. In an interstrain complementation test for allelism, the F 1 progeny from crosses between any 2 of the 3 resistant colonies exhibited the similar resistance levels as their parental colonies, indicating that the 3 colonies most likely shared a locus of Cry1Ab resistance. Results generated from this study should provide useful information in developing effective strategies for managing Bt resistance in the insect. © 2016 Institute of Zoology, Chinese Academy of Sciences.

  19. An effective approach for annotation of protein families with low sequence similarity and conserved motifs: identifying GDSL hydrolases across the plant kingdom.

    Science.gov (United States)

    Vujaklija, Ivan; Bielen, Ana; Paradžik, Tina; Biđin, Siniša; Goldstein, Pavle; Vujaklija, Dušica

    2016-02-18

    The massive accumulation of protein sequences arising from the rapid development of high-throughput sequencing, coupled with automatic annotation, results in high levels of incorrect annotations. In this study, we describe an approach to decrease annotation errors of protein families characterized by low overall sequence similarity. The GDSL lipolytic family comprises proteins with multifunctional properties and high potential for pharmaceutical and industrial applications. The number of proteins assigned to this family has increased rapidly over the last few years. In particular, the natural abundance of GDSL enzymes reported recently in plants indicates that they could be a good source of novel GDSL enzymes. We noticed that a significant proportion of annotated sequences lack specific GDSL motif(s) or catalytic residue(s). Here, we applied motif-based sequence analyses to identify enzymes possessing conserved GDSL motifs in selected proteomes across the plant kingdom. Motif-based HMM scanning (Viterbi decoding-VD and posterior decoding-PD) and the here described PD/VD protocol were successfully applied on 12 selected plant proteomes to identify sequences with GDSL motifs. A significant number of identified GDSL sequences were novel. Moreover, our scanning approach successfully detected protein sequences lacking at least one of the essential motifs (171/820) annotated by Pfam profile search (PfamA) as GDSL. Based on these analyses we provide a curated list of GDSL enzymes from the selected plants. CLANS clustering and phylogenetic analysis helped us to gain a better insight into the evolutionary relationship of all identified GDSL sequences. Three novel GDSL subfamilies as well as unreported variations in GDSL motifs were discovered in this study. In addition, analyses of selected proteomes showed a remarkable expansion of GDSL enzymes in the lycophyte, Selaginella moellendorffii. Finally, we provide a general motif-HMM scanner which is easily accessible through

  20. In silico peptide-binding predictions of passerine MHC class I reveal similarities across distantly related species, suggesting convergence on the level of protein function.

    Science.gov (United States)

    Follin, Elna; Karlsson, Maria; Lundegaard, Claus; Nielsen, Morten; Wallin, Stefan; Paulsson, Kajsa; Westerdahl, Helena

    2013-04-01

    The major histocompatibility complex (MHC) genes are the most polymorphic genes found in the vertebrate genome, and they encode proteins that play an essential role in the adaptive immune response. Many songbirds (passerines) have been shown to have a large number of transcribed MHC class I genes compared to most mammals. To elucidate the reason for this large number of genes, we compared 14 MHC class I alleles (α1-α3 domains), from great reed warbler, house sparrow and tree sparrow, via phylogenetic analysis, homology modelling and in silico peptide-binding predictions to investigate their functional and genetic relationships. We found more pronounced clustering of the MHC class I allomorphs (allele specific proteins) in regards to their function (peptide-binding specificities) compared to their genetic relationships (amino acid sequences), indicating that the high number of alleles is of functional significance. The MHC class I allomorphs from house sparrow and tree sparrow, species that diverged 10 million years ago (MYA), had overlapping peptide-binding specificities, and these similarities across species were also confirmed in phylogenetic analyses based on amino acid sequences. Notably, there were also overlapping peptide-binding specificities in the allomorphs from house sparrow and great reed warbler, although these species diverged 30 MYA. This overlap was not found in a tree based on amino acid sequences. Our interpretation is that convergent evolution on the level of the protein function, possibly driven by selection from shared pathogens, has resulted in allomorphs with similar peptide-binding repertoires, although trans-species evolution in combination with gene conversion cannot be ruled out.

  1. Protein-Pacing Caloric-Restriction Enhances Body Composition Similarly in Obese Men and Women during Weight Loss and Sustains Efficacy during Long-Term Weight Maintenance.

    Science.gov (United States)

    Arciero, Paul J; Edmonds, Rohan; He, Feng; Ward, Emery; Gumpricht, Eric; Mohr, Alex; Ormsbee, Michael J; Astrup, Arne

    2016-07-30

    Short-Term protein-pacing (P; ~6 meals/day, >30% protein/day) and caloric restriction (CR, ~25% energy deficit) improves total (TBF), abdominal (ABF) and visceral (VAT) fat loss, energy expenditure, and biomarkers compared to heart healthy (HH) recommendations (3 meals/day, 15% protein/day) in obese adults. Less is known whether obese men and women respond similarly to P-CR during weight loss (WL) and whether a modified P-CR (mP-CR) is more efficacious than a HH diet during long-term (52 week) weight maintenance (WM). The purposes of this study were to evaluate the efficacy of: (1) P-CR on TBF, ABF, resting metabolic rate (RMR), and biomarkers between obese men and women during WL (weeks 0-12); and (2) mP-CR compared to a HH diet during WM (weeks 13-64). During WL, men (n = 21) and women (n = 19) were assessed for TBF, ABF, VAT, RMR, and biomarkers at weeks 0 (pre) and 12 (post). Men and women had similar reductions (p 50%) and increase in % lean body mass (9%). RMR (kcals/kg bodyweight) was unchanged and respiratory quotient decreased 9%. Twenty-four subjects (mP-CR, n = 10; HH, n = 14) completed WM. mP-CR regained significantly less body weight (6%), TBF (12%), and ABF (17%) compared to HH (p < 0.05). Our results demonstrate P-CR enhances weight loss, body composition and biomarkers, and maintains these changes for 52-weeks compared to a traditional HH diet.

  2. Species B adenovirus serotypes 3, 7, 11 and 35 share similar binding sites on the membrane cofactor protein CD46 receptor.

    Science.gov (United States)

    Fleischli, Christoph; Sirena, Dominique; Lesage, Guillaume; Havenga, Menzo J E; Cattaneo, Roberto; Greber, Urs F; Hemmi, Silvio

    2007-11-01

    We recently characterized the domains of the human cofactor protein CD46 involved in binding species B2 adenovirus (Ad) serotype 35. Here, the CD46 binding determinants are mapped for the species B1 Ad serotypes 3 and 7 and for the species B2 Ad11. Ad3, 7 and 11 bound and transduced CD46-positive rodent BHK cells at levels similar to Ad35. By using antibody-blocking experiments, hybrid CD46-CD4 receptor constructs and CD46 single point mutants, it is shown that Ad3, 7 and 11 share many of the Ad35-binding features on CD46. Both CD46 short consensus repeat domains SCR I and SCR II were necessary and sufficient for optimal binding and transgene expression, provided that they were positioned at an appropriate distance from the cell membrane. Similar to Ad35, most of the putative binding residues of Ad3, 7 and 11 were located on the same glycan-free, solvent-exposed face of the SCR I or SCR II domains, largely overlapping with the binding surface of the recently solved fiber knob Ad11-SCR I-II three-dimensional structure. Differences between species B1 and B2 Ads were documented with competition experiments based on anti-CD46 antibodies directed against epitopes flanking the putative Ad-binding sites, and with competition experiments based on soluble CD46 protein. It is concluded that the B1 and B2 species of Ad engage CD46 through similar binding surfaces.

  3. Evolutionary genomics of plant genes encoding N-terminal-TM-C2 domain proteins and the similar FAM62 genes and synaptotagmin genes of metazoans

    Directory of Open Access Journals (Sweden)

    Craxton Molly

    2007-07-01

    Full Text Available Abstract Background Synaptotagmin genes are found in animal genomes and are known to function in the nervous system. Genes with a similar domain architecture as well as sequence similarity to synaptotagmin C2 domains have also been found in plant genomes. The plant genes share an additional region of sequence similarity with a group of animal genes named FAM62. FAM62 genes also have a similar domain architecture. Little is known about the functions of the plant genes and animal FAM62 genes. Indeed, many members of the large and diverse Syt gene family await functional characterization. Understanding the evolutionary relationships among these genes will help to realize the full implications of functional studies and lead to improved genome annotation. Results I collected and compared plant Syt-like sequences from the primary nucleotide sequence databases at NCBI. The collection comprises six groups of plant genes conserved in embryophytes: NTMC2Type1 to NTMC2Type6. I collected and compared metazoan FAM62 sequences and identified some similar sequences from other eukaryotic lineages. I found evidence of RNA editing and alternative splicing. I compared the intron patterns of Syt genes. I also compared Rabphilin and Doc2 genes. Conclusion Genes encoding proteins with N-terminal-transmembrane-C2 domain architectures resembling synaptotagmins, are widespread in eukaryotes. A collection of these genes is presented here. The collection provides a resource for studies of intron evolution. I have classified the collection into homologous gene families according to distinctive patterns of sequence conservation and intron position. The evolutionary histories of these gene families are traceable through the appearance of family members in different eukaryotic lineages. Assuming an intron-rich eukaryotic ancestor, the conserved intron patterns distinctive of individual gene families, indicate independent origins of Syt, FAM62 and NTMC2 genes. Resemblances

  4. Carbohydrates Alone or Mixing With Beef or Whey Protein Promote Similar Training Outcomes in Resistance Training Males: A Double-Blind, Randomized Controlled Clinical Trial.

    Science.gov (United States)

    Naclerio, Fernando; Seijo-Bujia, Marco; Larumbe-Zabala, Eneko; Earnest, Conrad P

    2017-10-01

    Beef powder is a new high-quality protein source scarcely researched relative to exercise performance. The present study examined the impact of ingesting hydrolyzed beef protein, whey protein, and carbohydrate on strength performance (1RM), body composition (via plethysmography), limb circumferences and muscular thickness (via ultrasonography), following an 8-week resistance-training program. After being randomly assigned to one of the following groups: Beef, Whey, or Carbohydrate, twenty four recreationally physically active males (n = 8 per treatment) ingested 20 g of supplement, mixed with orange juice, once a day (immediately after workout or before breakfast). Post intervention changes were examined as percent change and 95% CIs. Beef (2.0%, CI, 0.2-2.38%) and Whey (1.4%, CI, 0.2-2.6%) but not Carbohydrate (0.0%, CI, -1.2-1.2%) increased fat-free mass. All groups increased vastus medialis thickness: Beef (11.1%, CI, 6.3-15.9%), Whey (12.1%, CI, 4.0, -20.2%), Carbohydrate (6.3%, CI, 1.9-10.6%). Beef (11.2%, CI, 5.9-16.5%) and Carbohydrate (4.5%, CI, 1.6-7.4%), but not Whey (1.1%, CI, -1.7-4.0%), increased biceps brachialis thickness, while only Beef increased arm (4.8%, CI, 2.3-7.3%) and thigh (11.2%, 95%CI 0.4-5.9%) circumferences. Although the three groups significantly improved 1RM Squat (Beef 21.6%, CI 5.5-37.7%; Whey 14.6%, CI, 5.9-23.3%; Carbohydrate 19.6%, CI, 2.2-37.1%), for the 1RM bench press the improvements were significant for Beef (15.8% CI 7.0-24.7%) and Whey (5.8%, CI, 1.7-9.8%) but not for carbohydrate (11.4%, CI, -0.9-23.6%). Protein-carbohydrate supplementation supports fat-free mass accretion and lower body hypertrophy. Hydrolyzed beef promotes upper body hypertrophy along with similar performance outcomes as observed when supplementing with whey isolate or maltodextrin.

  5. Molecular cloning and expression of a novel keratinocyte protein (psoriasis-associated fatty acid-binding protein [PA-FABP]) that is highly up-regulated in psoriatic skin and that shares similarity to fatty acid-binding proteins

    DEFF Research Database (Denmark)

    Madsen, Peder; Rasmussen, H H; Leffers, H

    1992-01-01

    termed PA-FABP (psoriasis-associated fatty acid-binding protein). The deduced sequence predicted a protein with molecular weight of 15,164 daltons and a calculated pI of 6.96, values that are close to those recorded in the keratinocyte 2D gel protein database. The protein comigrated with PA......-FABP as determined by 2D gel analysis of [35S]-methionine-labeled proteins expressed by transformed human amnion (AMA) cells transfected with clone 1592 using the vaccinia virus expression system and reacted with a rabbit polyclonal antibody raised against 2D gel purified PA-FABP. Structural analysis of the amino...... with epidermal growth factor (EGF), pituitary extract, and 10% fetal calf serum] revealed a strong up-regulation of PA-FABP, psoriasin, calgranulins A and B, and a few other proteins that are highly expressed in psoriatic skin. The levels of these proteins exceeded by far those observed in non-cultured normal...

  6. Human immunodeficiency virus type 1 subtype B ancestral envelope protein is functional and elicits neutralizing antibodies in rabbits similar to those elicited by a circulating subtype B envelope.

    Science.gov (United States)

    Doria-Rose, N A; Learn, G H; Rodrigo, A G; Nickle, D C; Li, F; Mahalanabis, M; Hensel, M T; McLaughlin, S; Edmonson, P F; Montefiori, D; Barnett, S W; Haigwood, N L; Mullins, J I

    2005-09-01

    Human immunodeficiency virus type 1 (HIV-1) is a difficult target for vaccine development, in part because of its ever-expanding genetic diversity and attendant capacity to escape immunologic recognition. Vaccine efficacy might be improved by maximizing immunogen antigenic similarity to viruses likely to be encountered by vaccinees. To this end, we designed a prototype HIV-1 envelope vaccine using a deduced ancestral state for the env gene. The ancestral state reconstruction method was shown to be >95% accurate by computer simulation and 99.8% accurate when estimating the known inoculum used in an experimental infection study in rhesus macaques. Furthermore, the deduced ancestor gene differed from the set of sequences used to derive the ancestor by an average of 12.3%, while these latter sequences were an average of 17.3% different from each other. A full-length ancestral subtype B HIV-1 env gene was constructed and shown to produce a glycoprotein of 160 kDa that bound and fused with cells expressing the HIV-1 coreceptor CCR5. This Env was also functional in a virus pseudotype assay. When either gp160- or gp140-expressing plasmids and recombinant gp120 were used to immunize rabbits in a DNA prime-protein boost regimen, the artificial gene induced immunoglobulin G antibodies capable of weakly neutralizing heterologous primary HIV-1 strains. The results were similar for rabbits immunized in parallel with a natural isolate, HIV-1 SF162. Further design efforts to better present conserved neutralization determinants are warranted.

  7. The crystal structure of Erwinia amylovora AmyR, a member of the YbjN protein family, shows similarity to type III secretion chaperones but suggests different cellular functions.

    Science.gov (United States)

    Bartho, Joseph D; Bellini, Dom; Wuerges, Jochen; Demitri, Nicola; Toccafondi, Mirco; Schmitt, Armin O; Zhao, Youfu; Walsh, Martin A; Benini, Stefano

    2017-01-01

    AmyR is a stress and virulence associated protein from the plant pathogenic Enterobacteriaceae species Erwinia amylovora, and is a functionally conserved ortholog of YbjN from Escherichia coli. The crystal structure of E. amylovora AmyR reveals a class I type III secretion chaperone-like fold, despite the lack of sequence similarity between these two classes of protein and lacking any evidence of a secretion-associated role. The results indicate that AmyR, and YbjN proteins in general, function through protein-protein interactions without any enzymatic action. The YbjN proteins of Enterobacteriaceae show remarkably low sequence similarity with other members of the YbjN protein family in Eubacteria, yet a high level of structural conservation is observed. Across the YbjN protein family sequence conservation is limited to residues stabilising the protein core and dimerization interface, while interacting regions are only conserved between closely related species. This study presents the first structure of a YbjN protein from Enterobacteriaceae, the most highly divergent and well-studied subgroup of YbjN proteins, and an in-depth sequence and structural analysis of this important but poorly understood protein family.

  8. Molecular cloning and expression of a novel keratinocyte protein (psoriasis-associated fatty acid-binding protein [PA-FABP]) that is highly up-regulated in psoriatic skin and that shares similarity to fatty acid-binding proteins

    DEFF Research Database (Denmark)

    Madsen, Peder; Rasmussen, H H; Leffers, H

    1992-01-01

    termed PA-FABP (psoriasis-associated fatty acid-binding protein). The deduced sequence predicted a protein with molecular weight of 15,164 daltons and a calculated pI of 6.96, values that are close to those recorded in the keratinocyte 2D gel protein database. The protein comigrated with PA-FABP...... as determined by 2D gel analysis of [35S]-methionine-labeled proteins expressed by transformed human amnion (AMA) cells transfected with clone 1592 using the vaccinia virus expression system and reacted with a rabbit polyclonal antibody raised against 2D gel purified PA-FABP. Structural analysis of the amino...... acid sequence revealed 48%, 52%, and 56% identity to known low-molecular-weight fatty acid-binding proteins belonging to the FABP family. Northern blot analysis showed that PA-FABP mRNA is indeed highly up-regulated in psoriatic keratinocytes. The transcript is present in human cell lines of epithelial...

  9. Protein from meat or vegetable sources in meals matched for fiber content has similar effects on subjective appetite sensations and energy intake - A randomized acute cross-over meal test study

    DEFF Research Database (Denmark)

    Nielsen, Lone Vestergaard; Kristensen, Marlene D; Klingenberg, Lars

    2018-01-01

    Higher-protein meals decrease hunger and increase satiety compared to lower-protein meals. However, no consensus exists about the different effects of animal and vegetable proteins on appetite. We investigated how a meal based on vegetable protein (fava beans/split peas) affected ad libitum energy......-balanced, fiber-matched meals based on vegetable protein (fava beans/split peas) or animal protein (veal/pork or eggs) had similar effects on ad libitum energy intake and appetite sensations....... intake and appetite sensations, compared to macronutrient-balanced, iso-caloric meals based on animal protein (veal/pork or eggs). Thirty-five healthy men were enrolled in this acute cross-over study. On each test day, participants were presented with one of four test meals (~3550 kilojoules (kJ) 19...

  10. Immunization with Brucella VirB proteins reduces organ colonization in mice through a Th1-type immune response and elicits a similar immune response in dogs.

    Science.gov (United States)

    Pollak, Cora N; Wanke, María Magdalena; Estein, Silvia M; Delpino, M Victoria; Monachesi, Norma E; Comercio, Elida A; Fossati, Carlos A; Baldi, Pablo C

    2015-03-01

    VirB proteins from Brucella spp. constitute the type IV secretion system, a key virulence factor mediating the intracellular survival of these bacteria. Here, we assessed whether a Th1-type immune response against VirB proteins may protect mice from Brucella infection and whether this response can be induced in the dog, a natural host for Brucella. Splenocytes from mice immunized with VirB7 or VirB9 responded to their respective antigens with significant and specific production of gamma interferon (IFN-γ), whereas interleukin-4 (IL-4) was not detected. Thirty days after an intraperitoneal challenge with live Brucella abortus, the spleen load of bacteria was almost 1 log lower in mice immunized with VirB proteins than in unvaccinated animals. As colonization reduction seemed to correlate with a Th1-type immune response against VirB proteins, we decided to assess whether such a response could be elicited in the dog. Peripheral blood mononuclear cells (PBMCs) from dogs immunized with VirB proteins (three subcutaneous doses in QuilA adjuvant) produced significantly higher levels of IFN-γ than cells from control animals upon in vitro stimulation with VirB proteins. A skin test to assess specific delayed-type hypersensitivity was positive in 4 out of 5 dogs immunized with either VirB7 or VirB9. As both proteins are predicted to locate in the outer membrane of Brucella organisms, the ability of anti-VirB antibodies to mediate complement-dependent bacteriolysis of B. canis was assessed in vitro. Sera from dogs immunized with either VirB7 or VirB9, but not from those receiving phosphate-buffered saline (PBS), produced significant bacteriolysis. These results suggest that VirB-specific responses that reduce organ colonization by Brucella in mice can be also elicited in dogs. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  11. Fusion protein gene nucleotide sequence similarities, shared antigenic sites and phylogenetic analysis suggest that phocid distemper virus 2 and canine distemper virus belong to the same virus entity.

    NARCIS (Netherlands)

    I.K.G. Visser (Ilona); R.W.J. van der Heijden (Roger); M.W.G. van de Bildt (Marco); M.J.H. Kenter (Marcel); C. Örvell; A.D.M.E. Osterhaus (Albert)

    1993-01-01

    textabstractNucleotide sequencing of the fusion protein (F) gene of phocid distemper virus-2 (PDV-2), recently isolated from Baikal seals (Phoca sibirica), revealed an open reading frame (nucleotides 84 to 2075) with two potential in-frame ATG translation initiation codons. We suggest that the

  12. In silico peptide-binding predictions of passerine MHC class I reveal similarities across distantly related species, suggesting convergence on the level of protein function

    DEFF Research Database (Denmark)

    Follin, Elna; Karlsson, Maria; Lundegaard, Claus

    2013-01-01

    The major histocompatibility complex (MHC) genes are the most polymorphic genes found in the vertebrate genome, and they encode proteins that play an essential role in the adaptive immune response. Many songbirds (passerines) have been shown to have a large number of transcribed MHC class I genes...

  13. The remarkable similarity between the acid-base properties of ISFETs and proteins and the consequences for the design of ISFET biosensors

    NARCIS (Netherlands)

    Bergveld, Piet; van Hal, R.E.G.; van Hal, R.E.G.; Eijkel, Jan C.T.

    1995-01-01

    Studying the acid-base properties of protein molecules led us to reconsider the operational mechanism of ISFETs. Based on the site-dissociation model, applied to the amphoteric metal oxide gate materials used in ISFETs, the sensitivity of ISFETs is described in terms of the intrinsic buffer capacity

  14. Protein from Meat or Vegetable Sources in Meals Matched for Fiber Content has Similar Effects on Subjective Appetite Sensations and Energy Intake—A Randomized Acute Cross-Over Meal Test Study

    Directory of Open Access Journals (Sweden)

    Lone V. Nielsen

    2018-01-01

    Full Text Available Higher-protein meals decrease hunger and increase satiety compared to lower-protein meals. However, no consensus exists about the different effects of animal and vegetable proteins on appetite. We investigated how a meal based on vegetable protein (fava beans/split peas affected ad libitum energy intake and appetite sensations, compared to macronutrient-balanced, iso-caloric meals based on animal protein (veal/pork or eggs. Thirty-five healthy men were enrolled in this acute cross-over study. On each test day, participants were presented with one of four test meals (~3550 kilojoules (kJ 19% of energy from protein, based on fava beans/split peas (28.5 g fiber, pork/veal or eggs supplemented with pea fiber to control for fiber content (28.5 g fiber, or eggs without supplementation of fiber (6.0 g fiber. Subjective appetite sensations were recorded at baseline and every half hour until the ad libitum meal three hours later. There were no differences in ad libitum energy intake across test meals (p > 0.05. Further, no differences were found across meals for hunger, satiety, fullness, prospective food consumption, or composite appetite score (all p > 0.05. Iso-caloric, macronutrient-balanced, fiber-matched meals based on vegetable protein (fava beans/split peas or animal protein (veal/pork or eggs had similar effects on ad libitum energy intake and appetite sensations.

  15. Protein from Meat or Vegetable Sources in Meals Matched for Fiber Content has Similar Effects on Subjective Appetite Sensations and Energy Intake-A Randomized Acute Cross-Over Meal Test Study.

    Science.gov (United States)

    Nielsen, Lone V; Kristensen, Marlene D; Klingenberg, Lars; Ritz, Christian; Belza, Anita; Astrup, Arne; Raben, Anne

    2018-01-16

    Higher-protein meals decrease hunger and increase satiety compared to lower-protein meals. However, no consensus exists about the different effects of animal and vegetable proteins on appetite. We investigated how a meal based on vegetable protein (fava beans/split peas) affected ad libitum energy intake and appetite sensations, compared to macronutrient-balanced, iso-caloric meals based on animal protein (veal/pork or eggs). Thirty-five healthy men were enrolled in this acute cross-over study. On each test day, participants were presented with one of four test meals (~3550 kilojoules (kJ) 19% of energy from protein), based on fava beans/split peas (28.5 g fiber), pork/veal or eggs supplemented with pea fiber to control for fiber content (28.5 g fiber), or eggs without supplementation of fiber (6.0 g fiber). Subjective appetite sensations were recorded at baseline and every half hour until the ad libitum meal three hours later. There were no differences in ad libitum energy intake across test meals ( p > 0.05). Further, no differences were found across meals for hunger, satiety, fullness, prospective food consumption, or composite appetite score (all p > 0.05). Iso-caloric, macronutrient-balanced, fiber-matched meals based on vegetable protein (fava beans/split peas) or animal protein (veal/pork or eggs) had similar effects on ad libitum energy intake and appetite sensations.

  16. Protein from Meat or Vegetable Sources in Meals Matched for Fiber Content has Similar Effects on Subjective Appetite Sensations and Energy Intake—A Randomized Acute Cross-Over Meal Test Study

    Science.gov (United States)

    Nielsen, Lone V.; Kristensen, Marlene D.; Klingenberg, Lars; Belza, Anita

    2018-01-01

    Higher-protein meals decrease hunger and increase satiety compared to lower-protein meals. However, no consensus exists about the different effects of animal and vegetable proteins on appetite. We investigated how a meal based on vegetable protein (fava beans/split peas) affected ad libitum energy intake and appetite sensations, compared to macronutrient-balanced, iso-caloric meals based on animal protein (veal/pork or eggs). Thirty-five healthy men were enrolled in this acute cross-over study. On each test day, participants were presented with one of four test meals (~3550 kilojoules (kJ) 19% of energy from protein), based on fava beans/split peas (28.5 g fiber), pork/veal or eggs supplemented with pea fiber to control for fiber content (28.5 g fiber), or eggs without supplementation of fiber (6.0 g fiber). Subjective appetite sensations were recorded at baseline and every half hour until the ad libitum meal three hours later. There were no differences in ad libitum energy intake across test meals (p > 0.05). Further, no differences were found across meals for hunger, satiety, fullness, prospective food consumption, or composite appetite score (all p > 0.05). Iso-caloric, macronutrient-balanced, fiber-matched meals based on vegetable protein (fava beans/split peas) or animal protein (veal/pork or eggs) had similar effects on ad libitum energy intake and appetite sensations. PMID:29337861

  17. The remarkable similarity between the acid-base properties of ISFETs and proteins and the consequences for the design of ISFET biosensors

    OpenAIRE

    Bergveld, Piet; van Hal, R.E.G.; van Hal, R.E.G.; Eijkel, Jan C.T.

    1995-01-01

    Studying the acid-base properties of protein molecules led us to reconsider the operational mechanism of ISFETs. Based on the site-dissociation model, applied to the amphoteric metal oxide gate materials used in ISFETs, the sensitivity of ISFETs is described in terms of the intrinsic buffer capacity of the oxide surface, ßs, and the electrical surface capacitance, Cs. The ISFET sensitivity towards changes in the bulk pH is fully described by the ratio ßs/Cs. Practical measurements support thi...

  18. Domain similarity based orthology detection.

    Science.gov (United States)

    Bitard-Feildel, Tristan; Kemena, Carsten; Greenwood, Jenny M; Bornberg-Bauer, Erich

    2015-05-13

    Orthologous protein detection software mostly uses pairwise comparisons of amino-acid sequences to assert whether two proteins are orthologous or not. Accordingly, when the number of sequences for comparison increases, the number of comparisons to compute grows in a quadratic order. A current challenge of bioinformatic research, especially when taking into account the increasing number of sequenced organisms available, is to make this ever-growing number of comparisons computationally feasible in a reasonable amount of time. We propose to speed up the detection of orthologous proteins by using strings of domains to characterize the proteins. We present two new protein similarity measures, a cosine and a maximal weight matching score based on domain content similarity, and new software, named porthoDom. The qualities of the cosine and the maximal weight matching similarity measures are compared against curated datasets. The measures show that domain content similarities are able to correctly group proteins into their families. Accordingly, the cosine similarity measure is used inside porthoDom, the wrapper developed for proteinortho. porthoDom makes use of domain content similarity measures to group proteins together before searching for orthologs. By using domains instead of amino acid sequences, the reduction of the search space decreases the computational complexity of an all-against-all sequence comparison. We demonstrate that representing and comparing proteins as strings of discrete domains, i.e. as a concatenation of their unique identifiers, allows a drastic simplification of search space. porthoDom has the advantage of speeding up orthology detection while maintaining a degree of accuracy similar to proteinortho. The implementation of porthoDom is released using python and C++ languages and is available under the GNU GPL licence 3 at http://www.bornberglab.org/pages/porthoda .

  19. IsoCleft Finder – a web-based tool for the detection and analysis of protein binding-site geometric and chemical similarities [v2; ref status: indexed, http://f1000r.es/13y

    Directory of Open Access Journals (Sweden)

    Natalja Kurbatova

    2013-05-01

    Full Text Available IsoCleft Finder is a web-based tool for the detection of local geometric and chemical similarities between potential small-molecule binding cavities and a non-redundant dataset of ligand-bound known small-molecule binding-sites. The non-redundant dataset developed as part of this study is composed of 7339 entries representing unique Pfam/PDB-ligand (hetero group code combinations with known levels of cognate ligand similarity. The query cavity can be uploaded by the user or detected automatically by the system using existing PDB entries as well as user-provided structures in PDB format. In all cases, the user can refine the definition of the cavity interactively via a browser-based Jmol 3D molecular visualization interface. Furthermore, users can restrict the search to a subset of the dataset using a cognate-similarity threshold. Local structural similarities are detected using the IsoCleft software and ranked according to two criteria (number of atoms in common and Tanimoto score of local structural similarity and the associated Z-score and p-value measures of statistical significance. The results, including predicted ligands, target proteins, similarity scores, number of atoms in common, etc., are shown in a powerful interactive graphical interface. This interface permits the visualization of target ligands superimposed on the query cavity and additionally provides a table of pairwise ligand topological similarities. Similarities between top scoring ligands serve as an additional tool to judge the quality of the results obtained. We present several examples where IsoCleft Finder provides useful functional information. IsoCleft Finder results are complementary to existing approaches for the prediction of protein function from structure, rational drug design and x-ray crystallography. IsoCleft Finder can be found at: http://bcb.med.usherbrooke.ca/isocleftfinder.

  20. Leveraging 3D chemical similarity, target and phenotypic data in the identification of drug-protein and drug-adverse effect associations.

    Science.gov (United States)

    Vilar, Santiago; Hripcsak, George

    2016-01-01

    Drug-target identification is crucial to discover novel applications for existing drugs and provide more insights about mechanisms of biological actions, such as adverse drug effects (ADEs). Computational methods along with the integration of current big data sources provide a useful framework for drug-target and drug-adverse effect discovery. In this article, we propose a method based on the integration of 3D chemical similarity, target and adverse effect data to generate a drug-target-adverse effect predictor along with a simple leveraging system to improve identification of drug-targets and drug-adverse effects. In the first step, we generated a system for multiple drug-target identification based on the application of 3D drug similarity into a large target dataset extracted from the ChEMBL. Next, we developed a target-adverse effect predictor combining targets from ChEMBL with phenotypic information provided by SIDER data source. Both modules were linked to generate a final predictor that establishes hypothesis about new drug-target-adverse effect candidates. Additionally, we showed that leveraging drug-target candidates with phenotypic data is very useful to improve the identification of drug-targets. The integration of phenotypic data into drug-target candidates yielded up to twofold precision improvement. In the opposite direction, leveraging drug-phenotype candidates with target data also yielded a significant enhancement in the performance. The modeling described in the current study is simple and efficient and has applications at large scale in drug repurposing and drug safety through the identification of mechanism of action of biological effects.

  1. Sequence similarity between the erythrocyte binding domain 1 of the Plasmodium vivax Duffy binding protein and the V3 loop of HIV-1 strain MN reveals binding residues for the Duffy Antigen Receptor for Chemokines

    OpenAIRE

    Bolton, Michael J; Garry, Robert F

    2011-01-01

    Abstract Background The surface glycoprotein (SU, gp120) of the human immunodeficiency virus (HIV) must bind to a chemokine receptor, CCR5 or CXCR4, to invade CD4+ cells. Plasmodium vivax uses the Duffy Binding Protein (DBP) to bind the Duffy Antigen Receptor for Chemokines (DARC) and invade reticulocytes. Results Variable loop 3 (V3) of HIV-1 SU and domain 1 of the Plasmodium vivax DBP share a sequence similarity. The site of amino acid sequence similarity was necessary, but not sufficient, ...

  2. Rapid detection of hypoxia-inducible factor-1-active tumours: pretargeted imaging with a protein degrading in a mechanism similar to hypoxia-inducible factor-1{alpha}

    Energy Technology Data Exchange (ETDEWEB)

    Ueda, Masashi [Kyoto University, Radioisotopes Research Laboratory, Kyoto University Hospital, Faculty of Medicine, Kyoto (Japan); Kyoto University, Department of Patho-Functional Bioanalysis, Graduate School of Pharmaceutical Sciences, Kyoto (Japan); Kudo, Takashi; Konishi, Hiroaki; Miyano, Azusa; Ono, Masahiro; Saji, Hideo [Kyoto University, Department of Patho-Functional Bioanalysis, Graduate School of Pharmaceutical Sciences, Kyoto (Japan); Kuge, Yuji [Kyoto University, Department of Patho-Functional Bioanalysis, Graduate School of Pharmaceutical Sciences, Kyoto (Japan); Hokkaido University, Central Institute of Isotope Science, Sapporo (Japan); Mukai, Takahiro [Kyushu University, Department of Biomolecular Recognition Chemistry, Graduate School of Pharmaceutical Sciences, Fukuoka (Japan); Tanaka, Shotaro; Kizaka-Kondoh, Shinae; Hiraoka, Masahiro [Kyoto University, Department of Radiation Oncology and Image-applied Therapy, Graduate School of Medicine, Kyoto (Japan)

    2010-08-15

    Hypoxia-inducible factor-1 (HIF-1) plays an important role in malignant tumour progression. For the imaging of HIF-1-active tumours, we previously developed a protein, POS, which is effectively delivered to and selectively stabilized in HIF-1-active cells, and a radioiodinated biotin derivative, (3-{sup 123}I-iodobenzoyl)norbiotinamide ({sup 123}I-IBB), which can bind to the streptavidin moiety of POS. In this study, we aimed to investigate the feasibility of the pretargeting method using POS and {sup 123}I-IBB for rapid imaging of HIF-1-active tumours. Tumour-implanted mice were pretargeted with POS. After 24 h, {sup 125}I-IBB was administered and subsequently, the biodistribution of radioactivity was investigated at several time points. In vivo planar imaging, comparison between {sup 125}I-IBB accumulation and HIF-1 transcriptional activity, and autoradiography were performed at 6 h after the administration of {sup 125}I-IBB. The same sections that were used in autoradiographic analysis were subjected to HIF-1{alpha} immunohistochemistry. {sup 125}I-IBB accumulation was observed in tumours of mice pretargeted with POS (1.6%ID/g at 6 h). This result is comparable to the data derived from {sup 125}I-IBB-conjugated POS-treated mice (1.4%ID/g at 24 h). In vivo planar imaging provided clear tumour images. The tumoral accumulation of {sup 125}I-IBB significantly correlated with HIF-1-dependent luciferase bioluminescence (R=0.84, p<0.01). The intratumoral distribution of {sup 125}I-IBB was heterogeneous and was significantly correlated with HIF-1{alpha}-positive regions (R=0.58, p<0.0001). POS pretargeting with {sup 123}I-IBB is a useful technique in the rapid imaging and detection of HIF-1-active regions in tumours. (orig.)

  3. Sequence similarity between the erythrocyte binding domain 1 of the Plasmodium vivax Duffy binding protein and the V3 loop of HIV-1 strain MN reveals binding residues for the Duffy Antigen Receptor for Chemokines

    Directory of Open Access Journals (Sweden)

    Garry Robert F

    2011-01-01

    Full Text Available Abstract Background The surface glycoprotein (SU, gp120 of the human immunodeficiency virus (HIV must bind to a chemokine receptor, CCR5 or CXCR4, to invade CD4+ cells. Plasmodium vivax uses the Duffy Binding Protein (DBP to bind the Duffy Antigen Receptor for Chemokines (DARC and invade reticulocytes. Results Variable loop 3 (V3 of HIV-1 SU and domain 1 of the Plasmodium vivax DBP share a sequence similarity. The site of amino acid sequence similarity was necessary, but not sufficient, for DARC binding and contained a consensus heparin binding site essential for DARC binding. Both HIV-1 and P. vivax can be blocked from binding to their chemokine receptors by the chemokine, RANTES and its analog AOP-RANTES. Site directed mutagenesis of the heparin binding motif in members of the DBP family, the P. knowlesi alpha, beta and gamma proteins abrogated their binding to erythrocytes. Positively charged residues within domain 1 are required for binding of P. vivax and P. knowlesi erythrocyte binding proteins. Conclusion A heparin binding site motif in members of the DBP family may form part of a conserved erythrocyte receptor binding pocket.

  4. Crystal structure of full-length Zika virus NS5 protein reveals a conformation similar to Japanese encephalitis virus NS5

    Energy Technology Data Exchange (ETDEWEB)

    Upadhyay, Anup K.; Cyr, Matthew; Longenecker, Kenton; Tripathi, Rakesh; Sun, Chaohong; Kempf, Dale J. (AbbVie)

    2017-02-21

    The rapid spread of the recentZika virus(ZIKV) epidemic across various countries in the American continent poses a major health hazard for the unborn fetuses of pregnant women. To date, there is no effective medical intervention. The nonstructural protein 5 ofZika virus(ZIKV-NS5) is critical for ZIKV replication through the 5'-RNA capping and RNA polymerase activities present in its N-terminal methyltransferase (MTase) and C-terminal RNA-dependent RNA polymerase (RdRp) domains, respectively. The crystal structure of the full-length ZIKV-NS5 protein has been determined at 3.05 Å resolution from a crystal belonging to space groupP21212 and containing two protein molecules in the asymmetric unit. The structure is similar to that reported for the NS5 protein fromJapanese encephalitis virusand suggests opportunities for structure-based drug design targeting either its MTase or RdRp domain.

  5. Sequence similarity between the erythrocyte binding domain of the Plasmodium vivax Duffy binding protein and the V3 loop of HIV-1 strain MN reveals a functional heparin binding motif involved in binding to the Duffy antigen receptor for chemokines

    Directory of Open Access Journals (Sweden)

    Bolton Michael J

    2011-11-01

    Full Text Available Abstract Background The HIV surface glycoprotein gp120 (SU, gp120 and the Plasmodium vivax Duffy binding protein (PvDBP bind to chemokine receptors during infection and have a site of amino acid sequence similarity in their binding domains that often includes a heparin binding motif (HBM. Infection by either pathogen has been found to be inhibited by polyanions. Results Specific polyanions that inhibit HIV infection and bind to the V3 loop of X4 strains also inhibited DBP-mediated infection of erythrocytes and DBP binding to the Duffy Antigen Receptor for Chemokines (DARC. A peptide including the HBM of PvDBP had similar affinity for heparin as RANTES and V3 loop peptides, and could be specifically inhibited from heparin binding by the same polyanions that inhibit DBP binding to DARC. However, some V3 peptides can competitively inhibit RANTES binding to heparin, but not the PvDBP HBM peptide. Three other members of the DBP family have an HBM sequence that is necessary for erythrocyte binding, however only the protein which binds to DARC, the P. knowlesi alpha protein, is inhibited by heparin from binding to erythrocytes. Heparitinase digestion does not affect the binding of DBP to erythrocytes. Conclusion The HBMs of DBPs that bind to DARC have similar heparin binding affinities as some V3 loop peptides and chemokines, are responsible for specific sulfated polysaccharide inhibition of parasite binding and invasion of red blood cells, and are more likely to bind to negative charges on the receptor than cell surface glycosaminoglycans.

  6. Sequence similarity between the erythrocyte binding domain of the Plasmodium vivax Duffy binding protein and the V3 loop of HIV-1 strain MN reveals a functional heparin binding motif involved in binding to the Duffy antigen receptor for chemokines

    OpenAIRE

    Bolton, Michael J; Garry, Robert F

    2011-01-01

    Abstract Background The HIV surface glycoprotein gp120 (SU, gp120) and the Plasmodium vivax Duffy binding protein (PvDBP) bind to chemokine receptors during infection and have a site of amino acid sequence similarity in their binding domains that often includes a heparin binding motif (HBM). Infection by either pathogen has been found to be inhibited by polyanions. Results Specific polyanions that inhibit HIV infection and bind to the V3 loop of X4 strains also inhibited DBP-mediated infectio...

  7. Secretome analysis of Aspergillus fumigatus reveals Asp-hemolysin as a major secreted protein.

    Science.gov (United States)

    Wartenberg, Dirk; Lapp, Katrin; Jacobsen, Ilse D; Dahse, Hans-Martin; Kniemeyer, Olaf; Heinekamp, Thorsten; Brakhage, Axel A

    2011-11-01

    Surface-associated and secreted proteins represent primarily exposed components of Aspergillus fumigatus during host infection. Several secreted proteins are known to be involved in defense mechanisms or immune evasion, thus, probably contributing to pathogenicity. Furthermore, several secreted antigens were identified as possible biomarkers for the verification of diseases caused by Aspergillus species. Nevertheless, there is only limited knowledge about the composition of the secretome and about molecular functions of particular proteins. To identify secreted proteins potentially essential for virulence, the core secretome of A. fumigatus grown in minimal medium was determined. Two-dimensional gel electrophoretic separation and subsequent MALDI-TOF-MS/MS analyses resulted in the identification of 64 different proteins. Additionally, secretome analyses of A. fumigatus utilizing elastin, collagen or keratin as main carbon and nitrogen source were performed. Thereby, the alkaline serine protease Alp1 was identified as the most abundant protein and hence presumably represents an important protease during host infection. Interestingly, the Asp-hemolysin (Asp-HS), which belongs to the protein family of aegerolysins and which was often suggested to be involved in fungal virulence, was present in the secretome under all growth conditions tested. In addition, a second, non-secreted protein with an aegerolysin domain annotated as Asp-hemolysin-like (HS-like) protein can be found to be encoded in the genome of A. fumigatus. Generation and analysis of Asp-HS and HS-like deletion strains revealed no differences in phenotype compared to the corresponding wild-type strain. Furthermore, hemolysis and cytotoxicity was not altered in both single-deletion and double-deletion mutants lacking both aegerolysin genes. All mutant strains showed no attenuation in virulence in a mouse infection model for invasive pulmonary aspergillosis. Overall, this study provides a comprehensive

  8. New Similarity Functions

    DEFF Research Database (Denmark)

    Yazdani, Hossein; Ortiz-Arroyo, Daniel; Kwasnicka, Halina

    2016-01-01

    spaces, in addition to their similarity in the vector space. Prioritized Weighted Feature Distance (PWFD) works similarly as WFD, but provides the ability to give priorities to desirable features. The accuracy of the proposed functions are compared with other similarity functions on several data sets....... Our results show that the proposed functions work better than other methods proposed in the literature....

  9. Phoneme Similarity and Confusability

    Science.gov (United States)

    Bailey, T.M.; Hahn, U.

    2005-01-01

    Similarity between component speech sounds influences language processing in numerous ways. Explanation and detailed prediction of linguistic performance consequently requires an understanding of these basic similarities. The research reported in this paper contrasts two broad classes of approach to the issue of phoneme similarity-theoretically…

  10. Similarities between the Epstein-Barr Virus (EBV) Nuclear Protein EBNA1 and the Pioneer Transcription Factor FoxA: Is EBNA1 a “Bookmarking” Oncoprotein that Alters the Host Cell Epigenotype?

    Science.gov (United States)

    Niller, Hans Helmut; Minarovits, Janos

    2012-01-01

    EBNA1, a nuclear protein expressed in all EBV-associated neoplasms is indispensable for the maintenance of the viral episomes in latently infected cells. EBNA1 may induce genetic alterations by upregulating cellular recombinases, production of reactive oxygen species (ROS) and affecting p53 levels and function. All these changes may contribute to tumorigenesis. In this overview we focus, however, on the epigenetic alterations elicited by EBNA1 by drawing a parallel between EBNA1 and the FoxA family of pioneer transcription factors. Both EBNA1 and FoxA induce local DNA demethylation, nucleosome destabilization and bind to mitotic chromosomes. Local DNA demethylation and nucleosome rearrangement mark active promoters and enhancers. In addition, EBNA1 and FoxA, when associated with mitotic chromatin may “bookmark” active genes and ensure their reactivation in postmitotic cells (epigenetic memory). We speculate that DNA looping induced by EBNA1-EBNA1 interactions may reorganize the cellular genome. Such chromatin loops, sustained in mitotic chromatin similarly to the long-distance interactions mediated by the insulator protein CTCF, may also mediate the epigenetic inheritance of gene expression patterns. We suggest that EBNA1 has the potential to induce patho-epigenetic alterations contributing to tumorigenesis. PMID:25436603

  11. Molecular similarity measures.

    Science.gov (United States)

    Maggiora, Gerald M; Shanmugasundaram, Veerabahu

    2011-01-01

    Molecular similarity is a pervasive concept in chemistry. It is essential to many aspects of chemical reasoning and analysis and is perhaps the fundamental assumption underlying medicinal chemistry. Dissimilarity, the complement of similarity, also plays a major role in a growing number of applications of molecular diversity in combinatorial chemistry, high-throughput screening, and related fields. How molecular information is represented, called the representation problem, is important to the type of molecular similarity analysis (MSA) that can be carried out in any given situation. In this work, four types of mathematical structure are used to represent molecular information: sets, graphs, vectors, and functions. Molecular similarity is a pairwise relationship that induces structure into sets of molecules, giving rise to the concept of chemical space. Although all three concepts - molecular similarity, molecular representation, and chemical space - are treated in this chapter, the emphasis is on molecular similarity measures. Similarity measures, also called similarity coefficients or indices, are functions that map pairs of compatible molecular representations that are of the same mathematical form into real numbers usually, but not always, lying on the unit interval. This chapter presents a somewhat pedagogical discussion of many types of molecular similarity measures, their strengths and limitations, and their relationship to one another. An expanded account of the material on chemical spaces presented in the first edition of this book is also provided. It includes a discussion of the topography of activity landscapes and the role that activity cliffs in these landscapes play in structure-activity studies.

  12. Similarity Measure of Graphs

    Directory of Open Access Journals (Sweden)

    Amine Labriji

    2017-07-01

    Full Text Available The topic of identifying the similarity of graphs was considered as highly recommended research field in the Web semantic, artificial intelligence, the shape recognition and information research. One of the fundamental problems of graph databases is finding similar graphs to a graph query. Existing approaches dealing with this problem are usually based on the nodes and arcs of the two graphs, regardless of parental semantic links. For instance, a common connection is not identified as being part of the similarity of two graphs in cases like two graphs without common concepts, the measure of similarity based on the union of two graphs, or the one based on the notion of maximum common sub-graph (SCM, or the distance of edition of graphs. This leads to an inadequate situation in the context of information research. To overcome this problem, we suggest a new measure of similarity between graphs, based on the similarity measure of Wu and Palmer. We have shown that this new measure satisfies the properties of a measure of similarities and we applied this new measure on examples. The results show that our measure provides a run time with a gain of time compared to existing approaches. In addition, we compared the relevance of the similarity values obtained, it appears that this new graphs measure is advantageous and  offers a contribution to solving the problem mentioned above.

  13. Processes of Similarity Judgment

    Science.gov (United States)

    Larkey, Levi B.; Markman, Arthur B.

    2005-01-01

    Similarity underlies fundamental cognitive capabilities such as memory, categorization, decision making, problem solving, and reasoning. Although recent approaches to similarity appreciate the structure of mental representations, they differ in the processes posited to operate over these representations. We present an experiment that…

  14. Judgments of brand similarity

    NARCIS (Netherlands)

    Bijmolt, THA; Wedel, M; Pieters, RGM; DeSarbo, WS

    This paper provides empirical insight into the way consumers make pairwise similarity judgments between brands, and how familiarity with the brands, serial position of the pair in a sequence, and the presentation format affect these judgments. Within the similarity judgment process both the

  15. All 17 S-locus F-box proteins of the S2 - and S3 -haplotypes of Petunia inflata are assembled into similar SCF complexes with a specific function in self-incompatibility.

    Science.gov (United States)

    Li, Shu; Williams, Justin S; Sun, Penglin; Kao, Teh-Hui

    2016-09-01

    The collaborative non-self-recognition model for S-RNase-based self-incompatibility predicts that multiple S-locus F-box proteins (SLFs) produced by pollen of a given S-haplotype collectively mediate ubiquitination and degradation of all non-self S-RNases, but not self S-RNases, in the pollen tube, thereby resulting in cross-compatible pollination but self-incompatible pollination. We had previously used pollen extracts containing GFP-fused S2 -SLF1 (SLF1 with an S2 -haplotype) of Petunia inflata for co-immunoprecipitation (Co-IP) and mass spectrometry (MS), and identified PiCUL1-P (a pollen-specific Cullin1), PiSSK1 (a pollen-specific Skp1-like protein) and PiRBX1 (a conventional Rbx1) as components of the SCF(S) (2-) (SLF) (1) complex. Using pollen extracts containing PiSSK1:FLAG:GFP for Co-IP/MS, we identified two additional SLFs (SLF4 and SLF13) that were assembled into SCF(SLF) complexes. As 17 SLF genes (SLF1 to SLF17) have been identified in S2 and S3 pollen, here we examined whether all 17 SLFs are assembled into similar complexes and, if so, whether these complexes are unique to SLFs. We modified the previous Co-IP/MS procedure, including the addition of style extracts from four different S-genotypes to pollen extracts containing PiSSK1:FLAG:GFP, to perform four separate experiments. The results taken together show that all 17 SLFs and an SLF-like protein, SLFLike1 (encoded by an S-locus-linked gene), co-immunoprecipitated with PiSSK1:FLAG:GFP. Moreover, of the 179 other F-box proteins predicted by S2 and S3 pollen transcriptomes, only a pair with 94.9% identity and another pair with 99.7% identity co-immunoprecipitated with PiSSK1:FLAG:GFP. These results suggest that SCF(SLF) complexes have evolved specifically to function in self-incompatibility. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  16. The semantic similarity ensemble

    Directory of Open Access Journals (Sweden)

    Andrea Ballatore

    2013-12-01

    Full Text Available Computational measures of semantic similarity between geographic terms provide valuable support across geographic information retrieval, data mining, and information integration. To date, a wide variety of approaches to geo-semantic similarity have been devised. A judgment of similarity is not intrinsically right or wrong, but obtains a certain degree of cognitive plausibility, depending on how closely it mimics human behavior. Thus selecting the most appropriate measure for a specific task is a significant challenge. To address this issue, we make an analogy between computational similarity measures and soliciting domain expert opinions, which incorporate a subjective set of beliefs, perceptions, hypotheses, and epistemic biases. Following this analogy, we define the semantic similarity ensemble (SSE as a composition of different similarity measures, acting as a panel of experts having to reach a decision on the semantic similarity of a set of geographic terms. The approach is evaluated in comparison to human judgments, and results indicate that an SSE performs better than the average of its parts. Although the best member tends to outperform the ensemble, all ensembles outperform the average performance of each ensemble's member. Hence, in contexts where the best measure is unknown, the ensemble provides a more cognitively plausible approach.

  17. Gender similarities and differences.

    Science.gov (United States)

    Hyde, Janet Shibley

    2014-01-01

    Whether men and women are fundamentally different or similar has been debated for more than a century. This review summarizes major theories designed to explain gender differences: evolutionary theories, cognitive social learning theory, sociocultural theory, and expectancy-value theory. The gender similarities hypothesis raises the possibility of theorizing gender similarities. Statistical methods for the analysis of gender differences and similarities are reviewed, including effect sizes, meta-analysis, taxometric analysis, and equivalence testing. Then, relying mainly on evidence from meta-analyses, gender differences are reviewed in cognitive performance (e.g., math performance), personality and social behaviors (e.g., temperament, emotions, aggression, and leadership), and psychological well-being. The evidence on gender differences in variance is summarized. The final sections explore applications of intersectionality and directions for future research.

  18. Native tandem and ion mobility mass spectrometry highlight structural and modular similarities in clustered-regularly-interspaced shot-palindromic-repeats (CRISPR)-associated protein complexes from Escherichia coli and Pseudomonas aeruginosa.

    Science.gov (United States)

    van Duijn, Esther; Barbu, Ioana M; Barendregt, Arjan; Jore, Matthijs M; Wiedenheft, Blake; Lundgren, Magnus; Westra, Edze R; Brouns, Stan J J; Doudna, Jennifer A; van der Oost, John; Heck, Albert J R

    2012-11-01

    The CRISPR/Cas (clustered regularly interspaced short palindromic repeats/CRISPR-associated genes) immune system of bacteria and archaea provides acquired resistance against viruses and plasmids, by a strategy analogous to RNA-interference. Key components of the defense system are ribonucleoprotein complexes, the composition of which appears highly variable in different CRISPR/Cas subtypes. Previous studies combined mass spectrometry, electron microscopy, and small angle x-ray scattering to demonstrate that the E. coli Cascade complex (405 kDa) and the P. aeruginosa Csy-complex (350 kDa) are similar in that they share a central spiral-shaped hexameric structure, flanked by associating proteins and one CRISPR RNA. Recently, a cryo-electron microscopy structure of Cascade revealed that the CRISPR RNA molecule resides in a groove of the hexameric backbone. For both complexes we here describe the use of native mass spectrometry in combination with ion mobility mass spectrometry to assign a stable core surrounded by more loosely associated modules. Via computational modeling subcomplex structures were proposed that relate to the experimental IMMS data. Despite the absence of obvious sequence homology between several subunits, detailed analysis of sub-complexes strongly suggests analogy between subunits of the two complexes. Probing the specific association of E. coli Cascade/crRNA to its complementary DNA target reveals a conformational change. All together these findings provide relevant new information about the potential assembly process of the two CRISPR-associated complexes.

  19. Similarity or difference?

    DEFF Research Database (Denmark)

    Villadsen, Anders Ryom

    2013-01-01

    While the organizational structures and strategies of public organizations have attracted substantial research attention among public management scholars, little research has explored how these organizational core dimensions are interconnected and influenced by pressures for similarity....... In this paper I address this topic by exploring the relation between expenditure strategy isomorphism and structure isomorphism in Danish municipalities. Different literatures suggest that organizations exist in concurrent pressures for being similar to and different from other organizations in their field......-shaped relation exists between expenditure strategy isomorphism and structure isomorphism in a longitudinal quantitative study of Danish municipalities....

  20. Comparing Harmonic Similarity Measures

    NARCIS (Netherlands)

    de Haas, W.B.; Robine, M.; Hanna, P.; Veltkamp, R.C.; Wiering, F.

    2010-01-01

    We present an overview of the most recent developments in polyphonic music retrieval and an experiment in which we compare two harmonic similarity measures. In contrast to earlier work, in this paper we specifically focus on the symbolic chord description as the primary musical representation and

  1. Similar or different?

    DEFF Research Database (Denmark)

    Cornér, Solveig; Pyhältö, Kirsi; Peltonen, Jouni

    2018-01-01

    Previous research has identified researcher community and supervisory support as key determinants of the doctoral journey contributing to students’ persistence and robustness. However, we still know little about cross-cultural variation in the researcher community and supervisory support experien...... counter partners, whereas the Finnish students perceived lower levels of instrumental support than the Danish students. The findings imply that seemingly similar contexts hold valid differences in experienced social support and educational strategies at the PhD level....... experienced by PhD students within the same discipline. This study explores the support experiences of 381 PhD students within the humanities and social sciences from three research-intensive universities in Denmark (n=145) and Finland (n=236). The mixed methods design was utilized. The data were collected...... counter partners. The results also indicated that the only form of support in which the students expressed more matched support than mismatched support was informational support. Further investigation showed that the Danish students reported a high level of mismatch in emotional support than their Finnish...

  2. Isolation of a novel promoter for efficient protein expression by Aspergillus oryzae in solid-state culture.

    Science.gov (United States)

    Bando, Hiroki; Hisada, Hiromoto; Ishida, Hiroki; Hata, Yoji; Katakura, Yoshio; Kondo, Akihiko

    2011-11-01

    A novel promoter from a hemolysin-like protein encoding the gene, hlyA, was characterized for protein overexpression in Aspergillus oryzae grown in solid-state culture. Using endo-1,4-β-glucanase from A. oryzae (CelA) as the reporter, promoter activity was found to be higher than that of the α-amylase (amyA) and manganese superoxide dismutase (sodM) genes not only in wheat bran solid-state culture but also in liquid culture. Expression of the A. oryzae endoglucanase CelB and two heterologous endoglucanases (TrEglI and TrEglIII from Trichoderma reesei) under the control of the hlyA promoter were also found to be stronger than under the control of the amyA promoter in A. oryzae grown in wheat bran solid-state culture, suggesting that the hlyA promoter may be useful for the overproduction of other proteins as well. In wheat bran solid-state culture, the productivity of the hlyA promoter in terms of protein produced was high when the cultivation temperature was 30°C or 37°C, when the water content was 0.6 or 0.8 ml/g wheat bran, and from 48 to 72 h after inoculation. Because A. oryzae sporulated actively under these conditions and because hemolysin has been reported to play a role in fungal fruiting body formation, high-level expression of hlyA may be related to sporulation.

  3. A novel seven-octapeptide repeat insertion in the prion protein gene (PRNP) in a Dutch pedigree with Gerstmann-Sträussler-Scheinker disease phenotype: comparison with similar cases from the literature

    NARCIS (Netherlands)

    Jansen, Casper; Voet, Willem; Head, Mark W.; Parchi, Piero; Yull, Helen; Verrips, Aad; Wesseling, Pieter; Meulstee, Jan; Baas, Frank; van Gool, Willem A.; Ironside, James W.; Rozemuller, Annemieke J. M.

    2011-01-01

    Human prion diseases can be sporadic, inherited or acquired by infection and show considerable phenotypic heterogeneity. We describe the clinical, histopathological and pathological prion protein (PrP(Sc)) characteristics of a Dutch family with a novel 7-octapeptide repeat insertion (7-OPRI) in

  4. Ultrastructural changes and Heat Shock Proteins 70 induced by atmospheric pollution are similar to the effects observed under in vitro heavy metals stress in Conocephalum conicum (Marchantiales--Bryophyta).

    Science.gov (United States)

    Basile, Adriana; Sorbo, Sergio; Conte, Barbara; Cardi, Manuela; Esposito, Sergio

    2013-11-01

    Changes in ultrastructure and induction of Heat Shock Proteins 70 have been studied in Conocephalum conicum (Marchantiales) collected in different urban and country sites in Italy. These results were compared to the effects in vitro of exposition to different heavy metals for several days. At urban sites, cellular ultrastructure was modified, and heavy metals could be observed accumulating in cell walls. Simultaneously, a strong increment in Hsp70 was detected, compared with results observed on control specimens. When C. conicum was exposed to heavy metals in vitro, comparable effects as in polluted sites were observed: Cd and Pb accumulated mostly within parenchyma and, within cells, were absorbed to cell walls or concentrated in vacuoles. Moreover, severe alterations were observed in organelles. Concomitantly, a progressive accumulation of Hsp70 was detected following heavy metals exposition. These effects are discussed in order to describe the dose and time-dependent response to heavy metal stress in C. conicum. Copyright © 2013 Elsevier Ltd. All rights reserved.

  5. Identifying mechanistic similarities in drug responses

    KAUST Repository

    Zhao, C.

    2012-05-15

    Motivation: In early drug development, it would be beneficial to be able to identify those dynamic patterns of gene response that indicate that drugs targeting a particular gene will be likely or not to elicit the desired response. One approach would be to quantitate the degree of similarity between the responses that cells show when exposed to drugs, so that consistencies in the regulation of cellular response processes that produce success or failure can be more readily identified.Results: We track drug response using fluorescent proteins as transcription activity reporters. Our basic assumption is that drugs inducing very similar alteration in transcriptional regulation will produce similar temporal trajectories on many of the reporter proteins and hence be identified as having similarities in their mechanisms of action (MOA). The main body of this work is devoted to characterizing similarity in temporal trajectories/signals. To do so, we must first identify the key points that determine mechanistic similarity between two drug responses. Directly comparing points on the two signals is unrealistic, as it cannot handle delays and speed variations on the time axis. Hence, to capture the similarities between reporter responses, we develop an alignment algorithm that is robust to noise, time delays and is able to find all the contiguous parts of signals centered about a core alignment (reflecting a core mechanism in drug response). Applying the proposed algorithm to a range of real drug experiments shows that the result agrees well with the prior drug MOA knowledge. © The Author 2012. Published by Oxford University Press. All rights reserved.

  6. BLAST and FASTA similarity searching for multiple sequence alignment.

    Science.gov (United States)

    Pearson, William R

    2014-01-01

    BLAST, FASTA, and other similarity searching programs seek to identify homologous proteins and DNA sequences based on excess sequence similarity. If two sequences share much more similarity than expected by chance, the simplest explanation for the excess similarity is common ancestry-homology. The most effective similarity searches compare protein sequences, rather than DNA sequences, for sequences that encode proteins, and use expectation values, rather than percent identity, to infer homology. The BLAST and FASTA packages of sequence comparison programs provide programs for comparing protein and DNA sequences to protein databases (the most sensitive searches). Protein and translated-DNA comparisons to protein databases routinely allow evolutionary look back times from 1 to 2 billion years; DNA:DNA searches are 5-10-fold less sensitive. BLAST and FASTA can be run on popular web sites, but can also be downloaded and installed on local computers. With local installation, target databases can be customized for the sequence data being characterized. With today's very large protein databases, search sensitivity can also be improved by searching smaller comprehensive databases, for example, a complete protein set from an evolutionarily neighboring model organism. By default, BLAST and FASTA use scoring strategies target for distant evolutionary relationships; for comparisons involving short domains or queries, or searches that seek relatively close homologs (e.g. mouse-human), shallower scoring matrices will be more effective. Both BLAST and FASTA provide very accurate statistical estimates, which can be used to reliably identify protein sequences that diverged more than 2 billion years ago.

  7. A COMPARISON OF SEMANTIC SIMILARITY MODELS IN EVALUATING CONCEPT SIMILARITY

    Directory of Open Access Journals (Sweden)

    Q. X. Xu

    2012-08-01

    Full Text Available The semantic similarities are important in concept definition, recognition, categorization, interpretation, and integration. Many semantic similarity models have been established to evaluate semantic similarities of objects or/and concepts. To find out the suitability and performance of different models in evaluating concept similarities, we make a comparison of four main types of models in this paper: the geometric model, the feature model, the network model, and the transformational model. Fundamental principles and main characteristics of these models are introduced and compared firstly. Land use and land cover concepts of NLCD92 are employed as examples in the case study. The results demonstrate that correlations between these models are very high for a possible reason that all these models are designed to simulate the similarity judgement of human mind.

  8. Renewing the Respect for Similarity

    Directory of Open Access Journals (Sweden)

    Shimon eEdelman

    2012-07-01

    Full Text Available In psychology, the concept of similarity has traditionally evoked a mixture of respect, stemmingfrom its ubiquity and intuitive appeal, and concern, due to its dependence on the framing of the problemat hand and on its context. We argue for a renewed focus on similarity as an explanatory concept, bysurveying established results and new developments in the theory and methods of similarity-preservingassociative lookup and dimensionality reduction — critical components of many cognitive functions, aswell as of intelligent data management in computer vision. We focus in particular on the growing familyof algorithms that support associative memory by performing hashing that respects local similarity, andon the uses of similarity in representing structured objects and scenes. Insofar as these similarity-basedideas and methods are useful in cognitive modeling and in AI applications, they should be included inthe core conceptual toolkit of computational neuroscience.

  9. Domain similarity based orthology detection

    OpenAIRE

    Bitard-Feildel, Tristan; Kemena, Carsten; Greenwood, Jenny M; Bornberg-Bauer, Erich

    2015-01-01

    Background Orthologous protein detection software mostly uses pairwise comparisons of amino-acid sequences to assert whether two proteins are orthologous or not. Accordingly, when the number of sequences for comparison increases, the number of comparisons to compute grows in a quadratic order. A current challenge of bioinformatic research, especially when taking into account the increasing number of sequenced organisms available, is to make this ever-growing number of comparisons computationa...

  10. Self-similar cosmological models

    Energy Technology Data Exchange (ETDEWEB)

    Chao, W Z [Cambridge Univ. (UK). Dept. of Applied Mathematics and Theoretical Physics

    1981-07-01

    The kinematics and dynamics of self-similar cosmological models are discussed. The degrees of freedom of the solutions of Einstein's equations for different types of models are listed. The relation between kinematic quantities and the classifications of the self-similarity group is examined. All dust local rotational symmetry models have been found.

  11. Self-similar factor approximants

    International Nuclear Information System (INIS)

    Gluzman, S.; Yukalov, V.I.; Sornette, D.

    2003-01-01

    The problem of reconstructing functions from their asymptotic expansions in powers of a small variable is addressed by deriving an improved type of approximants. The derivation is based on the self-similar approximation theory, which presents the passage from one approximant to another as the motion realized by a dynamical system with the property of group self-similarity. The derived approximants, because of their form, are called self-similar factor approximants. These complement the obtained earlier self-similar exponential approximants and self-similar root approximants. The specific feature of self-similar factor approximants is that their control functions, providing convergence of the computational algorithm, are completely defined from the accuracy-through-order conditions. These approximants contain the Pade approximants as a particular case, and in some limit they can be reduced to the self-similar exponential approximants previously introduced by two of us. It is proved that the self-similar factor approximants are able to reproduce exactly a wide class of functions, which include a variety of nonalgebraic functions. For other functions, not pertaining to this exactly reproducible class, the factor approximants provide very accurate approximations, whose accuracy surpasses significantly that of the most accurate Pade approximants. This is illustrated by a number of examples showing the generality and accuracy of the factor approximants even when conventional techniques meet serious difficulties

  12. Dynamic similarity in erosional processes

    Science.gov (United States)

    Scheidegger, A.E.

    1963-01-01

    A study is made of the dynamic similarity conditions obtaining in a variety of erosional processes. The pertinent equations for each type of process are written in dimensionless form; the similarity conditions can then easily be deduced. The processes treated are: raindrop action, slope evolution and river erosion. ?? 1963 Istituto Geofisico Italiano.

  13. Personalized recommendation with corrected similarity

    International Nuclear Information System (INIS)

    Zhu, Xuzhen; Tian, Hui; Cai, Shimin

    2014-01-01

    Personalized recommendation has attracted a surge of interdisciplinary research. Especially, similarity-based methods in applications of real recommendation systems have achieved great success. However, the computations of similarities are overestimated or underestimated, in particular because of the defective strategy of unidirectional similarity estimation. In this paper, we solve this drawback by leveraging mutual correction of forward and backward similarity estimations, and propose a new personalized recommendation index, i.e., corrected similarity based inference (CSI). Through extensive experiments on four benchmark datasets, the results show a greater improvement of CSI in comparison with these mainstream baselines. And a detailed analysis is presented to unveil and understand the origin of such difference between CSI and mainstream indices. (paper)

  14. Towards Personalized Medicine: Leveraging Patient Similarity and Drug Similarity Analytics

    Science.gov (United States)

    Zhang, Ping; Wang, Fei; Hu, Jianying; Sorrentino, Robert

    2014-01-01

    The rapid adoption of electronic health records (EHR) provides a comprehensive source for exploratory and predictive analytic to support clinical decision-making. In this paper, we investigate how to utilize EHR to tailor treatments to individual patients based on their likelihood to respond to a therapy. We construct a heterogeneous graph which includes two domains (patients and drugs) and encodes three relationships (patient similarity, drug similarity, and patient-drug prior associations). We describe a novel approach for performing a label propagation procedure to spread the label information representing the effectiveness of different drugs for different patients over this heterogeneous graph. The proposed method has been applied on a real-world EHR dataset to help identify personalized treatments for hypercholesterolemia. The experimental results demonstrate the effectiveness of the approach and suggest that the combination of appropriate patient similarity and drug similarity analytics could lead to actionable insights for personalized medicine. Particularly, by leveraging drug similarity in combination with patient similarity, our method could perform well even on new or rarely used drugs for which there are few records of known past performance. PMID:25717413

  15. Similarity measures for face recognition

    CERN Document Server

    Vezzetti, Enrico

    2015-01-01

    Face recognition has several applications, including security, such as (authentication and identification of device users and criminal suspects), and in medicine (corrective surgery and diagnosis). Facial recognition programs rely on algorithms that can compare and compute the similarity between two sets of images. This eBook explains some of the similarity measures used in facial recognition systems in a single volume. Readers will learn about various measures including Minkowski distances, Mahalanobis distances, Hansdorff distances, cosine-based distances, among other methods. The book also summarizes errors that may occur in face recognition methods. Computer scientists "facing face" and looking to select and test different methods of computing similarities will benefit from this book. The book is also useful tool for students undertaking computer vision courses.

  16. Revisiting Inter-Genre Similarity

    DEFF Research Database (Denmark)

    Sturm, Bob L.; Gouyon, Fabien

    2013-01-01

    We revisit the idea of ``inter-genre similarity'' (IGS) for machine learning in general, and music genre recognition in particular. We show analytically that the probability of error for IGS is higher than naive Bayes classification with zero-one loss (NB). We show empirically that IGS does...... not perform well, even for data that satisfies all its assumptions....

  17. Fast business process similarity search

    NARCIS (Netherlands)

    Yan, Z.; Dijkman, R.M.; Grefen, P.W.P.J.

    2012-01-01

    Nowadays, it is common for organizations to maintain collections of hundreds or even thousands of business processes. Techniques exist to search through such a collection, for business process models that are similar to a given query model. However, those techniques compare the query model to each

  18. Glove boxes and similar containments

    International Nuclear Information System (INIS)

    Anon.

    1975-01-01

    According to the present invention a glove box or similar containment is provided with an exhaust system including a vortex amplifier venting into the system, the vortex amplifier also having its main inlet in fluid flow connection with the containment and a control inlet in fluid flow connection with the atmosphere outside the containment. (U.S.)

  19. Large margin classification with indefinite similarities

    KAUST Repository

    Alabdulmohsin, Ibrahim

    2016-01-07

    Classification with indefinite similarities has attracted attention in the machine learning community. This is partly due to the fact that many similarity functions that arise in practice are not symmetric positive semidefinite, i.e. the Mercer condition is not satisfied, or the Mercer condition is difficult to verify. Examples of such indefinite similarities in machine learning applications are ample including, for instance, the BLAST similarity score between protein sequences, human-judged similarities between concepts and words, and the tangent distance or the shape matching distance in computer vision. Nevertheless, previous works on classification with indefinite similarities are not fully satisfactory. They have either introduced sources of inconsistency in handling past and future examples using kernel approximation, settled for local-minimum solutions using non-convex optimization, or produced non-sparse solutions by learning in Krein spaces. Despite the large volume of research devoted to this subject lately, we demonstrate in this paper how an old idea, namely the 1-norm support vector machine (SVM) proposed more than 15 years ago, has several advantages over more recent work. In particular, the 1-norm SVM method is conceptually simpler, which makes it easier to implement and maintain. It is competitive, if not superior to, all other methods in terms of predictive accuracy. Moreover, it produces solutions that are often sparser than more recent methods by several orders of magnitude. In addition, we provide various theoretical justifications by relating 1-norm SVM to well-established learning algorithms such as neural networks, SVM, and nearest neighbor classifiers. Finally, we conduct a thorough experimental evaluation, which reveals that the evidence in favor of 1-norm SVM is statistically significant.

  20. An Alfven eigenmode similarity experiment

    International Nuclear Information System (INIS)

    Heidbrink, W W; Fredrickson, E; Gorelenkov, N N; Hyatt, A W; Kramer, G; Luo, Y

    2003-01-01

    The major radius dependence of Alfven mode stability is studied by creating plasmas with similar minor radius, shape, magnetic field (0.5 T), density (n e ≅3x10 19 m -3 ), electron temperature (1.0 keV) and beam ion population (near-tangential 80 keV deuterium injection) on both NSTX and DIII-D. The major radius of NSTX is half the major radius of DIII-D. The super-Alfvenic beam ions that drive the modes have overlapping values of v f /v A in the two devices. Observed beam-driven instabilities include toroidicity-induced Alfven eigenmodes (TAE). The stability threshold for the TAE is similar in the two devices. As expected theoretically, the most unstable toroidal mode number n is larger in DIII-D

  1. Compressional Alfven Eigenmode Similarity Study

    Science.gov (United States)

    Heidbrink, W. W.; Fredrickson, E. D.; Gorelenkov, N. N.; Rhodes, T. L.

    2004-11-01

    NSTX and DIII-D are nearly ideal for Alfven eigenmode (AE) similarity experiments, having similar neutral beams, fast-ion to Alfven speed v_f/v_A, fast-ion pressure, and shape of the plasma, but with a factor of 2 difference in the major radius. Toroidicity-induced AE with ˜100 kHz frequencies were compared in an earlier study [1]; this paper focuses on higher frequency AE with f ˜ 1 MHz. Compressional AE (CAE) on NSTX have a polarization, dependence on the fast-ion distribution function, frequency scaling, and low-frequency limit that are qualitatively consistent with CAE theory [2]. Global AE (GAE) are also observed. On DIII-D, coherent modes in this frequency range are observed during low-field (0.6 T) similarity experiments. Experiments will compare the CAE stability limits on DIII-D with the NSTX stability limits, with the aim of determining if CAE will be excited by alphas in a reactor. Predicted differences in the frequency splitting Δ f between excited modes will also be used. \\vspace0.25em [1] W.W. Heidbrink, et al., Plasmas Phys. Control. Fusion 45, 983 (2003). [2] E.D. Fredrickson, et al., Princeton Plasma Physics Laboratory Report PPPL-3955 (2004).

  2. Prioritization of candidate disease genes by combining topological similarity and semantic similarity.

    Science.gov (United States)

    Liu, Bin; Jin, Min; Zeng, Pan

    2015-10-01

    The identification of gene-phenotype relationships is very important for the treatment of human diseases. Studies have shown that genes causing the same or similar phenotypes tend to interact with each other in a protein-protein interaction (PPI) network. Thus, many identification methods based on the PPI network model have achieved good results. However, in the PPI network, some interactions between the proteins encoded by candidate gene and the proteins encoded by known disease genes are very weak. Therefore, some studies have combined the PPI network with other genomic information and reported good predictive performances. However, we believe that the results could be further improved. In this paper, we propose a new method that uses the semantic similarity between the candidate gene and known disease genes to set the initial probability vector of a random walk with a restart algorithm in a human PPI network. The effectiveness of our method was demonstrated by leave-one-out cross-validation, and the experimental results indicated that our method outperformed other methods. Additionally, our method can predict new causative genes of multifactor diseases, including Parkinson's disease, breast cancer and obesity. The top predictions were good and consistent with the findings in the literature, which further illustrates the effectiveness of our method. Copyright © 2015 Elsevier Inc. All rights reserved.

  3. Similarity analysis between quantum images

    Science.gov (United States)

    Zhou, Ri-Gui; Liu, XingAo; Zhu, Changming; Wei, Lai; Zhang, Xiafen; Ian, Hou

    2018-06-01

    Similarity analyses between quantum images are so essential in quantum image processing that it provides fundamental research for the other fields, such as quantum image matching, quantum pattern recognition. In this paper, a quantum scheme based on a novel quantum image representation and quantum amplitude amplification algorithm is proposed. At the end of the paper, three examples and simulation experiments show that the measurement result must be 0 when two images are same, and the measurement result has high probability of being 1 when two images are different.

  4. Similarity flows in relativistic hydrodynamics

    International Nuclear Information System (INIS)

    Blaizot, J.P.; Ollitrault, J.Y.

    1986-01-01

    In ultra-relativistic heavy ion collisions, one expects in particular to observe a deconfinement transition leading to a formation of quark gluon plasma. In the framework of the hydrodynamic model, experimental signatures of such a plasma may be looked for as observable consequences of a first order transition on the evolution of the system. In most of the possible scenario, the phase transition is accompanied with discontinuities in the hydrodynamic flow, such as shock waves. The method presented in this paper has been developed to treat without too much numerical effort such discontinuous flow. It relies heavily on the use of similarity solutions of the hydrodynamic equations

  5. Self-similar gravitational clustering

    International Nuclear Information System (INIS)

    Efstathiou, G.; Fall, S.M.; Hogan, C.

    1979-01-01

    The evolution of gravitational clustering is considered and several new scaling relations are derived for the multiplicity function. These include generalizations of the Press-Schechter theory to different densities and cosmological parameters. The theory is then tested against multiplicity function and correlation function estimates for a series of 1000-body experiments. The results are consistent with the theory and show some dependence on initial conditions and cosmological density parameter. The statistical significance of the results, however, is fairly low because of several small number effects in the experiments. There is no evidence for a non-linear bootstrap effect or a dependence of the multiplicity function on the internal dynamics of condensed groups. Empirical estimates of the multiplicity function by Gott and Turner have a feature near the characteristic luminosity predicted by the theory. The scaling relations allow the inference from estimates of the galaxy luminosity function that galaxies must have suffered considerable dissipation if they originally formed from a self-similar hierarchy. A method is also developed for relating the multiplicity function to similar measures of clustering, such as those of Bhavsar, for the distribution of galaxies on the sky. These are shown to depend on the luminosity function in a complicated way. (author)

  6. Seniority bosons from similarity transformations

    International Nuclear Information System (INIS)

    Geyer, H.B.

    1986-01-01

    The requirement of associating in the boson space seniority with twice the number of non-s bosons defines a similarity transformation which re-expresses the Dyson pair boson images in terms of seniority bosons. In particular the fermion S-pair creation operator is mapped onto an operator which, unlike the pair boson image, does not change the number of non-s bosons. The original results of Otsuka, Arima and Iachello are recovered by this procedure while at the same time they are generalized to include g-bosons or even bosons with J>4 as well as any higher order boson terms. Furthermore the seniority boson images are valid for an arbitrary number of d- or g-bosons - a result which is not readily obtainable within the framework of the usual Marumori- or OAI-method

  7. Alaska, Gulf spills share similarities

    International Nuclear Information System (INIS)

    Usher, D.

    1991-01-01

    The accidental Exxon Valdez oil spill in Alaska and the deliberate dumping of crude oil into the Persian Gulf as a tactic of war contain both glaring differences and surprising similarities. Public reaction and public response was much greater to the Exxon Valdez spill in pristine Prince William Sound than to the war-related tragedy in the Persian Gulf. More than 12,000 workers helped in the Alaskan cleanup; only 350 have been involved in Kuwait. But in both instances, environmental damages appear to be less than anticipated. Natures highly effective self-cleansing action is primarily responsible for minimizing the damages. One positive action growing out of the two incidents is increased international cooperation and participation in oil-spill clean-up efforts. In 1990, in the aftermath of the Exxon Valdez spill, 94 nations signed an international accord on cooperation in future spills. The spills can be historic environmental landmarks leading to creation of more sophisticated response systems worldwide

  8. Hilar cholangiocarcinoma is pathologically similar to pancreatic duct adenocarcinoma: suggestions of similar background and development.

    Science.gov (United States)

    Nakanuma, Yasuni; Sato, Yasunori

    2014-07-01

    Routine experiences suggest that cholangiocarcinomas (CCAs) show different clinicopathological behaviors along the biliary tree, and hilar CCA apparently resembles pancreatic duct adenocarcinoma (PDAC). Herein, the backgrounds for these similarities were reviewed. While all cases of PDAC, hilar CCA, intrahepatic CCA (ICCA) and CCA components of combined hepatocellular-cholangiocarcinoma (cHC-CCA) were adenocarcinomas, micropapillary patterns and columnar carcinoma cells were common in PDAC and hilar CCA, and trabecular components and cuboidal carcinoma cells were common in ICCA and CCA components of cHC-CCA. Anterior gradient protein-2 and S100P were frequently expressed in perihilar CCA and PDAC, while neural cell adhesion molecule and luminal epithelial membrane antigen were common in CCA components of c-HC-CCA. Pdx1 and Hes1 were frequently and markedly expressed aberrantly in PDAC and perihilar CCA, although their expression was rare and mild in CCA components in cHC-CCA and ICCA. Hilar CCA showed a similar postoperative prognosis to PDAC but differed from ICCA and cHC-CCA. Taken together, hilar CCA may differ from ICCA and CCA components of cHC-CCA but have a similar development to PDAC. These similarities may be explained by the unique anatomical, embryological and reactive nature of the pancreatobiliary tract. Further studies of these intractable malignancies are warranted. © 2014 Japanese Society of Hepato-Biliary-Pancreatic Surgery.

  9. Development of similarity theory for control systems

    Science.gov (United States)

    Myshlyaev, L. P.; Evtushenko, V. F.; Ivushkin, K. A.; Makarov, G. V.

    2018-05-01

    The area of effective application of the traditional similarity theory and the need necessity of its development for systems are discussed. The main statements underlying the similarity theory of control systems are given. The conditions for the similarity of control systems and the need for similarity control control are formulated. Methods and algorithms for estimating and similarity control of control systems and the results of research of control systems based on their similarity are presented. The similarity control of systems includes the current evaluation of the degree of similarity of control systems and the development of actions controlling similarity, and the corresponding targeted change in the state of any element of control systems.

  10. Using SQL Databases for Sequence Similarity Searching and Analysis.

    Science.gov (United States)

    Pearson, William R; Mackey, Aaron J

    2017-09-13

    Relational databases can integrate diverse types of information and manage large sets of similarity search results, greatly simplifying genome-scale analyses. By focusing on taxonomic subsets of sequences, relational databases can reduce the size and redundancy of sequence libraries and improve the statistical significance of homologs. In addition, by loading similarity search results into a relational database, it becomes possible to explore and summarize the relationships between all of the proteins in an organism and those in other biological kingdoms. This unit describes how to use relational databases to improve the efficiency of sequence similarity searching and demonstrates various large-scale genomic analyses of homology-related data. It also describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. The unit also introduces search_demo, a database that stores sequence similarity search results. The search_demo database is then used to explore the evolutionary relationships between E. coli proteins and proteins in other organisms in a large-scale comparative genomic analysis. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.

  11. Marriage Matters: Spousal Similarity in Life Satisfaction

    OpenAIRE

    Ulrich Schimmack; Richard Lucas

    2006-01-01

    Examined the concurrent and cross-lagged spousal similarity in life satisfaction over a 21-year period. Analyses were based on married couples (N = 847) in the German Socio-Economic Panel (SOEP). Concurrent spousal similarity was considerably higher than one-year retest similarity, revealing spousal similarity in the variable component of life satisfac-tion. Spousal similarity systematically decreased with length of retest interval, revealing simi-larity in the changing component of life sati...

  12. Exploring the relationship between sequence similarity and accurate phylogenetic trees.

    Science.gov (United States)

    Cantarel, Brandi L; Morrison, Hilary G; Pearson, William

    2006-11-01

    We have characterized the relationship between accurate phylogenetic reconstruction and sequence similarity, testing whether high levels of sequence similarity can consistently produce accurate evolutionary trees. We generated protein families with known phylogenies using a modified version of the PAML/EVOLVER program that produces insertions and deletions as well as substitutions. Protein families were evolved over a range of 100-400 point accepted mutations; at these distances 63% of the families shared significant sequence similarity. Protein families were evolved using balanced and unbalanced trees, with ancient or recent radiations. In families sharing statistically significant similarity, about 60% of multiple sequence alignments were 95% identical to true alignments. To compare recovered topologies with true topologies, we used a score that reflects the fraction of clades that were correctly clustered. As expected, the accuracy of the phylogenies was greatest in the least divergent families. About 88% of phylogenies clustered over 80% of clades in families that shared significant sequence similarity, using Bayesian, parsimony, distance, and maximum likelihood methods. However, for protein families with short ancient branches (ancient radiation), only 30% of the most divergent (but statistically significant) families produced accurate phylogenies, and only about 70% of the second most highly conserved families, with median expectation values better than 10(-60), produced accurate trees. These values represent upper bounds on expected tree accuracy for sequences with a simple divergence history; proteins from 700 Giardia families, with a similar range of sequence similarities but considerably more gaps, produced much less accurate trees. For our simulated insertions and deletions, correct multiple sequence alignments did not perform much better than those produced by T-COFFEE, and including sequences with expressed sequence tag-like sequencing errors did not

  13. On different forms of self similarity

    International Nuclear Information System (INIS)

    Aswathy, R.K.; Mathew, Sunil

    2016-01-01

    Fractal geometry is mainly based on the idea of self-similar forms. To be self-similar, a shape must able to be divided into parts that are smaller copies, which are more or less similar to the whole. There are different forms of self similarity in nature and mathematics. In this paper, some of the topological properties of super self similar sets are discussed. It is proved that in a complete metric space with two or more elements, the set of all non super self similar sets are dense in the set of all non-empty compact sub sets. It is also proved that the product of self similar sets are super self similar in product metric spaces and that the super self similarity is preserved under isometry. A characterization of super self similar sets using contracting sub self similarity is also presented. Some relevant counterexamples are provided. The concepts of exact super and sub self similarity are introduced and a necessary and sufficient condition for a set to be exact super self similar in terms of condensation iterated function systems (Condensation IFS’s) is obtained. A method to generate exact sub self similar sets using condensation IFS’s and the denseness of exact super self similar sets are also discussed.

  14. Large margin classification with indefinite similarities

    KAUST Repository

    Alabdulmohsin, Ibrahim; Cisse, Moustapha; Gao, Xin; Zhang, Xiangliang

    2016-01-01

    Classification with indefinite similarities has attracted attention in the machine learning community. This is partly due to the fact that many similarity functions that arise in practice are not symmetric positive semidefinite, i.e. the Mercer

  15. Testing Self-Similarity Through Lamperti Transformations

    KAUST Repository

    Lee, Myoungji; Genton, Marc G.; Jun, Mikyoung

    2016-01-01

    extensively, while statistical tests for self-similarity are scarce and limited to processes indexed in one dimension. This paper proposes a statistical hypothesis test procedure for self-similarity of a stochastic process indexed in one dimension and multi

  16. Personality similarity and life satisfaction in couples

    OpenAIRE

    Furler Katrin; Gomez Veronica; Grob Alexander

    2013-01-01

    The present study examined the association between personality similarity and life satisfaction in a large nationally representative sample of 1608 romantic couples. Similarity effects were computed for the Big Five personality traits as well as for personality profiles with global and differentiated indices of similarity. Results showed substantial actor and partner effects indicating that both partners' personality traits were related to both partners' life satisfaction. Personality similar...

  17. Protein sequence comparison and protein evolution

    Energy Technology Data Exchange (ETDEWEB)

    Pearson, W.R. [Univ. of Virginia, Charlottesville, VA (United States). Dept. of Biochemistry

    1995-12-31

    This tutorial was one of eight tutorials selected to be presented at the Third International Conference on Intelligent Systems for Molecular Biology which was held in the United Kingdom from July 16 to 19, 1995. This tutorial examines how the information conserved during the evolution of a protein molecule can be used to infer reliably homology, and thus a shared proteinfold and possibly a shared active site or function. The authors start by reviewing a geological/evolutionary time scale. Next they look at the evolution of several protein families. During the tutorial, these families will be used to demonstrate that homologous protein ancestry can be inferred with confidence. They also examine different modes of protein evolution and consider some hypotheses that have been presented to explain the very earliest events in protein evolution. The next part of the tutorial will examine the technical aspects of protein sequence comparison. Both optimal and heuristic algorithms and their associated parameters that are used to characterize protein sequence similarities are discussed. Perhaps more importantly, they survey the statistics of local similarity scores, and how these statistics can both be used to improve the selectivity of a search and to evaluate the significance of a match. They them examine distantly related members of three protein families, the serine proteases, the glutathione transferases, and the G-protein-coupled receptors (GCRs). Finally, the discuss how sequence similarity can be used to examine internal repeated or mosaic structures in proteins.

  18. Testing Self-Similarity Through Lamperti Transformations

    KAUST Repository

    Lee, Myoungji

    2016-07-14

    Self-similar processes have been widely used in modeling real-world phenomena occurring in environmetrics, network traffic, image processing, and stock pricing, to name but a few. The estimation of the degree of self-similarity has been studied extensively, while statistical tests for self-similarity are scarce and limited to processes indexed in one dimension. This paper proposes a statistical hypothesis test procedure for self-similarity of a stochastic process indexed in one dimension and multi-self-similarity for a random field indexed in higher dimensions. If self-similarity is not rejected, our test provides a set of estimated self-similarity indexes. The key is to test stationarity of the inverse Lamperti transformations of the process. The inverse Lamperti transformation of a self-similar process is a strongly stationary process, revealing a theoretical connection between the two processes. To demonstrate the capability of our test, we test self-similarity of fractional Brownian motions and sheets, their time deformations and mixtures with Gaussian white noise, and the generalized Cauchy family. We also apply the self-similarity test to real data: annual minimum water levels of the Nile River, network traffic records, and surface heights of food wrappings. © 2016, International Biometric Society.

  19. Similarity increases altruistic punishment in humans.

    Science.gov (United States)

    Mussweiler, Thomas; Ockenfels, Axel

    2013-11-26

    Humans are attracted to similar others. As a consequence, social networks are homogeneous in sociodemographic, intrapersonal, and other characteristics--a principle called homophily. Despite abundant evidence showing the importance of interpersonal similarity and homophily for human relationships, their behavioral correlates and cognitive foundations are poorly understood. Here, we show that perceived similarity substantially increases altruistic punishment, a key mechanism underlying human cooperation. We induced (dis)similarity perception by manipulating basic cognitive mechanisms in an economic cooperation game that included a punishment phase. We found that similarity-focused participants were more willing to punish others' uncooperative behavior. This influence of similarity is not explained by group identity, which has the opposite effect on altruistic punishment. Our findings demonstrate that pure similarity promotes reciprocity in ways known to encourage cooperation. At the same time, the increased willingness to punish norm violations among similarity-focused participants provides a rationale for why similar people are more likely to build stable social relationships. Finally, our findings show that altruistic punishment is differentially involved in encouraging cooperation under pure similarity vs. in-group conditions.

  20. Notions of similarity for systems biology models.

    Science.gov (United States)

    Henkel, Ron; Hoehndorf, Robert; Kacprowski, Tim; Knüpfer, Christian; Liebermeister, Wolfram; Waltemath, Dagmar

    2018-01-01

    Systems biology models are rapidly increasing in complexity, size and numbers. When building large models, researchers rely on software tools for the retrieval, comparison, combination and merging of models, as well as for version control. These tools need to be able to quantify the differences and similarities between computational models. However, depending on the specific application, the notion of 'similarity' may greatly vary. A general notion of model similarity, applicable to various types of models, is still missing. Here we survey existing methods for the comparison of models, introduce quantitative measures for model similarity, and discuss potential applications of combined similarity measures. To frame model comparison as a general problem, we describe a theoretical approach to defining and computing similarities based on a combination of different model aspects. The six aspects that we define as potentially relevant for similarity are underlying encoding, references to biological entities, quantitative behaviour, qualitative behaviour, mathematical equations and parameters and network structure. We argue that future similarity measures will benefit from combining these model aspects in flexible, problem-specific ways to mimic users' intuition about model similarity, and to support complex model searches in databases. © The Author 2016. Published by Oxford University Press.

  1. Similar speaker recognition using nonlinear analysis

    International Nuclear Information System (INIS)

    Seo, J.P.; Kim, M.S.; Baek, I.C.; Kwon, Y.H.; Lee, K.S.; Chang, S.W.; Yang, S.I.

    2004-01-01

    Speech features of the conventional speaker identification system, are usually obtained by linear methods in spectral space. However, these methods have the drawback that speakers with similar voices cannot be distinguished, because the characteristics of their voices are also similar in spectral space. To overcome the difficulty in linear methods, we propose to use the correlation exponent in the nonlinear space as a new feature vector for speaker identification among persons with similar voices. We show that our proposed method surprisingly reduces the error rate of speaker identification system to speakers with similar voices

  2. Immunoinformatics and Similarity Analysis of House Dust Mite Tropomyosin

    Directory of Open Access Journals (Sweden)

    Mohammad Mehdi Ranjbar

    2015-10-01

    Full Text Available Background: Dermatophagoides farinae and Dermatophagoides pteronyssinus are house dust mites (HDM that they cause severe asthma and allergic symptoms. Tropomyosin protein plays an important role in mentioned immune and allergic reactions to HDMs. Here, tropomyosin protein from Dermatophagoides spp. was comprehensively screened in silico for its allergenicity, antigenicity and similarity/conservation.Materials and Methods: The amino acid sequences of D. farinae tropomyosin, D. pteronyssinus and other mites were retrieved. We included alignments and evaluated conserved/ variable regions along sequences, constructed their phylogenetic tree and estimated overall mean distances. Then, followed by with prediction of linear B-cell epitope based on different approaches, and besides in-silico evaluation of IgE epitopes allergenicity (by SVMc, IgE epitope, ARPs BLAST, MAST and hybrid method. Finally, comparative analysis of results by different approaches was made.Results: Alignment results revealed near complete identity between D. farina and D. pteronyssinus members, and also there was close similarity among Dermatophagoides spp. Most of the variations among mites' tropomyosin were approximately located at amino acids 23 to 80, 108 to 120, 142 to 153 and 220 to 230. Topology of tree showed close relationships among mites in tropomyosin protein sequence, although their sequences in D. farina, D. pteronyssinus and Psoroptes ovis are more similar to each other and clustered. Dermanyssus gallinae (AC: Q2WBI0 has less relationship to other mites, being located in a separate branch. Hydrophilicity and flexibility plots revealed that many parts of this protein have potential to be hydrophilic and flexible. Surface accessibility represented 7 different epitopes. Beta-turns in this protein are with high probability in the middle part and its two terminals. Kolaskar and Tongaonkar method analysis represented 11 immunogenic epitopes between amino acids 7-16. From

  3. On self-similar Tolman models

    International Nuclear Information System (INIS)

    Maharaj, S.D.

    1988-01-01

    The self-similar spherically symmetric solutions of the Einstein field equation for the case of dust are identified. These form a subclass of the Tolman models. These self-similar models contain the solution recently presented by Chi [J. Math. Phys. 28, 1539 (1987)], thereby refuting the claim of having found a new solution to the Einstein field equations

  4. Mining Diagnostic Assessment Data for Concept Similarity

    Science.gov (United States)

    Madhyastha, Tara; Hunt, Earl

    2009-01-01

    This paper introduces a method for mining multiple-choice assessment data for similarity of the concepts represented by the multiple choice responses. The resulting similarity matrix can be used to visualize the distance between concepts in a lower-dimensional space. This gives an instructor a visualization of the relative difficulty of concepts…

  5. Similarity indices I: what do they measure

    International Nuclear Information System (INIS)

    Johnston, J.W.

    1976-11-01

    A method for estimating the effects of environmental effusions on ecosystems is described. The characteristics of 25 similarity indices used in studies of ecological communities were investigated. The type of data structure, to which these indices are frequently applied, was described as consisting of vectors of measurements on attributes (species) observed in a set of samples. A general similarity index was characterized as the result of a two-step process defined on a pair of vectors. In the first step an attribute similarity score is obtained for each attribute by comparing the attribute values observed in the pair of vectors. The result is a vector of attribute similarity scores. These are combined in the second step to arrive at the similarity index. The operation in the first step was characterized as a function, g, defined on pairs of attribute values. The second operation was characterized as a function, F, defined on the vector of attribute similarity scores from the first step. Usually, F was a simple sum or weighted sum of the attribute similarity scores. It is concluded that similarity indices should not be used as the test statistic to discriminate between two ecological communities

  6. Measuring transferring similarity via local information

    Science.gov (United States)

    Yin, Likang; Deng, Yong

    2018-05-01

    Recommender systems have developed along with the web science, and how to measure the similarity between users is crucial for processing collaborative filtering recommendation. Many efficient models have been proposed (i.g., the Pearson coefficient) to measure the direct correlation. However, the direct correlation measures are greatly affected by the sparsity of dataset. In other words, the direct correlation measures would present an inauthentic similarity if two users have a very few commonly selected objects. Transferring similarity overcomes this drawback by considering their common neighbors (i.e., the intermediates). Yet, the transferring similarity also has its drawback since it can only provide the interval of similarity. To break the limitations, we propose the Belief Transferring Similarity (BTS) model. The contributions of BTS model are: (1) BTS model addresses the issue of the sparsity of dataset by considering the high-order similarity. (2) BTS model transforms uncertain interval to a certain state based on fuzzy systems theory. (3) BTS model is able to combine the transferring similarity of different intermediates using information fusion method. Finally, we compare BTS models with nine different link prediction methods in nine different networks, and we also illustrate the convergence property and efficiency of the BTS model.

  7. On distributional assumptions and whitened cosine similarities

    DEFF Research Database (Denmark)

    Loog, Marco

    2008-01-01

    Recently, an interpretation of the whitened cosine similarity measure as a Bayes decision rule was proposed (C. Liu, "The Bayes Decision Rule Induced Similarity Measures,'' IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 6, pp. 1086-1090, June 2007. This communication makes th...

  8. Self-Similar Traffic In Wireless Networks

    OpenAIRE

    Jerjomins, R.; Petersons, E.

    2005-01-01

    Many studies have shown that traffic in Ethernet and other wired networks is self-similar. This paper reveals that wireless network traffic is also self-similar and long-range dependant by analyzing big amount of data captured from the wireless router.

  9. Similarity Structure of Wave-Collapse

    DEFF Research Database (Denmark)

    Rypdal, Kristoffer; Juul Rasmussen, Jens; Thomsen, Kenneth

    1985-01-01

    Similarity transformations of the cubic Schrödinger equation (CSE) are investigated. The transformations are used to remove the explicit time variation in the CSE and reduce it to differential equations in the spatial variables only. Two different methods for similarity reduction are employed and...

  10. Similarity indices I: what do they measure.

    Energy Technology Data Exchange (ETDEWEB)

    Johnston, J.W.

    1976-11-01

    A method for estimating the effects of environmental effusions on ecosystems is described. The characteristics of 25 similarity indices used in studies of ecological communities were investigated. The type of data structure, to which these indices are frequently applied, was described as consisting of vectors of measurements on attributes (species) observed in a set of samples. A general similarity index was characterized as the result of a two-step process defined on a pair of vectors. In the first step an attribute similarity score is obtained for each attribute by comparing the attribute values observed in the pair of vectors. The result is a vector of attribute similarity scores. These are combined in the second step to arrive at the similarity index. The operation in the first step was characterized as a function, g, defined on pairs of attribute values. The second operation was characterized as a function, F, defined on the vector of attribute similarity scores from the first step. Usually, F was a simple sum or weighted sum of the attribute similarity scores. It is concluded that similarity indices should not be used as the test statistic to discriminate between two ecological communities.

  11. Information filtering based on transferring similarity.

    Science.gov (United States)

    Sun, Duo; Zhou, Tao; Liu, Jian-Guo; Liu, Run-Ran; Jia, Chun-Xiao; Wang, Bing-Hong

    2009-07-01

    In this Brief Report, we propose an index of user similarity, namely, the transferring similarity, which involves all high-order similarities between users. Accordingly, we design a modified collaborative filtering algorithm, which provides remarkably higher accurate predictions than the standard collaborative filtering. More interestingly, we find that the algorithmic performance will approach its optimal value when the parameter, contained in the definition of transferring similarity, gets close to its critical value, before which the series expansion of transferring similarity is convergent and after which it is divergent. Our study is complementary to the one reported in [E. A. Leicht, P. Holme, and M. E. J. Newman, Phys. Rev. E 73, 026120 (2006)], and is relevant to the missing link prediction problem.

  12. Self-similar continued root approximants

    International Nuclear Information System (INIS)

    Gluzman, S.; Yukalov, V.I.

    2012-01-01

    A novel method of summing asymptotic series is advanced. Such series repeatedly arise when employing perturbation theory in powers of a small parameter for complicated problems of condensed matter physics, statistical physics, and various applied problems. The method is based on the self-similar approximation theory involving self-similar root approximants. The constructed self-similar continued roots extrapolate asymptotic series to finite values of the expansion parameter. The self-similar continued roots contain, as a particular case, continued fractions and Padé approximants. A theorem on the convergence of the self-similar continued roots is proved. The method is illustrated by several examples from condensed-matter physics.

  13. Correlation between social proximity and mobility similarity.

    Science.gov (United States)

    Fan, Chao; Liu, Yiding; Huang, Junming; Rong, Zhihai; Zhou, Tao

    2017-09-20

    Human behaviors exhibit ubiquitous correlations in many aspects, such as individual and collective levels, temporal and spatial dimensions, content, social and geographical layers. With rich Internet data of online behaviors becoming available, it attracts academic interests to explore human mobility similarity from the perspective of social network proximity. Existent analysis shows a strong correlation between online social proximity and offline mobility similarity, namely, mobile records between friends are significantly more similar than between strangers, and those between friends with common neighbors are even more similar. We argue the importance of the number and diversity of common friends, with a counter intuitive finding that the number of common friends has no positive impact on mobility similarity while the diversity plays a key role, disagreeing with previous studies. Our analysis provides a novel view for better understanding the coupling between human online and offline behaviors, and will help model and predict human behaviors based on social proximity.

  14. Scalar Similarity for Relaxed Eddy Accumulation Methods

    Science.gov (United States)

    Ruppert, Johannes; Thomas, Christoph; Foken, Thomas

    2006-07-01

    The relaxed eddy accumulation (REA) method allows the measurement of trace gas fluxes when no fast sensors are available for eddy covariance measurements. The flux parameterisation used in REA is based on the assumption of scalar similarity, i.e., similarity of the turbulent exchange of two scalar quantities. In this study changes in scalar similarity between carbon dioxide, sonic temperature and water vapour were assessed using scalar correlation coefficients and spectral analysis. The influence on REA measurements was assessed by simulation. The evaluation is based on observations over grassland, irrigated cotton plantation and spruce forest. Scalar similarity between carbon dioxide, sonic temperature and water vapour showed a distinct diurnal pattern and change within the day. Poor scalar similarity was found to be linked to dissimilarities in the energy contained in the low frequency part of the turbulent spectra ( definition.

  15. Surf similarity and solitary wave runup

    DEFF Research Database (Denmark)

    Fuhrman, David R.; Madsen, Per A.

    2008-01-01

    The notion of surf similarity in the runup of solitary waves is revisited. We show that the surf similarity parameter for solitary waves may be effectively reduced to the beach slope divided by the offshore wave height to depth ratio. This clarifies its physical interpretation relative to a previ...... functional dependence on their respective surf similarity parameters. Important equivalencies in the runup of sinusoidal and solitary waves are thus revealed.......The notion of surf similarity in the runup of solitary waves is revisited. We show that the surf similarity parameter for solitary waves may be effectively reduced to the beach slope divided by the offshore wave height to depth ratio. This clarifies its physical interpretation relative...... to a previous parameterization, which was not given in an explicit form. Good coherency with experimental (breaking) runup data is preserved with this simpler parameter. A recasting of analytical (nonbreaking) runup expressions for sinusoidal and solitary waves additionally shows that they contain identical...

  16. Similarity in Bilateral Isolated Internal Orbital Fractures.

    Science.gov (United States)

    Chen, Hung-Chang; Cox, Jacob T; Sanyal, Abanti; Mahoney, Nicholas R

    2018-04-13

    In evaluating patients sustaining bilateral isolated internal orbital fractures, the authors have observed both similar fracture locations and also similar expansion of orbital volumes. In this study, we aim to investigate if there is a propensity for the 2 orbits to fracture in symmetrically similar patterns when sustaining similar trauma. A retrospective chart review was performed studying all cases at our institution of bilateral isolated internal orbital fractures involving the medial wall and/or the floor at the time of presentation. The similarity of the bilateral fracture locations was evaluated using the Fisher's exact test. The bilateral expanded orbital volumes were analyzed using the Wilcoxon signed-rank test to assess for orbital volume similarity. Twenty-four patients with bilateral internal orbital fractures were analyzed for fracture location similarity. Seventeen patients (70.8%) had 100% concordance in the orbital subregion fractured, and the association between the right and the left orbital fracture subregion locations was statistically significant (P < 0.0001). Fifteen patients were analyzed for orbital volume similarity. The average orbital cavity volume was 31.2 ± 3.8 cm on the right and 32.0 ± 3.7 cm on the left. There was a statistically significant difference between right and left orbital cavity volumes (P = 0.0026). The data from this study suggest that an individual who suffers isolated bilateral internal orbital fractures has a statistically significant similarity in the location of their orbital fractures. However, there does not appear to be statistically significant similarity in the expansion of the orbital volumes in these patients.

  17. Measure of Node Similarity in Multilayer Networks.

    Directory of Open Access Journals (Sweden)

    Anders Mollgaard

    Full Text Available The weight of links in a network is often related to the similarity of the nodes. Here, we introduce a simple tunable measure for analysing the similarity of nodes across different link weights. In particular, we use the measure to analyze homophily in a group of 659 freshman students at a large university. Our analysis is based on data obtained using smartphones equipped with custom data collection software, complemented by questionnaire-based data. The network of social contacts is represented as a weighted multilayer network constructed from different channels of telecommunication as well as data on face-to-face contacts. We find that even strongly connected individuals are not more similar with respect to basic personality traits than randomly chosen pairs of individuals. In contrast, several socio-demographics variables have a significant degree of similarity. We further observe that similarity might be present in one layer of the multilayer network and simultaneously be absent in the other layers. For a variable such as gender, our measure reveals a transition from similarity between nodes connected with links of relatively low weight to dis-similarity for the nodes connected by the strongest links. We finally analyze the overlap between layers in the network for different levels of acquaintanceships.

  18. Notions of similarity for computational biology models

    KAUST Repository

    Waltemath, Dagmar

    2016-03-21

    Computational models used in biology are rapidly increasing in complexity, size, and numbers. To build such large models, researchers need to rely on software tools for model retrieval, model combination, and version control. These tools need to be able to quantify the differences and similarities between computational models. However, depending on the specific application, the notion of similarity may greatly vary. A general notion of model similarity, applicable to various types of models, is still missing. Here, we introduce a general notion of quantitative model similarities, survey the use of existing model comparison methods in model building and management, and discuss potential applications of model comparison. To frame model comparison as a general problem, we describe a theoretical approach to defining and computing similarities based on different model aspects. Potentially relevant aspects of a model comprise its references to biological entities, network structure, mathematical equations and parameters, and dynamic behaviour. Future similarity measures could combine these model aspects in flexible, problem-specific ways in order to mimic users\\' intuition about model similarity, and to support complex model searches in databases.

  19. Trajectory similarity join in spatial networks

    KAUST Repository

    Shang, Shuo

    2017-09-07

    The matching of similar pairs of objects, called similarity join, is fundamental functionality in data management. We consider the case of trajectory similarity join (TS-Join), where the objects are trajectories of vehicles moving in road networks. Thus, given two sets of trajectories and a threshold θ, the TS-Join returns all pairs of trajectories from the two sets with similarity above θ. This join targets applications such as trajectory near-duplicate detection, data cleaning, ridesharing recommendation, and traffic congestion prediction. With these applications in mind, we provide a purposeful definition of similarity. To enable efficient TS-Join processing on large sets of trajectories, we develop search space pruning techniques and take into account the parallel processing capabilities of modern processors. Specifically, we present a two-phase divide-and-conquer algorithm. For each trajectory, the algorithm first finds similar trajectories. Then it merges the results to achieve a final result. The algorithm exploits an upper bound on the spatiotemporal similarity and a heuristic scheduling strategy for search space pruning. The algorithm\\'s per-trajectory searches are independent of each other and can be performed in parallel, and the merging has constant cost. An empirical study with real data offers insight in the performance of the algorithm and demonstrates that is capable of outperforming a well-designed baseline algorithm by an order of magnitude.

  20. The baryonic self similarity of dark matter

    International Nuclear Information System (INIS)

    Alard, C.

    2014-01-01

    The cosmological simulations indicates that dark matter halos have specific self-similar properties. However, the halo similarity is affected by the baryonic feedback. By using momentum-driven winds as a model to represent the baryon feedback, an equilibrium condition is derived which directly implies the emergence of a new type of similarity. The new self-similar solution has constant acceleration at a reference radius for both dark matter and baryons. This model receives strong support from the observations of galaxies. The new self-similar properties imply that the total acceleration at larger distances is scale-free, the transition between the dark matter and baryons dominated regime occurs at a constant acceleration, and the maximum amplitude of the velocity curve at larger distances is proportional to M 1/4 . These results demonstrate that this self-similar model is consistent with the basics of modified Newtonian dynamics (MOND) phenomenology. In agreement with the observations, the coincidence between the self-similar model and MOND breaks at the scale of clusters of galaxies. Some numerical experiments show that the behavior of the density near the origin is closely approximated by a Einasto profile.

  1. Notions of similarity for computational biology models

    KAUST Repository

    Waltemath, Dagmar; Henkel, Ron; Hoehndorf, Robert; Kacprowski, Tim; Knuepfer, Christian; Liebermeister, Wolfram

    2016-01-01

    Computational models used in biology are rapidly increasing in complexity, size, and numbers. To build such large models, researchers need to rely on software tools for model retrieval, model combination, and version control. These tools need to be able to quantify the differences and similarities between computational models. However, depending on the specific application, the notion of similarity may greatly vary. A general notion of model similarity, applicable to various types of models, is still missing. Here, we introduce a general notion of quantitative model similarities, survey the use of existing model comparison methods in model building and management, and discuss potential applications of model comparison. To frame model comparison as a general problem, we describe a theoretical approach to defining and computing similarities based on different model aspects. Potentially relevant aspects of a model comprise its references to biological entities, network structure, mathematical equations and parameters, and dynamic behaviour. Future similarity measures could combine these model aspects in flexible, problem-specific ways in order to mimic users' intuition about model similarity, and to support complex model searches in databases.

  2. A Similarity Search Using Molecular Topological Graphs

    Directory of Open Access Journals (Sweden)

    Yoshifumi Fukunishi

    2009-01-01

    Full Text Available A molecular similarity measure has been developed using molecular topological graphs and atomic partial charges. Two kinds of topological graphs were used. One is the ordinary adjacency matrix and the other is a matrix which represents the minimum path length between two atoms of the molecule. The ordinary adjacency matrix is suitable to compare the local structures of molecules such as functional groups, and the other matrix is suitable to compare the global structures of molecules. The combination of these two matrices gave a similarity measure. This method was applied to in silico drug screening, and the results showed that it was effective as a similarity measure.

  3. Similarity-based pattern analysis and recognition

    CERN Document Server

    Pelillo, Marcello

    2013-01-01

    This accessible text/reference presents a coherent overview of the emerging field of non-Euclidean similarity learning. The book presents a broad range of perspectives on similarity-based pattern analysis and recognition methods, from purely theoretical challenges to practical, real-world applications. The coverage includes both supervised and unsupervised learning paradigms, as well as generative and discriminative models. Topics and features: explores the origination and causes of non-Euclidean (dis)similarity measures, and how they influence the performance of traditional classification alg

  4. HYPOTHESIS TESTING WITH THE SIMILARITY INDEX

    Science.gov (United States)

    Mulltilocus DNA fingerprinting methods have been used extensively to address genetic issues in wildlife populations. Hypotheses concerning population subdivision and differing levels of diversity can be addressed through the use of the similarity index (S), a band-sharing coeffic...

  5. On self-similarity of crack layer

    Science.gov (United States)

    Botsis, J.; Kunin, B.

    1987-01-01

    The crack layer (CL) theory of Chudnovsky (1986), based on principles of thermodynamics of irreversible processes, employs a crucial hypothesis of self-similarity. The self-similarity hypothesis states that the value of the damage density at a point x of the active zone at a time t coincides with that at the corresponding point in the initial (t = 0) configuration of the active zone, the correspondence being given by a time-dependent affine transformation of the space variables. In this paper, the implications of the self-similarity hypothesis for qusi-static CL propagation is investigated using polystyrene as a model material and examining the evolution of damage distribution along the trailing edge which is approximated by a straight segment perpendicular to the crack path. The results support the self-similarity hypothesis adopted by the CL theory.

  6. Bilateral Trade Flows and Income Distribution Similarity

    Science.gov (United States)

    2016-01-01

    Current models of bilateral trade neglect the effects of income distribution. This paper addresses the issue by accounting for non-homothetic consumer preferences and hence investigating the role of income distribution in the context of the gravity model of trade. A theoretically justified gravity model is estimated for disaggregated trade data (Dollar volume is used as dependent variable) using a sample of 104 exporters and 108 importers for 1980–2003 to achieve two main goals. We define and calculate new measures of income distribution similarity and empirically confirm that greater similarity of income distribution between countries implies more trade. Using distribution-based measures as a proxy for demand similarities in gravity models, we find consistent and robust support for the hypothesis that countries with more similar income-distributions trade more with each other. The hypothesis is also confirmed at disaggregated level for differentiated product categories. PMID:27137462

  7. Discovering Music Structure via Similarity Fusion

    DEFF Research Database (Denmark)

    for representing music structure is studied in a simplified scenario consisting of 4412 songs and two similarity measures among them. The results suggest that the PLSA model is a useful framework to combine different sources of information, and provides a reasonable space for song representation.......Automatic methods for music navigation and music recommendation exploit the structure in the music to carry out a meaningful exploration of the “song space”. To get a satisfactory performance from such systems, one should incorporate as much information about songs similarity as possible; however...... semantics”, in such a way that all observed similarities can be satisfactorily explained using the latent semantics. Therefore, one can think of these semantics as the real structure in music, in the sense that they can explain the observed similarities among songs. The suitability of the PLSA model...

  8. Abundance estimation of spectrally similar minerals

    CSIR Research Space (South Africa)

    Debba, Pravesh

    2009-07-01

    Full Text Available This paper evaluates a spectral unmixing method for estimating the partial abundance of spectrally similar minerals in complex mixtures. The method requires formulation of a linear function of individual spectra of individual minerals. The first...

  9. Lagrangian-similarity diffusion-deposition model

    International Nuclear Information System (INIS)

    Horst, T.W.

    1979-01-01

    A Lagrangian-similarity diffusion model has been incorporated into the surface-depletion deposition model. This model predicts vertical concentration profiles far downwind of the source that agree with those of a one-dimensional gradient-transfer model

  10. Discovering Music Structure via Similarity Fusion

    DEFF Research Database (Denmark)

    Arenas-García, Jerónimo; Parrado-Hernandez, Emilio; Meng, Anders

    Automatic methods for music navigation and music recommendation exploit the structure in the music to carry out a meaningful exploration of the “song space”. To get a satisfactory performance from such systems, one should incorporate as much information about songs similarity as possible; however...... semantics”, in such a way that all observed similarities can be satisfactorily explained using the latent semantics. Therefore, one can think of these semantics as the real structure in music, in the sense that they can explain the observed similarities among songs. The suitability of the PLSA model...... for representing music structure is studied in a simplified scenario consisting of 4412 songs and two similarity measures among them. The results suggest that the PLSA model is a useful framework to combine different sources of information, and provides a reasonable space for song representation....

  11. Outsourced similarity search on metric data assets

    KAUST Repository

    Yiu, Man Lung

    2012-02-01

    This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example. Outsourcing offers the data owner scalability and a low-initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise confidential. Given this setting, the paper presents techniques that transform the data prior to supplying it to the service provider for similarity queries on the transformed data. Our techniques provide interesting trade-offs between query cost and accuracy. They are then further extended to offer an intuitive privacy guarantee. Empirical studies with real data demonstrate that the techniques are capable of offering privacy while enabling efficient and accurate processing of similarity queries.

  12. Similarity search processing. Paralelization and indexing technologies.

    Directory of Open Access Journals (Sweden)

    Eder Dos Santos

    2015-08-01

    The next Scientific-Technical Report addresses the similarity search and the implementation of metric structures on parallel environments. It also presents the state of the art related to similarity search on metric structures and parallelism technologies. Comparative analysis are also proposed, seeking to identify the behavior of a set of metric spaces and metric structures over processing platforms multicore-based and GPU-based.

  13. Parallel trajectory similarity joins in spatial networks

    KAUST Repository

    Shang, Shuo

    2018-04-04

    The matching of similar pairs of objects, called similarity join, is fundamental functionality in data management. We consider two cases of trajectory similarity joins (TS-Joins), including a threshold-based join (Tb-TS-Join) and a top-k TS-Join (k-TS-Join), where the objects are trajectories of vehicles moving in road networks. Given two sets of trajectories and a threshold θ, the Tb-TS-Join returns all pairs of trajectories from the two sets with similarity above θ. In contrast, the k-TS-Join does not take a threshold as a parameter, and it returns the top-k most similar trajectory pairs from the two sets. The TS-Joins target diverse applications such as trajectory near-duplicate detection, data cleaning, ridesharing recommendation, and traffic congestion prediction. With these applications in mind, we provide purposeful definitions of similarity. To enable efficient processing of the TS-Joins on large sets of trajectories, we develop search space pruning techniques and enable use of the parallel processing capabilities of modern processors. Specifically, we present a two-phase divide-and-conquer search framework that lays the foundation for the algorithms for the Tb-TS-Join and the k-TS-Join that rely on different pruning techniques to achieve efficiency. For each trajectory, the algorithms first find similar trajectories. Then they merge the results to obtain the final result. The algorithms for the two joins exploit different upper and lower bounds on the spatiotemporal trajectory similarity and different heuristic scheduling strategies for search space pruning. Their per-trajectory searches are independent of each other and can be performed in parallel, and the mergings have constant cost. An empirical study with real data offers insight in the performance of the algorithms and demonstrates that they are capable of outperforming well-designed baseline algorithms by an order of magnitude.

  14. Parallel trajectory similarity joins in spatial networks

    KAUST Repository

    Shang, Shuo; Chen, Lisi; Wei, Zhewei; Jensen, Christian S.; Zheng, Kai; Kalnis, Panos

    2018-01-01

    The matching of similar pairs of objects, called similarity join, is fundamental functionality in data management. We consider two cases of trajectory similarity joins (TS-Joins), including a threshold-based join (Tb-TS-Join) and a top-k TS-Join (k-TS-Join), where the objects are trajectories of vehicles moving in road networks. Given two sets of trajectories and a threshold θ, the Tb-TS-Join returns all pairs of trajectories from the two sets with similarity above θ. In contrast, the k-TS-Join does not take a threshold as a parameter, and it returns the top-k most similar trajectory pairs from the two sets. The TS-Joins target diverse applications such as trajectory near-duplicate detection, data cleaning, ridesharing recommendation, and traffic congestion prediction. With these applications in mind, we provide purposeful definitions of similarity. To enable efficient processing of the TS-Joins on large sets of trajectories, we develop search space pruning techniques and enable use of the parallel processing capabilities of modern processors. Specifically, we present a two-phase divide-and-conquer search framework that lays the foundation for the algorithms for the Tb-TS-Join and the k-TS-Join that rely on different pruning techniques to achieve efficiency. For each trajectory, the algorithms first find similar trajectories. Then they merge the results to obtain the final result. The algorithms for the two joins exploit different upper and lower bounds on the spatiotemporal trajectory similarity and different heuristic scheduling strategies for search space pruning. Their per-trajectory searches are independent of each other and can be performed in parallel, and the mergings have constant cost. An empirical study with real data offers insight in the performance of the algorithms and demonstrates that they are capable of outperforming well-designed baseline algorithms by an order of magnitude.

  15. Are calanco landforms similar to river basins?

    Science.gov (United States)

    Caraballo-Arias, N A; Ferro, V

    2017-12-15

    In the past badlands have been often considered as ideal field laboratories for studying landscape evolution because of their geometrical similarity to larger fluvial systems. For a given hydrological process, no scientific proof exists that badlands can be considered a model of river basin prototypes. In this paper the measurements carried out on 45 Sicilian calanchi, a type of badlands that appears as a small-scale hydrographic unit, are used to establish their morphological similarity with river systems whose data are available in the literature. At first the geomorphological similarity is studied by identifying the dimensionless groups, which can assume the same value or a scaled one in a fixed ratio, representing drainage basin shape, stream network and relief properties. Then, for each property, the dimensionless groups are calculated for the investigated calanchi and the river basins and their corresponding scale ratio is evaluated. The applicability of Hack's, Horton's and Melton's laws for establishing similarity criteria is also tested. The developed analysis allows to conclude that a quantitative morphological similarity between calanco landforms and river basins can be established using commonly applied dimensionless groups. In particular, the analysis showed that i) calanchi and river basins have a geometrically similar shape respect to the parameters Rf and Re with a scale factor close to 1, ii) calanchi and river basins are similar respect to the bifurcation and length ratios (λ=1), iii) for the investigated calanchi the Melton number assumes values less than that (0.694) corresponding to the river case and a scale ratio ranging from 0.52 and 0.78 can be used, iv) calanchi and river basins have similar mean relief ratio values (λ=1.13) and v) calanchi present active geomorphic processes and therefore fall in a more juvenile stage with respect to river basins. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Functional enrichment analyses and construction of functional similarity networks with high confidence function prediction by PFP

    Directory of Open Access Journals (Sweden)

    Kihara Daisuke

    2010-05-01

    Full Text Available Abstract Background A new paradigm of biological investigation takes advantage of technologies that produce large high throughput datasets, including genome sequences, interactions of proteins, and gene expression. The ability of biologists to analyze and interpret such data relies on functional annotation of the included proteins, but even in highly characterized organisms many proteins can lack the functional evidence necessary to infer their biological relevance. Results Here we have applied high confidence function predictions from our automated prediction system, PFP, to three genome sequences, Escherichia coli, Saccharomyces cerevisiae, and Plasmodium falciparum (malaria. The number of annotated genes is increased by PFP to over 90% for all of the genomes. Using the large coverage of the function annotation, we introduced the functional similarity networks which represent the functional space of the proteomes. Four different functional similarity networks are constructed for each proteome, one each by considering similarity in a single Gene Ontology (GO category, i.e. Biological Process, Cellular Component, and Molecular Function, and another one by considering overall similarity with the funSim score. The functional similarity networks are shown to have higher modularity than the protein-protein interaction network. Moreover, the funSim score network is distinct from the single GO-score networks by showing a higher clustering degree exponent value and thus has a higher tendency to be hierarchical. In addition, examining function assignments to the protein-protein interaction network and local regions of genomes has identified numerous cases where subnetworks or local regions have functionally coherent proteins. These results will help interpreting interactions of proteins and gene orders in a genome. Several examples of both analyses are highlighted. Conclusion The analyses demonstrate that applying high confidence predictions from PFP

  17. Semantic similarity between ontologies at different scales

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Qingpeng; Haglin, David J.

    2016-04-01

    In the past decade, existing and new knowledge and datasets has been encoded in different ontologies for semantic web and biomedical research. The size of ontologies is often very large in terms of number of concepts and relationships, which makes the analysis of ontologies and the represented knowledge graph computational and time consuming. As the ontologies of various semantic web and biomedical applications usually show explicit hierarchical structures, it is interesting to explore the trade-offs between ontological scales and preservation/precision of results when we analyze ontologies. This paper presents the first effort of examining the capability of this idea via studying the relationship between scaling biomedical ontologies at different levels and the semantic similarity values. We evaluate the semantic similarity between three Gene Ontology slims (Plant, Yeast, and Candida, among which the latter two belong to the same kingdom—Fungi) using four popular measures commonly applied to biomedical ontologies (Resnik, Lin, Jiang-Conrath, and SimRel). The results of this study demonstrate that with proper selection of scaling levels and similarity measures, we can significantly reduce the size of ontologies without losing substantial detail. In particular, the performance of Jiang-Conrath and Lin are more reliable and stable than that of the other two in this experiment, as proven by (a) consistently showing that Yeast and Candida are more similar (as compared to Plant) at different scales, and (b) small deviations of the similarity values after excluding a majority of nodes from several lower scales. This study provides a deeper understanding of the application of semantic similarity to biomedical ontologies, and shed light on how to choose appropriate semantic similarity measures for biomedical engineering.

  18. Measure of Node Similarity in Multilayer Networks

    DEFF Research Database (Denmark)

    Møllgaard, Anders; Zettler, Ingo; Dammeyer, Jesper

    2016-01-01

    The weight of links in a network is often related to the similarity of thenodes. Here, we introduce a simple tunable measure for analysing the similarityof nodes across different link weights. In particular, we use the measure toanalyze homophily in a group of 659 freshman students at a large...... university.Our analysis is based on data obtained using smartphones equipped with customdata collection software, complemented by questionnaire-based data. The networkof social contacts is represented as a weighted multilayer network constructedfrom different channels of telecommunication as well as data...... might bepresent in one layer of the multilayer network and simultaneously be absent inthe other layers. For a variable such as gender, our measure reveals atransition from similarity between nodes connected with links of relatively lowweight to dis-similarity for the nodes connected by the strongest...

  19. A Novel Hybrid Similarity Calculation Model

    Directory of Open Access Journals (Sweden)

    Xiaoping Fan

    2017-01-01

    Full Text Available This paper addresses the problems of similarity calculation in the traditional recommendation algorithms of nearest neighbor collaborative filtering, especially the failure in describing dynamic user preference. Proceeding from the perspective of solving the problem of user interest drift, a new hybrid similarity calculation model is proposed in this paper. This model consists of two parts, on the one hand the model uses the function fitting to describe users’ rating behaviors and their rating preferences, and on the other hand it employs the Random Forest algorithm to take user attribute features into account. Furthermore, the paper combines the two parts to build a new hybrid similarity calculation model for user recommendation. Experimental results show that, for data sets of different size, the model’s prediction precision is higher than the traditional recommendation algorithms.

  20. Universal self-similarity of propagating populations.

    Science.gov (United States)

    Eliazar, Iddo; Klafter, Joseph

    2010-07-01

    This paper explores the universal self-similarity of propagating populations. The following general propagation model is considered: particles are randomly emitted from the origin of a d-dimensional Euclidean space and propagate randomly and independently of each other in space; all particles share a statistically common--yet arbitrary--motion pattern; each particle has its own random propagation parameters--emission epoch, motion frequency, and motion amplitude. The universally self-similar statistics of the particles' displacements and first passage times (FPTs) are analyzed: statistics which are invariant with respect to the details of the displacement and FPT measurements and with respect to the particles' underlying motion pattern. Analysis concludes that the universally self-similar statistics are governed by Poisson processes with power-law intensities and by the Fréchet and Weibull extreme-value laws.

  1. Universal self-similarity of propagating populations

    Science.gov (United States)

    Eliazar, Iddo; Klafter, Joseph

    2010-07-01

    This paper explores the universal self-similarity of propagating populations. The following general propagation model is considered: particles are randomly emitted from the origin of a d -dimensional Euclidean space and propagate randomly and independently of each other in space; all particles share a statistically common—yet arbitrary—motion pattern; each particle has its own random propagation parameters—emission epoch, motion frequency, and motion amplitude. The universally self-similar statistics of the particles’ displacements and first passage times (FPTs) are analyzed: statistics which are invariant with respect to the details of the displacement and FPT measurements and with respect to the particles’ underlying motion pattern. Analysis concludes that the universally self-similar statistics are governed by Poisson processes with power-law intensities and by the Fréchet and Weibull extreme-value laws.

  2. Trajectory similarity join in spatial networks

    KAUST Repository

    Shang, Shuo; Chen, Lisi; Wei, Zhewei; Jensen, Christian S.; Zheng, Kai; Kalnis, Panos

    2017-01-01

    With these applications in mind, we provide a purposeful definition of similarity. To enable efficient TS-Join processing on large sets of trajectories, we develop search space pruning techniques and take into account the parallel processing capabilities of modern processors. Specifically, we present a two-phase divide-and-conquer algorithm. For each trajectory, the algorithm first finds similar trajectories. Then it merges the results to achieve a final result. The algorithm exploits an upper bound on the spatiotemporal similarity and a heuristic scheduling strategy for search space pruning. The algorithm's per-trajectory searches are independent of each other and can be performed in parallel, and the merging has constant cost. An empirical study with real data offers insight in the performance of the algorithm and demonstrates that is capable of outperforming a well-designed baseline algorithm by an order of magnitude.

  3. Phonological similarity in working memory span tasks.

    Science.gov (United States)

    Chow, Michael; Macnamara, Brooke N; Conway, Andrew R A

    2016-08-01

    In a series of four experiments, we explored what conditions are sufficient to produce a phonological similarity facilitation effect in working memory span tasks. By using the same set of memoranda, but differing the secondary-task requirements across experiments, we showed that a phonological similarity facilitation effect is dependent upon the semantic relationship between the memoranda and the secondary-task stimuli, and is robust to changes in the representation, ordering, and pool size of the secondary-task stimuli. These findings are consistent with interference accounts of memory (Brown, Neath, & Chater, Psychological Review, 114, 539-576, 2007; Oberauer, Lewandowsky, Farrell, Jarrold, & Greaves, Psychonomic Bulletin & Review, 19, 779-819, 2012), whereby rhyming stimuli provide a form of categorical similarity that allows distractors to be excluded from retrieval at recall.

  4. Unveiling Music Structure Via PLSA Similarity Fusion

    DEFF Research Database (Denmark)

    Arenas-García, Jerónimo; Meng, Anders; Petersen, Kaare Brandt

    2007-01-01

    Nowadays there is an increasing interest in developing methods for building music recommendation systems. In order to get a satisfactory performance from such a system, one needs to incorporate as much information about songs similarity as possible; however, how to do so is not obvious. In this p......Nowadays there is an increasing interest in developing methods for building music recommendation systems. In order to get a satisfactory performance from such a system, one needs to incorporate as much information about songs similarity as possible; however, how to do so is not obvious...... observed similarities can be satisfactorily explained using the latent semantics. Additionally, this approach significantly simplifies the song retrieval phase, leading to a more practical system implementation. The suitability of the PLSA model for representing music structure is studied in a simplified...

  5. Multidrug transporters from bacteria to man : similarities in structure and function

    NARCIS (Netherlands)

    van Veen, HW; Konings, WN

    Organisms ranging from bacteria to man possess transmembrane transporters which confer resistance to toxic corn pounds. Underlining their biological significance, prokaryotic and eukaryotic multidrug transport proteins are very similar in structure and function. Therefore, a study of the factors

  6. Bacterial Ice Crystal Controlling Proteins

    Science.gov (United States)

    Lorv, Janet S. H.; Rose, David R.; Glick, Bernard R.

    2014-01-01

    Across the world, many ice active bacteria utilize ice crystal controlling proteins for aid in freezing tolerance at subzero temperatures. Ice crystal controlling proteins include both antifreeze and ice nucleation proteins. Antifreeze proteins minimize freezing damage by inhibiting growth of large ice crystals, while ice nucleation proteins induce formation of embryonic ice crystals. Although both protein classes have differing functions, these proteins use the same ice binding mechanisms. Rather than direct binding, it is probable that these protein classes create an ice surface prior to ice crystal surface adsorption. Function is differentiated by molecular size of the protein. This paper reviews the similar and different aspects of bacterial antifreeze and ice nucleation proteins, the role of these proteins in freezing tolerance, prevalence of these proteins in psychrophiles, and current mechanisms of protein-ice interactions. PMID:24579057

  7. Similarity joins in relational database systems

    CERN Document Server

    Augsten, Nikolaus

    2013-01-01

    State-of-the-art database systems manage and process a variety of complex objects, including strings and trees. For such objects equality comparisons are often not meaningful and must be replaced by similarity comparisons. This book describes the concepts and techniques to incorporate similarity into database systems. We start out by discussing the properties of strings and trees, and identify the edit distance as the de facto standard for comparing complex objects. Since the edit distance is computationally expensive, token-based distances have been introduced to speed up edit distance comput

  8. Outsourced Similarity Search on Metric Data Assets

    DEFF Research Database (Denmark)

    Yiu, Man Lung; Assent, Ira; Jensen, Christian S.

    2012-01-01

    . Outsourcing offers the data owner scalability and a low initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise confidential. Given this setting, the paper presents techniques that transform the data prior to supplying......This paper considers a cloud computing setting in which similarity querying of metric data is outsourced to a service provider. The data is to be revealed only to trusted users, not to the service provider or anyone else. Users query the server for the most similar data objects to a query example...

  9. Measure of Node Similarity in Multilayer Networks

    DEFF Research Database (Denmark)

    Møllgaard, Anders; Zettler, Ingo; Dammeyer, Jesper

    2016-01-01

    university.Our analysis is based on data obtained using smartphones equipped with customdata collection software, complemented by questionnaire-based data. The networkof social contacts is represented as a weighted multilayer network constructedfrom different channels of telecommunication as well as data...... might bepresent in one layer of the multilayer network and simultaneously be absent inthe other layers. For a variable such as gender, our measure reveals atransition from similarity between nodes connected with links of relatively lowweight to dis-similarity for the nodes connected by the strongest...

  10. Cultural similarity and adjustment of expatriate academics

    DEFF Research Database (Denmark)

    Selmer, Jan; Lauring, Jakob

    2009-01-01

    The findings of a number of recent empirical studies of business expatriates, using different samples and methodologies, seem to support the counter-intuitive proposition that cultural similarity may be as difficult to adjust to as cultural dissimilarity. However, it is not obvious...... and non-EU countries. Results showed that although the perceived cultural similarity between host and home country for the two groups of investigated respondents was different, there was neither any difference in their adjustment nor in the time it took for them to become proficient. Implications...

  11. Nuclear markers reveal that inter-lake cichlids' similar morphologies do not reflect similar genealogy.

    Science.gov (United States)

    Kassam, Daud; Seki, Shingo; Horic, Michio; Yamaoka, Kosaku

    2006-08-01

    The apparent inter-lake morphological similarity among East African Great Lakes' cichlid species/genera has left evolutionary biologists asking whether such similarity is due to sharing of common ancestor or mere convergent evolution. In order to answer such question, we first used Geometric Morphometrics, GM, to quantify morphological similarity and then subsequently used Amplified Fragment Length Polymorphism, AFLP, to determine if similar morphologies imply shared ancestry or convergent evolution. GM revealed that not all presumed morphological similar pairs were indeed similar, and the dendrogram generated from AFLP data indicated distinct clusters corresponding to each lake and not inter-lake morphological similar pairs. Such results imply that the morphological similarity is due to convergent evolution and not shared ancestry. The congruency of GM and AFLP generated dendrograms imply that GM is capable of picking up phylogenetic signal, and thus GM can be potential tool in phylogenetic systematics.

  12. Prediction of Protein-Protein Interactions Related to Protein Complexes Based on Protein Interaction Networks

    Directory of Open Access Journals (Sweden)

    Peng Liu

    2015-01-01

    Full Text Available A method for predicting protein-protein interactions based on detected protein complexes is proposed to repair deficient interactions derived from high-throughput biological experiments. Protein complexes are pruned and decomposed into small parts based on the adaptive k-cores method to predict protein-protein interactions associated with the complexes. The proposed method is adaptive to protein complexes with different structure, number, and size of nodes in a protein-protein interaction network. Based on different complex sets detected by various algorithms, we can obtain different prediction sets of protein-protein interactions. The reliability of the predicted interaction sets is proved by using estimations with statistical tests and direct confirmation of the biological data. In comparison with the approaches which predict the interactions based on the cliques, the overlap of the predictions is small. Similarly, the overlaps among the predicted sets of interactions derived from various complex sets are also small. Thus, every predicted set of interactions may complement and improve the quality of the original network data. Meanwhile, the predictions from the proposed method replenish protein-protein interactions associated with protein complexes using only the network topology.

  13. Clustering biomolecular complexes by residue contacts similarity

    NARCIS (Netherlands)

    Garcia Lopes Maia Rodrigues, João; Trellet, Mikaël; Schmitz, Christophe; Kastritis, Panagiotis; Karaca, Ezgi; Melquiond, Adrien S J; Bonvin, Alexandre M J J; Garcia Lopes Maia Rodrigues, João

    Inaccuracies in computational molecular modeling methods are often counterweighed by brute-force generation of a plethora of putative solutions. These are then typically sieved via structural clustering based on similarity measures such as the root mean square deviation (RMSD) of atomic positions.

  14. Similarity principles for equipment qualification by experience

    International Nuclear Information System (INIS)

    Kana, D.D.; Pomerening, D.J.

    1988-07-01

    A methodology is developed for seismic qualification of nuclear plant equipment by applying similarity principles to existing experience data. Experience data are available from previous qualifications by analysis or testing, or from actual earthquake events. Similarity principles are defined in terms of excitation, equipment physical characteristics, and equipment response. Physical similarity is further defined in terms of a critical transfer function for response at a location on a primary structure, whose response can be assumed directly related to ultimate fragility of the item under elevated levels of excitation. Procedures are developed for combining experience data into composite specifications for qualification of equipment that can be shown to be physically similar to the reference equipment. Other procedures are developed for extending qualifications beyond the original specifications under certain conditions. Some examples for application of the procedures and verification of them are given for certain cases that can be approximated by a two degree of freedom simple primary/secondary system. Other examples are based on use of actual test data available from previous qualifications. Relationships of the developments with other previously-published methods are discussed. The developments are intended to elaborate on the rather broad revised guidelines developed by the IEEE 344 Standards Committee for equipment qualification in new nuclear plants. However, the results also contribute to filling a gap that exists between the IEEE 344 methodology and that previously developed by the Seismic Qualification Utilities Group. The relationship of the results to safety margin methodology is also discussed. (author)

  15. 7 CFR 51.1997 - Similar type.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar type. 51.1997 Section 51.1997 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards, Inspections, Marketing Practices), DEPARTMENT OF AGRICULTURE REGULATIONS AND STANDARDS UNDER THE AGRICULTURAL MARKETING ACT OF 1946...

  16. Efficient Similarity Retrieval in Music Databases

    DEFF Research Database (Denmark)

    Ruxanda, Maria Magdalena; Jensen, Christian Søndergaard

    2006-01-01

    Audio music is increasingly becoming available in digital form, and the digital music collections of individuals continue to grow. Addressing the need for effective means of retrieving music from such collections, this paper proposes new techniques for content-based similarity search. Each music...

  17. Similarity search of business process models

    NARCIS (Netherlands)

    Dumas, M.; García-Bañuelos, L.; Dijkman, R.M.

    2009-01-01

    Similarity search is a general class of problems in which a given object, called a query object, is compared against a collection of objects in order to retrieve those that most closely resemble the query object. This paper reviews recent work on an instance of this class of problems, where the

  18. Evaluating gender similarities and differences using metasynthesis.

    Science.gov (United States)

    Zell, Ethan; Krizan, Zlatan; Teeter, Sabrina R

    2015-01-01

    Despite the common lay assumption that males and females are profoundly different, Hyde (2005) used data from 46 meta-analyses to demonstrate that males and females are highly similar. Nonetheless, the gender similarities hypothesis has remained controversial. Since Hyde's provocative report, there has been an explosion of meta-analytic interest in psychological gender differences. We utilized this enormous collection of 106 meta-analyses and 386 individual meta-analytic effects to reevaluate the gender similarities hypothesis. Furthermore, we employed a novel data-analytic approach called metasynthesis (Zell & Krizan, 2014) to estimate the average difference between males and females and to explore moderators of gender differences. The average, absolute difference between males and females across domains was relatively small (d = 0.21, SD = 0.14), with the majority of effects being either small (46%) or very small (39%). Magnitude of differences fluctuated somewhat as a function of the psychological domain (e.g., cognitive variables, social and personality variables, well-being), but remained largely constant across age, culture, and generations. These findings provide compelling support for the gender similarities hypothesis, but also underscore conditions under which gender differences are most pronounced. PsycINFO Database Record (c) 2015 APA, all rights reserved.

  19. Cross-kingdom similarities in microbiome functions

    NARCIS (Netherlands)

    Mendes, R.; Raaijmakers, J.M.

    2015-01-01

    Recent advances in medical research have revealed how humans rely on their microbiome for diverse traits and functions. Similarly, microbiomes of other higher organisms play key roles in disease, health, growth and development of their host. Exploring microbiome functions across kingdoms holds

  20. Measuring structural similarity in large online networks.

    Science.gov (United States)

    Shi, Yongren; Macy, Michael

    2016-09-01

    Structural similarity based on bipartite graphs can be used to detect meaningful communities, but the networks have been tiny compared to massive online networks. Scalability is important in applications involving tens of millions of individuals with highly skewed degree distributions. Simulation analysis holding underlying similarity constant shows that two widely used measures - Jaccard index and cosine similarity - are biased by the distribution of out-degree in web-scale networks. However, an alternative measure, the Standardized Co-incident Ratio (SCR), is unbiased. We apply SCR to members of Congress, musical artists, and professional sports teams to show how massive co-following on Twitter can be used to map meaningful affiliations among cultural entities, even in the absence of direct connections to one another. Our results show how structural similarity can be used to map cultural alignments and demonstrate the potential usefulness of social media data in the study of culture, politics, and organizations across the social and behavioral sciences. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Phonological Similarity in American Sign Language.

    Science.gov (United States)

    Hildebrandt, Ursula; Corina, David

    2002-01-01

    Investigates deaf and hearing subjects' ratings of American Sign Language (ASL) signs to assess whether linguistic experience shapes judgments of sign similarity. Findings are consistent with linguistic theories that posit movement and location as core structural elements of syllable structure in ASL. (Author/VWL)

  2. Structural similarity and category-specificity

    DEFF Research Database (Denmark)

    Gerlach, Christian; Law, Ian; Paulson, Olaf B

    2004-01-01

    It has been suggested that category-specific recognition disorders for natural objects may reflect that natural objects are more structurally (visually) similar than artefacts and therefore more difficult to recognize following brain damage. On this account one might expect a positive relationshi...

  3. Music Retrieval based on Melodic Similarity

    NARCIS (Netherlands)

    Typke, R.

    2007-01-01

    This thesis introduces a method for measuring melodic similarity for notated music such as MIDI files. This music search algorithm views music as sets of notes that are represented as weighted points in the two-dimensional space of time and pitch. Two point sets can be compared by calculating how

  4. Measurement of Similarity in Academic Contexts

    Directory of Open Access Journals (Sweden)

    Omid Mahian

    2017-06-01

    Full Text Available We propose some reflections, comments and suggestions about the measurement of similar and matched content in scientific papers and documents, and the need to develop appropriate tools and standards for an ethically fair and equitable treatment of authors.

  5. Appropriate Similarity Measures for Author Cocitation Analysis

    NARCIS (Netherlands)

    N.J.P. van Eck (Nees Jan); L. Waltman (Ludo)

    2007-01-01

    textabstractWe provide a number of new insights into the methodological discussion about author cocitation analysis. We first argue that the use of the Pearson correlation for measuring the similarity between authors’ cocitation profiles is not very satisfactory. We then discuss what kind of

  6. Similarity of Experience and Empathy in Preschoolers.

    Science.gov (United States)

    Barnett, Mark A.

    The present study examined the role of similarity of experience in young children's affective reactions to others. Some preschoolers played one of two games (Puzzle Board or Buckets) and were informed that they had either failed or succeeded; others merely observed the games being played and were given no evaluative feedback. Subsequently, each…

  7. Cultural Similarities and Differences on Idiom Translation

    Institute of Scientific and Technical Information of China (English)

    黄频频; 陈于全

    2010-01-01

    Both English and Chinese are abound with idioms. Idioms are an important part of the hnguage and culture of a society. English and Chinese idioms carved with cultural characteristics account for a great part in the tramlation. This paper studies the translation of idioms concerning their cultural similarities, cultural differences and transhtion principles.

  8. Learning by similarity in coordination problems

    Czech Academy of Sciences Publication Activity Database

    Steiner, Jakub; Stewart, C.

    -, č. 324 (2007), s. 1-40 ISSN 1211-3298 R&D Projects: GA MŠk LC542 Institutional research plan: CEZ:AV0Z70850503 Keywords : similarity * learning * case-based reasoning Subject RIV: AH - Economics http://www.cerge-ei.cz/pdf/wp/Wp324.pdf

  9. Outsourced similarity search on metric data assets

    KAUST Repository

    Yiu, Man Lung; Assent, Ira; Jensen, Christian Sø ndergaard; Kalnis, Panos

    2012-01-01

    for the most similar data objects to a query example. Outsourcing offers the data owner scalability and a low-initial investment. The need for privacy may be due to the data being sensitive (e.g., in medicine), valuable (e.g., in astronomy), or otherwise

  10. Unique Features of Halophilic Proteins.

    Science.gov (United States)

    Arakawa, Tsutomu; Yamaguchi, Rui; Tokunaga, Hiroko; Tokunaga, Masao

    2017-01-01

    Proteins from moderate and extreme halophiles have unique characteristics. They are highly acidic and hydrophilic, similar to intrinsically disordered proteins. These characteristics make the halophilic proteins soluble in water and fold reversibly. In addition to reversible folding, the rate of refolding of halophilic proteins from denatured structure is generally slow, often taking several days, for example, for extremely halophilic proteins. This slow folding rate makes the halophilic proteins a novel model system for folding mechanism analysis. High solubility and reversible folding also make the halophilic proteins excellent fusion partners for soluble expression of recombinant proteins.

  11. Extending the Similarity-Attraction Effect : The effects of When-Similarity in mediated communication

    NARCIS (Netherlands)

    Kaptein, M.C.; Castaneda, D.; Fernandez, N.; Nass, C.

    2014-01-01

    The feeling of connectedness experienced in computer-mediated relationships can be explained by the similarity-attraction effect (SAE). Though SAE is well established in psychology, the effects of some types of similarity have not yet been explored. In 2 studies, we demonstrate similarity-attraction

  12. Popularity versus similarity in growing networks

    Science.gov (United States)

    Krioukov, Dmitri; Papadopoulos, Fragkiskos; Kitsak, Maksim; Serrano, Mariangeles; Boguna, Marian

    2012-02-01

    Preferential attachment is a powerful mechanism explaining the emergence of scaling in growing networks. If new connections are established preferentially to more popular nodes in a network, then the network is scale-free. Here we show that not only popularity but also similarity is a strong force shaping the network structure and dynamics. We develop a framework where new connections, instead of preferring popular nodes, optimize certain trade-offs between popularity and similarity. The framework admits a geometric interpretation, in which preferential attachment emerges from local optimization processes. As opposed to preferential attachment, the optimization framework accurately describes large-scale evolution of technological (Internet), social (web of trust), and biological (E.coli metabolic) networks, predicting the probability of new links in them with a remarkable precision. The developed framework can thus be used for predicting new links in evolving networks, and provides a different perspective on preferential attachment as an emergent phenomenon.

  13. Similarity, trust in institutions, affect, and populism

    DEFF Research Database (Denmark)

    Scholderer, Joachim; Finucane, Melissa L.

    -based evaluations are fundamental to human information processing, they can contribute significantly to other judgments (such as the risk, cost-effectiveness, trustworthiness) of the same stimulus object. Although deliberation and analysis are certainly important in some decision-making circumstances, reliance...... on affect is a quicker, easier, and a more efficient way of navigating in a complex and uncertain world. Hence, many theorists give affect a direct and primary role in motivating behavior. Taken together, the results provide uncannily strong support for the value-similarity hypothesis, strengthening...... types of information about gene technology. The materials were attributed to different institutions. The results indicated that participants' trust in an institution was a function of the similarity between the position advocated in the materials and participants' own attitudes towards gene technology...

  14. Contingency and similarity in response selection.

    Science.gov (United States)

    Prinz, Wolfgang

    2018-05-09

    This paper explores issues of task representation in choice reaction time tasks. How is it possible, and what does it take, to represent such a task in a way that enables a performer to do the task in line with the prescriptions entailed in the instructions? First, a framework for task representation is outlined which combines the implementation of task sets and their use for performance with different kinds of representational operations (pertaining to feature compounds for event codes and code assemblies for task sets, respectively). Then, in a second step, the framework is itself embedded in the bigger picture of the classical debate on the roles of contingency and similarity for the formation of associations. The final conclusion is that both principles are needed and that the operation of similarity at the level of task sets requires and presupposes the operation of contingency at the level of event codes. Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.

  15. Similarity and Modeling in Science and Engineering

    CERN Document Server

    Kuneš, Josef

    2012-01-01

    The present text sets itself in relief to other titles on the subject in that it addresses the means and methodologies versus a narrow specific-task oriented approach. Concepts and their developments which evolved to meet the changing needs of applications are addressed. This approach provides the reader with a general tool-box to apply to their specific needs. Two important tools are presented: dimensional analysis and the similarity analysis methods. The fundamental point of view, enabling one to sort all models, is that of information flux between a model and an original expressed by the similarity and abstraction. Each chapter includes original examples and ap-plications. In this respect, the models can be divided into several groups. The following models are dealt with separately by chapter; mathematical and physical models, physical analogues, deterministic, stochastic, and cybernetic computer models. The mathematical models are divided into asymptotic and phenomenological models. The phenomenological m...

  16. Similarity solutions for phase-change problems

    Science.gov (United States)

    Canright, D.; Davis, S. H.

    1989-01-01

    A modification of Ivantsov's (1947) similarity solutions is proposed which can describe phase-change processes which are limited by diffusion. The method has application to systems that have n-components and possess cross-diffusion and Soret and Dufour effects, along with convection driven by density discontinuities at the two-phase interface. Local thermal equilibrium is assumed at the interface. It is shown that analytic solutions are possible when the material properties are constant.

  17. Stochastic self-similar and fractal universe

    International Nuclear Information System (INIS)

    Iovane, G.; Laserra, E.; Tortoriello, F.S.

    2004-01-01

    The structures formation of the Universe appears as if it were a classically self-similar random process at all astrophysical scales. An agreement is demonstrated for the present hypotheses of segregation with a size of astrophysical structures by using a comparison between quantum quantities and astrophysical ones. We present the observed segregated Universe as the result of a fundamental self-similar law, which generalizes the Compton wavelength relation. It appears that the Universe has a memory of its quantum origin as suggested by R. Penrose with respect to quasi-crystal. A more accurate analysis shows that the present theory can be extended from the astrophysical to the nuclear scale by using generalized (stochastically) self-similar random process. This transition is connected to the relevant presence of the electromagnetic and nuclear interactions inside the matter. In this sense, the presented rule is correct from a subatomic scale to an astrophysical one. We discuss the near full agreement at organic cell scale and human scale too. Consequently the Universe, with its structures at all scales (atomic nucleus, organic cell, human, planet, solar system, galaxy, clusters of galaxy, super clusters of galaxy), could have a fundamental quantum reason. In conclusion, we analyze the spatial dimensions of the objects in the Universe as well as space-time dimensions. The result is that it seems we live in an El Naschie's E-infinity Cantorian space-time; so we must seriously start considering fractal geometry as the geometry of nature, a type of arena where the laws of physics appear at each scale in a self-similar way as advocated long ago by the Swedish school of astrophysics

  18. Similarity-based Polymorphic Shellcode Detection

    Directory of Open Access Journals (Sweden)

    Denis Yurievich Gamayunov

    2013-02-01

    Full Text Available In the work the method for polymorphic shellcode dedection based on the set of known shellcodes is proposed. The method’s main idea is in sequential applying of deobfuscating transformations to a data analyzed and then recognizing similarity with malware samples. The method has been tested on the sets of shellcodes generated using Metasploit Framework v.4.1.0 and PELock Obfuscator and shows 87 % precision with zero false positives rate.

  19. Quasi-Similarity Model of Synthetic Jets

    Czech Academy of Sciences Publication Activity Database

    Tesař, Václav; Kordík, Jozef

    2009-01-01

    Roč. 149, č. 2 (2009), s. 255-265 ISSN 0924-4247 R&D Projects: GA AV ČR IAA200760705; GA ČR GA101/07/1499 Institutional research plan: CEZ:AV0Z20760514 Keywords : jets * synthetic jets * similarity solution Subject RIV: BK - Fluid Dynamics Impact factor: 1.674, year: 2009 http://www.sciencedirect.com

  20. Multidimensional Scaling Visualization using Parametric Similarity Indices

    OpenAIRE

    Machado, J. A. Tenreiro; Lopes, António M.; Galhano, A.M.

    2015-01-01

    In this paper, we apply multidimensional scaling (MDS) and parametric similarity indices (PSI) in the analysis of complex systems (CS). Each CS is viewed as a dynamical system, exhibiting an output time-series to be interpreted as a manifestation of its behavior. We start by adopting a sliding window to sample the original data into several consecutive time periods. Second, we define a given PSI for tracking pieces of data. We then compare the windows for different values of the parameter, an...

  1. The fluid similarity of the boiling crisis

    International Nuclear Information System (INIS)

    Katsaounis, A.

    1986-01-01

    Most of the measurements related to the boiling crisis have, until now, been undertaken for a wide parameter variation in the water, and were mainly related to the water-cooled reactor. This article investigates, whether or how the measuring results can be transferred to other fluids. Derived dimensionless similarity figures and those taken from literature are verified by measurements from complex geometries in water and freon 12. (orig.) [de

  2. The fluid similarity of the boiling crisis

    International Nuclear Information System (INIS)

    Katsaounis, A.

    1987-01-01

    Most of the measurements related to the boiling crisis have, until now, been undertaken for a wide parameter variation in the water, and were mainly related to the water-cooled reactor. This article investigates, whether or how the measuring results can be transferred to other fluids. Derived dimensionless similarity figures and those taken from literature are verified by measurements from complex geometries in water and freon 12. (orig./GL) [de

  3. Semantic Similarity between Web Documents Using Ontology

    Science.gov (United States)

    Chahal, Poonam; Singh Tomer, Manjeet; Kumar, Suresh

    2018-06-01

    The World Wide Web is the source of information available in the structure of interlinked web pages. However, the procedure of extracting significant information with the assistance of search engine is incredibly critical. This is for the reason that web information is written mainly by using natural language, and further available to individual human. Several efforts have been made in semantic similarity computation between documents using words, concepts and concepts relationship but still the outcome available are not as per the user requirements. This paper proposes a novel technique for computation of semantic similarity between documents that not only takes concepts available in documents but also relationships that are available between the concepts. In our approach documents are being processed by making ontology of the documents using base ontology and a dictionary containing concepts records. Each such record is made up of the probable words which represents a given concept. Finally, document ontology's are compared to find their semantic similarity by taking the relationships among concepts. Relevant concepts and relations between the concepts have been explored by capturing author and user intention. The proposed semantic analysis technique provides improved results as compared to the existing techniques.

  4. Semantic Similarity between Web Documents Using Ontology

    Science.gov (United States)

    Chahal, Poonam; Singh Tomer, Manjeet; Kumar, Suresh

    2018-03-01

    The World Wide Web is the source of information available in the structure of interlinked web pages. However, the procedure of extracting significant information with the assistance of search engine is incredibly critical. This is for the reason that web information is written mainly by using natural language, and further available to individual human. Several efforts have been made in semantic similarity computation between documents using words, concepts and concepts relationship but still the outcome available are not as per the user requirements. This paper proposes a novel technique for computation of semantic similarity between documents that not only takes concepts available in documents but also relationships that are available between the concepts. In our approach documents are being processed by making ontology of the documents using base ontology and a dictionary containing concepts records. Each such record is made up of the probable words which represents a given concept. Finally, document ontology's are compared to find their semantic similarity by taking the relationships among concepts. Relevant concepts and relations between the concepts have been explored by capturing author and user intention. The proposed semantic analysis technique provides improved results as compared to the existing techniques.

  5. Emergent self-similarity of cluster coagulation

    Science.gov (United States)

    Pushkin, Dmtiri O.

    A wide variety of nonequilibrium processes, such as coagulation of colloidal particles, aggregation of bacteria into colonies, coalescence of rain drops, bond formation between polymerization sites, and formation of planetesimals, fall under the rubric of cluster coagulation. We predict emergence of self-similar behavior in such systems when they are 'forced' by an external source of the smallest particles. The corresponding self-similar coagulation spectra prove to be power laws. Starting from the classical Smoluchowski coagulation equation, we identify the conditions required for emergence of self-similarity and show that the power-law exponent value for a particular coagulation mechanism depends on the homogeneity index of the corresponding coagulation kernel only. Next, we consider the current wave of mergers of large American banks as an 'unorthodox' application of coagulation theory. We predict that the bank size distribution has propensity to become a power law, and verify our prediction in a statistical study of the available economical data. We conclude this chapter by discussing economically significant phenomenon of capital condensation and predicting emergence of power-law distributions in other economical and social data. Finally, we turn to apparent semblance between cluster coagulation and turbulence and conclude that it is not accidental: both of these processes are instances of nonlinear cascades. This class of processes also includes river network formation models, certain force-chain models in granular mechanics, fragmentation due to collisional cascades, percolation, and growing random networks. We characterize a particular cascade by three indicies and show that the resulting power-law spectrum exponent depends on the indicies values only. The ensuing algebraic formula is remarkable for its simplicity.

  6. FRESCO: Referential compression of highly similar sequences.

    Science.gov (United States)

    Wandelt, Sebastian; Leser, Ulf

    2013-01-01

    In many applications, sets of similar texts or sequences are of high importance. Prominent examples are revision histories of documents or genomic sequences. Modern high-throughput sequencing technologies are able to generate DNA sequences at an ever-increasing rate. In parallel to the decreasing experimental time and cost necessary to produce DNA sequences, computational requirements for analysis and storage of the sequences are steeply increasing. Compression is a key technology to deal with this challenge. Recently, referential compression schemes, storing only the differences between a to-be-compressed input and a known reference sequence, gained a lot of interest in this field. In this paper, we propose a general open-source framework to compress large amounts of biological sequence data called Framework for REferential Sequence COmpression (FRESCO). Our basic compression algorithm is shown to be one to two orders of magnitudes faster than comparable related work, while achieving similar compression ratios. We also propose several techniques to further increase compression ratios, while still retaining the advantage in speed: 1) selecting a good reference sequence; and 2) rewriting a reference sequence to allow for better compression. In addition,we propose a new way of further boosting the compression ratios by applying referential compression to already referentially compressed files (second-order compression). This technique allows for compression ratios way beyond state of the art, for instance,4,000:1 and higher for human genomes. We evaluate our algorithms on a large data set from three different species (more than 1,000 genomes, more than 3 TB) and on a collection of versions of Wikipedia pages. Our results show that real-time compression of highly similar sequences at high compression ratios is possible on modern hardware.

  7. Spherically symmetric self-similar universe

    Energy Technology Data Exchange (ETDEWEB)

    Dyer, C C [Toronto Univ., Ontario (Canada)

    1979-10-01

    A spherically symmetric self-similar dust-filled universe is considered as a simple model of a hierarchical universe. Observable differences between the model in parabolic expansion and the corresponding homogeneous Einstein-de Sitter model are considered in detail. It is found that an observer at the centre of the distribution has a maximum observable redshift and can in principle see arbitrarily large blueshifts. It is found to yield an observed density-distance law different from that suggested by the observations of de Vaucouleurs. The use of these solutions as central objects for Swiss-cheese vacuoles is discussed.

  8. Image magnification based on similarity analogy

    International Nuclear Information System (INIS)

    Chen Zuoping; Ye Zhenglin; Wang Shuxun; Peng Guohua

    2009-01-01

    Aiming at the high time complexity of the decoding phase in the traditional image enlargement methods based on fractal coding, a novel image magnification algorithm is proposed in this paper, which has the advantage of iteration-free decoding, by using the similarity analogy between an image and its zoom-out and zoom-in. A new pixel selection technique is also presented to further improve the performance of the proposed method. Furthermore, by combining some existing fractal zooming techniques, an efficient image magnification algorithm is obtained, which can provides the image quality as good as the state of the art while greatly decrease the time complexity of the decoding phase.

  9. Modeling Timbre Similarity of Short Music Clips.

    Science.gov (United States)

    Siedenburg, Kai; Müllensiefen, Daniel

    2017-01-01

    There is evidence from a number of recent studies that most listeners are able to extract information related to song identity, emotion, or genre from music excerpts with durations in the range of tenths of seconds. Because of these very short durations, timbre as a multifaceted auditory attribute appears as a plausible candidate for the type of features that listeners make use of when processing short music excerpts. However, the importance of timbre in listening tasks that involve short excerpts has not yet been demonstrated empirically. Hence, the goal of this study was to develop a method that allows to explore to what degree similarity judgments of short music clips can be modeled with low-level acoustic features related to timbre. We utilized the similarity data from two large samples of participants: Sample I was obtained via an online survey, used 16 clips of 400 ms length, and contained responses of 137,339 participants. Sample II was collected in a lab environment, used 16 clips of 800 ms length, and contained responses from 648 participants. Our model used two sets of audio features which included commonly used timbre descriptors and the well-known Mel-frequency cepstral coefficients as well as their temporal derivates. In order to predict pairwise similarities, the resulting distances between clips in terms of their audio features were used as predictor variables with partial least-squares regression. We found that a sparse selection of three to seven features from both descriptor sets-mainly encoding the coarse shape of the spectrum as well as spectrotemporal variability-best predicted similarities across the two sets of sounds. Notably, the inclusion of non-acoustic predictors of musical genre and record release date allowed much better generalization performance and explained up to 50% of shared variance ( R 2 ) between observations and model predictions. Overall, the results of this study empirically demonstrate that both acoustic features related

  10. Similar on the Inside (pre-grinding)

    Science.gov (United States)

    2004-01-01

    This approximate true-color image taken by the panoramic camera on the Mars Exploration Rover Opportunity show the rock called 'Pilbara' located in the small crater dubbed 'Fram.' The rock appears to be dotted with the same 'blueberries,' or spherules, found at 'Eagle Crater.' Spirit drilled into this rock with its rock abrasion tool. After analyzing the hole with the rover's scientific instruments, scientists concluded that Pilbara has a similar chemical make-up, and thus watery past, to rocks studied at Eagle Crater. This image was taken with the panoramic camera's 480-, 530- and 600-nanometer filters.

  11. Similar on the Inside (post-grinding)

    Science.gov (United States)

    2004-01-01

    This approximate true-color image taken by the panoramic camera on the Mars Exploration Rover Opportunity show the hole drilled into the rock called 'Pilbara,' which is located in the small crater dubbed 'Fram.' Spirit drilled into this rock with its rock abrasion tool. The rock appears to be dotted with the same 'blueberries,' or spherules, found at 'Eagle Crater.' After analyzing the hole with the rover's scientific instruments, scientists concluded that Pilbara has a similar chemical make-up, and thus watery past, to rocks studied at Eagle Crater. This image was taken with the panoramic camera's 480-, 530- and 600-nanometer filters.

  12. Self-similar magnetohydrodynamic boundary layers

    Energy Technology Data Exchange (ETDEWEB)

    Nunez, Manuel; Lastra, Alberto, E-mail: mnjmhd@am.uva.e [Departamento de Analisis Matematico, Universidad de Valladolid, 47005 Valladolid (Spain)

    2010-10-15

    The boundary layer created by parallel flow in a magnetized fluid of high conductivity is considered in this paper. Under appropriate boundary conditions, self-similar solutions analogous to the ones studied by Blasius for the hydrodynamic problem may be found. It is proved that for these to be stable, the size of the Alfven velocity at the outer flow must be smaller than the flow velocity, a fact that has a ready physical explanation. The process by which the transverse velocity and the thickness of the layer grow with the size of the Alfven velocity is detailed.

  13. Self-similar magnetohydrodynamic boundary layers

    International Nuclear Information System (INIS)

    Nunez, Manuel; Lastra, Alberto

    2010-01-01

    The boundary layer created by parallel flow in a magnetized fluid of high conductivity is considered in this paper. Under appropriate boundary conditions, self-similar solutions analogous to the ones studied by Blasius for the hydrodynamic problem may be found. It is proved that for these to be stable, the size of the Alfven velocity at the outer flow must be smaller than the flow velocity, a fact that has a ready physical explanation. The process by which the transverse velocity and the thickness of the layer grow with the size of the Alfven velocity is detailed.

  14. [Similarity system theory to evaluate similarity of chromatographic fingerprints of traditional Chinese medicine].

    Science.gov (United States)

    Liu, Yongsuo; Meng, Qinghua; Jiang, Shumin; Hu, Yuzhu

    2005-03-01

    The similarity evaluation of the fingerprints is one of the most important problems in the quality control of the traditional Chinese medicine (TCM). Similarity measures used to evaluate the similarity of the common peaks in the chromatogram of TCM have been discussed. Comparative studies were carried out among correlation coefficient, cosine of the angle and an improved extent similarity method using simulated data and experimental data. Correlation coefficient and cosine of the angle are not sensitive to the differences of the data set. They are still not sensitive to the differences of the data even after normalization. According to the similarity system theory, an improved extent similarity method was proposed. The improved extent similarity is more sensitive to the differences of the data sets than correlation coefficient and cosine of the angle. And the character of the data sets needs not to be changed compared with log-transformation. The improved extent similarity can be used to evaluate the similarity of the chromatographic fingerprints of TCM.

  15. Protein carbonylation in plants

    DEFF Research Database (Denmark)

    Møller, Ian Max; Havelund, Jesper; Rogowska-Wrzesinska, Adelina

    2017-01-01

    This chapter provides an overview of the current knowledge on protein carbonylation in plants and its role in plant physiology. It starts with a brief outline of the turnover and production sites of reactive oxygen species (ROS) in plants and the causes of protein carbonylation. This is followed...... by a description of the methods used to study protein carbonylation in plants, which is also very brief as the methods are similar to those used in studies on animals. The chapter also focuses on protein carbonylation in plants in general and in mitochondria and in seeds in particular, as case stories where...... specific carbonylated proteins have been identified. Protein carbonylation appears to accumulate at all stages of seed development and germination investigated to date. In some cases, such as seed aging, it is probably simply an accumulation of oxidative damage. However, in other cases protein...

  16. Gait Recognition Using Image Self-Similarity

    Directory of Open Access Journals (Sweden)

    Chiraz BenAbdelkader

    2004-04-01

    Full Text Available Gait is one of the few biometrics that can be measured at a distance, and is hence useful for passive surveillance as well as biometric applications. Gait recognition research is still at its infancy, however, and we have yet to solve the fundamental issue of finding gait features which at once have sufficient discrimination power and can be extracted robustly and accurately from low-resolution video. This paper describes a novel gait recognition technique based on the image self-similarity of a walking person. We contend that the similarity plot encodes a projection of gait dynamics. It is also correspondence-free, robust to segmentation noise, and works well with low-resolution video. The method is tested on multiple data sets of varying sizes and degrees of difficulty. Performance is best for fronto-parallel viewpoints, whereby a recognition rate of 98% is achieved for a data set of 6 people, and 70% for a data set of 54 people.

  17. Self-similarity in applied superconductivity

    International Nuclear Information System (INIS)

    Dresner, Lawrence

    1981-09-01

    Self-similarity is a descriptive term applying to a family of curves. It means that the family is invariant to a one-parameter group of affine (stretching) transformations. The property of self-similarity has been exploited in a wide variety of problems in applied superconductivity, namely, (i) transient distribution of the current among the filaments of a superconductor during charge-up, (ii) steady distribution of current among the filaments of a superconductor near the current leads, (iii) transient heat transfer in superfluid helium, (iv) transient diffusion in cylindrical geometry (important in studying the growth rate of the reacted layer in A15 materials), (v) thermal expulsion of helium from quenching cable-in-conduit conductors, (vi) eddy current heating of irregular plates by slow, ramped fields, and (vii) the specific heat of type-II superconductors. Most, but not all, of the applications involve differential equations, both ordinary and partial. The novel methods explained in this report should prove of great value in other fields, just as they already have done in applied superconductivity. (author)

  18. Phonological similarity effect in complex span task.

    Science.gov (United States)

    Camos, Valérie; Mora, Gérôme; Barrouillet, Pierre

    2013-01-01

    The aim of our study was to test the hypothesis that two systems are involved in verbal working memory; one is specifically dedicated to the maintenance of phonological representations through verbal rehearsal while the other would maintain multimodal representations through attentional refreshing. This theoretical framework predicts that phonologically related phenomena such as the phonological similarity effect (PSE) should occur when the domain-specific system is involved in maintenance, but should disappear when concurrent articulation hinders its use. Impeding maintenance in the domain-general system by a concurrent attentional demand should impair recall performance without affecting PSE. In three experiments, we manipulated the concurrent articulation and the attentional demand induced by the processing component of complex span tasks in which participants had to maintain lists of either similar or dissimilar words. Confirming our predictions, PSE affected recall performance in complex span tasks. Although both the attentional demand and the articulatory requirement of the concurrent task impaired recall, only the induction of an articulatory suppression during maintenance made the PSE disappear. These results suggest a duality in the systems devoted to verbal maintenance in the short term, constraining models of working memory.

  19. Popularity versus similarity in growing networks.

    Science.gov (United States)

    Papadopoulos, Fragkiskos; Kitsak, Maksim; Serrano, M Ángeles; Boguñá, Marián; Krioukov, Dmitri

    2012-09-27

    The principle that 'popularity is attractive' underlies preferential attachment, which is a common explanation for the emergence of scaling in growing networks. If new connections are made preferentially to more popular nodes, then the resulting distribution of the number of connections possessed by nodes follows power laws, as observed in many real networks. Preferential attachment has been directly validated for some real networks (including the Internet), and can be a consequence of different underlying processes based on node fitness, ranking, optimization, random walks or duplication. Here we show that popularity is just one dimension of attractiveness; another dimension is similarity. We develop a framework in which new connections optimize certain trade-offs between popularity and similarity, instead of simply preferring popular nodes. The framework has a geometric interpretation in which popularity preference emerges from local optimization. As opposed to preferential attachment, our optimization framework accurately describes the large-scale evolution of technological (the Internet), social (trust relationships between people) and biological (Escherichia coli metabolic) networks, predicting the probability of new links with high precision. The framework that we have developed can thus be used for predicting new links in evolving networks, and provides a different perspective on preferential attachment as an emergent phenomenon.

  20. Predicting the performance of fingerprint similarity searching.

    Science.gov (United States)

    Vogt, Martin; Bajorath, Jürgen

    2011-01-01

    Fingerprints are bit string representations of molecular structure that typically encode structural fragments, topological features, or pharmacophore patterns. Various fingerprint designs are utilized in virtual screening and their search performance essentially depends on three parameters: the nature of the fingerprint, the active compounds serving as reference molecules, and the composition of the screening database. It is of considerable interest and practical relevance to predict the performance of fingerprint similarity searching. A quantitative assessment of the potential that a fingerprint search might successfully retrieve active compounds, if available in the screening database, would substantially help to select the type of fingerprint most suitable for a given search problem. The method presented herein utilizes concepts from information theory to relate the fingerprint feature distributions of reference compounds to screening libraries. If these feature distributions do not sufficiently differ, active database compounds that are similar to reference molecules cannot be retrieved because they disappear in the "background." By quantifying the difference in feature distribution using the Kullback-Leibler divergence and relating the divergence to compound recovery rates obtained for different benchmark classes, fingerprint search performance can be quantitatively predicted.

  1. K-nearest uphill clustering in the protein structure space

    KAUST Repository

    Cui, Xuefeng; Gao, Xin

    2016-01-01

    The protein structure classification problem, which is to assign a protein structure to a cluster of similar proteins, is one of the most fundamental problems in the construction and application of the protein structure space. Early manually curated

  2. Protein - Which is Best?

    Science.gov (United States)

    Hoffman, Jay R; Falvo, Michael J

    2004-09-01

    Protein intake that exceeds the recommended daily allowance is widely accepted for both endurance and power athletes. However, considering the variety of proteins that are available much less is known concerning the benefits of consuming one protein versus another. The purpose of this paper is to identify and analyze key factors in order to make responsible recommendations to both the general and athletic populations. Evaluation of a protein is fundamental in determining its appropriateness in the human diet. Proteins that are of inferior content and digestibility are important to recognize and restrict or limit in the diet. Similarly, such knowledge will provide an ability to identify proteins that provide the greatest benefit and should be consumed. The various techniques utilized to rate protein will be discussed. Traditionally, sources of dietary protein are seen as either being of animal or vegetable origin. Animal sources provide a complete source of protein (i.e. containing all essential amino acids), whereas vegetable sources generally lack one or more of the essential amino acids. Animal sources of dietary protein, despite providing a complete protein and numerous vitamins and minerals, have some health professionals concerned about the amount of saturated fat common in these foods compared to vegetable sources. The advent of processing techniques has shifted some of this attention and ignited the sports supplement marketplace with derivative products such as whey, casein and soy. Individually, these products vary in quality and applicability to certain populations. The benefits that these particular proteins possess are discussed. In addition, the impact that elevated protein consumption has on health and safety issues (i.e. bone health, renal function) are also reviewed. Key PointsHigher protein needs are seen in athletic populations.Animal proteins is an important source of protein, however potential health concerns do exist from a diet of protein

  3. Similar or different?: the importance of similarities and differences for support between siblings

    NARCIS (Netherlands)

    Voorpostel, M.; van der Lippe, T.; Dykstra, P.A.; Flap, H.

    2007-01-01

    Using a large-scale Dutch national sample (N = 7,126), the authors examine the importance of similarities and differences in the sibling dyad for the provision of support. Similarities are assumed to enhance attraction and empathy; differences are assumed to be related to different possibilities for

  4. Similar or Different? The Importance of Similarities and Differences for Support Between Siblings

    NARCIS (Netherlands)

    Voorpostel, Marieke; Lippe, Tanja van der; Dykstra, Pearl A.; Flap, Henk

    2007-01-01

    Using a large-scale Dutch national sample (N = 7,126), the authors examine the importance of similarities and differences in the sibling dyad for the provision of support. Similarities are assumed to enhance attraction and empathy; differences are assumed to be related to different possibilities for

  5. Similarity problems and completely bounded maps

    CERN Document Server

    Pisier, Gilles

    2001-01-01

    These notes revolve around three similarity problems, appearing in three different contexts, but all dealing with the space B(H) of all bounded operators on a complex Hilbert space H. The first one deals with group representations, the second one with C* -algebras and the third one with the disc algebra. We describe them in detail in the introduction which follows. This volume is devoted to the background necessary to understand these three problems, to the solutions that are known in some special cases and to numerous related concepts, results, counterexamples or extensions which their investigation has generated. While the three problems seem different, it is possible to place them in a common framework using the key concept of "complete boundedness", which we present in detail. Using this notion, the three problems can all be formulated as asking whether "boundedness" implies "complete boundedness" for linear maps satisfying certain additional algebraic identities. Two chapters have been added on the HALMO...

  6. Social values as arguments: similar is convincing

    Science.gov (United States)

    Maio, Gregory R.; Hahn, Ulrike; Frost, John-Mark; Kuppens, Toon; Rehman, Nadia; Kamble, Shanmukh

    2014-01-01

    Politicians, philosophers, and rhetors engage in co-value argumentation: appealing to one value in order to support another value (e.g., “equality leads to freedom”). Across four experiments in the United Kingdom and India, we found that the psychological relatedness of values affects the persuasiveness of the arguments that bind them. Experiment 1 found that participants were more persuaded by arguments citing values that fulfilled similar motives than by arguments citing opposing values. Experiments 2 and 3 replicated this result using a wider variety of values, while finding that the effect is stronger among people higher in need for cognition and that the effect is mediated by the greater plausibility of co-value arguments that link motivationally compatible values. Experiment 4 extended the effect to real-world arguments taken from political propaganda and replicated the mediating effect of argument plausibility. The findings highlight the importance of value relatedness in argument persuasiveness. PMID:25147529

  7. Image Steganalysis with Binary Similarity Measures

    Directory of Open Access Journals (Sweden)

    Kharrazi Mehdi

    2005-01-01

    Full Text Available We present a novel technique for steganalysis of images that have been subjected to embedding by steganographic algorithms. The seventh and eighth bit planes in an image are used for the computation of several binary similarity measures. The basic idea is that the correlation between the bit planes as well as the binary texture characteristics within the bit planes will differ between a stego image and a cover image. These telltale marks are used to construct a classifier that can distinguish between stego and cover images. We also provide experimental results using some of the latest steganographic algorithms. The proposed scheme is found to have complementary performance vis-à-vis Farid's scheme in that they outperform each other in alternate embedding techniques.

  8. A Lithium Vapor Box Divertor Similarity Experiment

    Science.gov (United States)

    Cohen, Robert A.; Emdee, Eric D.; Goldston, Robert J.; Jaworski, Michael A.; Schwartz, Jacob A.

    2017-10-01

    A lithium vapor box divertor offers an alternate means of managing the extreme power density of divertor plasmas by leveraging gaseous lithium to volumetrically extract power. The vapor box divertor is a baffled slot with liquid lithium coated walls held at temperatures which increase toward the divertor floor. The resulting vapor pressure differential drives gaseous lithium from hotter chambers into cooler ones, where the lithium condenses and returns. A similarity experiment was devised to investigate the advantages offered by a vapor box divertor design. We discuss the design, construction, and early findings of the vapor box divertor experiment including vapor can construction, power transfer calculations, joint integrity tests, and thermocouple data logging. Heat redistribution of an incident plasma-based heat flux from a typical linear plasma device is also presented. This work supported by DOE Contract No. DE-AC02-09CH11466 and The Princeton Environmental Institute.

  9. Correct Bayesian and frequentist intervals are similar

    International Nuclear Information System (INIS)

    Atwood, C.L.

    1986-01-01

    This paper argues that Bayesians and frequentists will normally reach numerically similar conclusions, when dealing with vague data or sparse data. It is shown that both statistical methodologies can deal reasonably with vague data. With sparse data, in many important practical cases Bayesian interval estimates and frequentist confidence intervals are approximately equal, although with discrete data the frequentist intervals are somewhat longer. This is not to say that the two methodologies are equally easy to use: The construction of a frequentist confidence interval may require new theoretical development. Bayesians methods typically require numerical integration, perhaps over many variables. Also, Bayesian can easily fall into the trap of over-optimism about their amount of prior knowledge. But in cases where both intervals are found correctly, the two intervals are usually not very different. (orig.)

  10. Soldier motivation – different or similar?

    DEFF Research Database (Denmark)

    Brænder, Morten; Andersen, Lotte Bøgh

    Recent research in military sociology has shown that in addition to their strong peer motivation modern soldiers are oriented toward contributing to society. It has not, however, been tested how soldier motivation differs from the motivation of other citizens in this respect. In this paper......, by means of public service motivation, a concept developed within the public administration literature, we compare soldier and civilian motivation. The contribution of this paper is an analysis of whether and how Danish combat soldiers differs from other Danes in regard to public service motivation? Using...... surveys with similar questions, we find that soldiers are more normatively motivated to contribute to society than other citizens (higher commitment to the public interest), while their affectively based motivation is lower (lower compassion). This points towards a potential problem in regard...

  11. Social Values as Arguments: Similar is Convincing

    Directory of Open Access Journals (Sweden)

    Gregory R Maio

    2014-08-01

    Full Text Available Politicians, philosophers, and rhetors engage in co-value argumentation: appealing to one value in order to support another value (e.g., equality leads to freedom. Across four experiments in the United Kingdom and India, we found that the psychological relatedness of values affects the persuasiveness of the arguments that bind them. Experiment 1 found that participants were more persuaded by arguments citing values that fulfilled similar motives than by arguments citing opposing values. Experiments 2 and 3 replicated this result using a wider variety of values, while finding that the effect is stronger among people higher in need for cognition and that the effect is mediated by the greater plausibility of co-value arguments that link motivationally compatible values. Experiment 4 extended the effect to real-world arguments taken from political propaganda and replicated the mediating effect of argument plausibility. The findings highlight the importance of value relatedness in argument persuasiveness.

  12. Formulation of similarity porous media systems

    International Nuclear Information System (INIS)

    Anderson, R.M.; Ford, W.T.; Ruttan, A.; Strauss, M.J.

    1982-01-01

    The mathematical formulation of the Porous Media System (PMS) describing two-phase, immiscible, compressible fluid flow in linear, homogeneous porous media is reviewed and expanded. It is shown that families of common vertex, coaxial parabolas and families of parallel lines are the only families of curves on which solutions of the PMS may be constant. A coordinate transformation is used to change the partial differential equations of the PMS to a system of ordinary differential equations, referred to as a similarity Porous Media System (SPMS), in which the independent variable denotes movement from curve to curve in a selected family of curves. Properties of solutions of the first boundary value problem are developed for the SPMS

  13. Contextual Factors for Finding Similar Experts

    DEFF Research Database (Denmark)

    Hofmann, Katja; Balog, Krisztian; Bogers, Toine

    2010-01-01

    -seeking models, are rarely taken into account. In this article, we extend content-based expert-finding approaches with contextual factors that have been found to influence human expert finding. We focus on a task of science communicators in a knowledge-intensive environment, the task of finding similar experts......, given an example expert. Our approach combines expertise-seeking and retrieval research. First, we conduct a user study to identify contextual factors that may play a role in the studied task and environment. Then, we design expert retrieval models to capture these factors. We combine these with content......-based retrieval models and evaluate them in a retrieval experiment. Our main finding is that while content-based features are the most important, human participants also take contextual factors into account, such as media experience and organizational structure. We develop two principled ways of modeling...

  14. Similarity queries for temporal toxicogenomic expression profiles.

    Directory of Open Access Journals (Sweden)

    Adam A Smith

    2008-07-01

    Full Text Available We present an approach for answering similarity queries about gene expression time series that is motivated by the task of characterizing the potential toxicity of various chemicals. Our approach involves two key aspects. First, our method employs a novel alignment algorithm based on time warping. Our time warping algorithm has several advantages over previous approaches. It allows the user to impose fairly strong biases on the form that the alignments can take, and it permits a type of local alignment in which the entirety of only one series has to be aligned. Second, our method employs a relaxed spline interpolation to predict expression responses for unmeasured time points, such that the spline does not necessarily exactly fit every observed point. We evaluate our approach using expression time series from the Edge toxicology database. Our experiments show the value of using spline representations for sparse time series. More significantly, they show that our time warping method provides more accurate alignments and classifications than previous standard alignment methods for time series.

  15. Humans and mice express similar olfactory preferences.

    Directory of Open Access Journals (Sweden)

    Nathalie Mandairon

    Full Text Available In humans, the pleasantness of odors is a major contributor to social relationships and food intake. Smells evoke attraction and repulsion responses, reflecting the hedonic value of the odorant. While olfactory preferences are known to be strongly modulated by experience and learning, it has been recently suggested that, in humans, the pleasantness of odors may be partly explained by the physicochemical properties of the odorant molecules themselves. If odor hedonic value is indeed predetermined by odorant structure, then it could be hypothesized that other species will show similar odor preferences to humans. Combining behavioral and psychophysical approaches, we here show that odorants rated as pleasant by humans were also those which, behaviorally, mice investigated longer and human subjects sniffed longer, thereby revealing for the first time a component of olfactory hedonic perception conserved across species. Consistent with this, we further show that odor pleasantness rating in humans and investigation time in mice were both correlated with the physicochemical properties of the molecules, suggesting that olfactory preferences are indeed partly engraved in the physicochemical structure of the odorant. That odor preferences are shared between mammal species and are guided by physicochemical features of odorant stimuli strengthens the view that odor preference is partially predetermined. These findings open up new perspectives for the study of the neural mechanisms of hedonic perception.

  16. Different-but-Similar Judgments by Bumblebees

    Directory of Open Access Journals (Sweden)

    Vicki Xu

    2016-08-01

    Full Text Available This study examines picture perception in an invertebrate. Two questions regarding possible picture-object correspondence are addressed for bumblebees (Bombus impatiens: (1 Do bees perceive the difference between an object and its corresponding picture even when they have not been trained to do so? (2 Do they also perceive the similarity? Twenty bees from each of four colonies underwent discrimination training of stimuli placed in a radial maze. Bees were trained to discriminate between two objects (artificial flowers in one group and between photos of those objects in another. Subsequent testing on unrewarding stimuli revealed, for both groups, a significant discrimination between the object and its photo: discrimination training was not necessary for bees to detect a difference between corresponding objects and pictures. We obtained not only object-to-picture transfer, as in previous research, but also the reverse: picture-to-object transfer. In the absence of the rewarding object, its photo, though never seen before by the bees, was accepted as a substitute. The reverse was also true. Bumblebees treated pictures as “different-but-similar” without having been trained to do so, which is in turn useful in floral categorization.

  17. Block generators for the similarity renormalization group

    Energy Technology Data Exchange (ETDEWEB)

    Huether, Thomas; Roth, Robert [TU Darmstadt (Germany)

    2016-07-01

    The Similarity Renormalization Group (SRG) is a powerful tool to improve convergence behavior of many-body calculations using NN and 3N interactions from chiral effective field theory. The SRG method decouples high and low-energy physics, through a continuous unitary transformation implemented via a flow equation approach. The flow is determined by a generator of choice. This generator governs the decoupling pattern and, thus, the improvement of convergence, but it also induces many-body interactions. Through the design of the generator we can optimize the balance between convergence and induced forces. We explore a new class of block generators that restrict the decoupling to the high-energy sector and leave the diagonalization in the low-energy sector to the many-body method. In this way one expects a suppression of induced forces. We analyze the induced many-body forces and the convergence behavior in light and medium-mass nuclei in No-Core Shell Model and In-Medium SRG calculations.

  18. State and Mafia, Differences and Similarities

    Directory of Open Access Journals (Sweden)

    Alfano Vincenzo

    2015-02-01

    Full Text Available The purpose of this article is to investigate about the differences and, if any, the similarities among the modern State and the mafia criminal organizations. In particular, starting from their definitions, I will try to find the differences between State and mafia, to then focus on the operational aspects of the functioning of these two organizations, with specific reference to the effect/impact that both these human constructs have on citizens’ existences, and especially on citizen’s economic lives. All this in order to understand whether it is possible to identify an objective difference – beside morals – between taxation by the modern State and extortion by criminal organizations. With this of course I do not want to argue that the mafia is in any way justifiable or absolvable, nor that it is better than the State. However, I want to investigate whether there is a real, logical reason why the State should be considered by the citizens more desirable than the criminal organizations oppressing Southern Italy, from a strictly logical point of view and not from the point of view of ethics and morality.

  19. Similarity of eigenstates in generalized labyrinth tilings

    International Nuclear Information System (INIS)

    Thiem, Stefanie; Schreiber, Michael

    2010-01-01

    The eigenstates of d-dimensional quasicrystalline models with a separable Hamiltonian are studied within the tight-binding model. The approach is based on mathematical sequences, constructed by an inflation rule P = {w → s,s → sws b-1 } describing the weak/strong couplings of atoms in a quasiperiodic chain. Higher-dimensional quasiperiodic tilings are constructed as a direct product of these chains and their eigenstates can be directly calculated by multiplying the energies E or wave functions ψ of the chain, respectively. Applying this construction rule, the grid in d dimensions splits into 2 d-1 different tilings, for which we investigated the characteristics of the wave functions. For the standard two-dimensional labyrinth tiling constructed from the octonacci sequence (b = 2) the lattice breaks up into two identical lattices, which consequently yield the same eigenstates. While this is not the case for b ≠ 2, our numerical results show that the wave functions of the different grids become increasingly similar for large system sizes. This can be explained by the fact that the structure of the 2 d-1 grids mainly differs at the boundaries and thus for large systems the eigenstates approach each other. This property allows us to analytically derive properties of the higher-dimensional generalized labyrinth tilings from the one-dimensional results. In particular participation numbers and corresponding scaling exponents have been determined.

  20. Genetic and 'cultural' similarity in wild chimpanzees.

    Science.gov (United States)

    Langergraber, Kevin E; Boesch, Christophe; Inoue, Eiji; Inoue-Murayama, Miho; Mitani, John C; Nishida, Toshisada; Pusey, Anne; Reynolds, Vernon; Schubert, Grit; Wrangham, Richard W; Wroblewski, Emily; Vigilant, Linda

    2011-02-07

    The question of whether animals possess 'cultures' or 'traditions' continues to generate widespread theoretical and empirical interest. Studies of wild chimpanzees have featured prominently in this discussion, as the dominant approach used to identify culture in wild animals was first applied to them. This procedure, the 'method of exclusion,' begins by documenting behavioural differences between groups and then infers the existence of culture by eliminating ecological explanations for their occurrence. The validity of this approach has been questioned because genetic differences between groups have not explicitly been ruled out as a factor contributing to between-group differences in behaviour. Here we investigate this issue directly by analysing genetic and behavioural data from nine groups of wild chimpanzees. We find that the overall levels of genetic and behavioural dissimilarity between groups are highly and statistically significantly correlated. Additional analyses show that only a very small number of behaviours vary between genetically similar groups, and that there is no obvious pattern as to which classes of behaviours (e.g. tool-use versus communicative) have a distribution that matches patterns of between-group genetic dissimilarity. These results indicate that genetic dissimilarity cannot be eliminated as playing a major role in generating group differences in chimpanzee behaviour.

  1. Multidimensional Scaling Visualization Using Parametric Similarity Indices

    Directory of Open Access Journals (Sweden)

    J. A. Tenreiro Machado

    2015-03-01

    Full Text Available In this paper, we apply multidimensional scaling (MDS and parametric similarity indices (PSI in the analysis of complex systems (CS. Each CS is viewed as a dynamical system, exhibiting an output time-series to be interpreted as a manifestation of its behavior. We start by adopting a sliding window to sample the original data into several consecutive time periods. Second, we define a given PSI for tracking pieces of data. We then compare the windows for different values of the parameter, and we generate the corresponding MDS maps of ‘points’. Third, we use Procrustes analysis to linearly transform the MDS charts for maximum superposition and to build a globalMDS map of “shapes”. This final plot captures the time evolution of the phenomena and is sensitive to the PSI adopted. The generalized correlation, theMinkowski distance and four entropy-based indices are tested. The proposed approach is applied to the Dow Jones Industrial Average stock market index and the Europe Brent Spot Price FOB time-series.

  2. Exploring similarities among many species distributions

    Science.gov (United States)

    Simmerman, Scott; Wang, Jingyuan; Osborne, James; Shook, Kimberly; Huang, Jian; Godsoe, William; Simons, Theodore R.

    2012-01-01

    Collecting species presence data and then building models to predict species distribution has been long practiced in the field of ecology for the purpose of improving our understanding of species relationships with each other and with the environment. Due to limitations of computing power as well as limited means of using modeling software on HPC facilities, past species distribution studies have been unable to fully explore diverse data sets. We build a system that can, for the first time to our knowledge, leverage HPC to support effective exploration of species similarities in distribution as well as their dependencies on common environmental conditions. Our system can also compute and reveal uncertainties in the modeling results enabling domain experts to make informed judgments about the data. Our work was motivated by and centered around data collection efforts within the Great Smoky Mountains National Park that date back to the 1940s. Our findings present new research opportunities in ecology and produce actionable field-work items for biodiversity management personnel to include in their planning of daily management activities.

  3. Similarities and differences in vapor explosion criteria

    International Nuclear Information System (INIS)

    Cronenberg, A.W.

    1978-01-01

    An overview of recent ideas pertaining to vapor explosion criteria indicates that in general sense, a consensus of opinion is emerging on the conditions applicable to explosive vaporization. Experimental and theoretical work has lead a number of investigators to the formulation of such conditions which are quite similar in many respects, although the quantitative details of the model formulation of such conditions are somewhat different. All model concepts are consistent in that an initial period of stable film boiling, separating molten fuel from coolant, is considered necessary (at least for large-scale interactions and efficient intermixing), with subsequent breakdown of film boiling due to pressure and/or thermal effects, followed by intimate fuel-coolant contact and a rapid vaporization process which is sufficient to cause shock pressurization. Although differences arise as to the conditions for and the energetics associated with film boiling destabilization and the mode and energetics of fragmentation and intermixing. However, the principal area of difference seems to be the question of what constitutes the requisite condition(s) for rapid vapor production to cause shock pressurization

  4. Decoding the similarities and differences among mycobacterial species.

    Directory of Open Access Journals (Sweden)

    Sony Malhotra

    2017-08-01

    Full Text Available Mycobacteriaceae comprises pathogenic species such as Mycobacterium tuberculosis, M. leprae and M. abscessus, as well as non-pathogenic species, for example, M. smegmatis and M. thermoresistibile. Genome comparison and annotation studies provide insights into genome evolutionary relatedness, identify unique and pathogenicity-related genes in each species, and explore new targets that could be used for developing new diagnostics and therapeutics. Here, we present a comparative analysis of ten-mycobacterial genomes with the objective of identifying similarities and differences between pathogenic and non-pathogenic species. We identified 1080 core orthologous clusters that were enriched in proteins involved in amino acid and purine/pyrimidine biosynthetic pathways, DNA-related processes (replication, transcription, recombination and repair, RNA-methylation and modification, and cell-wall polysaccharide biosynthetic pathways. For their pathogenicity and survival in the host cell, pathogenic species have gained specific sets of genes involved in repair and protection of their genomic DNA. M. leprae is of special interest owing to its smallest genome (1600 genes and ~1300 psuedogenes, yet poor genome annotation. More than 75% of the pseudogenes were found to have a functional ortholog in the other mycobacterial genomes and belong to protein families such as transferases, oxidoreductases and hydrolases.

  5. Similarly shaped letters evoke similar colors in grapheme-color synesthesia.

    Science.gov (United States)

    Brang, David; Rouw, Romke; Ramachandran, V S; Coulson, Seana

    2011-04-01

    Grapheme-color synesthesia is a neurological condition in which viewing numbers or letters (graphemes) results in the concurrent sensation of color. While the anatomical substrates underlying this experience are well understood, little research to date has investigated factors influencing the particular colors associated with particular graphemes or how synesthesia occurs developmentally. A recent suggestion of such an interaction has been proposed in the cascaded cross-tuning (CCT) model of synesthesia, which posits that in synesthetes connections between grapheme regions and color area V4 participate in a competitive activation process, with synesthetic colors arising during the component-stage of grapheme processing. This model more directly suggests that graphemes sharing similar component features (lines, curves, etc.) should accordingly activate more similar synesthetic colors. To test this proposal, we created and regressed synesthetic color-similarity matrices for each of 52 synesthetes against a letter-confusability matrix, an unbiased measure of visual similarity among graphemes. Results of synesthetes' grapheme-color correspondences indeed revealed that more similarly shaped graphemes corresponded with more similar synesthetic colors, with stronger effects observed in individuals with more intense synesthetic experiences (projector synesthetes). These results support the CCT model of synesthesia, implicate early perceptual mechanisms as driving factors in the elicitation of synesthetic hues, and further highlight the relationship between conceptual and perceptual factors in this phenomenon. Copyright © 2011 Elsevier Ltd. All rights reserved.

  6. Asteroid clusters similar to asteroid pairs

    Science.gov (United States)

    Pravec, P.; Fatka, P.; Vokrouhlický, D.; Scheeres, D. J.; Kušnirák, P.; Hornoch, K.; Galád, A.; Vraštil, J.; Pray, D. P.; Krugly, Yu. N.; Gaftonyuk, N. M.; Inasaridze, R. Ya.; Ayvazian, V. R.; Kvaratskhelia, O. I.; Zhuzhunadze, V. T.; Husárik, M.; Cooney, W. R.; Gross, J.; Terrell, D.; Világi, J.; Kornoš, L.; Gajdoš, Š.; Burkhonov, O.; Ehgamberdiev, Sh. A.; Donchev, Z.; Borisov, G.; Bonev, T.; Rumyantsev, V. V.; Molotov, I. E.

    2018-04-01

    We studied the membership, size ratio and rotational properties of 13 asteroid clusters consisting of between 3 and 19 known members that are on similar heliocentric orbits. By backward integrations of their orbits, we confirmed their cluster membership and estimated times elapsed since separation of the secondaries (the smaller cluster members) from the primary (i.e., cluster age) that are between 105 and a few 106 years. We ran photometric observations for all the cluster primaries and a sample of secondaries and we derived their accurate absolute magnitudes and rotation periods. We found that 11 of the 13 clusters follow the same trend of primary rotation period vs mass ratio as asteroid pairs that was revealed by Pravec et al. (2010). We generalized the model of the post-fission system for asteroid pairs by Pravec et al. (2010) to a system of N components formed by rotational fission and we found excellent agreement between the data for the 11 asteroid clusters and the prediction from the theory of their formation by rotational fission. The two exceptions are the high-mass ratio (q > 0.7) clusters of (18777) Hobson and (22280) Mandragora for which a different formation mechanism is needed. Two candidate mechanisms for formation of more than one secondary by rotational fission were published: the secondary fission process proposed by Jacobson and Scheeres (2011) and a cratering collision event onto a nearly critically rotating primary proposed by Vokrouhlický et al. (2017). It will have to be revealed from future studies which of the clusters were formed by one or the other process. To that point, we found certain further interesting properties and features of the asteroid clusters that place constraints on the theories of their formation, among them the most intriguing being the possibility of a cascade disruption for some of the clusters.

  7. Expanding the boundaries of local similarity analysis.

    Science.gov (United States)

    Durno, W Evan; Hanson, Niels W; Konwar, Kishori M; Hallam, Steven J

    2013-01-01

    Pairwise comparison of time series data for both local and time-lagged relationships is a computationally challenging problem relevant to many fields of inquiry. The Local Similarity Analysis (LSA) statistic identifies the existence of local and lagged relationships, but determining significance through a p-value has been algorithmically cumbersome due to an intensive permutation test, shuffling rows and columns and repeatedly calculating the statistic. Furthermore, this p-value is calculated with the assumption of normality -- a statistical luxury dissociated from most real world datasets. To improve the performance of LSA on big datasets, an asymptotic upper bound on the p-value calculation was derived without the assumption of normality. This change in the bound calculation markedly improved computational speed from O(pm²n) to O(m²n), where p is the number of permutations in a permutation test, m is the number of time series, and n is the length of each time series. The bounding process is implemented as a computationally efficient software package, FASTLSA, written in C and optimized for threading on multi-core computers, improving its practical computation time. We computationally compare our approach to previous implementations of LSA, demonstrate broad applicability by analyzing time series data from public health, microbial ecology, and social media, and visualize resulting networks using the Cytoscape software. The FASTLSA software package expands the boundaries of LSA allowing analysis on datasets with millions of co-varying time series. Mapping metadata onto force-directed graphs derived from FASTLSA allows investigators to view correlated cliques and explore previously unrecognized network relationships. The software is freely available for download at: http://www.cmde.science.ubc.ca/hallam/fastLSA/.

  8. UNSOLVED AND LATENT CRIME: DIFFERENCES AND SIMILARITIES

    Directory of Open Access Journals (Sweden)

    Mikhail Kleymenov

    2017-01-01

    Full Text Available УДК 343Purpose of the article is to study the specific legal and informational nature of the unsolved crime in comparison with the phenomenon of delinquency, special study and analysis to improve the efficiency of law enforcement.Methods of research are abstract-logical, systematic, statistical, study of documents. The main results of research. Unsolved crime has specific legal, statistical and informational na-ture as the crime phenomenon, which is expressed in cumulative statistical population of unsolved crimes. An array of unsolved crimes is the sum of the number of acts, things of which is suspended and not terminated. The fault of the perpetrator in these cases is not proven, they are not considered by the court, it is not a conviction. Unsolved crime must be registered. Latent crime has a different informational nature. The main symptom of latent crimes is the uncertainty for the subjects of law enforcement, which delegated functions of identification, registration and accounting. Latent crime is not recorded. At the same time, there is a "border" area between the latent and unsolved crimes, which includes covered from the account of the crime. In modern Russia the majority of crimes covered from accounting by passing the decision about refusal in excitation of criminal case. Unsolved crime on their criminogenic consequences represents a significant danger to the public is higher compared to latent crime.It is conducted in the article a special analysis of the differences and similarities in the unsolved latent crime for the first time in criminological literature.The analysis proves the need for radical changes in the current Russian assessment of the state of crime and law enforcement to solve crimes. The article argues that an unsolved crime is a separate and, in contrast to latent crime, poorly understood phenomenon. However unsolved latent crime and have common features and areas of interaction.

  9. Self-similar pattern formation and continuous mechanics of self-similar systems

    Directory of Open Access Journals (Sweden)

    A. V. Dyskin

    2007-01-01

    Full Text Available In many cases, the critical state of systems that reached the threshold is characterised by self-similar pattern formation. We produce an example of pattern formation of this kind – formation of self-similar distribution of interacting fractures. Their formation starts with the crack growth due to the action of stress fluctuations. It is shown that even when the fluctuations have zero average the cracks generated by them could grow far beyond the scale of stress fluctuations. Further development of the fracture system is controlled by crack interaction leading to the emergence of self-similar crack distributions. As a result, the medium with fractures becomes discontinuous at any scale. We develop a continuum fractal mechanics to model its physical behaviour. We introduce a continuous sequence of continua of increasing scales covering this range of scales. The continuum of each scale is specified by the representative averaging volume elements of the corresponding size. These elements determine the resolution of the continuum. Each continuum hides the cracks of scales smaller than the volume element size while larger fractures are modelled explicitly. Using the developed formalism we investigate the stability of self-similar crack distributions with respect to crack growth and show that while the self-similar distribution of isotropically oriented cracks is stable, the distribution of parallel cracks is not. For the isotropically oriented cracks scaling of permeability is determined. For permeable materials (rocks with self-similar crack distributions permeability scales as cube of crack radius. This property could be used for detecting this specific mechanism of formation of self-similar crack distributions.

  10. On fuzzy semantic similarity measure for DNA coding.

    Science.gov (United States)

    Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin

    2016-02-01

    A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. Similar Symmetries: The Role of Wallpaper Groups in Perceptual Texture Similarity

    Directory of Open Access Journals (Sweden)

    Fraser Halley

    2011-05-01

    Full Text Available Periodic patterns and symmetries are striking visual properties that have been used decoratively around the world throughout human history. Periodic patterns can be mathematically classified into one of 17 different Wallpaper groups, and while computational models have been developed which can extract an image's symmetry group, very little work has been done on how humans perceive these patterns. This study presents the results from a grouping experiment using stimuli from the different wallpaper groups. We find that while different images from the same wallpaper group are perceived as similar to one another, not all groups have the same degree of self-similarity. The similarity relationships between wallpaper groups appear to be dominated by rotations.

  12. Similarity and self-similarity in high energy density physics: application to laboratory astrophysics

    International Nuclear Information System (INIS)

    Falize, E.

    2008-10-01

    The spectacular recent development of powerful facilities allows the astrophysical community to explore, in laboratory, astrophysical phenomena where radiation and matter are strongly coupled. The titles of the nine chapters of the thesis are: from high energy density physics to laboratory astrophysics; Lie groups, invariance and self-similarity; scaling laws and similarity properties in High-Energy-Density physics; the Burgan-Feix-Munier transformation; dynamics of polytropic gases; stationary radiating shocks and the POLAR project; structure, dynamics and stability of optically thin fluids; from young star jets to laboratory jets; modelling and experiences for laboratory jets

  13. Sequence Similarity Presenter: a tool for the graphic display of similarities of long sequences for use in presentations.

    Science.gov (United States)

    Fröhlich, K U

    1994-04-01

    A new method for the presentation of alignments of long sequences is described. The degree of identity for the aligned sequences is averaged for sections of a fixed number of residues. The resulting values are converted to shades of gray, with white corresponding to lack of identity and black corresponding to perfect identity. A sequence alignment is represented as a bar filled with varying shades of gray. The display is compact and allows for a fast and intuitive recognition of the distribution of regions with a high similarity. It is well suited for the presentation of alignments of long sequences, e.g. of protein superfamilies, in plenary lectures. The method is implemented as a HyperCard stack for Apple Macintosh computers. Several options for the modification of the output are available (e.g. background reduction, size of the summation window, consideration of amino acid similarity, inclusion of graphic markers to indicate specific domains). The output is a PostScript file which can be printed, imported as EPS or processed further with Adobe Illustrator.

  14. Improved cosine similarity measures of simplified neutrosophic setsfor medical diagnoses

    OpenAIRE

    Jun Ye

    2014-01-01

    In pattern recognition and medical diagnosis, similarity measure is an important mathematicaltool. To overcome some disadvantages of existing cosine similarity measures of simplified neutrosophicsets (SNSs) in vector space, this paper proposed improved cosine similarity measures of SNSs based oncosine function, including single valued neutrosophic cosine similarity measures and interval neutro-sophic cosine similarity measures. Then, weighted cosine similarity measures of SNSs were introduced...

  15. When high similarity copycats lose and moderate similarity copycats gain: The impact of comparative evaluation

    NARCIS (Netherlands)

    Van Horen, F.; Pieters, R.

    2012-01-01

    Copycats imitate features of leading brands to free ride on their equity. The prevailing belief is that the more similar copycats are to the leader brand, the more positive their evaluation is, and thus the more they free ride. Three studies demonstrate when the reverse holds true:

  16. When high similarity copycats lose and moderate similarity copycats gain : The impact of comparative evaluation

    NARCIS (Netherlands)

    van Horen, F.; Pieters, R.

    2012-01-01

    Copycats imitate features of leading brands to free ride on their equity. The prevailing belief is that the more similar copycats are to the leader brand, the more positive their evaluation is, and thus the more they free ride. Three studies demonstrate when the reverse holds true:

  17. A new measure for functional similarity of gene products based on Gene Ontology

    Directory of Open Access Journals (Sweden)

    Lengauer Thomas

    2006-06-01

    Full Text Available Abstract Background Gene Ontology (GO is a standard vocabulary of functional terms and allows for coherent annotation of gene products. These annotations provide a basis for new methods that compare gene products regarding their molecular function and biological role. Results We present a new method for comparing sets of GO terms and for assessing the functional similarity of gene products. The method relies on two semantic similarity measures; simRel and funSim. One measure (simRel is applied in the comparison of the biological processes found in different groups of organisms. The other measure (funSim is used to find functionally related gene products within the same or between different genomes. Results indicate that the method, in addition to being in good agreement with established sequence similarity approaches, also provides a means for the identification of functionally related proteins independent of evolutionary relationships. The method is also applied to estimating functional similarity between all proteins in Saccharomyces cerevisiae and to visualizing the molecular function space of yeast in a map of the functional space. A similar approach is used to visualize the functional relationships between protein families. Conclusion The approach enables the comparison of the underlying molecular biology of different taxonomic groups and provides a new comparative genomics tool identifying functionally related gene products independent of homology. The proposed map of the functional space provides a new global view on the functional relationships between gene products or protein families.

  18. PROTEIN - WHICH IS BEST?

    Directory of Open Access Journals (Sweden)

    Michael J. Falvo

    2004-09-01

    Full Text Available Protein intake that exceeds the recommended daily allowance is widely accepted for both endurance and power athletes. However, considering the variety of proteins that are available much less is known concerning the benefits of consuming one protein versus another. The purpose of this paper is to identify and analyze key factors in order to make responsible recommendations to both the general and athletic populations. Evaluation of a protein is fundamental in determining its appropriateness in the human diet. Proteins that are of inferior content and digestibility are important to recognize and restrict or limit in the diet. Similarly, such knowledge will provide an ability to identify proteins that provide the greatest benefit and should be consumed. The various techniques utilized to rate protein will be discussed. Traditionally, sources of dietary protein are seen as either being of animal or vegetable origin. Animal sources provide a complete source of protein (i.e. containing all essential amino acids, whereas vegetable sources generally lack one or more of the essential amino acids. Animal sources of dietary protein, despite providing a complete protein and numerous vitamins and minerals, have some health professionals concerned about the amount of saturated fat common in these foods compared to vegetable sources. The advent of processing techniques has shifted some of this attention and ignited the sports supplement marketplace with derivative products such as whey, casein and soy. Individually, these products vary in quality and applicability to certain populations. The benefits that these particular proteins possess are discussed. In addition, the impact that elevated protein consumption has on health and safety issues (i.e. bone health, renal function are also reviewed

  19. Why fibrous proteins are romantic.

    Science.gov (United States)

    Cohen, C

    1998-01-01

    Here I give a personal account of the great history of fibrous protein structure. I describe how Astbury first recognized the essential simplicity of fibrous proteins and their paradigmatic role in protein structure. The poor diffraction patterns yielded by these proteins were then deciphered by Pauling, Crick, Ramachandran and others (in part by model building) to reveal alpha-helical coiled coils, beta-sheets, and the collagen triple helical coiled coil-all characterized by different local sequence periodicities. Longer-range sequence periodicities (or "magic numbers") present in diverse fibrous proteins, such as collagen, tropomyosin, paramyosin, myosin, and were then shown to account for the characteristic axial repeats observed in filaments of these proteins. More recently, analysis of fibrous protein structure has been extended in many cases to atomic resolution, and some systems, such as "leucine zippers," are providing a deeper understanding of protein design than similar studies of globular proteins. In the last sections, I provide some dramatic examples of fibrous protein dynamics. One example is the so-called "spring-loaded" mechanism for viral fusion by the hemagglutinin protein of influenza. Another is the possible conformational changes in prion proteins, implicated in "mad cow disease," which may be related to similar transitions in a variety of globular and fibrous proteins. Copyright 1998 Academic Press.

  20. Application of clustering methods: Regularized Markov clustering (R-MCL) for analyzing dengue virus similarity

    Science.gov (United States)

    Lestari, D.; Raharjo, D.; Bustamam, A.; Abdillah, B.; Widhianto, W.

    2017-07-01

    Dengue virus consists of 10 different constituent proteins and are classified into 4 major serotypes (DEN 1 - DEN 4). This study was designed to perform clustering against 30 protein sequences of dengue virus taken from Virus Pathogen Database and Analysis Resource (VIPR) using Regularized Markov Clustering (R-MCL) algorithm and then we analyze the result. By using Python program 3.4, R-MCL algorithm produces 8 clusters with more than one centroid in several clusters. The number of centroid shows the density level of interaction. Protein interactions that are connected in a tissue, form a complex protein that serves as a specific biological process unit. The analysis of result shows the R-MCL clustering produces clusters of dengue virus family based on the similarity role of their constituent protein, regardless of serotypes.

  1. Engaging narratives evoke similar neural activity and lead to similar time perception.

    Science.gov (United States)

    Cohen, Samantha S; Henin, Simon; Parra, Lucas C

    2017-07-04

    It is said that we lose track of time - that "time flies" - when we are engrossed in a story. How does engagement with the story cause this distorted perception of time, and what are its neural correlates? People commit both time and attentional resources to an engaging stimulus. For narrative videos, attentional engagement can be represented as the level of similarity between the electroencephalographic responses of different viewers. Here we show that this measure of neural engagement predicted the duration of time that viewers were willing to commit to narrative videos. Contrary to popular wisdom, engagement did not distort the average perception of time duration. Rather, more similar brain responses resulted in a more uniform perception of time across viewers. These findings suggest that by capturing the attention of an audience, narrative videos bring both neural processing and the subjective perception of time into synchrony.

  2. A Similarity Analysis of Audio Signal to Develop a Human Activity Recognition Using Similarity Networks

    Directory of Open Access Journals (Sweden)

    Alejandra García-Hernández

    2017-11-01

    Full Text Available Human Activity Recognition (HAR is one of the main subjects of study in the areas of computer vision and machine learning due to the great benefits that can be achieved. Examples of the study areas are: health prevention, security and surveillance, automotive research, and many others. The proposed approaches are carried out using machine learning techniques and present good results. However, it is difficult to observe how the descriptors of human activities are grouped. In order to obtain a better understanding of the the behavior of descriptors, it is important to improve the abilities to recognize the human activities. This paper proposes a novel approach for the HAR based on acoustic data and similarity networks. In this approach, we were able to characterize the sound of the activities and identify those activities looking for similarity in the sound pattern. We evaluated the similarity of the sounds considering mainly two features: the sound location and the materials that were used. As a result, the materials are a good reference classifying the human activities compared with the location.

  3. Phonological similarity and orthographic similarity affect probed serial recall of Chinese characters.

    Science.gov (United States)

    Lin, Yi-Chen; Chen, Hsiang-Yu; Lai, Yvonne C; Wu, Denise H

    2015-04-01

    The previous literature on working memory (WM) has indicated that verbal materials are dominantly retained in phonological representations, whereas other linguistic information (e.g., orthography, semantics) only contributes to verbal WM minimally, if not negligibly. Although accumulating evidence has suggested that multiple linguistic components jointly support verbal WM, the visual/orthographic contribution has rarely been addressed in alphabetic languages, possibly due to the difficulty of dissociating the effects of word forms from the effects of their pronunciations in relatively shallow orthography. In the present study, we examined whether the orthographic representations of Chinese characters support the retention of verbal materials in this language of deep orthography. In Experiments 1a and 2, we independently manipulated the phonological and orthographic similarity of horizontal and vertical characters, respectively, and found that participants' accuracy of probed serial recall was reduced by both similar pronunciations and shared phonetic radicals in the to-be-remembered stimuli. Moreover, Experiment 1b showed that only the effect of phonological, but not that of orthographic, similarity was affected by concurrent articulatory suppression. Taken together, the present results indicate the indispensable contribution of orthographic representations to verbal WM of Chinese characters, and suggest that the linguistic characteristics of a specific language not only determine long-term linguistic-processing mechanisms, but also delineate the organization of verbal WM for that language.

  4. Similar or different? The role of the ventrolateral prefrontal cortex in similarity detection.

    Directory of Open Access Journals (Sweden)

    Béatrice Garcin

    Full Text Available Patients with frontal lobe syndrome can exhibit two types of abnormal behaviour when asked to place a banana and an orange in a single category: some patients categorize them at a concrete level (e.g., "both have peel", while others continue to look for differences between these objects (e.g., "one is yellow, the other is orange". These observations raise the question of whether abstraction and similarity detection are distinct processes involved in abstract categorization, and that depend on separate areas of the prefrontal cortex (PFC. We designed an original experimental paradigm for a functional magnetic resonance imaging (fMRI study involving healthy subjects, confirming the existence of two distinct processes relying on different prefrontal areas, and thus explaining the behavioural dissociation in frontal lesion patients. We showed that: 1 Similarity detection involves the anterior ventrolateral PFC bilaterally with a right-left asymmetry: the right anterior ventrolateral PFC is only engaged in detecting physical similarities; 2 Abstraction per se activates the left dorsolateral PFC.

  5. [-25]A Similarity Analysis of Audio Signal to Develop a Human Activity Recognition Using Similarity Networks.

    Science.gov (United States)

    García-Hernández, Alejandra; Galván-Tejada, Carlos E; Galván-Tejada, Jorge I; Celaya-Padilla, José M; Gamboa-Rosales, Hamurabi; Velasco-Elizondo, Perla; Cárdenas-Vargas, Rogelio

    2017-11-21

    Human Activity Recognition (HAR) is one of the main subjects of study in the areas of computer vision and machine learning due to the great benefits that can be achieved. Examples of the study areas are: health prevention, security and surveillance, automotive research, and many others. The proposed approaches are carried out using machine learning techniques and present good results. However, it is difficult to observe how the descriptors of human activities are grouped. In order to obtain a better understanding of the the behavior of descriptors, it is important to improve the abilities to recognize the human activities. This paper proposes a novel approach for the HAR based on acoustic data and similarity networks. In this approach, we were able to characterize the sound of the activities and identify those activities looking for similarity in the sound pattern. We evaluated the similarity of the sounds considering mainly two features: the sound location and the materials that were used. As a result, the materials are a good reference classifying the human activities compared with the location.

  6. ProDis-ContSHC: Learning protein dissimilarity measures and hierarchical context coherently for protein-protein comparison in protein database retrieval

    KAUST Repository

    Wang, Jim Jing-Yan; Gao, Xin; Wang, Quanquan; Li, Yongping

    2012-01-01

    Background: The need to retrieve or classify protein molecules using structure or sequence-based similarity measures underlies a wide range of biomedical applications. Traditional protein search methods rely on a pairwise dissimilarity

  7. Similarities in Self-Assembly of Proteins and Surfactants: an Attempt to Bridge the Gap

    NARCIS (Netherlands)

    Linden, van der E.; Venema, P.

    2007-01-01

    The area of surfactant self assembly has already received attention for more than half a century. Considerable progress has been made in regards to connecting the molecular properties to the assembly morphology and the phase behaviour, where a multitude of different (rather exotic) types of

  8. Similar phenotypes caused by mutations in OTOG and OTOGL

    Science.gov (United States)

    Oonk, Anne M.M.; Leijendeckers, Joop M.; Huygen, Patrick L.M.; Schraders, Margit; del Campo, Miguel; del Castillo, Ignacio; Tekin, Mustafa; Feenstra, Ilse; Beynon, Andy J.; Kunst, Henricus P.M.; Snik, Ad F.M.; Kremer, Hannie; Admiraal, Ronald J.C.; Pennings, Ronald J.E.

    2013-01-01

    Objectives recently, OTOG and OTOGL were identified as human deafness genes. Currently, only four families are known to have autosomal recessive hearing loss based on mutations in these genes. Since the two genes code for proteins (otogelin and otogelin-like) that are strikingly similar in structure and localization in the inner ear, this study is focused on characterizing and comparing the hearing loss caused by mutations in these genes. Design To evaluate this type of hearing, an extensive set of audiometric and vestibular examinations was performed in the 13 patients from four families. Results all families show a flat to downsloping configuration of the audiogram with mild to moderate sensorineural hearing loss. Speech recognition scores remain good (>90%). Hearing loss is not significantly different in the four families and the psychophysical test results also do not differ between the families. Vestibular examinations show evidence for vestibular hyporeflexia. Conclusion since otogelin and otogelin-like are localized in the tectorial membrane, one could expect a cochlear conductive hearing loss, as was previously shown in DFNA13 (COL11A2) and DFNA8/12 (TECTA) patients. Results of psychophysical examinations, however, do not support this. Furthermore, the authors can conclude that there are no phenotypic differences between hearing loss based on mutations in OTOG or OTOGL. This phenotype description will facilitate counseling of hearing loss caused by defects in either of these two genes. PMID:24378291

  9. Defining a similarity threshold for a functional proteinsequence pattern: The signal peptide cleavage site

    DEFF Research Database (Denmark)

    Nielsen, Henrik; Engelbrecht, Jacob; von Heijne, Gunnar

    1996-01-01

    When preparing data sets of amino acid or nucleotide sequences it is necessary to exclude redundant or homologous sequences in order to avoid overestimating the predictive performance of an algorithm. For some time methods for doing this have been available in the area of protein structure...... prediction. We have developed a similar procedure based on pair-wise alignments for sequences with functional sites. We show how a correlation coefficient between sequence similarity and functional homology can be used to compare the efficiency of different similarity measures and choose a nonarbitrary...

  10. Chemical shift homology in proteins

    International Nuclear Information System (INIS)

    Potts, Barbara C.M.; Chazin, Walter J.

    1998-01-01

    The degree of chemical shift similarity for homologous proteins has been determined from a chemical shift database of over 50 proteins representing a variety of families and folds, and spanning a wide range of sequence homologies. After sequence alignment, the similarity of the secondary chemical shifts of C α protons was examined as a function of amino acid sequence identity for 37 pairs of structurally homologous proteins. A correlation between sequence identity and secondary chemical shift rmsd was observed. Important insights are provided by examining the sequence identity of homologous proteins versus percentage of secondary chemical shifts that fall within 0.1 and 0.3 ppm thresholds. These results begin to establish practical guidelines for the extent of chemical shift similarity to expect among structurally homologous proteins

  11. General protein-protein cross-linking.

    Science.gov (United States)

    Alegria-Schaffer, Alice

    2014-01-01

    This protocol describes a general protein-to-protein cross-linking procedure using the water-soluble amine-reactive homobifunctional BS(3) (bis[sulfosuccinimidyl] suberate); however, the protocol can be easily adapted using other cross-linkers of similar properties. BS(3) is composed of two sulfo-NHS ester groups and an 11.4 Å linker. Sulfo-NHS ester groups react with primary amines in slightly alkaline conditions (pH 7.2-8.5) and yield stable amide bonds. The reaction releases N-hydroxysuccinimide (see an application of NHS esters on Labeling a protein with fluorophores using NHS ester derivitization). © 2014 Elsevier Inc. All rights reserved.

  12. Total protein

    Science.gov (United States)

    ... page: //medlineplus.gov/ency/article/003483.htm Total protein To use the sharing features on this page, please enable JavaScript. The total protein test measures the total amount of two classes ...

  13. Proteins engineering

    International Nuclear Information System (INIS)

    2000-01-01

    At the - Departement d'Ingenierie et d'etudes de proteines (Deip) of the CEA more than seventy researchers are working hard to understand the function of proteins. For that they use the molecular labelling technique (F.M.)

  14. Whey Protein

    Science.gov (United States)

    ... reliable information about the safety of taking whey protein if you are pregnant or breast feeding. Stay on the safe side and avoid use. Milk allergy: If you are allergic to cow's milk, avoid using whey protein.

  15. Determination of subjective similarity for pairs of masses and pairs of clustered microcalcifications on mammograms: Comparison of similarity ranking scores and absolute similarity ratings

    International Nuclear Information System (INIS)

    Muramatsu, Chisako; Li Qiang; Schmidt, Robert A.; Shiraishi, Junji; Suzuki, Kenji; Newstead, Gillian M.; Doi, Kunio

    2007-01-01

    The presentation of images that are similar to that of an unknown lesion seen on a mammogram may be helpful for radiologists to correctly diagnose that lesion. For similar images to be useful, they must be quite similar from the radiologists' point of view. We have been trying to quantify the radiologists' impression of similarity for pairs of lesions and to establish a ''gold standard'' for development and evaluation of a computerized scheme for selecting such similar images. However, it is considered difficult to reliably and accurately determine similarity ratings, because they are subjective. In this study, we compared the subjective similarities obtained by two different methods, an absolute rating method and a 2-alternative forced-choice (2AFC) method, to demonstrate that reliable similarity ratings can be determined by the responses of a group of radiologists. The absolute similarity ratings were previously obtained for pairs of masses and pairs of microcalcifications from five and nine radiologists, respectively. In this study, similarity ranking scores for eight pairs of masses and eight pairs of microcalcifications were determined by use of the 2AFC method. In the first session, the eight pairs of masses and eight pairs of microcalcifications were grouped and compared separately for determining the similarity ranking scores. In the second session, another similarity ranking score was determined by use of mixed pairs, i.e., by comparison of the similarity of a mass pair with that of a calcification pair. Four pairs of masses and four pairs of microcalcifications were grouped together to create two sets of eight pairs. The average absolute similarity ratings and the average similarity ranking scores showed very good correlations in the first study (Pearson's correlation coefficients: 0.94 and 0.98 for masses and microcalcifications, respectively). Moreover, in the second study, the correlations between the absolute ratings and the ranking scores were also

  16. Characterization of a Smad motif similar to Drosophila mad in the mouse Msx 1 promoter.

    Science.gov (United States)

    Alvarez Martinez, Cristina E; Binato, Renata; Gonzalez, Sayonara; Pereira, Monica; Robert, Benoit; Abdelhay, Eliana

    2002-03-01

    Mouse Msx 1 gene, orthologous of the Drosophila msh, is involved in several developmental processes. BMP family members are major proteins in the regulation of Msx 1 expression. BMP signaling activates Smad 1/5/8 proteins, which associate to Smad 4 before translocating to the nucleus. Analysis of Msx 1 promoter revealed the presence of three elements similar to the consensus established for Mad, the Smad 1 Drosophila counterpart. Notably, such an element was identified in an enhancer important for Msx 1 regulation. Gel shift analysis demonstrated that proteins from 13.5 dpc embryo associate to this enhancer. Remarkably, supershift assays showed that Smad proteins are present in the complex. Purified Smad 1 and 4 also bind to this fragment. We demonstrate that functional binding sites in this enhancer are confined to the Mad motif and flanking region. Our data suggest that this Mad motif may be functional in response to BMP signaling. ©2002 Elsevier Science (USA).

  17. Neutrosophic Refined Similarity Measure Based on Cosine Function

    Directory of Open Access Journals (Sweden)

    Said Broumi

    2014-12-01

    Full Text Available In this paper, the cosine similarity measure of neutrosophic refined (multi- sets is proposed and its properties are studied. The concept of this cosine similarity measure of neutrosophic refined sets is the extension of improved cosine similarity measure of single valued neutrosophic. Finally, using this cosine similarity measure of neutrosophic refined set, the application of medical diagnosis is presented.

  18. Composition of Overlapping Protein-Protein and Protein-Ligand Interfaces.

    Directory of Open Access Journals (Sweden)

    Ruzianisra Mohamed

    Full Text Available Protein-protein interactions (PPIs play a major role in many biological processes and they represent an important class of targets for therapeutic intervention. However, targeting PPIs is challenging because often no convenient natural substrates are available as starting point for small-molecule design. Here, we explored the characteristics of protein interfaces in five non-redundant datasets of 174 protein-protein (PP complexes, and 161 protein-ligand (PL complexes from the ABC database, 436 PP complexes, and 196 PL complexes from the PIBASE database and a dataset of 89 PL complexes from the Timbal database. In all cases, the small molecule ligands must bind at the respective PP interface. We observed similar amino acid frequencies in all three datasets. Remarkably, also the characteristics of PP contacts and overlapping PL contacts are highly similar.

  19. Similar nature of ionic imbalances in cardiovascular and renal disorders

    International Nuclear Information System (INIS)

    Shahid, S.M.; Jawed, M.; Akram, H.; Mahboob, T.

    2004-01-01

    Background: Several studies have reported improper ionic environment in cardiovascular and renal patients but how the diseases are associated on ionic basis is still not clear. Objective: The present study was aimed to investigate sodium and potassium concentrations and their transport abnormalities in cardiovascular and renal patients. Patients and Methods: Thirty patients of various cardiovascular and thirty patients of various renal disorders (53.33% males, 46.67% females) were selected. Erythrocytes were isolated from freshly drawn blood samples, washed and used for the estimation of sodium and potassium levels using flame photometer (Corning 410). Serum sodium and potassium were measured by flame photometer. RBC membranes were prepared for the estimation of Na/sup +/-K/sup +/-ATPase activity in terms of inorganic phosphate released/mg protein/hour. Results: Intra-erythrocyte and serum sodium and potassium concentrations and Na/sup +/-K/sup +/-ATPase activity were different in cardiovascular and renal patients from controls. Intra-erythrocyte sodium level was increased significantly (P<0.01) in cardiovascular patients and non-significantly in renal patients as compared to controls. Na/sup +/-K/sup +/-ATPase activity and serum sodium level were decreased significantly (P<0.01) in both the groups as compared to controls. Serum potassium was found to be decreased significantly (P<0.01) in cardiovascular patients whereas it was raised significantly (P<0.01) in renal patients as compared to control subjects. Conclusion: The results indicated similar nature of ionic and electrolyte imbalances in cardiovascular and renal disorders resulting from impaired Na/sup +/-K/sup +/-ATPase system. Further investigations in the same area, may be of help to establish an understanding of the progression of diseases, associated complications and the preventive steps that should-be taken to arrest the progression of these disorders. (author)

  20. Protein trapping of nanoparticles

    International Nuclear Information System (INIS)

    Ang, Joo C.; Lin, Jack M.; Yaron, Peter N.; White, John W.

    2009-01-01

    Full text: We have observed the formation of protein-nanoparticle complexes at the air-water interfaces from three different methods of presenting the nanoparticles to proteins. The structures formed resemble the 'protein-nanoparticle corona' proposed by Lynch et al. [1-3) in relation to a possible route for nanoparticle entry into living cells. To do this, the methods of x-ray and neutron reflectivity (with isotopic contrast variation between the protein and nanoparticles) have been used to study the structures formed at the air-water interface of l 3 - casein presented to silica nanoparticle dispersions. Whilst the silica dispersions showed no observable reflectivity, strong signals appear in the reflectivity when protein is present. Drop-wise spreading of a small amount of protein at the air-silica sol interface and presentation of the silica sol to an isolated monomolecular protein film (made by the 'flow-trough' method [4]) gave an immediate signal. Mixing the components in solution only produces a slow response but in all cases a similar structure is formed. The different responses are interpreted in structural and stoichiometric ways.

  1. Cuticular proteins from the horseshoe crab, Limulus polyphemus

    DEFF Research Database (Denmark)

    Ditzel, Nicholas; Andersen, Svend Olav; Højrup, Peter

    2003-01-01

    Proteins were purified from the carapace cuticle of a juvenile horseshoe crab, Limulus polyphemus, and several of them were characterized by amino acid sequence determination. The proteins are small (7-16 kDa) and their isoelectric points range from 6.5 to 9.2. They have high contents of tyrosine......, ranging from 13.5 to 35.4%. Some of the proteins show sequence similarity to cuticular proteins from other arthropod groups, with the most pronounced similarity to proteins from the cuticle of the spider Araneus diadematus. Two proteins show sequence similarity to a hexamerin storage protein from Blaberus...

  2. Chlamydia trachomatis Mip-like protein

    DEFF Research Database (Denmark)

    Lundemose, AG; Rousch, DA; Birkelund, Svend

    1992-01-01

    venereum (LGV) biovar) is presented. The sequence shows high similarity to the legionella Mip protein and its C-terminal region, like that of the legionella Mip, has high amino acid similarity to eukaryotic and prokaryotic FK506-binding proteins. The chlamydial mip-like gene was detected by polymerase...

  3. Protein complex prediction in large ontology attributed protein-protein interaction networks.

    Science.gov (United States)

    Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian; Li, Yanpeng; Xu, Bo

    2013-01-01

    Protein complexes are important for unraveling the secrets of cellular organization and function. Many computational approaches have been developed to predict protein complexes in protein-protein interaction (PPI) networks. However, most existing approaches focus mainly on the topological structure of PPI networks, and largely ignore the gene ontology (GO) annotation information. In this paper, we constructed ontology attributed PPI networks with PPI data and GO resource. After constructing ontology attributed networks, we proposed a novel approach called CSO (clustering based on network structure and ontology attribute similarity). Structural information and GO attribute information are complementary in ontology attributed networks. CSO can effectively take advantage of the correlation between frequent GO annotation sets and the dense subgraph for protein complex prediction. Our proposed CSO approach was applied to four different yeast PPI data sets and predicted many well-known protein complexes. The experimental results showed that CSO was valuable in predicting protein complexes and achieved state-of-the-art performance.

  4. Lineup member similarity effects on children's eyewitness identification

    OpenAIRE

    Fitzgerald, Ryan J.; Whiting, Brittany F.; Therrien, Natalie M.; Price, Heather L.

    2014-01-01

    To date, research investigating the similarity among lineup members has focused on adult eyewitnesses. In the present research, children made identifications from lineups containing members of lower or higher similarity to a target person. In Experiment 1, following a live interaction, children's (6–14 years) correct identification rate was reduced in higher-similarity relative to lower-similarity lineups. In Experiment 2, children (6–12 years) and adults watched a video containing a target p...

  5. Resource use by two morphologically similar insectivorous bats ...

    African Journals Online (AJOL)

    Studies of morphologically dissimilar insectivorous bats have lead to the conclusion that morphology is the prime correlate of habitat use, and consequently of diet. This has lead to the prediction that morphologically similar bats should have similar diets. We examined the diet and morphology of two morphologically similar ...

  6. Articulation of Phonologically Similar Items Disrupts Free Recall of Nonwords

    Science.gov (United States)

    Nishiyama, Ryoji; Ukita, Jun

    2013-01-01

    The present study sought to clarify whether phonological similarity of encoded information impairs free recall performance (the phonological similarity effect: PSE) for nonwords. Five experiments examined the influence of the encoding process on the PSE in a step-by-step fashion, by using lists that consisted of phonologically similar (decoy)…

  7. Detecting Distortion: Bridging Visual and Quantitative Reasoning on Similarity Tasks

    Science.gov (United States)

    Cox, Dana C.; Lo, Jane-Jane

    2014-01-01

    This study is focused on identifying and describing the reasoning patterns of middle grade students when examining potentially similar figures. Described here is a framework that includes 11 strategies that students used during clinical interview to differentiate similar and non-similar figures. Two factors were found to influence the strategies…

  8. 7 CFR 51.632 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.632 Section 51.632 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards..., and Arizona) Definitions § 51.632 Similar varietal characteristics. Similar varietal characteristics...

  9. 7 CFR 51.3202 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.3202 Section 51.3202 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... Similar varietal characteristics. Similar varietal characteristics means that the onions in any container...

  10. 7 CFR 51.567 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.567 Section 51... STANDARDS) United States Standards for Celery Definitions § 51.567 Similar varietal characteristics. Similar varietal characteristics means that the stalks in any package have the same general appearance and...

  11. 7 CFR 51.763 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.763 Section 51.763 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... characteristics. Similar varietal characteristics means that the fruits in any container are similar in color and...

  12. 7 CFR 51.3057 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.3057 Section 51.3057 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... characteristics. Similar varietal characteristics means that the avocados in any container are similar in shape...

  13. 7 CFR 51.694 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.694 Section 51.694 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards..., and Arizona) Definitions § 51.694 Similar varietal characteristics. Similar varietal characteristics...

  14. 7 CFR 51.2650 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.2650 Section 51.2650 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... characteristics. Similar varietal characteristics means that the cherries in any container are similar in color...

  15. Towards Modelling Variation in Music as Foundation for Similarity

    NARCIS (Netherlands)

    Volk, A.; de Haas, W.B.; van Kranenburg, P.; Cambouropoulos, E.; Tsougras, C.; Mavromatis, P.; Pastiadis, K.

    2012-01-01

    This paper investigates the concept of variation in music from the perspective of music similarity. Music similarity is a central concept in Music Information Retrieval (MIR), however there exists no comprehensive approach to music similarity yet. As a consequence, MIR faces the challenge on how to

  16. On Similarity Invariance of Balancing for Nonlinear Systems

    NARCIS (Netherlands)

    Scherpen, Jacquelien M.A.

    1995-01-01

    A previously obtained balancing method for nonlinear systems is investigated on similarity in variance by generalization of the observations on the similarity invariance of the linear balanced realization theory. For linear systems it is well known that the Hankel singular values are similarity

  17. Being similar while judging right and wrong: The effects of personal and situational similarity on moral judgements.

    Science.gov (United States)

    Pascal, Emilia

    2017-07-20

    This study investigated the effects of similarity with the transgressor and the victim on the perceived immorality of the transgression. Participants read two stories describing a person that cheated on their partner and a police officer that mistreated somebody. In the first story we manipulated participants' personal similarity to the transgressor and in the second their personal similarity to the victim. In each story, participants' past situational similarity to the target character was assessed according to their previous experiences of being in the same position. Results show that both personal and past situational similarity to the transgressor determine less severe moral judgements, while personal and past situational similarity with the victim have the opposite effect. We also tested several potential mediators of these effects, derived from competing theoretical accounts of the influence of similarity on perceived responsibility. Empathy emerged as mediating most of the effects of similarity on moral judgements, except those induced by past situational similarity with the victim. The foreseen probability of being in a similar situation mediated only the effects of similarity to the transgressor, and not those of similarity to the victim. Overall, results highlight the complex mechanisms of the influences of similarity on moral judgements. © 2017 International Union of Psychological Science.

  18. Self-similar analysis of the spherical implosion process

    International Nuclear Information System (INIS)

    Ishiguro, Yukio; Katsuragi, Satoru.

    1976-07-01

    The implosion processes caused by laser-heating ablation has been studied by self-similarity analysis. Attention is paid to the possibility of existence of the self-similar solution which reproduces the implosion process of high compression. Details of the self-similar analysis are reproduced and conclusions are drawn quantitatively on the gas compression by a single shock. The compression process by a sequence of shocks is discussed in self-similarity. The gas motion followed by a homogeneous isentropic compression is represented by a self-similar motion. (auth.)

  19. Structural similarities in DNA packaging and delivery apparatuses in Herpesvirus and dsDNA bacteriophages.

    Science.gov (United States)

    Rixon, Frazer J; Schmid, Michael F

    2014-04-01

    Structural information can inform our understanding of virus origins and evolution. The herpesviruses and tailed bacteriophages constitute two large families of dsDNA viruses which infect vertebrates and prokaryotes respectively. A relationship between these disparate groups was initially suggested by similarities in their capsid assembly and DNA packaging strategies. This relationship has now been confirmed by a range of studies that have revealed common structural features in their capsid proteins, and similar organizations and sequence conservation in their DNA packaging machinery and maturational proteases. This concentration of conserved traits in proteins involved in essential and primordial capsid/packaging functions is evidence that these structures are derived from an ancient, common ancestor and is in sharp contrast to the lack of such evidence for other virus functions. Copyright © 2014. Published by Elsevier B.V.

  20. Topology-function conservation in protein-protein interaction networks.

    Science.gov (United States)

    Davis, Darren; Yaveroğlu, Ömer Nebil; Malod-Dognin, Noël; Stojmirovic, Aleksandar; Pržulj, Nataša

    2015-05-15

    Proteins underlay the functioning of a cell and the wiring of proteins in protein-protein interaction network (PIN) relates to their biological functions. Proteins with similar wiring in the PIN (topology around them) have been shown to have similar functions. This property has been successfully exploited for predicting protein functions. Topological similarity is also used to guide network alignment algorithms that find similarly wired proteins between PINs of different species; these similarities are used to transfer annotation across PINs, e.g. from model organisms to human. To refine these functional predictions and annotation transfers, we need to gain insight into the variability of the topology-function relationships. For example, a function may be significantly associated with specific topologies, while another function may be weakly associated with several different topologies. Also, the topology-function relationships may differ between different species. To improve our understanding of topology-function relationships and of their conservation among species, we develop a statistical framework that is built upon canonical correlation analysis. Using the graphlet degrees to represent the wiring around proteins in PINs and gene ontology (GO) annotations to describe their functions, our framework: (i) characterizes statistically significant topology-function relationships in a given species, and (ii) uncovers the functions that have conserved topology in PINs of different species, which we term topologically orthologous functions. We apply our framework to PINs of yeast and human, identifying seven biological process and two cellular component GO terms to be topologically orthologous for the two organisms. © The Author 2015. Published by Oxford University Press.

  1. Similarity analysis between chromosomes of Homo sapiens and monkeys with correlation coefficient, rank correlation coefficient and cosine similarity measures

    OpenAIRE

    Someswara Rao, Chinta; Viswanadha Raju, S.

    2016-01-01

    In this paper, we consider correlation coefficient, rank correlation coefficient and cosine similarity measures for evaluating similarity between Homo sapiens and monkeys. We used DNA chromosomes of genome wide genes to determine the correlation between the chromosomal content and evolutionary relationship. The similarity among the H. sapiens and monkeys is measured for a total of 210 chromosomes related to 10 species. The similarity measures of these different species show the relationship b...

  2. Protein politics

    NARCIS (Netherlands)

    Vijver, Marike

    2005-01-01

    This study is part of the program of the interdisciplinary research group Profetas (protein foods, environment, technology and society). Profetas consists of technological, environmental and socio-economic research projects on protein food systems which result in the development of scenarios and

  3. Protein adhesives

    Science.gov (United States)

    Charles R. Frihart; Linda F. Lorenz

    2018-01-01

    Nature uses a wide variety of chemicals for providing adhesion internally (e.g., cell to cell) and externally (e.g., mussels to ships and piers). This adhesive bonding is chemically and mechanically complex, involving a variety of proteins, carbohydrates, and other compounds.Consequently,the effect of protein structures on adhesive properties is only partially...

  4. On the role of electrostatics on protein-protein interactions

    Science.gov (United States)

    Zhang, Zhe; Witham, Shawn; Alexov, Emil

    2011-01-01

    The role of electrostatics on protein-protein interactions and binding is reviewed in this article. A brief outline of the computational modeling, in the framework of continuum electrostatics, is presented and basic electrostatic effects occurring upon the formation of the complex are discussed. The role of the salt concentration and pH of the water phase on protein-protein binding free energy is demonstrated and indicates that the increase of the salt concentration tends to weaken the binding, an observation that is attributed to the optimization of the charge-charge interactions across the interface. It is pointed out that the pH-optimum (pH of optimal binding affinity) varies among the protein-protein complexes, and perhaps is a result of their adaptation to particular subcellular compartment. At the end, the similarities and differences between hetero- and homo-complexes are outlined and discussed with respect to the binding mode and charge complementarity. PMID:21572182

  5. Extending the Similarity-Attraction Effect: The Effects of When-Similarity in Computer-Mediated Communication

    NARCIS (Netherlands)

    Kaptein, M.C.; Castaneda, D.; Fernandez, N.; Nass, C.

    2014-01-01

    The feeling of connectedness experienced in computer-mediated relationships can be explained by the similarity-attraction effect (SAE). Though SAE is well established in psychology, the effects of some types of similarity have not yet been explored. In 2 studies, we demonstrate similarity-attraction

  6. Tau protein

    DEFF Research Database (Denmark)

    Frederiksen, Jette Lautrup Battistini; Kristensen, Kim; Bahl, Jmc

    2011-01-01

    Background: Tau protein has been proposed as biomarker of axonal damage leading to irreversible neurological impairment in MS. CSF concentrations may be useful when determining risk of progression from ON to MS. Objective: To investigate the association between tau protein concentration and 14......-3-3 protein in the cerebrospinal fluid (CSF) of patients with monosymptomatic optic neuritis (ON) versus patients with monosymptomatic onset who progressed to multiple sclerosis (MS). To evaluate results against data found in a complete literature review. Methods: A total of 66 patients with MS and/or ON from...... the Department of Neurology of Glostrup Hospital, University of Copenhagen, Denmark, were included. CSF samples were analysed for tau protein and 14-3-3 protein, and clinical and paraclinical information was obtained from medical records. Results: The study shows a significantly increased concentration of tau...

  7. Assessing Analytical Similarity of Proposed Amgen Biosimilar ABP 501 to Adalimumab.

    Science.gov (United States)

    Liu, Jennifer; Eris, Tamer; Li, Cynthia; Cao, Shawn; Kuhns, Scott

    2016-08-01

    ABP 501 is being developed as a biosimilar to adalimumab. Comprehensive comparative analytical characterization studies have been conducted and completed. The objective of this study was to assess analytical similarity between ABP 501 and two adalimumab reference products (RPs), licensed by the United States Food and Drug Administration (adalimumab [US]) and authorized by the European Union (adalimumab [EU]), using state-of-the-art analytical methods. Comprehensive analytical characterization incorporating orthogonal analytical techniques was used to compare products. Physicochemical property comparisons comprised the primary structure related to amino acid sequence and post-translational modifications including glycans; higher-order structure; primary biological properties mediated by target and receptor binding; product-related substances and impurities; host-cell impurities; general properties of the finished drug product, including strength and formulation; subvisible and submicron particles and aggregates; and forced thermal degradation. ABP 501 had the same amino acid sequence and similar post-translational modification profiles compared with adalimumab RPs. Primary structure, higher-order structure, and biological activities were similar for the three products. Product-related size and charge variants and aggregate and particle levels were also similar. ABP 501 had very low residual host-cell protein and DNA. The finished ABP 501 drug product has the same strength with regard to protein concentration and fill volume as adalimumab RPs. ABP 501 and the RPs had a similar stability profile both in normal storage and thermal stress conditions. Based on the comprehensive analytical similarity assessment, ABP 501 was found to be similar to adalimumab with respect to physicochemical and biological properties.

  8. An electrophysiological signature of summed similarity in visual working memory.

    Science.gov (United States)

    van Vugt, Marieke K; Sekuler, Robert; Wilson, Hugh R; Kahana, Michael J

    2013-05-01

    Summed-similarity models of short-term item recognition posit that participants base their judgments of an item's prior occurrence on that item's summed similarity to the ensemble of items on the remembered list. We examined the neural predictions of these models in 3 short-term recognition memory experiments using electrocorticographic/depth electrode recordings and scalp electroencephalography. On each experimental trial, participants judged whether a test face had been among a small set of recently studied faces. Consistent with summed-similarity theory, participants' tendency to endorse a test item increased as a function of its summed similarity to the items on the just-studied list. To characterize this behavioral effect of summed similarity, we successfully fit a summed-similarity model to individual participant data from each experiment. Using the parameters determined from fitting the summed-similarity model to the behavioral data, we examined the relation between summed similarity and brain activity. We found that 4-9 Hz theta activity in the medial temporal lobe and 2-4 Hz delta activity recorded from frontal and parietal cortices increased with summed similarity. These findings demonstrate direct neural correlates of the similarity computations that form the foundation of several major cognitive theories of human recognition memory. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  9. Recombinant Collagenlike Proteins

    Science.gov (United States)

    Fertala, Andzej

    2007-01-01

    A group of collagenlike recombinant proteins containing high densities of biologically active sites has been invented. The method used to express these proteins is similar to a method of expressing recombinant procollagens and collagens described in U. S. Patent 5,593,859, "Synthesis of human procollagens and collagens in recombinant DNA systems." Customized collagenous proteins are needed for biomedical applications. In particular, fibrillar collagens are attractive for production of matrices needed for tissue engineering and drug delivery. Prior to this invention, there was no way of producing customized collagenous proteins for these and other applications. Heretofore, collagenous proteins have been produced by use of such biological systems as yeasts, bacteria, and transgenic animals and plants. These products are normal collagens that can also be extracted from such sources as tendons, bones, and hides. These products cannot be made to consist only of biologically active, specific amino acid sequences that may be needed for specific applications. Prior to this invention, it had been established that fibrillar collagens consist of domains that are responsible for such processes as interaction with cells, binding of growth factors, and interaction with a number of structural proteins present in the extracellular matrix. A normal collagen consists of a sequence of domains that can be represented by a corresponding sequence of labels, e.g., D1D2D3D4. A collagenlike protein of the present invention contains regions of collagen II that contain multiples of a single domain (e.g., D1D1D1D1 or D4D4D4D4) chosen for its specific biological activity. By virtue of the multiplicity of the chosen domain, the density of sites having that specific biological activity is greater than it is in a normal collagen. A collagenlike protein according to this invention can thus be made to have properties that are necessary for tissue engineering.

  10. Similarity analysis between chromosomes of Homo sapiens and monkeys with correlation coefficient, rank correlation coefficient and cosine similarity measures.

    Science.gov (United States)

    Someswara Rao, Chinta; Viswanadha Raju, S

    2016-03-01

    In this paper, we consider correlation coefficient, rank correlation coefficient and cosine similarity measures for evaluating similarity between Homo sapiens and monkeys. We used DNA chromosomes of genome wide genes to determine the correlation between the chromosomal content and evolutionary relationship. The similarity among the H. sapiens and monkeys is measured for a total of 210 chromosomes related to 10 species. The similarity measures of these different species show the relationship between the H. sapiens and monkey. This similarity will be helpful at theft identification, maternity identification, disease identification, etc.

  11. Disorder in Protein Crystals.

    Science.gov (United States)

    Clarage, James Braun, II

    1990-01-01

    Methods have been developed for analyzing the diffuse x-ray scattering in the halos about a crystal's Bragg reflections as a means of determining correlations in atomic displacements in protein crystals. The diffuse intensity distribution for rhombohedral insulin, tetragonal lysozyme, and triclinic lysozyme crystals was best simulated in terms of exponential displacement correlation functions. About 90% of the disorder can be accounted for by internal movements correlated with a decay distance of about 6A; the remaining 10% corresponds to intermolecular movements that decay in a distance the order of size of the protein molecule. The results demonstrate that protein crystals fit into neither the Einstein nor the Debye paradigms for thermally fluctuating crystalline solids. Unlike the Einstein model, there are correlations in the atomic displacements, but these correlations decay more steeply with distance than predicted by the Debye-Waller model for an elastic solid. The observed displacement correlations are liquid -like in the sense that they decay exponentially with the distance between atoms, just as positional correlations in a liquid. This liquid-like disorder is similar to the disorder observed in 2-D crystals of polystyrene latex spheres, and similar systems where repulsive interactions dominate; hence, these colloidal crystals appear to provide a better analogy for the dynamics of protein crystals than perfectly elastic lattices.

  12. Constructing an integrated gene similarity network for the identification of disease genes.

    Science.gov (United States)

    Tian, Zhen; Guo, Maozu; Wang, Chunyu; Xing, LinLin; Wang, Lei; Zhang, Yin

    2017-09-20

    Discovering novel genes that are involved human diseases is a challenging task in biomedical research. In recent years, several computational approaches have been proposed to prioritize candidate disease genes. Most of these methods are mainly based on protein-protein interaction (PPI) networks. However, since these PPI networks contain false positives and only cover less half of known human genes, their reliability and coverage are very low. Therefore, it is highly necessary to fuse multiple genomic data to construct a credible gene similarity network and then infer disease genes on the whole genomic scale. We proposed a novel method, named RWRB, to infer causal genes of interested diseases. First, we construct five individual gene (protein) similarity networks based on multiple genomic data of human genes. Then, an integrated gene similarity network (IGSN) is reconstructed based on similarity network fusion (SNF) method. Finally, we employee the random walk with restart algorithm on the phenotype-gene bilayer network, which combines phenotype similarity network, IGSN as well as phenotype-gene association network, to prioritize candidate disease genes. We investigate the effectiveness of RWRB through leave-one-out cross-validation methods in inferring phenotype-gene relationships. Results show that RWRB is more accurate than state-of-the-art methods on most evaluation metrics. Further analysis shows that the success of RWRB is benefited from IGSN which has a wider coverage and higher reliability comparing with current PPI networks. Moreover, we conduct a comprehensive case study for Alzheimer's disease and predict some novel disease genes that supported by literature. RWRB is an effective and reliable algorithm in prioritizing candidate disease genes on the genomic scale. Software and supplementary information are available at http://nclab.hit.edu.cn/~tianzhen/RWRB/ .

  13. Lipschitz equivalence of self-similar sets with touching structures

    International Nuclear Information System (INIS)

    Ruan, Huo-Jun; Wang, Yang; Xi, Li-Feng

    2014-01-01

    Lipschitz equivalence of self-similar sets is an important area in the study of fractal geometry. It is known that two dust-like self-similar sets with the same contraction ratios are always Lipschitz equivalent. However, when self-similar sets have touching structures the problem of Lipschitz equivalence becomes much more challenging and intriguing at the same time. So far, all the known results only cover self-similar sets in R with no more than three branches. In this study we establish results for the Lipschitz equivalence of self-similar sets with touching structures in R with arbitrarily many branches. Key to our study is the introduction of a geometric condition for self-similar sets called substitutable. (paper)

  14. Perceptions of Ideal and Former Partners’ Personality and Similarity

    Directory of Open Access Journals (Sweden)

    Pieternel Dijkstra

    2010-12-01

    Full Text Available The present study aimed to test predictions based on both the ‗similarity-attraction‘ hypothesis and the ‗attraction-similarity‘ hypothesis, by studying perceptions of ideal and former partners. Based on the ‗similarity-attraction‘ hypothesis, we expected individuals to desire ideal partners who are similar to the self in personality. In addition, based on the ‗attraction-similarity hypothesis‘, we expected individuals to perceive former partners as dissimilar to them in terms of personality. Findings showed that, whereas the ideal partner was seen as similar to and more positive than the self, the former partner was seen as dissimilar to and more negative than the self. In addition, our study showed that individuals did not rate similarity in personality as very important when seeking a mate. Our findings may help understand why so many relationships end in divorce due to mismatches in personality.

  15. The efficiency of similarity-focused comparisons in person perception.

    Science.gov (United States)

    Corcoran, Katja

    2013-01-01

    Comparison processes are ubiquitous in person perception. Comparative thinking can follow two routes: People either search for similarities or for dissimilarities while comparing. Which of these two routes is more efficient? Previous research indicates that people could compare two geometrical figures faster if they focused on similarities rather than dissimilarities. I examine comparisons of people and measure the consumption of cognitive resources as indicator for efficiency. The results confirm an efficiency-advantage of similarity-focused comparisons for social stimuli.

  16. A New Trajectory Similarity Measure for GPS Data

    KAUST Repository

    Ismail, Anas; Vigneron, Antoine E.

    2016-01-01

    We present a new algorithm for measuring the similarity between trajectories, and in particular between GPS traces. We call this new similarity measure the Merge Distance (MD). Our approach is robust against subsampling and supersampling. We perform experiments to compare this new similarity measure with the two main approaches that have been used so far: Dynamic Time Warping (DTW) and the Euclidean distance. © 2015 ACM.

  17. A Minimum Spanning Tree Representation of Anime Similarities

    OpenAIRE

    Wibowo, Canggih Puspo

    2016-01-01

    In this work, a new way to represent Japanese animation (anime) is presented. We applied a minimum spanning tree to show the relation between anime. The distance between anime is calculated through three similarity measurements, namely crew, score histogram, and topic similarities. Finally, the centralities are also computed to reveal the most significance anime. The result shows that the minimum spanning tree can be used to determine the similarity anime. Furthermore, by using centralities c...

  18. A New Trajectory Similarity Measure for GPS Data

    KAUST Repository

    Ismail, Anas

    2016-08-08

    We present a new algorithm for measuring the similarity between trajectories, and in particular between GPS traces. We call this new similarity measure the Merge Distance (MD). Our approach is robust against subsampling and supersampling. We perform experiments to compare this new similarity measure with the two main approaches that have been used so far: Dynamic Time Warping (DTW) and the Euclidean distance. © 2015 ACM.

  19. Common neighbour structure and similarity intensity in complex networks

    Science.gov (United States)

    Hou, Lei; Liu, Kecheng

    2017-10-01

    Complex systems as networks always exhibit strong regularities, implying underlying mechanisms governing their evolution. In addition to the degree preference, the similarity has been argued to be another driver for networks. Assuming a network is randomly organised without similarity preference, the present paper studies the expected number of common neighbours between vertices. A symmetrical similarity index is accordingly developed by removing such expected number from the observed common neighbours. The developed index can not only describe the similarities between vertices, but also the dissimilarities. We further apply the proposed index to measure of the influence of similarity on the wring patterns of networks. Fifteen empirical networks as well as artificial networks are examined in terms of similarity intensity and degree heterogeneity. Results on real networks indicate that, social networks are strongly governed by the similarity as well as the degree preference, while the biological networks and infrastructure networks show no apparent similarity governance. Particularly, classical network models, such as the Barabási-Albert model, the Erdös-Rényi model and the Ring Lattice, cannot well describe the social networks in terms of the degree heterogeneity and similarity intensity. The findings may shed some light on the modelling and link prediction of different classes of networks.

  20. A Survey of Binary Similarity and Distance Measures

    Directory of Open Access Journals (Sweden)

    Seung-Seok Choi

    2010-02-01

    Full Text Available The binary feature vector is one of the most common representations of patterns and measuring similarity and distance measures play a critical role in many problems such as clustering, classification, etc. Ever since Jaccard proposed a similarity measure to classify ecological species in 1901, numerous binary similarity and distance measures have been proposed in various fields. Applying appropriate measures results in more accurate data analysis. Notwithstanding, few comprehensive surveys on binary measures have been conducted. Hence we collected 76 binary similarity and distance measures used over the last century and reveal their correlations through the hierarchical clustering technique.

  1. Systematic characterizations of text similarity in full text biomedical publications.

    Science.gov (United States)

    Sun, Zhaohui; Errami, Mounir; Long, Tara; Renard, Chris; Choradia, Nishant; Garner, Harold

    2010-09-15

    Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text articles are becoming increasingly available, yet the similarities among them have not been systematically studied. Here, we quantitatively investigated the full text similarity of biomedical publications in PubMed Central. 72,011 full text articles from PubMed Central (PMC) were parsed to generate three different datasets: full texts, sections, and paragraphs. Text similarity comparisons were performed on these datasets using the text similarity algorithm eTBLAST. We measured the frequency of similar text pairs and compared it among different datasets. We found that high abstract similarity can be used to predict high full text similarity with a specificity of 20.1% (95% CI [17.3%, 23.1%]) and sensitivity of 99.999%. Abstract similarity and full text similarity have a moderate correlation (Pearson correlation coefficient: -0.423) when the similarity ratio is above 0.4. Among pairs of articles in PMC, method sections are found to be the most repetitive (frequency of similar pairs, methods: 0.029, introduction: 0.0076, results: 0.0043). In contrast, among a set of manually verified duplicate articles, results are the most repetitive sections (frequency of similar pairs, results: 0.94, methods: 0.89, introduction: 0.82). Repetition of introduction and methods sections is more likely to be committed by the same authors (odds of a highly similar pair having at least one shared author, introduction: 2.31, methods: 1.83, results: 1.03). There is also significantly more similarity in pairs of review articles than in pairs containing one review and one nonreview paper (frequency of similar pairs: 0.0167 and 0.0023, respectively). While quantifying abstract similarity is an effective approach for finding duplicate citations, a comprehensive full text analysis is necessary to uncover all potential duplicate citations in the scientific literature and is helpful when

  2. Application of the principle of similarity fluid mechanics

    International Nuclear Information System (INIS)

    Hendricks, R.C.; Sengers, J.V.

    1979-01-01

    Possible applications of the principle of similarity to fluid mechanics is described and illustrated. In correlating thermophysical properties of fluids, the similarity principle transcends the traditional corresponding states principle. In fluid mechanics the similarity principle is useful in correlating flow processes that can be modeled adequately with one independent variable (i.e., one-dimensional flows). In this paper we explore the concept of transforming the conservation equations by combining similarity principles for thermophysical properties with those for fluid flow. We illustrate the usefulness of the procedure by applying such a transformation to calculate two phase critical mass flow through a nozzle

  3. A measure of association between vectors based on "similarity covariance"

    OpenAIRE

    Pascual-Marqui, Roberto D.; Lehmann, Dietrich; Kochi, Kieko; Kinoshita, Toshihiko; Yamada, Naoto

    2013-01-01

    The "maximum similarity correlation" definition introduced in this study is motivated by the seminal work of Szekely et al on "distance covariance" (Ann. Statist. 2007, 35: 2769-2794; Ann. Appl. Stat. 2009, 3: 1236-1265). Instead of using Euclidean distances "d" as in Szekely et al, we use "similarity", which can be defined as "exp(-d/s)", where the scaling parameter s>0 controls how rapidly the similarity falls off with distance. Scale parameters are chosen by maximizing the similarity corre...

  4. HIV and influenza share a similar structural blueprint

    Science.gov (United States)

    HIV uses a protein called the envelope glycoprotein spike to attach itself and fuse with the cell membrane; NCI scientists have now defined the structure of this spike in its pre-fusion state using cryo-electron microscopy

  5. Using relational databases for improved sequence similarity searching and large-scale genomic analyses.

    Science.gov (United States)

    Mackey, Aaron J; Pearson, William R

    2004-10-01

    Relational databases are designed to integrate diverse types of information and manage large sets of search results, greatly simplifying genome-scale analyses. Relational databases are essential for management and analysis of large-scale sequence analyses, and can also be used to improve the statistical significance of similarity searches by focusing on subsets of sequence libraries most likely to contain homologs. This unit describes using relational databases to improve the efficiency of sequence similarity searching and to demonstrate various large-scale genomic analyses of homology-related data. This unit describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. These include basic use of the database to generate a novel sequence library subset, how to extend and use seqdb_demo for the storage of sequence similarity search results and making use of various kinds of stored search results to address aspects of comparative genomic analysis.

  6. Detecting atypical examples of known domain types by sequence similarity searching: the SBASE domain library approach.

    Science.gov (United States)

    Dhir, Somdutta; Pacurar, Mircea; Franklin, Dino; Gáspári, Zoltán; Kertész-Farkas, Attila; Kocsor, András; Eisenhaber, Frank; Pongor, Sándor

    2010-11-01

    SBASE is a project initiated to detect known domain types and predicting domain architectures using sequence similarity searching (Simon et al., Protein Seq Data Anal, 5: 39-42, 1992, Pongor et al, Nucl. Acids. Res. 21:3111-3115, 1992). The current approach uses a curated collection of domain sequences - the SBASE domain library - and standard similarity search algorithms, followed by postprocessing which is based on a simple statistics of the domain similarity network (http://hydra.icgeb.trieste.it/sbase/). It is especially useful in detecting rare, atypical examples of known domain types which are sometimes missed even by more sophisticated methodologies. This approach does not require multiple alignment or machine learning techniques, and can be a useful complement to other domain detection methodologies. This article gives an overview of the project history as well as of the concepts and principles developed within this the project.

  7. Correlating Information Contents of Gene Ontology Terms to Infer Semantic Similarity of Gene Products

    Directory of Open Access Journals (Sweden)

    Mingxin Gan

    2014-01-01

    Full Text Available Successful applications of the gene ontology to the inference of functional relationships between gene products in recent years have raised the need for computational methods to automatically calculate semantic similarity between gene products based on semantic similarity of gene ontology terms. Nevertheless, existing methods, though having been widely used in a variety of applications, may significantly overestimate semantic similarity between genes that are actually not functionally related, thereby yielding misleading results in applications. To overcome this limitation, we propose to represent a gene product as a vector that is composed of information contents of gene ontology terms annotated for the gene product, and we suggest calculating similarity between two gene products as the relatedness of their corresponding vectors using three measures: Pearson’s correlation coefficient, cosine similarity, and the Jaccard index. We focus on the biological process domain of the gene ontology and annotations of yeast proteins to study the effectiveness of the proposed measures. Results show that semantic similarity scores calculated using the proposed measures are more consistent with known biological knowledge than those derived using a list of existing methods, suggesting the effectiveness of our method in characterizing functional relationships between gene products.

  8. Calculating the knowledge-based similarity of functional groups using crystallographic data

    Science.gov (United States)

    Watson, Paul; Willett, Peter; Gillet, Valerie J.; Verdonk, Marcel L.

    2001-09-01

    A knowledge-based method for calculating the similarity of functional groups is described and validated. The method is based on experimental information derived from small molecule crystal structures. These data are used in the form of scatterplots that show the likelihood of a non-bonded interaction being formed between functional group A (the `central group') and functional group B (the `contact group' or `probe'). The scatterplots are converted into three-dimensional maps that show the propensity of the probe at different positions around the central group. Here we describe how to calculate the similarity of a pair of central groups based on these maps. The similarity method is validated using bioisosteric functional group pairs identified in the Bioster database and Relibase. The Bioster database is a critical compilation of thousands of bioisosteric molecule pairs, including drugs, enzyme inhibitors and agrochemicals. Relibase is an object-oriented database containing structural data about protein-ligand interactions. The distributions of the similarities of the bioisosteric functional group pairs are compared with similarities for all the possible pairs in IsoStar, and are found to be significantly different. Enrichment factors are also calculated showing the similarity method is statistically significantly better than random in predicting bioisosteric functional group pairs.

  9. Semantic similarity measure in biomedical domain leverage web search engine.

    Science.gov (United States)

    Chen, Chi-Huang; Hsieh, Sheau-Ling; Weng, Yung-Ching; Chang, Wen-Yung; Lai, Feipei

    2010-01-01

    Semantic similarity measure plays an essential role in Information Retrieval and Natural Language Processing. In this paper we propose a page-count-based semantic similarity measure and apply it in biomedical domains. Previous researches in semantic web related applications have deployed various semantic similarity measures. Despite the usefulness of the measurements in those applications, measuring semantic similarity between two terms remains a challenge task. The proposed method exploits page counts returned by the Web Search Engine. We define various similarity scores for two given terms P and Q, using the page counts for querying P, Q and P AND Q. Moreover, we propose a novel approach to compute semantic similarity using lexico-syntactic patterns with page counts. These different similarity scores are integrated adapting support vector machines, to leverage the robustness of semantic similarity measures. Experimental results on two datasets achieve correlation coefficients of 0.798 on the dataset provided by A. Hliaoutakis, 0.705 on the dataset provide by T. Pedersen with physician scores and 0.496 on the dataset provided by T. Pedersen et al. with expert scores.

  10. Visual reconciliation of alternative similarity spaces in climate modeling

    Science.gov (United States)

    J Poco; A Dasgupta; Y Wei; William Hargrove; C.R. Schwalm; D.N. Huntzinger; R Cook; E Bertini; C.T. Silva

    2015-01-01

    Visual data analysis often requires grouping of data objects based on their similarity. In many application domains researchers use algorithms and techniques like clustering and multidimensional scaling to extract groupings from data. While extracting these groups using a single similarity criteria is relatively straightforward, comparing alternative criteria poses...

  11. Phonological Similarity in Serial Recall: Constraints on Theories of Memory

    Science.gov (United States)

    Lewandowsky, Stephan; Farrell, Simon

    2008-01-01

    In short-term serial recall, similar-sounding items are remembered more poorly than items that do not sound alike. When lists mix similar and dissimilar items, performance on the dissimilar items is of considerable theoretical interest. Farrell and Lewandowsky [Farrell, S., & Lewandowsky, S. (2003). Dissimilar items benefit from phonological…

  12. 7 CFR 51.2116 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... blanchable varieties within the “California” Marketing Classification. In addition, Nonpareil or similar... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.2116 Section 51.2116 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards...

  13. Relationship between genetic similarity and some productive traits ...

    African Journals Online (AJOL)

    Admin

    Random amplified polymorphic DNA (RAPD) technique was applied to detect genetic similarity between five local chicken strains that have been selected for eggs and meat production in Egypt. Based on six oligonucleotide primers, the genetic similarity between the egg-producing strains (Anshas, Silver. Montazah and ...

  14. Self-similar solutions of certain coupled integrable systems

    CERN Document Server

    Chakravarty, S; Kent, S L

    2003-01-01

    Similarity reductions of the coupled nonlinear Schroedinger equation and an integrable version of the coupled Maxwell-Bloch system are obtained by applying non-translational symmetries. The reduced system of coupled ordinary differential equations are solved in terms of Painleve transcendents, leading to new exact self-similar solutions for these integrable equations.

  15. Self-similar solutions of certain coupled integrable systems

    International Nuclear Information System (INIS)

    Chakravarty, S; Halburd, R G; Kent, S L

    2003-01-01

    Similarity reductions of the coupled nonlinear Schroedinger equation and an integrable version of the coupled Maxwell-Bloch system are obtained by applying non-translational symmetries. The reduced system of coupled ordinary differential equations are solved in terms of Painleve transcendents, leading to new exact self-similar solutions for these integrable equations

  16. Perceptions of ideal and former partner's personality and similarity

    NARCIS (Netherlands)

    Dijkstra, Pieternel; Barelds, Dick P.H.

    2010-01-01

    The present study aimed to test predictions based on both the ‗similarity-attraction‘ hypothesis and the ‗attraction-similarity‘ hypothesis, by studying perceptions of ideal and former partners. Based on the ‗similarity-attraction‘ hypothesis, we expected individuals to desire ideal partners who are

  17. A Framework for Analysis of Music Similarity Measures

    DEFF Research Database (Denmark)

    Jensen, Jesper Højvang; Christensen, Mads G.; Jensen, Søren Holdt

    2007-01-01

    To analyze specific properties of music similarity measures that the commonly used genre classification evaluation procedure does not reveal, we introduce a MIDI based test framework for music similarity measures. We introduce the framework by example and thus outline an experiment to analyze the...

  18. Density-based retrieval from high-similarity image databases

    DEFF Research Database (Denmark)

    Hansen, Michael Edberg; Carstensen, Jens Michael

    2004-01-01

    Many image classification problems can fruitfully be thought of as image retrieval in a "high similarity image database" (HSID) characterized by being tuned towards a specific application and having a high degree of visual similarity between entries that should be distinguished. We introduce a me...

  19. Epistemic Similarities between Students' Scientific and Supernatural Beliefs

    Science.gov (United States)

    Shtulman, Andrew

    2013-01-01

    The evidential support for scientific claims is quantitatively and qualitatively superior to that for supernatural claims, yet students may not appreciate this difference in light of the fact that both types of claims are learned in similar ways (through testimony rather than firsthand observation) and perform similar functions (explaining…

  20. Mixed-List Phonological Similarity Effects in Delayed Serial Recall

    Science.gov (United States)

    Farrell, Simon

    2006-01-01

    Recent experiments have shown that placing dissimilar items on lists of phonologically similar items enhances accuracy of ordered recall of the dissimilar items [Farrell, S., & Lewandowsky, S. (2003). Dissimilar items benefit from phonological similarity in serial recall. "Journal of Experimental Psychology: Learning, Memory, and Cognition," 29,…

  1. Multicriteria decision-making method based on a cosine similarity ...

    African Journals Online (AJOL)

    the cosine similarity measure is often used in information retrieval, citation analysis, and automatic classification. However, it scarcely deals with trapezoidal fuzzy information and multicriteria decision-making problems. For this purpose, a cosine similarity measure between trapezoidal fuzzy numbers is proposed based on ...

  2. Self-similar solution for coupled thermal electromagnetic model ...

    African Journals Online (AJOL)

    An investigation into the existence and uniqueness solution of self-similar solution for the coupled Maxwell and Pennes Bio-heat equations have been done. Criteria for existence and uniqueness of self-similar solution are revealed in the consequent theorems. Journal of the Nigerian Association of Mathematical Physics ...

  3. 36 CFR 1002.20 - Skating, skateboards and similar devices.

    Science.gov (United States)

    2010-07-01

    ... 36 Parks, Forests, and Public Property 3 2010-07-01 2010-07-01 false Skating, skateboards and similar devices. 1002.20 Section 1002.20 Parks, Forests, and Public Property PRESIDIO TRUST RESOURCE PROTECTION, PUBLIC USE AND RECREATION § 1002.20 Skating, skateboards and similar devices. Using roller skates...

  4. 7 CFR 51.1550 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.1550 Section 51.1550 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... characteristics. Similar varietal characteristics means that the potatoes in any lot have the same general shape...

  5. 7 CFR 51.1154 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.1154 Section 51.1154 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... varietal characteristics. Similar varietal characteristics means that the fruits in any container are...

  6. 7 CFR 51.2756 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.2756 Section 51.2756 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... characteristics. Similar varietal characteristics means that the peanut kernels in the lot are not of distinctly...

  7. 7 CFR 51.1906 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.1906 Section 51.1906 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... characteristics. Similar varietal characteristics means that the tomatoes are alike as to color, i.e., bright red...

  8. 7 CFR 51.2714 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.2714 Section 51.2714 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... characteristics. Similar varietal characteristics means that the peanut kernels in the lot are not of distinctly...

  9. 7 CFR 51.603 - Similar varietal characteristics.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Similar varietal characteristics. 51.603 Section 51.603 Agriculture Regulations of the Department of Agriculture AGRICULTURAL MARKETING SERVICE (Standards... characteristics. Similar varietal characteristics means that the stalks in any container have the same character...

  10. Efficient estimation for high similarities using odd sketches

    DEFF Research Database (Denmark)

    Mitzenmacher, Michael; Pagh, Rasmus; Pham, Ninh Dang

    2014-01-01

    . This means that Odd Sketches provide a highly space-efficient estimator for sets of high similarity, which is relevant in applications such as web duplicate detection, collaborative filtering, and association rule learning. The method extends to weighted Jaccard similarity, relevant e.g. for TF-IDF vector...... and web duplicate detection tasks....

  11. An electrophysiological signature of summed similarity in visual working memory

    NARCIS (Netherlands)

    Van Vugt, Marieke K.; Sekuler, Robert; Wilson, Hugh R.; Kahana, Michael J.

    Summed-similarity models of short-term item recognition posit that participants base their judgments of an item's prior occurrence on that item's summed similarity to the ensemble of items on the remembered list. We examined the neural predictions of these models in 3 short-term recognition memory

  12. Interpersonal Similarity and Knowledge Sharing within Multinational Organizations

    DEFF Research Database (Denmark)

    Mäkelä, Kristiina; Andersson, Ulf; Seppälä, Tomi

    2012-01-01

    Previous research has established that interpersonal similarity can influence knowledge sharing in such a way that similar people are more likely to share knowledge than those who are dissimilar. We contribute to the literature by showing that in the MNC context, cultural and functional similarit....... These microfoundations of inter-unit knowledge exchange point to important theoretical and practical implications for international management....

  13. The study on the cephalometric similarity between parents and offspring

    Energy Technology Data Exchange (ETDEWEB)

    Kang, Woo Ghon; Ahn, Hyung Kyu [Department of Radiology, College of Dentistry, Seoul National University, Seoul (Korea, Republic of)

    1975-11-15

    The study was performed to investigate cephalometric similarity between parents and offspring of the Korean family by lateral cephalometric analysis. The lateral cephalograms consist of the 8 families comprising 16 parents, 5 sons and 7 daughters. In order to make an investigation of the similarity, 12 measuring points were set up, and 22 linear measurements on each depth, height and 5 angular measurements were made. The author drew up the profilograms to compare parents with offspring in each family group. The obtained results were as follows: 1. There was no common similarity on specific region between parents and offspring in each family group. 2. There was partial similarity between single parent and offspring. 3. The partial similarity between single parent and offspring was noted on the upper face in general.

  14. Relativistic quantum similarities in atoms in position and momentum spaces

    International Nuclear Information System (INIS)

    Maldonado, P.; Sarsa, A.; Buendia, E.; Galvez, F.J.

    2011-01-01

    A study of different quantum similarity measures and their corresponding quantum similarity indices is carried out for the atoms from H to Lr (Z=1-103). Relativistic effects in both position and momentum spaces have been studied by comparing the relativistic values to the non-relativistic ones. We have used the atomic electron density in both position and momentum spaces obtained within relativistic and non-relativistic numerical-parameterized optimized effective potential approximations. -- Highlights: → Quantum similarity measures and indices in electronic structure of atoms. → Position and momentum electronic densities. → Similarity of relativistic and non-relativistic densities. → Similarity of core and valence regions of different atoms. → Dependence with Z along the Periodic Table.

  15. Improved collaborative filtering recommendation algorithm of similarity measure

    Science.gov (United States)

    Zhang, Baofu; Yuan, Baoping

    2017-05-01

    The Collaborative filtering recommendation algorithm is one of the most widely used recommendation algorithm in personalized recommender systems. The key is to find the nearest neighbor set of the active user by using similarity measure. However, the methods of traditional similarity measure mainly focus on the similarity of user common rating items, but ignore the relationship between the user common rating items and all items the user rates. And because rating matrix is very sparse, traditional collaborative filtering recommendation algorithm is not high efficiency. In order to obtain better accuracy, based on the consideration of common preference between users, the difference of rating scale and score of common items, this paper presents an improved similarity measure method, and based on this method, a collaborative filtering recommendation algorithm based on similarity improvement is proposed. Experimental results show that the algorithm can effectively improve the quality of recommendation, thus alleviate the impact of data sparseness.

  16. Classification of Unknown Thermocouple Types Using Similarity Factor Measurement

    Directory of Open Access Journals (Sweden)

    Seshu K. DAMARLA

    2011-01-01

    Full Text Available In contrast to classification using PCA method, a new methodology is proposed for type identification of unknown thermocouple. The new methodology is based on calculating the degree of similarity between two multivariate datasets using two types of similarity factors. One similarity factor is based on principle component analysis and the angles between the principle component subspaces while the other is based on the Mahalanobis distance between the datasets. Datasets containing thermo-emfs against given temperature ranges are formed for each type of thermocouple (e.g. J, K, S, T, R, E, B and N type by experimentation are considered as reference datasets. Datasets corresponding to unknown type are captured. Similarity factor between the datasets one of which being the unknown type and the other being each known type are compared. When maximum similarity factor occurs, then the class of unknown type is allocated to that of known type.

  17. The study on the cephalometric similarity between parents and offspring

    International Nuclear Information System (INIS)

    Kang, Woo Ghon; Ahn, Hyung Kyu

    1975-01-01

    The study was performed to investigate cephalometric similarity between parents and offspring of the Korean family by lateral cephalometric analysis. The lateral cephalograms consist of the 8 families comprising 16 parents, 5 sons and 7 daughters. In order to make an investigation of the similarity, 12 measuring points were set up, and 22 linear measurements on each depth, height and 5 angular measurements were made. The author drew up the profilograms to compare parents with offspring in each family group. The obtained results were as follows: 1. There was no common similarity on specific region between parents and offspring in each family group. 2. There was partial similarity between single parent and offspring. 3. The partial similarity between single parent and offspring was noted on the upper face in general.

  18. A Model-Based Approach to Constructing Music Similarity Functions

    Science.gov (United States)

    West, Kris; Lamere, Paul

    2006-12-01

    Several authors have presented systems that estimate the audio similarity of two pieces of music through the calculation of a distance metric, such as the Euclidean distance, between spectral features calculated from the audio, related to the timbre or pitch of the signal. These features can be augmented with other, temporally or rhythmically based features such as zero-crossing rates, beat histograms, or fluctuation patterns to form a more well-rounded music similarity function. It is our contention that perceptual or cultural labels, such as the genre, style, or emotion of the music, are also very important features in the perception of music. These labels help to define complex regions of similarity within the available feature spaces. We demonstrate a machine-learning-based approach to the construction of a similarity metric, which uses this contextual information to project the calculated features into an intermediate space where a music similarity function that incorporates some of the cultural information may be calculated.

  19. Similar words analysis based on POS-CBOW language model

    Directory of Open Access Journals (Sweden)

    Dongru RUAN

    2015-10-01

    Full Text Available Similar words analysis is one of the important aspects in the field of natural language processing, and it has important research and application values in text classification, machine translation and information recommendation. Focusing on the features of Sina Weibo's short text, this paper presents a language model named as POS-CBOW, which is a kind of continuous bag-of-words language model with the filtering layer and part-of-speech tagging layer. The proposed approach can adjust the word vectors' similarity according to the cosine similarity and the word vectors' part-of-speech metrics. It can also filter those similar words set on the base of the statistical analysis model. The experimental result shows that the similar words analysis algorithm based on the proposed POS-CBOW language model is better than that based on the traditional CBOW language model.

  20. Natural texture retrieval based on perceptual similarity measurement

    Science.gov (United States)

    Gao, Ying; Dong, Junyu; Lou, Jianwen; Qi, Lin; Liu, Jun

    2018-04-01

    A typical texture retrieval system performs feature comparison and might not be able to make human-like judgments of image similarity. Meanwhile, it is commonly known that perceptual texture similarity is difficult to be described by traditional image features. In this paper, we propose a new texture retrieval scheme based on texture perceptual similarity. The key of the proposed scheme is that prediction of perceptual similarity is performed by learning a non-linear mapping from image features space to perceptual texture space by using Random Forest. We test the method on natural texture dataset and apply it on a new wallpapers dataset. Experimental results demonstrate that the proposed texture retrieval scheme with perceptual similarity improves the retrieval performance over traditional image features.

  1. Cultural similarity, cultural competence, and nurse workforce diversity.

    Science.gov (United States)

    McGinnis, Sandra L; Brush, Barbara L; Moore, Jean

    2010-11-01

    Proponents of health workforce diversity argue that increasing the number of minority health care providers will enhance cultural similarity between patients and providers as well as the health system's capacity to provide culturally competent care. Measuring cultural similarity has been difficult, however, given that current benchmarks of workforce diversity categorize health workers by major racial/ethnic classifications rather than by cultural measures. This study examined the use of national racial/ethnic categories in both patient and registered nurse (RN) populations and found them to be a poor indicator of cultural similarity. Rather, we found that cultural similarity between RN and patient populations needs to be established at the level of local labor markets and broadened to include other cultural parameters such as country of origin, primary language, and self-identified ancestry. Only then can the relationship between cultural similarity and cultural competence be accurately determined and its outcomes measured.

  2. Average is Boring: How Similarity Kills a Meme's Success

    Science.gov (United States)

    Coscia, Michele

    2014-09-01

    Every day we are exposed to different ideas, or memes, competing with each other for our attention. Previous research explained popularity and persistence heterogeneity of memes by assuming them in competition for limited attention resources, distributed in a heterogeneous social network. Little has been said about what characteristics make a specific meme more likely to be successful. We propose a similarity-based explanation: memes with higher similarity to other memes have a significant disadvantage in their potential popularity. We employ a meme similarity measure based on semantic text analysis and computer vision to prove that a meme is more likely to be successful and to thrive if its characteristics make it unique. Our results show that indeed successful memes are located in the periphery of the meme similarity space and that our similarity measure is a promising predictor of a meme success.

  3. Investigation of psychophysical similarity measures for selection of similar images in the diagnosis of clustered microcalcifications on mammograms

    Energy Technology Data Exchange (ETDEWEB)

    Muramatsu, Chisako; Li Qiang; Schmidt, Robert; Shiraishi, Junji; Doi, Kunio [Department of Radiology, University of Chicago, 5841 South Maryland Avenue, Chicago, Illinois 60637 (United States) and Department of Intelligent Image Information, Gifu University, 1-1 Yanagido, Gifu (Japan); Department of Radiology, Duke Advanced Imaging Labs, Duke University, 2424 Erwin Road, Suite 302, Durham, North Carolina 27705 (United States); Department of Radiology, University of Chicago, 5841 South Maryland Avenue, Chicago, Illinois 60637 (United States)

    2008-12-15

    The presentation of images with lesions of known pathology that are similar to an unknown lesion may be helpful to radiologists in the diagnosis of challenging cases for improving the diagnostic accuracy and also for reducing variation among different radiologists. The authors have been developing a computerized scheme for automatically selecting similar images with clustered microcalcifications on mammograms from a large database. For similar images to be useful, they must be similar from the point of view of the diagnosing radiologists. In order to select such images, subjective similarity ratings were obtained for a number of pairs of clustered microcalcifications by breast radiologists for establishment of a ''gold standard'' of image similarity, and the gold standard was employed for determination and evaluation of the selection of similar images. The images used in this study were obtained from the Digital Database for Screening Mammography developed by the University of South Florida. The subjective similarity ratings for 300 pairs of images with clustered microcalcifications were determined by ten breast radiologists. The authors determined a number of image features which represent the characteristics of clustered microcalcifications that radiologists would use in their diagnosis. For determination of objective similarity measures, an artificial neural network (ANN) was employed. The ANN was trained with the average subjective similarity ratings as teacher and selected image features as input data. The ANN was trained to learn the relationship between the image features and the radiologists' similarity ratings; therefore, once the training was completed, the ANN was able to determine the similarity, called a psychophysical similarity measure, which was expected to be close to radiologists' impressions, for an unknown pair of clustered microcalcifications. By use of a leave-one-out test method, the best combination of features

  4. Investigation of psychophysical similarity measures for selection of similar images in the diagnosis of clustered microcalcifications on mammograms

    International Nuclear Information System (INIS)

    Muramatsu, Chisako; Li Qiang; Schmidt, Robert; Shiraishi, Junji; Doi, Kunio

    2008-01-01

    The presentation of images with lesions of known pathology that are similar to an unknown lesion may be helpful to radiologists in the diagnosis of challenging cases for improving the diagnostic accuracy and also for reducing variation among different radiologists. The authors have been developing a computerized scheme for automatically selecting similar images with clustered microcalcifications on mammograms from a large database. For similar images to be useful, they must be similar from the point of view of the diagnosing radiologists. In order to select such images, subjective similarity ratings were obtained for a number of pairs of clustered microcalcifications by breast radiologists for establishment of a ''gold standard'' of image similarity, and the gold standard was employed for determination and evaluation of the selection of similar images. The images used in this study were obtained from the Digital Database for Screening Mammography developed by the University of South Florida. The subjective similarity ratings for 300 pairs of images with clustered microcalcifications were determined by ten breast radiologists. The authors determined a number of image features which represent the characteristics of clustered microcalcifications that radiologists would use in their diagnosis. For determination of objective similarity measures, an artificial neural network (ANN) was employed. The ANN was trained with the average subjective similarity ratings as teacher and selected image features as input data. The ANN was trained to learn the relationship between the image features and the radiologists' similarity ratings; therefore, once the training was completed, the ANN was able to determine the similarity, called a psychophysical similarity measure, which was expected to be close to radiologists' impressions, for an unknown pair of clustered microcalcifications. By use of a leave-one-out test method, the best combination of features was selected. The correlation

  5. Text Mining for Protein Docking.

    Directory of Open Access Journals (Sweden)

    Varsha D Badal

    2015-12-01

    Full Text Available The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking. Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu. The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound

  6. Protein function prediction using neighbor relativity in protein-protein interaction network.

    Science.gov (United States)

    Moosavi, Sobhan; Rahgozar, Masoud; Rahimi, Amir

    2013-04-01

    There is a large gap between the number of discovered proteins and the number of functionally annotated ones. Due to the high cost of determining protein function by wet-lab research, function prediction has become a major task for computational biology and bioinformatics. Some researches utilize the proteins interaction information to predict function for un-annotated proteins. In this paper, we propose a novel approach called "Neighbor Relativity Coefficient" (NRC) based on interaction network topology which estimates the functional similarity between two proteins. NRC is calculated for each pair of proteins based on their graph-based features including distance, common neighbors and the number of paths between them. In order to ascribe function to an un-annotated protein, NRC estimates a weight for each neighbor to transfer its annotation to the unknown protein. Finally, the unknown protein will be annotated by the top score transferred functions. We also investigate the effect of using different coefficients for various types of functions. The proposed method has been evaluated on Saccharomyces cerevisiae and Homo sapiens interaction networks. The performance analysis demonstrates that NRC yields better results in comparison with previous protein function prediction approaches that utilize interaction network. Copyright © 2012 Elsevier Ltd. All rights reserved.

  7. New similarity of triangular fuzzy number and its application.

    Science.gov (United States)

    Zhang, Xixiang; Ma, Weimin; Chen, Liping

    2014-01-01

    The similarity of triangular fuzzy numbers is an important metric for application of it. There exist several approaches to measure similarity of triangular fuzzy numbers. However, some of them are opt to be large. To make the similarity well distributed, a new method SIAM (Shape's Indifferent Area and Midpoint) to measure triangular fuzzy number is put forward, which takes the shape's indifferent area and midpoint of two triangular fuzzy numbers into consideration. Comparison with other similarity measurements shows the effectiveness of the proposed method. Then, it is applied to collaborative filtering recommendation to measure users' similarity. A collaborative filtering case is used to illustrate users' similarity based on cloud model and triangular fuzzy number; the result indicates that users' similarity based on triangular fuzzy number can obtain better discrimination. Finally, a simulated collaborative filtering recommendation system is developed which uses cloud model and triangular fuzzy number to express users' comprehensive evaluation on items, and result shows that the accuracy of collaborative filtering recommendation based on triangular fuzzy number is higher.

  8. Self-similarity in incompressible Navier-Stokes equations.

    Science.gov (United States)

    Ercan, Ali; Kavvas, M Levent

    2015-12-01

    The self-similarity conditions of the 3-dimensional (3D) incompressible Navier-Stokes equations are obtained by utilizing one-parameter Lie group of point scaling transformations. It is found that the scaling exponents of length dimensions in i = 1, 2, 3 coordinates in 3-dimensions are not arbitrary but equal for the self-similarity of 3D incompressible Navier-Stokes equations. It is also shown that the self-similarity in this particular flow process can be achieved in different time and space scales when the viscosity of the fluid is also scaled in addition to other flow variables. In other words, the self-similarity of Navier-Stokes equations is achievable under different fluid environments in the same or different gravity conditions. Self-similarity criteria due to initial and boundary conditions are also presented. Utilizing the proposed self-similarity conditions of the 3D hydrodynamic flow process, the value of a flow variable at a specified time and space can be scaled to a corresponding value in a self-similar domain at the corresponding time and space.

  9. Mapping monomeric threading to protein-protein structure prediction.

    Science.gov (United States)

    Guerler, Aysam; Govindarajoo, Brandon; Zhang, Yang

    2013-03-25

    The key step of template-based protein-protein structure prediction is the recognition of complexes from experimental structure libraries that have similar quaternary fold. Maintaining two monomer and dimer structure libraries is however laborious, and inappropriate library construction can degrade template recognition coverage. We propose a novel strategy SPRING to identify complexes by mapping monomeric threading alignments to protein-protein interactions based on the original oligomer entries in the PDB, which does not rely on library construction and increases the efficiency and quality of complex template recognitions. SPRING is tested on 1838 nonhomologous protein complexes which can recognize correct quaternary template structures with a TM score >0.5 in 1115 cases after excluding homologous proteins. The average TM score of the first model is 60% and 17% higher than that by HHsearch and COTH, respectively, while the number of targets with an interface RMSD benchmark proteins. Although the relative performance of SPRING and ZDOCK depends on the level of homology filters, a combination of the two methods can result in a significantly higher model quality than ZDOCK at all homology thresholds. These data demonstrate a new efficient approach to quaternary structure recognition that is ready to use for genome-scale modeling of protein-protein interactions due to the high speed and accuracy.

  10. Examining Similarity Structure: Multidimensional Scaling and Related Approaches in Neuroimaging

    Directory of Open Access Journals (Sweden)

    Svetlana V. Shinkareva

    2013-01-01

    Full Text Available This paper covers similarity analyses, a subset of multivariate pattern analysis techniques that are based on similarity spaces defined by multivariate patterns. These techniques offer several advantages and complement other methods for brain data analyses, as they allow for comparison of representational structure across individuals, brain regions, and data acquisition methods. Particular attention is paid to multidimensional scaling and related approaches that yield spatial representations or provide methods for characterizing individual differences. We highlight unique contributions of these methods by reviewing recent applications to functional magnetic resonance imaging data and emphasize areas of caution in applying and interpreting similarity analysis methods.

  11. Multi-Scale Scattering Transform in Music Similarity Measuring

    Science.gov (United States)

    Wang, Ruobai

    Scattering transform is a Mel-frequency spectrum based, time-deformation stable method, which can be used in evaluating music similarity. Compared with Dynamic time warping, it has better performance in detecting similar audio signals under local time-frequency deformation. Multi-scale scattering means to combine scattering transforms of different window lengths. This paper argues that, multi-scale scattering transform is a good alternative of dynamic time warping in music similarity measuring. We tested the performance of multi-scale scattering transform against other popular methods, with data designed to represent different conditions.

  12. IntelliGO: a new vector-based semantic similarity measure including annotation origin

    Directory of Open Access Journals (Sweden)

    Devignes Marie-Dominique

    2010-12-01

    Full Text Available Abstract Background The Gene Ontology (GO is a well known controlled vocabulary describing the biological process, molecular function and cellular component aspects of gene annotation. It has become a widely used knowledge source in bioinformatics for annotating genes and measuring their semantic similarity. These measures generally involve the GO graph structure, the information content of GO aspects, or a combination of both. However, only a few of the semantic similarity measures described so far can handle GO annotations differently according to their origin (i.e. their evidence codes. Results We present here a new semantic similarity measure called IntelliGO which integrates several complementary properties in a novel vector space model. The coefficients associated with each GO term that annotates a given gene or protein include its information content as well as a customized value for each type of GO evidence code. The generalized cosine similarity measure, used for calculating the dot product between two vectors, has been rigorously adapted to the context of the GO graph. The IntelliGO similarity measure is tested on two benchmark datasets consisting of KEGG pathways and Pfam domains grouped as clans, considering the GO biological process and molecular function terms, respectively, for a total of 683 yeast and human genes and involving more than 67,900 pair-wise comparisons. The ability of the IntelliGO similarity measure to express the biological cohesion of sets of genes compares favourably to four existing similarity measures. For inter-set comparison, it consistently discriminates between distinct sets of genes. Furthermore, the IntelliGO similarity measure allows the influence of weights assigned to evidence codes to be checked. Finally, the results obtained with a complementary reference technique give intermediate but correct correlation values with the sequence similarity, Pfam, and Enzyme classifications when compared to

  13. Human cancer protein-protein interaction network: a structural perspective.

    Directory of Open Access Journals (Sweden)

    Gozde Kar

    2009-12-01

    Full Text Available Protein-protein interaction networks provide a global picture of cellular function and biological processes. Some proteins act as hub proteins, highly connected to others, whereas some others have few interactions. The dysfunction of some interactions causes many diseases, including cancer. Proteins interact through their interfaces. Therefore, studying the interface properties of cancer-related proteins will help explain their role in the interaction networks. Similar or overlapping binding sites should be used repeatedly in single interface hub proteins, making them promiscuous. Alternatively, multi-interface hub proteins make use of several distinct binding sites to bind to different partners. We propose a methodology to integrate protein interfaces into cancer interaction networks (ciSPIN, cancer structural protein interface network. The interactions in the human protein interaction network are replaced by interfaces, coming from either known or predicted complexes. We provide a detailed analysis of cancer related human protein-protein interfaces and the topological properties of the cancer network. The results reveal that cancer-related proteins have smaller, more planar, more charged and less hydrophobic binding sites than non-cancer proteins, which may indicate low affinity and high specificity of the cancer-related interactions. We also classified the genes in ciSPIN according to phenotypes. Within phenotypes, for breast cancer, colorectal cancer and leukemia, interface properties were found to be discriminating from non-cancer interfaces with an accuracy of 71%, 67%, 61%, respectively. In addition, cancer-related proteins tend to interact with their partners through distinct interfaces, corresponding mostly to multi-interface hubs, which comprise 56% of cancer-related proteins, and constituting the nodes with higher essentiality in the network (76%. We illustrate the interface related affinity properties of two cancer-related hub

  14. Earthworm coelomocyte extracellular traps: structural and functional similarities with neutrophil NETs.

    Science.gov (United States)

    Homa, Joanna

    2018-03-01

    Invertebrate immunity is associated with natural mechanisms that include cellular and humoral elements, similar to those that play a role in vertebrate innate immune responses. Formation of extracellular traps (ETs) is a newly discovered mechanism to combat pathogens, operating not only in vertebrate leucocytes but also in invertebrate immune cells. The ET components include extracellular DNA (exDNA), antimicrobial proteins and histones. Formation of mammalian ETs depends on enzymes such as neutrophil elastase, myeloperoxidase, the citrullination of histones and protease activity. It was confirmed that coelomocytes-immunocompetent cells of the earthworm Eisenia andrei-are also able to release ETs in a protease-dependent manner, dependent or independent of the formation of reactive oxygen species and rearrangement of the cell cytoskeleton. Similar to vertebrate leukocytes (e.g., neutrophil), coelomocytes are responsible for many immune functions like phagocytosis, cytotoxicity and secretion of humoral factors. ETs formed by coelomocyte analogues to neutrophil ETs consist of exDNA, histone H3 and attached to these structures proteins, e.g., heat shock proteins HSP27. The latter fact confirms that mechanisms of ET release are conserved in evolution. The study on Annelida adds this animal group to the list of invertebrates capable of ET release, but most importantly provides insides into innate mechanisms of ET formation in lower animal taxa.

  15. Immunization of mice with LRP4 induces myasthenia similar to MuSK-associated myasthenia gravis.

    Science.gov (United States)

    Mori, Shuuichi; Motohashi, Norio; Takashima, Rumi; Kishi, Masahiko; Nishimune, Hiroshi; Shigemoto, Kazuhiro

    2017-11-01

    Since the first report of experimental animal models of myasthenia gravis (MG) with autoantibodies against low-density lipoprotein receptor-related protein 4 (LRP4), there have not been any major reports replicating the pathogenicity of anti-LRP4 antibodies (Abs). Recent clinical studies have cast doubt on the specificity and pathogenicity of anti-LRP4 antibodies for MG, highlighting the need for further research. In this study, we purified antigens corresponding to the extracellular region of human LRP4 stably expressed with chaperones in 293 cells and used these antigens to immunize female A/J mice. Immunization with LRP4 protein caused mice to develop myasthenia having similar electrophysiological and histological features as are observed in MG patients with circulating Abs against muscle-specific kinase (MuSK). Our results clearly demonstrate that active immunization of mice with LRP4 proteins causes myasthenia similar to the MG induced by anti-MuSK Abs. Further experimental and clinical studies are required to prove the pathogenicity of anti-LRP4 Abs in MG patients. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  16. Adhesives from modified soy protein

    Science.gov (United States)

    Sun, Susan [Manhattan, KS; Wang, Donghai [Manhattan, KS; Zhong, Zhikai [Manhattan, KS; Yang, Guang [Shanghai, CN

    2008-08-26

    The present invention provides useful adhesive compositions having similar adhesive properties to conventional UF and PPF resins. The compositions generally include a protein portion and modifying ingredient portion selected from the group consisting of carboxyl-containing compounds, aldehyde-containing compounds, epoxy group-containing compounds, and mixtures thereof. The composition is preferably prepared at a pH level at or near the isoelectric point of the protein. In other preferred forms, the adhesive composition includes a protein portion and a carboxyl-containing group portion.

  17. Neutrosophic Cubic MCGDM Method Based on Similarity Measure

    Directory of Open Access Journals (Sweden)

    Surapati Pramanik

    2017-06-01

    Full Text Available The notion of neutrosophic cubic set is originated from the hybridization of the concept of neutrosophic set and interval valued neutrosophic set. We define similarity measure for neutrosophic cubic sets and prove some of its basic properties.

  18. Aviation Safety: FAA and DOD Response to Similar Safety Concerns

    National Research Council Canada - National Science Library

    2002-01-01

    .... The Federal Aviation Administration (FAA) and the military services often face common safety issues as they oversee the operation of similar aircraft or even dissimilar aircraft that use common parts and materials...

  19. Efficient data retrieval method for similar plasma waveforms in EAST

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Ying, E-mail: liuying-ipp@szu.edu.cn [SZU-CASIPP Joint Laboratory for Applied Plasma, Shenzhen University, Shenzhen 518060 (China); Huang, Jianjun; Zhou, Huasheng; Wang, Fan [SZU-CASIPP Joint Laboratory for Applied Plasma, Shenzhen University, Shenzhen 518060 (China); Wang, Feng [Institute of Plasma Physics Chinese Academy of Sciences, Hefei 230031 (China)

    2016-11-15

    Highlights: • The proposed method is carried out by means of bounding envelope and angle distance. • It allows retrieving for whole similar waveforms of any time length. • In addition, the proposed method is also possible to retrieve subsequences. - Abstract: Fusion research relies highly on data analysis due to its massive-sized database. In the present work, we propose an efficient method for searching and retrieving similar plasma waveforms in Experimental Advanced Superconducting Tokamak (EAST). Based on Piecewise Linear Aggregate Approximation (PLAA) for extracting feature values, the searching process is accomplished in two steps. The first one is coarse searching to narrow down the search space, which is carried out by means of bounding envelope. The second step is fine searching to retrieval similar waveforms, which is implemented by the angle distance. The proposed method is tested in EAST databases and turns out to have good performance in retrieving similar waveforms.

  20. Musical structure analysis using similarity matrix and dynamic programming

    Science.gov (United States)

    Shiu, Yu; Jeong, Hong; Kuo, C.-C. Jay

    2005-10-01

    Automatic music segmentation and structure analysis from audio waveforms based on a three-level hierarchy is examined in this research, where the three-level hierarchy includes notes, measures and parts. The pitch class profile (PCP) feature is first extracted at the note level. Then, a similarity matrix is constructed at the measure level, where a dynamic time warping (DTW) technique is used to enhance the similarity computation by taking the temporal distortion of similar audio segments into account. By processing the similarity matrix, we can obtain a coarse-grain music segmentation result. Finally, dynamic programming is applied to the coarse-grain segments so that a song can be decomposed into several major parts such as intro, verse, chorus, bridge and outro. The performance of the proposed music structure analysis system is demonstrated for pop and rock music.

  1. Similarity-Based Interference and the Acquisition of Adjunct Control

    Directory of Open Access Journals (Sweden)

    Juliana Gerard

    2017-10-01

    Full Text Available Previous research on the acquisition of adjunct control has observed non-adultlike behavior for sentences like “John bumped Mary after tripping on the sidewalk.” While adults only allow a subject control interpretation for these sentences (that John tripped on the sidewalk, preschool-aged children have been reported to allow a much wider range of interpretations. A number of different tasks have been used with the aim of identifying a grammatical source of children’s errors. In this paper, we consider the role of extragrammatical factors. In two comprehension experiments, we demonstrate that error rates go up when the similarity increases between an antecedent and a linearly intervening noun phrase, first with similarity in gender, and next with similarity in number marking. This suggests that difficulties with adjunct control are to be explained (at least in part by the sentence processing mechanisms that underlie similarity-based interference in adults.

  2. Interference effects in learning similar sequences of discrete movements

    NARCIS (Netherlands)

    Koedijker, J.M.; Oudejans, R.R.D.; Beek, P.J.

    2010-01-01

    Three experiments were conducted to examine proactive and retroactive interference effects in learning two similar sequences of discrete movements. In each experiment, the participants in the experimental group practiced two movement sequences on consecutive days (1 on each day, order

  3. Interbehavioral psychology and radical behaviorism: Some similarities and differences

    Science.gov (United States)

    Morris, Edward K.

    1984-01-01

    Both J. R. Kantor's interbehavioral psychology and B. F. Skinner's radical behaviorism represent wellarticulated approaches to a natural science of behavior. As such, they share a number of similar features, yet they also differ on a number of dimensions. Some of these similarities and differences are examined by describing their emergence in the professional literature and by comparing the respective units of analysis of the two approaches—the interbehavioral field and the three-term contingency. An evaluation of the similarities and differences shows the similarities to be largely fundamental, and the differences largely ones of emphasis. Nonetheless, the two approaches do make unique contributions to a natural science of behavior, the integration of which can facilitate the development of that science and its acceptance among other sciences and within society at large. PMID:22478612

  4. Temporal self-similar synchronization patterns and scaling in ...

    Indian Academy of Sciences (India)

    Repulsively coupled oscillators; synchronization patterns; self-similar ... system, one expects multistable behavior in analogy to ..... More about the scaling relation between the long-period ... The third type of representation of phases is via.

  5. On finding similar items in a stream of transactions

    DEFF Research Database (Denmark)

    Campagna, Andrea; Pagh, Rasmus

    2010-01-01

    While there has been a lot of work on finding frequent itemsets in transaction data streams, none of these solve the problem of finding similar pairs according to standard similarity measures. This paper is a first attempt at dealing with this, arguably more important, problem. We start out with ...... in random order, and show that surprisingly, not only is small-space similarity mining possible for the most common similarity measures, but the mining accuracy {\\em improves\\/} with the length of the stream for any fixed support threshold....... with a negative result that also explains the lack of theoretical upper bounds on the space usage of data mining algorithms for finding frequent itemsets: Any algorithm that (even only approximately and with a chance of error) finds the most frequent $k$-itemset must use space $\\Omega...

  6. Comparative mapping reveals similar linkage of functional genes to ...

    Indian Academy of Sciences (India)

    genes between O. sativa and B. napus may have consistent function and control similar traits, which may be ..... acea chromosomes reveals islands of conserved organization. ... 1998 Conserved structure and function of the Arabidopsis flow-.

  7. Scaling, Similarity, and the Fourth Paradigm for Hydrology

    Science.gov (United States)

    Peters-Lidard, Christa D.; Clark, Martyn; Samaniego, Luis; Verhoest, Niko E. C.; van Emmerik, Tim; Uijlenhoet, Remko; Achieng, Kevin; Franz, Trenton E.; Woods, Ross

    2017-01-01

    In this synthesis paper addressing hydrologic scaling and similarity, we posit that roadblocks in the search for universal laws of hydrology are hindered by our focus on computational simulation (the third paradigm), and assert that it is time for hydrology to embrace a fourth paradigm of data-intensive science. Advances in information-based hydrologic science, coupled with an explosion of hydrologic data and advances in parameter estimation and modelling, have laid the foundation for a data-driven framework for scrutinizing hydrological scaling and similarity hypotheses. We summarize important scaling and similarity concepts (hypotheses) that require testing, describe a mutual information framework for testing these hypotheses, describe boundary condition, state flux, and parameter data requirements across scales to support testing these hypotheses, and discuss some challenges to overcome while pursuing the fourth hydrological paradigm. We call upon the hydrologic sciences community to develop a focused effort towards adopting the fourth paradigm and apply this to outstanding challenges in scaling and similarity.

  8. Tax-1 and Tax-2 similarities and differences: Focus on post-translational modifications and NF-кB activation

    Directory of Open Access Journals (Sweden)

    Margret eShirinian

    2013-08-01

    Full Text Available ABSTRACTAlthough human T-cell leukemia virus type 1 and 2 (HTLV-1 and HTLV-2 share similar genetic organization, they have major differences in their pathogenesis and disease manifestation. HTLV-1 is capable of transforming T lymphocytes in infected patients and subsequently leads to adult T cell leukemia/lymphoma (ATL whereas HTLV-2 is not clearly associated with lymphoproliferative diseases. Numerous studies have provided accumulating evidence on the involvement of the viral transactivators Tax-1 versus Tax-2 in T cell transformation. Tax-1 is a potent transcriptional activator of both viral and cellular genes. Tax-1 posttranslational modifications and specifically ubiquitylation and SUMOylation have been implicated in NF-кB activation and may contribute to its transformation capacity. Although Tax-2 has similar protein structure compared to Tax-1, the two proteins display differences both in their protein-protein interaction and activation of signal transduction pathways. Recent studies on Tax-2 have suggested ubiquitylation and SUMOylation independent mechanisms of NF-кB activation. In this present review, structural and functional differences between Tax-1 and Tax- 2 will be summarized. Specifically, we will address their subcellular localization, nuclear trafficking and their effect on cellular regulatory proteins. A special attention will be given to Tax-1/Tax-2 post-translational modification such as ubiquitylation, SUMOylation, phosphorylation, acetylation, NF-кB activation and protein-protein interactions involved in oncogenecity both in vivo and in vitro.

  9. Graphemes Sharing Phonetic Features Tend to Induce Similar Synesthetic Colors

    OpenAIRE

    Kang, Mi-Jeong; Kim, Yeseul; Shin, Ji-Young; Kim, Chai-Youn

    2017-01-01

    Individuals with grapheme-color synesthesia experience idiosyncratic colors when viewing achromatic letters or digits. Despite large individual differences in grapheme-color association, synesthetes tend to associate graphemes sharing a perceptual feature with similar synesthetic colors. Sound has been suggested as one such feature. In the present study, we investigated whether graphemes of which representative phonemes have similar phonetic features tend to be associated with analogous synes...

  10. Continuous Improvement and Collaborative Improvement: Similarities and Differences

    DEFF Research Database (Denmark)

    Middel, Rick; Boer, Harry; Fisscher, Olaf

    2006-01-01

    the similarities and differences between key components of continuous and collaborative improvement by assessing what is specific for continuous improvement, what for collaborative improvement, and where the two areas of application meet and overlap. The main conclusions are that there are many more similarities...... between continuous and collaborative improvement. The main differences relate to the role of hierarchy/market, trust, power and commitment to collaboration, all of which are related to differences between the settings in which continuous and collaborative improvement unfold....

  11. Decoding Decoders: Finding Optimal Representation Spaces for Unsupervised Similarity Tasks

    OpenAIRE

    Zhelezniak, Vitalii; Busbridge, Dan; Shen, April; Smith, Samuel L.; Hammerla, Nils Y.

    2018-01-01

    Experimental evidence indicates that simple models outperform complex deep networks on many unsupervised similarity tasks. We provide a simple yet rigorous explanation for this behaviour by introducing the concept of an optimal representation space, in which semantically close symbols are mapped to representations that are close under a similarity measure induced by the model's objective function. In addition, we present a straightforward procedure that, without any retraining or architectura...

  12. Inter Genre Similarity Modelling For Automatic Music Genre Classification

    OpenAIRE

    Bagci, Ulas; Erzin, Engin

    2009-01-01

    Music genre classification is an essential tool for music information retrieval systems and it has been finding critical applications in various media platforms. Two important problems of the automatic music genre classification are feature extraction and classifier design. This paper investigates inter-genre similarity modelling (IGS) to improve the performance of automatic music genre classification. Inter-genre similarity information is extracted over the mis-classified feature population....

  13. Word Similarity from Dictionaries: Inferring Fuzzy Measures from Fuzzy Graphs

    Directory of Open Access Journals (Sweden)

    Vicenc Torra

    2008-01-01

    Full Text Available WORD SIMILARITY FROM DICTIONARIES: INFERRING FUZZY MEASURES FROM FUZZY GRAPHS The computation of similarities between words is a basic element of information retrieval systems, when retrieval is not solely based on word matching. In this work we consider a measure between words based on dictionaries. This is achieved assuming that a dictionary is formalized as a fuzzy graph. We show that the approach permits to compute measures not only for pairs of words but for sets of them.

  14. Chromatographic fingerprint similarity analysis for pollutant source identification

    International Nuclear Information System (INIS)

    Xie, Juan-Ping; Ni, Hong-Gang

    2015-01-01

    In the present study, a similarity analysis method was proposed to evaluate the source-sink relationships among environmental media for polybrominated diphenyl ethers (PBDEs), which were taken as the representative contaminants. Chromatographic fingerprint analysis has been widely used in the fields of natural products chemistry and forensic chemistry, but its application to environmental science has been limited. We established a library of various sources of media containing contaminants (e.g., plastics), recognizing that the establishment of a more comprehensive library allows for a better understanding of the sources of contamination. We then compared an environmental complex mixture (e.g., sediment, soil) with the profiles in the library. These comparisons could be used as the first step in source tracking. The cosine similarities between plastic and soil or sediment ranged from 0.53 to 0.68, suggesting that plastic in electronic waste is an important source of PBDEs in the environment, but it is not the only source. A similarity analysis between soil and sediment indicated that they have a source-sink relationship. Generally, the similarity analysis method can encompass more relevant information of complex mixtures in the environment than a profile-based approach that only focuses on target pollutants. There is an inherent advantage to creating a data matrix containing all peaks and their relative levels after matching the peaks based on retention times and peak areas. This data matrix can be used for source identification via a similarity analysis without quantitative or qualitative analysis of all chemicals in a sample. - Highlights: • Chromatographic fingerprint analysis can be used as the first step in source tracking. • Similarity analysis method can encompass more relevant information of pollution. • The fingerprints strongly depend on the chromatographic conditions. • A more effective and robust method for identifying similarities is required

  15. Effective Results Analysis for the Similar Software Products’ Orthogonality

    OpenAIRE

    Ion Ivan; Daniel Milodin

    2009-01-01

    It is defined the concept of similar software. There are established conditions of archiving the software components. It is carried out the orthogonality evaluation and the correlation between the orthogonality and the complexity of the homogenous software components is analyzed. Shall proceed to build groups of similar software products, belonging to the orthogonality intervals. There are presented in graphical form the results of the analysis. There are detailed aspects of the functioning o...

  16. Effective Results Analysis for the Similar Software Products’ Orthogonality

    Directory of Open Access Journals (Sweden)

    Ion Ivan

    2009-10-01

    Full Text Available It is defined the concept of similar software. There are established conditions of archiving the software components. It is carried out the orthogonality evaluation and the correlation between the orthogonality and the complexity of the homogenous software components is analyzed. Shall proceed to build groups of similar software products, belonging to the orthogonality intervals. There are presented in graphical form the results of the analysis. There are detailed aspects of the functioning of the software product allocated for the orthogonality.

  17. Plant injury due to air pollution - similar symptoms. Part I

    Energy Technology Data Exchange (ETDEWEB)

    Matsuoka, Y

    1976-01-01

    Many plant diseases cause injuries to leaves which mimic the damage inflicted by air pollution. The relationship between air pollution injuries and those caused by meteorological conditions are discussed. Rice plants often contract akagare which causes reddish-brown spots on leaves similar to the symptoms caused by photochemical oxidants. Spider mites produce leaf damage in kidney beans which mimics the spotting caused by photochemical oxidants. Lace bugs produce minute white spots on azaleas similar to those caused by photochemical oxidants.

  18. Mixed quantization dimensions of self-similar measures

    International Nuclear Information System (INIS)

    Dai Meifeng; Wang Xiaoli; Chen Dandan

    2012-01-01

    Highlights: ► We define the mixed quantization dimension of finitely many measures. ► Formula of mixed quantization dimensions of self-similar measures is given. ► Illustrate the behavior of mixed quantization dimension as a function of order. - Abstract: Classical multifractal analysis studies the local scaling behaviors of a single measure. However recently mixed multifractal has generated interest. The purpose of this paper is some results about the mixed quantization dimensions of self-similar measures.

  19. Estimating correlation and covariance matrices by weighting of market similarity

    OpenAIRE

    Michael C. M\\"unnix; Rudi Sch\\"afer; Oliver Grothe

    2010-01-01

    We discuss a weighted estimation of correlation and covariance matrices from historical financial data. To this end, we introduce a weighting scheme that accounts for similarity of previous market conditions to the present one. The resulting estimators are less biased and show lower variance than either unweighted or exponentially weighted estimators. The weighting scheme is based on a similarity measure which compares the current correlation structure of the market to the structures at past ...

  20. Protein intrinsic disorder in plants.

    Science.gov (United States)

    Pazos, Florencio; Pietrosemoli, Natalia; García-Martín, Juan A; Solano, Roberto

    2013-09-12

    To some extent contradicting the classical paradigm of the relationship between protein 3D structure and function, now it is clear that large portions of the proteomes, especially in higher organisms, lack a fixed structure and still perform very important functions. Proteins completely or partially unstructured in their native (functional) form are involved in key cellular processes underlain by complex networks of protein interactions. The intrinsic conformational flexibility of these disordered proteins allows them to bind multiple partners in transient interactions of high specificity and low affinity. In concordance, in plants this type of proteins has been found in processes requiring these complex and versatile interaction networks. These include transcription factor networks, where disordered proteins act as integrators of different signals or link different transcription factor subnetworks due to their ability to interact (in many cases simultaneously) with different partners. Similarly, they also serve as signal integrators in signaling cascades, such as those related to response to external stimuli. Disordered proteins have also been found in plants in many stress-response processes, acting as protein chaperones or protecting other cellular components and structures. In plants, it is especially important to have complex and versatile networks able to quickly and efficiently respond to changing environmental conditions since these organisms cannot escape and have no other choice than adapting to them. Consequently, protein disorder can play an especially important role in plants, providing them with a fast mechanism to obtain complex, interconnected and versatile molecular networks.

  1. Protein intrinsic disorder in plants

    Directory of Open Access Journals (Sweden)

    Florencio ePazos

    2013-09-01

    Full Text Available To some extent contradicting the classical paradigm of the relationship between protein 3D structure and function, now it is clear that large portions of the proteomes, especially in higher organisms, lack a fixed structure and still perform very important functions. Proteins completely or partially unstructured in their native (functional form are involved in key cellular processes underlain by complex networks of protein interactions. The intrinsic conformational flexibility of these disordered proteins allows them to bind multiple partners in transient interactions of high specificity and low affinity. In concordance, in plants this type of proteins has been found in processes requiring these complex and versatile interaction networks. These include transcription factor networks, where disordered proteins act as integrators of different signals or link different transcription factor subnetworks due to their ability to interact (in many cases simultaneously with different partners. Similarly, they also serve as signal integrators in signalling cascades, such as those related to response to external stimuli. Disordered proteins have also been found in plants in many stress-response processes, acting as protein chaperones or protecting other cellular components and structures. In plants, it is especially important to have complex and versatile networks able to quickly and efficiently respond to changing environmental conditions since these organisms can not escape and have no other choice than adapting to them. Consequently, protein disorder can play an especially important role in plants, providing them with a fast mechanism to obtain complex, interconnected and versatile molecular networks.

  2. Distributional Similarity for Chinese: Exploiting Characters and Radicals

    Directory of Open Access Journals (Sweden)

    Peng Jin

    2012-01-01

    Full Text Available Distributional Similarity has attracted considerable attention in the field of natural language processing as an automatic means of countering the ubiquitous problem of sparse data. As a logographic language, Chinese words consist of characters and each of them is composed of one or more radicals. The meanings of characters are usually highly related to the words which contain them. Likewise, radicals often make a predictable contribution to the meaning of a character: characters that have the same components tend to have similar or related meanings. In this paper, we utilize these properties of the Chinese language to improve Chinese word similarity computation. Given a content word, we first extract similar words based on a large corpus and a similarity score for ranking. This rank is then adjusted according to the characters and components shared between the similar word and the target word. Experiments on two gold standard datasets show that the adjusted rank is superior and closer to human judgments than the original rank. In addition to quantitative evaluation, we examine the reasons behind errors drawing on linguistic phenomena for our explanations.

  3. Phishing Detection: Analysis of Visual Similarity Based Approaches

    Directory of Open Access Journals (Sweden)

    Ankit Kumar Jain

    2017-01-01

    Full Text Available Phishing is one of the major problems faced by cyber-world and leads to financial losses for both industries and individuals. Detection of phishing attack with high accuracy has always been a challenging issue. At present, visual similarities based techniques are very useful for detecting phishing websites efficiently. Phishing website looks very similar in appearance to its corresponding legitimate website to deceive users into believing that they are browsing the correct website. Visual similarity based phishing detection techniques utilise the feature set like text content, text format, HTML tags, Cascading Style Sheet (CSS, image, and so forth, to make the decision. These approaches compare the suspicious website with the corresponding legitimate website by using various features and if the similarity is greater than the predefined threshold value then it is declared phishing. This paper presents a comprehensive analysis of phishing attacks, their exploitation, some of the recent visual similarity based approaches for phishing detection, and its comparative study. Our survey provides a better understanding of the problem, current solution space, and scope of future research to deal with phishing attacks efficiently using visual similarity based approaches.

  4. Information assessment on predicting protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Gerstein Mark

    2004-10-01

    Full Text Available Abstract Background Identifying protein-protein interactions is fundamental for understanding the molecular machinery of the cell. Proteome-wide studies of protein-protein interactions are of significant value, but the high-throughput experimental technologies suffer from high rates of both false positive and false negative predictions. In addition to high-throughput experimental data, many diverse types of genomic data can help predict protein-protein interactions, such as mRNA expression, localization, essentiality, and functional annotation. Evaluations of the information contributions from different evidences help to establish more parsimonious models with comparable or better prediction accuracy, and to obtain biological insights of the relationships between protein-protein interactions and other genomic information. Results Our assessment is based on the genomic features used in a Bayesian network approach to predict protein-protein interactions genome-wide in yeast. In the special case, when one does not have any missing information about any of the features, our analysis shows that there is a larger information contribution from the functional-classification than from expression correlations or essentiality. We also show that in this case alternative models, such as logistic regression and random forest, may be more effective than Bayesian networks for predicting interactions. Conclusions In the restricted problem posed by the complete-information subset, we identified that the MIPS and Gene Ontology (GO functional similarity datasets as the dominating information contributors for predicting the protein-protein interactions under the framework proposed by Jansen et al. Random forests based on the MIPS and GO information alone can give highly accurate classifications. In this particular subset of complete information, adding other genomic data does little for improving predictions. We also found that the data discretizations used in the

  5. Novel OBP genes similar to hamster Aphrodisin in the bank vole, Myodes glareolus

    Directory of Open Access Journals (Sweden)

    Šandera Martin

    2010-01-01

    level we have detected further variants and thus we assume that similarly as Major Urinary Proteins in mice, these proteins may be important in chemical communication in this Cricetid rodent.

  6. A similarity measure method combining location feature for mammogram retrieval.

    Science.gov (United States)

    Wang, Zhiqiong; Xin, Junchang; Huang, Yukun; Li, Chen; Xu, Ling; Li, Yang; Zhang, Hao; Gu, Huizi; Qian, Wei

    2018-05-28

    Breast cancer, the most common malignancy among women, has a high mortality rate in clinical practice. Early detection, diagnosis and treatment can reduce the mortalities of breast cancer greatly. The method of mammogram retrieval can help doctors to find the early breast lesions effectively and determine a reasonable feature set for image similarity measure. This will improve the accuracy effectively for mammogram retrieval. This paper proposes a similarity measure method combining location feature for mammogram retrieval. Firstly, the images are pre-processed, the regions of interest are detected and the lesions are segmented in order to get the center point and radius of the lesions. Then, the method, namely Coherent Point Drift, is used for image registration with the pre-defined standard image. The center point and radius of the lesions after registration are obtained and the standard location feature of the image is constructed. This standard location feature can help figure out the location similarity between the image pair from the query image to each dataset image in the database. Next, the content feature of the image is extracted, including the Histogram of Oriented Gradients, the Edge Direction Histogram, the Local Binary Pattern and the Gray Level Histogram, and the image pair content similarity can be calculated using the Earth Mover's Distance. Finally, the location similarity and content similarity are fused to form the image fusion similarity, and the specified number of the most similar images can be returned according to it. In the experiment, 440 mammograms, which are from Chinese women in Northeast China, are used as the database. When fusing 40% lesion location feature similarity and 60% content feature similarity, the results have obvious advantages. At this time, precision is 0.83, recall is 0.76, comprehensive indicator is 0.79, satisfaction is 96.0%, mean is 4.2 and variance is 17.7. The results show that the precision and recall of this

  7. BSSF: a fingerprint based ultrafast binding site similarity search and function analysis server

    Directory of Open Access Journals (Sweden)

    Jiang Hualiang

    2010-01-01

    Full Text Available Abstract Background Genome sequencing and post-genomics projects such as structural genomics are extending the frontier of the study of sequence-structure-function relationship of genes and their products. Although many sequence/structure-based methods have been devised with the aim of deciphering this delicate relationship, there still remain large gaps in this fundamental problem, which continuously drives researchers to develop novel methods to extract relevant information from sequences and structures and to infer the functions of newly identified genes by genomics technology. Results Here we present an ultrafast method, named BSSF(Binding Site Similarity & Function, which enables researchers to conduct similarity searches in a comprehensive three-dimensional binding site database extracted from PDB structures. This method utilizes a fingerprint representation of the binding site and a validated statistical Z-score function scheme to judge the similarity between the query and database items, even if their similarities are only constrained in a sub-pocket. This fingerprint based similarity measurement was also validated on a known binding site dataset by comparing with geometric hashing, which is a standard 3D similarity method. The comparison clearly demonstrated the utility of this ultrafast method. After conducting the database searching, the hit list is further analyzed to provide basic statistical information about the occurrences of Gene Ontology terms and Enzyme Commission numbers, which may benefit researchers by helping them to design further experiments to study the query proteins. Conclusions This ultrafast web-based system will not only help researchers interested in drug design and structural genomics to identify similar binding sites, but also assist them by providing further analysis of hit list from database searching.

  8. Link-Based Similarity Measures Using Reachability Vectors

    Directory of Open Access Journals (Sweden)

    Seok-Ho Yoon

    2014-01-01

    Full Text Available We present a novel approach for computing link-based similarities among objects accurately by utilizing the link information pertaining to the objects involved. We discuss the problems with previous link-based similarity measures and propose a novel approach for computing link based similarities that does not suffer from these problems. In the proposed approach each target object is represented by a vector. Each element of the vector corresponds to all the objects in the given data, and the value of each element denotes the weight for the corresponding object. As for this weight value, we propose to utilize the probability of reaching from the target object to the specific object, computed using the “Random Walk with Restart” strategy. Then, we define the similarity between two objects as the cosine similarity of the two vectors. In this paper, we provide examples to show that our approach does not suffer from the aforementioned problems. We also evaluate the performance of the proposed methods in comparison with existing link-based measures, qualitatively and quantitatively, with respect to two kinds of data sets, scientific papers and Web documents. Our experimental results indicate that the proposed methods significantly outperform the existing measures.

  9. Brand name confusion: Subjective and objective measures of orthographic similarity.

    Science.gov (United States)

    Burt, Jennifer S; McFarlane, Kimberley A; Kelly, Sarah J; Humphreys, Michael S; Weatherall, Kimberlee; Burrell, Robert G

    2017-09-01

    Determining brand name similarity is vital in areas of trademark registration and brand confusion. Students rated the orthographic (spelling) similarity of word pairs (Experiments 1, 2, and 4) and brand name pairs (Experiment 5). Similarity ratings were consistently higher when words shared beginnings rather than endings, whereas shared pronunciation of the stressed vowel had small and less consistent effects on ratings. In Experiment 3 a behavioral task confirmed the similarity of shared beginnings in lexical processing. Specifically, in a task requiring participants to decide whether 2 words presented in the clear (a probe and a later target) were the same or different, a masked prime word preceding the target shortened response latencies if it shared its initial 3 letters with the target. The ratings of students for word and brand name pairs were strongly predicted by metrics of orthographic similarity from the visual word identification literature based on the number of shared letters and their relative positions. The results indicate a potential use for orthographic metrics in brand name registration and trademark law. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  10. A Model-Based Approach to Constructing Music Similarity Functions

    Directory of Open Access Journals (Sweden)

    Lamere Paul

    2007-01-01

    Full Text Available Several authors have presented systems that estimate the audio similarity of two pieces of music through the calculation of a distance metric, such as the Euclidean distance, between spectral features calculated from the audio, related to the timbre or pitch of the signal. These features can be augmented with other, temporally or rhythmically based features such as zero-crossing rates, beat histograms, or fluctuation patterns to form a more well-rounded music similarity function. It is our contention that perceptual or cultural labels, such as the genre, style, or emotion of the music, are also very important features in the perception of music. These labels help to define complex regions of similarity within the available feature spaces. We demonstrate a machine-learning-based approach to the construction of a similarity metric, which uses this contextual information to project the calculated features into an intermediate space where a music similarity function that incorporates some of the cultural information may be calculated.

  11. Mechanics of ultra-stretchable self-similar serpentine interconnects

    International Nuclear Information System (INIS)

    Zhang, Yihui; Fu, Haoran; Su, Yewang; Xu, Sheng

    2013-01-01

    Graphical abstract: We developed analytical models of flexibility and elastic-stretchability for self-similar interconnect. The analytic solutions agree very well with the finite element analyses, both demonstrating that the elastic-stretchability more than doubles when the order of self-similar structure increases by one. Design optimization yields 90% and 50% elastic stretchability for systems with surface filling ratios of 50% and 70% of active devices, respectively. The analytic models are useful for the development of stretchable electronics that simultaneously demand large coverage of active devices, such as stretchable photovoltaics and electronic eye-ball cameras. -- Abstract: Electrical interconnects that adopt self-similar, serpentine layouts offer exceptional levels of stretchability in systems that consist of collections of small, non-stretchable active devices in the so-called island–bridge design. This paper develops analytical models of flexibility and elastic stretchability for such structures, and establishes recursive formulae at different orders of self-similarity. The analytic solutions agree well with finite element analysis, with both demonstrating that the elastic stretchability more than doubles when the order of the self-similar structure increases by one. Design optimization yields 90% and 50% elastic stretchability for systems with surface filling ratios of 50% and 70% of active devices, respectively

  12. Similarity relations in visual search predict rapid visual categorization

    Science.gov (United States)

    Mohan, Krithika; Arun, S. P.

    2012-01-01

    How do we perform rapid visual categorization?It is widely thought that categorization involves evaluating the similarity of an object to other category items, but the underlying features and similarity relations remain unknown. Here, we hypothesized that categorization performance is based on perceived similarity relations between items within and outside the category. To this end, we measured the categorization performance of human subjects on three diverse visual categories (animals, vehicles, and tools) and across three hierarchical levels (superordinate, basic, and subordinate levels among animals). For the same subjects, we measured their perceived pair-wise similarities between objects using a visual search task. Regardless of category and hierarchical level, we found that the time taken to categorize an object could be predicted using its similarity to members within and outside its category. We were able to account for several classic categorization phenomena, such as (a) the longer times required to reject category membership; (b) the longer times to categorize atypical objects; and (c) differences in performance across tasks and across hierarchical levels. These categorization times were also accounted for by a model that extracts coarse structure from an image. The striking agreement observed between categorization and visual search suggests that these two disparate tasks depend on a shared coarse object representation. PMID:23092947

  13. How similar are recognition memory and inductive reasoning?

    Science.gov (United States)

    Hayes, Brett K; Heit, Evan

    2013-07-01

    Conventionally, memory and reasoning are seen as different types of cognitive activities driven by different processes. In two experiments, we challenged this view by examining the relationship between recognition memory and inductive reasoning involving multiple forms of similarity. A common study set (members of a conjunctive category) was followed by a test set containing old and new category members, as well as items that matched the study set on only one dimension. The study and test sets were presented under recognition or induction instructions. In Experiments 1 and 2, the inductive property being generalized was varied in order to direct attention to different dimensions of similarity. When there was no time pressure on decisions, patterns of positive responding were strongly affected by property type, indicating that different types of similarity were driving recognition and induction. By comparison, speeded judgments showed weaker property effects and could be explained by generalization based on overall similarity. An exemplar model, GEN-EX (GENeralization from EXamples), could account for both the induction and recognition data. These findings show that induction and recognition share core component processes, even when the tasks involve flexible forms of similarity.

  14. A Measure of Similarity Between Trajectories of Vessels

    Directory of Open Access Journals (Sweden)

    Le QI

    2016-03-01

    Full Text Available The measurement of similarity between trajectories of vessels is one of the kernel problems that must be addressed to promote the development of maritime intelligent traffic system (ITS. In this study, a new model of trajectory similarity measurement was established to improve the data processing efficiency in dynamic application and to reflect actual sailing behaviors of vessels. In this model, a feature point detection algorithm was proposed to extract feature points, reduce data storage space and save computational resources. A new synthesized distance algorithm was also created to measure the similarity between trajectories by using the extracted feature points. An experiment was conducted to measure the similarity between the real trajectories of vessels. The growth of these trajectories required measurements to be conducted under different voyages. The results show that the similarity measurement between the vessel trajectories is efficient and correct. Comparison of the synthesized distance with the sailing behaviors of vessels proves that results are consistent with actual situations. The experiment results demonstrate the promising application of the proposed model in studying vessel traffic and in supplying reliable data for the development of maritime ITS.

  15. AREAL FEATURE MATCHING BASED ON SIMILARITY USING CRITIC METHOD

    Directory of Open Access Journals (Sweden)

    J. Kim

    2015-10-01

    Full Text Available In this paper, we propose an areal feature matching method that can be applied for many-to-many matching, which involves matching a simple entity with an aggregate of several polygons or two aggregates of several polygons with fewer user intervention. To this end, an affine transformation is applied to two datasets by using polygon pairs for which the building name is the same. Then, two datasets are overlaid with intersected polygon pairs that are selected as candidate matching pairs. If many polygons intersect at this time, we calculate the inclusion function between such polygons. When the value is more than 0.4, many of the polygons are aggregated as single polygons by using a convex hull. Finally, the shape similarity is calculated between the candidate pairs according to the linear sum of the weights computed in CRITIC method and the position similarity, shape ratio similarity, and overlap similarity. The candidate pairs for which the value of the shape similarity is more than 0.7 are determined as matching pairs. We applied the method to two geospatial datasets: the digital topographic map and the KAIS map in South Korea. As a result, the visual evaluation showed two polygons that had been well detected by using the proposed method. The statistical evaluation indicates that the proposed method is accurate when using our test dataset with a high F-measure of 0.91.

  16. Areal Feature Matching Based on Similarity Using Critic Method

    Science.gov (United States)

    Kim, J.; Yu, K.

    2015-10-01

    In this paper, we propose an areal feature matching method that can be applied for many-to-many matching, which involves matching a simple entity with an aggregate of several polygons or two aggregates of several polygons with fewer user intervention. To this end, an affine transformation is applied to two datasets by using polygon pairs for which the building name is the same. Then, two datasets are overlaid with intersected polygon pairs that are selected as candidate matching pairs. If many polygons intersect at this time, we calculate the inclusion function between such polygons. When the value is more than 0.4, many of the polygons are aggregated as single polygons by using a convex hull. Finally, the shape similarity is calculated between the candidate pairs according to the linear sum of the weights computed in CRITIC method and the position similarity, shape ratio similarity, and overlap similarity. The candidate pairs for which the value of the shape similarity is more than 0.7 are determined as matching pairs. We applied the method to two geospatial datasets: the digital topographic map and the KAIS map in South Korea. As a result, the visual evaluation showed two polygons that had been well detected by using the proposed method. The statistical evaluation indicates that the proposed method is accurate when using our test dataset with a high F-measure of 0.91.

  17. SpolSimilaritySearch - A web tool to compare and search similarities between spoligotypes of Mycobacterium tuberculosis complex.

    Science.gov (United States)

    Couvin, David; Zozio, Thierry; Rastogi, Nalin

    2017-07-01

    Spoligotyping is one of the most commonly used polymerase chain reaction (PCR)-based methods for identification and study of genetic diversity of Mycobacterium tuberculosis complex (MTBC). Despite its known limitations if used alone, the methodology is particularly useful when used in combination with other methods such as mycobacterial interspersed repetitive units - variable number of tandem DNA repeats (MIRU-VNTRs). At a worldwide scale, spoligotyping has allowed identification of information on 103,856 MTBC isolates (corresponding to 98049 clustered strains plus 5807 unique isolates from 169 countries of patient origin) contained within the SITVIT2 proprietary database of the Institut Pasteur de la Guadeloupe. The SpolSimilaritySearch web-tool described herein (available at: http://www.pasteur-guadeloupe.fr:8081/SpolSimilaritySearch) incorporates a similarity search algorithm allowing users to get a complete overview of similar spoligotype patterns (with information on presence or absence of 43 spacers) in the aforementioned worldwide database. This tool allows one to analyze spread and evolutionary patterns of MTBC by comparing similar spoligotype patterns, to distinguish between widespread, specific and/or confined patterns, as well as to pinpoint patterns with large deleted blocks, which play an intriguing role in the genetic epidemiology of M. tuberculosis. Finally, the SpolSimilaritySearch tool also provides with the country distribution patterns for each queried spoligotype. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Protein nanoparticles for therapeutic protein delivery.

    Science.gov (United States)

    Herrera Estrada, L P; Champion, J A

    2015-06-01

    Therapeutic proteins can face substantial challenges to their activity, requiring protein modification or use of a delivery vehicle. Nanoparticles can significantly enhance delivery of encapsulated cargo, but traditional small molecule carriers have some limitations in their use for protein delivery. Nanoparticles made from protein have been proposed as alternative carriers and have benefits specific to therapeutic protein delivery. This review describes protein nanoparticles made by self-assembly, including protein cages, protein polymers, and charged or amphipathic peptides, and by desolvation. It presents particle fabrication and delivery characterization for a variety of therapeutic and model proteins, as well as comparison of the features of different protein nanoparticles.

  19. Modularity in protein structures: study on all-alpha proteins.

    Science.gov (United States)

    Khan, Taushif; Ghosh, Indira

    2015-01-01

    Modularity is known as one of the most important features of protein's robust and efficient design. The architecture and topology of proteins play a vital role by providing necessary robust scaffolds to support organism's growth and survival in constant evolutionary pressure. These complex biomolecules can be represented by several layers of modular architecture, but it is pivotal to understand and explore the smallest biologically relevant structural component. In the present study, we have developed a component-based method, using protein's secondary structures and their arrangements (i.e. patterns) in order to investigate its structural space. Our result on all-alpha protein shows that the known structural space is highly populated with limited set of structural patterns. We have also noticed that these frequently observed structural patterns are present as modules or "building blocks" in large proteins (i.e. higher secondary structure content). From structural descriptor analysis, observed patterns are found to be within similar deviation; however, frequent patterns are found to be distinctly occurring in diverse functions e.g. in enzymatic classes and reactions. In this study, we are introducing a simple approach to explore protein structural space using combinatorial- and graph-based geometry methods, which can be used to describe modularity in protein structures. Moreover, analysis indicates that protein function seems to be the driving force that shapes the known structure space.

  20. The Hofmethode: Computing Semantic Similarities between E-Learning Products

    Directory of Open Access Journals (Sweden)

    Oliver Michel

    2009-11-01

    Full Text Available The key task in building useful e-learning repositories is to develop a system with an algorithm allowing users to retrieve information that corresponds to their specific requirements. To achieve this, products (or their verbal descriptions, i.e. presented in metadata need to be compared and structured according to the results of this comparison. Such structuring is crucial insofar as there are many search results that correspond to the entered keyword. The Hofmethode is an algorithm (based on psychological considerations to compute semantic similarities between texts and therefore offer a way to compare e-learning products. The computed similarity values are used to build semantic maps in which the products are visually arranged according to their similarities. The paper describes how the Hofmethode is implemented in the online database edulap, and how it contributes to help the user to explore the data in which he is interested.