Sable, Rushikesh; Jois, Seetharama
2015-06-23
Blocking protein-protein interactions (PPI) using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.
PPI finder: a mining tool for human protein-protein interactions.
Directory of Open Access Journals (Sweden)
Min He
Full Text Available BACKGROUND: The exponential increase of published biomedical literature prompts the use of text mining tools to manage the information overload automatically. One of the most common applications is to mine protein-protein interactions (PPIs from PubMed abstracts. Currently, most tools in mining PPIs from literature are using co-occurrence-based approaches or rule-based approaches. Hybrid methods (frame-based approaches by combining these two methods may have better performance in predicting PPIs. However, the predicted PPIs from these methods are rarely evaluated by known PPI databases and co-occurred terms in Gene Ontology (GO database. METHODOLOGY/PRINCIPAL FINDINGS: We here developed a web-based tool, PPI Finder, to mine human PPIs from PubMed abstracts based on their co-occurrences and interaction words, followed by evidences in human PPI databases and shared terms in GO database. Only 28% of the co-occurred pairs in PubMed abstracts appeared in any of the commonly used human PPI databases (HPRD, BioGRID and BIND. On the other hand, of the known PPIs in HPRD, 69% showed co-occurrences in the literature, and 65% shared GO terms. CONCLUSIONS: PPI Finder provides a useful tool for biologists to uncover potential novel PPIs. It is freely accessible at http://liweilab.genetics.ac.cn/tm/.
Li, Chuanxi
2014-01-01
Mining Protein-Protein Interactions (PPIs) from the fast-growing biomedical literature resources has been proven as an effective approach for the identifi cation of biological regulatory networks. This paper presents a novel method based on the idea of Interaction Relation Ontology (IRO), which specifi es and organises words of various proteins interaction relationships. Our method is a two-stage PPI extraction method. At fi rst, IRO is applied in a binary classifi er to determine whether sentences contain a relation or not. Then, IRO is taken to guide PPI extraction by building sentence dependency parse tree. Comprehensive and quantitative evaluations and detailed analyses are used to demonstrate the signifi cant performance of IRO on relation sentences classifi cation and PPI extraction. Our PPI extraction method yielded a recall of around 80% and 90% and an F1 of around 54% and 66% on corpora of AIMed and Bioinfer, respectively, which are superior to most existing extraction methods. Copyright © 2014 Inderscience Enterprises Ltd.
Directory of Open Access Journals (Sweden)
Rushikesh Sable
2015-06-01
Full Text Available Blocking protein-protein interactions (PPI using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.
HomPPI: a class of sequence homology based protein-protein interface prediction methods
Directory of Open Access Journals (Sweden)
Dobbs Drena
2011-06-01
Full Text Available Abstract Background Although homology-based methods are among the most widely used methods for predicting the structure and function of proteins, the question as to whether interface sequence conservation can be effectively exploited in predicting protein-protein interfaces has been a subject of debate. Results We studied more than 300,000 pair-wise alignments of protein sequences from structurally characterized protein complexes, including both obligate and transient complexes. We identified sequence similarity criteria required for accurate homology-based inference of interface residues in a query protein sequence. Based on these analyses, we developed HomPPI, a class of sequence homology-based methods for predicting protein-protein interface residues. We present two variants of HomPPI: (i NPS-HomPPI (Non partner-specific HomPPI, which can be used to predict interface residues of a query protein in the absence of knowledge of the interaction partner; and (ii PS-HomPPI (Partner-specific HomPPI, which can be used to predict the interface residues of a query protein with a specific target protein. Our experiments on a benchmark dataset of obligate homodimeric complexes show that NPS-HomPPI can reliably predict protein-protein interface residues in a given protein, with an average correlation coefficient (CC of 0.76, sensitivity of 0.83, and specificity of 0.78, when sequence homologs of the query protein can be reliably identified. NPS-HomPPI also reliably predicts the interface residues of intrinsically disordered proteins. Our experiments suggest that NPS-HomPPI is competitive with several state-of-the-art interface prediction servers including those that exploit the structure of the query proteins. The partner-specific classifier, PS-HomPPI can, on a large dataset of transient complexes, predict the interface residues of a query protein with a specific target, with a CC of 0.65, sensitivity of 0.69, and specificity of 0.70, when homologs of
Directory of Open Access Journals (Sweden)
Ching-Tai Chen
Full Text Available Protein-protein interactions are key to many biological processes. Computational methodologies devised to predict protein-protein interaction (PPI sites on protein surfaces are important tools in providing insights into the biological functions of proteins and in developing therapeutics targeting the protein-protein interaction sites. One of the general features of PPI sites is that the core regions from the two interacting protein surfaces are complementary to each other, similar to the interior of proteins in packing density and in the physicochemical nature of the amino acid composition. In this work, we simulated the physicochemical complementarities by constructing three-dimensional probability density maps of non-covalent interacting atoms on the protein surfaces. The interacting probabilities were derived from the interior of known structures. Machine learning algorithms were applied to learn the characteristic patterns of the probability density maps specific to the PPI sites. The trained predictors for PPI sites were cross-validated with the training cases (consisting of 432 proteins and were tested on an independent dataset (consisting of 142 proteins. The residue-based Matthews correlation coefficient for the independent test set was 0.423; the accuracy, precision, sensitivity, specificity were 0.753, 0.519, 0.677, and 0.779 respectively. The benchmark results indicate that the optimized machine learning models are among the best predictors in identifying PPI sites on protein surfaces. In particular, the PPI site prediction accuracy increases with increasing size of the PPI site and with increasing hydrophobicity in amino acid composition of the PPI interface; the core interface regions are more likely to be recognized with high prediction confidence. The results indicate that the physicochemical complementarity patterns on protein surfaces are important determinants in PPIs, and a substantial portion of the PPI sites can be predicted
Chen, Ching-Tai; Peng, Hung-Pin; Jian, Jhih-Wei; Tsai, Keng-Chang; Chang, Jeng-Yih; Yang, Ei-Wen; Chen, Jun-Bo; Ho, Shinn-Ying; Hsu, Wen-Lian; Yang, An-Suei
2012-01-01
Protein-protein interactions are key to many biological processes. Computational methodologies devised to predict protein-protein interaction (PPI) sites on protein surfaces are important tools in providing insights into the biological functions of proteins and in developing therapeutics targeting the protein-protein interaction sites. One of the general features of PPI sites is that the core regions from the two interacting protein surfaces are complementary to each other, similar to the interior of proteins in packing density and in the physicochemical nature of the amino acid composition. In this work, we simulated the physicochemical complementarities by constructing three-dimensional probability density maps of non-covalent interacting atoms on the protein surfaces. The interacting probabilities were derived from the interior of known structures. Machine learning algorithms were applied to learn the characteristic patterns of the probability density maps specific to the PPI sites. The trained predictors for PPI sites were cross-validated with the training cases (consisting of 432 proteins) and were tested on an independent dataset (consisting of 142 proteins). The residue-based Matthews correlation coefficient for the independent test set was 0.423; the accuracy, precision, sensitivity, specificity were 0.753, 0.519, 0.677, and 0.779 respectively. The benchmark results indicate that the optimized machine learning models are among the best predictors in identifying PPI sites on protein surfaces. In particular, the PPI site prediction accuracy increases with increasing size of the PPI site and with increasing hydrophobicity in amino acid composition of the PPI interface; the core interface regions are more likely to be recognized with high prediction confidence. The results indicate that the physicochemical complementarity patterns on protein surfaces are important determinants in PPIs, and a substantial portion of the PPI sites can be predicted correctly with
Coarse-grain modelling of protein-protein interactions
Baaden, Marc; Marrink, Siewert J.
2013-01-01
Here, we review recent advances towards the modelling of protein-protein interactions (PPI) at the coarse-grained (CG) level, a technique that is now widely used to understand protein affinity, aggregation and self-assembly behaviour. PPI models of soluble proteins and membrane proteins are
Protein complex detection in PPI networks based on data integration and supervised learning method.
Yu, Feng; Yang, Zhi; Hu, Xiao; Sun, Yuan; Lin, Hong; Wang, Jian
2015-01-01
Revealing protein complexes are important for understanding principles of cellular organization and function. High-throughput experimental techniques have produced a large amount of protein interactions, which makes it possible to predict protein complexes from protein-protein interaction (PPI) networks. However, the small amount of known physical interactions may limit protein complex detection. The new PPI networks are constructed by integrating PPI datasets with the large and readily available PPI data from biomedical literature, and then the less reliable PPI between two proteins are filtered out based on semantic similarity and topological similarity of the two proteins. Finally, the supervised learning protein complex detection (SLPC), which can make full use of the information of available known complexes, is applied to detect protein complex on the new PPI networks. The experimental results of SLPC on two different categories yeast PPI networks demonstrate effectiveness of the approach: compared with the original PPI networks, the best average improvements of 4.76, 6.81 and 15.75 percentage units in the F-score, accuracy and maximum matching ratio (MMR) are achieved respectively; compared with the denoising PPI networks, the best average improvements of 3.91, 4.61 and 12.10 percentage units in the F-score, accuracy and MMR are achieved respectively; compared with ClusterONE, the start-of the-art complex detection method, on the denoising extended PPI networks, the average improvements of 26.02 and 22.40 percentage units in the F-score and MMR are achieved respectively. The experimental results show that the performances of SLPC have a large improvement through integration of new receivable PPI data from biomedical literature into original PPI networks and denoising PPI networks. In addition, our protein complexes detection method can achieve better performance than ClusterONE.
A Novel Approach for Protein-Named Entity Recognition and Protein-Protein Interaction Extraction
Directory of Open Access Journals (Sweden)
Meijing Li
2015-01-01
Full Text Available Many researchers focus on developing protein-named entity recognition (Protein-NER or PPI extraction systems. However, the studies about these two topics cannot be merged well; then existing PPI extraction systems’ Protein-NER still needs to improve. In this paper, we developed the protein-protein interaction extraction system named PPIMiner based on Support Vector Machine (SVM and parsing tree. PPIMiner consists of three main models: natural language processing (NLP model, Protein-NER model, and PPI discovery model. The Protein-NER model, which is named ProNER, identifies the protein names based on two methods: dictionary-based method and machine learning-based method. ProNER is capable of identifying more proteins than dictionary-based Protein-NER model in other existing systems. The final discovered PPIs extracted via PPI discovery model are represented in detail because we showed the protein interaction types and the occurrence frequency through two different methods. In the experiments, the result shows that the performances achieved by our ProNER and PPI discovery model are better than other existing tools. PPIMiner applied this protein-named entity recognition approach and parsing tree based PPI extraction method to improve the performance of PPI extraction. We also provide an easy-to-use interface to access PPIs database and an online system for PPIs extraction and Protein-NER.
Lam, Winnie W M; Chan, Keith C C
2012-04-01
Protein molecules interact with each other in protein complexes to perform many vital functions, and different computational techniques have been developed to identify protein complexes in protein-protein interaction (PPI) networks. These techniques are developed to search for subgraphs of high connectivity in PPI networks under the assumption that the proteins in a protein complex are highly interconnected. While these techniques have been shown to be quite effective, it is also possible that the matching rate between the protein complexes they discover and those that are previously determined experimentally be relatively low and the "false-alarm" rate can be relatively high. This is especially the case when the assumption of proteins in protein complexes being more highly interconnected be relatively invalid. To increase the matching rate and reduce the false-alarm rate, we have developed a technique that can work effectively without having to make this assumption. The name of the technique called protein complex identification by discovering functional interdependence (PCIFI) searches for protein complexes in PPI networks by taking into consideration both the functional interdependence relationship between protein molecules and the network topology of the network. The PCIFI works in several steps. The first step is to construct a multiple-function protein network graph by labeling each vertex with one or more of the molecular functions it performs. The second step is to filter out protein interactions between protein pairs that are not functionally interdependent of each other in the statistical sense. The third step is to make use of an information-theoretic measure to determine the strength of the functional interdependence between all remaining interacting protein pairs. Finally, the last step is to try to form protein complexes based on the measure of the strength of functional interdependence and the connectivity between proteins. For performance evaluation
Protein complex prediction in large ontology attributed protein-protein interaction networks.
Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian; Li, Yanpeng; Xu, Bo
2013-01-01
Protein complexes are important for unraveling the secrets of cellular organization and function. Many computational approaches have been developed to predict protein complexes in protein-protein interaction (PPI) networks. However, most existing approaches focus mainly on the topological structure of PPI networks, and largely ignore the gene ontology (GO) annotation information. In this paper, we constructed ontology attributed PPI networks with PPI data and GO resource. After constructing ontology attributed networks, we proposed a novel approach called CSO (clustering based on network structure and ontology attribute similarity). Structural information and GO attribute information are complementary in ontology attributed networks. CSO can effectively take advantage of the correlation between frequent GO annotation sets and the dense subgraph for protein complex prediction. Our proposed CSO approach was applied to four different yeast PPI data sets and predicted many well-known protein complexes. The experimental results showed that CSO was valuable in predicting protein complexes and achieved state-of-the-art performance.
Prediction and characterization of protein-protein interaction networks in swine
Directory of Open Access Journals (Sweden)
Wang Fen
2012-01-01
Full Text Available Abstract Background Studying the large-scale protein-protein interaction (PPI network is important in understanding biological processes. The current research presents the first PPI map of swine, which aims to give new insights into understanding their biological processes. Results We used three methods, Interolog-based prediction of porcine PPI network, domain-motif interactions from structural topology-based prediction of porcine PPI network and motif-motif interactions from structural topology-based prediction of porcine PPI network, to predict porcine protein interactions among 25,767 porcine proteins. We predicted 20,213, 331,484, and 218,705 porcine PPIs respectively, merged the three results into 567,441 PPIs, constructed four PPI networks, and analyzed the topological properties of the porcine PPI networks. Our predictions were validated with Pfam domain annotations and GO annotations. Averages of 70, 10,495, and 863 interactions were related to the Pfam domain-interacting pairs in iPfam database. For comparison, randomized networks were generated, and averages of only 4.24, 66.79, and 44.26 interactions were associated with Pfam domain-interacting pairs in iPfam database. In GO annotations, we found 52.68%, 75.54%, 27.20% of the predicted PPIs sharing GO terms respectively. However, the number of PPI pairs sharing GO terms in the 10,000 randomized networks reached 52.68%, 75.54%, 27.20% is 0. Finally, we determined the accuracy and precision of the methods. The methods yielded accuracies of 0.92, 0.53, and 0.50 at precisions of about 0.93, 0.74, and 0.75, respectively. Conclusion The results reveal that the predicted PPI networks are considerably reliable. The present research is an important pioneering work on protein function research. The porcine PPI data set, the confidence score of each interaction and a list of related data are available at (http://pppid.biositemap.com/.
Analysis of protein-protein interaction networks by means of annotated graph mining algorithms
Rahmani, Hossein
2012-01-01
This thesis discusses solutions to several open problems in Protein-Protein Interaction (PPI) networks with the aid of Knowledge Discovery. PPI networks are usually represented as undirected graphs, with nodes corresponding to proteins and edges representing interactions among protein pairs. A large
Novel Technology for Protein-Protein Interaction-based Targeted Drug Discovery
Directory of Open Access Journals (Sweden)
Jung Me Hwang
2011-12-01
Full Text Available We have developed a simple but highly efficient in-cell protein-protein interaction (PPI discovery system based on the translocation properties of protein kinase C- and its C1a domain in live cells. This system allows the visual detection of trimeric and dimeric protein interactions including cytosolic, nuclear, and/or membrane proteins with their cognate ligands. In addition, this system can be used to identify pharmacological small compounds that inhibit specific PPIs. These properties make this PPI system an attractive tool for screening drug candidates and mapping the protein interactome.
False positive reduction in protein-protein interaction predictions using gene ontology annotations
Directory of Open Access Journals (Sweden)
Lin Yen-Han
2007-07-01
Full Text Available Abstract Background Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein-protein interactions. However, the lack of robust protein-protein interaction information is a challenge. One reason for the lack of solid protein-protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein-protein interaction datasets based on highly confident experimental results has not been adequately investigated. Results Gene Ontology (GO annotations were used to reduce false positive protein-protein interactions (PPI pairs resulting from computational predictions. Using experimentally obtained PPI pairs as a training dataset, eight top-ranking keywords were extracted from GO molecular function annotations. The sensitivity of these keywords is 64.21% in the yeast experimental dataset and 80.83% in the worm experimental dataset. The specificities, a measure of recovery power, of these keywords applied to four predicted PPI datasets for each studied organisms, are 48.32% and 46.49% (by average of four datasets in yeast and worm, respectively. Based on eight top-ranking keywords and co-localization of interacting proteins a set of two knowledge rules were deduced and applied to remove false positive protein pairs. The 'strength', a measure of improvement provided by the rules was defined based on the signal-to-noise ratio and implemented to measure the applicability of knowledge rules applying to the predicted PPI datasets. Depending on the employed PPI-predicting methods, the strength varies between two and ten-fold of randomly removing protein pairs from the datasets. Conclusion Gene Ontology annotations along with the deduced knowledge rules could be implemented to partially
HKC: An Algorithm to Predict Protein Complexes in Protein-Protein Interaction Networks
Directory of Open Access Journals (Sweden)
Xiaomin Wang
2011-01-01
Full Text Available With the availability of more and more genome-scale protein-protein interaction (PPI networks, research interests gradually shift to Systematic Analysis on these large data sets. A key topic is to predict protein complexes in PPI networks by identifying clusters that are densely connected within themselves but sparsely connected with the rest of the network. In this paper, we present a new topology-based algorithm, HKC, to detect protein complexes in genome-scale PPI networks. HKC mainly uses the concepts of highest k-core and cohesion to predict protein complexes by identifying overlapping clusters. The experiments on two data sets and two benchmarks show that our algorithm has relatively high F-measure and exhibits better performance compared with some other methods.
Schlecht, Ulrich; Liu, Zhimin; Blundell, Jamie R; St Onge, Robert P; Levy, Sasha F
2017-05-25
Several large-scale efforts have systematically catalogued protein-protein interactions (PPIs) of a cell in a single environment. However, little is known about how the protein interactome changes across environmental perturbations. Current technologies, which assay one PPI at a time, are too low throughput to make it practical to study protein interactome dynamics. Here, we develop a highly parallel protein-protein interaction sequencing (PPiSeq) platform that uses a novel double barcoding system in conjunction with the dihydrofolate reductase protein-fragment complementation assay in Saccharomyces cerevisiae. PPiSeq detects PPIs at a rate that is on par with current assays and, in contrast with current methods, quantitatively scores PPIs with enough accuracy and sensitivity to detect changes across environments. Both PPI scoring and the bulk of strain construction can be performed with cell pools, making the assay scalable and easily reproduced across environments. PPiSeq is therefore a powerful new tool for large-scale investigations of dynamic PPIs.
The CTD2 Center at Emory has developed a new NanoLuc®-based protein-fragment complementation assay (NanoPCA) which allows the detection of novel protein-protein interactions (PPI). NanoPCA allows the study of PPI dynamics with reversible interactions. Read the abstract. Experimental Approaches Read the detailed Experimetnal Approaches.
Directory of Open Access Journals (Sweden)
Lei Yang
2014-01-01
Full Text Available Cliques (maximal complete subnets in protein-protein interaction (PPI network are an important resource used to analyze protein complexes and functional modules. Clique-based methods of predicting PPI complement the data defection from biological experiments. However, clique-based predicting methods only depend on the topology of network. The false-positive and false-negative interactions in a network usually interfere with prediction. Therefore, we propose a method combining clique-based method of prediction and gene ontology (GO annotations to overcome the shortcoming and improve the accuracy of predictions. According to different GO correcting rules, we generate two predicted interaction sets which guarantee the quality and quantity of predicted protein interactions. The proposed method is applied to the PPI network from the Database of Interacting Proteins (DIP and most of the predicted interactions are verified by another biological database, BioGRID. The predicted protein interactions are appended to the original protein network, which leads to clique extension and shows the significance of biological meaning.
Hayashi, Takanori; Matsuzaki, Yuri; Yanagisawa, Keisuke; Ohue, Masahito; Akiyama, Yutaka
2018-05-08
Protein-protein interactions (PPIs) play several roles in living cells, and computational PPI prediction is a major focus of many researchers. The three-dimensional (3D) structure and binding surface are important for the design of PPI inhibitors. Therefore, rigid body protein-protein docking calculations for two protein structures are expected to allow elucidation of PPIs different from known complexes in terms of 3D structures because known PPI information is not explicitly required. We have developed rapid PPI prediction software based on protein-protein docking, called MEGADOCK. In order to fully utilize the benefits of computational PPI predictions, it is necessary to construct a comprehensive database to gather prediction results and their predicted 3D complex structures and to make them easily accessible. Although several databases exist that provide predicted PPIs, the previous databases do not contain a sufficient number of entries for the purpose of discovering novel PPIs. In this study, we constructed an integrated database of MEGADOCK PPI predictions, named MEGADOCK-Web. MEGADOCK-Web provides more than 10 times the number of PPI predictions than previous databases and enables users to conduct PPI predictions that cannot be found in conventional PPI prediction databases. In MEGADOCK-Web, there are 7528 protein chains and 28,331,628 predicted PPIs from all possible combinations of those proteins. Each protein structure is annotated with PDB ID, chain ID, UniProt AC, related KEGG pathway IDs, and known PPI pairs. Additionally, MEGADOCK-Web provides four powerful functions: 1) searching precalculated PPI predictions, 2) providing annotations for each predicted protein pair with an experimentally known PPI, 3) visualizing candidates that may interact with the query protein on biochemical pathways, and 4) visualizing predicted complex structures through a 3D molecular viewer. MEGADOCK-Web provides a huge amount of comprehensive PPI predictions based on
Directory of Open Access Journals (Sweden)
Xianjun Shen
Full Text Available How to identify protein complex is an important and challenging task in proteomics. It would make great contribution to our knowledge of molecular mechanism in cell life activities. However, the inherent organization and dynamic characteristic of cell system have rarely been incorporated into the existing algorithms for detecting protein complexes because of the limitation of protein-protein interaction (PPI data produced by high throughput techniques. The availability of time course gene expression profile enables us to uncover the dynamics of molecular networks and improve the detection of protein complexes. In order to achieve this goal, this paper proposes a novel algorithm DCA (Dynamic Core-Attachment. It detects protein-complex core comprising of continually expressed and highly connected proteins in dynamic PPI network, and then the protein complex is formed by including the attachments with high adhesion into the core. The integration of core-attachment feature into the dynamic PPI network is responsible for the superiority of our algorithm. DCA has been applied on two different yeast dynamic PPI networks and the experimental results show that it performs significantly better than the state-of-the-art techniques in terms of prediction accuracy, hF-measure and statistical significance in biology. In addition, the identified complexes with strong biological significance provide potential candidate complexes for biologists to validate.
The role of exon shuffling in shaping protein-protein interaction networks
Directory of Open Access Journals (Sweden)
França Gustavo S
2010-12-01
Full Text Available Abstract Background Physical protein-protein interaction (PPI is a critical phenomenon for the function of most proteins in living organisms and a significant fraction of PPIs are the result of domain-domain interactions. Exon shuffling, intron-mediated recombination of exons from existing genes, is known to have been a major mechanism of domain shuffling in metazoans. Thus, we hypothesized that exon shuffling could have a significant influence in shaping the topology of PPI networks. Results We tested our hypothesis by compiling exon shuffling and PPI data from six eukaryotic species: Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Cryptococcus neoformans and Arabidopsis thaliana. For all four metazoan species, genes enriched in exon shuffling events presented on average higher vertex degree (number of interacting partners in PPI networks. Furthermore, we verified that a set of protein domains that are simultaneously promiscuous (known to interact to multiple types of other domains, self-interacting (able to interact with another copy of themselves and abundant in the genomes presents a stronger signal for exon shuffling. Conclusions Exon shuffling appears to have been a recurrent mechanism for the emergence of new PPIs along metazoan evolution. In metazoan genomes, exon shuffling also promoted the expansion of some protein domains. We speculate that their promiscuous and self-interacting properties may have been decisive for that expansion.
Protein-Protein Interactions (PPI) reagents: | Office of Cancer Genomics
The CTD2 Center at Emory University has a library of genes used to study protein-protein interactions in mammalian cells. These genes are cloned in different mammalian expression vectors. A list of available cancer-associated genes can be accessed below.
Protein-Protein Interaction Network and Gene Ontology
Choi, Yunkyu; Kim, Seok; Yi, Gwan-Su; Park, Jinah
Evolution of computer technologies makes it possible to access a large amount and various kinds of biological data via internet such as DNA sequences, proteomics data and information discovered about them. It is expected that the combination of various data could help researchers find further knowledge about them. Roles of a visualization system are to invoke human abilities to integrate information and to recognize certain patterns in the data. Thus, when the various kinds of data are examined and analyzed manually, an effective visualization system is an essential part. One instance of these integrated visualizations can be combination of protein-protein interaction (PPI) data and Gene Ontology (GO) which could help enhance the analysis of PPI network. We introduce a simple but comprehensive visualization system that integrates GO and PPI data where GO and PPI graphs are visualized side-by-side and supports quick reference functions between them. Furthermore, the proposed system provides several interactive visualization methods for efficiently analyzing the PPI network and GO directedacyclic- graph such as context-based browsing and common ancestors finding.
CellMap visualizes protein-protein interactions and subcellular localization
Dallago, Christian; Goldberg, Tatyana; Andrade-Navarro, Miguel Angel; Alanis-Lobato, Gregorio; Rost, Burkhard
2018-01-01
Many tools visualize protein-protein interaction (PPI) networks. The tool introduced here, CellMap, adds one crucial novelty by visualizing PPI networks in the context of subcellular localization, i.e. the location in the cell or cellular component in which a PPI happens. Users can upload images of cells and define areas of interest against which PPIs for selected proteins are displayed (by default on a cartoon of a cell). Annotations of localization are provided by the user or through our in-house database. The visualizer and server are written in JavaScript, making CellMap easy to customize and to extend by researchers and developers. PMID:29497493
Application of Machine Learning Approaches for Protein-protein Interactions Prediction.
Zhang, Mengying; Su, Qiang; Lu, Yi; Zhao, Manman; Niu, Bing
2017-01-01
Proteomics endeavors to study the structures, functions and interactions of proteins. Information of the protein-protein interactions (PPIs) helps to improve our knowledge of the functions and the 3D structures of proteins. Thus determining the PPIs is essential for the study of the proteomics. In this review, in order to study the application of machine learning in predicting PPI, some machine learning approaches such as support vector machine (SVM), artificial neural networks (ANNs) and random forest (RF) were selected, and the examples of its applications in PPIs were listed. SVM and RF are two commonly used methods. Nowadays, more researchers predict PPIs by combining more than two methods. This review presents the application of machine learning approaches in predicting PPI. Many examples of success in identification and prediction in the area of PPI prediction have been discussed, and the PPIs research is still in progress. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Unified Alignment of Protein-Protein Interaction Networks.
Malod-Dognin, Noël; Ban, Kristina; Pržulj, Nataša
2017-04-19
Paralleling the increasing availability of protein-protein interaction (PPI) network data, several network alignment methods have been proposed. Network alignments have been used to uncover functionally conserved network parts and to transfer annotations. However, due to the computational intractability of the network alignment problem, aligners are heuristics providing divergent solutions and no consensus exists on a gold standard, or which scoring scheme should be used to evaluate them. We comprehensively evaluate the alignment scoring schemes and global network aligners on large scale PPI data and observe that three methods, HUBALIGN, L-GRAAL and NATALIE, regularly produce the most topologically and biologically coherent alignments. We study the collective behaviour of network aligners and observe that PPI networks are almost entirely aligned with a handful of aligners that we unify into a new tool, Ulign. Ulign enables complete alignment of two networks, which traditional global and local aligners fail to do. Also, multiple mappings of Ulign define biologically relevant soft clusterings of proteins in PPI networks, which may be used for refining the transfer of annotations across networks. Hence, PPI networks are already well investigated by current aligners, so to gain additional biological insights, a paradigm shift is needed. We propose such a shift come from aligning all available data types collectively rather than any particular data type in isolation from others.
Dr. PIAS: an integrative system for assessing the druggability of protein-protein interactions
Directory of Open Access Journals (Sweden)
Furuya Toshio
2011-02-01
Full Text Available Abstract Background The amount of data on protein-protein interactions (PPIs available in public databases and in the literature has rapidly expanded in recent years. PPI data can provide useful information for researchers in pharmacology and medicine as well as those in interactome studies. There is urgent need for a novel methodology or software allowing the efficient utilization of PPI data in pharmacology and medicine. Results To address this need, we have developed the 'Druggable Protein-protein Interaction Assessment System' (Dr. PIAS. Dr. PIAS has a meta-database that stores various types of information (tertiary structures, drugs/chemicals, and biological functions associated with PPIs retrieved from public sources. By integrating this information, Dr. PIAS assesses whether a PPI is druggable as a target for small chemical ligands by using a supervised machine-learning method, support vector machine (SVM. Dr. PIAS holds not only known druggable PPIs but also all PPIs of human, mouse, rat, and human immunodeficiency virus (HIV proteins identified to date. Conclusions The design concept of Dr. PIAS is distinct from other published PPI databases in that it focuses on selecting the PPIs most likely to make good drug targets, rather than merely collecting PPI data.
Protein-Protein Interaction Reagents | Office of Cancer Genomics
The CTD2 Center at Emory University has a library of genes used to study protein-protein interactions in mammalian cells. These genes are cloned in different mammalian expression vectors. A list of available cancer-associated genes can be accessed below. Emory_CTD^2_PPI_Reagents.xlsx Contact: Haian Fu
Understanding Protein-Protein Interactions Using Local Structural Features
DEFF Research Database (Denmark)
Planas-Iglesias, Joan; Bonet, Jaume; García-García, Javier
2013-01-01
Protein-protein interactions (PPIs) play a relevant role among the different functions of a cell. Identifying the PPI network of a given organism (interactome) is useful to shed light on the key molecular mechanisms within a biological system. In this work, we show the role of structural features...... interacting and non-interacting protein pairs to classify the structural features that sustain the binding (or non-binding) behavior. Our study indicates that not only the interacting region but also the rest of the protein surface are important for the interaction fate. The interpretation...... to score the likelihood of the interaction between two proteins and to develop a method for the prediction of PPIs. We have tested our method on several sets with unbalanced ratios of interactions and non-interactions to simulate real conditions, obtaining accuracies higher than 25% in the most unfavorable...
msiDBN: A Method of Identifying Critical Proteins in Dynamic PPI Networks
Directory of Open Access Journals (Sweden)
Yuan Zhang
2014-01-01
Full Text Available Dynamics of protein-protein interactions (PPIs reveals the recondite principles of biological processes inside a cell. Shown in a wealth of study, just a small group of proteins, rather than the majority, play more essential roles at crucial points of biological processes. This present work focuses on identifying these critical proteins exhibiting dramatic structural changes in dynamic PPI networks. First, a comprehensive way of modeling the dynamic PPIs is presented which simultaneously analyzes the activity of proteins and assembles the dynamic coregulation correlation between proteins at each time point. Second, a novel method is proposed, named msiDBN, which models a common representation of multiple PPI networks using a deep belief network framework and analyzes the reconstruction errors and the variabilities across the time courses in the biological process. Experiments were implemented on data of yeast cell cycles. We evaluated our network construction method by comparing the functional representations of the derived networks with two other traditional construction methods. The ranking results of critical proteins in msiDBN were compared with the results from the baseline methods. The results of comparison showed that msiDBN had better reconstruction rate and identified more proteins of critical value to yeast cell cycle process.
A domain-based approach to predict protein-protein interactions
Directory of Open Access Journals (Sweden)
Resat Haluk
2007-06-01
Full Text Available Abstract Background Knowing which proteins exist in a certain organism or cell type and how these proteins interact with each other are necessary for the understanding of biological processes at the whole cell level. The determination of the protein-protein interaction (PPI networks has been the subject of extensive research. Despite the development of reasonably successful methods, serious technical difficulties still exist. In this paper we present DomainGA, a quantitative computational approach that uses the information about the domain-domain interactions to predict the interactions between proteins. Results DomainGA is a multi-parameter optimization method in which the available PPI information is used to derive a quantitative scoring scheme for the domain-domain pairs. Obtained domain interaction scores are then used to predict whether a pair of proteins interacts. Using the yeast PPI data and a series of tests, we show the robustness and insensitivity of the DomainGA method to the selection of the parameter sets, score ranges, and detection rules. Our DomainGA method achieves very high explanation ratios for the positive and negative PPIs in yeast. Based on our cross-verification tests on human PPIs, comparison of the optimized scores with the structurally observed domain interactions obtained from the iPFAM database, and sensitivity and specificity analysis; we conclude that our DomainGA method shows great promise to be applicable across multiple organisms. Conclusion We envision the DomainGA as a first step of a multiple tier approach to constructing organism specific PPIs. As it is based on fundamental structural information, the DomainGA approach can be used to create potential PPIs and the accuracy of the constructed interaction template can be further improved using complementary methods. Explanation ratios obtained in the reported test case studies clearly show that the false prediction rates of the template networks constructed
Targeting protein-protein interactions for parasite control.
Directory of Open Access Journals (Sweden)
Christina M Taylor
2011-04-01
Full Text Available Finding new drug targets for pathogenic infections would be of great utility for humanity, as there is a large need to develop new drugs to fight infections due to the developing resistance and side effects of current treatments. Current drug targets for pathogen infections involve only a single protein. However, proteins rarely act in isolation, and the majority of biological processes occur via interactions with other proteins, so protein-protein interactions (PPIs offer a realm of unexplored potential drug targets and are thought to be the next-generation of drug targets. Parasitic worms were chosen for this study because they have deleterious effects on human health, livestock, and plants, costing society billions of dollars annually and many sequenced genomes are available. In this study, we present a computational approach that utilizes whole genomes of 6 parasitic and 1 free-living worm species and 2 hosts. The species were placed in orthologous groups, then binned in species-specific orthologous groups. Proteins that are essential and conserved among species that span a phyla are of greatest value, as they provide foundations for developing broad-control strategies. Two PPI databases were used to find PPIs within the species specific bins. PPIs with unique helminth proteins and helminth proteins with unique features relative to the host, such as indels, were prioritized as drug targets. The PPIs were scored based on RNAi phenotype and homology to the PDB (Protein DataBank. EST data for the various life stages, GO annotation, and druggability were also taken into consideration. Several PPIs emerged from this study as potential drug targets. A few interactions were supported by co-localization of expression in M. incognita (plant parasite and B. malayi (H. sapiens parasite, which have extremely different modes of parasitism. As more genomes of pathogens are sequenced and PPI databases expanded, this methodology will become increasingly
Li, Jian-Feng; Bush, Jenifer; Xiong, Yan; Li, Lei; McCormack, Matthew
2011-01-01
Protein-protein interactions (PPIs) constitute the regulatory network that coordinates diverse cellular functions. There are growing needs in plant research for creating protein interaction maps behind complex cellular processes and at a systems biology level. However, only a few approaches have been successfully used for large-scale surveys of PPIs in plants, each having advantages and disadvantages. Here we present split firefly luciferase complementation (SFLC) as a highly sensitive and noninvasive technique for in planta PPI investigation. In this assay, the separate halves of a firefly luciferase can come into close proximity and transiently restore its catalytic activity only when their fusion partners, namely the two proteins of interest, interact with each other. This assay was conferred with quantitativeness and high throughput potential when the Arabidopsis mesophyll protoplast system and a microplate luminometer were employed for protein expression and luciferase measurement, respectively. Using the SFLC assay, we could monitor the dynamics of rapamycin-induced and ascomycin-disrupted interaction between Arabidopsis FRB and human FKBP proteins in a near real-time manner. As a proof of concept for large-scale PPI survey, we further applied the SFLC assay to testing 132 binary PPIs among 8 auxin response factors (ARFs) and 12 Aux/IAA proteins from Arabidopsis. Our results demonstrated that the SFLC assay is ideal for in vivo quantitative PPI analysis in plant cells and is particularly powerful for large-scale binary PPI screens.
Directory of Open Access Journals (Sweden)
Jian-Feng Li
Full Text Available Protein-protein interactions (PPIs constitute the regulatory network that coordinates diverse cellular functions. There are growing needs in plant research for creating protein interaction maps behind complex cellular processes and at a systems biology level. However, only a few approaches have been successfully used for large-scale surveys of PPIs in plants, each having advantages and disadvantages. Here we present split firefly luciferase complementation (SFLC as a highly sensitive and noninvasive technique for in planta PPI investigation. In this assay, the separate halves of a firefly luciferase can come into close proximity and transiently restore its catalytic activity only when their fusion partners, namely the two proteins of interest, interact with each other. This assay was conferred with quantitativeness and high throughput potential when the Arabidopsis mesophyll protoplast system and a microplate luminometer were employed for protein expression and luciferase measurement, respectively. Using the SFLC assay, we could monitor the dynamics of rapamycin-induced and ascomycin-disrupted interaction between Arabidopsis FRB and human FKBP proteins in a near real-time manner. As a proof of concept for large-scale PPI survey, we further applied the SFLC assay to testing 132 binary PPIs among 8 auxin response factors (ARFs and 12 Aux/IAA proteins from Arabidopsis. Our results demonstrated that the SFLC assay is ideal for in vivo quantitative PPI analysis in plant cells and is particularly powerful for large-scale binary PPI screens.
Sequence-based prediction of protein protein interaction using a deep-learning algorithm.
Sun, Tanlin; Zhou, Bo; Lai, Luhua; Pei, Jianfeng
2017-05-25
Protein-protein interactions (PPIs) are critical for many biological processes. It is therefore important to develop accurate high-throughput methods for identifying PPI to better understand protein function, disease occurrence, and therapy design. Though various computational methods for predicting PPI have been developed, their robustness for prediction with external datasets is unknown. Deep-learning algorithms have achieved successful results in diverse areas, but their effectiveness for PPI prediction has not been tested. We used a stacked autoencoder, a type of deep-learning algorithm, to study the sequence-based PPI prediction. The best model achieved an average accuracy of 97.19% with 10-fold cross-validation. The prediction accuracies for various external datasets ranged from 87.99% to 99.21%, which are superior to those achieved with previous methods. To our knowledge, this research is the first to apply a deep-learning algorithm to sequence-based PPI prediction, and the results demonstrate its potential in this field.
Jelínek, Jan; Škoda, Petr; Hoksza, David
2017-12-06
Protein-protein interactions (PPI) play a key role in an investigation of various biochemical processes, and their identification is thus of great importance. Although computational prediction of which amino acids take part in a PPI has been an active field of research for some time, the quality of in-silico methods is still far from perfect. We have developed a novel prediction method called INSPiRE which benefits from a knowledge base built from data available in Protein Data Bank. All proteins involved in PPIs were converted into labeled graphs with nodes corresponding to amino acids and edges to pairs of neighboring amino acids. A structural neighborhood of each node was then encoded into a bit string and stored in the knowledge base. When predicting PPIs, INSPiRE labels amino acids of unknown proteins as interface or non-interface based on how often their structural neighborhood appears as interface or non-interface in the knowledge base. We evaluated INSPiRE's behavior with respect to different types and sizes of the structural neighborhood. Furthermore, we examined the suitability of several different features for labeling the nodes. Our evaluations showed that INSPiRE clearly outperforms existing methods with respect to Matthews correlation coefficient. In this paper we introduce a new knowledge-based method for identification of protein-protein interaction sites called INSPiRE. Its knowledge base utilizes structural patterns of known interaction sites in the Protein Data Bank which are then used for PPI prediction. Extensive experiments on several well-established datasets show that INSPiRE significantly surpasses existing PPI approaches.
Efficient prediction of human protein-protein interactions at a global scale.
Schoenrock, Andrew; Samanfar, Bahram; Pitre, Sylvain; Hooshyar, Mohsen; Jin, Ke; Phillips, Charles A; Wang, Hui; Phanse, Sadhna; Omidi, Katayoun; Gui, Yuan; Alamgir, Md; Wong, Alex; Barrenäs, Fredrik; Babu, Mohan; Benson, Mikael; Langston, Michael A; Green, James R; Dehne, Frank; Golshani, Ashkan
2014-12-10
Our knowledge of global protein-protein interaction (PPI) networks in complex organisms such as humans is hindered by technical limitations of current methods. On the basis of short co-occurring polypeptide regions, we developed a tool called MP-PIPE capable of predicting a global human PPI network within 3 months. With a recall of 23% at a precision of 82.1%, we predicted 172,132 putative PPIs. We demonstrate the usefulness of these predictions through a range of experiments. The speed and accuracy associated with MP-PIPE can make this a potential tool to study individual human PPI networks (from genomic sequences alone) for personalized medicine.
Inferring high-confidence human protein-protein interactions
Directory of Open Access Journals (Sweden)
Yu Xueping
2012-05-01
Full Text Available Abstract Background As numerous experimental factors drive the acquisition, identification, and interpretation of protein-protein interactions (PPIs, aggregated assemblies of human PPI data invariably contain experiment-dependent noise. Ascertaining the reliability of PPIs collected from these diverse studies and scoring them to infer high-confidence networks is a non-trivial task. Moreover, a large number of PPIs share the same number of reported occurrences, making it impossible to distinguish the reliability of these PPIs and rank-order them. For example, for the data analyzed here, we found that the majority (>83% of currently available human PPIs have been reported only once. Results In this work, we proposed an unsupervised statistical approach to score a set of diverse, experimentally identified PPIs from nine primary databases to create subsets of high-confidence human PPI networks. We evaluated this ranking method by comparing it with other methods and assessing their ability to retrieve protein associations from a number of diverse and independent reference sets. These reference sets contain known biological data that are either directly or indirectly linked to interactions between proteins. We quantified the average effect of using ranked protein interaction data to retrieve this information and showed that, when compared to randomly ranked interaction data sets, the proposed method created a larger enrichment (~134% than either ranking based on the hypergeometric test (~109% or occurrence ranking (~46%. Conclusions From our evaluations, it was clear that ranked interactions were always of value because higher-ranked PPIs had a higher likelihood of retrieving high-confidence experimental data. Reducing the noise inherent in aggregated experimental PPIs via our ranking scheme further increased the accuracy and enrichment of PPIs derived from a number of biologically relevant data sets. These results suggest that using our high
Feature generation and representations for protein-protein interaction classification.
Lan, Man; Tan, Chew Lim; Su, Jian
2009-10-01
Automatic detecting protein-protein interaction (PPI) relevant articles is a crucial step for large-scale biological database curation. The previous work adopted POS tagging, shallow parsing and sentence splitting techniques, but they achieved worse performance than the simple bag-of-words representation. In this paper, we generated and investigated multiple types of feature representations in order to further improve the performance of PPI text classification task. Besides the traditional domain-independent bag-of-words approach and the term weighting methods, we also explored other domain-dependent features, i.e. protein-protein interaction trigger keywords, protein named entities and the advanced ways of incorporating Natural Language Processing (NLP) output. The integration of these multiple features has been evaluated on the BioCreAtIvE II corpus. The experimental results showed that both the advanced way of using NLP output and the integration of bag-of-words and NLP output improved the performance of text classification. Specifically, in comparison with the best performance achieved in the BioCreAtIvE II IAS, the feature-level and classifier-level integration of multiple features improved the performance of classification 2.71% and 3.95%, respectively.
Directory of Open Access Journals (Sweden)
Dimitrakopoulou Konstantina
2007-10-01
Full Text Available Abstract Background Nowadays modern biology aims at unravelling the strands of complex biological structures such as the protein-protein interaction (PPI networks. A key concept in the organization of PPI networks is the existence of dense subnetworks (functional modules in them. In recent approaches clustering algorithms were applied at these networks and the resulting subnetworks were evaluated by estimating the coverage of well-established protein complexes they contained. However, most of these algorithms elaborate on an unweighted graph structure which in turn fails to elevate those interactions that would contribute to the construction of biologically more valid and coherent functional modules. Results In the current study, we present a method that corroborates the integration of protein interaction and microarray data via the discovery of biologically valid functional modules. Initially the gene expression information is overlaid as weights onto the PPI network and the enriched PPI graph allows us to exploit its topological aspects, while simultaneously highlights enhanced functional association in specific pairs of proteins. Then we present an algorithm that unveils the functional modules of the weighted graph by expanding a kernel protein set, which originates from a given 'seed' protein used as starting-point. Conclusion The integrated data and the concept of our approach provide reliable functional modules. We give proofs based on yeast data that our method manages to give accurate results in terms both of structural coherency, as well as functional consistency.
HCVpro: Hepatitis C virus protein interaction database
Kwofie, Samuel K.; Schaefer, Ulf; Sundararajan, Vijayaraghava Seshadri; Bajic, Vladimir B.; Christoffels, Alan G.
2011-01-01
It is essential to catalog characterized hepatitis C virus (HCV) protein-protein interaction (PPI) data and the associated plethora of vital functional information to augment the search for therapies, vaccines and diagnostic biomarkers
Hot spot-based design of small-molecule inhibitors for protein-protein interactions.
Guo, Wenxing; Wisniewski, John A; Ji, Haitao
2014-06-01
Protein-protein interactions (PPIs) are important targets for the development of chemical probes and therapeutic agents. From the initial discovery of the existence of hot spots at PPI interfaces, it has been proposed that hot spots might provide the key for developing small-molecule PPI inhibitors. However, there has been no review on the ways in which the knowledge of hot spots can be used to achieve inhibitor design, nor critical examination of successful examples. This Digest discusses the characteristics of hot spots and the identification of druggable hot spot pockets. An analysis of four examples of hot spot-based design reveals the importance of this strategy in discovering potent and selective PPI inhibitors. A general procedure for hot spot-based design of PPI inhibitors is outlined. Copyright © 2014 Elsevier Ltd. All rights reserved.
Agarwal, Shashank; Liu, Feifan; Yu, Hong
2011-10-03
Protein-protein interaction (PPI) is an important biomedical phenomenon. Automatically detecting PPI-relevant articles and identifying methods that are used to study PPI are important text mining tasks. In this study, we have explored domain independent features to develop two open source machine learning frameworks. One performs binary classification to determine whether the given article is PPI relevant or not, named "Simple Classifier", and the other one maps the PPI relevant articles with corresponding interaction method nodes in a standardized PSI-MI (Proteomics Standards Initiative-Molecular Interactions) ontology, named "OntoNorm". We evaluated our system in the context of BioCreative challenge competition using the standardized data set. Our systems are amongst the top systems reported by the organizers, attaining 60.8% F1-score for identifying relevant documents, and 52.3% F1-score for mapping articles to interaction method ontology. Our results show that domain-independent machine learning frameworks can perform competitively well at the tasks of detecting PPI relevant articles and identifying the methods that were used to study the interaction in such articles. Simple Classifier is available at http://sourceforge.net/p/simpleclassify/home/ and OntoNorm at http://sourceforge.net/p/ontonorm/home/.
Data management of protein interaction networks
Cannataro, Mario
2012-01-01
Interactomics: a complete survey from data generation to knowledge extraction With the increasing use of high-throughput experimental assays, more and more protein interaction databases are becoming available. As a result, computational analysis of protein-to-protein interaction (PPI) data and networks, now known as interactomics, has become an essential tool to determine functionally associated proteins. From wet lab technologies to data management to knowledge extraction, this timely book guides readers through the new science of interactomics, giving them the tools needed to: Generate
Song, Xiao; Lu, Lu-Yi; Passioura, Toby; Suga, Hiroaki
2017-06-21
Ebola virus infection leads to severe hemorrhagic fever in human and non-human primates with an average case fatality rate of 50%. To date, numerous potential therapies are in development, but FDA-approved drugs or vaccines are yet unavailable. Ebola viral protein 24 (VP24) is a multifunctional protein that plays critical roles in the pathogenesis of Ebola virus infection, e.g. innate immune suppression by blocking the interaction between KPNA and PY-STAT1. Here we report macrocyclic peptide inhibitors of the VP24-KPNA5 protein-protein interaction (PPI) by means of the RaPID (Random non-standard Peptides Integrated Discovery) system. These macrocyclic peptides showed remarkably high affinity to recombinant Zaire Ebola virus VP24 (eVP24), with a dissociation constant in the single digit nanomolar range, and could also successfully disrupt the eVP24-KPNA interaction. This work provides for the first time a chemical probe capable of modulating this PPI interaction and is the starting point for the development of unique anti-viral drugs against the Ebola virus.
Detection of protein complex from protein-protein interaction network using Markov clustering
International Nuclear Information System (INIS)
Ochieng, P J; Kusuma, W A; Haryanto, T
2017-01-01
Detection of complexes, or groups of functionally related proteins, is an important challenge while analysing biological networks. However, existing algorithms to identify protein complexes are insufficient when applied to dense networks of experimentally derived interaction data. Therefore, we introduced a graph clustering method based on Markov clustering algorithm to identify protein complex within highly interconnected protein-protein interaction networks. Protein-protein interaction network was first constructed to develop geometrical network, the network was then partitioned using Markov clustering to detect protein complexes. The interest of the proposed method was illustrated by its application to Human Proteins associated to type II diabetes mellitus. Flow simulation of MCL algorithm was initially performed and topological properties of the resultant network were analysed for detection of the protein complex. The results indicated the proposed method successfully detect an overall of 34 complexes with 11 complexes consisting of overlapping modules and 20 non-overlapping modules. The major complex consisted of 102 proteins and 521 interactions with cluster modularity and density of 0.745 and 0.101 respectively. The comparison analysis revealed MCL out perform AP, MCODE and SCPS algorithms with high clustering coefficient (0.751) network density and modularity index (0.630). This demonstrated MCL was the most reliable and efficient graph clustering algorithm for detection of protein complexes from PPI networks. (paper)
Directory of Open Access Journals (Sweden)
Zimmer Ralf
2007-08-01
Full Text Available Abstract Background The existence of negative correlations between degrees of interacting proteins is being discussed since such negative degree correlations were found for the large-scale yeast protein-protein interaction (PPI network of Ito et al. More recent studies observed no such negative correlations for high-confidence interaction sets. In this article, we analyzed a range of experimentally derived interaction networks to understand the role and prevalence of degree correlations in PPI networks. We investigated how degree correlations influence the structure of networks and their tolerance against perturbations such as the targeted deletion of hubs. Results For each PPI network, we simulated uncorrelated, positively and negatively correlated reference networks. Here, a simple model was developed which can create different types of degree correlations in a network without changing the degree distribution. Differences in static properties associated with degree correlations were compared by analyzing the network characteristics of the original PPI and reference networks. Dynamics were compared by simulating the effect of a selective deletion of hubs in all networks. Conclusion Considerable differences between the network types were found for the number of components in the original networks. Negatively correlated networks are fragmented into significantly less components than observed for positively correlated networks. On the other hand, the selective deletion of hubs showed an increased structural tolerance to these deletions for the positively correlated networks. This results in a lower rate of interaction loss in these networks compared to the negatively correlated networks and a decreased disintegration rate. Interestingly, real PPI networks are most similar to the randomly correlated references with respect to all properties analyzed. Thus, although structural properties of networks can be modified considerably by degree
PIE the search: searching PubMed literature for protein interaction information.
Kim, Sun; Kwon, Dongseop; Shin, Soo-Yong; Wilbur, W John
2012-02-15
Finding protein-protein interaction (PPI) information from literature is challenging but an important issue. However, keyword search in PubMed(®) is often time consuming because it requires a series of actions that refine keywords and browse search results until it reaches a goal. Due to the rapid growth of biomedical literature, it has become more difficult for biologists and curators to locate PPI information quickly. Therefore, a tool for prioritizing PPI informative articles can be a useful assistant for finding this PPI-relevant information. PIE (Protein Interaction information Extraction) the search is a web service implementing a competition-winning approach utilizing word and syntactic analyses by machine learning techniques. For easy user access, PIE the search provides a PubMed-like search environment, but the output is the list of articles prioritized by PPI confidence scores. By obtaining PPI-related articles at high rank, researchers can more easily find the up-to-date PPI information, which cannot be found in manually curated PPI databases. http://www.ncbi.nlm.nih.gov/IRET/PIE/.
Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming
2015-01-01
In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.
Discovering disease-associated genes in weighted protein-protein interaction networks
Cui, Ying; Cai, Meng; Stanley, H. Eugene
2018-04-01
Although there have been many network-based attempts to discover disease-associated genes, most of them have not taken edge weight - which quantifies their relative strength - into consideration. We use connection weights in a protein-protein interaction (PPI) network to locate disease-related genes. We analyze the topological properties of both weighted and unweighted PPI networks and design an improved random forest classifier to distinguish disease genes from non-disease genes. We use a cross-validation test to confirm that weighted networks are better able to discover disease-associated genes than unweighted networks, which indicates that including link weight in the analysis of network properties provides a better model of complex genotype-phenotype associations.
An ensemble self-training protein interaction article classifier.
Chen, Yifei; Hou, Ping; Manderick, Bernard
2014-01-01
Protein-protein interaction (PPI) is essential to understand the fundamental processes governing cell biology. The mining and curation of PPI knowledge are critical for analyzing proteomics data. Hence it is desired to classify articles PPI-related or not automatically. In order to build interaction article classification systems, an annotated corpus is needed. However, it is usually the case that only a small number of labeled articles can be obtained manually. Meanwhile, a large number of unlabeled articles are available. By combining ensemble learning and semi-supervised self-training, an ensemble self-training interaction classifier called EST_IACer is designed to classify PPI-related articles based on a small number of labeled articles and a large number of unlabeled articles. A biological background based feature weighting strategy is extended using the category information from both labeled and unlabeled data. Moreover, a heuristic constraint is put forward to select optimal instances from unlabeled data to improve the performance further. Experiment results show that the EST_IACer can classify the PPI related articles effectively and efficiently.
Li, Chuanxi; Chen, Peng; Wang, Rujing; Wang, Xiujie; Su, Yaru; Li, Jinyan
2014-01-01
Mining Protein-Protein Interactions (PPIs) from the fast-growing biomedical literature resources has been proven as an effective approach for the identifi cation of biological regulatory networks. This paper presents a novel method based on the idea
A coevolution analysis for identifying protein-protein interactions by Fourier transform
Yin, Changchuan; Yau, Stephen S. -T.
2017-01-01
Protein-protein interactions (PPIs) play key roles in life processes, such as signal transduction, transcription regulations, and immune response, etc. Identification of PPIs enables better understanding of the functional networks within a cell. Common experimental methods for identifying PPIs are time consuming and expensive. However, recent developments in computational approaches for inferring PPIs from protein sequences based on coevolution theory avoid these problems. In the coevolution theory model, interacted proteins may show coevolutionary mutations and have similar phylogenetic trees. The existing coevolution methods depend on multiple sequence alignments (MSA); however, the MSA-based coevolution methods often produce high false positive interactions. In this paper, we present a computational method using an alignment-free approach to accurately detect PPIs and reduce false positives. In the method, protein sequences are numerically represented by biochemical properties of amino acids, which reflect the structural and functional differences of proteins. Fourier transform is applied to the numerical representation of protein sequences to capture the dissimilarities of protein sequences in biophysical context. The method is assessed for predicting PPIs in Ebola virus. The results indicate strong coevolution between the protein pairs (NP-VP24, NP-VP30, NP-VP40, VP24-VP30, VP24-VP40, and VP30-VP40). The method is also validated for PPIs in influenza and E.coli genomes. Since our method can reduce false positive and increase the specificity of PPI prediction, it offers an effective tool to understand mechanisms of disease pathogens and find potential targets for drug design. The Python programs in this study are available to public at URL (https://github.com/cyinbox/PPI). PMID:28430779
A coevolution analysis for identifying protein-protein interactions by Fourier transform.
Directory of Open Access Journals (Sweden)
Changchuan Yin
Full Text Available Protein-protein interactions (PPIs play key roles in life processes, such as signal transduction, transcription regulations, and immune response, etc. Identification of PPIs enables better understanding of the functional networks within a cell. Common experimental methods for identifying PPIs are time consuming and expensive. However, recent developments in computational approaches for inferring PPIs from protein sequences based on coevolution theory avoid these problems. In the coevolution theory model, interacted proteins may show coevolutionary mutations and have similar phylogenetic trees. The existing coevolution methods depend on multiple sequence alignments (MSA; however, the MSA-based coevolution methods often produce high false positive interactions. In this paper, we present a computational method using an alignment-free approach to accurately detect PPIs and reduce false positives. In the method, protein sequences are numerically represented by biochemical properties of amino acids, which reflect the structural and functional differences of proteins. Fourier transform is applied to the numerical representation of protein sequences to capture the dissimilarities of protein sequences in biophysical context. The method is assessed for predicting PPIs in Ebola virus. The results indicate strong coevolution between the protein pairs (NP-VP24, NP-VP30, NP-VP40, VP24-VP30, VP24-VP40, and VP30-VP40. The method is also validated for PPIs in influenza and E.coli genomes. Since our method can reduce false positive and increase the specificity of PPI prediction, it offers an effective tool to understand mechanisms of disease pathogens and find potential targets for drug design. The Python programs in this study are available to public at URL (https://github.com/cyinbox/PPI.
Zhou, Hufeng; Gao, Shangzhi; Nguyen, Nam Ninh; Fan, Mengyuan; Jin, Jingjing; Liu, Bing; Zhao, Liang; Xiong, Geng; Tan, Min; Li, Shijun; Wong, Limsoon
2014-04-08
H. sapiens-M. tuberculosis H37Rv protein-protein interaction (PPI) data are essential for understanding the infection mechanism of the formidable pathogen M. tuberculosis H37Rv. Computational prediction is an important strategy to fill the gap in experimental H. sapiens-M. tuberculosis H37Rv PPI data. Homology-based prediction is frequently used in predicting both intra-species and inter-species PPIs. However, some limitations are not properly resolved in several published works that predict eukaryote-prokaryote inter-species PPIs using intra-species template PPIs. We develop a stringent homology-based prediction approach by taking into account (i) differences between eukaryotic and prokaryotic proteins and (ii) differences between inter-species and intra-species PPI interfaces. We compare our stringent homology-based approach to a conventional homology-based approach for predicting host-pathogen PPIs, based on cellular compartment distribution analysis, disease gene list enrichment analysis, pathway enrichment analysis and functional category enrichment analysis. These analyses support the validity of our prediction result, and clearly show that our approach has better performance in predicting H. sapiens-M. tuberculosis H37Rv PPIs. Using our stringent homology-based approach, we have predicted a set of highly plausible H. sapiens-M. tuberculosis H37Rv PPIs which might be useful for many of related studies. Based on our analysis of the H. sapiens-M. tuberculosis H37Rv PPI network predicted by our stringent homology-based approach, we have discovered several interesting properties which are reported here for the first time. We find that both host proteins and pathogen proteins involved in the host-pathogen PPIs tend to be hubs in their own intra-species PPI network. Also, both host and pathogen proteins involved in host-pathogen PPIs tend to have longer primary sequence, tend to have more domains, tend to be more hydrophilic, etc. And the protein domains from both
Theofilatos, Konstantinos; Pavlopoulou, Niki; Papasavvas, Christoforos; Likothanassis, Spiros; Dimitrakopoulos, Christos; Georgopoulos, Efstratios; Moschopoulos, Charalampos; Mavroudi, Seferina
2015-03-01
Proteins are considered to be the most important individual components of biological systems and they combine to form physical protein complexes which are responsible for certain molecular functions. Despite the large availability of protein-protein interaction (PPI) information, not much information is available about protein complexes. Experimental methods are limited in terms of time, efficiency, cost and performance constraints. Existing computational methods have provided encouraging preliminary results, but they phase certain disadvantages as they require parameter tuning, some of them cannot handle weighted PPI data and others do not allow a protein to participate in more than one protein complex. In the present paper, we propose a new fully unsupervised methodology for predicting protein complexes from weighted PPI graphs. The proposed methodology is called evolutionary enhanced Markov clustering (EE-MC) and it is a hybrid combination of an adaptive evolutionary algorithm and a state-of-the-art clustering algorithm named enhanced Markov clustering. EE-MC was compared with state-of-the-art methodologies when applied to datasets from the human and the yeast Saccharomyces cerevisiae organisms. Using public available datasets, EE-MC outperformed existing methodologies (in some datasets the separation metric was increased by 10-20%). Moreover, when applied to new human datasets its performance was encouraging in the prediction of protein complexes which consist of proteins with high functional similarity. In specific, 5737 protein complexes were predicted and 72.58% of them are enriched for at least one gene ontology (GO) function term. EE-MC is by design able to overcome intrinsic limitations of existing methodologies such as their inability to handle weighted PPI networks, their constraint to assign every protein in exactly one cluster and the difficulties they face concerning the parameter tuning. This fact was experimentally validated and moreover, new
Directory of Open Access Journals (Sweden)
Yao-Cheng Li
2014-12-01
Full Text Available Summary: Protein-protein interactions (PPIs play central roles in orchestrating biological processes. While some PPIs are stable, many important ones are transient and hard to detect with conventional approaches. We developed ReBiL, a recombinase enhanced bimolecular luciferase complementation platform, to enable detection of weak PPIs in living cells. ReBiL readily identified challenging transient interactions between an E3 ubiquitin ligase and an E2 ubiquitin-conjugating enzyme. ReBiL’s ability to rapidly interrogate PPIs in diverse conditions revealed that some stapled α-helical peptides, a class of PPI antagonists, induce target-independent cytosolic leakage and cytotoxicity that is antagonized by serum. These results explain the requirement for serum-free conditions to detect stapled peptide activity, and define a required parameter to evaluate for peptide antagonist approaches. ReBiL’s ability to expedite PPI analysis, assess target specificity and cell permeability, and reveal off-target effects of PPI modifiers should facilitate the development of effective, cell-permeable PPI therapeutics and the elaboration of diverse biological mechanisms. : Li et al. developed a recombinase-enhanced bimolecular luciferase complementation platform, termed ReBiL, to evaluate low-affinity protein-protein interactions (PPIs that are not detectable by other methods and to analyze PPI antagonists in living cells. ReBiL showed that small-molecule p53-Mdm2 antagonists disrupt their intended targets effectively in cells, whereas stapled peptides did not. Stapled peptides unexpectedly induced cell membrane disruption resulting in p53-independent death associated with cytoplasmic leakage. ReBiL is also valuable for high-throughput screening and for deciphering signaling mechanisms mediated by protein interactions.
Jeong, Hyundoo; Qian, Xiaoning; Yoon, Byung-Jun
2016-10-06
Comparative analysis of protein-protein interaction (PPI) networks provides an effective means of detecting conserved functional network modules across different species. Such modules typically consist of orthologous proteins with conserved interactions, which can be exploited to computationally predict the modules through network comparison. In this work, we propose a novel probabilistic framework for comparing PPI networks and effectively predicting the correspondence between proteins, represented as network nodes, that belong to conserved functional modules across the given PPI networks. The basic idea is to estimate the steady-state network flow between nodes that belong to different PPI networks based on a Markov random walk model. The random walker is designed to make random moves to adjacent nodes within a PPI network as well as cross-network moves between potential orthologous nodes with high sequence similarity. Based on this Markov random walk model, we estimate the steady-state network flow - or the long-term relative frequency of the transitions that the random walker makes - between nodes in different PPI networks, which can be used as a probabilistic score measuring their potential correspondence. Subsequently, the estimated scores can be used for detecting orthologous proteins in conserved functional modules through network alignment. Through evaluations based on multiple real PPI networks, we demonstrate that the proposed scheme leads to improved alignment results that are biologically more meaningful at reduced computational cost, outperforming the current state-of-the-art algorithms. The source code and datasets can be downloaded from http://www.ece.tamu.edu/~bjyoon/CUFID .
Directory of Open Access Journals (Sweden)
Yosef Y Kuttner
2013-04-01
Full Text Available Knowledge of the structural basis of protein-protein interactions (PPI is of fundamental importance for understanding the organization and functioning of biological networks and advancing the design of therapeutics which target PPI. Allosteric modulators play an important role in regulating such interactions by binding at site(s orthogonal to the complex interface and altering the protein's propensity for complex formation. In this work, we apply an approach recently developed by us for analyzing protein surfaces based on steered molecular dynamics simulation (SMD to the study of the dynamic properties of functionally distinct conformations of a model protein, calmodulin (CaM, whose ability to interact with target proteins is regulated by the presence of the allosteric modulator Ca(2+. Calmodulin is a regulatory protein that acts as an intracellular Ca(2+ sensor to control a wide variety of cellular processes. We demonstrate that SMD analysis is capable of pinpointing CaM surfaces implicated in the recognition of both the allosteric modulator Ca(2+ and target proteins. Our analysis of changes in the dynamic properties of the CaM backbone elicited by Ca(2+ binding yielded new insights into the molecular mechanism of allosteric regulation of CaM-target interactions.
Defining the protein interaction network of human malaria parasite Plasmodium falciparum
Ramaprasad, Abhinay
2012-02-01
Malaria, caused by the protozoan parasite Plasmodium falciparum, affects around 225. million people yearly and a huge international effort is directed towards combating this grave threat to world health and economic development. Considerable advances have been made in malaria research triggered by the sequencing of its genome in 2002, followed by several high-throughput studies defining the malaria transcriptome and proteome. A protein-protein interaction (PPI) network seeks to trace the dynamic interactions between proteins, thereby elucidating their local and global functional relationships. Experimentally derived PPI network from high-throughput methods such as yeast two hybrid (Y2H) screens are inherently noisy, but combining these independent datasets by computational methods tends to give a greater accuracy and coverage. This review aims to discuss the computational approaches used till date to construct a malaria protein interaction network and to catalog the functional predictions and biological inferences made from analysis of the PPI network. © 2011 Elsevier Inc.
Exploring overlapping functional units with various structure in protein interaction networks.
Directory of Open Access Journals (Sweden)
Xiao-Fei Zhang
Full Text Available Revealing functional units in protein-protein interaction (PPI networks are important for understanding cellular functional organization. Current algorithms for identifying functional units mainly focus on cohesive protein complexes which have more internal interactions than external interactions. Most of these approaches do not handle overlaps among complexes since they usually allow a protein to belong to only one complex. Moreover, recent studies have shown that other non-cohesive structural functional units beyond complexes also exist in PPI networks. Thus previous algorithms that just focus on non-overlapping cohesive complexes are not able to present the biological reality fully. Here, we develop a new regularized sparse random graph model (RSRGM to explore overlapping and various structural functional units in PPI networks. RSRGM is principally dominated by two model parameters. One is used to define the functional units as groups of proteins that have similar patterns of connections to others, which allows RSRGM to detect non-cohesive structural functional units. The other one is used to represent the degree of proteins belonging to the units, which supports a protein belonging to more than one revealed unit. We also propose a regularizer to control the smoothness between the estimators of these two parameters. Experimental results on four S. cerevisiae PPI networks show that the performance of RSRGM on detecting cohesive complexes and overlapping complexes is superior to that of previous competing algorithms. Moreover, RSRGM has the ability to discover biological significant functional units besides complexes.
Stringent DDI-based prediction of H. sapiens-M. tuberculosis H37Rv protein-protein interactions.
Zhou, Hufeng; Rezaei, Javad; Hugo, Willy; Gao, Shangzhi; Jin, Jingjing; Fan, Mengyuan; Yong, Chern-Han; Wozniak, Michal; Wong, Limsoon
2013-01-01
H. sapiens-M. tuberculosis H37Rv protein-protein interaction (PPI) data are very important information to illuminate the infection mechanism of M. tuberculosis H37Rv. But current H. sapiens-M. tuberculosis H37Rv PPI data are very scarce. This seriously limits the study of the interaction between this important pathogen and its host H. sapiens. Computational prediction of H. sapiens-M. tuberculosis H37Rv PPIs is an important strategy to fill in the gap. Domain-domain interaction (DDI) based prediction is one of the frequently used computational approaches in predicting both intra-species and inter-species PPIs. However, the performance of DDI-based host-pathogen PPI prediction has been rather limited. We develop a stringent DDI-based prediction approach with emphasis on (i) differences between the specific domain sequences on annotated regions of proteins under the same domain ID and (ii) calculation of the interaction strength of predicted PPIs based on the interacting residues in their interaction interfaces. We compare our stringent DDI-based approach to a conventional DDI-based approach for predicting PPIs based on gold standard intra-species PPIs and coherent informative Gene Ontology terms assessment. The assessment results show that our stringent DDI-based approach achieves much better performance in predicting PPIs than the conventional approach. Using our stringent DDI-based approach, we have predicted a small set of reliable H. sapiens-M. tuberculosis H37Rv PPIs which could be very useful for a variety of related studies. We also analyze the H. sapiens-M. tuberculosis H37Rv PPIs predicted by our stringent DDI-based approach using cellular compartment distribution analysis, functional category enrichment analysis and pathway enrichment analysis. The analyses support the validity of our prediction result. Also, based on an analysis of the H. sapiens-M. tuberculosis H37Rv PPI network predicted by our stringent DDI-based approach, we have discovered some
Directory of Open Access Journals (Sweden)
Jiawei Luo
Full Text Available Computational approaches aided by computer science have been used to predict essential proteins and are faster than expensive, time-consuming, laborious experimental approaches. However, the performance of such approaches is still poor, making practical applications of computational approaches difficult in some fields. Hence, the development of more suitable and efficient computing methods is necessary for identification of essential proteins.In this paper, we propose a new method for predicting essential proteins in a protein interaction network, local interaction density combined with protein complexes (LIDC, based on statistical analyses of essential proteins and protein complexes. First, we introduce a new local topological centrality, local interaction density (LID, of the yeast PPI network; second, we discuss a new integration strategy for multiple bioinformatics. The LIDC method was then developed through a combination of LID and protein complex information based on our new integration strategy. The purpose of LIDC is discovery of important features of essential proteins with their neighbors in real protein complexes, thereby improving the efficiency of identification.Experimental results based on three different PPI(protein-protein interaction networks of Saccharomyces cerevisiae and Escherichia coli showed that LIDC outperformed classical topological centrality measures and some recent combinational methods. Moreover, when predicting MIPS datasets, the better improvement of performance obtained by LIDC is over all nine reference methods (i.e., DC, BC, NC, LID, PeC, CoEWC, WDC, ION, and UC.LIDC is more effective for the prediction of essential proteins than other recently developed methods.
Directory of Open Access Journals (Sweden)
Nan Zhao
2014-05-01
Full Text Available Single nucleotide polymorphisms (SNPs are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs have been found near or inside the protein-protein interaction (PPI interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor. Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1 a 2-class problem (strengthening/weakening PPI mutations, (2 another 2-class problem (mutations that disrupt/preserve a PPI, and (3 a 3-class classification (detrimental/neutral/beneficial mutation effects. In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the
Zhao, Nan; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry
2014-01-01
Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of
BIPS: BIANA Interolog Prediction Server. A tool for protein-protein interaction inference.
Garcia-Garcia, Javier; Schleker, Sylvia; Klein-Seetharaman, Judith; Oliva, Baldo
2012-07-01
Protein-protein interactions (PPIs) play a crucial role in biology, and high-throughput experiments have greatly increased the coverage of known interactions. Still, identification of complete inter- and intraspecies interactomes is far from being complete. Experimental data can be complemented by the prediction of PPIs within an organism or between two organisms based on the known interactions of the orthologous genes of other organisms (interologs). Here, we present the BIANA (Biologic Interactions and Network Analysis) Interolog Prediction Server (BIPS), which offers a web-based interface to facilitate PPI predictions based on interolog information. BIPS benefits from the capabilities of the framework BIANA to integrate the several PPI-related databases. Additional metadata can be used to improve the reliability of the predicted interactions. Sensitivity and specificity of the server have been calculated using known PPIs from different interactomes using a leave-one-out approach. The specificity is between 72 and 98%, whereas sensitivity varies between 1 and 59%, depending on the sequence identity cut-off used to calculate similarities between sequences. BIPS is freely accessible at http://sbi.imim.es/BIPS.php.
P-Finder: Reconstruction of Signaling Networks from Protein-Protein Interactions and GO Annotations.
Young-Rae Cho; Yanan Xin; Speegle, Greg
2015-01-01
Because most complex genetic diseases are caused by defects of cell signaling, illuminating a signaling cascade is essential for understanding their mechanisms. We present three novel computational algorithms to reconstruct signaling networks between a starting protein and an ending protein using genome-wide protein-protein interaction (PPI) networks and gene ontology (GO) annotation data. A signaling network is represented as a directed acyclic graph in a merged form of multiple linear pathways. An advanced semantic similarity metric is applied for weighting PPIs as the preprocessing of all three methods. The first algorithm repeatedly extends the list of nodes based on path frequency towards an ending protein. The second algorithm repeatedly appends edges based on the occurrence of network motifs which indicate the link patterns more frequently appearing in a PPI network than in a random graph. The last algorithm uses the information propagation technique which iteratively updates edge orientations based on the path strength and merges the selected directed edges. Our experimental results demonstrate that the proposed algorithms achieve higher accuracy than previous methods when they are tested on well-studied pathways of S. cerevisiae. Furthermore, we introduce an interactive web application tool, called P-Finder, to visualize reconstructed signaling networks.
Subramani, Suresh; Kalpana, Raja; Monickaraj, Pankaj Moses; Natarajan, Jeyakumar
2015-04-01
The knowledge on protein-protein interactions (PPI) and their related pathways are equally important to understand the biological functions of the living cell. Such information on human proteins is highly desirable to understand the mechanism of several diseases such as cancer, diabetes, and Alzheimer's disease. Because much of that information is buried in biomedical literature, an automated text mining system for visualizing human PPI and pathways is highly desirable. In this paper, we present HPIminer, a text mining system for visualizing human protein interactions and pathways from biomedical literature. HPIminer extracts human PPI information and PPI pairs from biomedical literature, and visualize their associated interactions, networks and pathways using two curated databases HPRD and KEGG. To our knowledge, HPIminer is the first system to build interaction networks from literature as well as curated databases. Further, the new interactions mined only from literature and not reported earlier in databases are highlighted as new. A comparative study with other similar tools shows that the resultant network is more informative and provides additional information on interacting proteins and their associated networks. Copyright © 2015 Elsevier Inc. All rights reserved.
Directory of Open Access Journals (Sweden)
Naoki Orii
Full Text Available Protein-protein interactions (PPIs are the basis of biological functions. Knowledge of the interactions of a protein can help understand its molecular function and its association with different biological processes and pathways. Several publicly available databases provide comprehensive information about individual proteins, such as their sequence, structure, and function. There also exist databases that are built exclusively to provide PPIs by curating them from published literature. The information provided in these web resources is protein-centric, and not PPI-centric. The PPIs are typically provided as lists of interactions of a given gene with links to interacting partners; they do not present a comprehensive view of the nature of both the proteins involved in the interactions. A web database that allows search and retrieval based on biomedical characteristics of PPIs is lacking, and is needed. We present Wiki-Pi (read Wiki-π, a web-based interface to a database of human PPIs, which allows users to retrieve interactions by their biomedical attributes such as their association to diseases, pathways, drugs and biological functions. Each retrieved PPI is shown with annotations of both of the participant proteins side-by-side, creating a basis to hypothesize the biological function facilitated by the interaction. Conceptually, it is a search engine for PPIs analogous to PubMed for scientific literature. Its usefulness in generating novel scientific hypotheses is demonstrated through the study of IGSF21, a little-known gene that was recently identified to be associated with diabetic retinopathy. Using Wiki-Pi, we infer that its association to diabetic retinopathy may be mediated through its interactions with the genes HSPB1, KRAS, TMSB4X and DGKD, and that it may be involved in cellular response to external stimuli, cytoskeletal organization and regulation of molecular activity. The website also provides a wiki-like capability allowing users
Directory of Open Access Journals (Sweden)
Moschopoulos Charalampos
2011-06-01
Full Text Available Abstract Background Recent technological advances applied to biology such as yeast-two-hybrid, phage display and mass spectrometry have enabled us to create a detailed map of protein interaction networks. These interaction networks represent a rich, yet noisy, source of data that could be used to extract meaningful information, such as protein complexes. Several interaction network weighting schemes have been proposed so far in the literature in order to eliminate the noise inherent in interactome data. In this paper, we propose a novel weighting scheme and apply it to the S. cerevisiae interactome. Complex prediction rates are improved by up to 39%, depending on the clustering algorithm applied. Results We adopt a two step procedure. During the first step, by applying both novel and well established protein-protein interaction (PPI weighting methods, weights are introduced to the original interactome graph based on the confidence level that a given interaction is a true-positive one. The second step applies clustering using established algorithms in the field of graph theory, as well as two variations of Spectral clustering. The clustered interactome networks are also cross-validated against the confirmed protein complexes present in the MIPS database. Conclusions The results of our experimental work demonstrate that interactome graph weighting methods clearly improve the clustering results of several clustering algorithms. Moreover, our proposed weighting scheme outperforms other approaches of PPI graph weighting.
Directory of Open Access Journals (Sweden)
Jun Ren
2014-01-01
Full Text Available Many evidences have demonstrated that protein complexes are overlapping and hierarchically organized in PPI networks. Meanwhile, the large size of PPI network wants complex detection methods have low time complexity. Up to now, few methods can identify overlapping and hierarchical protein complexes in a PPI network quickly. In this paper, a novel method, called MCSE, is proposed based on λ-module and “seed-expanding.” First, it chooses seeds as essential PPIs or edges with high edge clustering values. Then, it identifies protein complexes by expanding each seed to a λ-module. MCSE is suitable for large PPI networks because of its low time complexity. MCSE can identify overlapping protein complexes naturally because a protein can be visited by different seeds. MCSE uses the parameter λ_th to control the range of seed expanding and can detect a hierarchical organization of protein complexes by tuning the value of λ_th. Experimental results of S. cerevisiae show that this hierarchical organization is similar to that of known complexes in MIPS database. The experimental results also show that MCSE outperforms other previous competing algorithms, such as CPM, CMC, Core-Attachment, Dpclus, HC-PIN, MCL, and NFC, in terms of the functional enrichment and matching with known protein complexes.
Predicting Protein-Protein Interaction Sites with a Novel Membership Based Fuzzy SVM Classifier.
Sriwastava, Brijesh K; Basu, Subhadip; Maulik, Ujjwal
2015-01-01
Predicting residues that participate in protein-protein interactions (PPI) helps to identify, which amino acids are located at the interface. In this paper, we show that the performance of the classical support vector machine (SVM) algorithm can further be improved with the use of a custom-designed fuzzy membership function, for the partner-specific PPI interface prediction problem. We evaluated the performances of both classical SVM and fuzzy SVM (F-SVM) on the PPI databases of three different model proteomes of Homo sapiens, Escherichia coli and Saccharomyces Cerevisiae and calculated the statistical significance of the developed F-SVM over classical SVM algorithm. We also compared our performance with the available state-of-the-art fuzzy methods in this domain and observed significant performance improvements. To predict interaction sites in protein complexes, local composition of amino acids together with their physico-chemical characteristics are used, where the F-SVM based prediction method exploits the membership function for each pair of sequence fragments. The average F-SVM performance (area under ROC curve) on the test samples in 10-fold cross validation experiment are measured as 77.07, 78.39, and 74.91 percent for the aforementioned organisms respectively. Performances on independent test sets are obtained as 72.09, 73.24 and 82.74 percent respectively. The software is available for free download from http://code.google.com/p/cmater-bioinfo.
Prediction of Protein–Protein Interactions by Evidence Combining Methods
Directory of Open Access Journals (Sweden)
Ji-Wei Chang
2016-11-01
Full Text Available Most cellular functions involve proteins’ features based on their physical interactions with other partner proteins. Sketching a map of protein–protein interactions (PPIs is therefore an important inception step towards understanding the basics of cell functions. Several experimental techniques operating in vivo or in vitro have made significant contributions to screening a large number of protein interaction partners, especially high-throughput experimental methods. However, computational approaches for PPI predication supported by rapid accumulation of data generated from experimental techniques, 3D structure definitions, and genome sequencing have boosted the map sketching of PPIs. In this review, we shed light on in silico PPI prediction methods that integrate evidence from multiple sources, including evolutionary relationship, function annotation, sequence/structure features, network topology and text mining. These methods are developed for integration of multi-dimensional evidence, for designing the strategies to predict novel interactions, and for making the results consistent with the increase of prediction coverage and accuracy.
MPact: the MIPS protein interaction resource on yeast.
Güldener, Ulrich; Münsterkötter, Martin; Oesterheld, Matthias; Pagel, Philipp; Ruepp, Andreas; Mewes, Hans-Werner; Stümpflen, Volker
2006-01-01
In recent years, the Munich Information Center for Protein Sequences (MIPS) yeast protein-protein interaction (PPI) dataset has been used in numerous analyses of protein networks and has been called a gold standard because of its quality and comprehensiveness [H. Yu, N. M. Luscombe, H. X. Lu, X. Zhu, Y. Xia, J. D. Han, N. Bertin, S. Chung, M. Vidal and M. Gerstein (2004) Genome Res., 14, 1107-1118]. MPact and the yeast protein localization catalog provide information related to the proximity of proteins in yeast. Beside the integration of high-throughput data, information about experimental evidence for PPIs in the literature was compiled by experts adding up to 4300 distinct PPIs connecting 1500 proteins in yeast. As the interaction data is a complementary part of CYGD, interactive mapping of data on other integrated data types such as the functional classification catalog [A. Ruepp, A. Zollner, D. Maier, K. Albermann, J. Hani, M. Mokrejs, I. Tetko, U. Güldener, G. Mannhaupt, M. Münsterkötter and H. W. Mewes (2004) Nucleic Acids Res., 32, 5539-5545] is possible. A survey of signaling proteins and comparison with pathway data from KEGG demonstrates that based on these manually annotated data only an extensive overview of the complexity of this functional network can be obtained in yeast. The implementation of a web-based PPI-analysis tool allows analysis and visualization of protein interaction networks and facilitates integration of our curated data with high-throughput datasets. The complete dataset as well as user-defined sub-networks can be retrieved easily in the standardized PSI-MI format. The resource can be accessed through http://mips.gsf.de/genre/proj/mpact.
Reynès, Christelle; Host, Hélène; Camproux, Anne-Claude; Laconde, Guillaume; Leroux, Florence; Mazars, Anne; Deprez, Benoit; Fahraeus, Robin; Villoutreix, Bruno O.; Sperandio, Olivier
2010-01-01
Protein-protein interactions (PPIs) may represent one of the next major classes of therapeutic targets. So far, only a minute fraction of the estimated 650,000 PPIs that comprise the human interactome are known with a tiny number of complexes being drugged. Such intricate biological systems cannot be cost-efficiently tackled using conventional high-throughput screening methods. Rather, time has come for designing new strategies that will maximize the chance for hit identification through a rationalization of the PPI inhibitor chemical space and the design of PPI-focused compound libraries (global or target-specific). Here, we train machine-learning-based models, mainly decision trees, using a dataset of known PPI inhibitors and of regular drugs in order to determine a global physico-chemical profile for putative PPI inhibitors. This statistical analysis unravels two important molecular descriptors for PPI inhibitors characterizing specific molecular shapes and the presence of a privileged number of aromatic bonds. The best model has been transposed into a computer program, PPI-HitProfiler, that can output from any drug-like compound collection a focused chemical library enriched in putative PPI inhibitors. Our PPI inhibitor profiler is challenged on the experimental screening results of 11 different PPIs among which the p53/MDM2 interaction screened within our own CDithem platform, that in addition to the validation of our concept led to the identification of 4 novel p53/MDM2 inhibitors. Collectively, our tool shows a robust behavior on the 11 experimental datasets by correctly profiling 70% of the experimentally identified hits while removing 52% of the inactive compounds from the initial compound collections. We strongly believe that this new tool can be used as a global PPI inhibitor profiler prior to screening assays to reduce the size of the compound collections to be experimentally screened while keeping most of the true PPI inhibitors. PPI-HitProfiler is
Directory of Open Access Journals (Sweden)
Christelle Reynès
2010-03-01
Full Text Available Protein-protein interactions (PPIs may represent one of the next major classes of therapeutic targets. So far, only a minute fraction of the estimated 650,000 PPIs that comprise the human interactome are known with a tiny number of complexes being drugged. Such intricate biological systems cannot be cost-efficiently tackled using conventional high-throughput screening methods. Rather, time has come for designing new strategies that will maximize the chance for hit identification through a rationalization of the PPI inhibitor chemical space and the design of PPI-focused compound libraries (global or target-specific. Here, we train machine-learning-based models, mainly decision trees, using a dataset of known PPI inhibitors and of regular drugs in order to determine a global physico-chemical profile for putative PPI inhibitors. This statistical analysis unravels two important molecular descriptors for PPI inhibitors characterizing specific molecular shapes and the presence of a privileged number of aromatic bonds. The best model has been transposed into a computer program, PPI-HitProfiler, that can output from any drug-like compound collection a focused chemical library enriched in putative PPI inhibitors. Our PPI inhibitor profiler is challenged on the experimental screening results of 11 different PPIs among which the p53/MDM2 interaction screened within our own CDithem platform, that in addition to the validation of our concept led to the identification of 4 novel p53/MDM2 inhibitors. Collectively, our tool shows a robust behavior on the 11 experimental datasets by correctly profiling 70% of the experimentally identified hits while removing 52% of the inactive compounds from the initial compound collections. We strongly believe that this new tool can be used as a global PPI inhibitor profiler prior to screening assays to reduce the size of the compound collections to be experimentally screened while keeping most of the true PPI inhibitors. PPI
Reynès, Christelle; Host, Hélène; Camproux, Anne-Claude; Laconde, Guillaume; Leroux, Florence; Mazars, Anne; Deprez, Benoit; Fahraeus, Robin; Villoutreix, Bruno O; Sperandio, Olivier
2010-03-05
Protein-protein interactions (PPIs) may represent one of the next major classes of therapeutic targets. So far, only a minute fraction of the estimated 650,000 PPIs that comprise the human interactome are known with a tiny number of complexes being drugged. Such intricate biological systems cannot be cost-efficiently tackled using conventional high-throughput screening methods. Rather, time has come for designing new strategies that will maximize the chance for hit identification through a rationalization of the PPI inhibitor chemical space and the design of PPI-focused compound libraries (global or target-specific). Here, we train machine-learning-based models, mainly decision trees, using a dataset of known PPI inhibitors and of regular drugs in order to determine a global physico-chemical profile for putative PPI inhibitors. This statistical analysis unravels two important molecular descriptors for PPI inhibitors characterizing specific molecular shapes and the presence of a privileged number of aromatic bonds. The best model has been transposed into a computer program, PPI-HitProfiler, that can output from any drug-like compound collection a focused chemical library enriched in putative PPI inhibitors. Our PPI inhibitor profiler is challenged on the experimental screening results of 11 different PPIs among which the p53/MDM2 interaction screened within our own CDithem platform, that in addition to the validation of our concept led to the identification of 4 novel p53/MDM2 inhibitors. Collectively, our tool shows a robust behavior on the 11 experimental datasets by correctly profiling 70% of the experimentally identified hits while removing 52% of the inactive compounds from the initial compound collections. We strongly believe that this new tool can be used as a global PPI inhibitor profiler prior to screening assays to reduce the size of the compound collections to be experimentally screened while keeping most of the true PPI inhibitors. PPI-HitProfiler is
Directory of Open Access Journals (Sweden)
Amin R Mazloom
2011-12-01
Full Text Available Coregulator proteins (CoRegs are part of multi-protein complexes that transiently assemble with transcription factors and chromatin modifiers to regulate gene expression. In this study we analyzed data from 3,290 immuno-precipitations (IP followed by mass spectrometry (MS applied to human cell lines aimed at identifying CoRegs complexes. Using the semi-quantitative spectral counts, we scored binary protein-protein and domain-domain associations with several equations. Unlike previous applications, our methods scored prey-prey protein-protein interactions regardless of the baits used. We also predicted domain-domain interactions underlying predicted protein-protein interactions. The quality of predicted protein-protein and domain-domain interactions was evaluated using known binary interactions from the literature, whereas one protein-protein interaction, between STRN and CTTNBP2NL, was validated experimentally; and one domain-domain interaction, between the HEAT domain of PPP2R1A and the Pkinase domain of STK25, was validated using molecular docking simulations. The scoring schemes presented here recovered known, and predicted many new, complexes, protein-protein, and domain-domain interactions. The networks that resulted from the predictions are provided as a web-based interactive application at http://maayanlab.net/HT-IP-MS-2-PPI-DDI/.
BioPlex Display: An Interactive Suite for Large-Scale AP-MS Protein-Protein Interaction Data.
Schweppe, Devin K; Huttlin, Edward L; Harper, J Wade; Gygi, Steven P
2018-01-05
The development of large-scale data sets requires a new means to display and disseminate research studies to large audiences. Knowledge of protein-protein interaction (PPI) networks has become a principle interest of many groups within the field of proteomics. At the confluence of technologies, such as cross-linking mass spectrometry, yeast two-hybrid, protein cofractionation, and affinity purification mass spectrometry (AP-MS), detection of PPIs can uncover novel biological inferences at a high-throughput. Thus new platforms to provide community access to large data sets are necessary. To this end, we have developed a web application that enables exploration and dissemination of the growing BioPlex interaction network. BioPlex is a large-scale interactome data set based on AP-MS of baits from the human ORFeome. The latest BioPlex data set release (BioPlex 2.0) contains 56 553 interactions from 5891 AP-MS experiments. To improve community access to this vast compendium of interactions, we developed BioPlex Display, which integrates individual protein querying, access to empirical data, and on-the-fly annotation of networks within an easy-to-use and mobile web application. BioPlex Display enables rapid acquisition of data from BioPlex and development of hypotheses based on protein interactions.
Next-Generation Sequencing for Binary Protein-Protein Interactions
Directory of Open Access Journals (Sweden)
Bernhard eSuter
2015-12-01
Full Text Available The yeast two-hybrid (Y2H system exploits host cell genetics in order to display binary protein-protein interactions (PPIs via defined and selectable phenotypes. Numerous improvements have been made to this method, adapting the screening principle for diverse applications, including drug discovery and the scale-up for proteome wide interaction screens in human and other organisms. Here we discuss a systematic workflow and analysis scheme for screening data generated by Y2H and related assays that includes high-throughput selection procedures, readout of comprehensive results via next-generation sequencing (NGS, and the interpretation of interaction data via quantitative statistics. The novel assays and tools will serve the broader scientific community to harness the power of NGS technology to address PPI networks in health and disease. We discuss examples of how this next-generation platform can be applied to address specific questions in diverse fields of biology and medicine.
DEFF Research Database (Denmark)
Neergaard, Martin S; Kalonia, Devendra S; Parshad, Henrik
2013-01-01
The purpose of this work was to explore the relation between protein-protein interactions (PPIs) and solution viscosity at high protein concentration using three monoclonal antibodies (mAbs), two of the IgG4 subclass and one of the IgG1 subclass. A range of methods was used to quantify the PPI...... low or high protein concentration determined using DLS. The PPI measurements were correlated with solution viscosity (measured by DLS using polystyrene nanospheres and ultrasonic shear rheology) as a function of pH (4-9) and ionic strength (10, 50 and 150mM). Our measurements showed that the highest...... solution viscosity was observed under conditions with the most negative kD, the highest apparent radius and the lowest net charge. An increase in ionic strength resulted in a change in the nature of the PPI at low pH from repulsive to attractive. In the neutral to alkaline pH region the mAbs behaved...
Cannistraci, Carlo; Alanis Lobato, Gregorio; Ravasi, Timothy
2013-01-01
Motivation: Most functions within the cell emerge thanks to protein-protein interactions (PPIs), yet experimental determination of PPIs is both expensive and time-consuming. PPI networks present significant levels of noise and incompleteness
Rationalizing the chemical space of protein-protein interaction inhibitors.
Sperandio, Olivier; Reynès, Christelle H; Camproux, Anne-Claude; Villoutreix, Bruno O
2010-03-01
Protein-protein interactions (PPIs) are one of the next major classes of therapeutic targets, although they are too intricate to tackle with standard approaches. This is due, in part, to the inadequacy of today's chemical libraries. However, the emergence of a growing number of experimentally validated inhibitors of PPIs (i-PPIs) allows drug designers to use chemoinformatics and machine learning technologies to unravel the nature of the chemical space covered by the reported compounds. Key characteristics of i-PPIs can then be revealed and highlight the importance of specific shapes and/or aromatic bonds, enabling the design of i-PPI-enriched focused libraries and, therefore, of cost-effective screening strategies. 2009 Elsevier Ltd. All rights reserved.
Computational prediction of protein-protein interactions in Leishmania predicted proteomes.
Directory of Open Access Journals (Sweden)
Antonio M Rezende
Full Text Available The Trypanosomatids parasites Leishmania braziliensis, Leishmania major and Leishmania infantum are important human pathogens. Despite of years of study and genome availability, effective vaccine has not been developed yet, and the chemotherapy is highly toxic. Therefore, it is clear just interdisciplinary integrated studies will have success in trying to search new targets for developing of vaccines and drugs. An essential part of this rationale is related to protein-protein interaction network (PPI study which can provide a better understanding of complex protein interactions in biological system. Thus, we modeled PPIs for Trypanosomatids through computational methods using sequence comparison against public database of protein or domain interaction for interaction prediction (Interolog Mapping and developed a dedicated combined system score to address the predictions robustness. The confidence evaluation of network prediction approach was addressed using gold standard positive and negative datasets and the AUC value obtained was 0.94. As result, 39,420, 43,531 and 45,235 interactions were predicted for L. braziliensis, L. major and L. infantum respectively. For each predicted network the top 20 proteins were ranked by MCC topological index. In addition, information related with immunological potential, degree of protein sequence conservation among orthologs and degree of identity compared to proteins of potential parasite hosts was integrated. This information integration provides a better understanding and usefulness of the predicted networks that can be valuable to select new potential biological targets for drug and vaccine development. Network modularity which is a key when one is interested in destabilizing the PPIs for drug or vaccine purposes along with multiple alignments of the predicted PPIs were performed revealing patterns associated with protein turnover. In addition, around 50% of hypothetical protein present in the networks
DomPep--a general method for predicting modular domain-mediated protein-protein interactions.
Directory of Open Access Journals (Sweden)
Lei Li
Full Text Available Protein-protein interactions (PPIs are frequently mediated by the binding of a modular domain in one protein to a short, linear peptide motif in its partner. The advent of proteomic methods such as peptide and protein arrays has led to the accumulation of a wealth of interaction data for modular interaction domains. Although several computational programs have been developed to predict modular domain-mediated PPI events, they are often restricted to a given domain type. We describe DomPep, a method that can potentially be used to predict PPIs mediated by any modular domains. DomPep combines proteomic data with sequence information to achieve high accuracy and high coverage in PPI prediction. Proteomic binding data were employed to determine a simple yet novel parameter Ligand-Binding Similarity which, in turn, is used to calibrate Domain Sequence Identity and Position-Weighted-Matrix distance, two parameters that are used in constructing prediction models. Moreover, DomPep can be used to predict PPIs for both domains with experimental binding data and those without. Using the PDZ and SH2 domain families as test cases, we show that DomPep can predict PPIs with accuracies superior to existing methods. To evaluate DomPep as a discovery tool, we deployed DomPep to identify interactions mediated by three human PDZ domains. Subsequent in-solution binding assays validated the high accuracy of DomPep in predicting authentic PPIs at the proteome scale. Because DomPep makes use of only interaction data and the primary sequence of a domain, it can be readily expanded to include other types of modular domains.
FMFinder: A Functional Module Detector for PPI Networks
Directory of Open Access Journals (Sweden)
M. Modi
2017-10-01
Full Text Available Bioinformatics is an integrated area of data mining, statistics and computational biology. Protein-Protein Interaction (PPI network is the most important biological process in living beings. In this network a protein module interacts with another module and so on, forming a large network of proteins. The same set of proteins which takes part in the organic courses of biological actions is detected through the Function Module Detection method. Clustering process when applied in PPI networks is made of proteins which are part of a larger communication network. As a result of this, we can define the limits for module detection as well as clarify the construction of a PPI network. For understating the bio-mechanism of various living beings, a detailed study of FMFinder detection by clustering process is called for.
Kubis, Sybille; Baldwin, Amy; Patel, Ramesh; Razzaq, Azam; Dupree, Paul; Lilley, Kathryn; Kurth, Joachim; Leister, Dario; Jarvis, Paul
2003-01-01
The import of nucleus-encoded proteins into chloroplasts is mediated by translocon complexes in the envelope membranes. A component of the translocon in the outer envelope membrane, Toc34, is encoded in Arabidopsis by two homologous genes, atTOC33 and atTOC34. Whereas atTOC34 displays relatively uniform expression throughout development, atTOC33 is strongly upregulated in rapidly growing, photosynthetic tissues. To understand the reason for the existence of these two related genes, we characterized the atTOC33 knockout mutant ppi1. Immunoblotting and proteomics revealed that components of the photosynthetic apparatus are deficient in ppi1 chloroplasts and that nonphotosynthetic chloroplast proteins are unchanged or enriched slightly. Furthermore, DNA array analysis of 3292 transcripts revealed that photosynthetic genes are moderately, but specifically, downregulated in ppi1. Proteome differences in ppi1 could be correlated with protein import rates: ppi1 chloroplasts imported the ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit and 33-kD oxygen-evolving complex precursors at significantly reduced rates, but the import of a 50S ribosomal subunit precursor was largely unaffected. The ppi1 import defect occurred at the level of preprotein binding, which is consistent with a role for atToc33 during preprotein recognition. The data suggest that atToc33 is involved preferentially in the import of photosynthetic proteins and, by extension, that atToc34 is involved in the import of nonphotosynthetic chloroplast proteins. PMID:12897258
Kinase Substrate Sensor (KISS), a mammalian in situ protein interaction sensor.
Lievens, Sam; Gerlo, Sarah; Lemmens, Irma; De Clercq, Dries J H; Risseeuw, Martijn D P; Vanderroost, Nele; De Smet, Anne-Sophie; Ruyssinck, Elien; Chevet, Eric; Van Calenbergh, Serge; Tavernier, Jan
2014-12-01
Probably every cellular process is governed by protein-protein interaction (PPIs), which are often highly dynamic in nature being modulated by in- or external stimuli. Here we present KISS, for KInase Substrate Sensor, a mammalian two-hybrid approach designed to map intracellular PPIs and some of the dynamic features they exhibit. Benchmarking experiments indicate that in terms of sensitivity and specificity KISS is on par with other binary protein interaction technologies while being complementary with regard to the subset of PPIs it is able to detect. We used KISS to evaluate interactions between different types of proteins, including transmembrane proteins, expressed at their native subcellular location. In situ analysis of endoplasmic reticulum stress-induced clustering of the endoplasmic reticulum stress sensor ERN1 and ligand-dependent β-arrestin recruitment to GPCRs illustrated the method's potential to study functional PPI modulation in complex cellular processes. Exploring its use as a tool for in cell evaluation of pharmacological interference with PPIs, we showed that reported effects of known GPCR antagonists and PPI inhibitors are properly recapitulated. In a three-hybrid setup, KISS was able to map interactions between small molecules and proteins. Taken together, we established KISS as a sensitive approach for in situ analysis of protein interactions and their modulation in a changing cellular context or in response to pharmacological challenges. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
Villa, Stefania; Legnani, Laura; Colombo, Diego; Gelain, Arianna; Lammi, Carmen; Bongiorno, Daniele; Ilboudo, Denise P.; McGee, Kellen E.; Bosch, Jürgen; Grazioso, Giovanni
2018-03-01
The proteins involved in the autophagy (Atg) pathway have recently been considered promising targets for the development of new antimalarial drugs. In particular, inhibitors of the protein-protein interaction (PPI) between Atg3 and Atg8 of Plasmodium falciparum retarded the blood- and liver-stages of parasite growth. In this paper, we used computational techniques to design a new class of peptidomimetics mimicking the Atg3 interaction motif, which were then synthesized by click-chemistry. Surface plasmon resonance has been employed to measure the ability of these compounds to inhibit the Atg3-Atg8 reciprocal protein-protein interaction. Moreover, P. falciparum growth inhibition in red blood cell cultures was evaluated as well as the cyto-toxicity of the compounds.
Detection of Locally Over-Represented GO Terms in Protein-Protein Interaction Networks
LAVALLÉE-ADAM, MATHIEU; COULOMBE, BENOIT; BLANCHETTE, MATHIEU
2015-01-01
High-throughput methods for identifying protein-protein interactions produce increasingly complex and intricate interaction networks. These networks are extremely rich in information, but extracting biologically meaningful hypotheses from them and representing them in a human-readable manner is challenging. We propose a method to identify Gene Ontology terms that are locally over-represented in a subnetwork of a given biological network. Specifically, we propose several methods to evaluate the degree of clustering of proteins associated to a particular GO term in both weighted and unweighted PPI networks, and describe efficient methods to estimate the statistical significance of the observed clustering. We show, using Monte Carlo simulations, that our best approximation methods accurately estimate the true p-value, for random scale-free graphs as well as for actual yeast and human networks. When applied to these two biological networks, our approach recovers many known complexes and pathways, but also suggests potential functions for many subnetworks. Online Supplementary Material is available at www.liebertonline.com. PMID:20377456
Peng, Yifan; Arighi, Cecilia; Wu, Cathy H; Vijay-Shanker, K
2016-01-01
There has been a large growth in the number of biomedical publications that report experimental results. Many of these results concern detection of protein-protein interactions (PPI). In BioCreative V, we participated in the BioC task and developed a PPI system to detect text passages with PPIs in the full-text articles. By adopting the BioC format, the output of the system can be seamlessly added to the biocuration pipeline with little effort required for the system integration. A distinctive feature of our PPI system is that it utilizes extended dependency graph, an intermediate level of representation that attempts to abstract away syntactic variations in text. As a result, we are able to use only a limited set of rules to extract PPI pairs in the sentences, and additional rules to detect additional passages for PPI pairs. For evaluation, we used the 95 articles that were provided for the BioC annotation task. We retrieved the unique PPIs from the BioGRID database for these articles and show that our system achieves a recall of 83.5%. In order to evaluate the detection of passages with PPIs, we further annotated Abstract and Results sections of 20 documents from the dataset and show that an f-value of 80.5% was obtained. To evaluate the generalizability of the system, we also conducted experiments on AIMed, a well-known PPI corpus. We achieved an f-value of 76.1% for sentence detection and an f-value of 64.7% for unique PPI detection.Database URL: http://proteininformationresource.org/iprolink/corpora. © The Author(s) 2016. Published by Oxford University Press.
Multi-level machine learning prediction of protein–protein interactions in Saccharomyces cerevisiae
Directory of Open Access Journals (Sweden)
Julian Zubek
2015-07-01
Full Text Available Accurate identification of protein–protein interactions (PPI is the key step in understanding proteins’ biological functions, which are typically context-dependent. Many existing PPI predictors rely on aggregated features from protein sequences, however only a few methods exploit local information about specific residue contacts. In this work we present a two-stage machine learning approach for prediction of protein–protein interactions. We start with the carefully filtered data on protein complexes available for Saccharomyces cerevisiae in the Protein Data Bank (PDB database. First, we build linear descriptions of interacting and non-interacting sequence segment pairs based on their inter-residue distances. Secondly, we train machine learning classifiers to predict binary segment interactions for any two short sequence fragments. The final prediction of the protein–protein interaction is done using the 2D matrix representation of all-against-all possible interacting sequence segments of both analysed proteins. The level-I predictor achieves 0.88 AUC for micro-scale, i.e., residue-level prediction. The level-II predictor improves the results further by a more complex learning paradigm. We perform 30-fold macro-scale, i.e., protein-level cross-validation experiment. The level-II predictor using PSIPRED-predicted secondary structure reaches 0.70 precision, 0.68 recall, and 0.70 AUC, whereas other popular methods provide results below 0.6 threshold (recall, precision, AUC. Our results demonstrate that multi-scale sequence features aggregation procedure is able to improve the machine learning results by more than 10% as compared to other sequence representations. Prepared datasets and source code for our experimental pipeline are freely available for download from: http://zubekj.github.io/mlppi/ (open source Python implementation, OS independent.
HCVpro: Hepatitis C virus protein interaction database
Kwofie, Samuel K.
2011-12-01
It is essential to catalog characterized hepatitis C virus (HCV) protein-protein interaction (PPI) data and the associated plethora of vital functional information to augment the search for therapies, vaccines and diagnostic biomarkers. In furtherance of these goals, we have developed the hepatitis C virus protein interaction database (HCVpro) by integrating manually verified hepatitis C virus-virus and virus-human protein interactions curated from literature and databases. HCVpro is a comprehensive and integrated HCV-specific knowledgebase housing consolidated information on PPIs, functional genomics and molecular data obtained from a variety of virus databases (VirHostNet, VirusMint, HCVdb and euHCVdb), and from BIND and other relevant biology repositories. HCVpro is further populated with information on hepatocellular carcinoma (HCC) related genes that are mapped onto their encoded cellular proteins. Incorporated proteins have been mapped onto Gene Ontologies, canonical pathways, Online Mendelian Inheritance in Man (OMIM) and extensively cross-referenced to other essential annotations. The database is enriched with exhaustive reviews on structure and functions of HCV proteins, current state of drug and vaccine development and links to recommended journal articles. Users can query the database using specific protein identifiers (IDs), chromosomal locations of a gene, interaction detection methods, indexed PubMed sources as well as HCVpro, BIND and VirusMint IDs. The use of HCVpro is free and the resource can be accessed via http://apps.sanbi.ac.za/hcvpro/ or http://cbrc.kaust.edu.sa/hcvpro/. © 2011 Elsevier B.V.
Sriwastava, Brijesh Kumar; Basu, Subhadip; Maulik, Ujjwal
2015-10-01
Protein-protein interaction (PPI) site prediction aids to ascertain the interface residues that participate in interaction processes. Fuzzy support vector machine (F-SVM) is proposed as an effective method to solve this problem, and we have shown that the performance of the classical SVM can be enhanced with the help of an interaction-affinity based fuzzy membership function. The performances of both SVM and F-SVM on the PPI databases of the Homo sapiens and E. coli organisms are evaluated and estimated the statistical significance of the developed method over classical SVM and other fuzzy membership-based SVM methods available in the literature. Our membership function uses the residue-level interaction affinity scores for each pair of positive and negative sequence fragments. The average AUC scores in the 10-fold cross-validation experiments are measured as 79.94% and 80.48% for the Homo sapiens and E. coli organisms respectively. On the independent test datasets, AUC scores are obtained as 76.59% and 80.17% respectively for the two organisms. In almost all cases, the developed F-SVM method improves the performances obtained by the corresponding classical SVM and the other classifiers, available in the literature.
AlphaSpace: Fragment-Centric Topographical Mapping To Target Protein–Protein Interaction Interfaces
2016-01-01
Inhibition of protein–protein interactions (PPIs) is emerging as a promising therapeutic strategy despite the difficulty in targeting such interfaces with drug-like small molecules. PPIs generally feature large and flat binding surfaces as compared to typical drug targets. These features pose a challenge for structural characterization of the surface using geometry-based pocket-detection methods. An attractive mapping strategy—that builds on the principles of fragment-based drug discovery (FBDD)—is to detect the fragment-centric modularity at the protein surface and then characterize the large PPI interface as a set of localized, fragment-targetable interaction regions. Here, we introduce AlphaSpace, a computational analysis tool designed for fragment-centric topographical mapping (FCTM) of PPI interfaces. Our approach uses the alpha sphere construct, a geometric feature of a protein’s Voronoi diagram, to map out concave interaction space at the protein surface. We introduce two new features—alpha-atom and alpha-space—and the concept of the alpha-atom/alpha-space pair to rank pockets for fragment-targetability and to facilitate the evaluation of pocket/fragment complementarity. The resulting high-resolution interfacial map of targetable pocket space can be used to guide the rational design and optimization of small molecule or biomimetic PPI inhibitors. PMID:26225450
Directory of Open Access Journals (Sweden)
Peng Liu
2015-01-01
Full Text Available A method for predicting protein-protein interactions based on detected protein complexes is proposed to repair deficient interactions derived from high-throughput biological experiments. Protein complexes are pruned and decomposed into small parts based on the adaptive k-cores method to predict protein-protein interactions associated with the complexes. The proposed method is adaptive to protein complexes with different structure, number, and size of nodes in a protein-protein interaction network. Based on different complex sets detected by various algorithms, we can obtain different prediction sets of protein-protein interactions. The reliability of the predicted interaction sets is proved by using estimations with statistical tests and direct confirmation of the biological data. In comparison with the approaches which predict the interactions based on the cliques, the overlap of the predictions is small. Similarly, the overlaps among the predicted sets of interactions derived from various complex sets are also small. Thus, every predicted set of interactions may complement and improve the quality of the original network data. Meanwhile, the predictions from the proposed method replenish protein-protein interactions associated with protein complexes using only the network topology.
Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction.
Wang, Yijie; Qian, Xiaoning
2017-03-14
Intuitively, proteins in the same protein complexes should highly interact with each other but rarely interact with the other proteins in protein-protein interaction (PPI) networks. Surprisingly, many existing computational algorithms do not directly detect protein complexes based on both of these topological properties. Most of them, depending on mathematical definitions of either "modularity" or "conductance", have their own limitations: Modularity has the inherent resolution problem ignoring small protein complexes; and conductance characterizes the separability of complexes but fails to capture the interaction density within complexes. In this paper, we propose a two-step algorithm FLCD (Finding Low-Conductance sets with Dense interactions) to predict overlapping protein complexes with the desired topological structure, which is densely connected inside and well separated from the rest of the networks. First, FLCD detects well-separated subnetworks based on approximating a potential low-conductance set through a personalized PageRank vector from a protein and then solving a mixed integer programming (MIP) problem to find the minimum-conductance set within the identified low-conductance set. At the second step, the densely connected parts in those subnetworks are discovered as the protein complexes by solving another MIP problem that aims to find the dense subnetwork in the minimum-conductance set. Experiments on four large-scale yeast PPI networks from different public databases demonstrate that the complexes predicted by FLCD have better correspondence with the yeast protein complex gold standards than other three state-of-the-art algorithms (ClusterONE, LinkComm, and SR-MCL). Additionally, results of FLCD show higher biological relevance with respect to Gene Ontology (GO) terms by GO enrichment analysis.
Su, Min-Gang; Weng, Julia Tzu-Ya; Hsu, Justin Bo-Kai; Huang, Kai-Yao; Chi, Yu-Hsiang; Lee, Tzong-Yi
2017-12-21
Protein post-translational modification (PTM) plays an essential role in various cellular processes that modulates the physical and chemical properties, folding, conformation, stability and activity of proteins, thereby modifying the functions of proteins. The improved throughput of mass spectrometry (MS) or MS/MS technology has not only brought about a surge in proteome-scale studies, but also contributed to a fruitful list of identified PTMs. However, with the increase in the number of identified PTMs, perhaps the more crucial question is what kind of biological mechanisms these PTMs are involved in. This is particularly important in light of the fact that most protein-based pharmaceuticals deliver their therapeutic effects through some form of PTM. Yet, our understanding is still limited with respect to the local effects and frequency of PTM sites near pharmaceutical binding sites and the interfaces of protein-protein interaction (PPI). Understanding PTM's function is critical to our ability to manipulate the biological mechanisms of protein. In this study, to understand the regulation of protein functions by PTMs, we mapped 25,835 PTM sites to proteins with available three-dimensional (3D) structural information in the Protein Data Bank (PDB), including 1785 modified PTM sites on the 3D structure. Based on the acquired structural PTM sites, we proposed to use five properties for the structural characterization of PTM substrate sites: the spatial composition of amino acids, residues and side-chain orientations surrounding the PTM substrate sites, as well as the secondary structure, division of acidity and alkaline residues, and solvent-accessible surface area. We further mapped the structural PTM sites to the structures of drug binding and PPI sites, identifying a total of 1917 PTM sites that may affect PPI and 3951 PTM sites associated with drug-target binding. An integrated analytical platform (CruxPTM), with a variety of methods and online molecular docking
Directory of Open Access Journals (Sweden)
Chien-Hung Huang
2015-01-01
Full Text Available Many proteins are known to be associated with cancer diseases. It is quite often that their precise functional role in disease pathogenesis remains unclear. A strategy to gain a better understanding of the function of these proteins is to make use of a combination of different aspects of proteomics data types. In this study, we extended Aragues’s method by employing the protein-protein interaction (PPI data, domain-domain interaction (DDI data, weighted domain frequency score (DFS, and cancer linker degree (CLD data to predict cancer proteins. Performances were benchmarked based on three kinds of experiments as follows: (I using individual algorithm, (II combining algorithms, and (III combining the same classification types of algorithms. When compared with Aragues’s method, our proposed methods, that is, machine learning algorithm and voting with the majority, are significantly superior in all seven performance measures. We demonstrated the accuracy of the proposed method on two independent datasets. The best algorithm can achieve a hit ratio of 89.4% and 72.8% for lung cancer dataset and lung cancer microarray study, respectively. It is anticipated that the current research could help understand disease mechanisms and diagnosis.
Sail, Vibhavari; Rizzo, Alessandro A; Chatterjee, Nimrat; Dash, Radha C; Ozen, Zuleyha; Walker, Graham C; Korzhnev, Dmitry M; Hadden, M Kyle
2017-07-21
Translesion synthesis (TLS) is an important mechanism through which proliferating cells tolerate DNA damage during replication. The mutagenic Rev1/Polζ-dependent branch of TLS helps cancer cells survive first-line genotoxic chemotherapy and introduces mutations that can contribute to the acquired resistance so often observed with standard anticancer regimens. As such, inhibition of Rev1/Polζ-dependent TLS has recently emerged as a strategy to enhance the efficacy of first-line chemotherapy and reduce the acquisition of chemoresistance by decreasing tumor mutation rate. The TLS DNA polymerase Rev1 serves as an integral scaffolding protein that mediates the assembly of the active multiprotein TLS complexes. Protein-protein interactions (PPIs) between the C-terminal domain of Rev1 (Rev1-CT) and the Rev1-interacting region (RIR) of other TLS DNA polymerases play an essential role in regulating TLS activity. To probe whether disrupting the Rev1-CT/RIR PPI is a valid approach for developing a new class of targeted anticancer agents, we designed a fluorescence polarization-based assay that was utilized in a pilot screen for small molecule inhibitors of this PPI. Two small molecule scaffolds that disrupt this interaction were identified, and secondary validation assays confirmed that compound 5 binds to Rev1-CT at the RIR interface. Finally, survival and mutagenesis assays in mouse embryonic fibroblasts and human fibrosarcoma HT1080 cells treated with cisplatin and ultraviolet light indicate that these compounds inhibit mutagenic Rev1/Polζ-dependent TLS in cells, validating the Rev1-CT/RIR PPI for future anticancer drug discovery and identifying the first small molecule inhibitors of TLS that target Rev1-CT.
CHEN, CHEN; SHEN, HONG; ZHANG, LI-GUO; LIU, JIAN; CAO, XIAO-GE; YAO, AN-LIANG; KANG, SHAO-SAN; GAO, WEI-XING; HAN, HUI; CAO, FENG-HONG; LI, ZHI-GUO
2016-01-01
Currently, using human prostate cancer (PCa) tissue samples to conduct proteomics research has generated a large amount of data; however, only a very small amount has been thoroughly investigated. In this study, we manually carried out the mining of the full text of proteomics literature that involved comparisons between PCa and normal or benign tissue and identified 41 differentially expressed proteins verified or reported more than 2 times from different research studies. We regarded these proteins as seed proteins to construct a protein-protein interaction (PPI) network. The extended network included one giant network, which consisted of 1,264 nodes connected via 1,744 edges, and 3 small separate components. The backbone network was then constructed, which was derived from key nodes and the subnetwork consisting of the shortest path between seed proteins. Topological analyses of these networks were conducted to identify proteins essential for the genesis of PCa. Solute carrier family 2 (facilitated glucose transporter), member 4 (SLC2A4) had the highest closeness centrality located in the center of each network, and the highest betweenness centrality and largest degree in the backbone network. Tubulin, beta 2C (TUBB2C) had the largest degree in the giant network and subnetwork. In addition, using module analysis of the whole PPI network, we obtained a densely connected region. Functional annotation indicated that the Ras protein signal transduction biological process, mitogen-activated protein kinase (MAPK), neurotrophin and the gonadotropin-releasing hormone (GnRH) signaling pathway may play an important role in the genesis and development of PCa. Further investigation of the SLC2A4, TUBB2C proteins, and these biological processes and pathways may therefore provide a potential target for the diagnosis and treatment of PCa. PMID:27121963
Tucker, George; Loh, Po-Ru; Berger, Bonnie
2013-10-04
Comprehensive protein-protein interaction (PPI) maps are a powerful resource for uncovering the molecular basis of genetic interactions and providing mechanistic insights. Over the past decade, high-throughput experimental techniques have been developed to generate PPI maps at proteome scale, first using yeast two-hybrid approaches and more recently via affinity purification combined with mass spectrometry (AP-MS). Unfortunately, data from both protocols are prone to both high false positive and false negative rates. To address these issues, many methods have been developed to post-process raw PPI data. However, with few exceptions, these methods only analyze binary experimental data (in which each potential interaction tested is deemed either observed or unobserved), neglecting quantitative information available from AP-MS such as spectral counts. We propose a novel method for incorporating quantitative information from AP-MS data into existing PPI inference methods that analyze binary interaction data. Our approach introduces a probabilistic framework that models the statistical noise inherent in observations of co-purifications. Using a sampling-based approach, we model the uncertainty of interactions with low spectral counts by generating an ensemble of possible alternative experimental outcomes. We then apply the existing method of choice to each alternative outcome and aggregate results over the ensemble. We validate our approach on three recent AP-MS data sets and demonstrate performance comparable to or better than state-of-the-art methods. Additionally, we provide an in-depth discussion comparing the theoretical bases of existing approaches and identify common aspects that may be key to their performance. Our sampling framework extends the existing body of work on PPI analysis using binary interaction data to apply to the richer quantitative data now commonly available through AP-MS assays. This framework is quite general, and many enhancements are likely
Kanwal, Attiya; Fazal, Sahar
2018-01-05
Ankylosing spondylitis, a systemic illness is a foundation of progressing joint swelling that for the most part influences the spine. However, it frequently causes aggravation in different joints far from the spine, and in addition organs, for example, the eyes, heart, lungs, and kidneys. It's an immune system ailment that may be activated by specific sorts of bacterial or viral diseases that initiate an invulnerable reaction that don't close off after the contamination is recuperated. The particular reason for ankylosing spondylitis is obscure, yet hereditary qualities assume a huge part in this condition. The rising apparatuses of network medicine offer a stage to investigate an unpredictable illness at framework level. In this study, we meant to recognize the key proteins and the biological regulator pathways including in AS and further investigating the molecular connectivity between these pathways by the topological examination of the Protein-protein communication (PPI) system. The extended network including of 93 nodes and have 199 interactions respectively scanned from STRING database and some separated small networks. 24 proteins with high BC at the threshold of 0.01 and 55 proteins with large degree at the threshold of 1 have been identified. CD4 with highest BC and Closeness centrality located in the centre of the network. The backbone network derived from high BC proteins presents a clear and visual overview which shows all important regulatory pathways for AS and the crosstalk between them. The finding of this research suggests that AS variation is orchestrated by an integrated PPI network centered on CD4 out of 93 nodes. Ankylosing spondylitis, a systemic disease is an establishment of advancing joint swelling that generally impacts the spine. Be that as it may, it as often as possible causes disturbance in various joints a long way from the spine, and what's more organs. It's a resistant framework affliction that might be actuated by particular sorts
Exploring hierarchical and overlapping modular structure in the yeast protein interaction network
Directory of Open Access Journals (Sweden)
Zhao Yi
2010-12-01
Full Text Available Abstract Background Developing effective strategies to reveal modular structures in protein interaction networks is crucial for better understanding of molecular mechanisms of underlying biological processes. In this paper, we propose a new density-based algorithm (ADHOC for clustering vertices of a protein interaction network using a novel subgraph density measurement. Results By statistically evaluating several independent criteria, we found that ADHOC could significantly improve the outcome as compared with five previously reported density-dependent methods. We further applied ADHOC to investigate the hierarchical and overlapping modular structure in the yeast PPI network. Our method could effectively detect both protein modules and the overlaps between them, and thus greatly promote the precise prediction of protein functions. Moreover, by further assaying the intermodule layer of the yeast PPI network, we classified hubs into two types, module hubs and inter-module hubs. Each type presents distinct characteristics both in network topology and biological functions, which could conduce to the better understanding of relationship between network architecture and biological implications. Conclusions Our proposed algorithm based on the novel subgraph density measurement makes it possible to more precisely detect hierarchical and overlapping modular structures in protein interaction networks. In addition, our method also shows a strong robustness against the noise in network, which is quite critical for analyzing such a high noise network.
Huang, Liang-Chin; Wu, Xiaogang; Chen, Jake Y
2013-01-01
The prediction of adverse drug reactions (ADRs) has become increasingly important, due to the rising concern on serious ADRs that can cause drugs to fail to reach or stay in the market. We proposed a framework for predicting ADR profiles by integrating protein-protein interaction (PPI) networks with drug structures. We compared ADR prediction performances over 18 ADR categories through four feature groups-only drug targets, drug targets with PPI networks, drug structures, and drug targets with PPI networks plus drug structures. The results showed that the integration of PPI networks and drug structures can significantly improve the ADR prediction performance. The median AUC values for the four groups were 0.59, 0.61, 0.65, and 0.70. We used the protein features in the best two models, "Cardiac disorders" (median-AUC: 0.82) and "Psychiatric disorders" (median-AUC: 0.76), to build ADR-specific PPI networks with literature supports. For validation, we examined 30 drugs withdrawn from the U.S. market to see if our approach can predict their ADR profiles and explain why they were withdrawn. Except for three drugs having ADRs in the categories we did not predict, 25 out of 27 withdrawn drugs (92.6%) having severe ADRs were successfully predicted by our approach. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Directory of Open Access Journals (Sweden)
Browne Fiona
2006-12-01
Full Text Available Protein-protein interactions (PPI play a key role in many biological systems. Over the past few years, an explosion in availability of functional biological data obtained from high-throughput technologies to infer PPI has been observed. However, results obtained from such experiments show high rates of false positives and false negatives predictions as well as systematic predictive bias. Recent research has revealed that several machine and statistical learning methods applied to integrate relatively weak, diverse sources of large-scale functional data may provide improved predictive accuracy and coverage of PPI. In this paper we describe the effects of applying different computational, integrative methods to predict PPI in Saccharomyces cerevisiae. We investigated the predictive ability of combining different sets of relatively strong and weak predictive datasets. We analysed several genomic datasets ranging from mRNA co-expression to marginal essentiality. Moreover, we expanded an existing multi-source dataset from S. cerevisiae by constructing a new set of putative interactions extracted from Gene Ontology (GO- driven annotations in the Saccharomyces Genome Database. Different classification techniques: Simple Naive Bayesian (SNB, Multilayer Perceptron (MLP and K-Nearest Neighbors (KNN were evaluated. Relatively simple classification methods (i.e. less computing intensive and mathematically complex, such as SNB, have been proven to be proficient at predicting PPI. SNB produced the “highest” predictive quality obtaining an area under Receiver Operating Characteristic (ROC curve (AUC value of 0.99. The lowest AUC value of 0.90 was obtained by the KNN classifier. This assessment also demonstrates the strong predictive power of GO-driven models, which offered predictive performance above 0.90 using the different machine learning and statistical techniques. As the predictive power of single-source datasets became weaker MLP and SNB performed
Yim, Soorin; Yu, Hasun; Jang, Dongjin; Lee, Doheon
2018-04-11
Signaling pathways can be reconstructed by identifying 'effect types' (i.e. activation/inhibition) of protein-protein interactions (PPIs). Effect types are composed of 'directions' (i.e. upstream/downstream) and 'signs' (i.e. positive/negative), thereby requiring directions as well as signs of PPIs to predict signaling events from PPI networks. Here, we propose a computational method for systemically annotating effect types to PPIs using relations between functional information of proteins. We used regulates, positively regulates, and negatively regulates relations in Gene Ontology (GO) to predict directions and signs of PPIs. These relations indicate both directions and signs between GO terms so that we can project directions and signs between relevant GO terms to PPIs. Independent test results showed that our method is effective for predicting both directions and signs of PPIs. Moreover, our method outperformed a previous GO-based method that did not consider the relations between GO terms. We annotated effect types to human PPIs and validated several highly confident effect types against literature. The annotated human PPIs are available in Additional file 2 to aid signaling pathway reconstruction and network biology research. We annotated effect types to PPIs by using regulates, positively regulates, and negatively regulates relations in GO. We demonstrated that those relations are effective for predicting not only signs, but also directions of PPIs. The usefulness of those relations suggests their potential applications to other types of interactions such as protein-DNA interactions.
Cannistraci, Carlo
2013-06-21
Motivation: Most functions within the cell emerge thanks to protein-protein interactions (PPIs), yet experimental determination of PPIs is both expensive and time-consuming. PPI networks present significant levels of noise and incompleteness. Predicting interactions using only PPI-network topology (topological prediction) is difficult but essential when prior biological knowledge is absent or unreliable.Methods: Network embedding emphasizes the relations between network proteins embedded in a low-dimensional space, in which protein pairs that are closer to each other represent good candidate interactions. To achieve network denoising, which boosts prediction performance, we first applied minimum curvilinear embedding (MCE), and then adopted shortest path (SP) in the reduced space to assign likelihood scores to candidate interactions. Furthermore, we introduce (i) a new valid variation of MCE, named non-centred MCE (ncMCE); (ii) two automatic strategies for selecting the appropriate embedding dimension; and (iii) two new randomized procedures for evaluating predictions.Results: We compared our method against several unsupervised and supervisedly tuned embedding approaches and node neighbourhood techniques. Despite its computational simplicity, ncMCE-SP was the overall leader, outperforming the current methods in topological link prediction.Conclusion: Minimum curvilinearity is a valuable non-linear framework that we successfully applied to the embedding of protein networks for the unsupervised prediction of novel PPIs. The rationale for our approach is that biological and evolutionary information is imprinted in the non-linear patterns hidden behind the protein network topology, and can be exploited for predicting new protein links. The predicted PPIs represent good candidates for testing in high-throughput experiments or for exploitation in systems biology tools such as those used for network-based inference and prediction of disease-related functional modules. The
The vast datasets generated by next generation gene sequencing and expression profiling have transformed biological and translational research. However, technologies to produce large-scale functional genomics datasets, such as high-throughput detection of protein-protein interactions (PPIs), are still in early development. While a number of powerful technologies have been employed to detect PPIs, a singular PPI biosensor platform featured with both high sensitivity and robustness in a mammalian cell environment remains to be established.
Spectral affinity in protein networks.
Voevodski, Konstantin; Teng, Shang-Hua; Xia, Yu
2009-11-29
Protein-protein interaction (PPI) networks enable us to better understand the functional organization of the proteome. We can learn a lot about a particular protein by querying its neighborhood in a PPI network to find proteins with similar function. A spectral approach that considers random walks between nodes of interest is particularly useful in evaluating closeness in PPI networks. Spectral measures of closeness are more robust to noise in the data and are more precise than simpler methods based on edge density and shortest path length. We develop a novel affinity measure for pairs of proteins in PPI networks, which uses personalized PageRank, a random walk based method used in context-sensitive search on the Web. Our measure of closeness, which we call PageRank Affinity, is proportional to the number of times the smaller-degree protein is visited in a random walk that restarts at the larger-degree protein. PageRank considers paths of all lengths in a network, therefore PageRank Affinity is a precise measure that is robust to noise in the data. PageRank Affinity is also provably related to cluster co-membership, making it a meaningful measure. In our experiments on protein networks we find that our measure is better at predicting co-complex membership and finding functionally related proteins than other commonly used measures of closeness. Moreover, our experiments indicate that PageRank Affinity is very resilient to noise in the network. In addition, based on our method we build a tool that quickly finds nodes closest to a queried protein in any protein network, and easily scales to much larger biological networks. We define a meaningful way to assess the closeness of two proteins in a PPI network, and show that our closeness measure is more biologically significant than other commonly used methods. We also develop a tool, accessible at http://xialab.bu.edu/resources/pnns, that allows the user to quickly find nodes closest to a queried vertex in any protein
Isvoran, Adriana; Craciun, Dana; Martiny, Virginie; Sperandio, Olivier; Miteva, Maria A
2013-06-14
Protein-Protein Interactions (PPIs) are key for many cellular processes. The characterization of PPI interfaces and the prediction of putative ligand binding sites and hot spot residues are essential to design efficient small-molecule modulators of PPI. Terphenyl and its derivatives are small organic molecules known to mimic one face of protein-binding alpha-helical peptides. In this work we focus on several PPIs mediated by alpha-helical peptides. We performed computational sequence- and structure-based analyses in order to evaluate several key physicochemical and surface properties of proteins known to interact with alpha-helical peptides and/or terphenyl and its derivatives. Sequence-based analysis revealed low sequence identity between some of the analyzed proteins binding alpha-helical peptides. Structure-based analysis was performed to calculate the volume, the fractal dimension roughness and the hydrophobicity of the binding regions. Besides the overall hydrophobic character of the binding pockets, some specificities were detected. We showed that the hydrophobicity is not uniformly distributed in different alpha-helix binding pockets that can help to identify key hydrophobic hot spots. The presence of hydrophobic cavities at the protein surface with a more complex shape than the entire protein surface seems to be an important property related to the ability of proteins to bind alpha-helical peptides and low molecular weight mimetics. Characterization of similarities and specificities of PPI binding sites can be helpful for further development of small molecules targeting alpha-helix binding proteins.
DEFF Research Database (Denmark)
Brorsson, C.; Hansen, Niclas Tue; Hansen, Kasper Lage
2009-01-01
genes. We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein-protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC......To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1...... region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein...
Asadzadeh-Aghdaee, Hamid; Mansouri, Vahid; Peyvandi, Ali Asghar; Moztarzadeh, Fathollah; Okhovatian, Farshad; Lahmi, Farhad; Vafaee, Reza; Zali, Mohammad Reza
2016-01-01
Aim: The corresponding proteins are important for network mapping since the interaction analysis can provide a new interpretation about disease underlying mechanisms as the aim of this study. Backgroud: Nonalcoholic steatohepatitis (NASH) is one of the main causes of liver disease in the world. It has been known with many susceptible proteins that play essential role in its pathogenesis. Methods: In this paper, protein-protein interaction (PPI) network analysis of fatty liver disease retrieved from STRING db by the application of Cytoscape Software. ClueGO analyzed the associated pathways for the selected top proteins. Results: INS, PPARA, LEP, SREBF1, and ALB are the introduced biomarker panel for fatty liver disease. Conclusion: It seems that pathways related to insulin have a prominent role in fatty liver disease. Therefore, investigation in this case is required to confirm the possible linkage of introduced panel and involvement of insulin pathway in the disease. PMID:28224024
Specificity of molecular interactions in transient protein-protein interaction interfaces.
Cho, Kyu-il; Lee, KiYoung; Lee, Kwang H; Kim, Dongsup; Lee, Doheon
2006-11-15
In this study, we investigate what types of interactions are specific to their biological function, and what types of interactions are persistent regardless of their functional category in transient protein-protein heterocomplexes. This is the first approach to analyze protein-protein interfaces systematically at the molecular interaction level in the context of protein functions. We perform systematic analysis at the molecular interaction level using classification and feature subset selection technique prevalent in the field of pattern recognition. To represent the physicochemical properties of protein-protein interfaces, we design 18 molecular interaction types using canonical and noncanonical interactions. Then, we construct input vector using the frequency of each interaction type in protein-protein interface. We analyze the 131 interfaces of transient protein-protein heterocomplexes in PDB: 33 protease-inhibitors, 52 antibody-antigens, 46 signaling proteins including 4 cyclin dependent kinase and 26 G-protein. Using kNN classification and feature subset selection technique, we show that there are specific interaction types based on their functional category, and such interaction types are conserved through the common binding mechanism, rather than through the sequence or structure conservation. The extracted interaction types are C(alpha)-- H...O==C interaction, cation...anion interaction, amine...amine interaction, and amine...cation interaction. With these four interaction types, we achieve the classification success rate up to 83.2% with leave-one-out cross-validation at k = 15. Of these four interaction types, C(alpha)--H...O==C shows binding specificity for protease-inhibitor complexes, while cation-anion interaction is predominant in signaling complexes. The amine ... amine and amine...cation interaction give a minor contribution to the classification accuracy. When combined with these two interactions, they increase the accuracy by 3.8%. In the case of
Evolution of protein-protein interactions
Indian Academy of Sciences (India)
Evolution of protein-protein interactions · Our interests in protein-protein interactions · Slide 3 · Slide 4 · Slide 5 · Slide 6 · Slide 7 · Slide 8 · Slide 9 · Slide 10 · Slide 11 · Slide 12 · Slide 13 · Slide 14 · Slide 15 · Slide 16 · Slide 17 · Slide 18 · Slide 19 · Slide 20.
Completing sparse and disconnected protein-protein network by deep learning.
Huang, Lei; Liao, Li; Wu, Cathy H
2018-03-22
Protein-protein interaction (PPI) prediction remains a central task in systems biology to achieve a better and holistic understanding of cellular and intracellular processes. Recently, an increasing number of computational methods have shifted from pair-wise prediction to network level prediction. Many of the existing network level methods predict PPIs under the assumption that the training network should be connected. However, this assumption greatly affects the prediction power and limits the application area because the current golden standard PPI networks are usually very sparse and disconnected. Therefore, how to effectively predict PPIs based on a training network that is sparse and disconnected remains a challenge. In this work, we developed a novel PPI prediction method based on deep learning neural network and regularized Laplacian kernel. We use a neural network with an autoencoder-like architecture to implicitly simulate the evolutionary processes of a PPI network. Neurons of the output layer correspond to proteins and are labeled with values (1 for interaction and 0 for otherwise) from the adjacency matrix of a sparse disconnected training PPI network. Unlike autoencoder, neurons at the input layer are given all zero input, reflecting an assumption of no a priori knowledge about PPIs, and hidden layers of smaller sizes mimic ancient interactome at different times during evolution. After the training step, an evolved PPI network whose rows are outputs of the neural network can be obtained. We then predict PPIs by applying the regularized Laplacian kernel to the transition matrix that is built upon the evolved PPI network. The results from cross-validation experiments show that the PPI prediction accuracies for yeast data and human data measured as AUC are increased by up to 8.4 and 14.9% respectively, as compared to the baseline. Moreover, the evolved PPI network can also help us leverage complementary information from the disconnected training network
Proteins interacting with cloning scars: a source of false positive protein-protein interactions.
Banks, Charles A S; Boanca, Gina; Lee, Zachary T; Florens, Laurence; Washburn, Michael P
2015-02-23
A common approach for exploring the interactome, the network of protein-protein interactions in cells, uses a commercially available ORF library to express affinity tagged bait proteins; these can be expressed in cells and endogenous cellular proteins that copurify with the bait can be identified as putative interacting proteins using mass spectrometry. Control experiments can be used to limit false-positive results, but in many cases, there are still a surprising number of prey proteins that appear to copurify specifically with the bait. Here, we have identified one source of false-positive interactions in such studies. We have found that a combination of: 1) the variable sequence of the C-terminus of the bait with 2) a C-terminal valine "cloning scar" present in a commercially available ORF library, can in some cases create a peptide motif that results in the aberrant co-purification of endogenous cellular proteins. Control experiments may not identify false positives resulting from such artificial motifs, as aberrant binding depends on sequences that vary from one bait to another. It is possible that such cryptic protein binding might occur in other systems using affinity tagged proteins; this study highlights the importance of conducting careful follow-up studies where novel protein-protein interactions are suspected.
Spectral affinity in protein networks
Directory of Open Access Journals (Sweden)
Teng Shang-Hua
2009-11-01
Full Text Available Abstract Background Protein-protein interaction (PPI networks enable us to better understand the functional organization of the proteome. We can learn a lot about a particular protein by querying its neighborhood in a PPI network to find proteins with similar function. A spectral approach that considers random walks between nodes of interest is particularly useful in evaluating closeness in PPI networks. Spectral measures of closeness are more robust to noise in the data and are more precise than simpler methods based on edge density and shortest path length. Results We develop a novel affinity measure for pairs of proteins in PPI networks, which uses personalized PageRank, a random walk based method used in context-sensitive search on the Web. Our measure of closeness, which we call PageRank Affinity, is proportional to the number of times the smaller-degree protein is visited in a random walk that restarts at the larger-degree protein. PageRank considers paths of all lengths in a network, therefore PageRank Affinity is a precise measure that is robust to noise in the data. PageRank Affinity is also provably related to cluster co-membership, making it a meaningful measure. In our experiments on protein networks we find that our measure is better at predicting co-complex membership and finding functionally related proteins than other commonly used measures of closeness. Moreover, our experiments indicate that PageRank Affinity is very resilient to noise in the network. In addition, based on our method we build a tool that quickly finds nodes closest to a queried protein in any protein network, and easily scales to much larger biological networks. Conclusion We define a meaningful way to assess the closeness of two proteins in a PPI network, and show that our closeness measure is more biologically significant than other commonly used methods. We also develop a tool, accessible at http://xialab.bu.edu/resources/pnns, that allows the user to
Visualization of protein interaction networks: problems and solutions
Directory of Open Access Journals (Sweden)
Agapito Giuseppe
2013-01-01
Full Text Available Abstract Background Visualization concerns the representation of data visually and is an important task in scientific research. Protein-protein interactions (PPI are discovered using either wet lab techniques, such mass spectrometry, or in silico predictions tools, resulting in large collections of interactions stored in specialized databases. The set of all interactions of an organism forms a protein-protein interaction network (PIN and is an important tool for studying the behaviour of the cell machinery. Since graphic representation of PINs may highlight important substructures, e.g. protein complexes, visualization is more and more used to study the underlying graph structure of PINs. Although graphs are well known data structures, there are different open problems regarding PINs visualization: the high number of nodes and connections, the heterogeneity of nodes (proteins and edges (interactions, the possibility to annotate proteins and interactions with biological information extracted by ontologies (e.g. Gene Ontology that enriches the PINs with semantic information, but complicates their visualization. Methods In these last years many software tools for the visualization of PINs have been developed. Initially thought for visualization only, some of them have been successively enriched with new functions for PPI data management and PIN analysis. The paper analyzes the main software tools for PINs visualization considering four main criteria: (i technology, i.e. availability/license of the software and supported OS (Operating System platforms; (ii interoperability, i.e. ability to import/export networks in various formats, ability to export data in a graphic format, extensibility of the system, e.g. through plug-ins; (iii visualization, i.e. supported layout and rendering algorithms and availability of parallel implementation; (iv analysis, i.e. availability of network analysis functions, such as clustering or mining of the graph, and the
Li, Min; Li, Wenkai; Wu, Fang-Xiang; Pan, Yi; Wang, Jianxin
2018-06-14
Essential proteins are important participants in various life activities and play a vital role in the survival and reproduction of living organisms. Identification of essential proteins from protein-protein interaction (PPI) networks has great significance to facilitate the study of human complex diseases, the design of drugs and the development of bioinformatics and computational science. Studies have shown that highly connected proteins in a PPI network tend to be essential. A series of computational methods have been proposed to identify essential proteins by analyzing topological structures of PPI networks. However, the high noise in the PPI data can degrade the accuracy of essential protein prediction. Moreover, proteins must be located in the appropriate subcellular localization to perform their functions, and only when the proteins are located in the same subcellular localization, it is possible that they can interact with each other. In this paper, we propose a new network-based essential protein discovery method based on sub-network partition and prioritization by integrating subcellular localization information, named SPP. The proposed method SPP was tested on two different yeast PPI networks obtained from DIP database and BioGRID database. The experimental results show that SPP can effectively reduce the effect of false positives in PPI networks and predict essential proteins more accurately compared with other existing computational methods DC, BC, CC, SC, EC, IC, NC. Copyright © 2018 Elsevier Ltd. All rights reserved.
A comprehensive benchmark of kernel methods to extract protein-protein interactions from literature.
Directory of Open Access Journals (Sweden)
Domonkos Tikk
Full Text Available The most important way of conveying new findings in biomedical research is scientific publication. Extraction of protein-protein interactions (PPIs reported in scientific publications is one of the core topics of text mining in the life sciences. Recently, a new class of such methods has been proposed - convolution kernels that identify PPIs using deep parses of sentences. However, comparing published results of different PPI extraction methods is impossible due to the use of different evaluation corpora, different evaluation metrics, different tuning procedures, etc. In this paper, we study whether the reported performance metrics are robust across different corpora and learning settings and whether the use of deep parsing actually leads to an increase in extraction quality. Our ultimate goal is to identify the one method that performs best in real-life scenarios, where information extraction is performed on unseen text and not on specifically prepared evaluation data. We performed a comprehensive benchmarking of nine different methods for PPI extraction that use convolution kernels on rich linguistic information. Methods were evaluated on five different public corpora using cross-validation, cross-learning, and cross-corpus evaluation. Our study confirms that kernels using dependency trees generally outperform kernels based on syntax trees. However, our study also shows that only the best kernel methods can compete with a simple rule-based approach when the evaluation prevents information leakage between training and test corpora. Our results further reveal that the F-score of many approaches drops significantly if no corpus-specific parameter optimization is applied and that methods reaching a good AUC score often perform much worse in terms of F-score. We conclude that for most kernels no sensible estimation of PPI extraction performance on new text is possible, given the current heterogeneity in evaluation data. Nevertheless, our study
Specificity and affinity quantification of protein-protein interactions.
Yan, Zhiqiang; Guo, Liyong; Hu, Liang; Wang, Jin
2013-05-01
Most biological processes are mediated by the protein-protein interactions. Determination of the protein-protein structures and insight into their interactions are vital to understand the mechanisms of protein functions. Currently, compared with the isolated protein structures, only a small fraction of protein-protein structures are experimentally solved. Therefore, the computational docking methods play an increasing role in predicting the structures and interactions of protein-protein complexes. The scoring function of protein-protein interactions is the key responsible for the accuracy of the computational docking. Previous scoring functions were mostly developed by optimizing the binding affinity which determines the stability of the protein-protein complex, but they are often lack of the consideration of specificity which determines the discrimination of native protein-protein complex against competitive ones. We developed a scoring function (named as SPA-PP, specificity and affinity of the protein-protein interactions) by incorporating both the specificity and affinity into the optimization strategy. The testing results and comparisons with other scoring functions show that SPA-PP performs remarkably on both predictions of binding pose and binding affinity. Thus, SPA-PP is a promising quantification of protein-protein interactions, which can be implemented into the protein docking tools and applied for the predictions of protein-protein structure and affinity. The algorithm is implemented in C language, and the code can be downloaded from http://dl.dropbox.com/u/1865642/Optimization.cpp.
Our interests in protein-protein interactions
Indian Academy of Sciences (India)
protein interactions. Evolution of P-P partnerships. Evolution of P-P structures. Evolutionary dynamics of P-P interactions. Dynamics of P-P interaction network. Host-pathogen interactions. CryoEM mapping of gigantic protein assemblies.
Oligomeric protein structure networks: insights into protein-protein interactions
Directory of Open Access Journals (Sweden)
Brinda KV
2005-12-01
Full Text Available Abstract Background Protein-protein association is essential for a variety of cellular processes and hence a large number of investigations are being carried out to understand the principles of protein-protein interactions. In this study, oligomeric protein structures are viewed from a network perspective to obtain new insights into protein association. Structure graphs of proteins have been constructed from a non-redundant set of protein oligomer crystal structures by considering amino acid residues as nodes and the edges are based on the strength of the non-covalent interactions between the residues. The analysis of such networks has been carried out in terms of amino acid clusters and hubs (highly connected residues with special emphasis to protein interfaces. Results A variety of interactions such as hydrogen bond, salt bridges, aromatic and hydrophobic interactions, which occur at the interfaces are identified in a consolidated manner as amino acid clusters at the interface, from this study. Moreover, the characterization of the highly connected hub-forming residues at the interfaces and their comparison with the hubs from the non-interface regions and the non-hubs in the interface regions show that there is a predominance of charged interactions at the interfaces. Further, strong and weak interfaces are identified on the basis of the interaction strength between amino acid residues and the sizes of the interface clusters, which also show that many protein interfaces are stronger than their monomeric protein cores. The interface strengths evaluated based on the interface clusters and hubs also correlate well with experimentally determined dissociation constants for known complexes. Finally, the interface hubs identified using the present method correlate very well with experimentally determined hotspots in the interfaces of protein complexes obtained from the Alanine Scanning Energetics database (ASEdb. A few predictions of interface hot
Senachak, Jittisak; Cheevadhanarak, Supapon; Hongsthong, Apiradee
2015-07-29
Spirulina (Arthrospira) platensis is the only cyanobacterium that in addition to being studied at the molecular level and subjected to gene manipulation, can also be mass cultivated in outdoor ponds for commercial use as a food supplement. Thus, encountering environmental changes, including temperature stresses, is common during the mass production of Spirulina. The use of cyanobacteria as an experimental platform, especially for photosynthetic gene manipulation in plants and bacteria, is becoming increasingly important. Understanding the mechanisms and protein-protein interaction networks that underlie low- and high-temperature responses is relevant to Spirulina mass production. To accomplish this goal, high-throughput techniques such as OMICs analyses are used. Thus, large datasets must be collected, managed and subjected to information extraction. Therefore, databases including (i) proteomic analysis and protein-protein interaction (PPI) data and (ii) domain/motif visualization tools are required for potential use in temperature response models for plant chloroplasts and photosynthetic bacteria. A web-based repository was developed including an embedded database, SpirPro, and tools for network visualization. Proteome data were analyzed integrated with protein-protein interactions and/or metabolic pathways from KEGG. The repository provides various information, ranging from raw data (2D-gel images) to associated results, such as data from interaction and/or pathway analyses. This integration allows in silico analyses of protein-protein interactions affected at the metabolic level and, particularly, analyses of interactions between and within the affected metabolic pathways under temperature stresses for comparative proteomic analysis. The developed tool, which is coded in HTML with CSS/JavaScript and depicted in Scalable Vector Graphics (SVG), is designed for interactive analysis and exploration of the constructed network. SpirPro is publicly available on the web
Vella, Danila; Zoppis, Italo; Mauri, Giancarlo; Mauri, Pierluigi; Di Silvestre, Dario
2017-12-01
The reductionist approach of dissecting biological systems into their constituents has been successful in the first stage of the molecular biology to elucidate the chemical basis of several biological processes. This knowledge helped biologists to understand the complexity of the biological systems evidencing that most biological functions do not arise from individual molecules; thus, realizing that the emergent properties of the biological systems cannot be explained or be predicted by investigating individual molecules without taking into consideration their relations. Thanks to the improvement of the current -omics technologies and the increasing understanding of the molecular relationships, even more studies are evaluating the biological systems through approaches based on graph theory. Genomic and proteomic data are often combined with protein-protein interaction (PPI) networks whose structure is routinely analyzed by algorithms and tools to characterize hubs/bottlenecks and topological, functional, and disease modules. On the other hand, co-expression networks represent a complementary procedure that give the opportunity to evaluate at system level including organisms that lack information on PPIs. Based on these premises, we introduce the reader to the PPI and to the co-expression networks, including aspects of reconstruction and analysis. In particular, the new idea to evaluate large-scale proteomic data by means of co-expression networks will be discussed presenting some examples of application. Their use to infer biological knowledge will be shown, and a special attention will be devoted to the topological and module analysis.
Directory of Open Access Journals (Sweden)
Matthew D Dyer
2010-08-01
Full Text Available Bacillus anthracis, Francisella tularensis, and Yersinia pestis are bacterial pathogens that can cause anthrax, lethal acute pneumonic disease, and bubonic plague, respectively, and are listed as NIAID Category A priority pathogens for possible use as biological weapons. However, the interactions between human proteins and proteins in these bacteria remain poorly characterized leading to an incomplete understanding of their pathogenesis and mechanisms of immune evasion.In this study, we used a high-throughput yeast two-hybrid assay to identify physical interactions between human proteins and proteins from each of these three pathogens. From more than 250,000 screens performed, we identified 3,073 human-B. anthracis, 1,383 human-F. tularensis, and 4,059 human-Y. pestis protein-protein interactions including interactions involving 304 B. anthracis, 52 F. tularensis, and 330 Y. pestis proteins that are uncharacterized. Computational analysis revealed that pathogen proteins preferentially interact with human proteins that are hubs and bottlenecks in the human PPI network. In addition, we computed modules of human-pathogen PPIs that are conserved amongst the three networks. Functionally, such conserved modules reveal commonalities between how the different pathogens interact with crucial host pathways involved in inflammation and immunity.These data constitute the first extensive protein interaction networks constructed for bacterial pathogens and their human hosts. This study provides novel insights into host-pathogen interactions.
Protein- protein interaction detection system using fluorescent protein microdomains
Waldo, Geoffrey S.; Cabantous, Stephanie
2010-02-23
The invention provides a protein labeling and interaction detection system based on engineered fragments of fluorescent and chromophoric proteins that require fused interacting polypeptides to drive the association of the fragments, and further are soluble and stable, and do not change the solubility of polypeptides to which they are fused. In one embodiment, a test protein X is fused to a sixteen amino acid fragment of GFP (.beta.-strand 10, amino acids 198-214), engineered to not perturb fusion protein solubility. A second test protein Y is fused to a sixteen amino acid fragment of GFP (.beta.-strand 11, amino acids 215-230), engineered to not perturb fusion protein solubility. When X and Y interact, they bring the GFP strands into proximity, and are detected by complementation with a third GFP fragment consisting of GFP amino acids 1-198 (strands 1-9). When GFP strands 10 and 11 are held together by interaction of protein X and Y, they spontaneous association with GFP strands 1-9, resulting in structural complementation, folding, and concomitant GFP fluorescence.
Protein-protein interactions: an application of Tus-Ter mediated protein microarray system.
Sitaraman, Kalavathy; Chatterjee, Deb K
2011-01-01
In this chapter, we present a novel, cost-effective microarray strategy that utilizes expression-ready plasmid DNAs to generate protein arrays on-demand and its use to validate protein-protein interactions. These expression plasmids were constructed in such a way so as to serve a dual purpose of synthesizing the protein of interest as well as capturing the synthesized protein. The microarray system is based on the high affinity binding of Escherichia coli "Tus" protein to "Ter," a 20 bp DNA sequence involved in the regulation of DNA replication. The protein expression is carried out in a cell-free protein synthesis system, with rabbit reticulocyte lysates, and the target proteins are detected either by labeled incorporated tag specific or by gene-specific antibodies. This microarray system has been successfully used for the detection of protein-protein interaction because both the target protein and the query protein can be transcribed and translated simultaneously in the microarray slides. The utility of this system for detecting protein-protein interaction is demonstrated by a few well-known examples: Jun/Fos, FRB/FKBP12, p53/MDM2, and CDK4/p16. In all these cases, the presence of protein complexes resulted in the localization of fluorophores at the specific sites of the immobilized target plasmids. Interestingly, during our interactions studies we also detected a previously unknown interaction between CDK2 and p16. Thus, this Tus-Ter based system of protein microarray can be used for the validation of known protein interactions as well as for identifying new protein-protein interactions. In addition, it can be used to examine and identify targets of nucleic acid-protein, ligand-receptor, enzyme-substrate, and drug-protein interactions.
Aquaporin Protein-Protein Interactions
Directory of Open Access Journals (Sweden)
Jennifer Virginia Roche
2017-10-01
Full Text Available Aquaporins are tetrameric membrane-bound channels that facilitate transport of water and other small solutes across cell membranes. In eukaryotes, they are frequently regulated by gating or trafficking, allowing for the cell to control membrane permeability in a specific manner. Protein–protein interactions play crucial roles in both regulatory processes and also mediate alternative functions such as cell adhesion. In this review, we summarize recent knowledge about aquaporin protein–protein interactions; dividing the interactions into three types: (1 interactions between aquaporin tetramers; (2 interactions between aquaporin monomers within a tetramer (hetero-tetramerization; and (3 transient interactions with regulatory proteins. We particularly focus on the structural aspects of the interactions, discussing the small differences within a conserved overall fold that allow for aquaporins to be differentially regulated in an organism-, tissue- and trigger-specific manner. A deep knowledge about these differences is needed to fully understand aquaporin function and regulation in many physiological processes, and may enable design of compounds targeting specific aquaporins for treatment of human disease.
Integration of multiple biological features yields high confidence human protein interactome.
Karagoz, Kubra; Sevimoglu, Tuba; Arga, Kazim Yalcin
2016-08-21
The biological function of a protein is usually determined by its physical interaction with other proteins. Protein-protein interactions (PPIs) are identified through various experimental methods and are stored in curated databases. The noisiness of the existing PPI data is evident, and it is essential that a more reliable data is generated. Furthermore, the selection of a set of PPIs at different confidence levels might be necessary for many studies. Although different methodologies were introduced to evaluate the confidence scores for binary interactions, a highly reliable, almost complete PPI network of Homo sapiens is not proposed yet. The quality and coverage of human protein interactome need to be improved to be used in various disciplines, especially in biomedicine. In the present work, we propose an unsupervised statistical approach to assign confidence scores to PPIs of H. sapiens. To achieve this goal PPI data from six different databases were collected and a total of 295,288 non-redundant interactions between 15,950 proteins were acquired. The present scoring system included the context information that was assigned to PPIs derived from eight biological attributes. A high confidence network, which included 147,923 binary interactions between 13,213 proteins, had scores greater than the cutoff value of 0.80, for which sensitivity, specificity, and coverage were 94.5%, 80.9%, and 82.8%, respectively. We compared the present scoring method with others for evaluation. Reducing the noise inherent in experimental PPIs via our scoring scheme increased the accuracy significantly. As it was demonstrated through the assessment of process and cancer subnetworks, this study allows researchers to construct and analyze context-specific networks via valid PPI sets and one can easily achieve subnetworks around proteins of interest at a specified confidence level. Copyright © 2016 Elsevier Ltd. All rights reserved.
Bioinformatic Prediction of WSSV-Host Protein-Protein Interaction
Directory of Open Access Journals (Sweden)
Zheng Sun
2014-01-01
Full Text Available WSSV is one of the most dangerous pathogens in shrimp aquaculture. However, the molecular mechanism of how WSSV interacts with shrimp is still not very clear. In the present study, bioinformatic approaches were used to predict interactions between proteins from WSSV and shrimp. The genome data of WSSV (NC_003225.1 and the constructed transcriptome data of F. chinensis were used to screen potentially interacting proteins by searching in protein interaction databases, including STRING, Reactome, and DIP. Forty-four pairs of proteins were suggested to have interactions between WSSV and the shrimp. Gene ontology analysis revealed that 6 pairs of these interacting proteins were classified into “extracellular region” or “receptor complex” GO-terms. KEGG pathway analysis showed that they were involved in the “ECM-receptor interaction pathway.” In the 6 pairs of interacting proteins, an envelope protein called “collagen-like protein” (WSSV-CLP encoded by an early virus gene “wsv001” in WSSV interacted with 6 deduced proteins from the shrimp, including three integrin alpha (ITGA, two integrin beta (ITGB, and one syndecan (SDC. Sequence analysis on WSSV-CLP, ITGA, ITGB, and SDC revealed that they possessed the sequence features for protein-protein interactions. This study might provide new insights into the interaction mechanisms between WSSV and shrimp.
Information assessment on predicting protein-protein interactions
Directory of Open Access Journals (Sweden)
Gerstein Mark
2004-10-01
Full Text Available Abstract Background Identifying protein-protein interactions is fundamental for understanding the molecular machinery of the cell. Proteome-wide studies of protein-protein interactions are of significant value, but the high-throughput experimental technologies suffer from high rates of both false positive and false negative predictions. In addition to high-throughput experimental data, many diverse types of genomic data can help predict protein-protein interactions, such as mRNA expression, localization, essentiality, and functional annotation. Evaluations of the information contributions from different evidences help to establish more parsimonious models with comparable or better prediction accuracy, and to obtain biological insights of the relationships between protein-protein interactions and other genomic information. Results Our assessment is based on the genomic features used in a Bayesian network approach to predict protein-protein interactions genome-wide in yeast. In the special case, when one does not have any missing information about any of the features, our analysis shows that there is a larger information contribution from the functional-classification than from expression correlations or essentiality. We also show that in this case alternative models, such as logistic regression and random forest, may be more effective than Bayesian networks for predicting interactions. Conclusions In the restricted problem posed by the complete-information subset, we identified that the MIPS and Gene Ontology (GO functional similarity datasets as the dominating information contributors for predicting the protein-protein interactions under the framework proposed by Jansen et al. Random forests based on the MIPS and GO information alone can give highly accurate classifications. In this particular subset of complete information, adding other genomic data does little for improving predictions. We also found that the data discretizations used in the
Huang, Sing-Han; Lo, Yu-Shu; Luo, Yong-Chun; Tseng, Yu-Yao; Yang, Jinn-Moon
2018-03-19
One of the crucial steps toward understanding the associations among molecular interactions, pathways, and diseases in a cell is to investigate detailed atomic protein-protein interactions (PPIs) in the structural interactome. Despite the availability of large-scale methods for analyzing PPI networks, these methods often focused on PPI networks using genome-scale data and/or known experimental PPIs. However, these methods are unable to provide structurally resolved interaction residues and their conservations in PPI networks. Here, we reconstructed a human three-dimensional (3D) structural PPI network (hDiSNet) with the detailed atomic binding models and disease-associated mutations by enhancing our PPI families and 3D-domain interologs from 60,618 structural complexes and complete genome database with 6,352,363 protein sequences across 2274 species. hDiSNet is a scale-free network (γ = 2.05), which consists of 5177 proteins and 19,239 PPIs with 5843 mutations. These 19,239 structurally resolved PPIs not only expanded the number of PPIs compared to present structural PPI network, but also achieved higher agreement with gene ontology similarities and higher co-expression correlation than the ones of 181,868 experimental PPIs recorded in public databases. Among 5843 mutations, 1653 and 790 mutations involved in interacting domains and contacting residues, respectively, are highly related to diseases. Our hDiSNet can provide detailed atomic interactions of human disease and their associated proteins with mutations. Our results show that the disease-related mutations are often located at the contacting residues forming the hydrogen bonds or conserved in the PPI family. In addition, hDiSNet provides the insights of the FGFR (EGFR)-MAPK pathway for interpreting the mechanisms of breast cancer and ErbB signaling pathway in brain cancer. Our results demonstrate that hDiSNet can explore structural-based interactions insights for understanding the mechanisms of disease
Detecting mutually exclusive interactions in protein-protein interaction maps.
Sánchez Claros, Carmen
2012-06-08
Comprehensive protein interaction maps can complement genetic and biochemical experiments and allow the formulation of new hypotheses to be tested in the system of interest. The computational analysis of the maps may help to focus on interesting cases and thereby to appropriately prioritize the validation experiments. We show here that, by automatically comparing and analyzing structurally similar regions of proteins of known structure interacting with a common partner, it is possible to identify mutually exclusive interactions present in the maps with a sensitivity of 70% and a specificity higher than 85% and that, in about three fourth of the correctly identified complexes, we also correctly recognize at least one residue (five on average) belonging to the interaction interface. Given the present and continuously increasing number of proteins of known structure, the requirement of the knowledge of the structure of the interacting proteins does not substantially impact on the coverage of our strategy that can be estimated to be around 25%. We also introduce here the Estrella server that embodies this strategy, is designed for users interested in validating specific hypotheses about the functional role of a protein-protein interaction and it also allows access to pre-computed data for seven organisms.
Detecting mutually exclusive interactions in protein-protein interaction maps.
Sá nchez Claros, Carmen; Tramontano, Anna
2012-01-01
Comprehensive protein interaction maps can complement genetic and biochemical experiments and allow the formulation of new hypotheses to be tested in the system of interest. The computational analysis of the maps may help to focus on interesting cases and thereby to appropriately prioritize the validation experiments. We show here that, by automatically comparing and analyzing structurally similar regions of proteins of known structure interacting with a common partner, it is possible to identify mutually exclusive interactions present in the maps with a sensitivity of 70% and a specificity higher than 85% and that, in about three fourth of the correctly identified complexes, we also correctly recognize at least one residue (five on average) belonging to the interaction interface. Given the present and continuously increasing number of proteins of known structure, the requirement of the knowledge of the structure of the interacting proteins does not substantially impact on the coverage of our strategy that can be estimated to be around 25%. We also introduce here the Estrella server that embodies this strategy, is designed for users interested in validating specific hypotheses about the functional role of a protein-protein interaction and it also allows access to pre-computed data for seven organisms.
Non-interacting surface solvation and dynamics in protein-protein interactions
Visscher, Koen M.; Kastritis, Panagiotis L.|info:eu-repo/dai/nl/315886668; Bonvin, Alexandre M J J|info:eu-repo/dai/nl/113691238
2015-01-01
Protein-protein interactions control a plethora of cellular processes, including cell proliferation, differentiation, apoptosis, and signal transduction. Understanding how and why proteins interact will inevitably lead to novel structure-based drug design methods, as well as design of de novo
Mapping Protein-Protein Interactions by Quantitative Proteomics
DEFF Research Database (Denmark)
Dengjel, Joern; Kratchmarova, Irina; Blagoev, Blagoy
2010-01-01
spectrometry (MS)-based proteomics in combination with affinity purification protocols has become the method of choice to map and track the dynamic changes in protein-protein interactions, including the ones occurring during cellular signaling events. Different quantitative MS strategies have been used...... to characterize protein interaction networks. In this chapter we describe in detail the use of stable isotope labeling by amino acids in cell culture (SILAC) for the quantitative analysis of stimulus-dependent dynamic protein interactions.......Proteins exert their function inside a cell generally in multiprotein complexes. These complexes are highly dynamic structures changing their composition over time and cell state. The same protein may thereby fulfill different functions depending on its binding partners. Quantitative mass...
Directory of Open Access Journals (Sweden)
Shaoying Lee
2011-10-01
Full Text Available Genome-wide yeast two-hybrid (Y2H screens were conducted to elucidate the molecular functions of open reading frames (ORFs encoded by murine γ-herpesvirus 68 (MHV-68. A library of 84 MHV-68 genes and gene fragments was generated in a Gateway entry plasmid and transferred to Y2H vectors. All possible pair-wise interactions between viral proteins were tested in the Y2H assay, resulting in the identification of 23 intra-viral protein-protein interactions (PPIs. Seventy percent of the interactions between viral proteins were confirmed by co-immunoprecipitation experiments. To systematically investigate virus-cellular protein interactions, the MHV-68 Y2H constructs were screened against a cellular cDNA library, yielding 243 viral-cellular PPIs involving 197 distinct cellar proteins. Network analyses indicated that cellular proteins targeted by MHV-68 had more partners in the cellular PPI network and were located closer to each other than expected by chance. Taking advantage of this observation, we scored the cellular proteins based on their network distances from other MHV-68-interacting proteins and segregated them into high (Y2H-HP and low priority/not-scored (Y2H-LP/NS groups. Significantly more genes from Y2H-HP altered MHV-68 replication when their expression was inhibited with siRNAs (53% of genes from Y2H-HP, 21% of genes from Y2H-LP/NS, and 16% of genes randomly chosen from the human PPI network; p<0.05. Enriched Gene Ontology (GO terms in the Y2H-HP group included regulation of apoptosis, protein kinase cascade, post-translational protein modification, transcription from RNA polymerase II promoter, and IκB kinase/NFκB cascade. Functional validation assays indicated that PCBP1, which interacted with MHV-68 ORF34, may be involved in regulating late virus gene expression in a manner consistent with the effects of its viral interacting partner. Our study integrated Y2H screening with multiple functional validation approaches to create
Inferring domain-domain interactions from protein-protein interactions with formal concept analysis.
Directory of Open Access Journals (Sweden)
Susan Khor
Full Text Available Identifying reliable domain-domain interactions will increase our ability to predict novel protein-protein interactions, to unravel interactions in protein complexes, and thus gain more information about the function and behavior of genes. One of the challenges of identifying reliable domain-domain interactions is domain promiscuity. Promiscuous domains are domains that can occur in many domain architectures and are therefore found in many proteins. This becomes a problem for a method where the score of a domain-pair is the ratio between observed and expected frequencies because the protein-protein interaction network is sparse. As such, many protein-pairs will be non-interacting and domain-pairs with promiscuous domains will be penalized. This domain promiscuity challenge to the problem of inferring reliable domain-domain interactions from protein-protein interactions has been recognized, and a number of work-arounds have been proposed. This paper reports on an application of Formal Concept Analysis to this problem. It is found that the relationship between formal concepts provides a natural way for rare domains to elevate the rank of promiscuous domain-pairs and enrich highly ranked domain-pairs with reliable domain-domain interactions. This piggybacking of promiscuous domain-pairs onto less promiscuous domain-pairs is possible only with concept lattices whose attribute-labels are not reduced and is enhanced by the presence of proteins that comprise both promiscuous and rare domains.
Inferring Domain-Domain Interactions from Protein-Protein Interactions with Formal Concept Analysis
Khor, Susan
2014-01-01
Identifying reliable domain-domain interactions will increase our ability to predict novel protein-protein interactions, to unravel interactions in protein complexes, and thus gain more information about the function and behavior of genes. One of the challenges of identifying reliable domain-domain interactions is domain promiscuity. Promiscuous domains are domains that can occur in many domain architectures and are therefore found in many proteins. This becomes a problem for a method where the score of a domain-pair is the ratio between observed and expected frequencies because the protein-protein interaction network is sparse. As such, many protein-pairs will be non-interacting and domain-pairs with promiscuous domains will be penalized. This domain promiscuity challenge to the problem of inferring reliable domain-domain interactions from protein-protein interactions has been recognized, and a number of work-arounds have been proposed. This paper reports on an application of Formal Concept Analysis to this problem. It is found that the relationship between formal concepts provides a natural way for rare domains to elevate the rank of promiscuous domain-pairs and enrich highly ranked domain-pairs with reliable domain-domain interactions. This piggybacking of promiscuous domain-pairs onto less promiscuous domain-pairs is possible only with concept lattices whose attribute-labels are not reduced and is enhanced by the presence of proteins that comprise both promiscuous and rare domains. PMID:24586450
Yang, Ming; Ge, Yan; Wu, Jiayan; Xiao, Jingfa; Yu, Jun
2011-05-20
Coevolution can be seen as the interdependency between evolutionary histories. In the context of protein evolution, functional correlation proteins are ever-present coordinated evolutionary characters without disruption of organismal integrity. As to complex system, there are two forms of protein--protein interactions in vivo, which refer to inter-complex interaction and intra-complex interaction. In this paper, we studied the difference of coevolution characters between inter-complex interaction and intra-complex interaction using "Mirror tree" method on the respiratory chain (RC) proteins. We divided the correlation coefficients of every pairwise RC proteins into two groups corresponding to the binary protein--protein interaction in intra-complex and the binary protein--protein interaction in inter-complex, respectively. A dramatical discrepancy is detected between the coevolution characters of the two sets of protein interactions (Wilcoxon test, p-value = 4.4 × 10(-6)). Our finding reveals some critical information on coevolutionary study and assists the mechanical investigation of protein--protein interaction. Furthermore, the results also provide some unique clue for supramolecular organization of protein complexes in the mitochondrial inner membrane. More detailed binding sites map and genome information of nuclear encoded RC proteins will be extraordinary valuable for the further mitochondria dynamics study. Copyright © 2011. Published by Elsevier Ltd.
Evolutionary reprograming of protein-protein interaction specificity.
Akiva, Eyal; Babbitt, Patricia C
2015-10-22
Using mutation libraries and deep sequencing, Aakre et al. study the evolution of protein-protein interactions using a toxin-antitoxin model. The results indicate probable trajectories via "intermediate" proteins that are promiscuous, thus avoiding transitions via non-interactions. These results extend observations about other biological interactions and enzyme evolution, suggesting broadly general principles. Copyright © 2015 Elsevier Inc. All rights reserved.
Can infrared spectroscopy provide information on protein-protein interactions?
Haris, Parvez I
2010-08-01
For most biophysical techniques, characterization of protein-protein interactions is challenging; this is especially true with methods that rely on a physical phenomenon that is common to both of the interacting proteins. Thus, for example, in IR spectroscopy, the carbonyl vibration (1600-1700 cm(-1)) associated with the amide bonds from both of the interacting proteins will overlap extensively, making the interpretation of spectral changes very complicated. Isotope-edited infrared spectroscopy, where one of the interacting proteins is uniformly labelled with (13)C or (13)C,(15)N has been introduced as a solution to this problem, enabling the study of protein-protein interactions using IR spectroscopy. The large shift of the amide I band (approx. 45 cm(-1) towards lower frequency) upon (13)C labelling of one of the proteins reveals the amide I band of the unlabelled protein, enabling it to be used as a probe for monitoring conformational changes. With site-specific isotopic labelling, structural resolution at the level of individual amino acid residues can be achieved. Furthermore, the ability to record IR spectra of proteins in diverse environments means that isotope-edited IR spectroscopy can be used to structurally characterize difficult systems such as protein-protein complexes bound to membranes or large insoluble peptide/protein aggregates. In the present article, examples of application of isotope-edited IR spectroscopy for studying protein-protein interactions are provided.
An ontology-based search engine for protein-protein interactions.
Park, Byungkyu; Han, Kyungsook
2010-01-18
Keyword matching or ID matching is the most common searching method in a large database of protein-protein interactions. They are purely syntactic methods, and retrieve the records in the database that contain a keyword or ID specified in a query. Such syntactic search methods often retrieve too few search results or no results despite many potential matches present in the database. We have developed a new method for representing protein-protein interactions and the Gene Ontology (GO) using modified Gödel numbers. This representation is hidden from users but enables a search engine using the representation to efficiently search protein-protein interactions in a biologically meaningful way. Given a query protein with optional search conditions expressed in one or more GO terms, the search engine finds all the interaction partners of the query protein by unique prime factorization of the modified Gödel numbers representing the query protein and the search conditions. Representing the biological relations of proteins and their GO annotations by modified Gödel numbers makes a search engine efficiently find all protein-protein interactions by prime factorization of the numbers. Keyword matching or ID matching search methods often miss the interactions involving a protein that has no explicit annotations matching the search condition, but our search engine retrieves such interactions as well if they satisfy the search condition with a more specific term in the ontology.
Human cancer protein-protein interaction network: a structural perspective.
Directory of Open Access Journals (Sweden)
Gozde Kar
2009-12-01
Full Text Available Protein-protein interaction networks provide a global picture of cellular function and biological processes. Some proteins act as hub proteins, highly connected to others, whereas some others have few interactions. The dysfunction of some interactions causes many diseases, including cancer. Proteins interact through their interfaces. Therefore, studying the interface properties of cancer-related proteins will help explain their role in the interaction networks. Similar or overlapping binding sites should be used repeatedly in single interface hub proteins, making them promiscuous. Alternatively, multi-interface hub proteins make use of several distinct binding sites to bind to different partners. We propose a methodology to integrate protein interfaces into cancer interaction networks (ciSPIN, cancer structural protein interface network. The interactions in the human protein interaction network are replaced by interfaces, coming from either known or predicted complexes. We provide a detailed analysis of cancer related human protein-protein interfaces and the topological properties of the cancer network. The results reveal that cancer-related proteins have smaller, more planar, more charged and less hydrophobic binding sites than non-cancer proteins, which may indicate low affinity and high specificity of the cancer-related interactions. We also classified the genes in ciSPIN according to phenotypes. Within phenotypes, for breast cancer, colorectal cancer and leukemia, interface properties were found to be discriminating from non-cancer interfaces with an accuracy of 71%, 67%, 61%, respectively. In addition, cancer-related proteins tend to interact with their partners through distinct interfaces, corresponding mostly to multi-interface hubs, which comprise 56% of cancer-related proteins, and constituting the nodes with higher essentiality in the network (76%. We illustrate the interface related affinity properties of two cancer-related hub
Protein function prediction using neighbor relativity in protein-protein interaction network.
Moosavi, Sobhan; Rahgozar, Masoud; Rahimi, Amir
2013-04-01
There is a large gap between the number of discovered proteins and the number of functionally annotated ones. Due to the high cost of determining protein function by wet-lab research, function prediction has become a major task for computational biology and bioinformatics. Some researches utilize the proteins interaction information to predict function for un-annotated proteins. In this paper, we propose a novel approach called "Neighbor Relativity Coefficient" (NRC) based on interaction network topology which estimates the functional similarity between two proteins. NRC is calculated for each pair of proteins based on their graph-based features including distance, common neighbors and the number of paths between them. In order to ascribe function to an un-annotated protein, NRC estimates a weight for each neighbor to transfer its annotation to the unknown protein. Finally, the unknown protein will be annotated by the top score transferred functions. We also investigate the effect of using different coefficients for various types of functions. The proposed method has been evaluated on Saccharomyces cerevisiae and Homo sapiens interaction networks. The performance analysis demonstrates that NRC yields better results in comparison with previous protein function prediction approaches that utilize interaction network. Copyright © 2012 Elsevier Ltd. All rights reserved.
Alignment of non-covalent interactions at protein-protein interfaces.
Directory of Open Access Journals (Sweden)
Hongbo Zhu
Full Text Available BACKGROUND: The study and comparison of protein-protein interfaces is essential for the understanding of the mechanisms of interaction between proteins. While there are many methods for comparing protein structures and protein binding sites, so far no methods have been reported for comparing the geometry of non-covalent interactions occurring at protein-protein interfaces. METHODOLOGY/PRINCIPAL FINDINGS: Here we present a method for aligning non-covalent interactions between different protein-protein interfaces. The method aligns the vector representations of van der Waals interactions and hydrogen bonds based on their geometry. The method has been applied to a dataset which comprises a variety of protein-protein interfaces. The alignments are consistent to a large extent with the results obtained using two other complementary approaches. In addition, we apply the method to three examples of protein mimicry. The method successfully aligns respective interfaces and allows for recognizing conserved interface regions. CONCLUSIONS/SIGNIFICANCE: The Galinter method has been validated in the comparison of interfaces in which homologous subunits are involved, including cases of mimicry. The method is also applicable to comparing interfaces involving non-peptidic compounds. Galinter assists users in identifying local interface regions with similar patterns of non-covalent interactions. This is particularly relevant to the investigation of the molecular basis of interaction mimicry.
Which clustering algorithm is better for predicting protein complexes?
Directory of Open Access Journals (Sweden)
Moschopoulos Charalampos N
2011-12-01
Full Text Available Abstract Background Protein-Protein interactions (PPI play a key role in determining the outcome of most cellular processes. The correct identification and characterization of protein interactions and the networks, which they comprise, is critical for understanding the molecular mechanisms within the cell. Large-scale techniques such as pull down assays and tandem affinity purification are used in order to detect protein interactions in an organism. Today, relatively new high-throughput methods like yeast two hybrid, mass spectrometry, microarrays, and phage display are also used to reveal protein interaction networks. Results In this paper we evaluated four different clustering algorithms using six different interaction datasets. We parameterized the MCL, Spectral, RNSC and Affinity Propagation algorithms and applied them to six PPI datasets produced experimentally by Yeast 2 Hybrid (Y2H and Tandem Affinity Purification (TAP methods. The predicted clusters, so called protein complexes, were then compared and benchmarked with already known complexes stored in published databases. Conclusions While results may differ upon parameterization, the MCL and RNSC algorithms seem to be more promising and more accurate at predicting PPI complexes. Moreover, they predict more complexes than other reviewed algorithms in absolute numbers. On the other hand the spectral clustering algorithm achieves the highest valid prediction rate in our experiments. However, it is nearly always outperformed by both RNSC and MCL in terms of the geometrical accuracy while it generates the fewest valid clusters than any other reviewed algorithm. This article demonstrates various metrics to evaluate the accuracy of such predictions as they are presented in the text below. Supplementary material can be found at: http://www.bioacademy.gr/bioinformatics/projects/ppireview.htm
Mei, Suyu; Zhu, Hao
2015-01-26
Protein-protein interaction (PPI) prediction is generally treated as a problem of binary classification wherein negative data sampling is still an open problem to be addressed. The commonly used random sampling is prone to yield less representative negative data with considerable false negatives. Meanwhile rational constraints are seldom exerted on model selection to reduce the risk of false positive predictions for most of the existing computational methods. In this work, we propose a novel negative data sampling method based on one-class SVM (support vector machine, SVM) to predict proteome-wide protein interactions between HTLV retrovirus and Homo sapiens, wherein one-class SVM is used to choose reliable and representative negative data, and two-class SVM is used to yield proteome-wide outcomes as predictive feedback for rational model selection. Computational results suggest that one-class SVM is more suited to be used as negative data sampling method than two-class PPI predictor, and the predictive feedback constrained model selection helps to yield a rational predictive model that reduces the risk of false positive predictions. Some predictions have been validated by the recent literature. Lastly, gene ontology based clustering of the predicted PPI networks is conducted to provide valuable cues for the pathogenesis of HTLV retrovirus.
Detection of protein-protein interactions by ribosome display and protein in situ immobilisation.
He, Mingyue; Liu, Hong; Turner, Martin; Taussig, Michael J
2009-12-31
We describe a method for identification of protein-protein interactions by combining two cell-free protein technologies, namely ribosome display and protein in situ immobilisation. The method requires only PCR fragments as the starting material, the target proteins being made through cell-free protein synthesis, either associated with their encoding mRNA as ribosome complexes or immobilised on a solid surface. The use of ribosome complexes allows identification of interacting protein partners from their attached coding mRNA. To demonstrate the procedures, we have employed the lymphocyte signalling proteins Vav1 and Grb2 and confirmed the interaction between Grb2 and the N-terminal SH3 domain of Vav1. The method has promise for library screening of pairwise protein interactions, down to the analytical level of individual domain or motif mapping.
Molecular tweezers modulate 14-3-3 protein-protein interactions
Bier, David; Rose, Rolf; Bravo-Rodriguez, Kenny; Bartel, Maria; Ramirez-Anguita, Juan Manuel; Dutt, Som; Wilch, Constanze; Klärner, Frank-Gerrit; Sanchez-Garcia, Elsa; Schrader, Thomas; Ottmann, Christian
2013-03-01
Supramolecular chemistry has recently emerged as a promising way to modulate protein functions, but devising molecules that will interact with a protein in the desired manner is difficult as many competing interactions exist in a biological environment (with solvents, salts or different sites for the target biomolecule). We now show that lysine-specific molecular tweezers bind to a 14-3-3 adapter protein and modulate its interaction with partner proteins. The tweezers inhibit binding between the 14-3-3 protein and two partner proteins—a phosphorylated (C-Raf) protein and an unphosphorylated one (ExoS)—in a concentration-dependent manner. Protein crystallography shows that this effect arises from the binding of the tweezers to a single surface-exposed lysine (Lys214) of the 14-3-3 protein in the proximity of its central channel, which normally binds the partner proteins. A combination of structural analysis and computer simulations provides rules for the tweezers' binding preferences, thus allowing us to predict their influence on this type of protein-protein interactions.
Hot-spot analysis for drug discovery targeting protein-protein interactions.
Rosell, Mireia; Fernández-Recio, Juan
2018-04-01
Protein-protein interactions are important for biological processes and pathological situations, and are attractive targets for drug discovery. However, rational drug design targeting protein-protein interactions is still highly challenging. Hot-spot residues are seen as the best option to target such interactions, but their identification requires detailed structural and energetic characterization, which is only available for a tiny fraction of protein interactions. Areas covered: In this review, the authors cover a variety of computational methods that have been reported for the energetic analysis of protein-protein interfaces in search of hot-spots, and the structural modeling of protein-protein complexes by docking. This can help to rationalize the discovery of small-molecule inhibitors of protein-protein interfaces of therapeutic interest. Computational analysis and docking can help to locate the interface, molecular dynamics can be used to find suitable cavities, and hot-spot predictions can focus the search for inhibitors of protein-protein interactions. Expert opinion: A major difficulty for applying rational drug design methods to protein-protein interactions is that in the majority of cases the complex structure is not available. Fortunately, computational docking can complement experimental data. An interesting aspect to explore in the future is the integration of these strategies for targeting PPIs with large-scale mutational analysis.
Protein-protein interactions and cancer: targeting the central dogma.
Garner, Amanda L; Janda, Kim D
2011-01-01
Between 40,000 and 200,000 protein-protein interactions have been predicted to exist within the human interactome. As these interactions are of a critical nature in many important cellular functions and their dysregulation is causal of disease, the modulation of these binding events has emerged as a leading, yet difficult therapeutic arena. In particular, the targeting of protein-protein interactions relevant to cancer is of fundamental importance as the tumor-promoting function of several aberrantly expressed proteins in the cancerous state is directly resultant of its ability to interact with a protein-binding partner. Of significance, these protein complexes play a crucial role in each of the steps of the central dogma of molecular biology, the fundamental processes of genetic transmission. With the many important discoveries being made regarding the mechanisms of these genetic process, the identification of new chemical probes are needed to better understand and validate the druggability of protein-protein interactions related to the central dogma. In this review, we provide an overview of current small molecule-based protein-protein interaction inhibitors for each stage of the central dogma: transcription, mRNA splicing and translation. Importantly, through our analysis we have uncovered a lack of necessary probes targeting mRNA splicing and translation, thus, opening up the possibility for expansion of these fields.
Directory of Open Access Journals (Sweden)
Greenblatt Jack
2006-07-01
Full Text Available Abstract Background Identification of protein interaction networks has received considerable attention in the post-genomic era. The currently available biochemical approaches used to detect protein-protein interactions are all time and labour intensive. Consequently there is a growing need for the development of computational tools that are capable of effectively identifying such interactions. Results Here we explain the development and implementation of a novel Protein-Protein Interaction Prediction Engine termed PIPE. This tool is capable of predicting protein-protein interactions for any target pair of the yeast Saccharomyces cerevisiae proteins from their primary structure and without the need for any additional information or predictions about the proteins. PIPE showed a sensitivity of 61% for detecting any yeast protein interaction with 89% specificity and an overall accuracy of 75%. This rate of success is comparable to those associated with the most commonly used biochemical techniques. Using PIPE, we identified a novel interaction between YGL227W (vid30 and YMR135C (gid8 yeast proteins. This lead us to the identification of a novel yeast complex that here we term vid30 complex (vid30c. The observed interaction was confirmed by tandem affinity purification (TAP tag, verifying the ability of PIPE to predict novel protein-protein interactions. We then used PIPE analysis to investigate the internal architecture of vid30c. It appeared from PIPE analysis that vid30c may consist of a core and a secondary component. Generation of yeast gene deletion strains combined with TAP tagging analysis indicated that the deletion of a member of the core component interfered with the formation of vid30c, however, deletion of a member of the secondary component had little effect (if any on the formation of vid30c. Also, PIPE can be used to analyse yeast proteins for which TAP tagging fails, thereby allowing us to predict protein interactions that are not
Weighted Protein Interaction Network Analysis of Frontotemporal Dementia.
Ferrari, Raffaele; Lovering, Ruth C; Hardy, John; Lewis, Patrick A; Manzoni, Claudia
2017-02-03
The genetic analysis of complex disorders has undoubtedly led to the identification of a wealth of associations between genes and specific traits. However, moving from genetics to biochemistry one gene at a time has, to date, rather proved inefficient and under-powered to comprehensively explain the molecular basis of phenotypes. Here we present a novel approach, weighted protein-protein interaction network analysis (W-PPI-NA), to highlight key functional players within relevant biological processes associated with a given trait. This is exemplified in the current study by applying W-PPI-NA to frontotemporal dementia (FTD): We first built the state of the art FTD protein network (FTD-PN) and then analyzed both its topological and functional features. The FTD-PN resulted from the sum of the individual interactomes built around FTD-spectrum genes, leading to a total of 4198 nodes. Twenty nine of 4198 nodes, called inter-interactome hubs (IIHs), represented those interactors able to bridge over 60% of the individual interactomes. Functional annotation analysis not only reiterated and reinforced previous findings from single genes and gene-coexpression analyses but also indicated a number of novel potential disease related mechanisms, including DNA damage response, gene expression regulation, and cell waste disposal and potential biomarkers or therapeutic targets including EP300. These processes and targets likely represent the functional core impacted in FTD, reflecting the underlying genetic architecture contributing to disease. The approach presented in this study can be applied to other complex traits for which risk-causative genes are known as it provides a promising tool for setting the foundations for collating genomics and wet laboratory data in a bidirectional manner. This is and will be critical to accelerate molecular target prioritization and drug discovery.
Supervised maximum-likelihood weighting of composite protein networks for complex prediction
Directory of Open Access Journals (Sweden)
Yong Chern Han
2012-12-01
Full Text Available Abstract Background Protein complexes participate in many important cellular functions, so finding the set of existent complexes is essential for understanding the organization and regulation of processes in the cell. With the availability of large amounts of high-throughput protein-protein interaction (PPI data, many algorithms have been proposed to discover protein complexes from PPI networks. However, such approaches are hindered by the high rate of noise in high-throughput PPI data, including spurious and missing interactions. Furthermore, many transient interactions are detected between proteins that are not from the same complex, while not all proteins from the same complex may actually interact. As a result, predicted complexes often do not match true complexes well, and many true complexes go undetected. Results We address these challenges by integrating PPI data with other heterogeneous data sources to construct a composite protein network, and using a supervised maximum-likelihood approach to weight each edge based on its posterior probability of belonging to a complex. We then use six different clustering algorithms, and an aggregative clustering strategy, to discover complexes in the weighted network. We test our method on Saccharomyces cerevisiae and Homo sapiens, and show that complex discovery is improved: compared to previously proposed supervised and unsupervised weighting approaches, our method recalls more known complexes, achieves higher precision at all recall levels, and generates novel complexes of greater functional similarity. Furthermore, our maximum-likelihood approach allows learned parameters to be used to visualize and evaluate the evidence of novel predictions, aiding human judgment of their credibility. Conclusions Our approach integrates multiple data sources with supervised learning to create a weighted composite protein network, and uses six clustering algorithms with an aggregative clustering strategy to
A credit-card library approach for disrupting protein-protein interactions.
Xu, Yang; Shi, Jin; Yamamoto, Noboru; Moss, Jason A; Vogt, Peter K; Janda, Kim D
2006-04-15
Protein-protein interfaces are prominent in many therapeutically important targets. Using small organic molecules to disrupt protein-protein interactions is a current challenge in chemical biology. An important example of protein-protein interactions is provided by the Myc protein, which is frequently deregulated in human cancers. Myc belongs to the family of basic helix-loop-helix leucine zipper (bHLH-ZIP) transcription factors. It is biologically active only as heterodimer with the bHLH-ZIP protein Max. Herein, we report a new strategy for the disruption of protein-protein interactions that has been corroborated through the design and synthesis of a small parallel library composed of 'credit-card' compounds. These compounds are derived from a planar, aromatic scaffold and functionalized with four points of diversity. From a 285 membered library, several hits were obtained that disrupted the c-Myc-Max interaction and cellular functions of c-Myc. The IC50 values determined for this small focused library for the disruption of Myc-Max dimerization are quite potent, especially since small molecule antagonists of protein-protein interactions are notoriously difficult to find. Furthermore, several of the compounds were active at the cellular level as shown by their biological effects on Myc action in chicken embryo fibroblast assays. In light of our findings, this approach is considered a valuable addition to the armamentarium of new molecules being developed to interact with protein-protein interfaces. Finally, this strategy for disrupting protein-protein interactions should prove applicable to other families of proteins.
Improving prediction of heterodimeric protein complexes using combination with pairwise kernel.
Ruan, Peiying; Hayashida, Morihiro; Akutsu, Tatsuya; Vert, Jean-Philippe
2018-02-19
Since many proteins become functional only after they interact with their partner proteins and form protein complexes, it is essential to identify the sets of proteins that form complexes. Therefore, several computational methods have been proposed to predict complexes from the topology and structure of experimental protein-protein interaction (PPI) network. These methods work well to predict complexes involving at least three proteins, but generally fail at identifying complexes involving only two different proteins, called heterodimeric complexes or heterodimers. There is however an urgent need for efficient methods to predict heterodimers, since the majority of known protein complexes are precisely heterodimers. In this paper, we use three promising kernel functions, Min kernel and two pairwise kernels, which are Metric Learning Pairwise Kernel (MLPK) and Tensor Product Pairwise Kernel (TPPK). We also consider the normalization forms of Min kernel. Then, we combine Min kernel or its normalization form and one of the pairwise kernels by plugging. We applied kernels based on PPI, domain, phylogenetic profile, and subcellular localization properties to predicting heterodimers. Then, we evaluate our method by employing C-Support Vector Classification (C-SVC), carrying out 10-fold cross-validation, and calculating the average F-measures. The results suggest that the combination of normalized-Min-kernel and MLPK leads to the best F-measure and improved the performance of our previous work, which had been the best existing method so far. We propose new methods to predict heterodimers, using a machine learning-based approach. We train a support vector machine (SVM) to discriminate interacting vs non-interacting protein pairs, based on informations extracted from PPI, domain, phylogenetic profiles and subcellular localization. We evaluate in detail new kernel functions to encode these data, and report prediction performance that outperforms the state-of-the-art.
Yin, Baoru; Zhang, Rujing; Yao, Ping
2015-03-20
The applications of plant proteins in the food and beverage industry have been hampered by their precipitation in acidic solution. In this study, pea protein isolate (PPI) with poor dispersibility in acidic solution was used to form complexes with soybean soluble polysaccharide (SSPS), and the effects of PPI aggregates on the structure and stability of PPI/SSPS complex emulsions were investigated. Under acidic conditions, high pressure homogenization disrupts the PPI aggregates and the electrostatic attraction between PPI and SSPS facilitates the formation of dispersible PPI/SSPS complexes. The PPI/SSPS complex emulsions prepared from the PPI containing aggregates prove to possess similar droplet structure and similar stability compared with the PPI/SSPS emulsions produced from the PPI in which the aggregates have been previously removed by centrifugation. The oil droplets are protected by PPI/SSPS complex interfacial films and SSPS surfaces. The emulsions show long-term stability against pH and NaCl concentration changes. This study demonstrates that PPI aggregates can also be used to produce stable complex emulsions, which may promote the applications of plant proteins in the food and beverage industry.
Directory of Open Access Journals (Sweden)
Baoru Yin
2015-03-01
Full Text Available The applications of plant proteins in the food and beverage industry have been hampered by their precipitation in acidic solution. In this study, pea protein isolate (PPI with poor dispersibility in acidic solution was used to form complexes with soybean soluble polysaccharide (SSPS, and the effects of PPI aggregates on the structure and stability of PPI/SSPS complex emulsions were investigated. Under acidic conditions, high pressure homogenization disrupts the PPI aggregates and the electrostatic attraction between PPI and SSPS facilitates the formation of dispersible PPI/SSPS complexes. The PPI/SSPS complex emulsions prepared from the PPI containing aggregates prove to possess similar droplet structure and similar stability compared with the PPI/SSPS emulsions produced from the PPI in which the aggregates have been previously removed by centrifugation. The oil droplets are protected by PPI/SSPS complex interfacial films and SSPS surfaces. The emulsions show long-term stability against pH and NaCl concentration changes. This study demonstrates that PPI aggregates can also be used to produce stable complex emulsions, which may promote the applications of plant proteins in the food and beverage industry.
Quality control methodology for high-throughput protein-protein interaction screening.
Vazquez, Alexei; Rual, Jean-François; Venkatesan, Kavitha
2011-01-01
Protein-protein interactions are key to many aspects of the cell, including its cytoskeletal structure, the signaling processes in which it is involved, or its metabolism. Failure to form protein complexes or signaling cascades may sometimes translate into pathologic conditions such as cancer or neurodegenerative diseases. The set of all protein interactions between the proteins encoded by an organism constitutes its protein interaction network, representing a scaffold for biological function. Knowing the protein interaction network of an organism, combined with other sources of biological information, can unravel fundamental biological circuits and may help better understand the molecular basics of human diseases. The protein interaction network of an organism can be mapped by combining data obtained from both low-throughput screens, i.e., "one gene at a time" experiments and high-throughput screens, i.e., screens designed to interrogate large sets of proteins at once. In either case, quality controls are required to deal with the inherent imperfect nature of experimental assays. In this chapter, we discuss experimental and statistical methodologies to quantify error rates in high-throughput protein-protein interactions screens.
Quantifying the molecular origins of opposite solvent effects on protein-protein interactions.
Directory of Open Access Journals (Sweden)
Vincent Vagenende
Full Text Available Although the nature of solvent-protein interactions is generally weak and non-specific, addition of cosolvents such as denaturants and osmolytes strengthens protein-protein interactions for some proteins, whereas it weakens protein-protein interactions for others. This is exemplified by the puzzling observation that addition of glycerol oppositely affects the association constants of two antibodies, D1.3 and D44.1, with lysozyme. To resolve this conundrum, we develop a methodology based on the thermodynamic principles of preferential interaction theory and the quantitative characterization of local protein solvation from molecular dynamics simulations. We find that changes of preferential solvent interactions at the protein-protein interface quantitatively account for the opposite effects of glycerol on the antibody-antigen association constants. Detailed characterization of local protein solvation in the free and associated protein states reveals how opposite solvent effects on protein-protein interactions depend on the extent of dewetting of the protein-protein contact region and on structural changes that alter cooperative solvent-protein interactions at the periphery of the protein-protein interface. These results demonstrate the direct relationship between macroscopic solvent effects on protein-protein interactions and atom-scale solvent-protein interactions, and establish a general methodology for predicting and understanding solvent effects on protein-protein interactions in diverse biological environments.
Identification of Protein-Protein Interactions with Glutathione-S-Transferase (GST) Fusion Proteins.
Einarson, Margret B; Pugacheva, Elena N; Orlinick, Jason R
2007-08-01
INTRODUCTIONGlutathione-S-transferase (GST) fusion proteins have had a wide range of applications since their introduction as tools for synthesis of recombinant proteins in bacteria. GST was originally selected as a fusion moiety because of several desirable properties. First and foremost, when expressed in bacteria alone, or as a fusion, GST is not sequestered in inclusion bodies (in contrast to previous fusion protein systems). Second, GST can be affinity-purified without denaturation because it binds to immobilized glutathione, which provides the basis for simple purification. Consequently, GST fusion proteins are routinely used for antibody generation and purification, protein-protein interaction studies, and biochemical analysis. This article describes the use of GST fusion proteins as probes for the identification of protein-protein interactions.
[Detection of protein-protein interactions by FRET and BRET methods].
Matoulková, E; Vojtěšek, B
2014-01-01
Nowadays, in vivo protein-protein interaction studies have become preferable detecting meth-ods that enable to show or specify (already known) protein interactions and discover their inhibitors. They also facilitate detection of protein conformational changes and discovery or specification of signaling pathways in living cells. One group of in vivo methods enabling these findings is based on fluorescent resonance energy transfer (FRET) and its bio-luminescent modification (BRET). They are based on visualization of protein-protein interactions via light or enzymatic excitation of fluorescent or bio-luminescent proteins. These methods allow not only protein localization within the cell or its organelles (or small animals) but they also allow us to quantify fluorescent signals and to discover weak or strong interaction partners. In this review, we explain the principles of FRET and BRET, their applications in the characterization of protein-protein interactions and we describe several findings using these two methods that clarify molecular and cellular mechanisms and signals related to cancer biology.
Selection of peptides interfering with protein-protein interaction.
Gaida, Annette; Hagemann, Urs B; Mattay, Dinah; Räuber, Christina; Müller, Kristian M; Arndt, Katja M
2009-01-01
Cell physiology depends on a fine-tuned network of protein-protein interactions, and misguided interactions are often associated with various diseases. Consequently, peptides, which are able to specifically interfere with such adventitious interactions, are of high interest for analytical as well as medical purposes. One of the most abundant protein interaction domains is the coiled-coil motif, and thus provides a premier target. Coiled coils, which consist of two or more alpha-helices wrapped around each other, have one of the simplest interaction interfaces, yet they are able to confer highly specific homo- and heterotypic interactions involved in virtually any cellular process. While there are several ways to generate interfering peptides, the combination of library design with a powerful selection system seems to be one of the most effective and promising approaches. This chapter guides through all steps of such a process, starting with library options and cloning, detailing suitable selection techniques and ending with purification for further down-stream characterization. Such generated peptides will function as versatile tools to interfere with the natural function of their targets thereby illuminating their down-stream signaling and, in general, promoting understanding of factors leading to specificity and stability in protein-protein interactions. Furthermore, peptides interfering with medically relevant proteins might become important diagnostics and therapeutics.
Linking structural features of protein complexes and biological function.
Sowmya, Gopichandran; Breen, Edmond J; Ranganathan, Shoba
2015-09-01
Protein-protein interaction (PPI) establishes the central basis for complex cellular networks in a biological cell. Association of proteins with other proteins occurs at varying affinities, yet with a high degree of specificity. PPIs lead to diverse functionality such as catalysis, regulation, signaling, immunity, and inhibition, playing a crucial role in functional genomics. The molecular principle of such interactions is often elusive in nature. Therefore, a comprehensive analysis of known protein complexes from the Protein Data Bank (PDB) is essential for the characterization of structural interface features to determine structure-function relationship. Thus, we analyzed a nonredundant dataset of 278 heterodimer protein complexes, categorized into major functional classes, for distinguishing features. Interestingly, our analysis has identified five key features (interface area, interface polar residue abundance, hydrogen bonds, solvation free energy gain from interface formation, and binding energy) that are discriminatory among the functional classes using Kruskal-Wallis rank sum test. Significant correlations between these PPI interface features amongst functional categories are also documented. Salt bridges correlate with interface area in regulator-inhibitors (r = 0.75). These representative features have implications for the prediction of potential function of novel protein complexes. The results provide molecular insights for better understanding of PPIs and their relation to biological functions. © 2015 The Protein Society.
Water-Protein Interactions: The Secret of Protein Dynamics
Directory of Open Access Journals (Sweden)
Silvia Martini
2013-01-01
Full Text Available Water-protein interactions help to maintain flexible conformation conditions which are required for multifunctional protein recognition processes. The intimate relationship between the protein surface and hydration water can be analyzed by studying experimental water properties measured in protein systems in solution. In particular, proteins in solution modify the structure and the dynamics of the bulk water at the solute-solvent interface. The ordering effects of proteins on hydration water are extended for several angstroms. In this paper we propose a method for analyzing the dynamical properties of the water molecules present in the hydration shells of proteins. The approach is based on the analysis of the effects of protein-solvent interactions on water protons NMR relaxation parameters. NMR relaxation parameters, especially the nonselective (R1NS and selective (R1SE spin-lattice relaxation rates of water protons, are useful for investigating the solvent dynamics at the macromolecule-solvent interfaces as well as the perturbation effects caused by the water-macromolecule interactions on the solvent dynamical properties. In this paper we demonstrate that Nuclear Magnetic Resonance Spectroscopy can be used to determine the dynamical contributions of proteins to the water molecules belonging to their hydration shells.
A scored human protein-protein interaction network to catalyze genomic interpretation
DEFF Research Database (Denmark)
Li, Taibo; Wernersson, Rasmus; Hansen, Rasmus B
2017-01-01
Genome-scale human protein-protein interaction networks are critical to understanding cell biology and interpreting genomic data, but challenging to produce experimentally. Through data integration and quality control, we provide a scored human protein-protein interaction network (InWeb_InBioMap,......Genome-scale human protein-protein interaction networks are critical to understanding cell biology and interpreting genomic data, but challenging to produce experimentally. Through data integration and quality control, we provide a scored human protein-protein interaction network (In...
Evolutionary diversification of protein-protein interactions by interface add-ons.
Plach, Maximilian G; Semmelmann, Florian; Busch, Florian; Busch, Markus; Heizinger, Leonhard; Wysocki, Vicki H; Merkl, Rainer; Sterner, Reinhard
2017-10-03
Cells contain a multitude of protein complexes whose subunits interact with high specificity. However, the number of different protein folds and interface geometries found in nature is limited. This raises the question of how protein-protein interaction specificity is achieved on the structural level and how the formation of nonphysiological complexes is avoided. Here, we describe structural elements called interface add-ons that fulfill this function and elucidate their role for the diversification of protein-protein interactions during evolution. We identified interface add-ons in 10% of a representative set of bacterial, heteromeric protein complexes. The importance of interface add-ons for protein-protein interaction specificity is demonstrated by an exemplary experimental characterization of over 30 cognate and hybrid glutamine amidotransferase complexes in combination with comprehensive genetic profiling and protein design. Moreover, growth experiments showed that the lack of interface add-ons can lead to physiologically harmful cross-talk between essential biosynthetic pathways. In sum, our complementary in silico, in vitro, and in vivo analysis argues that interface add-ons are a practical and widespread evolutionary strategy to prevent the formation of nonphysiological complexes by specializing protein-protein interactions.
Prediction of protein–protein interactions: unifying evolution and structure at protein interfaces
International Nuclear Information System (INIS)
Tuncbag, Nurcan; Gursoy, Attila; Keskin, Ozlem
2011-01-01
The vast majority of the chores in the living cell involve protein–protein interactions. Providing details of protein interactions at the residue level and incorporating them into protein interaction networks are crucial toward the elucidation of a dynamic picture of cells. Despite the rapid increase in the number of structurally known protein complexes, we are still far away from a complete network. Given experimental limitations, computational modeling of protein interactions is a prerequisite to proceed on the way to complete structural networks. In this work, we focus on the question 'how do proteins interact?' rather than 'which proteins interact?' and we review structure-based protein–protein interaction prediction approaches. As a sample approach for modeling protein interactions, PRISM is detailed which combines structural similarity and evolutionary conservation in protein interfaces to infer structures of complexes in the protein interaction network. This will ultimately help us to understand the role of protein interfaces in predicting bound conformations
Categorizing Biases in High-Confidence High-Throughput Protein-Protein Interaction Data Sets*
Yu, Xueping; Ivanic, Joseph; Memišević, Vesna; Wallqvist, Anders; Reifman, Jaques
2011-01-01
We characterized and evaluated the functional attributes of three yeast high-confidence protein-protein interaction data sets derived from affinity purification/mass spectrometry, protein-fragment complementation assay, and yeast two-hybrid experiments. The interacting proteins retrieved from these data sets formed distinct, partially overlapping sets with different protein-protein interaction characteristics. These differences were primarily a function of the deployed experimental technologies used to recover these interactions. This affected the total coverage of interactions and was especially evident in the recovery of interactions among different functional classes of proteins. We found that the interaction data obtained by the yeast two-hybrid method was the least biased toward any particular functional characterization. In contrast, interacting proteins in the affinity purification/mass spectrometry and protein-fragment complementation assay data sets were over- and under-represented among distinct and different functional categories. We delineated how these differences affected protein complex organization in the network of interactions, in particular for strongly interacting complexes (e.g. RNA and protein synthesis) versus weak and transient interacting complexes (e.g. protein transport). We quantified methodological differences in detecting protein interactions from larger protein complexes, in the correlation of protein abundance among interacting proteins, and in their connectivity of essential proteins. In the latter case, we showed that minimizing inherent methodology biases removed many of the ambiguous conclusions about protein essentiality and protein connectivity. We used these findings to rationalize how biological insights obtained by analyzing data sets originating from different sources sometimes do not agree or may even contradict each other. An important corollary of this work was that discrepancies in biological insights did not
Protein-protein interaction network-based detection of functionally similar proteins within species.
Song, Baoxing; Wang, Fen; Guo, Yang; Sang, Qing; Liu, Min; Li, Dengyun; Fang, Wei; Zhang, Deli
2012-07-01
Although functionally similar proteins across species have been widely studied, functionally similar proteins within species showing low sequence similarity have not been examined in detail. Identification of these proteins is of significant importance for understanding biological functions, evolution of protein families, progression of co-evolution, and convergent evolution and others which cannot be obtained by detection of functionally similar proteins across species. Here, we explored a method of detecting functionally similar proteins within species based on graph theory. After denoting protein-protein interaction networks using graphs, we split the graphs into subgraphs using the 1-hop method. Proteins with functional similarities in a species were detected using a method of modified shortest path to compare these subgraphs and to find the eligible optimal results. Using seven protein-protein interaction networks and this method, some functionally similar proteins with low sequence similarity that cannot detected by sequence alignment were identified. By analyzing the results, we found that, sometimes, it is difficult to separate homologous from convergent evolution. Evaluation of the performance of our method by gene ontology term overlap showed that the precision of our method was excellent. Copyright © 2012 Wiley Periodicals, Inc.
Bengali, Aditya N; Tessier, Peter M
2009-10-01
"Reversible" protein interactions govern diverse biological behavior ranging from intracellular transport and toxic protein aggregation to protein crystallization and inactivation of protein therapeutics. Much less is known about weak protein interactions than their stronger counterparts since they are difficult to characterize, especially in a parallel format (in contrast to a sequential format) necessary for high-throughput screening. We have recently introduced a highly efficient approach of characterizing protein self-association, namely self-interaction nanoparticle spectroscopy (SINS; Tessier et al., 2008; J Am Chem Soc 130:3106-3112). This approach exploits the separation-dependent optical properties of gold nanoparticles to detect weak self-interactions between proteins immobilized on nanoparticles. A limitation of our previous work is that differences in the sequence and structure of proteins can lead to significant differences in their affinity to adsorb to nanoparticle surfaces, which complicates analysis of the corresponding protein self-association behavior. In this work we demonstrate a highly specific approach for coating nanoparticles with proteins using biotin-avidin interactions to generate protein-nanoparticle conjugates that report protein self-interactions through changes in their optical properties. Using lysozyme as a model protein that is refractory to characterization by conventional SINS, we demonstrate that surface Plasmon wavelengths for gold-avidin-lysozyme conjugates over a range of solution conditions (i.e., pH and ionic strength) are well correlated with lysozyme osmotic second virial coefficient measurements. Since SINS requires orders of magnitude less protein and time than conventional methods (e.g., static light scattering), we envision this approach will find application in large screens of protein self-association aimed at either preventing (e.g., protein aggregation) or promoting (e.g., protein crystallization) these
Protein-protein interactions in the regulation of WRKY transcription factors.
Chi, Yingjun; Yang, Yan; Zhou, Yuan; Zhou, Jie; Fan, Baofang; Yu, Jing-Quan; Chen, Zhixiang
2013-03-01
It has been almost 20 years since the first report of a WRKY transcription factor, SPF1, from sweet potato. Great progress has been made since then in establishing the diverse biological roles of WRKY transcription factors in plant growth, development, and responses to biotic and abiotic stress. Despite the functional diversity, almost all analyzed WRKY proteins recognize the TTGACC/T W-box sequences and, therefore, mechanisms other than mere recognition of the core W-box promoter elements are necessary to achieve the regulatory specificity of WRKY transcription factors. Research over the past several years has revealed that WRKY transcription factors physically interact with a wide range of proteins with roles in signaling, transcription, and chromatin remodeling. Studies of WRKY-interacting proteins have provided important insights into the regulation and mode of action of members of the important family of transcription factors. It has also emerged that the slightly varied WRKY domains and other protein motifs conserved within each of the seven WRKY subfamilies participate in protein-protein interactions and mediate complex functional interactions between WRKY proteins and between WRKY and other regulatory proteins in the modulation of important biological processes. In this review, we summarize studies of protein-protein interactions for WRKY transcription factors and discuss how the interacting partners contribute, at different levels, to the establishment of the complex regulatory and functional network of WRKY transcription factors.
Gong, Wei; He, Kun; Covington, Mike; Dinesh-Kumar, S. P.; Snyder, Michael; Harmer, Stacey L.; Zhu, Yu-Xian; Deng, Xing Wang
2009-01-01
We used our collection of Arabidopsis transcription factor (TF) ORFeome clones to construct protein microarrays containing as many as 802 TF proteins. These protein microarrays were used for both protein-DNA and protein-protein interaction analyses. For protein-DNA interaction studies, we examined AP2/ERF family TFs and their cognate cis-elements. By careful comparison of the DNA-binding specificity of 13 TFs on the protein microarray with previous non-microarray data, we showed that protein microarrays provide an efficient and high throughput tool for genome-wide analysis of TF-DNA interactions. This microarray protein-DNA interaction analysis allowed us to derive a comprehensive view of DNA-binding profiles of AP2/ERF family proteins in Arabidopsis. It also revealed four TFs that bound the EE (evening element) and had the expected phased gene expression under clock-regulation, thus providing a basis for further functional analysis of their roles in clock regulation of gene expression. We also developed procedures for detecting protein interactions using this TF protein microarray and discovered four novel partners that interact with HY5, which can be validated by yeast two-hybrid assays. Thus, plant TF protein microarrays offer an attractive high-throughput alternative to traditional techniques for TF functional characterization on a global scale. PMID:19802365
Song, Youngwoon; Yoo, Sang-Ho
2017-11-15
The quality of rice-substituted fried noodles was improved by applying interaction between pea protein isolate (PPI) and green tea extract (GTE). Radical-scavenging activities of GTE were stably maintained when exposed to acidic pH, UV light, and fluorescent light, but decreased by approximately 65% when exposed to 80°C for 168h. The RVA profiles of noodle dough showed that peak viscosity and breakdown increased significantly but that setback and final viscosity remained unchanged with 20% rice flour replacement. PPI significantly decreased the viscosity parameters of rice-supplemented dough, and the addition of GTE recovered these values significantly. The cooking loss and viscoelasticity (R max ) of cooked rice-supplemented noodles were fully restored by combined treatment of PPI and GTE. GTE decreased the peroxide value of fried noodles by 14% after storage at 63°C for 16days. Therefore, PPI+GTE treatment has great potential for use in fried noodles owing to the reinforced network and antioxidant activity. Copyright © 2017 Elsevier Ltd. All rights reserved.
Directory of Open Access Journals (Sweden)
Zheng-Yu Jiang
Full Text Available Degradation of certain proteins through the ubiquitin-proteasome pathway is a common strategy taken by the key modulators responsible for stress responses. Kelch-like ECH-associated protein-1(Keap1, a substrate adaptor component of the Cullin3 (Cul3-based ubiquitin E3 ligase complex, mediates the ubiquitination of two key modulators, NF-E2-related factor 2 (Nrf2 and IκB kinase β (IKKβ, which are involved in the redox control of gene transcription. However, compared to the Keap1-Nrf2 protein-protein interaction (PPI, the intermolecular recognition mechanism of Keap1 and IKKβ has been poorly investigated. In order to explore the binding pattern between Keap1 and IKKβ, the PPI model of Keap1 and IKKβ was investigated. The structure of human IKKβ was constructed by means of the homology modeling method and using reported crystal structure of Xenopus laevis IKKβ as the template. A protein-protein docking method was applied to develop the Keap1-IKKβ complex model. After the refinement and visual analysis of docked proteins, the chosen pose was further optimized through molecular dynamics simulations. The resulting structure was utilized to conduct the virtual alanine mutation for the exploration of hot-spots significant for the intermolecular interaction. Overall, our results provided structural insights into the PPI model of Keap1-IKKβ and suggest that the substrate specificity of Keap1 depend on the interaction with the key tyrosines, namely Tyr525, Tyr574 and Tyr334. The study presented in the current project may be useful to design molecules that selectively modulate Keap1. The selective recognition mechanism of Keap1 with IKKβ or Nrf2 will be helpful to further know the crosstalk between NF-κB and Nrf2 signaling.
Detecting protein-protein interactions in living cells
DEFF Research Database (Denmark)
Gottschalk, Marie; Bach, Anders; Hansen, Jakob Lerche
2009-01-01
to the endogenous C-terminal peptide of the NMDA receptor, as evaluated by a cell-free protein-protein interaction assay. However, it is important to address both membrane permeability and effect in living cells. Therefore a bioluminescence resonance energy transfer (BRET) assay was established, where the C......-terminal of the NMDA receptor and PDZ2 of PSD-95 were fused to green fluorescent protein (GFP) and Renilla luciferase (Rluc) and expressed in COS7 cells. A robust and specific BRET signal was obtained by expression of the appropriate partner proteins and subsequently, the assay was used to evaluate a Tat......The PDZ domain mediated interaction between the NMDA receptor and its intracellular scaffolding protein, PSD-95, is a potential target for treatment of ischemic brain diseases. We have recently developed a number of peptide analogues with improved affinity for the PDZ domains of PSD-95 compared...
Yeast Interacting Proteins Database: YLR447C, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available xpression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Sp...; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; act
Interactive protein manipulation
International Nuclear Information System (INIS)
2003-01-01
We describe an interactive visualization and modeling program for the creation of protein structures ''from scratch''. The input to our program is an amino acid sequence -decoded from a gene- and a sequence of predicted secondary structure types for each amino acid-provided by external structure prediction programs. Our program can be used in the set-up phase of a protein structure prediction process; the structures created with it serve as input for a subsequent global internal energy minimization, or another method of protein structure prediction. Our program supports basic visualization methods for protein structures, interactive manipulation based on inverse kinematics, and visualization guides to aid a user in creating ''good'' initial structures
Gal, Maayan; Bloch, Itai; Shechter, Nelia; Romanenko, Olga; Shir, Ofer M
2016-01-01
Protein-protein interactions (PPI) play a critical role in regulating many cellular processes. Finding novel PPI inhibitors that interfere with specific binding of two proteins is considered a great challenge, mainly due to the complexity involved in characterizing multi-molecular systems and limited understanding of the physical principles governing PPIs. Here we show that the combination of virtual screening techniques, which are capable of filtering a large library of potential small molecule inhibitors, and a unique secondary screening by isothermal titration calorimetry, a label-free method capable of observing direct interactions, is an efficient tool for finding such an inhibitor. In this study we applied this strategy in a search for a small molecule capable of interfering with the interaction of the tumor-suppressor p53 and the E3-ligase MDM2. We virtually screened a library of 15 million small molecules that were filtered to a final set of 80 virtual hits. Our in vitro experimental assay, designed to validate the activity of mixtures of compounds by isothermal titration calorimetry, was used to identify an active molecule against MDM2. At the end of the process the small molecule (4S,7R)-4-(4-chlorophenyl)-5-hydroxy-2,7-dimethyl-N-(6-methylpyridin-2-yl)-4,6,7,8 tetrahydrIoquinoline-3-carboxamide was found to bind MDM2 with a dissociation constant of ~2 µM. Following the identification of this single bioactive compound, spectroscopic measurements were used to further characterize the interaction of the small molecule with the target protein. 2D NMR spectroscopy was used to map the binding region of the small molecule, and fluorescence polarization measurement confirmed that it indeed competes with p53.
Wuchty, Stefan
2006-05-23
While the analysis of unweighted biological webs as diverse as genetic, protein and metabolic networks allowed spectacular insights in the inner workings of a cell, biological networks are not only determined by their static grid of links. In fact, we expect that the heterogeneity in the utilization of connections has a major impact on the organization of cellular activities as well. We consider a web of interactions between protein domains of the Protein Family database (PFAM), which are weighted by a probability score. We apply metrics that combine the static layout and the weights of the underlying interactions. We observe that unweighted measures as well as their weighted counterparts largely share the same trends in the underlying domain interaction network. However, we only find weak signals that weights and the static grid of interactions are connected entities. Therefore assuming that a protein interaction is governed by a single domain interaction, we observe strong and significant correlations of the highest scoring domain interaction and the confidence of protein interactions in the underlying interactions of yeast and fly. Modeling an interaction between proteins if we find a high scoring protein domain interaction we obtain 1, 428 protein interactions among 361 proteins in the human malaria parasite Plasmodium falciparum. Assessing their quality by a logistic regression method we observe that increasing confidence of predicted interactions is accompanied by high scoring domain interactions and elevated levels of functional similarity and evolutionary conservation. Our results indicate that probability scores are randomly distributed, allowing to treat static grid and weights of domain interactions as separate entities. In particular, these finding confirms earlier observations that a protein interaction is a matter of a single interaction event on domain level. As an immediate application, we show a simple way to predict potential protein interactions
Drosophila protein interaction map (DPiM): a paradigm for metazoan protein complex interactions.
Guruharsha, K G; Obar, Robert A; Mintseris, Julian; Aishwarya, K; Krishnan, R T; Vijayraghavan, K; Artavanis-Tsakonas, Spyros
2012-01-01
Proteins perform essential cellular functions as part of protein complexes, often in conjunction with RNA, DNA, metabolites and other small molecules. The genome encodes thousands of proteins but not all of them are expressed in every cell type; and expressed proteins are not active at all times. Such diversity of protein expression and function accounts for the level of biological intricacy seen in nature. Defining protein-protein interactions in protein complexes, and establishing the when, what and where of potential interactions, is therefore crucial to understanding the cellular function of any protein-especially those that have not been well studied by traditional molecular genetic approaches. We generated a large-scale resource of affinity-tagged expression-ready clones and used co-affinity purification combined with tandem mass-spectrometry to identify protein partners of nearly 5,000 Drosophila melanogaster proteins. The resulting protein complex "map" provided a blueprint of metazoan protein complex organization. Here we describe how the map has provided valuable insights into protein function in addition to generating hundreds of testable hypotheses. We also discuss recent technological advancements that will be critical in addressing the next generation of questions arising from the map.
The Ser/Thr Protein Kinase Protein-Protein Interaction Map of M. tuberculosis.
Wu, Fan-Lin; Liu, Yin; Jiang, He-Wei; Luan, Yi-Zhao; Zhang, Hai-Nan; He, Xiang; Xu, Zhao-Wei; Hou, Jing-Li; Ji, Li-Yun; Xie, Zhi; Czajkowsky, Daniel M; Yan, Wei; Deng, Jiao-Yu; Bi, Li-Jun; Zhang, Xian-En; Tao, Sheng-Ce
2017-08-01
Mycobacterium tuberculosis (Mtb) is the causative agent of tuberculosis, the leading cause of death among all infectious diseases. There are 11 eukaryotic-like serine/threonine protein kinases (STPKs) in Mtb, which are thought to play pivotal roles in cell growth, signal transduction and pathogenesis. However, their underlying mechanisms of action remain largely uncharacterized. In this study, using a Mtb proteome microarray, we have globally identified the binding proteins in Mtb for all of the STPKs, and constructed the first STPK protein interaction (KPI) map that includes 492 binding proteins and 1,027 interactions. Bioinformatics analysis showed that the interacting proteins reflect diverse functions, including roles in two-component system, transcription, protein degradation, and cell wall integrity. Functional investigations confirmed that PknG regulates cell wall integrity through key components of peptidoglycan (PG) biosynthesis, e.g. MurC. The global STPK-KPIs network constructed here is expected to serve as a rich resource for understanding the key signaling pathways in Mtb, thus facilitating drug development and effective control of Mtb. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Molecular simulations of lipid-mediated protein-protein interactions
de Meyer, F.J.M.; Venturoli, M.; Smit, B.
2008-01-01
Recent experimental results revealed that lipid-mediated interactions due to hydrophobic forces may be important in determining the protein topology after insertion in the membrane, in regulating the protein activity, in protein aggregation and in signal transduction. To gain insight into the
Yeast Interacting Proteins Database: YGR013W, YKL012W [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available tion U1 snRNP protein involved in splicing, interacts with the branchpoint-binding protein during the formation of the second commitm... PRP40 U1 snRNP protein involved in splicing, interacts with the branchpoint-binding protein during the form...ation of the second commitment complex Rows with this prey as prey (1) Rows with
Emergence of modularity and disassortativity in protein-protein interaction networks.
Wan, Xi; Cai, Shuiming; Zhou, Jin; Liu, Zengrong
2010-12-01
In this paper, we present a simple evolution model of protein-protein interaction networks by introducing a rule of small-preference duplication of a node, meaning that the probability of a node chosen to duplicate is inversely proportional to its degree, and subsequent divergence plus nonuniform heterodimerization based on some plausible mechanisms in biology. We show that our model cannot only reproduce scale-free connectivity and small-world pattern, but also exhibit hierarchical modularity and disassortativity. After comparing the features of our model with those of real protein-protein interaction networks, we believe that our model can provide relevant insights into the mechanism underlying the evolution of protein-protein interaction networks. © 2010 American Institute of Physics.
Interactive protein manipulation
Energy Technology Data Exchange (ETDEWEB)
SNCrivelli@lbl.gov
2003-07-01
We describe an interactive visualization and modeling program for the creation of protein structures ''from scratch''. The input to our program is an amino acid sequence -decoded from a gene- and a sequence of predicted secondary structure types for each amino acid-provided by external structure prediction programs. Our program can be used in the set-up phase of a protein structure prediction process; the structures created with it serve as input for a subsequent global internal energy minimization, or another method of protein structure prediction. Our program supports basic visualization methods for protein structures, interactive manipulation based on inverse kinematics, and visualization guides to aid a user in creating ''good'' initial structures.
Globular and disordered – the non-identical twins in protein-protein interactions
Directory of Open Access Journals (Sweden)
Kaare eTeilum
2015-07-01
Full Text Available In biology proteins from different structural classes interact across and within classes in ways that are optimized to achieve balanced functional outputs. The interactions between intrinsically disordered proteins (IDPs and other proteins rely on changes in flexibility and this is seen as a strong determinant for their function. This has fostered the notion that IDP’s bind with low affinity but high specificity. Here we have analyzed available detailed thermodynamic data for protein-protein interactions to put to the test if the thermodynamic profiles of IDP interactions differ from those of other protein-protein interactions. We find that ordered proteins and the disordered ones act as non identical twins operating by similar principles but where the disordered proteins complexes are on average less stable by 2.5 kcal mol-1.
Scoring functions for protein-protein interactions.
Moal, Iain H; Moretti, Rocco; Baker, David; Fernández-Recio, Juan
2013-12-01
The computational evaluation of protein-protein interactions will play an important role in organising the wealth of data being generated by high-throughput initiatives. Here we discuss future applications, report recent developments and identify areas requiring further investigation. Many functions have been developed to quantify the structural and energetic properties of interacting proteins, finding use in interrelated challenges revolving around the relationship between sequence, structure and binding free energy. These include loop modelling, side-chain refinement, docking, multimer assembly, affinity prediction, affinity change upon mutation, hotspots location and interface design. Information derived from models optimised for one of these challenges can be used to benefit the others, and can be unified within the theoretical frameworks of multi-task learning and Pareto-optimal multi-objective learning. Copyright © 2013 Elsevier Ltd. All rights reserved.
Protein complex prediction based on k-connected subgraphs in protein interaction network
Habibi, Mahnaz; Eslahchi, Changiz; Wong, Limsoon
2010-01-01
Abstract Background Protein complexes play an important role in cellular mechanisms. Recently, several methods have been presented to predict protein complexes in a protein interaction network. In these methods, a protein complex is predicted as a dense subgraph of protein interactions. However, interactions data are incomplete and a protein complex does not have to be a complete or dense subgraph. Results We propose a more appropriate protein complex prediction method, CFA, that is based on ...
Globular and disordered-the non-identical twins in protein-protein interactions
DEFF Research Database (Denmark)
Teilum, Kaare; Olsen, Johan Gotthardt; Kragelund, Birthe Brandt
2015-01-01
as a strong determinant for their function. This has fostered the notion that IDP's bind with low affinity but high specificity. Here we have analyzed available detailed thermodynamic data for protein-protein interactions to put to the test if the thermodynamic profiles of IDP interactions differ from those...... of other protein-protein interactions. We find that ordered proteins and the disordered ones act as non-identical twins operating by similar principles but where the disordered proteins complexes are on average less stable by 2.5 kcal mol(-1)....
The role of electrostatics in protein-protein interactions of a monoclonal antibody.
Roberts, D; Keeling, R; Tracka, M; van der Walle, C F; Uddin, S; Warwicker, J; Curtis, R
2014-07-07
Understanding how protein-protein interactions depend on the choice of buffer, salt, ionic strength, and pH is needed to have better control over protein solution behavior. Here, we have characterized the pH and ionic strength dependence of protein-protein interactions in terms of an interaction parameter kD obtained from dynamic light scattering and the osmotic second virial coefficient B22 measured by static light scattering. A simplified protein-protein interaction model based on a Baxter adhesive potential and an electric double layer force is used to separate out the contributions of longer-ranged electrostatic interactions from short-ranged attractive forces. The ionic strength dependence of protein-protein interactions for solutions at pH 6.5 and below can be accurately captured using a Deryaguin-Landau-Verwey-Overbeek (DLVO) potential to describe the double layer forces. In solutions at pH 9, attractive electrostatics occur over the ionic strength range of 5-275 mM. At intermediate pH values (7.25 to 8.5), there is a crossover effect characterized by a nonmonotonic ionic strength dependence of protein-protein interactions, which can be rationalized by the competing effects of long-ranged repulsive double layer forces at low ionic strength and a shorter ranged electrostatic attraction, which dominates above a critical ionic strength. The change of interactions from repulsive to attractive indicates a concomitant change in the angular dependence of protein-protein interaction from isotropic to anisotropic. In the second part of the paper, we show how the Baxter adhesive potential can be used to predict values of kD from fitting to B22 measurements, thus providing a molecular basis for the linear correlation between the two protein-protein interaction parameters.
Towards a map of the Populus biomass protein-protein interaction network
Energy Technology Data Exchange (ETDEWEB)
Beers, Eric [Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States); Brunner, Amy [Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States); Helm, Richard [Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States); Dickerman, Allan [Virginia Polytechnic Inst. and State Univ. (Virginia Tech), Blacksburg, VA (United States)
2015-07-31
Biofuels can be produced from a variety of plant feedstocks. The value of a particular feedstock for biofuels production depends in part on the degree of difficulty associated with the extraction of fermentable sugars from the plant biomass. The wood of trees is potentially a rich source fermentable sugars. However, the sugars in wood exist in a tightly cross-linked matrix of cellulose, hemicellulose, and lignin, making them largely recalcitrant to release and fermentation for biofuels production. Before breeders and genetic engineers can effectively develop plants with reduced recalcitrance to fermentation, it is necessary to gain a better understanding of the fundamental biology of the mechanisms responsible for wood formation. Regulatory, structural, and enzymatic proteins are required for the complicated process of wood formation. To function properly, proteins must interact with other proteins. Yet, very few of the protein-protein interactions necessary for wood formation are known. The main objectives of this project were to 1) identify new protein-protein interactions relevant to wood formation, and 2) perform in-depth characterizations of selected protein-protein interactions. To identify relevant protein-protein interactions, we cloned a set of approximately 400 genes that were highly expressed in the wood-forming tissue (known as secondary xylem) of poplar (Populus trichocarpa). We tested whether the proteins encoded by these biomass genes interacted with each other in a binary matrix design using the yeast two-hybrid (Y2H) method for protein-protein interaction discovery. We also tested a subset of the 400 biomass proteins for interactions with all proteins present in wood-forming tissue of poplar in a biomass library screen design using Y2H. Together, these two Y2H screens yielded over 270 interactions involving over 75 biomass proteins. For the second main objective we selected several interacting pairs or groups of interacting proteins for in
Characterizing Protein Interactions Employing a Genome-Wide siRNA Cellular Phenotyping Screen
Suratanee, Apichat; Schaefer, Martin H.; Betts, Matthew J.; Soons, Zita; Mannsperger, Heiko; Harder, Nathalie; Oswald, Marcus; Gipp, Markus; Ramminger, Ellen; Marcus, Guillermo; Männer, Reinhard; Rohr, Karl; Wanker, Erich; Russell, Robert B.; Andrade-Navarro, Miguel A.; Eils, Roland; König, Rainer
2014-01-01
Characterizing the activating and inhibiting effect of protein-protein interactions (PPI) is fundamental to gain insight into the complex signaling system of a human cell. A plethora of methods has been suggested to infer PPI from data on a large scale, but none of them is able to characterize the effect of this interaction. Here, we present a novel computational development that employs mitotic phenotypes of a genome-wide RNAi knockdown screen and enables identifying the activating and inhibiting effects of PPIs. Exemplarily, we applied our technique to a knockdown screen of HeLa cells cultivated at standard conditions. Using a machine learning approach, we obtained high accuracy (82% AUC of the receiver operating characteristics) by cross-validation using 6,870 known activating and inhibiting PPIs as gold standard. We predicted de novo unknown activating and inhibiting effects for 1,954 PPIs in HeLa cells covering the ten major signaling pathways of the Kyoto Encyclopedia of Genes and Genomes, and made these predictions publicly available in a database. We finally demonstrate that the predicted effects can be used to cluster knockdown genes of similar biological processes in coherent subgroups. The characterization of the activating or inhibiting effect of individual PPIs opens up new perspectives for the interpretation of large datasets of PPIs and thus considerably increases the value of PPIs as an integrated resource for studying the detailed function of signaling pathways of the cellular system of interest. PMID:25255318
Huisman, I.H.; Prádanos, P.; Hernández, A.
2000-01-01
It was studied how protein-protein and protein-membrane interactions influence the filtration performance during the ultrafiltration of protein solutions over polymeric membranes. This was done by measuring flux, streaming potential, and protein transmission during filtration of bovine serum albumin
DEFF Research Database (Denmark)
Feliu, Elisenda; Aloy, Patrick; Oliva, Baldo
2011-01-01
Development of effective methods to screen binary interactions obtained by rigid-body protein-protein docking is key for structure prediction of complexes and for elucidating physicochemical principles of protein-protein binding. We have derived empirical knowledge-based potential functions for s...... and with independence of the partner. This information is encoded at the residue level and could be easily incorporated in the initial grid scoring for Fast Fourier Transform rigid-body docking methods.......Development of effective methods to screen binary interactions obtained by rigid-body protein-protein docking is key for structure prediction of complexes and for elucidating physicochemical principles of protein-protein binding. We have derived empirical knowledge-based potential functions...... for selecting rigid-body docking poses. These potentials include the energetic component that provides the residues with a particular secondary structure and surface accessibility. These scoring functions have been tested on a state-of-art benchmark dataset and on a decoy dataset of permanent interactions. Our...
Finding local communities in protein networks.
Voevodski, Konstantin; Teng, Shang-Hua; Xia, Yu
2009-09-18
Protein-protein interactions (PPIs) play fundamental roles in nearly all biological processes, and provide major insights into the inner workings of cells. A vast amount of PPI data for various organisms is available from BioGRID and other sources. The identification of communities in PPI networks is of great interest because they often reveal previously unknown functional ties between proteins. A large number of global clustering algorithms have been applied to protein networks, where the entire network is partitioned into clusters. Here we take a different approach by looking for local communities in PPI networks. We develop a tool, named Local Protein Community Finder, which quickly finds a community close to a queried protein in any network available from BioGRID or specified by the user. Our tool uses two new local clustering algorithms Nibble and PageRank-Nibble, which look for a good cluster among the most popular destinations of a short random walk from the queried vertex. The quality of a cluster is determined by proportion of outgoing edges, known as conductance, which is a relative measure particularly useful in undersampled networks. We show that the two local clustering algorithms find communities that not only form excellent clusters, but are also likely to be biologically relevant functional components. We compare the performance of Nibble and PageRank-Nibble to other popular and effective graph partitioning algorithms, and show that they find better clusters in the graph. Moreover, Nibble and PageRank-Nibble find communities that are more functionally coherent. The Local Protein Community Finder, accessible at http://xialab.bu.edu/resources/lpcf, allows the user to quickly find a high-quality community close to a queried protein in any network available from BioGRID or specified by the user. We show that the communities found by our tool form good clusters and are functionally coherent, making our application useful for biologists who wish to
Finding local communities in protein networks
Directory of Open Access Journals (Sweden)
Teng Shang-Hua
2009-09-01
Full Text Available Abstract Background Protein-protein interactions (PPIs play fundamental roles in nearly all biological processes, and provide major insights into the inner workings of cells. A vast amount of PPI data for various organisms is available from BioGRID and other sources. The identification of communities in PPI networks is of great interest because they often reveal previously unknown functional ties between proteins. A large number of global clustering algorithms have been applied to protein networks, where the entire network is partitioned into clusters. Here we take a different approach by looking for local communities in PPI networks. Results We develop a tool, named Local Protein Community Finder, which quickly finds a community close to a queried protein in any network available from BioGRID or specified by the user. Our tool uses two new local clustering algorithms Nibble and PageRank-Nibble, which look for a good cluster among the most popular destinations of a short random walk from the queried vertex. The quality of a cluster is determined by proportion of outgoing edges, known as conductance, which is a relative measure particularly useful in undersampled networks. We show that the two local clustering algorithms find communities that not only form excellent clusters, but are also likely to be biologically relevant functional components. We compare the performance of Nibble and PageRank-Nibble to other popular and effective graph partitioning algorithms, and show that they find better clusters in the graph. Moreover, Nibble and PageRank-Nibble find communities that are more functionally coherent. Conclusion The Local Protein Community Finder, accessible at http://xialab.bu.edu/resources/lpcf, allows the user to quickly find a high-quality community close to a queried protein in any network available from BioGRID or specified by the user. We show that the communities found by our tool form good clusters and are functionally coherent
Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Marashi, Sayed-Amir
2017-02-01
Chameleon proteins are proteins which include sequences that can adopt α-helix-β-strand (HE-chameleon) or α-helix-coil (HC-chameleon) or β-strand-coil (CE-chameleon) structures to operate their crucial biological functions. In this study, using a network-based approach, we examined the chameleon proteins to give a better knowledge on these proteins. We focused on proteins with identical chameleon sequences with more than or equal to seven residues long in different PDB entries, which adopt HE-chameleon, HC-chameleon, and CE-chameleon structures in the same protein. One hundred and ninety-one human chameleon proteins were identified via our in-house program. Then, protein-protein interaction (PPI) networks, Gene ontology (GO) enrichment, disease network, and pathway enrichment analyses were performed for our derived data set. We discovered that there are chameleon sequences which reside in protein-protein interaction regions between two proteins critical for their dual function. Analysis of the PPI networks for chameleon proteins introduced five hub proteins, namely TP53, EGFR, HSP90AA1, PPARA, and HIF1A, which were presented in four PPI clusters. The outcomes demonstrate that the chameleon regions are in critical domains of these proteins and are important in the development and treatment of human cancers. The present report is the first network-based functional study of chameleon proteins using computational approaches and might provide a new perspective for understanding the mechanisms of diseases helping us in developing new medical therapies along with discovering new proteins with chameleon properties which are highly important in cancer.
Protein-Protein Interaction Databases
DEFF Research Database (Denmark)
Szklarczyk, Damian; Jensen, Lars Juhl
2015-01-01
Years of meticulous curation of scientific literature and increasingly reliable computational predictions have resulted in creation of vast databases of protein interaction data. Over the years, these repositories have become a basic framework in which experiments are analyzed and new directions...
Yeast Interacting Proteins Database: YGL237C, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available ene expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding prote... expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein
Yeast Interacting Proteins Database: YKL002W, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available ene expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding prote...xpression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Sp
Protein complex prediction based on k-connected subgraphs in protein interaction network
Directory of Open Access Journals (Sweden)
Habibi Mahnaz
2010-09-01
Full Text Available Abstract Background Protein complexes play an important role in cellular mechanisms. Recently, several methods have been presented to predict protein complexes in a protein interaction network. In these methods, a protein complex is predicted as a dense subgraph of protein interactions. However, interactions data are incomplete and a protein complex does not have to be a complete or dense subgraph. Results We propose a more appropriate protein complex prediction method, CFA, that is based on connectivity number on subgraphs. We evaluate CFA using several protein interaction networks on reference protein complexes in two benchmark data sets (MIPS and Aloy, containing 1142 and 61 known complexes respectively. We compare CFA to some existing protein complex prediction methods (CMC, MCL, PCP and RNSC in terms of recall and precision. We show that CFA predicts more complexes correctly at a competitive level of precision. Conclusions Many real complexes with different connectivity level in protein interaction network can be predicted based on connectivity number. Our CFA program and results are freely available from http://www.bioinf.cs.ipm.ir/softwares/cfa/CFA.rar.
Protein complex prediction via dense subgraphs and false positive analysis.
Directory of Open Access Journals (Sweden)
Cecilia Hernandez
Full Text Available Many proteins work together with others in groups called complexes in order to achieve a specific function. Discovering protein complexes is important for understanding biological processes and predict protein functions in living organisms. Large-scale and throughput techniques have made possible to compile protein-protein interaction networks (PPI networks, which have been used in several computational approaches for detecting protein complexes. Those predictions might guide future biologic experimental research. Some approaches are topology-based, where highly connected proteins are predicted to be complexes; some propose different clustering algorithms using partitioning, overlaps among clusters for networks modeled with unweighted or weighted graphs; and others use density of clusters and information based on protein functionality. However, some schemes still require much processing time or the quality of their results can be improved. Furthermore, most of the results obtained with computational tools are not accompanied by an analysis of false positives. We propose an effective and efficient mining algorithm for discovering highly connected subgraphs, which is our base for defining protein complexes. Our representation is based on transforming the PPI network into a directed acyclic graph that reduces the number of represented edges and the search space for discovering subgraphs. Our approach considers weighted and unweighted PPI networks. We compare our best alternative using PPI networks from Saccharomyces cerevisiae (yeast and Homo sapiens (human with state-of-the-art approaches in terms of clustering, biological metrics and execution times, as well as three gold standards for yeast and two for human. Furthermore, we analyze false positive predicted complexes searching the PDBe (Protein Data Bank in Europe database in order to identify matching protein complexes that have been purified and structurally characterized. Our analysis shows
PpiA, a surface PPIase of the cyclophilin family in Lactococcus lactis.
Directory of Open Access Journals (Sweden)
Nicolas Trémillon
Full Text Available BACKGROUND: Protein folding in the envelope is a crucial limiting step of protein export and secretion. In order to better understand this process in Lactococcus lactis, a lactic acid bacterium, genes encoding putative exported folding factors like Peptidyl Prolyl Isomerases (PPIases were searched for in lactococcal genomes. RESULTS: In L. lactis, a new putative membrane PPIase of the cyclophilin subfamily, PpiA, was identified and characterized. ppiA gene was found to be constitutively expressed under normal and stress (heat shock, H(2O(2 conditions. Under normal conditions, PpiA protein was synthesized and released from intact cells by an exogenously added protease, showing that it was exposed at the cell surface. No obvious phenotype could be associated to a ppiA mutant strain under several laboratory conditions including stress conditions, except a very low sensitivity to H(2O(2. Induction of a ppiA copy provided in trans had no effect i on the thermosensitivity of an mutant strain deficient for the lactococcal surface protease HtrA and ii on the secretion and stability on four exported proteins (a highly degraded hybrid protein and three heterologous secreted proteins in an otherwise wild-type strain background. However, a recombinant soluble form of PpiA that had been produced and secreted in L. lactis and purified from a culture supernatant displayed both PPIase and chaperone activities. CONCLUSIONS: Although L. lactis PpiA, a protein produced and exposed at the cell surface under normal conditions, displayed a very moderate role in vivo, it was found, as a recombinant soluble form, to be endowed with folding activities in vitro.
Directory of Open Access Journals (Sweden)
Wuchty Stefan
2006-05-01
Full Text Available Abstract Background While the analysis of unweighted biological webs as diverse as genetic, protein and metabolic networks allowed spectacular insights in the inner workings of a cell, biological networks are not only determined by their static grid of links. In fact, we expect that the heterogeneity in the utilization of connections has a major impact on the organization of cellular activities as well. Results We consider a web of interactions between protein domains of the Protein Family database (PFAM, which are weighted by a probability score. We apply metrics that combine the static layout and the weights of the underlying interactions. We observe that unweighted measures as well as their weighted counterparts largely share the same trends in the underlying domain interaction network. However, we only find weak signals that weights and the static grid of interactions are connected entities. Therefore assuming that a protein interaction is governed by a single domain interaction, we observe strong and significant correlations of the highest scoring domain interaction and the confidence of protein interactions in the underlying interactions of yeast and fly. Modeling an interaction between proteins if we find a high scoring protein domain interaction we obtain 1, 428 protein interactions among 361 proteins in the human malaria parasite Plasmodium falciparum. Assessing their quality by a logistic regression method we observe that increasing confidence of predicted interactions is accompanied by high scoring domain interactions and elevated levels of functional similarity and evolutionary conservation. Conclusion Our results indicate that probability scores are randomly distributed, allowing to treat static grid and weights of domain interactions as separate entities. In particular, these finding confirms earlier observations that a protein interaction is a matter of a single interaction event on domain level. As an immediate application, we
A conserved mammalian protein interaction network.
Directory of Open Access Journals (Sweden)
Åsa Pérez-Bercoff
Full Text Available Physical interactions between proteins mediate a variety of biological functions, including signal transduction, physical structuring of the cell and regulation. While extensive catalogs of such interactions are known from model organisms, their evolutionary histories are difficult to study given the lack of interaction data from phylogenetic outgroups. Using phylogenomic approaches, we infer a upper bound on the time of origin for a large set of human protein-protein interactions, showing that most such interactions appear relatively ancient, dating no later than the radiation of placental mammals. By analyzing paired alignments of orthologous and putatively interacting protein-coding genes from eight mammals, we find evidence for weak but significant co-evolution, as measured by relative selective constraint, between pairs of genes with interacting proteins. However, we find no strong evidence for shared instances of directional selection within an interacting pair. Finally, we use a network approach to show that the distribution of selective constraint across the protein interaction network is non-random, with a clear tendency for interacting proteins to share similar selective constraints. Collectively, the results suggest that, on the whole, protein interactions in mammals are under selective constraint, presumably due to their functional roles.
Phthalic Acid Chemical Probes Synthesized for Protein-Protein Interaction Analysis
Directory of Open Access Journals (Sweden)
Chin-Jen Wu
2013-06-01
Full Text Available Plasticizers are additives that are used to increase the flexibility of plastic during manufacturing. However, in injection molding processes, plasticizers cannot be generated with monomers because they can peel off from the plastics into the surrounding environment, water, or food, or become attached to skin. Among the various plasticizers that are used, 1,2-benzenedicarboxylic acid (phthalic acid is a typical precursor to generate phthalates. In addition, phthalic acid is a metabolite of diethylhexyl phthalate (DEHP. According to Gene_Ontology gene/protein database, phthalates can cause genital diseases, cardiotoxicity, hepatotoxicity, nephrotoxicity, etc. In this study, a silanized linker (3-aminopropyl triethoxyslane, APTES was deposited on silicon dioxides (SiO2 particles and phthalate chemical probes were manufactured from phthalic acid and APTES–SiO2. These probes could be used for detecting proteins that targeted phthalic acid and for protein-protein interactions. The phthalic acid chemical probes we produced were incubated with epithelioid cell lysates of normal rat kidney (NRK-52E cells to detect the interactions between phthalic acid and NRK-52E extracted proteins. These chemical probes interacted with a number of chaperones such as protein disulfide-isomerase A6, heat shock proteins, and Serpin H1. Ingenuity Pathways Analysis (IPA software showed that these chemical probes were a practical technique for protein-protein interaction analysis.
3DProIN: Protein-Protein Interaction Networks and Structure Visualization.
Li, Hui; Liu, Chunmei
2014-06-14
3DProIN is a computational tool to visualize protein-protein interaction networks in both two dimensional (2D) and three dimensional (3D) view. It models protein-protein interactions in a graph and explores the biologically relevant features of the tertiary structures of each protein in the network. Properties such as color, shape and name of each node (protein) of the network can be edited in either 2D or 3D views. 3DProIN is implemented using 3D Java and C programming languages. The internet crawl technique is also used to parse dynamically grasped protein interactions from protein data bank (PDB). It is a java applet component that is embedded in the web page and it can be used on different platforms including Linux, Mac and Window using web browsers such as Firefox, Internet Explorer, Chrome and Safari. It also was converted into a mac app and submitted to the App store as a free app. Mac users can also download the app from our website. 3DProIN is available for academic research at http://bicompute.appspot.com.
Potential disruption of protein-protein interactions by graphene oxide
International Nuclear Information System (INIS)
Feng, Mei; Kang, Hongsuk; Luan, Binquan; Yang, Zaixing; Zhou, Ruhong
2016-01-01
Graphene oxide (GO) is a promising novel nanomaterial with a wide range of potential biomedical applications due to its many intriguing properties. However, very little research has been conducted to study its possible adverse effects on protein-protein interactions (and thus subsequent toxicity to human). Here, the potential cytotoxicity of GO is investigated at molecular level using large-scale, all-atom molecular dynamics simulations to explore the interaction mechanism between a protein dimer and a GO nanosheet oxidized at different levels. Our theoretical results reveal that GO nanosheet could intercalate between the two monomers of HIV-1 integrase dimer, disrupting the protein-protein interactions and eventually lead to dimer disassociation as graphene does [B. Luan et al., ACS Nano 9(1), 663 (2015)], albeit its insertion process is slower when compared with graphene due to the additional steric and attractive interactions. This study helps to better understand the toxicity of GO to cell functions which could shed light on how to improve its biocompatibility and biosafety for its wide potential biomedical applications.
Potential disruption of protein-protein interactions by graphene oxide
Energy Technology Data Exchange (ETDEWEB)
Feng, Mei [Department of Physics, Institute of Quantitative Biology, Zhejiang University, Hangzhou 310027 (China); Kang, Hongsuk; Luan, Binquan [Computational Biological Center, IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598 (United States); Yang, Zaixing [Institute of Quantitative Biology and Medicine, SRMP and RAD-X, and Collaborative Innovation Center of Radiation Medicine of Jiangsu Higher Education Institutions, Soochow University, Suzhou 215123 (China); Zhou, Ruhong, E-mail: ruhong@us.ibm.com [Department of Physics, Institute of Quantitative Biology, Zhejiang University, Hangzhou 310027 (China); Computational Biological Center, IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598 (United States); Department of Chemistry, Columbia University, New York, New York 10027 (United States)
2016-06-14
Graphene oxide (GO) is a promising novel nanomaterial with a wide range of potential biomedical applications due to its many intriguing properties. However, very little research has been conducted to study its possible adverse effects on protein-protein interactions (and thus subsequent toxicity to human). Here, the potential cytotoxicity of GO is investigated at molecular level using large-scale, all-atom molecular dynamics simulations to explore the interaction mechanism between a protein dimer and a GO nanosheet oxidized at different levels. Our theoretical results reveal that GO nanosheet could intercalate between the two monomers of HIV-1 integrase dimer, disrupting the protein-protein interactions and eventually lead to dimer disassociation as graphene does [B. Luan et al., ACS Nano 9(1), 663 (2015)], albeit its insertion process is slower when compared with graphene due to the additional steric and attractive interactions. This study helps to better understand the toxicity of GO to cell functions which could shed light on how to improve its biocompatibility and biosafety for its wide potential biomedical applications.
Interaction between plate make and protein in protein crystallisation screening.
Directory of Open Access Journals (Sweden)
Gordon J King
Full Text Available BACKGROUND: Protein crystallisation screening involves the parallel testing of large numbers of candidate conditions with the aim of identifying conditions suitable as a starting point for the production of diffraction quality crystals. Generally, condition screening is performed in 96-well plates. While previous studies have examined the effects of protein construct, protein purity, or crystallisation condition ingredients on protein crystallisation, few have examined the effect of the crystallisation plate. METHODOLOGY/PRINCIPAL FINDINGS: We performed a statistically rigorous examination of protein crystallisation, and evaluated interactions between crystallisation success and plate row/column, different plates of same make, different plate makes and different proteins. From our analysis of protein crystallisation, we found a significant interaction between plate make and the specific protein being crystallised. CONCLUSIONS/SIGNIFICANCE: Protein crystal structure determination is the principal method for determining protein structure but is limited by the need to produce crystals of the protein under study. Many important proteins are difficult to crystallize, so that identification of factors that assist crystallisation could open up the structure determination of these more challenging targets. Our findings suggest that protein crystallisation success may be improved by matching a protein with its optimal plate make.
An analysis pipeline for the inference of protein-protein interaction networks
Energy Technology Data Exchange (ETDEWEB)
Taylor, Ronald C.; Singhal, Mudita; Daly, Don S.; Gilmore, Jason M.; Cannon, William R.; Domico, Kelly O.; White, Amanda M.; Auberry, Deanna L.; Auberry, Kenneth J.; Hooker, Brian S.; Hurst, G. B.; McDermott, Jason E.; McDonald, W. H.; Pelletier, Dale A.; Schmoyer, Denise A.; Wiley, H. S.
2009-12-01
An analysis pipeline has been created for deployment of a novel algorithm, the Bayesian Estimator of Protein-Protein Association Probabilities (BEPro), for use in the reconstruction of protein-protein interaction networks. We have combined the Software Environment for BIological Network Inference (SEBINI), an interactive environment for the deployment and testing of network inference algorithms that use high-throughput data, and the Collective Analysis of Biological Interaction Networks (CABIN), software that allows integration and analysis of protein-protein interaction and gene-to-gene regulatory evidence obtained from multiple sources, to allow interactions computed by BEPro to be stored, visualized, and further analyzed. Incorporating BEPro into SEBINI and automatically feeding the resulting inferred network into CABIN, we have created a structured workflow for protein-protein network inference and supplemental analysis from sets of mass spectrometry bait-prey experiment data. SEBINI demo site: https://www.emsl.pnl.gov /SEBINI/ Contact: ronald.taylor@pnl.gov. BEPro is available at http://www.pnl.gov/statistics/BEPro3/index.htm. Contact: ds.daly@pnl.gov. CABIN is available at http://www.sysbio.org/dataresources/cabin.stm. Contact: mudita.singhal@pnl.gov.
Targeting protein-protein interaction between MLL1 and reciprocal proteins for leukemia therapy.
Wang, Zhi-Hui; Li, Dong-Dong; Chen, Wei-Lin; You, Qi-Dong; Guo, Xiao-Ke
2018-01-15
The mixed lineage leukemia protein-1 (MLL1), as a lysine methyltransferase, predominantly regulates the methylation of histone H3 lysine 4 (H3K4) and functions in hematopoietic stem cell (HSC) self-renewal. MLL1 gene fuses with partner genes that results in the generation of MLL1 fusion proteins (MLL1-FPs), which are frequently detected in acute leukemia. In the progress of leukemogenesis, a great deal of proteins cooperate with MLL1 to form multiprotein complexes serving for the dysregulation of H3K4 methylation, the overexpression of homeobox (HOX) cluster genes, and the consequent generation of leukemia. Hence, disrupting the interactions between MLL1 and the reciprocal proteins has been considered to be a new treatment strategy for leukemia. Here, we reviewed potential protein-protein interactions (PPIs) between MLL1 and its reciprocal proteins, and summarized the inhibitors to target MLL1 PPIs. The druggability of MLL1 PPIs for leukemia were also discussed. Copyright © 2017. Published by Elsevier Ltd.
DEFF Research Database (Denmark)
Rossin, Elizabeth J.; Hansen, Kasper Lage; Raychaudhuri, Soumya
2011-01-01
Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these r......Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed...... in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more...... that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non...
Directory of Open Access Journals (Sweden)
Peiying Ruan
Full Text Available Since many proteins express their functional activity by interacting with other proteins and forming protein complexes, it is very useful to identify sets of proteins that form complexes. For that purpose, many prediction methods for protein complexes from protein-protein interactions have been developed such as MCL, MCODE, RNSC, PCP, RRW, and NWE. These methods have dealt with only complexes with size of more than three because the methods often are based on some density of subgraphs. However, heterodimeric protein complexes that consist of two distinct proteins occupy a large part according to several comprehensive databases of known complexes. In this paper, we propose several feature space mappings from protein-protein interaction data, in which each interaction is weighted based on reliability. Furthermore, we make use of prior knowledge on protein domains to develop feature space mappings, domain composition kernel and its combination kernel with our proposed features. We perform ten-fold cross-validation computational experiments. These results suggest that our proposed kernel considerably outperforms the naive Bayes-based method, which is the best existing method for predicting heterodimeric protein complexes.
Dong, Yadong; Sun, Yongqi; Qin, Chao
2018-01-01
The existing protein complex detection methods can be broadly divided into two categories: unsupervised and supervised learning methods. Most of the unsupervised learning methods assume that protein complexes are in dense regions of protein-protein interaction (PPI) networks even though many true complexes are not dense subgraphs. Supervised learning methods utilize the informative properties of known complexes; they often extract features from existing complexes and then use the features to train a classification model. The trained model is used to guide the search process for new complexes. However, insufficient extracted features, noise in the PPI data and the incompleteness of complex data make the classification model imprecise. Consequently, the classification model is not sufficient for guiding the detection of complexes. Therefore, we propose a new robust score function that combines the classification model with local structural information. Based on the score function, we provide a search method that works both forwards and backwards. The results from experiments on six benchmark PPI datasets and three protein complex datasets show that our approach can achieve better performance compared with the state-of-the-art supervised, semi-supervised and unsupervised methods for protein complex detection, occasionally significantly outperforming such methods.
Mistry, Divya; Wise, Roger P; Dickerson, Julie A
2017-01-01
Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be
Mass spectrometric analysis of protein interactions
DEFF Research Database (Denmark)
Borch, Jonas; Jørgensen, Thomas J. D.; Roepstorff, Peter
2005-01-01
Mass spectrometry is a powerful tool for identification of interaction partners and structural characterization of protein interactions because of its high sensitivity, mass accuracy and tolerance towards sample heterogeneity. Several tools that allow studies of protein interaction are now...... available and recent developments that increase the confidence of studies of protein interaction by mass spectrometry include quantification of affinity-purified proteins by stable isotope labeling and reagents for surface topology studies that can be identified by mass-contributing reporters (e.g. isotope...... labels, cleavable cross-linkers or fragment ions. The use of mass spectrometers to study protein interactions using deuterium exchange and for analysis of intact protein complexes recently has progressed considerably....
Energy Technology Data Exchange (ETDEWEB)
Chowdhury, Saiful M. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Shi, Liang [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Yoon, Hyunjin [Dartmouth College, Hanover, NH (United States); Ansong, Charles [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Rommereim, Leah M. [Dartmouth College, Hanover, NH (United States); Norbeck, Angela D. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Auberry, Kenneth J. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Moore, R. J. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Adkins, Joshua N. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Heffron, Fred [Oregon Health and Science Univ., Portland, OR (United States); Smith, Richard D. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
2009-02-10
We successfully modified an existing method to investigate protein-protein interactions in the pathogenic bacterium Salmonella typhimurium (STM). This method includes i) addition of a histidine-biotin-histidine tag to the bait proteins via recombinant DNA techniques; ii) in vivo cross-linking with formaldehyde; iii) tandem affinity purification of bait proteins under fully denaturing conditions; and iv) identification of the proteins cross-linked to the bait proteins by liquid-chromatography in conjunction with tandem mass-spectrometry. In vivo cross-linking stabilized protein interactions permitted the subsequent two-step purification step conducted under denaturing conditions. The two-step purification greatly reduced nonspecific binding of non-cross-linked proteins to bait proteins. Two different negative controls were employed to reduce false-positive identification. In an initial demonstration of this approach, we tagged three selected STM proteins- HimD, PduB and PhoP- with known binding partners that ranged from stable (e.g., HimD) to transient (i.e., PhoP). Distinct sets of interacting proteins were identified with each bait protein, including the known binding partners such as HimA for HimD, as well as anticipated and unexpected binding partners. Our results suggest that novel protein-protein interactions may be critical to pathogenesis by Salmonella typhimurium. .
Prediction of protein-protein interactions between viruses and human by an SVM model
Directory of Open Access Journals (Sweden)
Cui Guangyu
2012-05-01
Full Text Available Abstract Background Several computational methods have been developed to predict protein-protein interactions from amino acid sequences, but most of those methods are intended for the interactions within a species rather than for interactions across different species. Methods for predicting interactions between homogeneous proteins are not appropriate for finding those between heterogeneous proteins since they do not distinguish the interactions between proteins of the same species from those of different species. Results We developed a new method for representing a protein sequence of variable length in a frequency vector of fixed length, which encodes the relative frequency of three consecutive amino acids of a sequence. We built a support vector machine (SVM model to predict human proteins that interact with virus proteins. In two types of viruses, human papillomaviruses (HPV and hepatitis C virus (HCV, our SVM model achieved an average accuracy above 80%, which is higher than that of another SVM model with a different representation scheme. Using the SVM model and Gene Ontology (GO annotations of proteins, we predicted new interactions between virus proteins and human proteins. Conclusions Encoding the relative frequency of amino acid triplets of a protein sequence is a simple yet powerful representation method for predicting protein-protein interactions across different species. The representation method has several advantages: (1 it enables a prediction model to achieve a better performance than other representations, (2 it generates feature vectors of fixed length regardless of the sequence length, and (3 the same representation is applicable to different types of proteins.
Mahajan, Gaurang; Mande, Shekhar C
2017-04-04
A comprehensive map of the human-M. tuberculosis (MTB) protein interactome would help fill the gaps in our understanding of the disease, and computational prediction can aid and complement experimental studies towards this end. Several sequence-based in silico approaches tap the existing data on experimentally validated protein-protein interactions (PPIs); these PPIs serve as templates from which novel interactions between pathogen and host are inferred. Such comparative approaches typically make use of local sequence alignment, which, in the absence of structural details about the interfaces mediating the template interactions, could lead to incorrect inferences, particularly when multi-domain proteins are involved. We propose leveraging the domain-domain interaction (DDI) information in PDB complexes to score and prioritize candidate PPIs between host and pathogen proteomes based on targeted sequence-level comparisons. Our method picks out a small set of human-MTB protein pairs as candidates for physical interactions, and the use of functional meta-data suggests that some of them could contribute to the in vivo molecular cross-talk between pathogen and host that regulates the course of the infection. Further, we present numerical data for Pfam domain families that highlights interaction specificity on the domain level. Not every instance of a pair of domains, for which interaction evidence has been found in a few instances (i.e. structures), is likely to functionally interact. Our sorting approach scores candidates according to how "distant" they are in sequence space from known examples of DDIs (templates). Thus, it provides a natural way to deal with the heterogeneity in domain-level interactions. Our method represents a more informed application of local alignment to the sequence-based search for potential human-microbial interactions that uses available PPI data as a prior. Our approach is somewhat limited in its sensitivity by the restricted size and
Molecular imaging of drug-modulated protein-protein interactions in living subjects.
Paulmurugan, Ramasamy; Massoud, Tarik F; Huang, Jing; Gambhir, Sanjiv S
2004-03-15
Networks of protein interactions mediate cellular responses to environmental stimuli and direct the execution of many different cellular functional pathways. Small molecules synthesized within cells or recruited from the external environment mediate many protein interactions. The study of small molecule-mediated interactions of proteins is important to understand abnormal signal transduction pathways in cancer and in drug development and validation. In this study, we used split synthetic renilla luciferase (hRLUC) protein fragment-assisted complementation to evaluate heterodimerization of the human proteins FRB and FKBP12 mediated by the small molecule rapamycin. The concentration of rapamycin required for efficient dimerization and that of its competitive binder ascomycin required for dimerization inhibition were studied in cell lines. The system was dually modulated in cell culture at the transcription level, by controlling nuclear factor kappaB promoter/enhancer elements using tumor necrosis factor alpha, and at the interaction level, by controlling the concentration of the dimerizer rapamycin. The rapamycin-mediated dimerization of FRB and FKBP12 also was studied in living mice by locating, quantifying, and timing the hRLUC complementation-based bioluminescence imaging signal using a cooled charged coupled device camera. This split reporter system can be used to efficiently screen small molecule drugs that modulate protein-protein interactions and also to assess drugs in living animals. Both are essential steps in the preclinical evaluation of candidate pharmaceutical agents targeting protein-protein interactions, including signaling pathways in cancer cells.
Efficient extraction of protein-protein interactions from full-text articles.
Hakenberg, Jörg; Leaman, Robert; Vo, Nguyen Ha; Jonnalagadda, Siddhartha; Sullivan, Ryan; Miller, Christopher; Tari, Luis; Baral, Chitta; Gonzalez, Graciela
2010-01-01
Proteins and their interactions govern virtually all cellular processes, such as regulation, signaling, metabolism, and structure. Most experimental findings pertaining to such interactions are discussed in research papers, which, in turn, get curated by protein interaction databases. Authors, editors, and publishers benefit from efforts to alleviate the tasks of searching for relevant papers, evidence for physical interactions, and proper identifiers for each protein involved. The BioCreative II.5 community challenge addressed these tasks in a competition-style assessment to evaluate and compare different methodologies, to make aware of the increasing accuracy of automated methods, and to guide future implementations. In this paper, we present our approaches for protein-named entity recognition, including normalization, and for extraction of protein-protein interactions from full text. Our overall goal is to identify efficient individual components, and we compare various compositions to handle a single full-text article in between 10 seconds and 2 minutes. We propose strategies to transfer document-level annotations to the sentence-level, which allows for the creation of a more fine-grained training corpus; we use this corpus to automatically derive around 5,000 patterns. We rank sentences by relevance to the task of finding novel interactions with physical evidence, using a sentence classifier built from this training corpus. Heuristics for paraphrasing sentences help to further remove unnecessary information that might interfere with patterns, such as additional adjectives, clauses, or bracketed expressions. In BioCreative II.5, we achieved an f-score of 22 percent for finding protein interactions, and 43 percent for mapping proteins to UniProt IDs; disregarding species, f-scores are 30 percent and 55 percent, respectively. On average, our best-performing setup required around 2 minutes per full text. All data and pattern sets as well as Java classes that
A selection that reports on protein-protein interactions within a thermophilic bacterium.
Nguyen, Peter Q; Silberg, Jonathan J
2010-07-01
Many proteins can be split into fragments that exhibit enhanced function upon fusion to interacting proteins. While this strategy has been widely used to create protein-fragment complementation assays (PCAs) for discovering protein-protein interactions within mesophilic organisms, similar assays have not yet been developed for studying natural and engineered protein complexes at the temperatures where thermophilic microbes grow. We describe the development of a selection for protein-protein interactions within Thermus thermophilus that is based upon growth complementation by fragments of Thermotoga neapolitana adenylate kinase (AK(Tn)). Complementation studies with an engineered thermophile (PQN1) that is not viable above 75 degrees C because its adk gene has been replaced by a Geobacillus stearothermophilus ortholog revealed that growth could be restored at 78 degrees C by a vector that coexpresses polypeptides corresponding to residues 1-79 and 80-220 of AK(Tn). In contrast, PQN1 growth was not complemented by AK(Tn) fragments harboring a C156A mutation within the zinc-binding tetracysteine motif unless these fragments were fused to Thermotoga maritima chemotaxis proteins that heterodimerize (CheA and CheY) or homodimerize (CheX). This enhanced complementation is interpreted as arising from chemotaxis protein-protein interactions, since AK(Tn)-C156A fragments having only one polypeptide fused to a chemotaxis protein did not complement PQN1 to the same extent. This selection increases the maximum temperature where a PCA can be used to engineer thermostable protein complexes and to map protein-protein interactions.
Directory of Open Access Journals (Sweden)
Ji-Yong An
2016-05-01
Full Text Available Protein-Protein Interactions (PPIs play essential roles in most cellular processes. Knowledge of PPIs is becoming increasingly more important, which has prompted the development of technologies that are capable of discovering large-scale PPIs. Although many high-throughput biological technologies have been proposed to detect PPIs, there are unavoidable shortcomings, including cost, time intensity, and inherently high false positive and false negative rates. For the sake of these reasons, in silico methods are attracting much attention due to their good performances in predicting PPIs. In this paper, we propose a novel computational method known as RVM-AB that combines the Relevance Vector Machine (RVM model and Average Blocks (AB to predict PPIs from protein sequences. The main improvements are the results of representing protein sequences using the AB feature representation on a Position Specific Scoring Matrix (PSSM, reducing the influence of noise using a Principal Component Analysis (PCA, and using a Relevance Vector Machine (RVM based classifier. We performed five-fold cross-validation experiments on yeast and Helicobacter pylori datasets, and achieved very high accuracies of 92.98% and 95.58% respectively, which is significantly better than previous works. In addition, we also obtained good prediction accuracies of 88.31%, 89.46%, 91.08%, 91.55%, and 94.81% on other five independent datasets C. elegans, M. musculus, H. sapiens, H. pylori, and E. coli for cross-species prediction. To further evaluate the proposed method, we compare it with the state-of-the-art support vector machine (SVM classifier on the yeast dataset. The experimental results demonstrate that our RVM-AB method is obviously better than the SVM-based method. The promising experimental results show the efficiency and simplicity of the proposed method, which can be an automatic decision support tool. To facilitate extensive studies for future proteomics research, we developed
Intercellular protein-protein interactions at synapses.
Yang, Xiaofei; Hou, Dongmei; Jiang, Wei; Zhang, Chen
2014-06-01
Chemical synapses are asymmetric intercellular junctions through which neurons send nerve impulses to communicate with other neurons or excitable cells. The appropriate formation of synapses, both spatially and temporally, is essential for brain function and depends on the intercellular protein-protein interactions of cell adhesion molecules (CAMs) at synaptic clefts. The CAM proteins link pre- and post-synaptic sites, and play essential roles in promoting synapse formation and maturation, maintaining synapse number and type, accumulating neurotransmitter receptors and ion channels, controlling neuronal differentiation, and even regulating synaptic plasticity directly. Alteration of the interactions of CAMs leads to structural and functional impairments, which results in many neurological disorders, such as autism, Alzheimer's disease and schizophrenia. Therefore, it is crucial to understand the functions of CAMs during development and in the mature neural system, as well as in the pathogenesis of some neurological disorders. Here, we review the function of the major classes of CAMs, and how dysfunction of CAMs relates to several neurological disorders.
Protein Annotation from Protein Interaction Networks and Gene Ontology
Nguyen, Cao D.; Gardiner, Katheleen J.; Cios, Krzysztof J.
2011-01-01
We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precis...
From nonspecific DNA-protein encounter complexes to the prediction of DNA-protein interactions.
Directory of Open Access Journals (Sweden)
Mu Gao
2009-03-01
Full Text Available DNA-protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA-protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA-protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA-protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA-protein interaction modes exhibit some similarity to specific DNA-protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Calpha deviation from native is up to 5 A from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA-protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein.
Analysis of intraviral protein-protein interactions of the SARS coronavirus ORFeome.
Directory of Open Access Journals (Sweden)
Albrecht von Brunn
2007-05-01
Full Text Available The severe acute respiratory syndrome coronavirus (SARS-CoV genome is predicted to encode 14 functional open reading frames, leading to the expression of up to 30 structural and non-structural protein products. The functions of a large number of viral ORFs are poorly understood or unknown. In order to gain more insight into functions and modes of action and interaction of the different proteins, we cloned the viral ORFeome and performed a genome-wide analysis for intraviral protein interactions and for intracellular localization. 900 pairwise interactions were tested by yeast-two-hybrid matrix analysis, and more than 65 positive non-redundant interactions, including six self interactions, were identified. About 38% of interactions were subsequently confirmed by CoIP in mammalian cells. Nsp2, nsp8 and ORF9b showed a wide range of interactions with other viral proteins. Nsp8 interacts with replicase proteins nsp2, nsp5, nsp6, nsp7, nsp8, nsp9, nsp12, nsp13 and nsp14, indicating a crucial role as a major player within the replication complex machinery. It was shown by others that nsp8 is essential for viral replication in vitro, whereas nsp2 is not. We show that also accessory protein ORF9b does not play a pivotal role for viral replication, as it can be deleted from the virus displaying normal plaque sizes and growth characteristics in Vero cells. However, it can be expected to be important for the virus-host interplay and for pathogenicity, due to its large number of interactions, by enhancing the global stability of the SARS proteome network, or play some unrealized role in regulating protein-protein interactions. The interactions identified provide valuable material for future studies.
Integral UBL domain proteins: a family of proteasome interacting proteins
DEFF Research Database (Denmark)
Hartmann-Petersen, Rasmus; Gordon, Colin
2004-01-01
The family of ubiquitin-like (UBL) domain proteins (UDPs) comprises a conserved group of proteins involved in a multitude of different cellular activities. However, recent studies on UBL-domain proteins indicate that these proteins appear to share a common property in their ability to interact...
Hostetler, Jessica B.; Sharma, Sumana; Bartholdson, S. Josefin; Wright, Gavin J.; Fairhurst, Rick M.; Rayner, Julian C.
2015-01-01
Background A vaccine targeting Plasmodium vivax will be an essential component of any comprehensive malaria elimination program, but major gaps in our understanding of P. vivax biology, including the protein-protein interactions that mediate merozoite invasion of reticulocytes, hinder the search for candidate antigens. Only one ligand-receptor interaction has been identified, that between P. vivax Duffy Binding Protein (PvDBP) and the erythrocyte Duffy Antigen Receptor for Chemokines (DARC), and strain-specific immune responses to PvDBP make it a complex vaccine target. To broaden the repertoire of potential P. vivax merozoite-stage vaccine targets, we exploited a recent breakthrough in expressing full-length ectodomains of Plasmodium proteins in a functionally-active form in mammalian cells and initiated a large-scale study of P. vivax merozoite proteins that are potentially involved in reticulocyte binding and invasion. Methodology/Principal Findings We selected 39 P. vivax proteins that are predicted to localize to the merozoite surface or invasive secretory organelles, some of which show homology to P. falciparum vaccine candidates. Of these, we were able to express 37 full-length protein ectodomains in a mammalian expression system, which has been previously used to express P. falciparum invasion ligands such as PfRH5. To establish whether the expressed proteins were correctly folded, we assessed whether they were recognized by antibodies from Cambodian patients with acute vivax malaria. IgG from these samples showed at least a two-fold change in reactivity over naïve controls in 27 of 34 antigens tested, and the majority showed heat-labile IgG immunoreactivity, suggesting the presence of conformation-sensitive epitopes and native tertiary protein structures. Using a method specifically designed to detect low-affinity, extracellular protein-protein interactions, we confirmed a predicted interaction between P. vivax 6-cysteine proteins P12 and P41, further
On the role of electrostatics on protein-protein interactions
Zhang, Zhe; Witham, Shawn; Alexov, Emil
2011-01-01
The role of electrostatics on protein-protein interactions and binding is reviewed in this article. A brief outline of the computational modeling, in the framework of continuum electrostatics, is presented and basic electrostatic effects occurring upon the formation of the complex are discussed. The role of the salt concentration and pH of the water phase on protein-protein binding free energy is demonstrated and indicates that the increase of the salt concentration tends to weaken the binding, an observation that is attributed to the optimization of the charge-charge interactions across the interface. It is pointed out that the pH-optimum (pH of optimal binding affinity) varies among the protein-protein complexes, and perhaps is a result of their adaptation to particular subcellular compartment. At the end, the similarities and differences between hetero- and homo-complexes are outlined and discussed with respect to the binding mode and charge complementarity. PMID:21572182
Directory of Open Access Journals (Sweden)
Shuxian Li
2018-04-01
Full Text Available Phomopsis longicolla T. W. Hobbs (syn. Diaporthe longicolla is the primary cause of Phomopsis seed decay (PSD in soybean, Glycine max (L. Merrill. This disease results in poor seed quality and is one of the most economically important seed diseases in soybean. The objectives of this study were to infer protein–protein interactions (PPI and to identify conserved global networks and pathogenicity subnetworks in P. longicolla including orthologous pathways for cell signaling and pathogenesis. The interlog method used in the study identified 215,255 unique PPIs among 3,868 proteins. There were 1,414 pathogenicity related genes in P. longicolla identified using the pathogen host interaction (PHI database. Additionally, 149 plant cell wall degrading enzymes (PCWDE were detected. The network captured five different classes of carbohydrate degrading enzymes, including the auxiliary activities, carbohydrate esterases, glycoside hydrolases, glycosyl transferases, and carbohydrate binding molecules. From the PPI analysis, novel interacting partners were determined for each of the PCWDE classes. The most predominant class of PCWDE was a group of 60 glycoside hydrolases proteins. The glycoside hydrolase subnetwork was found to be interacting with 1,442 proteins within the network and was among the largest clusters. The orthologous proteins FUS3, HOG, CYP1, SGE1, and the g5566t.1 gene identified in this study could play an important role in pathogenicity. Therefore, the P. longicolla protein interactome (PiPhom generated in this study can lead to a better understanding of PPIs in soybean pathogens. Furthermore, the PPI may aid in targeting of genes and proteins for further studies of the pathogenicity mechanisms.
Identification of structural protein-protein interactions of herpes simplex virus type 1.
Lee, Jin H; Vittone, Valerio; Diefenbach, Eve; Cunningham, Anthony L; Diefenbach, Russell J
2008-09-01
In this study we have defined protein-protein interactions between the structural proteins of herpes simplex virus type 1 (HSV-1) using a LexA yeast two-hybrid system. The majority of the capsid, tegument and envelope proteins of HSV-1 were screened in a matrix approach. A total of 40 binary interactions were detected including 9 out of 10 previously identified tegument-tegument interactions (Vittone, V., Diefenbach, E., Triffett, D., Douglas, M.W., Cunningham, A.L., and Diefenbach, R.J., 2005. Determination of interactions between tegument proteins of herpes simplex virus type 1. J. Virol. 79, 9566-9571). A total of 12 interactions involving the capsid protein pUL35 (VP26) and 11 interactions involving the tegument protein pUL46 (VP11/12) were identified. The most significant novel interactions detected in this study, which are likely to play a role in viral assembly, include pUL35-pUL37 (capsid-tegument), pUL46-pUL37 (tegument-tegument) and pUL49 (VP22)-pUS9 (tegument-envelope). This information will provide further insights into the pathways of HSV-1 assembly and the identified interactions are potential targets for new antiviral drugs.
Yeast Interacting Proteins Database: YPR103W, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available tein involved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensors...gulated gene expression; interacts with protein kinase Snf1p, glucose sensors Snf
Protein-protein interactions within late pre-40S ribosomes.
Directory of Open Access Journals (Sweden)
Melody G Campbell
2011-01-01
Full Text Available Ribosome assembly in eukaryotic organisms requires more than 200 assembly factors to facilitate and coordinate rRNA transcription, processing, and folding with the binding of the ribosomal proteins. Many of these assembly factors bind and dissociate at defined times giving rise to discrete assembly intermediates, some of which have been partially characterized with regards to their protein and RNA composition. Here, we have analyzed the protein-protein interactions between the seven assembly factors bound to late cytoplasmic pre-40S ribosomes using recombinant proteins in binding assays. Our data show that these factors form two modules: one comprising Enp1 and the export adaptor Ltv1 near the beak structure, and the second comprising the kinase Rio2, the nuclease Nob1, and a regulatory RNA binding protein Dim2/Pno1 on the front of the head. The GTPase-like Tsr1 and the universally conserved methylase Dim1 are also peripherally connected to this second module. Additionally, in an effort to further define the locations for these essential proteins, we have analyzed the interactions between these assembly factors and six ribosomal proteins: Rps0, Rps3, Rps5, Rps14, Rps15 and Rps29. Together, these results and previous RNA-protein crosslinking data allow us to propose a model for the binding sites of these seven assembly factors. Furthermore, our data show that the essential kinase Rio2 is located at the center of the pre-ribosomal particle and interacts, directly or indirectly, with every other assembly factor, as well as three ribosomal proteins required for cytoplasmic 40S maturation. These data suggest that Rio2 could play a central role in regulating cytoplasmic maturation steps.
Exploiting conformational ensembles in modeling protein-protein interactions on the proteome scale
Kuzu, Guray; Gursoy, Attila; Nussinov, Ruth; Keskin, Ozlem
2013-01-01
Cellular functions are performed through protein-protein interactions; therefore, identification of these interactions is crucial for understanding biological processes. Recent studies suggest that knowledge-based approaches are more useful than ‘blind’ docking for modeling at large scales. However, a caveat of knowledge-based approaches is that they treat molecules as rigid structures. The Protein Data Bank (PDB) offers a wealth of conformations. Here, we exploited ensemble of the conformations in predictions by a knowledge-based method, PRISM. We tested ‘difficult’ cases in a docking-benchmark dataset, where the unbound and bound protein forms are structurally different. Considering alternative conformations for each protein, the percentage of successfully predicted interactions increased from ~26% to 66%, and 57% of the interactions were successfully predicted in an ‘unbiased’ scenario, in which data related to the bound forms were not utilized. If the appropriate conformation, or relevant template interface, is unavailable in the PDB, PRISM could not predict the interaction successfully. The pace of the growth of the PDB promises a rapid increase of ensemble conformations emphasizing the merit of such knowledge-based ensemble strategies for higher success rates in protein-protein interaction predictions on an interactome-scale. We constructed the structural network of ERK interacting proteins as a case study. PMID:23590674
Xu, Yu; Wang, Hong; Nussinov, Ruth; Ma, Buyong
2013-01-01
We constructed and simulated a ‘minimal proteome’ model using Langevin dynamics. It contains 206 essential protein types which were compiled from the literature. For comparison, we generated six proteomes with randomized concentrations. We found that the net charges and molecular weights of the proteins in the minimal genome are not random. The net charge of a protein decreases linearly with molecular weight, with small proteins being mostly positively charged and large proteins negatively charged. The protein copy numbers in the minimal genome have the tendency to maximize the number of protein-protein interactions in the network. Negatively charged proteins which tend to have larger sizes can provide large collision cross-section allowing them to interact with other proteins; on the other hand, the smaller positively charged proteins could have higher diffusion speed and are more likely to collide with other proteins. Proteomes with random charge/mass populations form less stable clusters than those with experimental protein copy numbers. Our study suggests that ‘proper’ populations of negatively and positively charged proteins are important for maintaining a protein-protein interaction network in a proteome. It is interesting to note that the minimal genome model based on the charge and mass of E. Coli may have a larger protein-protein interaction network than that based on the lower organism M. pneumoniae. PMID:23420643
Directory of Open Access Journals (Sweden)
Eric A Yen
2014-05-01
Full Text Available Protein complexes are not static, but rather highly dynamic with subunits that undergo 1-dimensional diffusion with respect to each other. Interactions within protein complexes are modulated through regulatory inputs that alter interactions and introduce new components and deplete existing components through exchange. While it is clear that the structure and function of any given protein complex is coupled to its dynamical properties, it remains a challenge to predict the possible conformations that complexes can adopt. Protein-fragment Complementation Assays detect physical interactions between protein pairs constrained to ≤8 nm from each other in living cells. This method has been used to build networks composed of 1000s of pair-wise interactions. Significantly, these networks contain a wealth of dynamic information, as the assay is fully reversible and the proteins are expressed in their natural context. In this study, we describe a method that extracts this valuable information in the form of predicted conformations, allowing the user to explore the conformational landscape, to search for structures that correlate with an activity state, and estimate the abundance of conformations in the living cell. The generator is based on a Markov Chain Monte Carlo simulation that uses the interaction dataset as input and is constrained by the physical resolution of the assay. We applied this method to an 18-member protein complex composed of the seven core proteins of the budding yeast Arp2/3 complex and 11 associated regulators and effector proteins. We generated 20,480 output structures and identified conformational states using principle component analysis. We interrogated the conformation landscape and found evidence of symmetry breaking, a mixture of likely active and inactive conformational states and dynamic exchange of the core protein Arc15 between core and regulatory components. Our method provides a novel tool for prediction and
A new protein-protein interaction sensor based on tripartite split-GFP association.
Cabantous, Stéphanie; Nguyen, Hau B; Pedelacq, Jean-Denis; Koraïchi, Faten; Chaudhary, Anu; Ganguly, Kumkum; Lockard, Meghan A; Favre, Gilles; Terwilliger, Thomas C; Waldo, Geoffrey S
2013-10-04
Monitoring protein-protein interactions in living cells is key to unraveling their roles in numerous cellular processes and various diseases. Previously described split-GFP based sensors suffer from poor folding and/or self-assembly background fluorescence. Here, we have engineered a micro-tagging system to monitor protein-protein interactions in vivo and in vitro. The assay is based on tripartite association between two twenty amino-acids long GFP tags, GFP10 and GFP11, fused to interacting protein partners, and the complementary GFP1-9 detector. When proteins interact, GFP10 and GFP11 self-associate with GFP1-9 to reconstitute a functional GFP. Using coiled-coils and FRB/FKBP12 model systems we characterize the sensor in vitro and in Escherichia coli. We extend the studies to mammalian cells and examine the FK-506 inhibition of the rapamycin-induced association of FRB/FKBP12. The small size of these tags and their minimal effect on fusion protein behavior and solubility should enable new experiments for monitoring protein-protein association by fluorescence.
Cui, Boyu; Wang, Yao; Song, Yunhong; Wang, Tietao; Li, Changfu; Wei, Yahong; Luo, Zhao-Qing; Shen, Xihui
2014-05-20
Protein-protein interactions are important for virtually every biological process, and a number of elegant approaches have been designed to detect and evaluate such interactions. However, few of these methods allow the detection of dynamic and real-time protein-protein interactions in bacteria. Here we describe a bioluminescence resonance energy transfer (BRET) system based on the bacterial luciferase LuxAB. We found that enhanced yellow fluorescent protein (eYFP) accepts the emission from LuxAB and emits yellow fluorescence. Importantly, BRET occurred when LuxAB and eYFP were fused, respectively, to the interacting protein pair FlgM and FliA. Furthermore, we observed sirolimus (i.e., rapamycin)-inducible interactions between FRB and FKBP12 and a dose-dependent abolishment of such interactions by FK506, the ligand of FKBP12. Using this system, we showed that osmotic stress or low pH efficiently induced multimerization of the regulatory protein OmpR and that the multimerization induced by low pH can be reversed by a neutralizing agent, further indicating the usefulness of this system in the measurement of dynamic interactions. This method can be adapted to analyze dynamic protein-protein interactions and the importance of such interactions in bacterial processes such as development and pathogenicity. Real-time measurement of protein-protein interactions in prokaryotes is highly desirable for determining the roles of protein complex in the development or virulence of bacteria, but methods that allow such measurement are not available. Here we describe the development of a bioluminescence resonance energy transfer (BRET) technology that meets this need. The use of endogenous excitation light in this strategy circumvents the requirement for the sophisticated instrument demanded by standard fluorescence resonance energy transfer (FRET). Furthermore, because the LuxAB substrate decanal is membrane permeable, the assay can be performed without lysing the bacterial cells
Directory of Open Access Journals (Sweden)
Ji'an Pan
Full Text Available Analyses of viral protein-protein interactions are an important step to understand viral protein functions and their underlying molecular mechanisms. In this study, we adopted a mammalian two-hybrid system to screen the genome-wide intraviral protein-protein interactions of SARS coronavirus (SARS-CoV and therefrom revealed a number of novel interactions which could be partly confirmed by in vitro biochemical assays. Three pairs of the interactions identified were detected in both directions: non-structural protein (nsp 10 and nsp14, nsp10 and nsp16, and nsp7 and nsp8. The interactions between the multifunctional nsp10 and nsp14 or nsp16, which are the unique proteins found in the members of Nidovirales with large RNA genomes including coronaviruses and toroviruses, may have important implication for the mechanisms of replication/transcription complex assembly and functions of these viruses. Using a SARS-CoV replicon expressing a luciferase reporter under the control of a transcription regulating sequence, it has been shown that several viral proteins (N, X and SUD domains of nsp3, and nsp12 provided in trans stimulated the replicon reporter activity, indicating that these proteins may regulate coronavirus replication and transcription. Collectively, our findings provide a basis and platform for further characterization of the functions and mechanisms of coronavirus proteins.
A Mesoscopic Model for Protein-Protein Interactions in Solution
Lund, Mikael; Jönsson, Bo
2003-01-01
Protein self-association may be detrimental in biological systems, but can be utilized in a controlled fashion for protein crystallization. It is hence of considerable interest to understand how factors like solution conditions prevent or promote aggregation. Here we present a computational model describing interactions between protein molecules in solution. The calculations are based on a molecular description capturing the detailed structure of the protein molecule using x-ray or nuclear ma...
Yeast Interacting Proteins Database: YMR280C, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available olved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensor... glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, an
Yeast Interacting Proteins Database: YOR047C, YKL038W [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available racts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a...Bait description Protein involved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose senso...rs Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a regulator of the tra
Yeast Interacting Proteins Database: YFR049W, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a regulator... (0) YOR047C STD1 Protein involved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sens...ors Snf3p and Rgt2p, and TATA-binding protein Spt15p; ac
PSAIA – Protein Structure and Interaction Analyzer
Directory of Open Access Journals (Sweden)
Vlahoviček Kristian
2008-04-01
Full Text Available Abstract Background PSAIA (Protein Structure and Interaction Analyzer was developed to compute geometric parameters for large sets of protein structures in order to predict and investigate protein-protein interaction sites. Results In addition to most relevant established algorithms, PSAIA offers a new method PIADA (Protein Interaction Atom Distance Algorithm for the determination of residue interaction pairs. We found that PIADA produced more satisfactory results than comparable algorithms implemented in PSAIA. Particular advantages of PSAIA include its capacity to combine different methods to detect the locations and types of interactions between residues and its ability, without any further automation steps, to handle large numbers of protein structures and complexes. Generally, the integration of a variety of methods enables PSAIA to offer easier automation of analysis and greater reliability of results. PSAIA can be used either via a graphical user interface or from the command-line. Results are generated in either tabular or XML format. Conclusion In a straightforward fashion and for large sets of protein structures, PSAIA enables the calculation of protein geometric parameters and the determination of location and type for protein-protein interaction sites. XML formatted output enables easy conversion of results to various formats suitable for statistic analysis. Results from smaller data sets demonstrated the influence of geometry on protein interaction sites. Comprehensive analysis of properties of large data sets lead to new information useful in the prediction of protein-protein interaction sites.
Yamniuk, Aaron P; Newitt, John A; Doyle, Michael L; Arisaka, Fumio; Giannetti, Anthony M; Hensley, Preston; Myszka, David G; Schwarz, Fred P; Thomson, James A; Eisenstein, Edward
2015-12-01
A significant challenge in the molecular interaction field is to accurately determine the stoichiometry and stepwise binding affinity constants for macromolecules having >1 binding site. The mission of the Molecular Interactions Research Group (MIRG) of the Association of Biomolecular Resource Facilities (ABRF) is to show how biophysical technologies are used to quantitatively characterize molecular interactions, and to educate the ABRF members and scientific community on the utility and limitations of core technologies [such as biosensor, microcalorimetry, or analytic ultracentrifugation (AUC)]. In the present work, the MIRG has developed a robust model protein interaction pair consisting of a bivalent variant of the Bacillus amyloliquefaciens extracellular RNase barnase and a variant of its natural monovalent intracellular inhibitor protein barstar. It is demonstrated that this system can serve as a benchmarking tool for the quantitative analysis of 2-site protein-protein interactions. The protein interaction pair enables determination of precise binding constants for the barstar protein binding to 2 distinct sites on the bivalent barnase binding partner (termed binase), where the 2 binding sites were engineered to possess affinities that differed by 2 orders of magnitude. Multiple MIRG laboratories characterized the interaction using isothermal titration calorimetry (ITC), AUC, and surface plasmon resonance (SPR) methods to evaluate the feasibility of the system as a benchmarking model. Although general agreement was seen for the binding constants measured using solution-based ITC and AUC approaches, weaker affinity was seen for surface-based method SPR, with protein immobilization likely affecting affinity. An analysis of the results from multiple MIRG laboratories suggests that the bivalent barnase-barstar system is a suitable model for benchmarking new approaches for the quantitative characterization of complex biomolecular interactions.
Transcription Factor Functional Protein-Protein Interactions in Plant Defense Responses
Directory of Open Access Journals (Sweden)
Murilo S. Alves
2014-03-01
Full Text Available Responses to biotic stress in plants lead to dramatic reprogramming of gene expression, favoring stress responses at the expense of normal cellular functions. Transcription factors are master regulators of gene expression at the transcriptional level, and controlling the activity of these factors alters the transcriptome of the plant, leading to metabolic and phenotypic changes in response to stress. The functional analysis of interactions between transcription factors and other proteins is very important for elucidating the role of these transcriptional regulators in different signaling cascades. In this review, we present an overview of protein-protein interactions for the six major families of transcription factors involved in plant defense: basic leucine zipper containing domain proteins (bZIP, amino-acid sequence WRKYGQK (WRKY, myelocytomatosis related proteins (MYC, myeloblastosis related proteins (MYB, APETALA2/ ETHYLENE-RESPONSIVE ELEMENT BINDING FACTORS (AP2/EREBP and no apical meristem (NAM, Arabidopsis transcription activation factor (ATAF, and cup-shaped cotyledon (CUC (NAC. We describe the interaction partners of these transcription factors as molecular responses during pathogen attack and the key components of signal transduction pathways that take place during plant defense responses. These interactions determine the activation or repression of response pathways and are crucial to understanding the regulatory networks that modulate plant defense responses.
Protein-protein interaction inference based on semantic similarity of Gene Ontology terms.
Zhang, Shu-Bo; Tang, Qiang-Rong
2016-07-21
Identifying protein-protein interactions is important in molecular biology. Experimental methods to this issue have their limitations, and computational approaches have attracted more and more attentions from the biological community. The semantic similarity derived from the Gene Ontology (GO) annotation has been regarded as one of the most powerful indicators for protein interaction. However, conventional methods based on GO similarity fail to take advantage of the specificity of GO terms in the ontology graph. We proposed a GO-based method to predict protein-protein interaction by integrating different kinds of similarity measures derived from the intrinsic structure of GO graph. We extended five existing methods to derive the semantic similarity measures from the descending part of two GO terms in the GO graph, then adopted a feature integration strategy to combines both the ascending and the descending similarity scores derived from the three sub-ontologies to construct various kinds of features to characterize each protein pair. Support vector machines (SVM) were employed as discriminate classifiers, and five-fold cross validation experiments were conducted on both human and yeast protein-protein interaction datasets to evaluate the performance of different kinds of integrated features, the experimental results suggest the best performance of the feature that combines information from both the ascending and the descending parts of the three ontologies. Our method is appealing for effective prediction of protein-protein interaction. Copyright © 2016 Elsevier Ltd. All rights reserved.
NatalieQ: A web server for protein-protein interaction network querying
El-Kebir, M.; Brandt, B.W.; Heringa, J.; Klau, G.W.
2014-01-01
Background Molecular interactions need to be taken into account to adequately model the complex behavior of biological systems. These interactions are captured by various types of biological networks, such as metabolic, gene-regulatory, signal transduction and protein-protein interaction networks.
Directory of Open Access Journals (Sweden)
Vandepoele Klaas
2009-06-01
Full Text Available Abstract Background Large-scale identification of the interrelationships between different components of the cell, such as the interactions between proteins, has recently gained great interest. However, unraveling large-scale protein-protein interaction maps is laborious and expensive. Moreover, assessing the reliability of the interactions can be cumbersome. Results In this study, we have developed a computational method that exploits the existing knowledge on protein-protein interactions in diverse species through orthologous relations on the one hand, and functional association data on the other hand to predict and filter protein-protein interactions in Arabidopsis thaliana. A highly reliable set of protein-protein interactions is predicted through this integrative approach making use of existing protein-protein interaction data from yeast, human, C. elegans and D. melanogaster. Localization, biological process, and co-expression data are used as powerful indicators for protein-protein interactions. The functional repertoire of the identified interactome reveals interactions between proteins functioning in well-conserved as well as plant-specific biological processes. We observe that although common mechanisms (e.g. actin polymerization and components (e.g. ARPs, actin-related proteins exist between different lineages, they are active in specific processes such as growth, cancer metastasis and trichome development in yeast, human and Arabidopsis, respectively. Conclusion We conclude that the integration of orthology with functional association data is adequate to predict protein-protein interactions. Through this approach, a high number of novel protein-protein interactions with diverse biological roles is discovered. Overall, we have predicted a reliable set of protein-protein interactions suitable for further computational as well as experimental analyses.
DeBlasio, Stacy L; Johnson, Richard; Sweeney, Michelle M; Karasev, Alexander; Gray, Stewart M; MacCoss, Michael J; Cilia, Michelle
2015-06-01
Potato leafroll virus (PLRV) produces a readthrough protein (RTP) via translational readthrough of the coat protein amber stop codon. The RTP functions as a structural component of the virion and as a nonincorporated protein in concert with numerous insect and plant proteins to regulate virus movement/transmission and tissue tropism. Affinity purification coupled to quantitative MS was used to generate protein interaction networks for a PLRV mutant that is unable to produce the read through domain (RTD) and compared to the known wild-type PLRV protein interaction network. By quantifying differences in the protein interaction networks, we identified four distinct classes of PLRV-plant interactions: those plant and nonstructural viral proteins interacting with assembled coat protein (category I); plant proteins in complex with both coat protein and RTD (category II); plant proteins in complex with the RTD (category III); and plant proteins that had higher affinity for virions lacking the RTD (category IV). Proteins identified as interacting with the RTD are potential candidates for regulating viral processes that are mediated by the RTP such as phloem retention and systemic movement and can potentially be useful targets for the development of strategies to prevent infection and/or viral transmission of Luteoviridae species that infect important crop species. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Protein annotation from protein interaction networks and Gene Ontology.
Nguyen, Cao D; Gardiner, Katheleen J; Cios, Krzysztof J
2011-10-01
We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precision and 60% recall versus 45% and 26% for Majority and 24% and 61% for χ²-statistics, respectively. Copyright © 2011 Elsevier Inc. All rights reserved.
Wang, Hua; Huang, Heng; Ding, Chris; Nie, Feiping
2013-04-01
Protein interactions are central to all the biological processes and structural scaffolds in living organisms, because they orchestrate a number of cellular processes such as metabolic pathways and immunological recognition. Several high-throughput methods, for example, yeast two-hybrid system and mass spectrometry method, can help determine protein interactions, which, however, suffer from high false-positive rates. Moreover, many protein interactions predicted by one method are not supported by another. Therefore, computational methods are necessary and crucial to complete the interactome expeditiously. In this work, we formulate the problem of predicting protein interactions from a new mathematical perspective--sparse matrix completion, and propose a novel nonnegative matrix factorization (NMF)-based matrix completion approach to predict new protein interactions from existing protein interaction networks. Through using manifold regularization, we further develop our method to integrate different biological data sources, such as protein sequences, gene expressions, protein structure information, etc. Extensive experimental results on four species, Saccharomyces cerevisiae, Drosophila melanogaster, Homo sapiens, and Caenorhabditis elegans, have shown that our new methods outperform related state-of-the-art protein interaction prediction methods.
Deciphering peculiar protein-protein interacting modules in Deinococcus radiodurans
Directory of Open Access Journals (Sweden)
Barkallah Insaf
2009-04-01
Full Text Available Abstract Interactomes of proteins under positive selection from ionizing-radiation-resistant bacteria (IRRB might be a part of the answer to the question as to how IRRB, particularly Deinococcus radiodurans R1 (Deira, resist ionizing radiation. Here, using the Database of Interacting Proteins (DIP and the Protein Structural Interactome (PSI-base server for PSI map, we have predicted novel interactions of orthologs of the 58 proteins under positive selection in Deira and other IRRB, but which are absent in IRSB. Among these, 18 domains and their interactomes have been identified in DNA checkpoint and repair; kinases pathways; energy and nucleotide metabolisms were the important biological processes that were found to be involved. This finding provides new clues to the cellular pathways that can to be important for ionizing-radiation resistance in Deira.
Building blocks for protein interaction devices
Grünberg, Raik; Ferrar, Tony S.; van der Sloot, Almer M.; Constante, Marco; Serrano, Luis
2010-01-01
Here, we propose a framework for the design of synthetic protein networks from modular protein–protein or protein–peptide interactions and provide a starter toolkit of protein building blocks. Our proof of concept experiments outline a general work flow for part–based protein systems engineering. We streamlined the iterative BioBrick cloning protocol and assembled 25 synthetic multidomain proteins each from seven standardized DNA fragments. A systematic screen revealed two main factors controlling protein expression in Escherichia coli: obstruction of translation initiation by mRNA secondary structure or toxicity of individual domains. Eventually, 13 proteins were purified for further characterization. Starting from well-established biotechnological tools, two general–purpose interaction input and two readout devices were built and characterized in vitro. Constitutive interaction input was achieved with a pair of synthetic leucine zippers. The second interaction was drug-controlled utilizing the rapamycin-induced binding of FRB(T2098L) to FKBP12. The interaction kinetics of both devices were analyzed by surface plasmon resonance. Readout was based on Förster resonance energy transfer between fluorescent proteins and was quantified for various combinations of input and output devices. Our results demonstrate the feasibility of parts-based protein synthetic biology. Additionally, we identify future challenges and limitations of modular design along with approaches to address them. PMID:20215443
Filtering high-throughput protein-protein interaction data using a combination of genomic features
Directory of Open Access Journals (Sweden)
Patil Ashwini
2005-04-01
Full Text Available Abstract Background Protein-protein interaction data used in the creation or prediction of molecular networks is usually obtained from large scale or high-throughput experiments. This experimental data is liable to contain a large number of spurious interactions. Hence, there is a need to validate the interactions and filter out the incorrect data before using them in prediction studies. Results In this study, we use a combination of 3 genomic features – structurally known interacting Pfam domains, Gene Ontology annotations and sequence homology – as a means to assign reliability to the protein-protein interactions in Saccharomyces cerevisiae determined by high-throughput experiments. Using Bayesian network approaches, we show that protein-protein interactions from high-throughput data supported by one or more genomic features have a higher likelihood ratio and hence are more likely to be real interactions. Our method has a high sensitivity (90% and good specificity (63%. We show that 56% of the interactions from high-throughput experiments in Saccharomyces cerevisiae have high reliability. We use the method to estimate the number of true interactions in the high-throughput protein-protein interaction data sets in Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens to be 27%, 18% and 68% respectively. Our results are available for searching and downloading at http://helix.protein.osaka-u.ac.jp/htp/. Conclusion A combination of genomic features that include sequence, structure and annotation information is a good predictor of true interactions in large and noisy high-throughput data sets. The method has a very high sensitivity and good specificity and can be used to assign a likelihood ratio, corresponding to the reliability, to each interaction.
Manipulating fatty acid biosynthesis in microalgae for biofuel through protein-protein interactions.
Directory of Open Access Journals (Sweden)
Jillian L Blatti
Full Text Available Microalgae are a promising feedstock for renewable fuels, and algal metabolic engineering can lead to crop improvement, thus accelerating the development of commercially viable biodiesel production from algae biomass. We demonstrate that protein-protein interactions between the fatty acid acyl carrier protein (ACP and thioesterase (TE govern fatty acid hydrolysis within the algal chloroplast. Using green microalga Chlamydomonas reinhardtii (Cr as a model, a structural simulation of docking CrACP to CrTE identifies a protein-protein recognition surface between the two domains. A virtual screen reveals plant TEs with similar in silico binding to CrACP. Employing an activity-based crosslinking probe designed to selectively trap transient protein-protein interactions between the TE and ACP, we demonstrate in vitro that CrTE must functionally interact with CrACP to release fatty acids, while TEs of vascular plants show no mechanistic crosslinking to CrACP. This is recapitulated in vivo, where overproduction of the endogenous CrTE increased levels of short-chain fatty acids and engineering plant TEs into the C. reinhardtii chloroplast did not alter the fatty acid profile. These findings highlight the critical role of protein-protein interactions in manipulating fatty acid biosynthesis for algae biofuel engineering as illuminated by activity-based probes.
Peng, Wei; Wang, Jianxin; Zhao, Bihai; Wang, Lusheng
2015-01-01
Protein complexes play a significant role in understanding the underlying mechanism of most cellular functions. Recently, many researchers have explored computational methods to identify protein complexes from protein-protein interaction (PPI) networks. One group of researchers focus on detecting local dense subgraphs which correspond to protein complexes by considering local neighbors. The drawback of this kind of approach is that the global information of the networks is ignored. Some methods such as Markov Clustering algorithm (MCL), PageRank-Nibble are proposed to find protein complexes based on random walk technique which can exploit the global structure of networks. However, these methods ignore the inherent core-attachment structure of protein complexes and treat adjacent node equally. In this paper, we design a weighted PageRank-Nibble algorithm which assigns each adjacent node with different probability, and propose a novel method named WPNCA to detect protein complex from PPI networks by using weighted PageRank-Nibble algorithm and core-attachment structure. Firstly, WPNCA partitions the PPI networks into multiple dense clusters by using weighted PageRank-Nibble algorithm. Then the cores of these clusters are detected and the rest of proteins in the clusters will be selected as attachments to form the final predicted protein complexes. The experiments on yeast data show that WPNCA outperforms the existing methods in terms of both accuracy and p-value. The software for WPNCA is available at "http://netlab.csu.edu.cn/bioinfomatics/weipeng/WPNCA/download.html".
Uncovering Viral Protein-Protein Interactions and their Role in Arenavirus Life Cycle
Directory of Open Access Journals (Sweden)
Nora López
2012-09-01
Full Text Available The Arenaviridae family includes widely distributed pathogens that cause severe hemorrhagic fever in humans. Replication and packaging of their single-stranded RNA genome involve RNA recognition by viral proteins and a number of key protein-protein interactions. Viral RNA synthesis is directed by the virus-encoded RNA dependent-RNA polymerase (L protein and requires viral RNA encapsidation by the Nucleoprotein. In addition to the role that the interaction between L and the Nucleoprotein may have in the replication process, polymerase activity appears to be modulated by the association between L and the small multifunctional Z protein. Z is also a structural component of the virions that plays an essential role in viral morphogenesis. Indeed, interaction of the Z protein with the Nucleoprotein is critical for genome packaging. Furthermore, current evidence suggests that binding between Z and the viral envelope glycoprotein complex is required for virion infectivity, and that Z homo-oligomerization is an essential step for particle assembly and budding. Efforts to understand the molecular basis of arenavirus life cycle have revealed important details on these viral protein-protein interactions that will be reviewed in this article.
Yeast Interacting Proteins Database: YGL127C, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available ith protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a regula...rotein involved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensors
Directory of Open Access Journals (Sweden)
Shuaizheng Jia
Full Text Available The identification and quantitative analysis of protein-protein interactions are essential to the functional characterization of proteins in the post-proteomics era. The methods currently available are generally time-consuming, technically complicated, insensitive and/or semi-quantitative. The lack of simple, sensitive approaches to precisely quantify protein-protein interactions still prevents our understanding of the functions of many proteins. Here, we develop a novel dual luciferase reporter pull-down assay by combining a biotinylated Firefly luciferase pull-down assay with a dual luciferase reporter assay. The biotinylated Firefly luciferase-tagged protein enables rapid and efficient isolation of a putative Renilla luciferase-tagged binding protein from a relatively small amount of sample. Both of these proteins can be quantitatively detected using the dual luciferase reporter assay system. Protein-protein interactions, including Fos-Jun located in the nucleus; MAVS-TRAF3 in cytoplasm; inducible IRF3 dimerization; viral protein-regulated interactions, such as MAVS-MAVS and MAVS-TRAF3; IRF3 dimerization; and protein interaction domain mapping, are studied using this novel assay system. Herein, we demonstrate that this dual luciferase reporter pull-down assay enables the quantification of the relative amounts of interacting proteins that bind to streptavidin-coupled beads for protein purification. This study provides a simple, rapid, sensitive, and efficient approach to identify and quantify relative protein-protein interactions. Importantly, the dual luciferase reporter pull-down method will facilitate the functional determination of proteins.
NMR Studies of Protein Hydration and Protein-Ligand Interactions
Chong, Yuan
Water on the surface of a protein is called hydration water. Hydration water is known to play a crucial role in a variety of biological processes including protein folding, enzymatic activation, and drug binding. Although the significance of hydration water has been recognized, the underlying mechanism remains far from being understood. This dissertation employs a unique in-situ nuclear magnetic resonance (NMR) technique to study the mechanism of protein hydration and the role of hydration in alcohol-protein interactions. Water isotherms in proteins are measured at different temperatures via the in-situ NMR technique. Water is found to interact differently with hydrophilic and hydrophobic groups on the protein. Water adsorption on hydrophilic groups is hardly affected by the temperature, while water adsorption on hydrophobic groups strongly depends on the temperature around 10 C, below which the adsorption is substantially reduced. This effect is induced by the dramatic decrease in the protein flexibility below 10 C. Furthermore, nanosecond to microsecond protein dynamics and the free energy, enthalpy, and entropy of protein hydration are studied as a function of hydration level and temperature. A crossover at 10 C in protein dynamics and thermodynamics is revealed. The effect of water at hydrophilic groups on protein dynamics and thermodynamics shows little temperature dependence, whereas water at hydrophobic groups has stronger effect above 10 C. In addition, I investigate the role of water in alcohol binding to the protein using the in-situ NMR detection. The isotherms of alcohols are first measured on dry proteins, then on proteins with a series of controlled hydration levels. The free energy, enthalpy, and entropy of alcohol binding are also determined. Two distinct types of alcohol binding are identified. On the one hand, alcohols can directly bind to a few specific sites on the protein. This type of binding is independent of temperature and can be
Predicting the binding patterns of hub proteins: a study using yeast protein interaction networks.
Directory of Open Access Journals (Sweden)
Carson M Andorf
Full Text Available Protein-protein interactions are critical to elucidating the role played by individual proteins in important biological pathways. Of particular interest are hub proteins that can interact with large numbers of partners and often play essential roles in cellular control. Depending on the number of binding sites, protein hubs can be classified at a structural level as singlish-interface hubs (SIH with one or two binding sites, or multiple-interface hubs (MIH with three or more binding sites. In terms of kinetics, hub proteins can be classified as date hubs (i.e., interact with different partners at different times or locations or party hubs (i.e., simultaneously interact with multiple partners.Our approach works in 3 phases: Phase I classifies if a protein is likely to bind with another protein. Phase II determines if a protein-binding (PB protein is a hub. Phase III classifies PB proteins as singlish-interface versus multiple-interface hubs and date versus party hubs. At each stage, we use sequence-based predictors trained using several standard machine learning techniques.Our method is able to predict whether a protein is a protein-binding protein with an accuracy of 94% and a correlation coefficient of 0.87; identify hubs from non-hubs with 100% accuracy for 30% of the data; distinguish date hubs/party hubs with 69% accuracy and area under ROC curve of 0.68; and SIH/MIH with 89% accuracy and area under ROC curve of 0.84. Because our method is based on sequence information alone, it can be used even in settings where reliable protein-protein interaction data or structures of protein-protein complexes are unavailable to obtain useful insights into the functional and evolutionary characteristics of proteins and their interactions.We provide a web server for our three-phase approach: http://hybsvm.gdcb.iastate.edu.
Fragment-based drug discovery and protein–protein interactions
Directory of Open Access Journals (Sweden)
Turnbull AP
2014-09-01
Full Text Available Andrew P Turnbull,1 Susan M Boyd,2 Björn Walse31CRT Discovery Laboratories, Department of Biological Sciences, Birkbeck, University of London, London, UK; 2IOTA Pharmaceuticals Ltd, Cambridge, UK; 3SARomics Biostructures AB, Lund, SwedenAbstract: Protein–protein interactions (PPIs are involved in many biological processes, with an estimated 400,000 PPIs within the human proteome. There is significant interest in exploiting the relatively unexplored potential of these interactions in drug discovery, driven by the need to find new therapeutic targets. Compared with classical drug discovery against targets with well-defined binding sites, developing small-molecule inhibitors against PPIs where the contact surfaces are frequently more extensive and comparatively flat, with most of the binding energy localized in “hot spots”, has proven far more challenging. However, despite the difficulties associated with targeting PPIs, important progress has been made in recent years with fragment-based drug discovery playing a pivotal role in improving their tractability. Computational and empirical approaches can be used to identify hot-spot regions and assess the druggability and ligandability of new targets, whilst fragment screening campaigns can detect low-affinity fragments that either directly or indirectly perturb the PPI. Once fragment hits have been identified and confirmed using biochemical and biophysical approaches, three-dimensional structural data derived from nuclear magnetic resonance or X-ray crystallography can be used to drive medicinal chemistry efforts towards the development of more potent inhibitors. A small-scale comparison presented in this review of “standard” fragments with those targeting PPIs has revealed that the latter tend to be larger, be more lipophilic, and contain more polar (acid/base functionality, whereas three-dimensional descriptor data indicate that there is little difference in their three
Parallel force assay for protein-protein interactions.
Aschenbrenner, Daniela; Pippig, Diana A; Klamecka, Kamila; Limmer, Katja; Leonhardt, Heinrich; Gaub, Hermann E
2014-01-01
Quantitative proteome research is greatly promoted by high-resolution parallel format assays. A characterization of protein complexes based on binding forces offers an unparalleled dynamic range and allows for the effective discrimination of non-specific interactions. Here we present a DNA-based Molecular Force Assay to quantify protein-protein interactions, namely the bond between different variants of GFP and GFP-binding nanobodies. We present different strategies to adjust the maximum sensitivity window of the assay by influencing the binding strength of the DNA reference duplexes. The binding of the nanobody Enhancer to the different GFP constructs is compared at high sensitivity of the assay. Whereas the binding strength to wild type and enhanced GFP are equal within experimental error, stronger binding to superfolder GFP is observed. This difference in binding strength is attributed to alterations in the amino acids that form contacts according to the crystal structure of the initial wild type GFP-Enhancer complex. Moreover, we outline the potential for large-scale parallelization of the assay.
Kinome signaling through regulated protein-protein interactions in normal and cancer cells.
Pawson, Tony; Kofler, Michael
2009-04-01
The flow of molecular information through normal and oncogenic signaling pathways frequently depends on protein phosphorylation, mediated by specific kinases, and the selective binding of the resulting phosphorylation sites to interaction domains present on downstream targets. This physical and functional interplay of catalytic and interaction domains can be clearly seen in cytoplasmic tyrosine kinases such as Src, Abl, Fes, and ZAP-70. Although the kinase and SH2 domains of these proteins possess similar intrinsic properties of phosphorylating tyrosine residues or binding phosphotyrosine sites, they also undergo intramolecular interactions when linked together, in a fashion that varies from protein to protein. These cooperative interactions can have diverse effects on substrate recognition and kinase activity, and provide a variety of mechanisms to link the stimulation of catalytic activity to substrate recognition. Taken together, these data have suggested how protein kinases, and the signaling pathways in which they are embedded, can evolve complex properties through the stepwise linkage of domains within single polypeptides or multi-protein assemblies.
Directory of Open Access Journals (Sweden)
Zheng Huiru
2009-01-01
Full Text Available Abstract Background Information about protein interaction networks is fundamental to understanding protein function and cellular processes. Interaction patterns among proteins can suggest new drug targets and aid in the design of new therapeutic interventions. Efforts have been made to map interactions on a proteomic-wide scale using both experimental and computational techniques. Reference datasets that contain known interacting proteins (positive cases and non-interacting proteins (negative cases are essential to support computational prediction and validation of protein-protein interactions. Information on known interacting and non interacting proteins are usually stored within databases. Extraction of these data can be both complex and time consuming. Although, the automatic construction of reference datasets for classification is a useful resource for researchers no public resource currently exists to perform this task. Results GRIP (Gold Reference dataset constructor from Information on Protein complexes is a web-based system that provides researchers with the functionality to create reference datasets for protein-protein interaction prediction in Saccharomyces cerevisiae. Both positive and negative cases for a reference dataset can be extracted, organised and downloaded by the user. GRIP also provides an upload facility whereby users can submit proteins to determine protein complex membership. A search facility is provided where a user can search for protein complex information in Saccharomyces cerevisiae. Conclusion GRIP is developed to retrieve information on protein complex, cellular localisation, and physical and genetic interactions in Saccharomyces cerevisiae. Manual construction of reference datasets can be a time consuming process requiring programming knowledge. GRIP simplifies and speeds up this process by allowing users to automatically construct reference datasets. GRIP is free to access at http://rosalind.infj.ulst.ac.uk/GRIP/.
Dendrimer-protein interactions versus dendrimer-based nanomedicine.
Shcharbin, Dzmitry; Shcharbina, Natallia; Dzmitruk, Volha; Pedziwiatr-Werbicka, Elzbieta; Ionov, Maksim; Mignani, Serge; de la Mata, F Javier; Gómez, Rafael; Muñoz-Fernández, Maria Angeles; Majoral, Jean-Pierre; Bryszewska, Maria
2017-04-01
Dendrimers are hyperbranched polymers belonging to the huge class of nanomedical devices. Their wide application in biology and medicine requires understanding of the fundamental mechanisms of their interactions with biological systems. Summarizing, electrostatic force plays the predominant role in dendrimer-protein interactions, especially with charged dendrimers. Other kinds of interactions have been proven, such as H-bonding, van der Waals forces, and even hydrophobic interactions. These interactions depend on the characteristics of both participants: flexibility and surface charge of a dendrimer, rigidity of protein structure and the localization of charged amino acids at its surface. pH and ionic strength of solutions can significantly modulate interactions. Ligands and cofactors attached to a protein can also change dendrimer-protein interactions. Binding of dendrimers to a protein can change its secondary structure, conformation, intramolecular mobility and functional activity. However, this strongly depends on rigidity versus flexibility of a protein's structure. In addition, the potential applications of dendrimers to nanomedicine are reviwed related to dendrimer-protein interactions. Copyright © 2017 Elsevier B.V. All rights reserved.
ProteinShop: A tool for interactive protein manipulation and steering
Energy Technology Data Exchange (ETDEWEB)
Crivelli, Silvia; Kreylos, Oliver; Max, Nelson; Hamann, Bernd; Bethel, Wes
2004-05-25
We describe ProteinShop, a new visualization tool that streamlines and simplifies the process of determining optimal protein folds. ProteinShop may be used at different stages of a protein structure prediction process. First, it can create protein configurations containing secondary structures specified by the user. Second, it can interactively manipulate protein fragments to achieve desired folds by adjusting the dihedral angles of selected coil regions using an Inverse Kinematics method. Last, it serves as a visual framework to monitor and steer a protein structure prediction process that may be running on a remote machine. ProteinShop was used to create initial configurations for a protein structure prediction method developed by a team that competed in CASP5. ProteinShop's use accelerated the process of generating initial configurations, reducing the time required from days to hours. This paper describes the structure of ProteinShop and discusses its main features.
Haidar, Malak; de Laté , Perle Latré
2017-01-01
One powerful application of cell penetrating peptides is the delivery into cells of molecules that function as specific competitors or inhibitors of protein-protein interactions. Ablating defined protein-protein interactions is a refined way
International Nuclear Information System (INIS)
D’Agostino, Tommaso; Solana, José Ramón; Emanuele, Antonio
2013-01-01
Highlights: ► We propose a model of effective protein–protein interaction embedding solvent effects. ► A previous square-well model is enhanced by giving to the interaction a free energy character. ► The temperature dependence of the interaction is due to entropic effects of the solvent. ► The validity of the original SW model is extended to entropy driven phase transitions. ► We get good fits for lysozyme and haemoglobin spinodal data taken from literature. - Abstract: Statistical thermodynamics of protein solutions is often studied in terms of simple, microscopic models of particles interacting via pairwise potentials. Such modelling can reproduce the short range structure of protein solutions at equilibrium and predict thermodynamics instabilities of these systems. We introduce a square well model of effective protein–protein interaction that embeds the solvent’s action. We modify an existing model [45] by considering a well depth having an explicit dependence on temperature, i.e. an explicit free energy character, thus encompassing the statistically relevant configurations of solvent molecules around proteins. We choose protein solutions exhibiting demixing upon temperature decrease (lysozyme, enthalpy driven) and upon temperature increase (haemoglobin, entropy driven). We obtain satisfactory fits of spinodal curves for both the two proteins without adding any mean field term, thus extending the validity of the original model. Our results underline the solvent role in modulating or stretching the interaction potential
Directory of Open Access Journals (Sweden)
Thahir Mohamed
2012-11-01
Full Text Available Abstract Background Machine learning approaches for classification learn the pattern of the feature space of different classes, or learn a boundary that separates the feature space into different classes. The features of the data instances are usually available, and it is only the class-labels of the instances that are unavailable. For example, to classify text documents into different topic categories, the words in the documents are features and they are readily available, whereas the topic is what is predicted. However, in some domains obtaining features may be resource-intensive because of which not all features may be available. An example is that of protein-protein interaction prediction, where not only are the labels ('interacting' or 'non-interacting' unavailable, but so are some of the features. It may be possible to obtain at least some of the missing features by carrying out a few experiments as permitted by the available resources. If only a few experiments can be carried out to acquire missing features, which proteins should be studied and which features of those proteins should be determined? From the perspective of machine learning for PPI prediction, it would be desirable that those features be acquired which when used in training the classifier, the accuracy of the classifier is improved the most. That is, the utility of the feature-acquisition is measured in terms of how much acquired features contribute to improving the accuracy of the classifier. Active feature acquisition (AFA is a strategy to preselect such instance-feature combinations (i.e. protein and experiment combinations for maximum utility. The goal of AFA is the creation of optimal training set that would result in the best classifier, and not in determining the best classification model itself. Results We present a heuristic method for active feature acquisition to calculate the utility of acquiring a missing feature. This heuristic takes into account the change in
Characterization of host proteins interacting with the lymphocytic choriomeningitis virus L protein.
Khamina, Kseniya; Lercher, Alexander; Caldera, Michael; Schliehe, Christopher; Vilagos, Bojan; Sahin, Mehmet; Kosack, Lindsay; Bhattacharya, Anannya; Májek, Peter; Stukalov, Alexey; Sacco, Roberto; James, Leo C; Pinschewer, Daniel D; Bennett, Keiryn L; Menche, Jörg; Bergthaler, Andreas
2017-12-01
RNA-dependent RNA polymerases (RdRps) play a key role in the life cycle of RNA viruses and impact their immunobiology. The arenavirus lymphocytic choriomeningitis virus (LCMV) strain Clone 13 provides a benchmark model for studying chronic infection. A major genetic determinant for its ability to persist maps to a single amino acid exchange in the viral L protein, which exhibits RdRp activity, yet its functional consequences remain elusive. To unravel the L protein interactions with the host proteome, we engineered infectious L protein-tagged LCMV virions by reverse genetics. A subsequent mass-spectrometric analysis of L protein pulldowns from infected human cells revealed a comprehensive network of interacting host proteins. The obtained LCMV L protein interactome was bioinformatically integrated with known host protein interactors of RdRps from other RNA viruses, emphasizing interconnected modules of human proteins. Functional characterization of selected interactors highlighted proviral (DDX3X) as well as antiviral (NKRF, TRIM21) host factors. To corroborate these findings, we infected Trim21-/- mice with LCMV and found impaired virus control in chronic infection. These results provide insights into the complex interactions of the arenavirus LCMV and other viral RdRps with the host proteome and contribute to a better molecular understanding of how chronic viruses interact with their host.
Force spectroscopy studies on protein-ligand interactions: a single protein mechanics perspective.
Hu, Xiaotang; Li, Hongbin
2014-10-01
Protein-ligand interactions are ubiquitous and play important roles in almost every biological process. The direct elucidation of the thermodynamic, structural and functional consequences of protein-ligand interactions is thus of critical importance to decipher the mechanism underlying these biological processes. A toolbox containing a variety of powerful techniques has been developed to quantitatively study protein-ligand interactions in vitro as well as in living systems. The development of atomic force microscopy-based single molecule force spectroscopy techniques has expanded this toolbox and made it possible to directly probe the mechanical consequence of ligand binding on proteins. Many recent experiments have revealed how ligand binding affects the mechanical stability and mechanical unfolding dynamics of proteins, and provided mechanistic understanding on these effects. The enhancement effect of mechanical stability by ligand binding has been used to help tune the mechanical stability of proteins in a rational manner and develop novel functional binding assays for protein-ligand interactions. Single molecule force spectroscopy studies have started to shed new lights on the structural and functional consequence of ligand binding on proteins that bear force under their biological settings. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Yeast Interacting Proteins Database: YOR358W, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available ; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; act...rotein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a regulator o
Interaction of Proteins Identified in Human Thyroid Cells
Pietsch, Jessica; Riwaldt, Stefan; Bauer, Johann; Sickmann, Albert; Weber, Gerhard; Grosse, Jirka; Infanger, Manfred; Eilles, Christoph; Grimm, Daniela
2013-01-01
Influence of gravity forces on the regulation of protein expression by healthy and malignant thyroid cells was studied with the aim to identify protein interactions. Western blot analyses of a limited number of proteins suggested a time-dependent regulation of protein expression by simulated microgravity. After applying free flow isoelectric focusing and mass spectrometry to search for differently expressed proteins by thyroid cells exposed to simulated microgravity for three days, a considerable number of candidates for gravi-sensitive proteins were detected. In order to show how proteins sensitive to microgravity could directly influence other proteins, we investigated all polypeptide chains identified with Mascot scores above 100, looking for groups of interacting proteins. Hence, UniProtKB entry numbers of all detected proteins were entered into the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and processed. The program indicated that we had detected various groups of interacting proteins in each of the three cell lines studied. The major groups of interacting proteins play a role in pathways of carbohydrate and protein metabolism, regulation of cell growth and cell membrane structuring. Analyzing these groups, networks of interaction could be established which show how a punctual influence of simulated microgravity may propagate via various members of interaction chains. PMID:23303277
Interaction of Proteins Identified in Human Thyroid Cells
Directory of Open Access Journals (Sweden)
Jessica Pietsch
2013-01-01
Full Text Available Influence of gravity forces on the regulation of protein expression by healthy and malignant thyroid cells was studied with the aim to identify protein interactions. Western blot analyses of a limited number of proteins suggested a time-dependent regulation of protein expression by simulated microgravity. After applying free flow isoelectric focusing and mass spectrometry to search for differently expressed proteins by thyroid cells exposed to simulated microgravity for three days, a considerable number of candidates for gravi-sensitive proteins were detected. In order to show how proteins sensitive to microgravity could directly influence other proteins, we investigated all polypeptide chains identified with Mascot scores above 100, looking for groups of interacting proteins. Hence, UniProtKB entry numbers of all detected proteins were entered into the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING and processed. The program indicated that we had detected various groups of interacting proteins in each of the three cell lines studied. The major groups of interacting proteins play a role in pathways of carbohydrate and protein metabolism, regulation of cell growth and cell membrane structuring. Analyzing these groups, networks of interaction could be established which show how a punctual influence of simulated microgravity may propagate via various members of interaction chains.
Directory of Open Access Journals (Sweden)
Jaume Torres
2015-06-01
Full Text Available Viroporins are members of a rapidly growing family of channel-forming small polypeptides found in viruses. The present review will be focused on recent structural and protein-protein interaction information involving two viroporins found in enveloped viruses that target the respiratory tract; (i the envelope protein in coronaviruses and (ii the small hydrophobic protein in paramyxoviruses. Deletion of these two viroporins leads to viral attenuation in vivo, whereas data from cell culture shows involvement in the regulation of stress and inflammation. The channel activity and structure of some representative members of these viroporins have been recently characterized in some detail. In addition, searches for protein-protein interactions using yeast-two hybrid techniques have shed light on possible functional roles for their exposed cytoplasmic domains. A deeper analysis of these interactions should not only provide a more complete overview of the multiple functions of these viroporins, but also suggest novel strategies that target protein-protein interactions as much needed antivirals. These should complement current efforts to block viroporin channel activity.
Analysis of Protein-Membrane Interactions
DEFF Research Database (Denmark)
Kemmer, Gerdi Christine
Cellular membranes are complex structures, consisting of hundreds of different lipids and proteins. These membranes act as barriers between distinct environments, constituting hot spots for many essential functions of the cell, including signaling, energy conversion, and transport. These functions....... Discovered interactions were then probed on the level of the membrane using liposome-based assays. In the second part, a transmembrane protein was investigated. Assays to probe activity of the plasma membrane ATPase (Arabidopsis thaliana H+ -ATPase isoform 2 (AHA2)) in single liposomes using both giant...... are implemented by soluble proteins reversibly binding to, as well as by integral membrane proteins embedded in, cellular membranes. The activity and interaction of these proteins is furthermore modulated by the lipids of the membrane. Here, liposomes were used as model membrane systems to investigate...
Li, Tao; Li, Qian-Zhong
2012-11-07
RNA-protein interactions play important roles in various biological processes. The precise detection of RNA-protein interaction sites is very important for understanding essential biological processes and annotating the function of the proteins. In this study, based on various features from amino acid sequence and structure, including evolutionary information, solvent accessible surface area and torsion angles (φ, ψ) in the backbone structure of the polypeptide chain, a computational method for predicting RNA-binding sites in proteins is proposed. When the method is applied to predict RNA-binding sites in three datasets: RBP86 containing 86 protein chains, RBP107 containing 107 proteins chains and RBP109 containing 109 proteins chains, better sensitivities and specificities are obtained compared to previously published methods in five-fold cross-validation tests. In order to make further examination for the efficiency of our method, the RBP107 dataset is used as training set, RBP86 and RBP109 datasets are used as the independent test sets. In addition, as examples of our prediction, RNA-binding sites in a few proteins are presented. The annotated results are consistent with the PDB annotation. These results show that our method is useful for annotating RNA binding sites of novel proteins.
Yeast Interacting Proteins Database: YGL145W, YNL258C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available ripheral membrane protein required for Golgi-to-ER retrograde traffic; component ... membrane protein required for Golgi-to-ER retrograde traffic; component of the ER target site that interact
Chen, Yan-Kun; Qiao, Lian-Sheng; Huo, Xiao-Qian; Zhang, Xu; Han, Na; Zhang, Yan-Ling
2017-09-01
Pinctada fucata oligopeptide is one of key pharmaceutical effective constituents of P. fucata. It is significant to analyze its pharmacological effect and mechanism. This study aims to discover the potential oligopeptides from P. fucata and analyze the mechanism of P. fucata oligopeptide based on in silico technologies and protein interaction network(PIN). First, main protein sequences of P. fucata were collected, and oligopeptides were obtained using in silico gastrointestinal tract proteolysis. Then, key potential targets of P. fucata oligopeptides were obtained through pharmacophore screening. The protein-protein interaction(PPI) of targets was achieved and implemented to construct PIN and analyze the mechanism of P. fucata oligopeptides. P. fucata oligopeptide database was constructed based on in silico technologies, including 458 oligopeptides. Twelve modules were identified from PIN by a graph theoretic clustering algorithm Molecular Complex Detection(MCODE) and analyzed by Gene ontology(GO) enrichment. The results indicated that P. fucata oligopeptides have an effect in treating neurological diseases, such as Alzheimer's disease. In silico proteolysis could be used to analyze the protein sequences of traditional Chinese medicine(TCM). According to the combination of in silico proteolysis and PIN, the biological activity of oligopeptides could be interpreted rapidly based on the known TCM protein sequence. The study provides the methodology basis for rapidly and efficiently implementing the mechanism analysis of TCM oligopeptides. Copyright© by the Chinese Pharmaceutical Association.
In silico modeling of the yeast protein and protein family interaction network
Goh, K.-I.; Kahng, B.; Kim, D.
2004-03-01
Understanding of how protein interaction networks of living organisms have evolved or are organized can be the first stepping stone in unveiling how life works on a fundamental ground. Here we introduce an in silico ``coevolutionary'' model for the protein interaction network and the protein family network. The essential ingredient of the model includes the protein family identity and its robustness under evolution, as well as the three previously proposed: gene duplication, divergence, and mutation. This model produces a prototypical feature of complex networks in a wide range of parameter space, following the generalized Pareto distribution in connectivity. Moreover, we investigate other structural properties of our model in detail with some specific values of parameters relevant to the yeast Saccharomyces cerevisiae, showing excellent agreement with the empirical data. Our model indicates that the physical constraints encoded via the domain structure of proteins play a crucial role in protein interactions.
Drosophila Protein interaction Map (DPiM)
Guruharsha, K.G.; Obar, Robert A.; Mintseris, Julian; Aishwarya, K.; Krishnan, R.T.; VijayRaghavan, K.; Artavanis-Tsakonas, Spyros
2012-01-01
Proteins perform essential cellular functions as part of protein complexes, often in conjunction with RNA, DNA, metabolites and other small molecules. The genome encodes thousands of proteins but not all of them are expressed in every cell type; and expressed proteins are not active at all times. Such diversity of protein expression and function accounts for the level of biological intricacy seen in nature. Defining protein-protein interactions in protein complexes, and establishing the when,...
Directory of Open Access Journals (Sweden)
Li Min
2012-03-01
Full Text Available Abstract Background Identification of essential proteins is always a challenging task since it requires experimental approaches that are time-consuming and laborious. With the advances in high throughput technologies, a large number of protein-protein interactions are available, which have produced unprecedented opportunities for detecting proteins' essentialities from the network level. There have been a series of computational approaches proposed for predicting essential proteins based on network topologies. However, the network topology-based centrality measures are very sensitive to the robustness of network. Therefore, a new robust essential protein discovery method would be of great value. Results In this paper, we propose a new centrality measure, named PeC, based on the integration of protein-protein interaction and gene expression data. The performance of PeC is validated based on the protein-protein interaction network of Saccharomyces cerevisiae. The experimental results show that the predicted precision of PeC clearly exceeds that of the other fifteen previously proposed centrality measures: Degree Centrality (DC, Betweenness Centrality (BC, Closeness Centrality (CC, Subgraph Centrality (SC, Eigenvector Centrality (EC, Information Centrality (IC, Bottle Neck (BN, Density of Maximum Neighborhood Component (DMNC, Local Average Connectivity-based method (LAC, Sum of ECC (SoECC, Range-Limited Centrality (RL, L-index (LI, Leader Rank (LR, Normalized α-Centrality (NC, and Moduland-Centrality (MC. Especially, the improvement of PeC over the classic centrality measures (BC, CC, SC, EC, and BN is more than 50% when predicting no more than 500 proteins. Conclusions We demonstrate that the integration of protein-protein interaction network and gene expression data can help improve the precision of predicting essential proteins. The new centrality measure, PeC, is an effective essential protein discovery method.
Parallel force assay for protein-protein interactions.
Directory of Open Access Journals (Sweden)
Daniela Aschenbrenner
Full Text Available Quantitative proteome research is greatly promoted by high-resolution parallel format assays. A characterization of protein complexes based on binding forces offers an unparalleled dynamic range and allows for the effective discrimination of non-specific interactions. Here we present a DNA-based Molecular Force Assay to quantify protein-protein interactions, namely the bond between different variants of GFP and GFP-binding nanobodies. We present different strategies to adjust the maximum sensitivity window of the assay by influencing the binding strength of the DNA reference duplexes. The binding of the nanobody Enhancer to the different GFP constructs is compared at high sensitivity of the assay. Whereas the binding strength to wild type and enhanced GFP are equal within experimental error, stronger binding to superfolder GFP is observed. This difference in binding strength is attributed to alterations in the amino acids that form contacts according to the crystal structure of the initial wild type GFP-Enhancer complex. Moreover, we outline the potential for large-scale parallelization of the assay.
Directory of Open Access Journals (Sweden)
Straßer Wolfgang
2006-12-01
Full Text Available In the last decades, biological databases became the major knowledge resource for researchers in the field of molecular biology. The distribution of information among these databases is one of the major problems. An overview about the subject area of data access and representation of protein and protein-protein interaction data within public biological databases is described. For a comprehensive and consistent way of searching and analysing integrated protein and protein-protein interaction data, the InSilico Proteomics (ISP project has been initiated. Its three main objectives are (1 to provide an integrated knowledge pool for data investigation and global network analysis functions for a better understanding of a cell’s interactome, (2 employment of public data for plausibility analysis and validation of in-house experimental data and (3 testing the applicability of Microsoft’s .NET architecture for bioinformatics applications. Data integrated into the ISP database can be queried through the Web portal PRIMOS (PRotein Interaction and MOlecule Search which is freely available at http://biomis.fh-hagenberg.at/isp/primos.
Evidence of probabilistic behaviour in protein interaction networks
Directory of Open Access Journals (Sweden)
Reifman Jaques
2008-01-01
Full Text Available Abstract Background Data from high-throughput experiments of protein-protein interactions are commonly used to probe the nature of biological organization and extract functional relationships between sets of proteins. What has not been appreciated is that the underlying mechanisms involved in assembling these networks may exhibit considerable probabilistic behaviour. Results We find that the probability of an interaction between two proteins is generally proportional to the numerical product of their individual interacting partners, or degrees. The degree-weighted behaviour is manifested throughout the protein-protein interaction networks studied here, except for the high-degree, or hub, interaction areas. However, we find that the probabilities of interaction between the hubs are still high. Further evidence is provided by path length analyses, which show that these hubs are separated by very few links. Conclusion The results suggest that protein-protein interaction networks incorporate probabilistic elements that lead to scale-rich hierarchical architectures. These observations seem to be at odds with a biologically-guided organization. One interpretation of the findings is that we are witnessing the ability of proteins to indiscriminately bind rather than the protein-protein interactions that are actually utilized by the cell in biological processes. Therefore, the topological study of a degree-weighted network requires a more refined methodology to extract biological information about pathways, modules, or other inferred relationships among proteins.
Mapping functional prion-prion protein interaction sites using prion protein based peptide-arrays
Rigter, A.; Priem, J.; Timmers-Parohi, D.; Langeveld, J.; Bossers, A.
2009-01-01
Protein-protein interactions are at the basis of most if not all biological processes in living cells. Therefore, adapting existing techniques or developing new techniques to study interactions between proteins are of importance in elucidating which amino acid sequences contribute to these
Meckes, David G
2014-01-01
The identification and characterization of herpes simplex virus protein interaction complexes are fundamental to understanding the molecular mechanisms governing the replication and pathogenesis of the virus. Recent advances in affinity-based methods, mass spectrometry configurations, and bioinformatics tools have greatly increased the quantity and quality of protein-protein interaction datasets. In this chapter, detailed and reliable methods that can easily be implemented are presented for the identification of protein-protein interactions using cryogenic cell lysis, affinity purification, trypsin digestion, and mass spectrometry.
DeBlasio, Stacy L; Chavez, Juan D; Alexander, Mariko M; Ramsey, John; Eng, Jimmy K; Mahoney, Jaclyn; Gray, Stewart M; Bruce, James E; Cilia, Michelle
2016-02-15
Demonstrating direct interactions between host and virus proteins during infection is a major goal and challenge for the field of virology. Most protein interactions are not binary or easily amenable to structural determination. Using infectious preparations of a polerovirus (Potato leafroll virus [PLRV]) and protein interaction reporter (PIR), a revolutionary technology that couples a mass spectrometric-cleavable chemical cross-linker with high-resolution mass spectrometry, we provide the first report of a host-pathogen protein interaction network that includes data-derived, topological features for every cross-linked site that was identified. We show that PLRV virions have hot spots of protein interaction and multifunctional surface topologies, revealing how these plant viruses maximize their use of binding interfaces. Modeling data, guided by cross-linking constraints, suggest asymmetric packing of the major capsid protein in the virion, which supports previous epitope mapping studies. Protein interaction topologies are conserved with other species in the Luteoviridae and with unrelated viruses in the Herpesviridae and Adenoviridae. Functional analysis of three PLRV-interacting host proteins in planta using a reverse-genetics approach revealed a complex, molecular tug-of-war between host and virus. Structural mimicry and diversifying selection-hallmarks of host-pathogen interactions-were identified within host and viral binding interfaces predicted by our models. These results illuminate the functional diversity of the PLRV-host protein interaction network and demonstrate the usefulness of PIR technology for precision mapping of functional host-pathogen protein interaction topologies. The exterior shape of a plant virus and its interacting host and insect vector proteins determine whether a virus will be transmitted by an insect or infect a specific host. Gaining this information is difficult and requires years of experimentation. We used protein interaction
C2 Domains as Protein-Protein Interaction Modules in the Ciliary Transition Zone
Directory of Open Access Journals (Sweden)
Kim Remans
2014-07-01
Full Text Available RPGR-interacting protein 1 (RPGRIP1 is mutated in the eye disease Leber congenital amaurosis (LCA and its structural homolog, RPGRIP1-like (RPGRIP1L, is mutated in many different ciliopathies. Both are multidomain proteins that are predicted to interact with retinitis pigmentosa G-protein regulator (RPGR. RPGR is mutated in X-linked retinitis pigmentosa and is located in photoreceptors and primary cilia. We solved the crystal structure of the complex between the RPGR-interacting domain (RID of RPGRIP1 and RPGR and demonstrate that RPGRIP1L binds to RPGR similarly. RPGRIP1 binding to RPGR affects the interaction with PDEδ, the cargo shuttling factor for prenylated ciliary proteins. RPGRIP1-RID is a C2 domain with a canonical β sandwich structure that does not bind Ca2+ and/or phospholipids and thus constitutes a unique type of protein-protein interaction module. Judging from the large number of C2 domains in most of the ciliary transition zone proteins identified thus far, the structure presented here seems to constitute a cilia-specific module that is present in multiprotein transition zone complexes.
Evidence for the interaction of the regulatory protein Ki-1/57 with p53 and its interacting proteins
International Nuclear Information System (INIS)
Nery, Flavia C.; Rui, Edmilson; Kuniyoshi, Tais M.; Kobarg, Joerg
2006-01-01
Ki-1/57 is a cytoplasmic and nuclear phospho-protein of 57 kDa and interacts with the adaptor protein RACK1, the transcription factor MEF2C, and the chromatin remodeling factor CHD3, suggesting that it might be involved in the regulation of transcription. Here, we describe yeast two-hybrid studies that identified a total of 11 proteins interacting with Ki-1/57, all of which interact or are functionally associated with p53 or other members of the p53 family of proteins. We further found that Ki-1/57 is able to interact with p53 itself in the yeast two-hybrid system when the interaction was tested directly. This interaction could be confirmed by pull down assays with purified proteins in vitro and by reciprocal co-immunoprecipitation assays from the human Hodgkin analogous lymphoma cell line L540. Furthermore, we found that the phosphorylation of p53 by PKC abolishes its interaction with Ki-1/57 in vitro
Directory of Open Access Journals (Sweden)
Sony Malhotra
Full Text Available Interactions at the molecular level in the cellular environment play a very crucial role in maintaining the physiological functioning of the cell. These molecular interactions exist at varied levels viz. protein-protein interactions, protein-nucleic acid interactions or protein-small molecules interactions. Presently in the field, these interactions and their mechanisms mark intensively studied areas. Molecular interactions can also be studied computationally using the approach named as Molecular Docking. Molecular docking employs search algorithms to predict the possible conformations for interacting partners and then calculates interaction energies. However, docking proposes number of solutions as different docked poses and hence offers a serious challenge to identify the native (or near native structures from the pool of these docked poses. Here, we propose a rigorous scoring scheme called DockScore which can be used to rank the docked poses and identify the best docked pose out of many as proposed by docking algorithm employed. The scoring identifies the optimal interactions between the two protein partners utilising various features of the putative interface like area, short contacts, conservation, spatial clustering and the presence of positively charged and hydrophobic residues. DockScore was first trained on a set of 30 protein-protein complexes to determine the weights for different parameters. Subsequently, we tested the scoring scheme on 30 different protein-protein complexes and native or near-native structure were assigned the top rank from a pool of docked poses in 26 of the tested cases. We tested the ability of DockScore to discriminate likely dimer interactions that differ substantially within a homologous family and also demonstrate that DOCKSCORE can distinguish correct pose for all 10 recent CAPRI targets.
Yates, Christopher M; Sternberg, Michael J E
2013-11-01
Non-synonymous single nucleotide polymorphisms (nsSNPs) are single base changes leading to a change to the amino acid sequence of the encoded protein. Many of these variants are associated with disease, so nsSNPs have been well studied, with studies looking at the effects of nsSNPs on individual proteins, for example, on stability and enzyme active sites. In recent years, the impact of nsSNPs upon protein-protein interactions has also been investigated, giving a greater insight into the mechanisms by which nsSNPs can lead to disease. In this review, we summarize these studies, looking at the various mechanisms by which nsSNPs can affect protein-protein interactions. We focus on structural changes that can impair interaction, changes to disorder, gain of interaction, and post-translational modifications before looking at some examples of nsSNPs at human-pathogen protein-protein interfaces and the analysis of nsSNPs from a network perspective. © 2013.
Berjón-Otero, Mónica; Lechuga, Ana; Mehla, Jitender; Uetz, Peter; Salas, Margarita; Redrejo-Rodríguez, Modesto
2017-07-26
function.Comprehensive protein-protein interaction (PPI) analysis among viral proteins can eventually disclose biological mechanisms and thus provide new insights into protein function unattainable by studying proteins one by one. Here we comprehensively describe intraviral PPIs among tectivirus Bam35 proteins using multi-vector yeast two-hybrid screening that was further supported by co-immunoprecipitation assays and protein structural models. This approach allowed us to propose new functions for known proteins and hypothesize on the biological role localization within the viral particle of some viral ORFan proteins that will be helpful for understanding the biology of Gram-positive tectivirus. Copyright © 2017 American Society for Microbiology.
Directory of Open Access Journals (Sweden)
Aalt D J van Dijk
Full Text Available Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein families with high sequence similarity, such as the plant MADS domain transcription factor family. In comparison to the situation in mammalian species, this important family of transcription regulators has expanded enormously in plant species and contains over 100 members in the model plant species Arabidopsis thaliana. Here, we provide insight into the mechanisms that determine protein-protein interaction specificity for the Arabidopsis MADS domain transcription factor family, using an integrated computational and experimental approach. Plant MADS proteins have highly similar amino acid sequences, but their dimerization patterns vary substantially. Our computational analysis uncovered small sequence regions that explain observed differences in dimerization patterns with reasonable accuracy. Furthermore, we show the usefulness of the method for prediction of MADS domain transcription factor interaction networks in other plant species. Introduction of mutations in the predicted interaction motifs demonstrated that single amino acid mutations can have a large effect and lead to loss or gain of specific interactions. In addition, various performed bioinformatics analyses shed light on the way evolution has shaped MADS domain transcription factor interaction specificity. Identified protein-protein interaction motifs appeared to be strongly conserved among orthologs, indicating their evolutionary importance. We also provide evidence that mutations in these motifs can be a source for sub- or neo-functionalization. The analyses presented here take us a step forward in understanding protein-protein interactions and the interplay between protein sequences and
Kinjo, Akira R; Nakamura, Haruki
2013-01-01
Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.
The EED protein–protein interaction inhibitor A-395 inactivates the PRC2 complex
Energy Technology Data Exchange (ETDEWEB)
He, Yupeng; Selvaraju, Sujatha; Curtin, Michael L.; Jakob, Clarissa G.; Zhu, Haizhong; Comess, Kenneth M.; Shaw, Bailin; The, Juliana; Lima-Fernandes, Evelyne; Szewczyk, Magdalena M.; Cheng, Dong; Klinge, Kelly L.; Li, Huan-Qiu; Pliushchev, Marina; Algire, Mikkel A.; Maag, David; Guo, Jun; Dietrich, Justin; Panchal, Sanjay C.; Petros, Andrew M.; Sweis, Ramzi F.; Torrent, Maricel; Bigelow, Lance J.; Senisterra, Guillermo; Li, Fengling; Kennedy, Steven; Wu, Qin; Osterling, Donald J.; Lindley, David J.; Gao, Wenqing; Galasinski, Scott; Barsyte-Lovejoy, Dalia; Vedadi, Masoud; Buchanan, Fritz G.; Arrowsmith, Cheryl H.; Chiang, Gary G.; Sun, Chaohong; Pappano , William N. (AbbVie); (Toronto)
2017-01-30
Polycomb repressive complex 2 (PRC2) is a regulator of epigenetic states required for development and homeostasis. PRC2 trimethylates histone H3 at lysine 27 (H3K27me3), which leads to gene silencing, and is dysregulated in many cancers. The embryonic ectoderm development (EED) protein is an essential subunit of PRC2 that has both a scaffolding function and an H3K27me3-binding function. Here we report the identification of A-395, a potent antagonist of the H3K27me3 binding functions of EED. Structural studies demonstrate that A-395 binds to EED in the H3K27me3-binding pocket, thereby preventing allosteric activation of the catalytic activity of PRC2. Phenotypic effects observed in vitro and in vivo are similar to those of known PRC2 enzymatic inhibitors; however, A-395 retains potent activity against cell lines resistant to the catalytic inhibitors. A-395 represents a first-in-class antagonist of PRC2 protein–protein interactions (PPI) for use as a chemical probe to investigate the roles of EED-containing protein complexes.
The simulation approach to lipid-protein interactions.
Paramo, Teresa; Garzón, Diana; Holdbrook, Daniel A; Khalid, Syma; Bond, Peter J
2013-01-01
The interactions between lipids and proteins are crucial for a range of biological processes, from the folding and stability of membrane proteins to signaling and metabolism facilitated by lipid-binding proteins. However, high-resolution structural details concerning functional lipid/protein interactions are scarce due to barriers in both experimental isolation of native lipid-bound complexes and subsequent biophysical characterization. The molecular dynamics (MD) simulation approach provides a means to complement available structural data, yielding dynamic, structural, and thermodynamic data for a protein embedded within a physiologically realistic, modelled lipid environment. In this chapter, we provide a guide to current methods for setting up and running simulations of membrane proteins and soluble, lipid-binding proteins, using standard atomistically detailed representations, as well as simplified, coarse-grained models. In addition, we outline recent studies that illustrate the power of the simulation approach in the context of biologically relevant lipid/protein interactions.
Miryala, Sravan Kumar; Anbarasu, Anand; Ramaiah, Sudha
2018-02-05
Computational analysis of biomolecular interaction networks is now gaining a lot of importance to understand the functions of novel genes/proteins. Gene interaction (GI) network analysis and protein-protein interaction (PPI) network analysis play a major role in predicting the functionality of interacting genes or proteins and gives an insight into the functional relationships and evolutionary conservation of interactions among the genes. An interaction network is a graphical representation of gene/protein interactome, where each gene/protein is a node, and interaction between gene/protein is an edge. In this review, we discuss the popular open source databases that serve as data repositories to search and collect protein/gene interaction data, and also tools available for the generation of interaction network, visualization and network analysis. Also, various network analysis approaches like topological approach and clustering approach to study the network properties and functional enrichment server which illustrates the functions and pathway of the genes and proteins has been discussed. Hence the distinctive attribute mentioned in this review is not only to provide an overview of tools and web servers for gene and protein-protein interaction (PPI) network analysis but also to extract useful and meaningful information from the interaction networks. Copyright © 2017 Elsevier B.V. All rights reserved.
Nussinov, Ruth; Panchenko, Anna R.; Przytycka, Teresa
2011-06-01
Physics approaches focus on uncovering, modeling and quantitating the general principles governing the micro and macro universe. This has always been an important component of biological research, however recent advances in experimental techniques and the accumulation of unprecedented genome-scale experimental data produced by these novel technologies now allow for addressing fundamental questions on a large scale. These relate to molecular interactions, principles of bimolecular recognition, and mechanisms of signal propagation. The functioning of a cell requires a variety of intermolecular interactions including protein-protein, protein-DNA, protein-RNA, hormones, peptides, small molecules, lipids and more. Biomolecules work together to provide specific functions and perturbations in intermolecular communication channels often lead to cellular malfunction and disease. A full understanding of the interactome requires an in-depth grasp of the biophysical principles underlying individual interactions as well as their organization in cellular networks. Phenomena can be described at different levels of abstraction. Computational and systems biology strive to model cellular processes by integrating and analyzing complex data from multiple experimental sources using interdisciplinary tools. As a result, both the causal relationships between the variables and the general features of the system can be discovered, which even without knowing the details of the underlying mechanisms allow for putting forth hypotheses and predicting the behavior of the systems in response to perturbation. And here lies the strength of in silico models which provide control and predictive power. At the same time, the complexity of individual elements and molecules can be addressed by the fields of molecular biophysics, physical biology and structural biology, which focus on the underlying physico-chemical principles and may explain the molecular mechanisms of cellular function. In this issue
Predicting Protein-Protein Interactions Using BiGGER: Case Studies
Directory of Open Access Journals (Sweden)
Rui M. Almeida
2016-08-01
Full Text Available The importance of understanding interactomes makes preeminent the study of protein interactions and protein complexes. Traditionally, protein interactions have been elucidated by experimental methods or, with lower impact, by simulation with protein docking algorithms. This article describes features and applications of the BiGGER docking algorithm, which stands at the interface of these two approaches. BiGGER is a user-friendly docking algorithm that was specifically designed to incorporate experimental data at different stages of the simulation, to either guide the search for correct structures or help evaluate the results, in order to combine the reliability of hard data with the convenience of simulations. Herein, the applications of BiGGER are described by illustrative applications divided in three Case Studies: (Case Study A in which no specific contact data is available; (Case Study B when different experimental data (e.g., site-directed mutagenesis, properties of the complex, NMR chemical shift perturbation mapping, electron tunneling on one of the partners is available; and (Case Study C when experimental data are available for both interacting surfaces, which are used during the search and/or evaluation stage of the docking. This algorithm has been extensively used, evidencing its usefulness in a wide range of different biological research fields.
Bojadzic, Damir; Buchwald, Peter
2018-05-30
Protein-protein interactions (PPIs) that are part of the costimulatory and coinhibitory (immune checkpoint) signaling are critical for adequate T cell response and are important therapeutic targets for immunomodulation. Biologics targeting them have already achieved considerable clinical success in the treatment of autoimmune diseases or transplant recipients (e.g., abatacept, belatacept, and belimumab) as well as cancer (e.g., ipilimumab, nivolumab, pembrolizumab, atezolizumab, durvalumab, and avelumab). In view of such progress, there have been only relatively limited efforts toward developing small-molecule PPI inhibitors (SMPPIIs) targeting these cosignaling interactions, possibly because they, as all other PPIs, are difficult to target by small molecules and were not considered druggable. Nevertheless, substantial progress has been achieved during the last decade. SMPPIIs proving the feasibility of such approaches have been identified through various strategies for a number of cosignaling interactions including CD40-CD40L, OX40-OX40L, BAFFR-BAFF, CD80-CD28, and PD-1-PD-L1s. Here, after an overview of the general aspects and challenges of SMPPII-focused drug discovery, we review them briefly together with relevant structural, immune-signaling, physicochemical, and medicinal chemistry aspects. While so far only a few of these SMPPIIs have shown activity in animal models (DRI-C21045 for CD40-D40L, KR33426 for BAFFR-BAFF) or reached clinical development (RhuDex for CD80-CD28, CA-170 for PD-1-PD-L1), there is proof-of-principle evidence for the feasibility of such approaches in immunomodulation. They can result in products that are easier to develop/manufacture and are less likely to be immunogenic or encounter postmarket safety events than corresponding biologics, and, contrary to them, can even become orally bioavailable. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Energy landscape of all-atom protein-protein interactions revealed by multiscale enhanced sampling.
Directory of Open Access Journals (Sweden)
Kei Moritsugu
2014-10-01
Full Text Available Protein-protein interactions are regulated by a subtle balance of complicated atomic interactions and solvation at the interface. To understand such an elusive phenomenon, it is necessary to thoroughly survey the large configurational space from the stable complex structure to the dissociated states using the all-atom model in explicit solvent and to delineate the energy landscape of protein-protein interactions. In this study, we carried out a multiscale enhanced sampling (MSES simulation of the formation of a barnase-barstar complex, which is a protein complex characterized by an extraordinary tight and fast binding, to determine the energy landscape of atomistic protein-protein interactions. The MSES adopts a multicopy and multiscale scheme to enable for the enhanced sampling of the all-atom model of large proteins including explicit solvent. During the 100-ns MSES simulation of the barnase-barstar system, we observed the association-dissociation processes of the atomistic protein complex in solution several times, which contained not only the native complex structure but also fully non-native configurations. The sampled distributions suggest that a large variety of non-native states went downhill to the stable complex structure, like a fast folding on a funnel-like potential. This funnel landscape is attributed to dominant configurations in the early stage of the association process characterized by near-native orientations, which will accelerate the native inter-molecular interactions. These configurations are guided mostly by the shape complementarity between barnase and barstar, and lead to the fast formation of the final complex structure along the downhill energy landscape.
Islamaj Dogan, Rezarta; Kim, Sun; Chatr-Aryamontri, Andrew; Chang, Christie S; Oughtred, Rose; Rust, Jennifer; Wilbur, W John; Comeau, Donald C; Dolinski, Kara; Tyers, Mike
2017-01-01
A great deal of information on the molecular genetics and biochemistry of model organisms has been reported in the scientific literature. However, this data is typically described in free text form and is not readily amenable to computational analyses. To this end, the BioGRID database systematically curates the biomedical literature for genetic and protein interaction data. This data is provided in a standardized computationally tractable format and includes structured annotation of experimental evidence. BioGRID curation necessarily involves substantial human effort by expert curators who must read each publication to extract the relevant information. Computational text-mining methods offer the potential to augment and accelerate manual curation. To facilitate the development of practical text-mining strategies, a new challenge was organized in BioCreative V for the BioC task, the collaborative Biocurator Assistant Task. This was a non-competitive, cooperative task in which the participants worked together to build BioC-compatible modules into an integrated pipeline to assist BioGRID curators. As an integral part of this task, a test collection of full text articles was developed that contained both biological entity annotations (gene/protein and organism/species) and molecular interaction annotations (protein-protein and genetic interactions (PPIs and GIs)). This collection, which we call the BioC-BioGRID corpus, was annotated by four BioGRID curators over three rounds of annotation and contains 120 full text articles curated in a dataset representing two major model organisms, namely budding yeast and human. The BioC-BioGRID corpus contains annotations for 6409 mentions of genes and their Entrez Gene IDs, 186 mentions of organism names and their NCBI Taxonomy IDs, 1867 mentions of PPIs and 701 annotations of PPI experimental evidence statements, 856 mentions of GIs and 399 annotations of GI evidence statements. The purpose, characteristics and possible future
Qiu, Zhijun; Zhou, Bo; Yuan, Jiangfeng
2017-11-21
Protein-protein interaction site (PPIS) prediction must deal with the diversity of interaction sites that limits their prediction accuracy. Use of proteins with unknown or unidentified interactions can also lead to missing interfaces. Such data errors are often brought into the training dataset. In response to these two problems, we used the minimum covariance determinant (MCD) method to refine the training data to build a predictor with better performance, utilizing its ability of removing outliers. In order to predict test data in practice, a method based on Mahalanobis distance was devised to select proper test data as input for the predictor. With leave-one-validation and independent test, after the Mahalanobis distance screening, our method achieved higher performance according to Matthews correlation coefficient (MCC), although only a part of test data could be predicted. These results indicate that data refinement is an efficient approach to improve protein-protein interaction site prediction. By further optimizing our method, it is hopeful to develop predictors of better performance and wide range of application. Copyright © 2017 Elsevier Ltd. All rights reserved.
Nicholson, Judith; Scherl, Alex; Way, Luke; Blackburn, Elizabeth A; Walkinshaw, Malcolm D; Ball, Kathryn L; Hupp, Ted R
2014-06-01
Linear motifs mediate protein-protein interactions (PPI) that allow expansion of a target protein interactome at a systems level. This study uses a proteomics approach and linear motif sub-stratifications to expand on PPIs of MDM2. MDM2 is a multi-functional protein with over one hundred known binding partners not stratified by hierarchy or function. A new linear motif based on a MDM2 interaction consensus is used to select novel MDM2 interactors based on Nutlin-3 responsiveness in a cell-based proteomics screen. MDM2 binds a subset of peptide motifs corresponding to real proteins with a range of allosteric responses to MDM2 ligands. We validate cyclophilin B as a novel protein with a consensus MDM2 binding motif that is stabilised by Nutlin-3 in vivo, thus identifying one of the few known interactors of MDM2 that is stabilised by Nutlin-3. These data invoke two modes of peptide binding at the MDM2 N-terminus that rely on a consensus core motif to control the equilibrium between MDM2 binding proteins. This approach stratifies MDM2 interacting proteins based on the linear motif feature and provides a new biomarker assay to define clinically relevant Nutlin-3 responsive MDM2 interactors. Copyright © 2014 Elsevier Inc. All rights reserved.
A membrane protein / signaling protein interaction network for Arabidopsis version AMPv2
Directory of Open Access Journals (Sweden)
Sylvie Lalonde
2010-09-01
Full Text Available Interactions between membrane proteins and the soluble fraction are essential for signal transduction and for regulating nutrient transport. To gain insights into the membrane-based interactome, 3,852 open reading frames (ORFs out of a target list of 8,383 representing membrane and signaling proteins from Arabidopsis thaliana were cloned into a Gateway compatible vector. The mating-based split-ubiquitin system was used to screen for potential protein-protein interactions (pPPIs among 490 Arabidopsis ORFs. A binary robotic screen between 142 receptor-like kinases, 72 transporters, 57 soluble protein kinases and phosphatases, 40 glycosyltransferases, 95 proteins of various functions and 89 proteins with unknown function detected 387 out of 90,370 possible PPIs. A secondary screen confirmed 343 (of 387 pPPIs between 179 proteins, yielding a scale-free network (r2=0.863. Eighty of 142 transmembrane receptor-like kinases (RLK tested positive, identifying three homomers, 63 heteromers and 80 pPPIs with other proteins. Thirty-one out of 142 RLK interactors (including RLKs had previously been found to be phosphorylated; thus interactors may be substrates for respective RLKs. None of the pPPIs described here had been reported in the major interactome databases, including potential interactors of G protein-coupled receptors, phospholipase C, and AMT ammonium transporters. Two RLKs found as putative interactors of AMT1;1 were independently confirmed using a split luciferase assay in Arabidopsis protoplasts. These RLKs may be involved in ammonium-dependent phosphorylation of the C-terminus and regulation of ammonium uptake activity. The robotic screening method established here will enable a systematic analysis of membrane protein interactions in fungi, plants and metazoa.
Yeast Interacting Proteins Database: YOR302W, YOR047C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available rol of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt...tein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt1
Lifescience Database Archive (English)
Full Text Available List Contact us Yeast Interacting Proteins Database Full Data of Yeast Interacting Proteins Database (Origin...al Version) Data detail Data name Full Data of Yeast Interacting Proteins Database (Original Version) DOI 10....18908/lsdba.nbdc00742-004 Description of data contents The entire data in the Yeast Interacting Proteins Database...eir interactions are required. Several sources including YPD (Yeast Proteome Database, Costanzo, M. C., Hoga...ematic name in the SGD (Saccharomyces Genome Database; http://www.yeastgenome.org /). Bait gene name The gen
A rice kinase-protein interaction map.
Ding, Xiaodong; Richter, Todd; Chen, Mei; Fujii, Hiroaki; Seo, Young Su; Xie, Mingtang; Zheng, Xianwu; Kanrar, Siddhartha; Stevenson, Rebecca A; Dardick, Christopher; Li, Ying; Jiang, Hao; Zhang, Yan; Yu, Fahong; Bartley, Laura E; Chern, Mawsheng; Bart, Rebecca; Chen, Xiuhua; Zhu, Lihuang; Farmerie, William G; Gribskov, Michael; Zhu, Jian-Kang; Fromm, Michael E; Ronald, Pamela C; Song, Wen-Yuan
2009-03-01
Plants uniquely contain large numbers of protein kinases, and for the vast majority of the 1,429 kinases predicted in the rice (Oryza sativa) genome, little is known of their functions. Genetic approaches often fail to produce observable phenotypes; thus, new strategies are needed to delineate kinase function. We previously developed a cost-effective high-throughput yeast two-hybrid system. Using this system, we have generated a protein interaction map of 116 representative rice kinases and 254 of their interacting proteins. Overall, the resulting interaction map supports a large number of known or predicted kinase-protein interactions from both plants and animals and reveals many new functional insights. Notably, we found a potential widespread role for E3 ubiquitin ligases in pathogen defense signaling mediated by receptor-like kinases, particularly by the kinases that may have evolved from recently expanded kinase subfamilies in rice. We anticipate that the data provided here will serve as a foundation for targeted functional studies in rice and other plants. The application of yeast two-hybrid and TAPtag analyses for large-scale plant protein interaction studies is also discussed.
A conserved NAD+ binding pocket that regulates protein-protein interactions during aging.
Li, Jun; Bonkowski, Michael S; Moniot, Sébastien; Zhang, Dapeng; Hubbard, Basil P; Ling, Alvin J Y; Rajman, Luis A; Qin, Bo; Lou, Zhenkun; Gorbunova, Vera; Aravind, L; Steegborn, Clemens; Sinclair, David A
2017-03-24
DNA repair is essential for life, yet its efficiency declines with age for reasons that are unclear. Numerous proteins possess Nudix homology domains (NHDs) that have no known function. We show that NHDs are NAD + (oxidized form of nicotinamide adenine dinucleotide) binding domains that regulate protein-protein interactions. The binding of NAD + to the NHD domain of DBC1 (deleted in breast cancer 1) prevents it from inhibiting PARP1 [poly(adenosine diphosphate-ribose) polymerase], a critical DNA repair protein. As mice age and NAD + concentrations decline, DBC1 is increasingly bound to PARP1, causing DNA damage to accumulate, a process rapidly reversed by restoring the abundance of NAD + Thus, NAD + directly regulates protein-protein interactions, the modulation of which may protect against cancer, radiation, and aging. Copyright © 2017, American Association for the Advancement of Science.
Single-well monitoring of protein-protein interaction and phosphorylation-dephosphorylation events.
Arcand, Mathieu; Roby, Philippe; Bossé, Roger; Lipari, Francesco; Padrós, Jaime; Beaudet, Lucille; Marcil, Alexandre; Dahan, Sophie
2010-04-20
We combined oxygen channeling assays with two distinct chemiluminescent beads to detect simultaneously protein phosphorylation and interaction events that are usually monitored separately. This novel method was tested in the ERK1/2 MAP kinase pathway. It was first used to directly monitor dissociation of MAP kinase ERK2 from MEK1 upon phosphorylation and to evaluate MAP kinase phosphatase (MKP) selectivity and mechanism of action. In addition, MEK1 and ERK2 were probed with an ATP competitor and an allosteric MEK1 inhibitor, which generated distinct phosphorylation-interaction patterns. Simultaneous monitoring of protein-protein interactions and substrate phosphorylation can provide significant mechanistic insight into enzyme activity and small molecule action.
Protein-material interactions: From micro-to-nano scale
International Nuclear Information System (INIS)
Tsapikouni, Theodora S.; Missirlis, Yannis F.
2008-01-01
The article presents a survey on the significance of protein-material interactions, the mechanisms which control them and the techniques used for their study. Protein-surface interactions play a key role in regenerative medicine, drug delivery, biosensor technology and chromatography, while it is related to various undesired effects such as biofouling and bio-prosthetic malfunction. Although the effects of protein-surface interaction concern the micro-scale, being sometimes obvious even with bare eyes, they derive from biophysical events at the nano-scale. The sequential steps for protein adsorption involve events at the single biomolecule level and the forces driving or inhibiting protein adsorption act at the molecular level too. Following the scaling of protein-surface interactions, various techniques have been developed for their study both in the micro- and nano-scale. Protein labelling with radioisotopes or fluorescent probes, colorimetric assays and the quartz crystal microbalance were the first techniques used to monitor protein adsorption isotherms, while the surface force apparatus was used to measure the interaction forces between protein layers at the micro-scale. Recently, more elaborate techniques like total internal reflection fluorescence (TIRF), Fourier transform infrared spectroscopy (FTIR), surface plasmon resonance, Raman spectroscopy, ellipsometry and time of flight secondary ion mass spectrometry (ToF-SIMS) have been applied for the investigation of protein density, structure or orientation at the interfaces. However, a turning point in the study of protein interactions with the surfaces was the invention and the wide-spread use of atomic force microscopy (AFM) which can both image single protein molecules on surfaces and directly measure the interaction force
Paulmurugan, Ramasamy; Gambhir, Sanjiv S
2005-08-15
Networks of protein interactions execute many different intracellular pathways. Small molecules either synthesized within the cell or obtained from the external environment mediate many of these protein-protein interactions. The study of these small molecule-mediated protein-protein interactions is important in understanding abnormal signal transduction pathways in a variety of disorders, as well as in optimizing the process of drug development and validation. In this study, we evaluated the rapamycin-mediated interaction of the human proteins FK506-binding protein (FKBP12) rapamycin-binding domain (FRB) and FKBP12 by constructing a fusion of these proteins with a split-Renilla luciferase or a split enhanced green fluorescent protein (split-EGFP) such that complementation of the reporter fragments occurs in the presence of rapamycin. Different linker peptides in the fusion protein were evaluated for the efficient maintenance of complemented reporter activity. This system was studied in both cell culture and xenografts in living animals. We found that peptide linkers with two or four EAAAR repeat showed higher protein-protein interaction-mediated signal with lower background signal compared with having no linker or linkers with amino acid sequences GGGGSGGGGS, ACGSLSCGSF, and ACGSLSCGSFACGSLSCGSF. A 9 +/- 2-fold increase in signal intensity both in cell culture and in living mice was seen compared with a system that expresses both reporter fragments and the interacting proteins separately. In this fusion system, rapamycin induced heterodimerization of the FRB and FKBP12 moieties occurred rapidly even at very lower concentrations (0.00001 nmol/L) of rapamycin. For a similar fusion system employing split-EGFP, flow cytometry analysis showed significant level of rapamycin-induced complementation.
DEFF Research Database (Denmark)
Nandy, Subir Kumar; Jouhten, Paula; Nielsen, Jens
2010-01-01
proteins. Despite the value of BioGRID for studying protein-protein interactions, there is a need for manual curation of these interactions in order to remove false positives. RESULTS: Here we describe an annotated reconstruction of the protein-protein interactions around four key nutrient......) and for all the interactions between them (edges). The annotated information is readily available utilizing the functionalities of network modelling tools such as Cytoscape and CellDesigner. CONCLUSIONS: The reported fully annotated interaction model serves as a platform for integrated systems biology studies...
A lanthipeptide library used to identify a protein-protein interaction inhibitor.
Yang, Xiao; Lennard, Katherine R; He, Chang; Walker, Mark C; Ball, Andrew T; Doigneaux, Cyrielle; Tavassoli, Ali; van der Donk, Wilfred A
2018-04-01
In this article we describe the production and screening of a genetically encoded library of 10 6 lanthipeptides in Escherichia coli using the substrate-tolerant lanthipeptide synthetase ProcM. This plasmid-encoded library was combined with a bacterial reverse two-hybrid system for the interaction of the HIV p6 protein with the UEV domain of the human TSG101 protein, which is a critical protein-protein interaction for HIV budding from infected cells. Using this approach, we identified an inhibitor of this interaction from the lanthipeptide library, whose activity was verified in vitro and in cell-based virus-like particle-budding assays. Given the variety of lanthipeptide backbone scaffolds that may be produced with ProcM, this method may be used for the generation of genetically encoded libraries of natural product-like lanthipeptides containing substantial structural diversity. Such libraries may be combined with any cell-based assay to identify lanthipeptides with new biological activities.
RAIN: RNA-protein Association and Interaction Networks
DEFF Research Database (Denmark)
Junge, Alexander; Refsgaard, Jan Christian; Garde, Christian
2017-01-01
is challenging due to data heterogeneity. Here, we present a database of ncRNA-RNA and ncRNA-protein interactions and its integration with the STRING database of protein-protein interactions. These ncRNA associations cover four organisms and have been established from curated examples, experimental data...
Fleishman, Sarel
2012-02-01
Molecular recognition underlies all life processes. Design of interactions not seen in nature is a test of our understanding of molecular recognition and could unlock the vast potential of subtle control over molecular interaction networks, allowing the design of novel diagnostics and therapeutics for basic and applied research. We developed the first general method for designing protein interactions. The method starts by computing a region of high affinity interactions between dismembered amino acid residues and the target surface and then identifying proteins that can harbor these residues. Designs are tested experimentally for binding the target surface and successful ones are affinity matured using yeast cell surface display. Applied to the conserved stem region of influenza hemagglutinin we designed two unrelated proteins that, following affinity maturation, bound hemagglutinin at subnanomolar dissociation constants. Co-crystal structures of hemagglutinin bound to the two designed binders were within 1Angstrom RMSd of their models, validating the accuracy of the design strategy. One of the designed proteins inhibits the conformational changes that underlie hemagglutinin's cell-invasion functions and blocks virus infectivity in cell culture, suggesting that such proteins may in future serve as diagnostics and antivirals against a wide range of pathogenic influenza strains. We have used this method to obtain experimentally validated binders of several other target proteins, demonstrating the generality of the approach. We discuss the combination of modeling and high-throughput characterization of design variants which has been key to the success of this approach, as well as how we have used the data obtained in this project to enhance our understanding of molecular recognition. References: Science 332:816 JMB, in press Protein Sci 20:753
Krallinger, Martin; Vazquez, Miguel; Leitner, Florian; Salgado, David; Chatr-Aryamontri, Andrew; Winter, Andrew; Perfetto, Livia; Briganti, Leonardo; Licata, Luana; Iannuccelli, Marta; Castagnoli, Luisa; Cesareni, Gianni; Tyers, Mike; Schneider, Gerold; Rinaldi, Fabio; Leaman, Robert; Gonzalez, Graciela; Matos, Sergio; Kim, Sun; Wilbur, W John; Rocha, Luis; Shatkay, Hagit; Tendulkar, Ashish V; Agarwal, Shashank; Liu, Feifan; Wang, Xinglong; Rak, Rafal; Noto, Keith; Elkan, Charles; Lu, Zhiyong; Dogan, Rezarta Islamaj; Fontaine, Jean-Fred; Andrade-Navarro, Miguel A; Valencia, Alfonso
2011-10-03
Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were evaluated by comparing
2011-01-01
Background Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. Results A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were
Topology-function conservation in protein-protein interaction networks.
Davis, Darren; Yaveroğlu, Ömer Nebil; Malod-Dognin, Noël; Stojmirovic, Aleksandar; Pržulj, Nataša
2015-05-15
Proteins underlay the functioning of a cell and the wiring of proteins in protein-protein interaction network (PIN) relates to their biological functions. Proteins with similar wiring in the PIN (topology around them) have been shown to have similar functions. This property has been successfully exploited for predicting protein functions. Topological similarity is also used to guide network alignment algorithms that find similarly wired proteins between PINs of different species; these similarities are used to transfer annotation across PINs, e.g. from model organisms to human. To refine these functional predictions and annotation transfers, we need to gain insight into the variability of the topology-function relationships. For example, a function may be significantly associated with specific topologies, while another function may be weakly associated with several different topologies. Also, the topology-function relationships may differ between different species. To improve our understanding of topology-function relationships and of their conservation among species, we develop a statistical framework that is built upon canonical correlation analysis. Using the graphlet degrees to represent the wiring around proteins in PINs and gene ontology (GO) annotations to describe their functions, our framework: (i) characterizes statistically significant topology-function relationships in a given species, and (ii) uncovers the functions that have conserved topology in PINs of different species, which we term topologically orthologous functions. We apply our framework to PINs of yeast and human, identifying seven biological process and two cellular component GO terms to be topologically orthologous for the two organisms. © The Author 2015. Published by Oxford University Press.
Rackiewicz, Michal; Große-Hovest, Ludger; Alpert, Andrew J; Zarei, Mostafa; Dengjel, Jörn
2017-06-02
Hydrophobic interaction chromatography (HIC) is a robust standard analytical method to purify proteins while preserving their biological activity. It is widely used to study post-translational modifications of proteins and drug-protein interactions. In the current manuscript we employed HIC to separate proteins, followed by bottom-up LC-MS/MS experiments. We used this approach to fractionate antibody species followed by comprehensive peptide mapping as well as to study protein complexes in human cells. HIC-reversed-phase chromatography (RPC)-mass spectrometry (MS) is a powerful alternative to fractionate proteins for bottom-up proteomics experiments making use of their distinct hydrophobic properties.
Directory of Open Access Journals (Sweden)
Srinivasan Narayanaswamy
2010-06-01
Full Text Available Abstract Background Dengue virus along with the other members of the flaviviridae family has reemerged as deadly human pathogens. Understanding the mechanistic details of these infections can be highly rewarding in developing effective antivirals. During maturation of the virus inside the host cell, the coat proteins E and M undergo conformational changes, altering the morphology of the viral coat. However, due to low resolution nature of the available 3-D structures of viral assemblies, the atomic details of these changes are still elusive. Results In the present analysis, starting from Cα positions of low resolution cryo electron microscopic structures the residue level details of protein-protein interaction interfaces of dengue virus coat proteins have been predicted. By comparing the preexisting structures of virus in different phases of life cycle, the changes taking place in these predicted protein-protein interaction interfaces were followed as a function of maturation process of the virus. Besides changing the current notion about the presence of only homodimers in the mature viral coat, the present analysis indicated presence of a proline-rich motif at the protein-protein interaction interface of the coat protein. Investigating the conservation status of these seemingly functionally crucial residues across other members of flaviviridae family enabled dissecting common mechanisms used for infections by these viruses. Conclusions Thus, using computational approach the present analysis has provided better insights into the preexisting low resolution structures of virus assemblies, the findings of which can be made use of in designing effective antivirals against these deadly human pathogens.
The coat protein complex II, COPII, protein Sec13 directly interacts with presenilin-1
International Nuclear Information System (INIS)
Nielsen, Anders Lade
2009-01-01
Mutations in the human gene encoding presenilin-1, PS1, account for most cases of early-onset familial Alzheimer's disease. PS1 has nine transmembrane domains and a large loop orientated towards the cytoplasm. PS1 locates to cellular compartments as endoplasmic reticulum (ER), Golgi apparatus, vesicular structures, and plasma membrane, and is an integral member of γ-secretase, a protein protease complex with specificity for intra-membranous cleavage of substrates such as β-amyloid precursor protein. Here, an interaction between PS1 and the Sec13 protein is described. Sec13 takes part in coat protein complex II, COPII, vesicular trafficking, nuclear pore function, and ER directed protein sequestering and degradation control. The interaction maps to the N-terminal part of the large hydrophilic PS1 loop and the first of the six WD40-repeats present in Sec13. The identified Sec13 interaction to PS1 is a new candidate interaction for linking PS1 to secretory and protein degrading vesicular circuits.
The coat protein complex II, COPII, protein Sec13 directly interacts with presenilin-1
Energy Technology Data Exchange (ETDEWEB)
Nielsen, Anders Lade, E-mail: aln@humgen.au.dk [Department of Human Genetics, The Bartholin Building, University of Aarhus, DK-8000 Aarhus C (Denmark)
2009-10-23
Mutations in the human gene encoding presenilin-1, PS1, account for most cases of early-onset familial Alzheimer's disease. PS1 has nine transmembrane domains and a large loop orientated towards the cytoplasm. PS1 locates to cellular compartments as endoplasmic reticulum (ER), Golgi apparatus, vesicular structures, and plasma membrane, and is an integral member of {gamma}-secretase, a protein protease complex with specificity for intra-membranous cleavage of substrates such as {beta}-amyloid precursor protein. Here, an interaction between PS1 and the Sec13 protein is described. Sec13 takes part in coat protein complex II, COPII, vesicular trafficking, nuclear pore function, and ER directed protein sequestering and degradation control. The interaction maps to the N-terminal part of the large hydrophilic PS1 loop and the first of the six WD40-repeats present in Sec13. The identified Sec13 interaction to PS1 is a new candidate interaction for linking PS1 to secretory and protein degrading vesicular circuits.
Towards a better understanding of the specificity of protein-protein interaction
Czech Academy of Sciences Publication Activity Database
Kysilka, Jiří; Vondrášek, Jiří
2012-01-01
Roč. 25, č. 11 (2012), s. 604-615 ISSN 0952-3499 R&D Projects: GA ČR GAP208/10/0725; GA ČR GAP302/10/0427; GA MŠk(CZ) LH11020 Institutional research plan: CEZ:AV0Z40550506; CEZ:AV0Z50520701 Keywords : protein-protein interaction * molecular recognition * x-ray structure analysis * empirical potentials * side chain-side chain interaction * interaction energy * bioinformatics Subject RIV: CE - Biochemistry Impact factor: 3.006, year: 2012
Predicting and validating protein interactions using network structure.
Directory of Open Access Journals (Sweden)
Pao-Yang Chen
2008-07-01
Full Text Available Protein interactions play a vital part in the function of a cell. As experimental techniques for detection and validation of protein interactions are time consuming, there is a need for computational methods for this task. Protein interactions appear to form a network with a relatively high degree of local clustering. In this paper we exploit this clustering by suggesting a score based on triplets of observed protein interactions. The score utilises both protein characteristics and network properties. Our score based on triplets is shown to complement existing techniques for predicting protein interactions, outperforming them on data sets which display a high degree of clustering. The predicted interactions score highly against test measures for accuracy. Compared to a similar score derived from pairwise interactions only, the triplet score displays higher sensitivity and specificity. By looking at specific examples, we show how an experimental set of interactions can be enriched and validated. As part of this work we also examine the effect of different prior databases upon the accuracy of prediction and find that the interactions from the same kingdom give better results than from across kingdoms, suggesting that there may be fundamental differences between the networks. These results all emphasize that network structure is important and helps in the accurate prediction of protein interactions. The protein interaction data set and the program used in our analysis, and a list of predictions and validations, are available at http://www.stats.ox.ac.uk/bioinfo/resources/PredictingInteractions.
Xie, Zhongqiu; Jia, Yuemeng; Li, Hui
2017-01-01
The study of protein-protein interactions represents a key aspect of biological research. Identifying unknown protein binding partners using mass spectrometry (MS)-based proteomics has evolved into an indispensable strategy in drug discovery. The classic approach of immunoprecipitation with specific antibodies against the proteins of interest has limitations, such as the need for immunoprecipitation-qualified antibody. The biotin AP-tag pull-down system has the advantage of high specificity, ease of use, and no requirement for antibody. It is based on the high specificity, high affinity interaction between biotin and streptavidin. After pulldown, in-gel tryptic digestion and tandem mass spectrometry (MS/MS) analysis of sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) protein bands can be performed. In this work, we provide protocols that can be used for the identification of proteins that interact with FOXM1, a protein that has recently emerged as a potential biomarker and drug target in oncotherapy, as an example. We focus on the pull-down procedure and assess the efficacy of the pulldown with known FOXM1 interactors such as β-catenin. We use a high performance LTQ Orbitrap MSn system that combines rapid LTQ ion trap data acquisition with high mass accuracy Orbitrap analysis to identify the interacting proteins.
Yeast Interacting Proteins Database: YNL258C, YKR022C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available YNL258C DSL1 Peripheral membrane protein required for Golgi-to-ER retrograde traffi...equired for Golgi-to-ER retrograde traffic; component of the ER target site that interacts with coatomer, th...it ORF YNL258C Bait gene name DSL1 Bait description Peripheral membrane protein r
Haidar, Malak
2017-09-08
One powerful application of cell penetrating peptides is the delivery into cells of molecules that function as specific competitors or inhibitors of protein-protein interactions. Ablating defined protein-protein interactions is a refined way to explore their contribution to a particular cellular phenotype in a given disease context. Cell-penetrating peptides can be synthetically constrained through various chemical modifications that stabilize a given structural fold with the potential to improve competitive binding to specific targets. Theileria-transformed leukocytes display high PKA activity, but PKAis an enzyme that plays key roles in multiple cellular processes; consequently genetic ablation of kinase activity gives rise to a myriad of confounding phenotypes. By contrast, ablation of a specific kinase-substrate interaction has the potential to give more refined information and we illustrate this here by describing how surgically ablating PKA interactions with BAD gives precise information on the type of glycolysis performed by Theileria-transformed leukocytes. In addition, we provide two other examples of how ablating specific protein-protein interactions in Theileria-infected leukocytes leads to precise phenotypes and argue that constrained penetrating peptides have great therapeutic potential to combat infectious diseases in general.
Agrawal, Neeraj J; Helk, Bernhard; Trout, Bernhardt L
2014-01-21
Identifying hot-spot residues - residues that are critical to protein-protein binding - can help to elucidate a protein's function and assist in designing therapeutic molecules to target those residues. We present a novel computational tool, termed spatial-interaction-map (SIM), to predict the hot-spot residues of an evolutionarily conserved protein-protein interaction from the structure of an unbound protein alone. SIM can predict the protein hot-spot residues with an accuracy of 36-57%. Thus, the SIM tool can be used to predict the yet unknown hot-spot residues for many proteins for which the structure of the protein-protein complexes are not available, thereby providing a clue to their functions and an opportunity to design therapeutic molecules to target these proteins. Copyright © 2013 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
P³DB 3.0: From plant phosphorylation sites to protein networks.
Yao, Qiuming; Ge, Huangyi; Wu, Shangquan; Zhang, Ning; Chen, Wei; Xu, Chunhui; Gao, Jianjiong; Thelen, Jay J; Xu, Dong
2014-01-01
In the past few years, the Plant Protein Phosphorylation Database (P(3)DB, http://p3db.org) has become one of the most significant in vivo data resources for studying plant phosphoproteomics. We have substantially updated P(3)DB with respect to format, new datasets and analytic tools. In the P(3)DB 3.0, there are altogether 47 923 phosphosites in 16 477 phosphoproteins curated across nine plant organisms from 32 studies, which have met our multiple quality standards for acquisition of in vivo phosphorylation site data. Centralized by these phosphorylation data, multiple related data and annotations are provided, including protein-protein interaction (PPI), gene ontology, protein tertiary structures, orthologous sequences, kinase/phosphatase classification and Kinase Client Assay (KiC Assay) data--all of which provides context for the phosphorylation event. In addition, P(3)DB 3.0 incorporates multiple network viewers for the above features, such as PPI network, kinase-substrate network, phosphatase-substrate network, and domain co-occurrence network to help study phosphorylation from a systems point of view. Furthermore, the new P(3)DB reflects a community-based design through which users can share datasets and automate data depository processes for publication purposes. Each of these new features supports the goal of making P(3)DB a comprehensive, systematic and interactive platform for phosphoproteomics research.
Ozawa, Motoyasu; Ozawa, Tomonaga; Ueda, Kazuyoshi
2017-06-01
The molecular interactions of inhibitors of bromodomains (BRDs) were investigated. BRDs are protein interaction modules that recognizing ε-N-acetyl-lysine (εAc-Lys) motifs found in histone tails and are promising protein-protein interaction (PPI) targets. First, we analyzed a peptide ligand containing εAc-Lys to evaluate native PPIs. We then analyzed tetrahydroquinazoline-6-yl-benzensulfonamide derivatives found by fragment-based drug design (FBDD) and examined their interactions with the protein compared with the peptide ligand in terms of the inter-fragment interaction energy. In addition, we analyzed benzodiazepine derivatives that are high-affinity ligands for BRDs and examined differences in the CH/π interactions of the amino acid residues. We further surveyed changes in the charges of the amino acid residues among individual ligands, performed pair interaction energy decomposition analysis and estimated the water profile within the ligand binding site. Thus, useful insights for drug design were provided. Through these analyses and considerations, we show that the FMO method is a useful drug design tool to evaluate the process of FBDD and to explore PPI inhibitors. Copyright © 2017 Elsevier Inc. All rights reserved.
Liu, Fan; Heck, Albert J R
2015-01-01
Proteins are involved in almost all processes of the living cell. They are organized through extensive networks of interaction, by tightly bound macromolecular assemblies or more transiently via signaling nodes. Therefore, revealing the architecture of protein complexes and protein interaction
McLaren, Donald G.; Ries, Michele L.; Xu, Guofan; Johnson, Sterling C.
2012-01-01
Functional MRI (fMRI) allows one to study task-related regional responses and task-dependent connectivity analysis using psychophysiological interaction (PPI) methods. The latter affords the additional opportunity to understand how brain regions interact in a task-dependent manner. The current implementation of PPI in Statistical Parametric Mapping (SPM8) is configured primarily to assess connectivity differences between two task conditions, when in practice fMRI tasks frequently employ more than two conditions. Here we evaluate how a generalized form of context-dependent PPI (gPPI; http://www.nitrc.org/projects/gppi), which is configured to automatically accommodate more than two task conditions in the same PPI model by spanning the entire experimental space, compares to the standard implementation in SPM8. These comparisons are made using both simulations and an empirical dataset. In the simulated dataset, we compare the interaction beta estimates to their expected values and model fit using the Akaike Information Criterion (AIC). We found that interaction beta estimates in gPPI were robust to different simulated data models, were not different from the expected beta value, and had better model fits than when using standard PPI (sPPI) methods. In the empirical dataset, we compare the model fit of the gPPI approach to sPPI. We found that the gPPI approach improved model fit compared to sPPI. There were several regions that became non-significant with gPPI. These regions all showed significantly better model fits with gPPI. Also, there were several regions where task-dependent connectivity was only detected using gPPI methods, also with improved model fit. Regions that were detected with all methods had more similar model fits. These results suggest that gPPI may have greater sensitivity and specificity than standard implementation in SPM. This notion is tempered slightly as there is no gold standard; however, data simulations with a known outcome support our
Directory of Open Access Journals (Sweden)
Wang Yong
2011-10-01
Full Text Available Abstract Background With the development of genome-sequencing technologies, protein sequences are readily obtained by translating the measured mRNAs. Therefore predicting protein-protein interactions from the sequences is of great demand. The reason lies in the fact that identifying protein-protein interactions is becoming a bottleneck for eventually understanding the functions of proteins, especially for those organisms barely characterized. Although a few methods have been proposed, the converse problem, if the features used extract sufficient and unbiased information from protein sequences, is almost untouched. Results In this study, we interrogate this problem theoretically by an optimization scheme. Motivated by the theoretical investigation, we find novel encoding methods for both protein sequences and protein pairs. Our new methods exploit sufficiently the information of protein sequences and reduce artificial bias and computational cost. Thus, it significantly outperforms the available methods regarding sensitivity, specificity, precision, and recall with cross-validation evaluation and reaches ~80% and ~90% accuracy in Escherichia coli and Saccharomyces cerevisiae respectively. Our findings here hold important implication for other sequence-based prediction tasks because representation of biological sequence is always the first step in computational biology. Conclusions By considering the converse problem, we propose new representation methods for both protein sequences and protein pairs. The results show that our method significantly improves the accuracy of protein-protein interaction predictions.
Yang, Huiying; Ke, Yuehua; Wang, Jian; Tan, Yafang; Myeni, Sebenzile K; Li, Dong; Shi, Qinghai; Yan, Yanfeng; Chen, Hui; Guo, Zhaobiao; Yuan, Yanzhi; Yang, Xiaoming; Yang, Ruifu; Du, Zongmin
2011-11-01
A Yersinia pestis-human protein interaction network is reported here to improve our understanding of its pathogenesis. Up to 204 interactions between 66 Y. pestis bait proteins and 109 human proteins were identified by yeast two-hybrid assay and then combined with 23 previously published interactions to construct a protein-protein interaction network. Topological analysis of the interaction network revealed that human proteins targeted by Y. pestis were significantly enriched in the proteins that are central in the human protein-protein interaction network. Analysis of this network showed that signaling pathways important for host immune responses were preferentially targeted by Y. pestis, including the pathways involved in focal adhesion, regulation of cytoskeleton, leukocyte transendoepithelial migration, and Toll-like receptor (TLR) and mitogen-activated protein kinase (MAPK) signaling. Cellular pathways targeted by Y. pestis are highly relevant to its pathogenesis. Interactions with host proteins involved in focal adhesion and cytoskeketon regulation pathways could account for resistance of Y. pestis to phagocytosis. Interference with TLR and MAPK signaling pathways by Y. pestis reflects common characteristics of pathogen-host interaction that bacterial pathogens have evolved to evade host innate immune response by interacting with proteins in those signaling pathways. Interestingly, a large portion of human proteins interacting with Y. pestis (16/109) also interacted with viral proteins (Epstein-Barr virus [EBV] and hepatitis C virus [HCV]), suggesting that viral and bacterial pathogens attack common cellular functions to facilitate infections. In addition, we identified vasodilator-stimulated phosphoprotein (VASP) as a novel interaction partner of YpkA and showed that YpkA could inhibit in vitro actin assembly mediated by VASP.
Directory of Open Access Journals (Sweden)
Swapnil R Chhabra
Full Text Available Protein-protein interactions offer an insight into cellular processes beyond what may be obtained by the quantitative functional genomics tools of proteomics and transcriptomics. The aforementioned tools have been extensively applied to study Escherichia coli and other aerobes and more recently to study the stress response behavior of Desulfovibrio vulgaris Hildenborough, a model obligate anaerobe and sulfate reducer and the subject of this study. Here we carried out affinity purification followed by mass spectrometry to reconstruct an interaction network among 12 chromosomally encoded bait and 90 prey proteins based on 134 bait-prey interactions identified to be of high confidence. Protein-protein interaction data are often plagued by the lack of adequate controls and replication analyses necessary to assess confidence in the results, including identification of potential false positives. We addressed these issues through the use of biological replication, exponentially modified protein abundance indices, results from an experimental negative control, and a statistical test to assign confidence to each putative interacting pair applicable to small interaction data studies. We discuss the biological significance of metabolic features of D. vulgaris revealed by these protein-protein interaction data and the observed protein modifications. These include the distinct role of the putative carbon monoxide-induced hydrogenase, unique electron transfer routes associated with different oxidoreductases, and the possible role of methylation in regulating sulfate reduction.
Femtosecond UV-laser pulses to unveil protein-protein interactions in living cells.
Itri, Francesco; Monti, Daria M; Della Ventura, Bartolomeo; Vinciguerra, Roberto; Chino, Marco; Gesuele, Felice; Lombardi, Angelina; Velotta, Raffaele; Altucci, Carlo; Birolo, Leila; Piccoli, Renata; Arciello, Angela
2016-02-01
A hallmark to decipher bioprocesses is to characterize protein-protein interactions in living cells. To do this, the development of innovative methodologies, which do not alter proteins and their natural environment, is particularly needed. Here, we report a method (LUCK, Laser UV Cross-linKing) to in vivo cross-link proteins by UV-laser irradiation of living cells. Upon irradiation of HeLa cells under controlled conditions, cross-linked products of glyceraldehyde-3-phosphate dehydrogenase (GAPDH) were detected, whose yield was found to be a linear function of the total irradiation energy. We demonstrated that stable dimers of GAPDH were formed through intersubunit cross-linking, as also observed when the pure protein was irradiated by UV-laser in vitro. We proposed a defined patch of aromatic residues located at the enzyme subunit interface as the cross-linking sites involved in dimer formation. Hence, by this technique, UV-laser is able to photofix protein surfaces that come in direct contact. Due to the ultra-short time scale of UV-laser-induced cross-linking, this technique could be extended to weld even transient protein interactions in their native context.
Interaction of a non-histone chromatin protein (high-mobility group protein 2) with DNA
International Nuclear Information System (INIS)
Goodwin, G.H.; Shooter, K.V.; Johns, E.W.
1975-01-01
The interaction with DNA of the calf thymus chromatin non-histone protein termed the high-mobility group protein 2 has been studied by sedimentation analysis in the ultracentrifuge and by measuring the binding of the 125 I-labelled protein to DNA. The results have been compared with those obtained previously by us [Eur. J. Biochem. (1974) 47, 263-270] for the interaction of high-mobility group protein 1 with DNA. Although the binding parameters are similar for these two proteins, high-mobility group protein 2 differs from high-mobility group protein 1 in that the former appears to change the shape of the DNA to a more compact form. The molecular weight of high-mobility group protein 2 has been determined by equilibrium sedimentation and a mean value of 26,000 was obtained. A low level of nuclease activity detected in one preparation of high-mobility group protein 2 has been investigated. (orig.) [de
Prediction of localization and interactions of apoptotic proteins
Directory of Open Access Journals (Sweden)
Matula Pavel
2009-07-01
Full Text Available Abstract During apoptosis several mitochondrial proteins are released. Some of them participate in caspase-independent nuclear DNA degradation, especially apoptosis-inducing factor (AIF and endonuclease G (endoG. Another interesting protein, which was expected to act similarly as AIF due to the high sequence homology with AIF is AIF-homologous mitochondrion-associated inducer of death (AMID. We studied the structure, cellular localization, and interactions of several proteins in silico and also in cells using fluorescent microscopy. We found the AMID protein to be cytoplasmic, most probably incorporated into the cytoplasmic side of the lipid membranes. Bioinformatic predictions were conducted to analyze the interactions of the studied proteins with each other and with other possible partners. We conducted molecular modeling of proteins with unknown 3D structures. These models were then refined by MolProbity server and employed in molecular docking simulations of interactions. Our results show data acquired using a combination of modern in silico methods and image analysis to understand the localization, interactions and functions of proteins AMID, AIF, endonuclease G, and other apoptosis-related proteins.
Lifescience Database Archive (English)
Full Text Available y are in the reverse direction. *1 A comprehensive two-hybrid analysis to explore the yeast protein interact...s. 2000 Jan 1;28(1):73-6. *2 The yeast proteome database (YPD) and Caenorhabditis elegans proteome database (WormPD): comprehensive...000 Jan 1;28(1):73-6. *3 A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisia
Analysis of Protein-RNA and Protein-Peptide Interactions in Equine Infectious Anemia
Energy Technology Data Exchange (ETDEWEB)
Lee, Jae-Hyung [Iowa State Univ., Ames, IA (United States)
2007-01-01
Macromolecular interactions are essential for virtually all cellular functions including signal transduction processes, metabolic processes, regulation of gene expression and immune responses. This dissertation focuses on the characterization of two important macromolecular interactions involved in the relationship between Equine Infectious Anemia Virus (EIAV) and its host cell in horse: (1) the interaction between the EIAV Rev protein and its binding site, the Rev-responsive element (RRE) and (2) interactions between equine MHC class I molecules and epitope peptides derived from EIAV proteins. EIAV, one of the most divergent members of the lentivirus family, has a single-stranded RNA genome and carries several regulatory and structural proteins within its viral particle. Rev is an essential EIAV regulatory encoded protein that interacts with the viral RRE, a specific binding site in the viral mRNA. Using a combination of experimental and computational methods, the interactions between EIAV Rev and RRE were characterized in detail. EIAV Rev was shown to have a bipartite RNA binding domain contain two arginine rich motifs (ARMs). The RRE secondary structure was determined and specific structural motifs that act as cis-regulatory elements for EIAV Rev-RRE interaction were identified. Interestingly, a structural motif located in the high affinity Rev binding site is well conserved in several diverse lentiviral genoes, including HIV-1. Macromolecular interactions involved in the immune response of the horse to EIAV infection were investigated by analyzing complexes between MHC class I proteins and epitope peptides derived from EIAV Rev, Env and Gag proteins. Computational modeling results provided a mechanistic explanation for the experimental finding that a single amino acid change in the peptide binding domain of the quine MHC class I molecule differentially affectes the recognitino of specific epitopes by EIAV-specific CTL. Together, the findings in this
Protein-surface interactions on stimuli-responsive polymeric biomaterials.
Cross, Michael C; Toomey, Ryan G; Gallant, Nathan D
2016-03-04
Responsive surfaces: a review of the dependence of protein adsorption on the reversible volume phase transition in stimuli-responsive polymers. Specifically addressed are a widely studied subset: thermoresponsive polymers. Findings are also generalizable to other materials which undergo a similarly reversible volume phase transition. As of 2015, over 100,000 articles have been published on stimuli-responsive polymers and many more on protein-biomaterial interactions. Significantly, fewer than 100 of these have focused specifically on protein interactions with stimuli-responsive polymers. These report a clear trend of increased protein adsorption in the collapsed state compared to the swollen state. This control over protein interactions makes stimuli-responsive polymers highly useful in biomedical applications such as wound repair scaffolds, on-demand drug delivery, and antifouling surfaces. Outstanding questions are whether the protein adsorption is reversible with the volume phase transition and whether there is a time-dependence. A clear understanding of protein interactions with stimuli-responsive polymers will advance theoretical models, experimental results, and biomedical applications.
Banda, Srikanth; Cao, Nan; Tse-Dinh, Yuk-Ching
2017-09-15
We report here a distinct mechanism of interaction between topoisomerase I and RNA polymerase in Mycobacterium tuberculosis and Mycobacterium smegmatis that has evolved independently from the previously characterized interaction between bacterial topoisomerase I and RNA polymerase. Bacterial DNA topoisomerase I is responsible for preventing the hyper-negative supercoiling of genomic DNA. The association of topoisomerase I with RNA polymerase during transcription elongation could efficiently relieve transcription-driven negative supercoiling. Our results demonstrate a direct physical interaction between the C-terminal domains of topoisomerase I (TopoI-CTDs) and the β' subunit of RNA polymerase of M. smegmatis in the absence of DNA. The TopoI-CTDs in mycobacteria are evolutionarily unrelated in amino acid sequence and three-dimensional structure to the TopoI-CTD found in the majority of bacterial species outside Actinobacteria, including Escherichia coli. The functional interaction between topoisomerase I and RNA polymerase has evolved independently in mycobacteria and E. coli, with distinctively different structural elements of TopoI-CTD utilized for this protein-protein interaction. Zinc ribbon motifs in E. coli TopoI-CTD are involved in the interaction with RNA polymerase. For M. smegmatis TopoI-CTD, a 27-amino-acid tail that is rich in basic residues at the C-terminal end is responsible for the interaction with RNA polymerase. Overexpression of recombinant TopoI-CTD in M. smegmatis competed with the endogenous topoisomerase I for protein-protein interactions with RNA polymerase. The TopoI-CTD overexpression resulted in decreased survival following treatment with antibiotics and hydrogen peroxide, supporting the importance of the protein-protein interaction between topoisomerase I and RNA polymerase during stress response of mycobacteria. Copyright © 2017 Elsevier Ltd. All rights reserved.
Tacken, M.G.J.; Rottier, P.J.M.; Gielkens, A.L.J.; Peeters, B.P.H.
2000-01-01
Little is known about the intermolecular interactions between the viral proteins of infectious bursal disease virus (IBDV). By using the yeast two-hybrid system, which allows the detection of protein-protein interactions in vivo, all possible interactions were tested by fusing the viral proteins to
Tacken, M.G.J.; Rottier, P.J.M.; Gielkens, A.L.J.; Peeters, B.P.H.
2000-01-01
Little is known about the intermolecular interactions between the viral proteins of infectious bursal disease virus (IBDV). By using the yeast two-hybrid system, which allows the detection of protein-protein interactions in vivo, all possible interactions were tested by fusing the viral proteins to
Protein interactions in genome maintenance as novel antibacterial targets.
Directory of Open Access Journals (Sweden)
Aimee H Marceau
Full Text Available Antibacterial compounds typically act by directly inhibiting essential bacterial enzyme activities. Although this general mechanism of action has fueled traditional antibiotic discovery efforts for decades, new antibiotic development has not kept pace with the emergence of drug resistant bacterial strains. These limitations have severely restricted the therapeutic tools available for treating bacterial infections. Here we test an alternative antibacterial lead-compound identification strategy in which essential protein-protein interactions are targeted rather than enzymatic activities. Bacterial single-stranded DNA-binding proteins (SSBs form conserved protein interaction "hubs" that are essential for recruiting many DNA replication, recombination, and repair proteins to SSB/DNA nucleoprotein substrates. Three small molecules that block SSB/protein interactions are shown to have antibacterial activity against diverse bacterial species. Consistent with a model in which the compounds target multiple SSB/protein interactions, treatment of Bacillus subtilis cultures with the compounds leads to rapid inhibition of DNA replication and recombination, and ultimately to cell death. The compounds also have unanticipated effects on protein synthesis that could be due to a previously unknown role for SSB/protein interactions in translation or to off-target effects. Our results highlight the potential of targeting protein-protein interactions, particularly those that mediate genome maintenance, as a powerful approach for identifying new antibacterial compounds.
A proteomics strategy to elucidate functional protein-protein interactions applied to EGF signaling
DEFF Research Database (Denmark)
Blagoev, B.; Kratchmarova, I.; Ong, S.E.
2003-01-01
Mass spectrometry-based proteomics can reveal protein-protein interactions on a large scale, but it has been difficult to separate background binding from functionally important interactions and still preserve weak binders. To investigate the epidermal growth factor receptor (EGFR) pathway, we em...
Energetics of the protein-DNA-water interaction
Directory of Open Access Journals (Sweden)
Marabotti Anna
2007-01-01
Full Text Available Abstract Background To understand the energetics of the interaction between protein and DNA we analyzed 39 crystallographically characterized complexes with the HINT (Hydropathic INTeractions computational model. HINT is an empirical free energy force field based on solvent partitioning of small molecules between water and 1-octanol. Our previous studies on protein-ligand complexes demonstrated that free energy predictions were significantly improved by taking into account the energetic contribution of water molecules that form at least one hydrogen bond with each interacting species. Results An initial correlation between the calculated HINT scores and the experimentally determined binding free energies in the protein-DNA system exhibited a relatively poor r2 of 0.21 and standard error of ± 1.71 kcal mol-1. However, the inclusion of 261 waters that bridge protein and DNA improved the HINT score-free energy correlation to an r2 of 0.56 and standard error of ± 1.28 kcal mol-1. Analysis of the water role and energy contributions indicate that 46% of the bridging waters act as linkers between amino acids and nucleotide bases at the protein-DNA interface, while the remaining 54% are largely involved in screening unfavorable electrostatic contacts. Conclusion This study quantifies the key energetic role of bridging waters in protein-DNA associations. In addition, the relevant role of hydrophobic interactions and entropy in driving protein-DNA association is indicated by analyses of interaction character showing that, together, the favorable polar and unfavorable polar/hydrophobic-polar interactions (i.e., desolvation mostly cancel.
Kim, Woo-Yeon; Kang, Sungsoo; Kim, Byoung-Chul; Oh, Jeehyun; Cho, Seongwoong; Bhak, Jong; Choi, Jong-Soon
2008-01-01
Background Cyanobacteria are model organisms for studying photosynthesis, carbon and nitrogen assimilation, evolution of plant plastids, and adaptability to environmental stresses. Despite many studies on cyanobacteria, there is no web-based database of their regulatory and signaling protein-protein interaction networks to date. Description We report a database and website SynechoNET that provides predicted protein-protein interactions. SynechoNET shows cyanobacterial domain-domain interactio...
Garamszegi, Sara; Franzosa, Eric A; Xia, Yu
2013-01-01
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are
A protein domain interaction interface database: InterPare
Directory of Open Access Journals (Sweden)
Lee Jungsul
2005-08-01
Full Text Available Abstract Background Most proteins function by interacting with other molecules. Their interaction interfaces are highly conserved throughout evolution to avoid undesirable interactions that lead to fatal disorders in cells. Rational drug discovery includes computational methods to identify the interaction sites of lead compounds to the target molecules. Identifying and classifying protein interaction interfaces on a large scale can help researchers discover drug targets more efficiently. Description We introduce a large-scale protein domain interaction interface database called InterPare http://interpare.net. It contains both inter-chain (between chains interfaces and intra-chain (within chain interfaces. InterPare uses three methods to detect interfaces: 1 the geometric distance method for checking the distance between atoms that belong to different domains, 2 Accessible Surface Area (ASA, a method for detecting the buried region of a protein that is detached from a solvent when forming multimers or complexes, and 3 the Voronoi diagram, a computational geometry method that uses a mathematical definition of interface regions. InterPare includes visualization tools to display protein interior, surface, and interaction interfaces. It also provides statistics such as the amino acid propensities of queried protein according to its interior, surface, and interface region. The atom coordinates that belong to interface, surface, and interior regions can be downloaded from the website. Conclusion InterPare is an open and public database server for protein interaction interface information. It contains the large-scale interface data for proteins whose 3D-structures are known. As of November 2004, there were 10,583 (Geometric distance, 10,431 (ASA, and 11,010 (Voronoi diagram entries in the Protein Data Bank (PDB containing interfaces, according to the above three methods. In the case of the geometric distance method, there are 31,620 inter-chain domain
(S)Pinning down protein interactions by NMR
DEFF Research Database (Denmark)
Teilum, Kaare; Kunze, Micha Ben Achim; Erlendsson, Simon
2017-01-01
Protein molecules are highly diverse communication platforms and their interaction repertoire stretches from atoms over small molecules such as sugars and lipids to macromolecules. An important route to understanding molecular communication is to quantitatively describe their interactions...... all types of protein reactions, which can span orders of magnitudes in affinities, reaction rates and lifetimes of states. As the more versatile technique, solution NMR spectroscopy offers a remarkable catalogue of methods that can be successfully applied to the quantitative as well as qualitative...... descriptions of protein interactions. In this review we provide an easy-access approach to NMR for the non-NMR specialist and describe how and when solution state NMR spectroscopy is the method of choice for addressing protein ligand interaction. We describe very briefly the theoretical background...
He, Jieyue; Li, Chaojun; Ye, Baoliu; Zhong, Wei
2012-06-25
Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures. In this paper, a Greedy Search Method based on Core-Attachment structure (GSM-CA) is proposed. The GSM-CA method detects densely connected regions in large protein-protein interaction networks based on the edge weight and two criteria for determining core nodes and attachment nodes. The GSM-CA method improves the prediction accuracy compared to other similar module detection approaches, however it is computationally expensive. Many module detection approaches are based on the traditional hierarchical methods, which is also computationally inefficient because the hierarchical tree structure produced by these approaches cannot provide adequate information to identify whether a network belongs to a module structure or not. In order to speed up the computational process, the Greedy Search Method based on Fast Clustering (GSM-FC) is proposed in this work. The edge weight based GSM-FC method uses a greedy procedure to traverse all edges just once to separate the network into the suitable set of modules. The proposed methods are applied to the protein interaction network of S. cerevisiae. Experimental results indicate that many significant functional modules are detected, most of which match the known complexes. Results also demonstrate that the GSM-FC algorithm is faster and more accurate as compared to other competing algorithms. Based on the new edge weight definition, the proposed algorithm takes advantages of the greedy search procedure to separate the network into the suitable set of modules. Experimental analysis shows that the identified modules are statistically significant. The algorithm can reduce the
Prediction and Dissection of Protein-RNA Interactions by Molecular Descriptors.
Liu, Zhi-Ping; Chen, Luonan
2016-01-01
Protein-RNA interactions play crucial roles in numerous biological processes. However, detecting the interactions and binding sites between protein and RNA by traditional experiments is still time consuming and labor costing. Thus, it is of importance to develop bioinformatics methods for predicting protein-RNA interactions and binding sites. Accurate prediction of protein-RNA interactions and recognitions will highly benefit to decipher the interaction mechanisms between protein and RNA, as well as to improve the RNA-related protein engineering and drug design. In this work, we summarize the current bioinformatics strategies of predicting protein-RNA interactions and dissecting protein-RNA interaction mechanisms from local structure binding motifs. In particular, we focus on the feature-based machine learning methods, in which the molecular descriptors of protein and RNA are extracted and integrated as feature vectors of representing the interaction events and recognition residues. In addition, the available methods are classified and compared comprehensively. The molecular descriptors are expected to elucidate the binding mechanisms of protein-RNA interaction and reveal the functional implications from structural complementary perspective.
Liu, Zhen; Xing, Dong; Su, Qian Peter; Zhu, Yun; Zhang, Jiamei; Kong, Xinyu; Xue, Boxin; Wang, Sheng; Sun, Hao; Tao, Yile; Sun, Yujie
2014-07-01
Imaging the location and dynamics of individual interacting protein pairs is essential but often difficult because of the fluorescent background from other paired and non-paired molecules, particularly in the sub-diffraction cellular space. Here we develop a new method combining bimolecular fluorescence complementation and photoactivated localization microscopy for super-resolution imaging and single-molecule tracking of specific protein-protein interactions. The method is used to study the interaction of two abundant proteins, MreB and EF-Tu, in Escherichia coli cells. The super-resolution imaging shows interesting distribution and domain sizes of interacting MreB-EF-Tu pairs as a subpopulation of total EF-Tu. The single-molecule tracking of MreB, EF-Tu and MreB-EF-Tu pairs reveals intriguing localization-dependent heterogonous dynamics and provides valuable insights to understanding the roles of MreB-EF-Tu interactions.
Fung, David C Y; Wilkins, Marc R; Hart, David; Hong, Seok-Hee
2010-07-01
The force-directed layout is commonly used in computer-generated visualizations of protein-protein interaction networks. While it is good for providing a visual outline of the protein complexes and their interactions, it has two limitations when used as a visual analysis method. The first is poor reproducibility. Repeated running of the algorithm does not necessarily generate the same layout, therefore, demanding cognitive readaptation on the investigator's part. The second limitation is that it does not explicitly display complementary biological information, e.g. Gene Ontology, other than the protein names or gene symbols. Here, we present an alternative layout called the clustered circular layout. Using the human DNA replication protein-protein interaction network as a case study, we compared the two network layouts for their merits and limitations in supporting visual analysis.
Wachsmuth, Leah M; Johnson, Meredith G; Gavenonis, Jason
2017-06-01
Parasitic diseases caused by kinetoplastid parasites of the genera Trypanosoma and Leishmania are an urgent public health crisis in the developing world. These closely related species possess a number of multimeric enzymes in highly conserved pathways involved in vital functions, such as redox homeostasis and nucleotide synthesis. Computational alanine scanning of these protein-protein interfaces has revealed a host of potentially ligandable sites on several established and emerging anti-parasitic drug targets. Analysis of interfaces with multiple clustered hotspots has suggested several potentially inhibitable protein-protein interactions that may have been overlooked by previous large-scale analyses focusing solely on secondary structure. These protein-protein interactions provide a promising lead for the development of new peptide and macrocycle inhibitors of these enzymes.
Directory of Open Access Journals (Sweden)
Ellen C Kammula
Full Text Available HIV-1 Nef protein contributes essentially to the pathology of AIDS by a variety of protein-protein-interactions within the host cell. The versatile functionality of Nef is partially attributed to different conformational states and posttranslational modifications, such as myristoylation. Up to now, many interaction partners of Nef have been identified using classical yeast two-hybrid screens. Such screens rely on transcriptional activation of reporter genes in the nucleus to detect interactions. Thus, the identification of Nef interaction partners that are integral membrane proteins, membrane-associated proteins or other proteins that do not translocate into the nucleus is hampered. In the present study, a split-ubiquitin based yeast two-hybrid screen was used to identify novel membrane-localized interaction partners of Nef. More than 80% of the hereby identified interaction partners of Nef are transmembrane proteins. The identified hits are GPM6B, GPM6A, BAP31, TSPAN7, CYB5B, CD320/TCblR, VSIG4, PMEPA1, OCIAD1, ITGB1, CHN1, PH4, CLDN10, HSPA9, APR-3, PEBP1 and B3GNT, which are involved in diverse cellular processes like signaling, apoptosis, neurogenesis, cell adhesion and protein trafficking or quality control. For a subfraction of the hereby identified proteins we present data supporting their direct interaction with HIV-1 Nef. We discuss the results with respect to many phenotypes observed in HIV infected cells and patients. The identified Nef interaction partners may help to further elucidate the molecular basis of HIV-related diseases.
Specificity and evolvability in eukaryotic protein interaction networks.
Directory of Open Access Journals (Sweden)
Pedro Beltrao
2007-02-01
Full Text Available Progress in uncovering the protein interaction networks of several species has led to questions of what underlying principles might govern their organization. Few studies have tried to determine the impact of protein interaction network evolution on the observed physiological differences between species. Using comparative genomics and structural information, we show here that eukaryotic species have rewired their interactomes at a fast rate of approximately 10(-5 interactions changed per protein pair, per million years of divergence. For Homo sapiens this corresponds to 10(3 interactions changed per million years. Additionally we find that the specificity of binding strongly determines the interaction turnover and that different biological processes show significantly different link dynamics. In particular, human proteins involved in immune response, transport, and establishment of localization show signs of positive selection for change of interactions. Our analysis suggests that a small degree of molecular divergence can give rise to important changes at the network level. We propose that the power law distribution observed in protein interaction networks could be partly explained by the cell's requirement for different degrees of protein binding specificity.
Ye, A; Streicher, C; Singh, H
2011-12-01
Whey protein beverages have been shown to be astringent at low pH. In the present study, the interactions between model whey proteins (β-lactoglobulin and lactoferrin) and human saliva in the pH range from 7 to 2 were investigated using particle size, turbidity, and ζ-potential measurements and sodium dodecyl sulfate-PAGE. The correlation between the sensory results of astringency and the physicochemical data was discussed. Strong interactions between β-lactoglobulin and salivary proteins led to an increase in the particle size and turbidity of mixtures of both unheated and heated β-lactoglobulin and human saliva at pH ∼3.4. However, the large particle size and high turbidity that occurred at pH 2.0 were the result of aggregation of human salivary proteins. The intense astringency in whey protein beverages may result from these increases in particle size and turbidity at these pH values and from the aggregation and precipitation of human salivary proteins alone at pH salivary proteins in the interaction is a key factor in the perception of astringency in whey protein beverages. At any pH, the increases in particle size and turbidity were much smaller in mixtures of lactoferrin and saliva, which suggests that aggregation and precipitation may not be the only mechanism linked to the perception of astringency in whey protein. Copyright © 2011 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
López, Yosvany; Nakai, Kenta; Patil, Ashwini
2015-01-01
HitPredict is a consolidated resource of experimentally identified, physical protein-protein interactions with confidence scores to indicate their reliability. The study of genes and their inter-relationships using methods such as network and pathway analysis requires high quality protein-protein interaction information. Extracting reliable interactions from most of the existing databases is challenging because they either contain only a subset of the available interactions, or a mixture of physical, genetic and predicted interactions. Automated integration of interactions is further complicated by varying levels of accuracy of database content and lack of adherence to standard formats. To address these issues, the latest version of HitPredict provides a manually curated dataset of 398 696 physical associations between 70 808 proteins from 105 species. Manual confirmation was used to resolve all issues encountered during data integration. For improved reliability assessment, this version combines a new score derived from the experimental information of the interactions with the original score based on the features of the interacting proteins. The combined interaction score performs better than either of the individual scores in HitPredict as well as the reliability score of another similar database. HitPredict provides a web interface to search proteins and visualize their interactions, and the data can be downloaded for offline analysis. Data usability has been enhanced by mapping protein identifiers across multiple reference databases. Thus, the latest version of HitPredict provides a significantly larger, more reliable and usable dataset of protein-protein interactions from several species for the study of gene groups. Database URL: http://hintdb.hgc.jp/htp. © The Author(s) 2015. Published by Oxford University Press.
Hamon, Véronique; Bourgeas, Raphael; Ducrot, Pierre; Theret, Isabelle; Xuereb, Laura; Basse, Marie Jeanne; Brunel, Jean Michel; Combes, Sebastien; Morelli, Xavier; Roche, Philippe
2014-01-06
Over the last 10 years, protein-protein interactions (PPIs) have shown increasing potential as new therapeutic targets. As a consequence, PPIs are today the most screened target class in high-throughput screening (HTS). The development of broad chemical libraries dedicated to these particular targets is essential; however, the chemical space associated with this 'high-hanging fruit' is still under debate. Here, we analyse the properties of 40 non-redundant small molecules present in the 2P2I database (http://2p2idb.cnrs-mrs.fr/) to define a general profile of orthosteric inhibitors and propose an original protocol to filter general screening libraries using a support vector machine (SVM) with 11 standard Dragon molecular descriptors. The filtering protocol has been validated using external datasets from PubChem BioAssay and results from in-house screening campaigns. This external blind validation demonstrated the ability of the SVM model to reduce the size of the filtered chemical library by eliminating up to 96% of the compounds as well as enhancing the proportion of active compounds by up to a factor of 8. We believe that the resulting chemical space identified in this paper will provide the scientific community with a concrete support to search for PPI inhibitors during HTS campaigns.
L?pez, Yosvany; Nakai, Kenta; Patil, Ashwini
2015-01-01
HitPredict is a consolidated resource of experimentally identified, physical protein?protein interactions with confidence scores to indicate their reliability. The study of genes and their inter-relationships using methods such as network and pathway analysis requires high quality protein?protein interaction information. Extracting reliable interactions from most of the existing databases is challenging because they either contain only a subset of the available interactions, or a mixture of p...
Yeast Interacting Proteins Database: YDL239C, YDR273W [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...it as prey (1) YDR273W DON1 Meiosis-specific component of the spindle pole body, part of the leading... edge protein (LEP) coat, forms a ring-like structure at the leading edge of the prospore...ption Protein required for spore wall formation, thought to mediate assembly of a Don1p-containing structure at the leading...description Meiosis-specific component of the spindle pole body, part of the leading edge protein (LEP) coat
Directory of Open Access Journals (Sweden)
Sara Garamszegi
Full Text Available A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1 domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2 domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral
Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing
2009-03-11
Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene
Catching the PEG-induced attractive interaction between proteins.
Vivarès, D; Belloni, L; Tardieu, A; Bonneté, F
2002-09-01
We present the experimental and theoretical background of a method to characterize the protein-protein attractive potential induced by one of the mostly used crystallizing agents in the protein-field, the poly(ethylene glycol) (PEG). This attractive interaction is commonly called, in colloid physics, the depletion interaction. Small-Angle X-ray Scattering experiments and numerical treatments based on liquid-state theories were performed on urate oxidase-PEG mixtures with two different PEGs (3350 Da and 8000 Da). A "two-component" approach was used in which the polymer-polymer, the protein-polymer and the protein-protein pair potentials were determined. The resulting effective protein-protein potential was characterized. This potential is the sum of the free-polymer protein-protein potential and of the PEG-induced depletion potential. The depletion potential was found to be hardly dependent upon the protein concentration but strongly function of the polymer size and concentration. Our results were also compared with two models, which give an analytic expression for the depletion potential.
Interactions between whey proteins and kaolinite surfaces
Energy Technology Data Exchange (ETDEWEB)
Barral, S. [Department of Chemical Engineering and Environmental Technology, University of Oviedo, Julian Claveria 8, 33006 Oviedo (Spain); Villa-Garcia, M.A. [Department of Organic and Inorganic Chemistry, University of Oviedo, Julian Claveria 8, 33006 Oviedo (Spain)], E-mail: mavg@uniovi.es; Rendueles, M. [Project Management Area, University of Oviedo, Independencia 13, 33004 Oviedo (Spain); Diaz, M. [Department of Chemical Engineering and Environmental Technology, University of Oviedo, Julian Claveria 8, 33006 Oviedo (Spain)
2008-07-15
The nature of the interactions between whey proteins and kaolinite surfaces was investigated by adsorption-desorption experiments at room temperature, performed at the isoelectric point (IEP) of the proteins and at pH 7. It was found that kaolinite is a strong adsorbent for proteins, reaching the maximum adsorption capacity at the IEP of each protein. At pH 7.0, the retention capacity decreased considerably. The adsorption isotherms showed typical Langmuir characteristics. X-ray diffraction data for the protein-kaolinite complexes showed that protein molecules were not intercalated in the mineral structure, but immobilized at the external surfaces and the edges of the kaolinite. Fourier transform IR results indicate the absence of hydrogen bonding between kaolinite surfaces and the polypeptide chain. The adsorption patterns appear to be related to electrostatic interactions, although steric effects should be also considered.
Interactions between whey proteins and kaolinite surfaces
International Nuclear Information System (INIS)
Barral, S.; Villa-Garcia, M.A.; Rendueles, M.; Diaz, M.
2008-01-01
The nature of the interactions between whey proteins and kaolinite surfaces was investigated by adsorption-desorption experiments at room temperature, performed at the isoelectric point (IEP) of the proteins and at pH 7. It was found that kaolinite is a strong adsorbent for proteins, reaching the maximum adsorption capacity at the IEP of each protein. At pH 7.0, the retention capacity decreased considerably. The adsorption isotherms showed typical Langmuir characteristics. X-ray diffraction data for the protein-kaolinite complexes showed that protein molecules were not intercalated in the mineral structure, but immobilized at the external surfaces and the edges of the kaolinite. Fourier transform IR results indicate the absence of hydrogen bonding between kaolinite surfaces and the polypeptide chain. The adsorption patterns appear to be related to electrostatic interactions, although steric effects should be also considered
Directory of Open Access Journals (Sweden)
Mo Min
2008-05-01
Full Text Available Abstract Background The polyadenylation of mRNA is one of the critical processing steps during expression of almost all eukaryotic genes. It is tightly integrated with transcription, particularly its termination, as well as other RNA processing events, i.e. capping and splicing. The poly(A tail protects the mRNA from unregulated degradation, and it is required for nuclear export and translation initiation. In recent years, it has been demonstrated that the polyadenylation process is also involved in the regulation of gene expression. The polyadenylation process requires two components, the cis-elements on the mRNA and a group of protein factors that recognize the cis-elements and produce the poly(A tail. Here we report a comprehensive pairwise protein-protein interaction mapping and gene expression profiling of the mRNA polyadenylation protein machinery in Arabidopsis. Results By protein sequence homology search using human and yeast polyadenylation factors, we identified 28 proteins that may be components of Arabidopsis polyadenylation machinery. To elucidate the protein network and their functions, we first tested their protein-protein interaction profiles. Out of 320 pair-wise protein-protein interaction assays done using the yeast two-hybrid system, 56 (~17% showed positive interactions. 15 of these interactions were further tested, and all were confirmed by co-immunoprecipitation and/or in vitro co-purification. These interactions organize into three distinct hubs involving the Arabidopsis polyadenylation factors. These hubs are centered around AtCPSF100, AtCLPS, and AtFIPS. The first two are similar to complexes seen in mammals, while the third one stands out as unique to plants. When comparing the gene expression profiles extracted from publicly available microarray datasets, some of the polyadenylation related genes showed tissue-specific expression, suggestive of potential different polyadenylation complex configurations. Conclusion An
Coevolution of interacting fertilization proteins.
Directory of Open Access Journals (Sweden)
Nathaniel L Clark
2009-07-01
Full Text Available Reproductive proteins are among the fastest evolving in the proteome, often due to the consequences of positive selection, and their rapid evolution is frequently attributed to a coevolutionary process between interacting female and male proteins. Such a process could leave characteristic signatures at coevolving genes. One signature of coevolution, predicted by sexual selection theory, is an association of alleles between the two genes. Another predicted signature is a correlation of evolutionary rates during divergence due to compensatory evolution. We studied female-male coevolution in the abalone by resequencing sperm lysin and its interacting egg coat protein, VERL, in populations of two species. As predicted, we found intergenic linkage disequilibrium between lysin and VERL, despite our demonstration that they are not physically linked. This finding supports a central prediction of sexual selection using actual genotypes, that of an association between a male trait and its female preference locus. We also created a novel likelihood method to show that lysin and VERL have experienced correlated rates of evolution. These two signatures of coevolution can provide statistical rigor to hypotheses of coevolution and could be exploited for identifying coevolving proteins a priori. We also present polymorphism-based evidence for positive selection and implicate recent selective events at the specific structural regions of lysin and VERL responsible for their species-specific interaction. Finally, we observed deep subdivision between VERL alleles in one species, which matches a theoretical prediction of sexual conflict. Thus, abalone fertilization proteins illustrate how coevolution can lead to reproductive barriers and potentially drive speciation.
Directory of Open Access Journals (Sweden)
Wan Li
Full Text Available The cardiomyopathies are a group of heart muscle diseases which can be inherited (familial. Identifying potential disease-related proteins is important to understand mechanisms of cardiomyopathies. Experimental identification of cardiomyophthies is costly and labour-intensive. In contrast, bioinformatics approach has a competitive advantage over experimental method. Based on "guilt by association" analysis, we prioritized candidate proteins involving in human cardiomyopathies. We first built weighted human cardiomyopathy-specific protein-protein interaction networks for three subtypes of cardiomyopathies using the known disease proteins from Online Mendelian Inheritance in Man as seeds. We then developed a method in prioritizing disease candidate proteins to rank candidate proteins in the network based on "guilt by association" analysis. It was found that most candidate proteins with high scores shared disease-related pathways with disease seed proteins. These top ranked candidate proteins were related with the corresponding disease subtypes, and were potential disease-related proteins. Cross-validation and comparison with other methods indicated that our approach could be used for the identification of potentially novel disease proteins, which may provide insights into cardiomyopathy-related mechanisms in a more comprehensive and integrated way.
Energy Technology Data Exchange (ETDEWEB)
Chhabra, S.R.; Joachimiak, M.P.; Petzold, C.J.; Zane, G.M.; Price, M.N.; Gaucher, S.; Reveco, S.A.; Fok, V.; Johanson, A.R.; Batth, T.S.; Singer, M.; Chandonia, J.M.; Joyner, D.; Hazen, T.C.; Arkin, A.P.; Wall, J.D.; Singh, A.K.; Keasling, J.D.
2011-05-01
Protein–protein interactions offer an insight into cellular processes beyond what may be obtained by the quantitative functional genomics tools of proteomics and transcriptomics. The aforementioned tools have been extensively applied to study E. coli and other aerobes and more recently to study the stress response behavior of Desulfovibrio 5 vulgaris Hildenborough, a model anaerobe and sulfate reducer. In this paper we present the first attempt to identify protein-protein interactions in an obligate anaerobic bacterium. We used suicide vector-assisted chromosomal modification of 12 open reading frames encoded by this sulfate reducer to append an eight amino acid affinity tag to the carboxy-terminus of the chosen proteins. Three biological replicates of the 10 ‘pulled-down’ proteins were separated and analyzed using liquid chromatography-mass spectrometry. Replicate agreement ranged between 35% and 69%. An interaction network among 12 bait and 90 prey proteins was reconstructed based on 134 bait-prey interactions computationally identified to be of high confidence. We discuss the biological significance of several unique metabolic features of D. vulgaris revealed by this protein-protein interaction data 15 and protein modifications that were observed. These include the distinct role of the putative carbon monoxide-induced hydrogenase, unique electron transfer routes associated with different oxidoreductases, and the possible role of methylation in regulating sulfate reduction.
Adal, Eda; Sadeghpour, Amin; Connell, Simon; Rappolt, Michael; Ibanoglu, Esra; Sarkar, Anwesha
2017-02-13
Associative electrostatic interactions between two oppositely charged globular proteins, lactoferrin (LF) and pea protein isolate (PPI), the latter being a mixture of vicilin, legumin, and convicilin, was studied with a specific PPI/LF molar ratio at room temperature. Structural aspects of the electrostatic complexes probed at different length scales were investigated as a function of pH by means of different complementary techniques, namely, with dynamic light scattering, small-angle X-ray scattering (SAXS), turbidity measurements, and atomic force microscopy (AFM). Irrespective of the applied techniques, the results consistently displayed that complexation between LF and PPI did occur. In an optimum narrow range of pH 5.0-5.8, a viscous liquid phase of complex coacervate was obtained upon mild centrifugation of the turbid LF-PPI mixture with a maximum R h , turbidity and the ζ-potential being close to zero observed at pH 5.4. In particular, the SAXS data demonstrated that the coacervates were densely assembled with a roughly spherical size distribution exhibiting a maximum extension of ∼80 nm at pH 5.4. Equally, AFM image analysis showed size distributions containing most frequent cluster sizes around 40-80 nm with spherical to elliptical shapes (axis aspect ratio ≤ 2) as well as less frequent elongated to chainlike structures. The most frequently observed compact complexes, we identify as mainly leading to LF-PPI coacervation, whereas for the less frequent chain-like aggregates, we hypothesize that additionally PPI-PPI facilitated complexes exist.
Findeisen, Felix; Campiglio, Marta; Jo, Hyunil; Abderemane-Ali, Fayal; Rumpf, Christine H; Pope, Lianne; Rossen, Nathan D; Flucher, Bernhard E; DeGrado, William F; Minor, Daniel L
2017-06-21
For many voltage-gated ion channels (VGICs), creation of a properly functioning ion channel requires the formation of specific protein-protein interactions between the transmembrane pore-forming subunits and cystoplasmic accessory subunits. Despite the importance of such protein-protein interactions in VGIC function and assembly, their potential as sites for VGIC modulator development has been largely overlooked. Here, we develop meta-xylyl (m-xylyl) stapled peptides that target a prototypic VGIC high affinity protein-protein interaction, the interaction between the voltage-gated calcium channel (Ca V ) pore-forming subunit α-interaction domain (AID) and cytoplasmic β-subunit (Ca V β). We show using circular dichroism spectroscopy, X-ray crystallography, and isothermal titration calorimetry that the m-xylyl staples enhance AID helix formation are structurally compatible with native-like AID:Ca V β interactions and reduce the entropic penalty associated with AID binding to Ca V β. Importantly, electrophysiological studies reveal that stapled AID peptides act as effective inhibitors of the Ca V α 1 :Ca V β interaction that modulate Ca V function in an Ca V β isoform-selective manner. Together, our studies provide a proof-of-concept demonstration of the use of protein-protein interaction inhibitors to control VGIC function and point to strategies for improved AID-based Ca V modulator design.
Directory of Open Access Journals (Sweden)
Du LinFang
2008-11-01
Full Text Available Abstract Background Researchers interested in analysing the expression patterns of functionally related genes usually hope to improve the accuracy of their results beyond the boundaries of currently available experimental data. Gene ontology (GO data provides a novel way to measure the functional relationship between gene products. Many approaches have been reported for calculating the similarities between two GO terms, known as semantic similarities. However, biologists are more interested in the relationship between gene products than in the scores linking the GO terms. To highlight the relationships among genes, recent studies have focused on functional similarities. Results In this study, we evaluated five functional similarity methods using both protein-protein interaction (PPI and expression data of S. cerevisiae. The receiver operating characteristics (ROC and correlation coefficient analysis of these methods showed that the maximum method outperformed the other methods. Statistical comparison of multiple- and single-term annotated proteins in biological process ontology indicated that genes with multiple GO terms may be more reliable for separating true positives from noise. Conclusion This study demonstrated the reliability of current approaches that elevate the similarity of GO terms to the similarity of proteins. Suggestions for further improvements in functional similarity analysis are also provided.
Positive Selection and Centrality in the Yeast and Fly Protein-Protein Interaction Networks
Directory of Open Access Journals (Sweden)
Sandip Chakraborty
2016-01-01
Full Text Available Proteins within a molecular network are expected to be subject to different selective pressures depending on their relative hierarchical positions. However, it is not obvious what genes within a network should be more likely to evolve under positive selection. On one hand, only mutations at genes with a relatively high degree of control over adaptive phenotypes (such as those encoding highly connected proteins are expected to be “seen” by natural selection. On the other hand, a high degree of pleiotropy at these genes is expected to hinder adaptation. Previous analyses of the human protein-protein interaction network have shown that genes under long-term, recurrent positive selection (as inferred from interspecific comparisons tend to act at the periphery of the network. It is unknown, however, whether these trends apply to other organisms. Here, we show that long-term positive selection has preferentially targeted the periphery of the yeast interactome. Conversely, in flies, genes under positive selection encode significantly more connected and central proteins. These observations are not due to covariation of genes’ adaptability and centrality with confounding factors. Therefore, the distribution of proteins encoded by genes under recurrent positive selection across protein-protein interaction networks varies from one species to another.
PIWI Proteins and PIWI-Interacting RNA
DEFF Research Database (Denmark)
Han, Yi Neng; Li, Yuan; Xia, Sheng Qiang
2017-01-01
tissue types as well and play important roles in transposon silencing, epigenetic regulation, gene and protein regulation, genome rearrangement, spermatogenesis and germ stem-cell maintenance. PIWI proteins were first discovered in Drosophila and they play roles in spermatogenesis, germline stem-cell......P-Element induced wimpy testis (PIWI)-interacting RNAs (piRNAs) are a type of noncoding RNAs (ncRNAs) and interact with PIWI proteins. piRNAs were primarily described in the germline, but emerging evidence revealed that piRNAs are expressed in a tissue-specific manner among multiple human somatic...... maintenance, self-renewal, retrotransposons silencing and the male germline mobility control. A growing number of studies have demonstrated that several piRNA and PIWI proteins are aberrantly expressed in various kinds of cancers and may probably serve as a novel biomarker and therapeutic target for cancer...
Itri, Francesco; Monti, Daria Maria; Chino, Marco; Vinciguerra, Roberto; Altucci, Carlo; Lombardi, Angela; Piccoli, Renata; Birolo, Leila; Arciello, Angela
2017-10-07
The identification of protein-protein interaction networks in living cells is becoming increasingly fundamental to elucidate main biological processes and to understand disease molecular bases on a system-wide level. We recently described a method (LUCK, Laser UV Cross-linKing) to cross-link interacting protein surfaces in living cells by UV laser irradiation. By using this innovative methodology, that does not require any protein modification or cell engineering, here we demonstrate that, upon UV laser irradiation of HeLa cells, a direct interaction between GAPDH and alpha-enolase was "frozen" by a cross-linking event. We validated the occurrence of this direct interaction by co-immunoprecipitation and Immuno-FRET analyses. This represents a proof of principle of the LUCK capability to reveal direct protein interactions in their physiological environment. Copyright © 2017 Elsevier Inc. All rights reserved.
Interactions among tobacco sieve element occlusion (SEO) proteins.
Jekat, Stephan B; Ernst, Antonia M; Zielonka, Sascia; Noll, Gundula A; Prüfer, Dirk
2012-12-01
Angiosperms transport their photoassimilates through sieve tubes, which comprise longitudinally-connected sieve elements. In dicots and also some monocots, the sieve elements contain parietal structural proteins known as phloem proteins or P-proteins. Following injury, P proteins disperse and accumulate as viscous plugs at the sieve plates to prevent the loss of valuable transport sugars. Tobacco (Nicotiana tabacum) P-proteins are multimeric complexes comprising subunits encoded by members of the SEO (sieve element occlusion) gene family. The existence of multiple subunits suggests that P-protein assembly involves interactions between SEO proteins, but this process is largely uncharacterized and it is unclear whether the different subunits perform unique roles or are redundant. We therefore extended our analysis of the tobacco P-proteins NtSEO1 and NtSEO2 to investigate potential interactions between them, and found that both proteins can form homomeric and heteromeric complexes in planta.
The nonstructural protein 8 (nsp8) of the SARS coronavirus interacts with its ORF6 accessory protein
International Nuclear Information System (INIS)
Kumar, Purnima; Gunalan, Vithiagaran; Liu Boping; Chow, Vincent T.K.; Druce, Julian; Birch, Chris; Catton, Mike; Fielding, Burtram C.; Tan, Yee-Joo; Lal, Sunil K.
2007-01-01
Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV) caused a severe outbreak in several regions of the world in 2003. The SARS-CoV genome is predicted to contain 14 functional open reading frames (ORFs). The first ORF (1a and 1b) encodes a large polyprotein that is cleaved into nonstructural proteins (nsp). The other ORFs encode for four structural proteins (spike, membrane, nucleocapsid and envelope) as well as eight SARS-CoV-specific accessory proteins (3a, 3b, 6, 7a, 7b, 8a, 8b and 9b). In this report we have cloned the predicted nsp8 gene and the ORF6 gene of the SARS-CoV and studied their abilities to interact with each other. We expressed the two proteins as fusion proteins in the yeast two-hybrid system to demonstrate protein-protein interactions and tested the same using a yeast genetic cross. Further the strength of the interaction was measured by challenging growth of the positive interaction clones on increasing gradients of 2-amino trizole. The interaction was then verified by expressing both proteins separately in-vitro in a coupled-transcription translation system and by coimmunoprecipitation in mammalian cells. Finally, colocalization experiments were performed in SARS-CoV infected Vero E6 mammalian cells to confirm the nsp8-ORF6 interaction. To the best of our knowledge, this is the first report of the interaction between a SARS-CoV accessory protein and nsp8 and our findings suggest that ORF6 protein may play a role in virus replication
DEFF Research Database (Denmark)
Tchórzewski, M; Boldyreff, B; Issinger, O
2000-01-01
The surface acidic ribosomal proteins (P-proteins), together with ribosomal core protein P0 form a multimeric lateral protuberance on the 60 S ribosomal subunit. This structure, also called stalk, is important for efficient translational activity of the ribosome. In order to shed more light...... forms the 60 S ribosomal stalk: P0-(P1/P2)(2). Additionally, mutual interactions among human and yeast P-proteins were analyzed. Heterodimer formation could be observed between human P2 and yeast P1 proteins....
Holland, David O; Johnson, Margaret E
2018-03-01
Stoichiometric balance, or dosage balance, implies that proteins that are subunits of obligate complexes (e.g. the ribosome) should have copy numbers expressed to match their stoichiometry in that complex. Establishing balance (or imbalance) is an important tool for inferring subunit function and assembly bottlenecks. We show here that these correlations in protein copy numbers can extend beyond complex subunits to larger protein-protein interactions networks (PPIN) involving a range of reversible binding interactions. We develop a simple method for quantifying balance in any interface-resolved PPINs based on network structure and experimentally observed protein copy numbers. By analyzing such a network for the clathrin-mediated endocytosis (CME) system in yeast, we found that the real protein copy numbers were significantly more balanced in relation to their binding partners compared to randomly sampled sets of yeast copy numbers. The observed balance is not perfect, highlighting both under and overexpressed proteins. We evaluate the potential cost and benefits of imbalance using two criteria. First, a potential cost to imbalance is that 'leftover' proteins without remaining functional partners are free to misinteract. We systematically quantify how this misinteraction cost is most dangerous for strong-binding protein interactions and for network topologies observed in biological PPINs. Second, a more direct consequence of imbalance is that the formation of specific functional complexes depends on relative copy numbers. We therefore construct simple kinetic models of two sub-networks in the CME network to assess multi-protein assembly of the ARP2/3 complex and a minimal, nine-protein clathrin-coated vesicle forming module. We find that the observed, imperfectly balanced copy numbers are less effective than balanced copy numbers in producing fast and complete multi-protein assemblies. However, we speculate that strategic imbalance in the vesicle forming module
Exploring the potential of 3D Zernike descriptors and SVM for protein-protein interface prediction.
Daberdaku, Sebastian; Ferrari, Carlo
2018-02-06
The correct determination of protein-protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is still not fully understood. Experimental evidence suggests that the location of binding sites is imprinted in the protein structure, but there are major differences among the interfaces of the various protein types: the characterising properties can vary a lot depending on the interaction type and function. The selection of an optimal set of features characterising the protein interface and the development of an effective method to represent and capture the complex protein recognition patterns are of paramount importance for this task. In this work we investigate the potential of a novel local surface descriptor based on 3D Zernike moments for the interface prediction task. Descriptors invariant to roto-translations are extracted from circular patches of the protein surface enriched with physico-chemical properties from the HQI8 amino acid index set, and are used as samples for a binary classification problem. Support Vector Machines are used as a classifier to distinguish interface local surface patches from non-interface ones. The proposed method was validated on 16 classes of proteins extracted from the Protein-Protein Docking Benchmark 5.0 and compared to other state-of-the-art protein interface predictors (SPPIDER, PrISE and NPS-HomPPI). The 3D Zernike descriptors are able to capture the similarity among patterns of physico-chemical and biochemical properties mapped on the protein surface arising from the various spatial arrangements of the underlying residues, and their usage can be easily extended to other sets of amino acid properties. The results suggest that the choice of a proper set of features characterising the protein interface is crucial for the interface prediction
Directory of Open Access Journals (Sweden)
Sebastian J. Nintemann
2017-11-01
Full Text Available Within the cell, biosynthetic pathways are embedded in protein-protein interaction networks. In Arabidopsis, the biosynthetic pathways of aliphatic and indole glucosinolate defense compounds are well-characterized. However, little is known about the spatial orchestration of these enzymes and their interplay with the cellular environment. To address these aspects, we applied two complementary, untargeted approaches—split-ubiquitin yeast 2-hybrid and co-immunoprecipitation screens—to identify proteins interacting with CYP83A1 and CYP83B1, two homologous enzymes specific for aliphatic and indole glucosinolate biosynthesis, respectively. Our analyses reveal distinct functional networks with substantial interconnection among the identified interactors for both pathway-specific markers, and add to our knowledge about how biochemical pathways are connected to cellular processes. Specifically, a group of protein interactors involved in cell death and the hypersensitive response provides a potential link between the glucosinolate defense compounds and defense against biotrophic pathogens, mediated by protein-protein interactions.
Keane, Harriet; Ryan, Brent J.; Jackson, Brendan; Whitmore, Alan; Wade-Martins, Richard
2015-11-01
Neurodegenerative diseases are complex multifactorial disorders characterised by the interplay of many dysregulated physiological processes. As an exemplar, Parkinson’s disease (PD) involves multiple perturbed cellular functions, including mitochondrial dysfunction and autophagic dysregulation in preferentially-sensitive dopamine neurons, a selective pathophysiology recapitulated in vitro using the neurotoxin MPP+. Here we explore a network science approach for the selection of therapeutic protein targets in the cellular MPP+ model. We hypothesised that analysis of protein-protein interaction networks modelling MPP+ toxicity could identify proteins critical for mediating MPP+ toxicity. Analysis of protein-protein interaction networks constructed to model the interplay of mitochondrial dysfunction and autophagic dysregulation (key aspects of MPP+ toxicity) enabled us to identify four proteins predicted to be key for MPP+ toxicity (P62, GABARAP, GBRL1 and GBRL2). Combined, but not individual, knockdown of these proteins increased cellular susceptibility to MPP+ toxicity. Conversely, combined, but not individual, over-expression of the network targets provided rescue of MPP+ toxicity associated with the formation of autophagosome-like structures. We also found that modulation of two distinct proteins in the protein-protein interaction network was necessary and sufficient to mitigate neurotoxicity. Together, these findings validate our network science approach to multi-target identification in complex neurological diseases.
Rossin, Elizabeth J.; Lage, Kasper; Raychaudhuri, Soumya; Xavier, Ramnik J.; Tatar, Diana; Benita, Yair
2011-01-01
Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in
PDZ domain-mediated interactions of G protein-coupled receptors with postsynaptic density protein 95
DEFF Research Database (Denmark)
Møller, Thor C; Wirth, Volker F; Roberts, Nina Ingerslev
2013-01-01
G protein-coupled receptors (GPCRs) constitute the largest family of membrane proteins in the human genome. Their signaling is regulated by scaffold proteins containing PDZ domains, but although these interactions are important for GPCR function, they are still poorly understood. We here present...
Quantitative analysis of protein-ligand interactions by NMR.
Furukawa, Ayako; Konuma, Tsuyoshi; Yanaka, Saeko; Sugase, Kenji
2016-08-01
Protein-ligand interactions have been commonly studied through static structures of the protein-ligand complex. Recently, however, there has been increasing interest in investigating the dynamics of protein-ligand interactions both for fundamental understanding of the underlying mechanisms and for drug development. NMR is a versatile and powerful tool, especially because it provides site-specific quantitative information. NMR has widely been used to determine the dissociation constant (KD), in particular, for relatively weak interactions. The simplest NMR method is a chemical-shift titration experiment, in which the chemical-shift changes of a protein in response to ligand titration are measured. There are other quantitative NMR methods, but they mostly apply only to interactions in the fast-exchange regime. These methods derive the dissociation constant from population-averaged NMR quantities of the free and bound states of a protein or ligand. In contrast, the recent advent of new relaxation-based experiments, including R2 relaxation dispersion and ZZ-exchange, has enabled us to obtain kinetic information on protein-ligand interactions in the intermediate- and slow-exchange regimes. Based on R2 dispersion or ZZ-exchange, methods that can determine the association rate, kon, dissociation rate, koff, and KD have been developed. In these approaches, R2 dispersion or ZZ-exchange curves are measured for multiple samples with different protein and/or ligand concentration ratios, and the relaxation data are fitted to theoretical kinetic models. It is critical to choose an appropriate kinetic model, such as the two- or three-state exchange model, to derive the correct kinetic information. The R2 dispersion and ZZ-exchange methods are suitable for the analysis of protein-ligand interactions with a micromolar or sub-micromolar dissociation constant but not for very weak interactions, which are typical in very fast exchange. This contrasts with the NMR methods that are used
Energy Technology Data Exchange (ETDEWEB)
Sakamoto, Hikaru [Department of Bioproduction, Faculty of Bioindustry, Tokyo University of Agriculture, 196 Yasaka, Abashiri-shi, Hokkaido 093-2422 (Japan); Sakata, Keiko; Kusumi, Kensuke [Department of Biology, Faculty of Sciences, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, Fukuoka 812-8581 (Japan); Kojima, Mikiko; Sakakibara, Hitoshi [RIKEN Plant Science Center, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045 (Japan); Iba, Koh, E-mail: koibascb@kyushu-u.org [Department of Biology, Faculty of Sciences, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, Fukuoka 812-8581 (Japan)
2012-06-29
Highlights: Black-Right-Pointing-Pointer ITN1, a plasma membrane ankyrin protein, interacts with a nuclear DNA-binding protein RTV1. Black-Right-Pointing-Pointer The nuclear transport of RTV1 is partially inhibited by interaction with ITN1. Black-Right-Pointing-Pointer RTV1 can promote the nuclear localization of ITN1. Black-Right-Pointing-Pointer Both overexpression of RTV1 and the lack of ITN1 increase salicylic acids sensitivity in plants. -- Abstract: The increased tolerance to NaCl 1 (ITN1) protein is a plasma membrane (PM)-localized protein involved in responses to NaCl stress in Arabidopsis. The predicted structure of ITN1 is composed of multiple transmembrane regions and an ankyrin-repeat domain that is known to mediate protein-protein interactions. To elucidate the molecular functions of ITN1, we searched for interacting partners using a yeast two-hybrid assay, and a nuclear-localized DNA-binding protein, RTV1, was identified as a candidate. Bimolecular fluorescence complementation analysis revealed that RTV1 interacted with ITN1 at the PM and nuclei in vivo. RTV1 tagged with red fluorescent protein localized to nuclei and ITN1 tagged with green fluorescent protein localized to PM; however, both proteins localized to both nuclei and the PM when co-expressed. These findings suggest that RTV1 and ITN1 regulate the subcellular localization of each other.
Regulation of PCNA-protein interactions for genome stability
DEFF Research Database (Denmark)
Mailand, Niels; Gibbs-Seymour, Ian; Bekker-Jensen, Simon
2013-01-01
Proliferating cell nuclear antigen (PCNA) has a central role in promoting faithful DNA replication, providing a molecular platform that facilitates the myriad protein-protein and protein-DNA interactions that occur at the replication fork. Numerous PCNA-associated proteins compete for binding...
The dynamic multisite interactions between two intrinsically disordered proteins
Wu, Shaowen
2017-05-11
Protein interactions involving intrinsically disordered proteins (IDPs) comprise a variety of binding modes, from the well characterized folding upon binding to dynamic fuzzy complex. To date, most studies concern the binding of an IDP to a structured protein, while the Interaction between two IDPs is poorly understood. In this study, we combined NMR, smFRET, and molecular dynamics (MD) simulation to characterize the interaction between two IDPs, the C-terminal domain (CTD) of protein 4.1G and the nuclear mitotic apparatus (NuMA) protein. It is revealed that CTD and NuMA form a fuzzy complex with remaining structural disorder. Multiple binding sites on both proteins were identified by MD and mutagenesis studies. Our study provides an atomic scenario in which two IDPs bearing multiple binding sites interact with each other in dynamic equilibrium. The combined approach employed here could be widely applicable for investigating IDPs and their dynamic interactions.
Pandey, Naresh; Nobles, Christopher L; Zechiedrich, Lynn; Maresso, Anthony W; Silberg, Jonathan J
2015-05-15
Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein-protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein-protein interactions within whole animals.
Identification of proteins that may directly interact with human RPA.
Nakaya, Ryou; Takaya, Junichiro; Onuki, Takeshi; Moritani, Mariko; Nozaki, Naohito; Ishimi, Yukio
2010-11-01
RPA, which consisted of three subunits (RPA1, 2 and 3), plays essential roles in DNA transactions. At the DNA replication forks, RPA binds to single-stranded DNA region to stabilize the structure and to assemble other replication proteins. Interactions between RPA and several replication proteins have been reported but the analysis is not comprehensive. We systematically performed the qualitative analysis to identify RPA interaction partners to understand the protein-protein interaction at the replication forks. We expressed in insect cells the three subunits of human RPA, together with one replication protein, which is present at the forks under normal conditions and/or under the replication stress conditions, to examine the interaction. Among 30 proteins examined in total, it was found that at least 14 proteins interacted with RPA. RPA interacted with MCM3-7, MCM-BP and CDC45 proteins among the proteins that play roles in the initiation and the elongation of the DNA replication. RPA bound with TIPIN, CLASPIN and RAD17, which are involved in the DNA replication checkpoint functions. RPA also bound with cyclin-dependent kinases and an amino-terminal fragment of Rb protein that negatively regulates DNA replication. These results suggest that RPA interacts with the specific proteins among those that play roles in the regulation of the replication fork progression.
Multiplex single-molecule interaction profiling of DNA-barcoded proteins.
Gu, Liangcai; Li, Chao; Aach, John; Hill, David E; Vidal, Marc; Church, George M
2014-11-27
In contrast with advances in massively parallel DNA sequencing, high-throughput protein analyses are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule protein detection using optical methods is limited by the number of spectrally non-overlapping chromophores. Here we introduce a single-molecular-interaction sequencing (SMI-seq) technology for parallel protein interaction profiling leveraging single-molecule advantages. DNA barcodes are attached to proteins collectively via ribosome display or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide thin film to construct a random single-molecule array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies) and analysed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimetre. Furthermore, protein interactions can be measured on the basis of the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor and antibody-binding profiling, are demonstrated. SMI-seq enables 'library versus library' screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity.
Pogmore, Justin P; Pemberton, James M; Chi, Xiaoke; Andrews, David W
2016-01-01
The Bcl-2 family of proteins regulates the process of mitochondrial outer membrane permeabilization, causing the release of cytochrome c and committing a cell to apoptosis. The majority of the functional interactions between these proteins occur at, on, or within the mitochondrial outer membrane, complicating structural studies of the proteins and complexes. As a result most in vitro studies of these protein-protein interactions use truncated proteins and/or detergents which can cause artificial interactions. Herein, we describe a detergent-free, fluorescence-based, in vitro technique to study binding between full-length recombinant Bcl-2 family proteins, particularly cleaved BID (cBID) and BCL-XL, on the membranes of purified mitochondria.
Interaction between Vaccinium bracteatum Thunb. leaf pigment and rice proteins.
Wang, Li; Xu, Yuan; Zhou, Sumei; Qian, Haifeng; Zhang, Hui; Qi, Xiguang; Fan, Meihua
2016-03-01
In this study, we investigated the interaction of Vaccinium bracteatum Thunb. leaf (VBTL) pigment and rice proteins. In the presence of rice protein, VBTL pigment antioxidant activity and free polyphenol content decreased by 67.19% and 68.11%, respectively, and L(∗) of the protein-pigment complex decreased significantly over time. L(∗) values of albumin, globulin and glutelin during 60-min pigment exposure decreased by 55.00, 57.14, and 54.30%, respectively, indicating that these proteins had bound to the pigment. A significant difference in protein surface hydrophobicity was observed between rice proteins and pigment-protein complexes, indicating that hydrophobic interaction is a major binding mechanism between VBTL pigment and rice proteins. A significant difference in secondary structures between proteins and protein-pigment complexes was also uncovered, indicating that hydrogen bonding may be another mode of interaction between VBTL pigment and rice proteins. Our results indicate that VBTL pigment can stain rice proteins with hydrophobic and hydrogen interactions. Copyright © 2015 Elsevier Ltd. All rights reserved.
Structural study of surfactant-dependent interaction with protein
Energy Technology Data Exchange (ETDEWEB)
Mehan, Sumit; Aswal, Vinod K., E-mail: vkaswal@barc.gov.in [Solid State Physics Division, Bhabha Atomic Research Centre, Mumbai 400 085 (India); Kohlbrecher, Joachim [Laboratory for Neutron Scattering, Paul Scherrer Institut, CH-5232 PSI Villigen (Switzerland)
2015-06-24
Small-angle neutron scattering (SANS) has been used to study the complex structure of anionic BSA protein with three different (cationic DTAB, anionic SDS and non-ionic C12E10) surfactants. These systems form very different surfactant-dependent complexes. We show that the structure of protein-surfactant complex is initiated by the site-specific electrostatic interaction between the components, followed by the hydrophobic interaction at high surfactant concentrations. It is also found that hydrophobic interaction is preferred over the electrostatic interaction in deciding the resultant structure of protein-surfactant complexes.
Essential protein discovery based on a combination of modularity and conservatism.
Zhao, Bihai; Wang, Jianxin; Li, Xueyong; Wu, Fang-Xiang
2016-11-01
Essential proteins are indispensable for the survival of a living organism and play important roles in the emerging field of synthetic biology. Many computational methods have been proposed to identify essential proteins by using the topological features of interactome networks. However, most of these methods ignored intrinsic biological meaning of proteins. Researches show that essentiality is tied not only to the protein or gene itself, but also to the molecular modules to which that protein belongs. The results of this study reveal the modularity of essential proteins. On the other hand, essential proteins are more evolutionarily conserved than nonessential proteins and frequently bind each other. That is to say, conservatism is another important feature of essential proteins. Multiple networks are constructed by integrating protein-protein interaction (PPI) networks, time course gene expression data and protein domain information. Based on these networks, a new essential protein identification method is proposed based on a combination of modularity and conservatism of proteins. Experimental results show that the proposed method outperforms other essential protein identification methods in terms of a number essential protein out of top ranked candidates. Copyright © 2016. Published by Elsevier Inc.
Directory of Open Access Journals (Sweden)
Pooja Sharma
2018-06-01
Full Text Available Protein complexes are known to play a major role in controlling cellular activity in a living being. Identifying complexes from raw protein protein interactions (PPIs is an important area of research. Earlier work has been limited mostly to yeast. Such protein complex identification methods, when applied to large human PPIs often give poor performance. We introduce a novel method called CSC to detect protein complexes. The method is evaluated in terms of positive predictive value, sensitivity and accuracy using the datasets of the model organism, yeast and humans. CSC outperforms several other competing algorithms for both organisms. Further, we present a framework to establish the usefulness of CSC in analyzing the influence of a given disease gene in a complex topologically as well as biologically considering eight major association factors. Keywords: Protein complex, Connectivity, Semantic similarity, Contribution
BRCA1 interacts directly with the Fanconi anemia protein FANCA.
Folias, Alexandra; Matkovic, Mara; Bruun, Donald; Reid, Sonja; Hejna, James; Grompe, Markus; D'Andrea, Alan; Moses, Robb
2002-10-01
Fanconi anemia (FA) is a rare autosomal recessive disease characterized by skeletal defects, anemia, chromosomal instability and increased risk of leukemia. At the cellular level FA is characterized by increased sensitivity to agents forming interstrand crosslinks (ICL) in DNA. Six FA genes have been cloned and interactions among individual FANC proteins have been found. The FANCD2 protein co-localizes in nuclear foci with the BRCA1 protein following DNA damage and during S-phase, requiring the FANCA, C, E and G proteins to do so. This finding may reflect a direct role for the BRCA1 protein in double strand break (DSB) repair and interaction with the FANC proteins. Therefore interactions between BRCA1 and the FANC proteins were investigated. Among the known FANC proteins, we find evidence for direct interaction only between the FANCA protein and BRCA1. The evidence rests on three different tests: yeast two-hybrid analysis, coimmunoprecipitation from in vitro synthesis, and coimmunoprecipitation from cell extracts. The amino terminal portion of FANCA and the central part (aa 740-1083) of BRCA1 contain the sites of interaction. The interaction does not depend on DNA damage, thus FANCA and BRCA1 are constitutively interacting. The demonstrated interaction directly connects BRCA1 to the FA pathway of DNA repair.
Evidence for the robustness of protein complexes to inter-species hybridization.
Directory of Open Access Journals (Sweden)
Jean-Baptiste Leducq
Full Text Available Despite the tremendous efforts devoted to the identification of genetic incompatibilities underlying hybrid sterility and inviability, little is known about the effect of inter-species hybridization at the protein interactome level. Here, we develop a screening platform for the comparison of protein-protein interactions (PPIs among closely related species and their hybrids. We examine in vivo the architecture of protein complexes in two yeast species (Saccharomyces cerevisiae and Saccharomyces kudriavzevii that diverged 5-20 million years ago and in their F1 hybrids. We focus on 24 proteins of two large complexes: the RNA polymerase II and the nuclear pore complex (NPC, which show contrasting patterns of molecular evolution. We found that, with the exception of one PPI in the NPC sub-complex, PPIs were highly conserved between species, regardless of protein divergence. Unexpectedly, we found that the architecture of the complexes in F1 hybrids could not be distinguished from that of the parental species. Our results suggest that the conservation of PPIs in hybrids likely results from the slow evolution taking place on the very few protein residues involved in the interaction or that protein complexes are inherently robust and may accommodate protein divergence up to the level that is observed among closely related species.
A lock-and-key model for protein–protein interactions
Morrison, Julie L.; Breitling, Rainer; Higham, Desmond J.; Gilbert, David R.
2006-01-01
Motivation: Protein–protein interaction networks are one of the major post-genomic data sources available to molecular biologists. They provide a comprehensive view of the global interaction structure of an organism’s proteome, as well as detailed information on specific interactions. Here we suggest a physical model of protein interactions that can be used to extract additional information at an intermediate level: It enables us to identify proteins which share biological interaction motifs,...
A Physical Interaction Network of Dengue Virus and Human Proteins*
Khadka, Sudip; Vangeloff, Abbey D.; Zhang, Chaoying; Siddavatam, Prasad; Heaton, Nicholas S.; Wang, Ling; Sengupta, Ranjan; Sahasrabudhe, Sudhir; Randall, Glenn; Gribskov, Michael; Kuhn, Richard J.; Perera, Rushika; LaCount, Douglas J.
2011-01-01
Dengue virus (DENV), an emerging mosquito-transmitted pathogen capable of causing severe disease in humans, interacts with host cell factors to create a more favorable environment for replication. However, few interactions between DENV and human proteins have been reported to date. To identify DENV-human protein interactions, we used high-throughput yeast two-hybrid assays to screen the 10 DENV proteins against a human liver activation domain library. From 45 DNA-binding domain clones containing either full-length viral genes or partially overlapping gene fragments, we identified 139 interactions between DENV and human proteins, the vast majority of which are novel. These interactions involved 105 human proteins, including six previously implicated in DENV infection and 45 linked to the replication of other viruses. Human proteins with functions related to the complement and coagulation cascade, the centrosome, and the cytoskeleton were enriched among the DENV interaction partners. To determine if the cellular proteins were required for DENV infection, we used small interfering RNAs to inhibit their expression. Six of 12 proteins targeted (CALR, DDX3X, ERC1, GOLGA2, TRIP11, and UBE2I) caused a significant decrease in the replication of a DENV replicon. We further showed that calreticulin colocalized with viral dsRNA and with the viral NS3 and NS5 proteins in DENV-infected cells, consistent with a direct role for calreticulin in DENV replication. Human proteins that interacted with DENV had significantly higher average degree and betweenness than expected by chance, which provides additional support for the hypothesis that viruses preferentially target cellular proteins that occupy central position in the human protein interaction network. This study provides a valuable starting point for additional investigations into the roles of human proteins in DENV infection. PMID:21911577
A physical interaction network of dengue virus and human proteins.
Khadka, Sudip; Vangeloff, Abbey D; Zhang, Chaoying; Siddavatam, Prasad; Heaton, Nicholas S; Wang, Ling; Sengupta, Ranjan; Sahasrabudhe, Sudhir; Randall, Glenn; Gribskov, Michael; Kuhn, Richard J; Perera, Rushika; LaCount, Douglas J
2011-12-01
Dengue virus (DENV), an emerging mosquito-transmitted pathogen capable of causing severe disease in humans, interacts with host cell factors to create a more favorable environment for replication. However, few interactions between DENV and human proteins have been reported to date. To identify DENV-human protein interactions, we used high-throughput yeast two-hybrid assays to screen the 10 DENV proteins against a human liver activation domain library. From 45 DNA-binding domain clones containing either full-length viral genes or partially overlapping gene fragments, we identified 139 interactions between DENV and human proteins, the vast majority of which are novel. These interactions involved 105 human proteins, including six previously implicated in DENV infection and 45 linked to the replication of other viruses. Human proteins with functions related to the complement and coagulation cascade, the centrosome, and the cytoskeleton were enriched among the DENV interaction partners. To determine if the cellular proteins were required for DENV infection, we used small interfering RNAs to inhibit their expression. Six of 12 proteins targeted (CALR, DDX3X, ERC1, GOLGA2, TRIP11, and UBE2I) caused a significant decrease in the replication of a DENV replicon. We further showed that calreticulin colocalized with viral dsRNA and with the viral NS3 and NS5 proteins in DENV-infected cells, consistent with a direct role for calreticulin in DENV replication. Human proteins that interacted with DENV had significantly higher average degree and betweenness than expected by chance, which provides additional support for the hypothesis that viruses preferentially target cellular proteins that occupy central position in the human protein interaction network. This study provides a valuable starting point for additional investigations into the roles of human proteins in DENV infection.
González-García, Estefanía; Maly, Marek; de la Mata, Francisco Javier; Gómez, Rafael; Marina, María Luisa; García, María Concepción
2016-11-01
Protein sample preparation is a critical and an unsustainable step since it involves the use of tedious methods that usually require high amount of solvents. The development of new materials offers additional opportunities in protein sample preparation. This work explores, for the first time, the potential application of carboxylate-terminated carbosilane dendrimers to the purification/enrichment of proteins. Studies on dendrimer binding to proteins, based on protein fluorescence intensity and emission wavelengths measurements, demonstrated the interaction between carboxylate-terminated carbosilane dendrimers and proteins at all tested pH levels. Interactions were greatly affected by the protein itself, pH, and dendrimer concentration and generation. Especially interesting was the interaction at acidic pH since it resulted in a significant protein precipitation. Dendrimer-protein interactions were modeled observing stable complexes for all proteins. Carboxylate-terminated carbosilane dendrimers at acidic pH were successfully used in the purification/enrichment of proteins extracted from a complex sample. Graphical Abstract Images showing the growing turbidity of solutions containing a mixture of proteins (lysozyme, myoglobin, and BSA) at different protein:dendrimer ratios (1:0, 1:1, 1:8, and 1:20) at acidic pH and SDS-PAGE profiles of the corresponsing supernatants. Comparison of SDS-PAGE profiles for the pellets obtained during the purification of proteins present in a complex sample using a conventional "no-clean" method based on acetone precipitation and the proposed "greener" method using carboxylate-terminated carbosilane dendrimer at a 1:20 protein:dendrimer ratio.
Unveiling protein functions through the dynamics of the interaction network.
Directory of Open Access Journals (Sweden)
Irene Sendiña-Nadal
Full Text Available Protein interaction networks have become a tool to study biological processes, either for predicting molecular functions or for designing proper new drugs to regulate the main biological interactions. Furthermore, such networks are known to be organized in sub-networks of proteins contributing to the same cellular function. However, the protein function prediction is not accurate and each protein has traditionally been assigned to only one function by the network formalism. By considering the network of the physical interactions between proteins of the yeast together with a manual and single functional classification scheme, we introduce a method able to reveal important information on protein function, at both micro- and macro-scale. In particular, the inspection of the properties of oscillatory dynamics on top of the protein interaction network leads to the identification of misclassification problems in protein function assignments, as well as to unveil correct identification of protein functions. We also demonstrate that our approach can give a network representation of the meta-organization of biological processes by unraveling the interactions between different functional classes.
Prediction of protein-protein interaction sites in sequences and 3D structures by random forests.
Directory of Open Access Journals (Sweden)
Mile Sikić
2009-01-01
Full Text Available Identifying interaction sites in proteins provides important clues to the function of a protein and is becoming increasingly relevant in topics such as systems biology and drug discovery. Although there are numerous papers on the prediction of interaction sites using information derived from structure, there are only a few case reports on the prediction of interaction residues based solely on protein sequence. Here, a sliding window approach is combined with the Random Forests method to predict protein interaction sites using (i a combination of sequence- and structure-derived parameters and (ii sequence information alone. For sequence-based prediction we achieved a precision of 84% with a 26% recall and an F-measure of 40%. When combined with structural information, the prediction performance increases to a precision of 76% and a recall of 38% with an F-measure of 51%. We also present an attempt to rationalize the sliding window size and demonstrate that a nine-residue window is the most suitable for predictor construction. Finally, we demonstrate the applicability of our prediction methods by modeling the Ras-Raf complex using predicted interaction sites as target binding interfaces. Our results suggest that it is possible to predict protein interaction sites with quite a high accuracy using only sequence information.
Reciprocal carbonyl-carbonyl interactions in small molecules and proteins.
Rahim, Abdur; Saha, Pinaki; Jha, Kunal Kumar; Sukumar, Nagamani; Sarma, Bani Kanta
2017-07-19
Carbonyl-carbonyl n→π* interactions where a lone pair (n) of the oxygen atom of a carbonyl group is delocalized over the π* orbital of a nearby carbonyl group have attracted a lot of attention in recent years due to their ability to affect the 3D structure of small molecules, polyesters, peptides, and proteins. In this paper, we report the discovery of a "reciprocal" carbonyl-carbonyl interaction with substantial back and forth n→π* and π→π* electron delocalization between neighboring carbonyl groups. We have carried out experimental studies, analyses of crystallographic databases and theoretical calculations to show the presence of this interaction in both small molecules and proteins. In proteins, these interactions are primarily found in polyproline II (PPII) helices. As PPII are the most abundant secondary structures in unfolded proteins, we propose that these local interactions may have implications in protein folding.Carbonyl-carbonyl π* non covalent interactions affect the structure and stability of small molecules and proteins. Here, the authors carry out experimental studies, analyses of crystallographic databases and theoretical calculations to describe an additional type of carbonyl-carbonyl interaction.
Multiplex single-molecule interaction profiling of DNA barcoded proteins
Gu, Liangcai; Li, Chao; Aach, John; Hill, David E.; Vidal, Marc; Church, George M.
2014-01-01
In contrast with advances in massively parallel DNA sequencing1, high-throughput protein analyses2-4 are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule (SM) protein detection achieved using optical methods5 is limited by the number of spectrally nonoverlapping chromophores. Here, we introduce a single molecular interaction-sequencing (SMI-Seq) technology for parallel protein interaction profiling leveraging SM advantages. DNA barcodes are attached to proteins collectively via ribosome display6 or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide (PAA) thin film to construct a random SM array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies)7 and analyzed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimeter. Furthermore, protein interactions can be measured based on the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor (GPCR) and antibody binding profiling, were demonstrated. SMI-Seq enables “library vs. library” screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity. PMID:25252978
Huntingtin interacting proteins are genetic modifiers of neurodegeneration.
Directory of Open Access Journals (Sweden)
Linda S Kaltenbach
2007-05-01
Full Text Available Huntington's disease (HD is a fatal neurodegenerative condition caused by expansion of the polyglutamine tract in the huntingtin (Htt protein. Neuronal toxicity in HD is thought to be, at least in part, a consequence of protein interactions involving mutant Htt. We therefore hypothesized that genetic modifiers of HD neurodegeneration should be enriched among Htt protein interactors. To test this idea, we identified a comprehensive set of Htt interactors using two complementary approaches: high-throughput yeast two-hybrid screening and affinity pull down followed by mass spectrometry. This effort led to the identification of 234 high-confidence Htt-associated proteins, 104 of which were found with the yeast method and 130 with the pull downs. We then tested an arbitrary set of 60 genes encoding interacting proteins for their ability to behave as genetic modifiers of neurodegeneration in a Drosophila model of HD. This high-content validation assay showed that 27 of 60 orthologs tested were high-confidence genetic modifiers, as modification was observed with more than one allele. The 45% hit rate for genetic modifiers seen among the interactors is an order of magnitude higher than the 1%-4% typically observed in unbiased genetic screens. Genetic modifiers were similarly represented among proteins discovered using yeast two-hybrid and pull-down/mass spectrometry methods, supporting the notion that these complementary technologies are equally useful in identifying biologically relevant proteins. Interacting proteins confirmed as modifiers of the neurodegeneration phenotype represent a diverse array of biological functions, including synaptic transmission, cytoskeletal organization, signal transduction, and transcription. Among the modifiers were 17 loss-of-function suppressors of neurodegeneration, which can be considered potential targets for therapeutic intervention. Finally, we show that seven interacting proteins from among 11 tested were able to
Directory of Open Access Journals (Sweden)
Claudia Rodríguez-Almazán
Full Text Available The intracellular concentration of protein may be as high as 400 mg per ml; thus it seems inevitable that within the cell, numerous protein-protein contacts are constantly occurring. A basic biochemical principle states that the equilibrium of an association reaction can be shifted by ligand binding. This indicates that if within the cell many protein-protein interactions are indeed taking place, some fundamental characteristics of proteins would necessarily differ from those observed in traditional biochemical systems. Accordingly, we measured the effect of eight different proteins on the formation of homodimeric triosephosphate isomerase from Trypanosoma brucei (TbTIM from guanidinium chloride unfolded monomers. The eight proteins at concentrations of micrograms per ml induced an important increase on active dimer formation. Studies on the mechanism of this phenomenon showed that the proteins stabilize the dimeric structure of TbTIM, and that this is the driving force that promotes the formation of active dimers. Similar data were obtained with TIM from three other species. The heat changes that occur when TbTIM is mixed with lysozyme were determined by isothermal titration calorimetry; the results provided direct evidence of the weak interaction between apparently unrelated proteins. The data, therefore, are strongly suggestive that the numerous protein-protein interactions that occur in the intracellular space are an additional control factor in the formation and stability of proteins.
Mackenzie, Kimberly D; Lim, Yoon; Duffield, Michael D; Chataway, Timothy; Zhou, Xin-Fu; Keating, Damien J
2017-07-01
Huntingtin-associated protein 1 (HAP1) was initially identified as a binding partner of huntingtin, mutations in which underlie Huntington's disease. Subcellular localization and protein interaction data indicate that HAP1 may be important in vesicle trafficking, cell signalling and receptor internalization. In this study, a proteomics approach was used for the identification of novel HAP1-interacting partners to attempt to shed light on the physiological function of HAP1. Using affinity chromatography with HAP1-GST protein fragments bound to Sepharose columns, this study identified a number of trafficking-related proteins that bind to HAP1. Interestingly, many of the proteins that were identified by mass spectrometry have trafficking-related functions and include the clathrin light chain B and Sec23A, an ER to Golgi trafficking vesicle coat component. Using co-immunoprecipitation and GST-binding assays the association between HAP1 and clathrin light chain B has been validated in vitro. This study also finds that HAP1 co-localizes with clathrin light chain B. In line with a physiological function of the HAP1-clathrin interaction this study detected a dramatic reduction in vesicle retrieval and endocytosis in adrenal chromaffin cells. Furthermore, through examination of transferrin endocytosis in HAP1 -/- cortical neurons, this study has determined that HAP1 regulates neuronal endocytosis. In this study, the interaction between HAP1 and Sec23A was also validated through endogenous co-immunoprecipitation in rat brain homogenate. Through the identification of novel HAP1 binding partners, many of which have putative trafficking roles, this study provides us with new insights into the mechanisms underlying the important physiological function of HAP1 as an intracellular trafficking protein through its protein-protein interactions. Copyright © 2017 Elsevier Inc. All rights reserved.
Cheng, Yuan; Zhou, Yuan; Yang, Yan; Chi, Ying-Jun; Zhou, Jie; Chen, Jian-Ye; Wang, Fei; Fan, Baofang; Shi, Kai; Zhou, Yan-Hong; Yu, Jing-Quan; Chen, Zhixiang
2012-06-01
WRKY transcription factors are encoded by a large gene superfamily with a broad range of roles in plants. Recently, several groups have reported that proteins containing a short VQ (FxxxVQxLTG) motif interact with WRKY proteins. We have recently discovered that two VQ proteins from Arabidopsis (Arabidopsis thaliana), SIGMA FACTOR-INTERACTING PROTEIN1 and SIGMA FACTOR-INTERACTING PROTEIN2, act as coactivators of WRKY33 in plant defense by specifically recognizing the C-terminal WRKY domain and stimulating the DNA-binding activity of WRKY33. In this study, we have analyzed the entire family of 34 structurally divergent VQ proteins from Arabidopsis. Yeast (Saccharomyces cerevisiae) two-hybrid assays showed that Arabidopsis VQ proteins interacted specifically with the C-terminal WRKY domains of group I and the sole WRKY domains of group IIc WRKY proteins. Using site-directed mutagenesis, we identified structural features of these two closely related groups of WRKY domains that are critical for interaction with VQ proteins. Quantitative reverse transcription polymerase chain reaction revealed that expression of a majority of Arabidopsis VQ genes was responsive to pathogen infection and salicylic acid treatment. Functional analysis using both knockout mutants and overexpression lines revealed strong phenotypes in growth, development, and susceptibility to pathogen infection. Altered phenotypes were substantially enhanced through cooverexpression of genes encoding interacting VQ and WRKY proteins. These findings indicate that VQ proteins play an important role in plant growth, development, and response to environmental conditions, most likely by acting as cofactors of group I and IIc WRKY transcription factors.
Targeted in vivo inhibition of specific protein-protein interactions using recombinant antibodies.
Directory of Open Access Journals (Sweden)
Matej Zábrady
Full Text Available With the growing availability of genomic sequence information, there is an increasing need for gene function analysis. Antibody-mediated "silencing" represents an intriguing alternative for the precise inhibition of a particular function of biomolecules. Here, we describe a method for selecting recombinant antibodies with a specific purpose in mind, which is to inhibit intrinsic protein-protein interactions in the cytosol of plant cells. Experimental procedures were designed for conveniently evaluating desired properties of recombinant antibodies in consecutive steps. Our selection method was successfully used to develop a recombinant antibody inhibiting the interaction of ARABIDOPSIS HISTIDINE PHOSPHOTRANSFER PROTEIN 3 with such of its upstream interaction partners as the receiver domain of CYTOKININ INDEPENDENT HISTIDINE KINASE 1. The specific down-regulation of the cytokinin signaling pathway in vivo demonstrates the validity of our approach. This selection method can serve as a prototype for developing unique recombinant antibodies able to interfere with virtually any biomolecule in the living cell.
Quantitative chemogenomics: machine-learning models of protein-ligand interaction.
Andersson, Claes R; Gustafsson, Mats G; Strömbergsson, Helena
2011-01-01
Chemogenomics is an emerging interdisciplinary field that lies in the interface of biology, chemistry, and informatics. Most of the currently used drugs are small molecules that interact with proteins. Understanding protein-ligand interaction is therefore central to drug discovery and design. In the subfield of chemogenomics known as proteochemometrics, protein-ligand-interaction models are induced from data matrices that consist of both protein and ligand information along with some experimentally measured variable. The two general aims of this quantitative multi-structure-property-relationship modeling (QMSPR) approach are to exploit sparse/incomplete information sources and to obtain more general models covering larger parts of the protein-ligand space, than traditional approaches that focuses mainly on specific targets or ligands. The data matrices, usually obtained from multiple sparse/incomplete sources, typically contain series of proteins and ligands together with quantitative information about their interactions. A useful model should ideally be easy to interpret and generalize well to new unseen protein-ligand combinations. Resolving this requires sophisticated machine-learning methods for model induction, combined with adequate validation. This review is intended to provide a guide to methods and data sources suitable for this kind of protein-ligand-interaction modeling. An overview of the modeling process is presented including data collection, protein and ligand descriptor computation, data preprocessing, machine-learning-model induction and validation. Concerns and issues specific for each step in this kind of data-driven modeling will be discussed. © 2011 Bentham Science Publishers
Directory of Open Access Journals (Sweden)
Uchikoga Nobuyuki
2010-05-01
Full Text Available Abstract Background Protein-protein docking for proteins with large conformational changes was analyzed by using interaction fingerprints, one of the scales for measuring similarities among complex structures, utilized especially for searching near-native protein-ligand or protein-protein complex structures. Here, we have proposed a combined method for analyzing protein-protein docking by taking large conformational changes into consideration. This combined method consists of ensemble soft docking with multiple protein structures, refinement of complexes, and cluster analysis using interaction fingerprints and energy profiles. Results To test for the applicability of this combined method, various CaM-ligand complexes were reconstructed from the NMR structures of unbound CaM. For the purpose of reconstruction, we used three known CaM-ligands, namely, the CaM-binding peptides of cyclic nucleotide gateway (CNG, CaM kinase kinase (CaMKK and the plasma membrane Ca2+ ATPase pump (PMCA, and thirty-one structurally diverse CaM conformations. For each ligand, 62000 CaM-ligand complexes were generated in the docking step and the relationship between their energy profiles and structural similarities to the native complex were analyzed using interaction fingerprint and RMSD. Near-native clusters were obtained in the case of CNG and CaMKK. Conclusions The interaction fingerprint method discriminated near-native structures better than the RMSD method in cluster analysis. We showed that a combined method that includes the interaction fingerprint is very useful for protein-protein docking analysis of certain cases.
Directory of Open Access Journals (Sweden)
Stefan M Ivanov
Full Text Available An improved knowledge of protein-protein interactions is essential for better understanding of metabolic and signaling networks, and cellular function. Progress tends to be based on structure determination and predictions using known structures, along with computational methods based on evolutionary information or detailed atomistic descriptions. We hypothesized that for the case of interactions across a common interface, between proteins from a pair of paralogue families or within a family of paralogues, a relatively simple interface description could distinguish between binding and non-binding pairs. Using binding data for several systems, and large-scale comparative modeling based on known template complex structures, it is found that charge-charge interactions (for groups bearing net charge are generally a better discriminant than buried non-polar surface. This is particularly the case for paralogue families that are less divergent, with more reliable comparative modeling. We suggest that electrostatic interactions are major determinants of specificity in such systems, an observation that could be used to predict binding partners.
Ivanov, Stefan M; Cawley, Andrew; Huber, Roland G; Bond, Peter J; Warwicker, Jim
2017-01-01
An improved knowledge of protein-protein interactions is essential for better understanding of metabolic and signaling networks, and cellular function. Progress tends to be based on structure determination and predictions using known structures, along with computational methods based on evolutionary information or detailed atomistic descriptions. We hypothesized that for the case of interactions across a common interface, between proteins from a pair of paralogue families or within a family of paralogues, a relatively simple interface description could distinguish between binding and non-binding pairs. Using binding data for several systems, and large-scale comparative modeling based on known template complex structures, it is found that charge-charge interactions (for groups bearing net charge) are generally a better discriminant than buried non-polar surface. This is particularly the case for paralogue families that are less divergent, with more reliable comparative modeling. We suggest that electrostatic interactions are major determinants of specificity in such systems, an observation that could be used to predict binding partners.
Omana, Dileep A; Pietrasik, Zeb; Betti, Mirko
2012-07-01
The potential of using poultry protein isolate (PPI) as a food ingredient to substitute either soy protein isolate (SPI) or meat protein in turkey bologna was investigated. PPI was prepared from mechanically separated turkey meat using pH-shift technology and the prepared PPI was added to turkey bologna at 2 different concentrations (1.5% and 2% dry weight basis). Product characteristics were compared with those prepared with the addition of 2% SPI, 11% meat protein (control-1), or 13% meat protein (control-2). All the 5 treatments were subjected to sensory analysis to evaluate aroma, appearance, color, flavor, saltiness, juiciness, firmness, and overall acceptability of the turkey bologna samples using 9-point hedonic scales. A turkey bologna control sample with 11% meat protein appeared to be softer compared to other treatments as revealed by texture profile analysis while purge loss during storage in a retail display case was significantly (P Sensory analysis concluded that 1.5% PPI and 2% PPI could be used as substitute of SPI or lean meat and the treatments could be improved by increasing saltiness and decreasing firmness. The study revealed that with slight modifications in saltiness, turkey bologna can be prepared with the addition of poultry protein isolates as an acceptable substitute for soy protein isolate or meat protein. This will help to avoid usage of nonmeat ingredients (as SPI substitute) and to reduce the cost of production (as meat protein substitute) of low-fat turkey bologna. © 2012 Institute of Food Technologists®
Identification of FUSE-binding proteins as interacting partners of TIA proteins
International Nuclear Information System (INIS)
Rothe, Francoise; Gueydan, Cyril; Bellefroid, Eric; Huez, Georges; Kruys, Veronique
2006-01-01
TIA-1 and TIAR are closely related RNA-binding proteins involved in several mechanisms of RNA metabolism, including alternative hnRNA splicing and mRNA translation regulation. In particular, TIA-1 represses tumor necrosis factor (TNF) mRNA translation by binding to the AU-rich element (ARE) present in the mRNA 3' untranslated region. Here, we demonstrate that TIA proteins interact with FUSE-binding proteins (FBPs) and that fbp genes are co-expressed with tia genes during Xenopus embryogenesis. FBPs participate in various steps of RNA processing and degradation. In Cos cells, FBPs co-localize with TIA proteins in the nucleus and migrate into TIA-enriched cytoplasmic granules upon oxidative stress. Overexpression of FBP2-KH3 RNA-binding domain fused to EGFP induces the specific sequestration of TIA proteins in cytoplasmic foci, thereby precluding their nuclear accumulation. In cytosolic RAW 264.7 macrophage extracts, FBPs are found associated in EMSA to the TIA-1/TNF-ARE complex. Together, our results indicate that TIA and FBP proteins may thus be relevant biological involved in common events of RNA metabolism occurring both in the nucleus and the cytoplasm
Richardson, Roy; Denis, Clyde L; Zhang, Chongxu; Nielsen, Maria E O; Chiang, Yueh-Chin; Kierkegaard, Morten; Wang, Xin; Lee, Darren J; Andersen, Jens S; Yao, Gang
2012-09-01
Poly(A) binding protein (PAB1) is involved in a number of RNA metabolic functions in eukaryotic cells and correspondingly is suggested to associate with a number of proteins. We have used mass spectrometric analysis to identify 55 non-ribosomal proteins that specifically interact with PAB1 from Saccharomyces cerevisiae. Because many of these factors may associate only indirectly with PAB1 by being components of the PAB1-mRNP structure, we additionally conducted mass spectrometric analyses on seven metabolically defined PAB1 deletion derivatives to delimit the interactions between these proteins and PAB1. These latter analyses identified 13 proteins whose associations with PAB1 were reduced by deleting one or another of PAB1's defined domains. Included in this list of 13 proteins were the translation initiation factors eIF4G1 and eIF4G2, translation termination factor eRF3, and PBP2, all of whose previously known direct interactions with specific PAB1 domains were either confirmed, delimited, or extended. The remaining nine proteins that interacted through a specific PAB1 domain were CBF5, SLF1, UPF1, CBC1, SSD1, NOP77, yGR250c, NAB6, and GBP2. In further study, UPF1, involved in nonsense-mediated decay, was confirmed to interact with PAB1 through the RRM1 domain. We additionally established that while the RRM1 domain of PAB1 was required for UPF1-induced acceleration of deadenylation during nonsense-mediated decay, it was not required for the more critical step of acceleration of mRNA decapping. These results begin to identify the proteins most likely to interact with PAB1 and the domains of PAB1 through which these contacts are made.
The potential of protein-nanomaterial interaction for advanced drug delivery.
Peng, Qiang; Mu, Huiling
2016-03-10
Nanomaterials, like nanoparticles, micelles, nano-sheets, nanotubes and quantum dots, have great potentials in biomedical fields. However, their delivery is highly limited by the formation of protein corona upon interaction with endogenous proteins. This new identity, instead of nanomaterial itself, would be the real substance the organs and cells firstly encounter. Consequently, the behavior of nanomaterials in vivo is uncontrollable and some undesired effects may occur, like rapid clearance from blood stream; risk of capillary blockage; loss of targeting capacity; and potential toxicity. Therefore, protein-nanomaterial interaction is a great challenge for nanomaterial systems and should be inhibited. However, this interaction can also be used to functionalize nanomaterials by forming a selected protein corona. Unlike other decoration using exogenous molecules, nanomaterials functionalized by selected protein corona using endogenous proteins would have greater promise for clinical use. In this review, we aim to provide a comprehensive understanding of protein-nanomaterial interaction. Importantly, a discussion about how to use such interaction is launched and some possible applications of such interaction for advanced drug delivery are presented. Copyright © 2016 Elsevier B.V. All rights reserved.
Várnai, Csilla; Burkoff, Nikolas S; Wild, David L
2017-01-01
Evolutionary information stored in multiple sequence alignments (MSAs) has been used to identify the interaction interface of protein complexes, by measuring either co-conservation or co-mutation of amino acid residues across the interface. Recently, maximum entropy related correlated mutation measures (CMMs) such as direct information, decoupling direct from indirect interactions, have been developed to identify residue pairs interacting across the protein complex interface. These studies have focussed on carefully selected protein complexes with large, good-quality MSAs. In this work, we study protein complexes with a more typical MSA consisting of fewer than 400 sequences, using a set of 79 intramolecular protein complexes. Using a maximum entropy based CMM at the residue level, we develop an interface level CMM score to be used in re-ranking docking decoys. We demonstrate that our interface level CMM score compares favourably to the complementarity trace score, an evolutionary information-based score measuring co-conservation, when combined with the number of interface residues, a knowledge-based potential and the variability score of individual amino acid sites. We also demonstrate, that, since co-mutation and co-complementarity in the MSA contain orthogonal information, the best prediction performance using evolutionary information can be achieved by combining the co-mutation information of the CMM with co-conservation information of a complementarity trace score, predicting a near-native structure as the top prediction for 41% of the dataset. The method presented is not restricted to small MSAs, and will likely improve interface prediction also for complexes with large and good-quality MSAs.
Directory of Open Access Journals (Sweden)
Cesira de Chiara
Full Text Available A main challenge for structural biologists is to understand the mechanisms that discriminate between molecular interactions and determine function. Here, we show how partner recognition of the AXH domain of the transcriptional co-regulator ataxin-1 is fine-tuned by a subtle balance between self- and hetero-associations. Ataxin-1 is the protein responsible for the hereditary spinocerebellar ataxia type 1, a disease linked to protein aggregation and transcriptional dysregulation. Expansion of a polyglutamine tract is essential for ataxin-1 aggregation, but the sequence-wise distant AXH domain plays an important aggravating role in the process. The AXH domain is also a key element for non-aberrant function as it intervenes in interactions with multiple protein partners. Previous data have shown that AXH is dimeric in solution and forms a dimer of dimers when crystallized. By solving the structure of a complex of AXH with a peptide from the interacting transcriptional repressor CIC, we show that the dimer interface of AXH is displaced by the new interaction and that, when blocked by the CIC peptide AXH aggregation and misfolding are impaired. This is a unique example in which palindromic self- and hetero-interactions within a sequence with chameleon properties discriminate the partner. We propose a drug design strategy for the treatment of SCA1 that is based on the information gained from the AXH/CIC complex.
PCNA Structure and Interactions with Partner Proteins
Oke, Muse; Zaher, Manal S.; Hamdan, Samir
2018-01-01
Proliferating cell nuclear antigen (PCNA) consists of three identical monomers that topologically encircle double-stranded DNA. PCNA stimulates the processivity of DNA polymerase δ and, to a less extent, the intrinsically highly processive DNA polymerase ε. It also functions as a platform that recruits and coordinates the activities of a large number of DNA processing proteins. Emerging structural and biochemical studies suggest that the nature of PCNA-partner proteins interactions is complex. A hydrophobic groove at the front side of PCNA serves as a primary docking site for the consensus PIP box motifs present in many PCNA-binding partners. Sequences that immediately flank the PIP box motif or regions that are distant from it could also interact with the hydrophobic groove and other regions of PCNA. Posttranslational modifications on the backside of PCNA could add another dimension to its interaction with partner proteins. An encounter of PCNA with different DNA structures might also be involved in coordinating its interactions. Finally, the ability of PCNA to bind up to three proteins while topologically linked to DNA suggests that it would be a versatile toolbox in many different DNA processing reactions.
PCNA Structure and Interactions with Partner Proteins
Oke, Muse
2018-01-29
Proliferating cell nuclear antigen (PCNA) consists of three identical monomers that topologically encircle double-stranded DNA. PCNA stimulates the processivity of DNA polymerase δ and, to a less extent, the intrinsically highly processive DNA polymerase ε. It also functions as a platform that recruits and coordinates the activities of a large number of DNA processing proteins. Emerging structural and biochemical studies suggest that the nature of PCNA-partner proteins interactions is complex. A hydrophobic groove at the front side of PCNA serves as a primary docking site for the consensus PIP box motifs present in many PCNA-binding partners. Sequences that immediately flank the PIP box motif or regions that are distant from it could also interact with the hydrophobic groove and other regions of PCNA. Posttranslational modifications on the backside of PCNA could add another dimension to its interaction with partner proteins. An encounter of PCNA with different DNA structures might also be involved in coordinating its interactions. Finally, the ability of PCNA to bind up to three proteins while topologically linked to DNA suggests that it would be a versatile toolbox in many different DNA processing reactions.
Semantic integration to identify overlapping functional modules in protein interaction networks
Directory of Open Access Journals (Sweden)
Ramanathan Murali
2007-07-01
Full Text Available Abstract Background The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms. Results We have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches. Conclusion The integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification.
Maier, Richard H; Maier, Christina J; Hintner, Helmut; Bauer, Johann W; Onder, Kamil
2012-12-01
Many functional proteomic experiments make use of high-throughput technologies such as mass spectrometry combined with two-dimensional polyacrylamide gel electrophoresis and the yeast two-hybrid (Y2H) system. Currently there are even automated versions of the Y2H system available that can be used for proteome-wide research. The Y2H system has the capacity to deliver a profusion of Y2H positive colonies from a single library screen. However, subsequent analysis of these numerous primary candidates with complementary methods can be overwhelming. Therefore, a method to select the most promising candidates with strong interaction properties might be useful to reduce the number of candidates requiring further analysis. The method described here offers a new way of quantifying and rating the performance of positive Y2H candidates. The novelty lies in the detection and measurement of mRNA expression instead of proteins or conventional Y2H genetic reporters. This method correlates well with the direct genetic reporter readouts usually used in the Y2H system, and has greater sensitivity for detecting and quantifying protein-protein interactions (PPIs) than the conventional Y2H system, as demonstrated by detection of the Y2H false-negative PPI of RXR/PPARG. Approximately 20% of all proteins are not suitable for the Y2H system, the so-called autoactivators. A further advantage of this method is the possibility to evaluate molecules that usually cannot be analyzed in the Y2H system, exemplified by a VDR-LXXLL motif peptide interaction. Copyright © 2012 Elsevier Inc. All rights reserved.
Identification of NAD interacting residues in proteins
Directory of Open Access Journals (Sweden)
Raghava Gajendra PS
2010-03-01
Full Text Available Abstract Background Small molecular cofactors or ligands play a crucial role in the proper functioning of cells. Accurate annotation of their target proteins and binding sites is required for the complete understanding of reaction mechanisms. Nicotinamide adenine dinucleotide (NAD+ or NAD is one of the most commonly used organic cofactors in living cells, which plays a critical role in cellular metabolism, storage and regulatory processes. In the past, several NAD binding proteins (NADBP have been reported in the literature, which are responsible for a wide-range of activities in the cell. Attempts have been made to derive a rule for the binding of NAD+ to its target proteins. However, so far an efficient model could not be derived due to the time consuming process of structure determination, and limitations of similarity based approaches. Thus a sequence and non-similarity based method is needed to characterize the NAD binding sites to help in the annotation. In this study attempts have been made to predict NAD binding proteins and their interacting residues (NIRs from amino acid sequence using bioinformatics tools. Results We extracted 1556 proteins chains from 555 NAD binding proteins whose structure is available in Protein Data Bank. Then we removed all redundant protein chains and finally obtained 195 non-redundant NAD binding protein chains, where no two chains have more than 40% sequence identity. In this study all models were developed and evaluated using five-fold cross validation technique on the above dataset of 195 NAD binding proteins. While certain type of residues are preferred (e.g. Gly, Tyr, Thr, His in NAD interaction, residues like Ala, Glu, Leu, Lys are not preferred. A support vector machine (SVM based method has been developed using various window lengths of amino acid sequence for predicting NAD interacting residues and obtained maximum Matthew's correlation coefficient (MCC 0.47 with accuracy 74.13% at window length 17
Directory of Open Access Journals (Sweden)
Naofumi Miwa
2015-05-01
Full Text Available Protein-carbohydrate interaction regulates multiple important processes during fertilization, an essential biological event where individual gametes undergo intercellular recognition to fuse and generate a zygote. In the mammalian female reproductive tract, sperm temporarily adhere to the oviductal epithelium via the complementary interaction between carbohydrate-binding proteins on the sperm membrane and carbohydrates on the oviductal cells. After detachment from the oviductal epithelium at the appropriate time point following ovulation, sperm migrate and occasionally bind to the extracellular matrix, called the zona pellucida (ZP, which surrounds the egg, thereafter undergoing the exocytotic acrosomal reaction to penetrate the envelope and to reach the egg plasma membrane. This sperm-ZP interaction also involves the direct interaction between sperm carbohydrate-binding proteins and carbohydrates within the ZP, most of which have been conserved across divergent species from mammals to amphibians and echinoderms. This review focuses on the carbohydrate-mediated interaction of sperm with the female reproductive tract, mainly the interaction between sperm and the ZP, and introduces the fertilization-suppressive action of dicalcin, a Xenopus laevis ZP protein-associated protein. The action of dicalcin correlates significantly with a dicalcin-dependent change in the lectin-staining pattern within the ZP, suggesting a unique role of dicalcin as an inherent protein that is capable of regulating the affinity between the lectin and oligosaccharides attached on its target glycoprotein.
Quantification of protein interaction kinetics in a micro droplet
Energy Technology Data Exchange (ETDEWEB)
Yin, L. L. [Center for Bioelectronics and Biosensors, Biodesign Institute, Arizona State University, Tempe, Arizona 85287 (United States); College of Chemistry and Chemical Engineering, Chongqing University, Chongqing 400044 (China); Wang, S. P., E-mail: shaopeng.wang@asu.edu, E-mail: njtao@asu.edu; Shan, X. N.; Tao, N. J., E-mail: shaopeng.wang@asu.edu, E-mail: njtao@asu.edu [Center for Bioelectronics and Biosensors, Biodesign Institute, Arizona State University, Tempe, Arizona 85287 (United States); Zhang, S. T. [College of Chemistry and Chemical Engineering, Chongqing University, Chongqing 400044 (China)
2015-11-15
Characterization of protein interactions is essential to the discovery of disease biomarkers, the development of diagnostic assays, and the screening for therapeutic drugs. Conventional flow-through kinetic measurements need relative large amount of sample that is not feasible for precious protein samples. We report a novel method to measure protein interaction kinetics in a single droplet with sub microliter or less volume. A droplet in a humidity-controlled environmental chamber is replacing the microfluidic channels as the reactor for the protein interaction. The binding process is monitored by a surface plasmon resonance imaging (SPRi) system. Association curves are obtained from the average SPR image intensity in the center area of the droplet. The washing step required by conventional flow-through SPR method is eliminated in the droplet method. The association and dissociation rate constants and binding affinity of an antigen-antibody interaction are obtained by global fitting of association curves at different concentrations. The result obtained by this method is accurate as validated by conventional flow-through SPR system. This droplet-based method not only allows kinetic studies for proteins with limited supply but also opens the door for high-throughput protein interaction study in a droplet-based microarray format that enables measurement of many to many interactions on a single chip.
International Nuclear Information System (INIS)
Choi, Kyoung-Jae; Lim, Chun-Woo; Yoon, Moon-Young; Ahn, Byung-Yoon; Yu, Yeon Gyu
2004-01-01
Interaction between preformed nucleocapsids and viral envelope proteins is critical for the assembly of virus particles in infected cells. The pre-S1 and pre-S2 and cytosolic regions of the human hepatitis B virus envelope protein had been implicated in the interaction with the core protein of nucleocapsids. The binding affinities of specific subdomains of the envelope protein to the core protein were quantitatively measured by both ELISA and BIAcore assay. While a marginal binding was detected with the pre-S1 or pre-S2, the core protein showed high affinities to pre-S with apparent dissociation constants (K D app ) of 7.3 ± 0.9 and 8.2 ± 0.4 μM by ELISA and BIAcore assay, respectively. The circular dichroism analysis suggested that conformational change occurs in pre-S through interaction with core protein. These results substantiate the importance of specific envelope domains in virion assembly, and demonstrate that the interaction between viral proteins can be quantitatively measured in vitro
Interaction of milk proteins and Binder of Sperm (BSP) proteins from boar, stallion and ram semen.
Plante, Geneviève; Lusignan, Marie-France; Lafleur, Michel; Manjunath, Puttaswamy
2015-08-15
Mammalian semen contains a family of closely related proteins known as Binder of SPerm (BSP proteins) that are added to sperm at ejaculation. BSP proteins extract lipids from the sperm membrane thereby extensively modifying its composition. These changes can ultimately be detrimental to sperm storage. We have demonstrated that bovine BSP proteins interact with major milk proteins and proposed that this interaction could be the basis of sperm protection by milk extenders. In the present study, we investigated if homologous BSP proteins present in boar, stallion and ram seminal plasma display a similar affinity for the milk proteins in order to assess whether the mechanism of sperm protection by milk for these species could be general. Skim milk was incubated with seminal plasma proteins (boar, stallion and ram), chromatographed on a Sepharose CL-4B column and protein fractions were analyzed by immunoblotting. Boar, stallion and ram BSP proteins displayed affinity for a milk protein fraction (F1) mainly composed of α-lactalbumin, β-lactoglobulin, and κ-casein. They also had affinity for another milk protein fraction (F2) composed mostly of casein micelles. However, stallion BSP showed higher affinity for the fraction (F1). These results further extend our view that the association of BSP proteins with milk proteins could be a general feature of the mechanism of mammalian sperm protection by milk to prevent detrimental effect of prolonged exposure of sperm to seminal plasma.
Yeast Interacting Proteins Database: YDR176W, YDL239C [Yeast Interacting Proteins Database
Lifescience Database Archive (English)
Full Text Available a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle pole...ining structure at the leading edge of the prospore membrane via interaction with spindle pole body componen...DY3 Prey description Protein required for spore wall formation, thought to mediate assembly of a Don1p-conta
Computational analysis of RNA-protein interaction interfaces via the Voronoi diagram.
Mahdavi, Sedigheh; Mohades, Ali; Salehzadeh Yazdi, Ali; Jahandideh, Samad; Masoudi-Nejad, Ali
2012-01-21
Cellular functions are mediated by various biological processes including biomolecular interactions, such as protein-protein, DNA-protein and RNA-protein interactions in which RNA-Protein interactions are indispensable for many biological processes like cell development and viral replication. Unlike the protein-protein and protein-DNA interactions, accurate mechanisms and structures of the RNA-Protein complexes are not fully understood. A large amount of theoretical evidence have shown during the past several years that computational geometry is the first pace in understanding the binding profiles and plays a key role in the study of intricate biological structures, interactions and complexes. In this paper, RNA-Protein interaction interface surface is computed via the weighted Voronoi diagram of atoms. Using two filter operations provides a natural definition for interface atoms as classic methods. Unbounded parts of Voronoi facets that are far from the complex are trimmed using modified convex hull of atom centers. This algorithm is implemented to a database with different RNA-Protein complexes extracted from Protein Data Bank (PDB). Afterward, the features of interfaces have been computed and compared with classic method. The results show high correlation coefficients between interface size in the Voronoi model and the classical model based on solvent accessibility, as well as high accuracy and precision in comparison to classical model. Copyright © 2011 Elsevier Ltd. All rights reserved.
A systematic evaluation of protein kinase a-a-kinase anchoring protein interaction motifs
Burgers, Pepijn P|info:eu-repo/dai/nl/341566551; van der Heyden, Marcel A G; Kok, Bart; Heck, Albert J R|info:eu-repo/dai/nl/105189332; Scholten, Arjen|info:eu-repo/dai/nl/313939780
2015-01-01
Protein kinase A (PKA) in vertebrates is localized to specific locations in the cell via A-kinase anchoring proteins (AKAPs). The regulatory subunits of the four PKA isoforms (RIα, RIβ, RIIα, and RIIβ) each form a homodimer, and their dimerization domain interacts with a small helical region present
A systematic evaluation of protein kinase A-A-kinase anchoring protein interaction motifs
Burgers, Pepijn P; van der Heyden, MAG; Kok, Bart; Heck, Albert J R; Scholten, Arjen
2015-01-01
Protein kinase A (PKA) in vertebrates is localized to specific locations in the cell via A-kinase anchoring proteins (AKAPs). The regulatory subunits of the four PKA isoforms (RIα, RIβ, RIIα, and RIIβ) each form a homodimer, and their dimerization domain interacts with a small helical region present
Identification of Redox and Glucose-Dependent Txnip Protein Interactions
Directory of Open Access Journals (Sweden)
Benjamin J. Forred
2016-01-01
Full Text Available Thioredoxin-interacting protein (Txnip acts as a negative regulator of thioredoxin function and is a critical modulator of several diseases including, but not limited to, diabetes, ischemia-reperfusion cardiac injury, and carcinogenesis. Therefore, Txnip has become an attractive therapeutic target to alleviate disease pathologies. Although Txnip has been implicated with numerous cellular processes such as proliferation, fatty acid and glucose metabolism, inflammation, and apoptosis, the molecular mechanisms underlying these processes are largely unknown. The objective of these studies was to identify Txnip interacting proteins using the proximity-based labeling method, BioID, to understand differential regulation of pleiotropic Txnip cellular functions. The BioID transgene fused to Txnip expressed in HEK293 identified 31 interacting proteins. Many protein interactions were redox-dependent and were disrupted through mutation of a previously described reactive cysteine (C247S. Furthermore, we demonstrate that this model can be used to identify dynamic Txnip interactions due to known physiological regulators such as hyperglycemia. These data identify novel Txnip protein interactions and demonstrate dynamic interactions dependent on redox and glucose perturbations, providing clarification to the pleiotropic cellular functions of Txnip.
International Nuclear Information System (INIS)
Mariano, Andrea C.; Andrade, Maxuel O.; Santos, Anesia A.; Carolino, Sonia M.B.; Oliveira, Marli L.; Baracat-Pereira, Maria Cristina; Brommonshenkel, Sergio H.; Fontes, Elizabeth P.B.
2004-01-01
Despite extensive studies in plant virus-host interactions, the molecular mechanisms of geminivirus movement and interactions with host components remain largely unknown. A tomato kinase protein and its soybean homolog were found to interact specifically with the nuclear shuttle protein (NSP) of Tomato golden mosaic virus (TGMV) and Tomato crinkle leaf yellows virus (TCrLYV) through yeast two-hybrid screening and in vitro protein binding assays. These proteins, designated LeNIK (Lycopersicon esculentum NSP-Interacting Kinase) and GmNIK (Glycine max NIK), belong to the LRR-RLK (leucine rich-repeat receptor-like kinase) family that is involved in plant developmental processes and/or resistance response. As such, NIK is structurally organized into characteristic domains, including a serine/threonine kinase domain with a nucleotide binding site at the C-terminal region, an internal transmembrane segment and leucine-rich repeats (LRR) at the N-terminal portion. The potential significance of the NSP-NIK interaction is discussed
Directory of Open Access Journals (Sweden)
Edward C Twomey
Full Text Available Protein conformational changes are commonly associated with the formation of protein complexes. The non-catalytic death effector domains (DEDs mediate protein-protein interactions in a variety of cellular processes, including apoptosis, proliferation and migration, and glucose metabolism. Here, using NMR residual dipolar coupling (RDC data, we report a conformational change in the DED of the phosphoprotein enriched in astrocytes, 15 kDa (PEA-15 protein in the complex with a mitogen-activated protein (MAP kinase, extracellular regulated kinase 2 (ERK2, which is essential in regulating ERK2 cellular distribution and function in cell proliferation and migration. The most significant conformational change in PEA-15 happens at helices α2, α3, and α4, which also possess the highest flexibility among the six-helix bundle of the DED. This crucial conformational change is modulated by the D/E-RxDL charge-triad motif, one of the prominent structural features of DEDs, together with a number of other electrostatic and hydrogen bonding interactions on the protein surface. Charge-triad motif promotes the optimal orientation of key residues and expands the binding interface to accommodate protein-protein interactions. However, the charge-triad residues are not directly involved in the binding interface between PEA-15 and ERK2.
Directory of Open Access Journals (Sweden)
Xiumei Li
Full Text Available Alpinia genus are known generally as ginger-lilies for showy flowers in the ginger family, Zingiberaceae, and their floral morphology diverges from typical monocotyledon flowers. However, little is known about the functions of ginger MADS-box genes in floral identity. In this study, four AP1/AGL9 MADS-box genes were cloned from Alpinia hainanensis, and protein-protein interactions (PPIs and roles of the four genes in floral homeotic conversion and in floral evolution are surveyed for the first time. AhFUL is clustered to the AP1 lineage, AhSEP4 and AhSEP3b to the SEP lineage, and AhAGL6-like to the AGL6 lineage. The four genes showed conserved and divergent expression patterns, and their encoded proteins were localized in the nucleus. Seven combinations of PPI (AhFUL-AhSEP4, AhFUL-AhAGL6-like, AhFUL-AhSEP3b, AhSEP4-AhAGL6-like, AhSEP4-AhSEP3b, AhAGL6-like-AhSEP3b, and AhSEP3b-AhSEP3b were detected, and the PPI patterns in the AP1/AGL9 lineage revealed that five of the 10 possible combinations are conserved and three are variable, while conclusions cannot yet be made regarding the other two. Ectopic expression of AhFUL in Arabidopsis thaliana led to early flowering and floral organ homeotic conversion to sepal-like or leaf-like. Therefore, we conclude that the four A. hainanensis AP1/AGL9 genes show functional conservation and divergence in the floral identity from other MADS-box genes.
Database of Interacting Proteins (DIP)
U.S. Department of Health & Human Services — The DIP database catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent...
DEFF Research Database (Denmark)
Poulsen, Jon Wriedt; Madsen, Christian Toft; Young, Clifford
2013-01-01
be optimally completed within 30 min with endoprotease Lys-C. No chemical artifacts were introduced when samples were incubated in Gnd-HCl at 95 °C, making Gnd-HCl an appropriate digestion buffer for shotgun proteomics. Current methodologies for investigating protein-protein interactions (PPIs) often require......Protein digestion is an integral part of the "shotgun" proteomics approach and commonly requires overnight incubation prior to mass spectrometry analysis. Quadruplicate "shotgun" proteomic analysis of whole yeast lysate demonstrated that Guanidine-Hydrochloride (Gnd-HCl) protein digestion can....... To validate the Gnd-HCl approach, label-free PPI analysis of several GFP-tagged yeast deubiquitinating enzymes was performed. The identification of known interaction partners demonstrates the utility of the optimized Gnd-HCl protocol that is also scalable to the 96 well-plate format....
Milk proteins interact with goat Binder of SPerm (BSP) proteins and decrease their binding to sperm.
de Menezes, Erika Bezerra; van Tilburg, Mauricio; Plante, Geneviève; de Oliveira, Rodrigo V; Moura, Arlindo A; Manjunath, Puttaswamy
2016-11-01
Seminal plasma Binder of SPerm (BSP) proteins bind to sperm at ejaculation and promote capacitation. When in excess, however, BSP proteins damage the sperm membrane. It has been suggested that milk components of semen extenders associate with BSP proteins, potentially protecting sperm. Thus, this study was conducted to investigate if milk proteins interact with BSP proteins and reduce BSP binding to goat sperm. Using gel filtration chromatography, milk was incubated with goat seminal plasma proteins and loaded onto columns with and without calcium. Milk was also fractionated into parts containing mostly whey proteins or mostly caseins, incubated with seminal plasma proteins and subjected to gel filtration. Eluted fractions were evaluated by immunoblot using anti-goat BSP antibodies, confirming milk protein-BSP protein interactions. As determined by ELISA, milk proteins coated on polystyrene wells bound to increasing of goat BSP proteins. Far-western dot blots confirmed that BSP proteins bound to caseins and β-lactoglobulin in a concentration-dependent manner. Then, cauda epididymal sperm from five goats was incubated with seminal plasma; seminal plasma followed by milk; and milk followed by seminal plasma. Sperm membrane proteins were extracted and evaluated by immunoblotting. The pattern of BSP binding to sperm membrane proteins was reduced by 59.3 % when epididymal sperm were incubated with seminal plasma and then with skimmed milk (p 0.05). In conclusion, goat BSP proteins have an affinity for caseins and whey proteins. Milk reduces BSP binding to goat sperm, depending whether or not sperm had been previously exposed to seminal plasma. Such events may explain the protective effect of milk during goat sperm preservation.
The potential of protein-nanomaterial interaction for advanced drug delivery
DEFF Research Database (Denmark)
Peng, Qiang; Mu, Huiling
2016-01-01
Nanomaterials, like nanoparticles, micelles, nano-sheets, nanotubes and quantum dots, have great potentials in biomedical fields. However, their delivery is highly limited by the formation of protein corona upon interaction with endogenous proteins. This new identity, instead of nanomaterial itself...... of such interaction for advanced drug delivery are presented........ Therefore, protein-nanomaterial interaction is a great challenge for nanomaterial systems and should be inhibited. However, this interaction can also be used to functionalize nanomaterials by forming a selected protein corona. Unlike other decoration using exogenous molecules, nanomaterials functionalized...
Benzekry, Sebastian; Tuszynski, Jack A; Rietman, Edward A; Lakka Klement, Giannoula
2015-05-28
The ever-increasing expanse of online bioinformatics data is enabling new ways to, not only explore the visualization of these data, but also to apply novel mathematical methods to extract meaningful information for clinically relevant analysis of pathways and treatment decisions. One of the methods used for computing topological characteristics of a space at different spatial resolutions is persistent homology. This concept can also be applied to network theory, and more specifically to protein-protein interaction networks, where the number of rings in an individual cancer network represents a measure of complexity. We observed a linear correlation of R = -0.55 between persistent homology and 5-year survival of patients with a variety of cancers. This relationship was used to predict the proteins within a protein-protein interaction network with the most impact on cancer progression. By re-computing the persistent homology after computationally removing an individual node (protein) from the protein-protein interaction network, we were able to evaluate whether such an inhibition would lead to improvement in patient survival. The power of this approach lied in its ability to identify the effects of inhibition of multiple proteins and in the ability to expose whether the effect of a single inhibition may be amplified by inhibition of other proteins. More importantly, we illustrate specific examples of persistent homology calculations, which correctly predict the survival benefit observed effects in clinical trials using inhibitors of the identified molecular target. We propose that computational approaches such as persistent homology may be used in the future for selection of molecular therapies in clinic. The technique uses a mathematical algorithm to evaluate the node (protein) whose inhibition has the highest potential to reduce network complexity. The greater the drop in persistent homology, the greater reduction in network complexity, and thus a larger
Boosting compound-protein interaction prediction by deep learning.
Tian, Kai; Shao, Mingyu; Wang, Yang; Guan, Jihong; Zhou, Shuigeng
2016-11-01
The identification of interactions between compounds and proteins plays an important role in network pharmacology and drug discovery. However, experimentally identifying compound-protein interactions (CPIs) is generally expensive and time-consuming, computational approaches are thus introduced. Among these, machine-learning based methods have achieved a considerable success. However, due to the nonlinear and imbalanced nature of biological data, many machine learning approaches have their own limitations. Recently, deep learning techniques show advantages over many state-of-the-art machine learning methods in some applications. In this study, we aim at improving the performance of CPI prediction based on deep learning, and propose a method called DL-CPI (the abbreviation of Deep Learning for Compound-Protein Interactions prediction), which employs deep neural network (DNN) to effectively learn the representations of compound-protein pairs. Extensive experiments show that DL-CPI can learn useful features of compound-protein pairs by a layerwise abstraction, and thus achieves better prediction performance than existing methods on both balanced and imbalanced datasets. Copyright © 2016 Elsevier Inc. All rights reserved.
Identification of in planta protein–protein interactions using IP-MS
Jamge, Suraj; Angenent, Gerco; Bemer, Marian
2018-01-01
Gene regulation by transcription factors involves complex protein interaction networks, which include chromatin remodeling and modifying proteins as an integral part. Decoding these protein interactions is crucial for our understanding of chromatin-mediated gene regulation. Here, we describe a
Characterization of interactions between inclusion membrane proteins from Chlamydia trachomatis
Directory of Open Access Journals (Sweden)
Emilie eGauliard
2015-02-01
Full Text Available Chlamydiae are obligate intracellular pathogens of eukaryotes. The bacteria grow in an intracellular vesicle called an inclusion, the membrane of which is heavily modified by chlamydial proteins called Incs (Inclusion membrane proteins. Incs represent 7-10% of the genomes of Chlamydia and, given their localization at the interface between the host and the pathogen, likely play a key role in the development and pathogenesis of the bacterium. However, their functions remain largely unknown. Here, we characterized the interaction properties between various Inc proteins of C. trachomatis, using a bacterial two-hybrid (BACTH method suitable for detecting interactions between integral membrane proteins. To validate this approach, we first examined the oligomerization properties of the well-characterized IncA protein and showed that both the cytoplasmic domain and the transmembrane region independently contribute to IncA oligomerization. We then analyzed a set of Inc proteins and identified novel interactions between these components. Two small Incs, IncF and Ct222, were found here to interact with many other Inc proteins and may thus represent interaction nodes within the inclusion membrane. Our data suggest that the Inc proteins may assemble in the membrane of the inclusion to form specific multi-molecular complexes in an hierarchical and temporal manner. These studies will help to better define the putative functions of the Inc proteins in the infectious process of Chlamydia.
DEFF Research Database (Denmark)
Alibes, A.; Nadra, A.; De Masi, Federico
2010-01-01
diseases such as aniridia. The validity of FoldX to deal with protein-DNA interactions was demonstrated by showing that high levels of accuracy can be achieved for mutations affecting these interactions. Also we showed that protein-design algorithms can accurately reproduce experimental DNA-binding logos......Quite often a single or a combination of protein mutations is linked to specific diseases. However, distinguishing from sequence information which mutations have real effects in the protein's function is not trivial. Protein design tools are commonly used to explain mutations that affect protein...... stability, or protein-protein interaction, but not for mutations that could affect protein-DNA binding. Here, we used the protein design algorithm FoldX to model all known missense mutations in the paired box domain of Pax6, a highly conserved transcription factor involved in eye development and in several...
DUF581 is plant specific FCS-like zinc finger involved in protein-protein interaction.
Directory of Open Access Journals (Sweden)
Muhammed Jamsheer K
Full Text Available Zinc fingers are a ubiquitous class of protein domain with considerable variation in structure and function. Zf-FCS is a highly diverged group of C2-C2 zinc finger which is present in animals, prokaryotes and viruses, but not in plants. In this study we identified that a plant specific domain of unknown function, DUF581 is a zf-FCS type zinc finger. Based on HMM-HMM comparison and signature motif similarity we named this domain as FCS-Like Zinc finger (FLZ domain. A genome wide survey identified that FLZ domain containing genes are bryophytic in origin and this gene family is expanded in spermatophytes. Expression analysis of selected FLZ gene family members of A. thaliana identified an overlapping expression pattern suggesting a possible redundancy in their function. Unlike the zf-FCS domain, the FLZ domain found to be highly conserved in sequence and structure. Using a combination of bioinformatic and protein-protein interaction tools, we identified that FLZ domain is involved in protein-protein interaction.
International Nuclear Information System (INIS)
Zencir, Sevil; Ovee, Mohiuddin; Dobson, Melanie J.; Banerjee, Monimoy; Topcu, Zeki; Mohanty, Smita
2011-01-01
Highlights: → Brain-specific angiogenesis inhibitor 2 (BAI2) is a new partner protein for GIP. → BAI2 interaction with GIP was revealed by yeast two-hybrid assay. → Binding of BAI2 to GIP was characterized by NMR, CD and fluorescence. → BAI2 and GIP binding was mediated through the C-terminus of BAI2. -- Abstract: The vast majority of physiological processes in living cells are mediated by protein-protein interactions often specified by particular protein sequence motifs. PDZ domains, composed of 80-100 amino acid residues, are an important class of interaction motif. Among the PDZ-containing proteins, glutaminase interacting protein (GIP), also known as Tax Interacting Protein TIP-1, is unique in being composed almost exclusively of a single PDZ domain. GIP has important roles in cellular signaling, protein scaffolding and modulation of tumor growth and interacts with a number of physiological partner proteins, including Glutaminase L, β-Catenin, FAS, HTLV-1 Tax, HPV16 E6, Rhotekin and Kir 2.3. To identify the network of proteins that interact with GIP, a human fetal brain cDNA library was screened using a yeast two-hybrid assay with GIP as bait. We identified brain-specific angiogenesis inhibitor 2 (BAI2), a member of the adhesion-G protein-coupled receptors (GPCRs), as a new partner of GIP. BAI2 is expressed primarily in neurons, further expanding GIP cellular functions. The interaction between GIP and the carboxy-terminus of BAI2 was characterized using fluorescence, circular dichroism (CD) and nuclear magnetic resonance (NMR) spectroscopy assays. These biophysical analyses support the interaction identified in the yeast two-hybrid assay. This is the first study reporting BAI2 as an interaction partner of GIP.
Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces.
Xia, Zheng; Wu, Ling-Yun; Zhou, Xiaobo; Wong, Stephen T C
2010-09-13
Predicting drug-protein interactions from heterogeneous biological data sources is a key step for in silico drug discovery. The difficulty of this prediction task lies in the rarity of known drug-protein interactions and myriad unknown interactions to be predicted. To meet this challenge, a manifold regularization semi-supervised learning method is presented to tackle this issue by using labeled and unlabeled information which often generates better results than using the labeled data alone. Furthermore, our semi-supervised learning method integrates known drug-protein interaction network information as well as chemical structure and genomic sequence data. Using the proposed method, we predicted certain drug-protein interactions on the enzyme, ion channel, GPCRs, and nuclear receptor data sets. Some of them are confirmed by the latest publicly available drug targets databases such as KEGG. We report encouraging results of using our method for drug-protein interaction network reconstruction which may shed light on the molecular interaction inference and new uses of marketed drugs.
Interaction between -Synuclein and Other Proteins in Neurodegenerative Disorders
Directory of Open Access Journals (Sweden)
Kurt A. Jellinger
2011-01-01
Full Text Available Protein aggregation is a common characteristic of many neurodegenerative disorders, and the interaction between pathological/toxic proteins to cause neurodegeneration is a hot topic of current neuroscience research. Despite clinical, genetic, and experimental differences, evidence increasingly indicates considerable overlap between synucleinopathies and tauopathies or other protein-misfolding diseases. Inclusions, characteristics of these disorders, also occurring in other neurodegenerative diseases, suggest interactions of pathological proteins engaging common downstream pathways. Novel findings that have shifted our understanding in the role of pathologic proteins in the pathogenesis of Parkinson and Alzheimer diseases have confirmed correlations/overlaps between these and other neurodegenerative disorders. The synergistic effects of α-synuclein, hyperphosphorylated tau, amyloid-β, and other pathologic proteins, and the underlying molecular pathogenic mechanisms, including induction and spread of protein aggregates, are critically reviewed, suggesting a dualism or triad of neurodegeneration in protein-misfolding disorders, although the etiology of most of these processes is still mysterious.
Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav
2002-12-01
The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit.
Extraction of Protein-Protein Interaction from Scientific Articles by Predicting Dominant Keywords.
Koyabu, Shun; Phan, Thi Thanh Thuy; Ohkawa, Takenao
2015-01-01
For the automatic extraction of protein-protein interaction information from scientific articles, a machine learning approach is useful. The classifier is generated from training data represented using several features to decide whether a protein pair in each sentence has an interaction. Such a specific keyword that is directly related to interaction as "bind" or "interact" plays an important role for training classifiers. We call it a dominant keyword that affects the capability of the classifier. Although it is important to identify the dominant keywords, whether a keyword is dominant depends on the context in which it occurs. Therefore, we propose a method for predicting whether a keyword is dominant for each instance. In this method, a keyword that derives imbalanced classification results is tentatively assumed to be a dominant keyword initially. Then the classifiers are separately trained from the instance with and without the assumed dominant keywords. The validity of the assumed dominant keyword is evaluated based on the classification results of the generated classifiers. The assumption is updated by the evaluation result. Repeating this process increases the prediction accuracy of the dominant keyword. Our experimental results using five corpora show the effectiveness of our proposed method with dominant keyword prediction.
HippDB: a database of readily targeted helical protein-protein interactions.
Bergey, Christina M; Watkins, Andrew M; Arora, Paramjit S
2013-11-01
HippDB catalogs every protein-protein interaction whose structure is available in the Protein Data Bank and which exhibits one or more helices at the interface. The Web site accepts queries on variables such as helix length and sequence, and it provides computational alanine scanning and change in solvent-accessible surface area values for every interfacial residue. HippDB is intended to serve as a starting point for structure-based small molecule and peptidomimetic drug development. HippDB is freely available on the web at http://www.nyu.edu/projects/arora/hippdb. The Web site is implemented in PHP, MySQL and Apache. Source code freely available for download at http://code.google.com/p/helidb, implemented in Perl and supported on Linux. arora@nyu.edu.
Prastowo, S.; Widyas, N.
2018-03-01
AMP-activated protein kinase (AMPK) is cellular energy censor which works based on ATP and AMP concentration. This protein interacts with mitochondria in determine its activity to generate energy for cell metabolism purposes. For that, this paper aims to compare the protein to protein interaction of AMPK and mitochondrial activity genes in the metabolism of known animal farm (domesticated) that are cattle (Bos taurus), pig (Sus scrofa) and chicken (Gallus gallus). In silico study was done using STRING V.10 as prominent protein interaction database, followed with biological function comparison in KEGG PATHWAY database. Set of genes (12 in total) were used as input analysis that are PRKAA1, PRKAA2, PRKAB1, PRKAB2, PRKAG1, PRKAG2, PRKAG3, PPARGC1, ACC, CPT1B, NRF2 and SOD. The first 7 genes belong to gene in AMPK family, while the last 5 belong to mitochondrial activity genes. The protein interaction result shows 11, 8 and 5 metabolism pathways in Bos taurus, Sus scrofa and Gallus gallus, respectively. The top pathway in Bos taurus is AMPK signaling pathway (10 genes), Sus scrofa is Adipocytokine signaling pathway (8 genes) and Gallus gallus is FoxO signaling pathway (5 genes). Moreover, the common pathways found in those 3 species are Adipocytokine signaling pathway, Insulin signaling pathway and FoxO signaling pathway. Genes clustered in Adipocytokine and Insulin signaling pathway are PRKAA2, PPARGC1A, PRKAB1 and PRKAG2. While, in FoxO signaling pathway are PRKAA2, PRKAB1, PRKAG2. According to that, we found PRKAA2, PRKAB1 and PRKAG2 are the common genes. Based on the bioinformatics analysis, we can demonstrate that protein to protein interaction shows distinct different of metabolism in different species. However, further validation is needed to give a clear explanation.
Cai, X.; Chang, D.; Rottinghaus, S.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)
1994-01-01
Recombinant polyomavirus VP2 protein was expressed in Escherichia coli (RK1448), using the recombinant expression system pFPYV2. Recombinant VP2 was purified to near homogeneity by sodium dodecyl sulfate-polyacrylamide gel electrophoresis, electroelution, and Extracti-Gel chromatography. Polyclonal serum to this protein which reacted specifically with recombinant VP2 as well as polyomavirus virion VP2 and VP3 on Western blots (immunoblots) was produced. Purified VP2 was used to establish an in vitro protein-protein interaction assay with polyomavirus structural proteins and purified recombinant VP1. Recombinant VP2 interacted with recombinant VP1, virion VP1, and the four virion histones. Recombinant VP1 coimmunoprecipitated with recombinant VP2 or truncated VP2 (delta C12VP2), which lacked the carboxy-terminal 12 amino acids. These experiments confirmed the interaction between VP1 and VP2 and revealed that the carboxyterminal 12 amino acids of VP2 and VP3 were not necessary for formation of this interaction. In vivo VP1-VP2 interaction study accomplished by cotransfection of COS-7 cells with VP2 and truncated VP1 (delta N11VP1) lacking the nuclear localization signal demonstrated that VP2 was capable of translocating delta N11VP1 into the nucleus. These studies suggest that complexes of VP1 and VP2 may be formed in the cytoplasm and cotransported to the nucleus for virion assembly to occur.
N-way FRET microscopy of multiple protein-protein interactions in live cells.
Directory of Open Access Journals (Sweden)
Adam D Hoppe
Full Text Available Fluorescence Resonance Energy Transfer (FRET microscopy has emerged as a powerful tool to visualize nanoscale protein-protein interactions while capturing their microscale organization and millisecond dynamics. Recently, FRET microscopy was extended to imaging of multiple donor-acceptor pairs, thereby enabling visualization of multiple biochemical events within a single living cell. These methods require numerous equations that must be defined on a case-by-case basis. Here, we present a universal multispectral microscopy method (N-Way FRET to enable quantitative imaging for any number of interacting and non-interacting FRET pairs. This approach redefines linear unmixing to incorporate the excitation and emission couplings created by FRET, which cannot be accounted for in conventional linear unmixing. Experiments on a three-fluorophore system using blue, yellow and red fluorescent proteins validate the method in living cells. In addition, we propose a simple linear algebra scheme for error propagation from input data to estimate the uncertainty in the computed FRET images. We demonstrate the strength of this approach by monitoring the oligomerization of three FP-tagged HIV Gag proteins whose tight association in the viral capsid is readily observed. Replacement of one FP-Gag molecule with a lipid raft-targeted FP allowed direct observation of Gag oligomerization with no association between FP-Gag and raft-targeted FP. The N-Way FRET method provides a new toolbox for capturing multiple molecular processes with high spatial and temporal resolution in living cells.
Directory of Open Access Journals (Sweden)
Jan Mani
Full Text Available Trypanosoma brucei and related pathogens transcribe most genes as polycistronic arrays that are subsequently processed into monocistronic mRNAs. Expression is frequently regulated post-transcriptionally by cis-acting elements in the untranslated regions (UTRs. GPEET and EP procyclins are the major surface proteins of procyclic (insect midgut forms of T. brucei. Three regulatory elements common to the 3' UTRs of both mRNAs regulate mRNA turnover and translation. The glycerol-responsive element (GRE is unique to the GPEET 3' UTR and regulates its expression independently from EP. A synthetic RNA encompassing the GRE showed robust sequence-specific interactions with cytoplasmic proteins in electromobility shift assays. This, combined with column chromatography, led to the identification of 3 Alba-domain proteins. RNAi against Alba3 caused a growth phenotype and reduced the levels of Alba1 and Alba2 proteins, indicative of interactions between family members. Tandem-affinity purification and co-immunoprecipitation verified these interactions and also identified Alba4 in sub-stoichiometric amounts. Alba proteins are cytoplasmic and are recruited to starvation granules together with poly(A RNA. Concomitant depletion of all four Alba proteins by RNAi specifically reduced translation of a reporter transcript flanked by the GPEET 3' UTR. Pulldown of tagged Alba proteins confirmed interactions with poly(A binding proteins, ribosomal protein P0 and, in the case of Alba3, the cap-binding protein eIF4E4. In addition, Alba2 and Alba3 partially cosediment with polyribosomes in sucrose gradients. Alba-domain proteins seem to have exhibited great functional plasticity in the course of evolution. First identified as DNA-binding proteins in Archaea, then in association with nuclear RNase MRP/P in yeast and mammalian cells, they were recently described as components of a translationally silent complex containing stage-regulated mRNAs in Plasmodium. Our results are
Foot-printing of Protein Interactions by Tritium Labeling
International Nuclear Information System (INIS)
Mousseau, Guillaume; Thomas, Olivier P.; Agez, Morgane; Thai, Robert; Cintrat, Jean-Christophe; Rousseau, Bernard; Raffy, Quentin; Renault, Jean Philippe; Pin, Serge; Ochsenbein, Francoise
2010-01-01
A new foot-printing method for mapping protein interactions has been developed, using tritium as a radioactive label. As residues involved in an interaction are less labeled when the complex is formed, they can be identified via comparison of the tritium incorporation of each residue of the bound protein with that of the unbound one. Application of this foot-printing method to the complex formed by the histone H3 fragment H3 122-135 and the protein hAsflA 1-156 afforded data in good agreement with NMR results. (authors)
Evaluation of clustering algorithms for protein-protein interaction networks
Directory of Open Access Journals (Sweden)
van Helden Jacques
2006-11-01
Full Text Available Abstract Background Protein interactions are crucial components of all cellular processes. Recently, high-throughput methods have been developed to obtain a global description of the interactome (the whole network of protein interactions for a given organism. In 2002, the yeast interactome was estimated to contain up to 80,000 potential interactions. This estimate is based on the integration of data sets obtained by various methods (mass spectrometry, two-hybrid methods, genetic studies. High-throughput methods are known, however, to yield a non-negligible rate of false positives, and to miss a fraction of existing interactions. The interactome can be represented as a graph where nodes correspond with proteins and edges with pairwise interactions. In recent years clustering methods have been developed and applied in order to extract relevant modules from such graphs. These algorithms require the specification of parameters that may drastically affect the results. In this paper we present a comparative assessment of four algorithms: Markov Clustering (MCL, Restricted Neighborhood Search Clustering (RNSC, Super Paramagnetic Clustering (SPC, and Molecular Complex Detection (MCODE. Results A test graph was built on the basis of 220 complexes annotated in the MIPS database. To evaluate the robustness to false positives and false negatives, we derived 41 altered graphs by randomly removing edges from or adding edges to the test graph in various proportions. Each clustering algorithm was applied to these graphs with various parameter settings, and the clusters were compared with the annotated complexes. We analyzed the sensitivity of the algorithms to the parameters and determined their optimal parameter values. We also evaluated their robustness to alterations of the test graph. We then applied the four algorithms to six graphs obtained from high-throughput experiments and compared the resulting clusters with the annotated complexes. Conclusion This
KFC Server: interactive forecasting of protein interaction hot spots.
Darnell, Steven J; LeGault, Laura; Mitchell, Julie C
2008-07-01
The KFC Server is a web-based implementation of the KFC (Knowledge-based FADE and Contacts) model-a machine learning approach for the prediction of binding hot spots, or the subset of residues that account for most of a protein interface's; binding free energy. The server facilitates the automated analysis of a user submitted protein-protein or protein-DNA interface and the visualization of its hot spot predictions. For each residue in the interface, the KFC Server characterizes its local structural environment, compares that environment to the environments of experimentally determined hot spots and predicts if the interface residue is a hot spot. After the computational analysis, the user can visualize the results using an interactive job viewer able to quickly highlight predicted hot spots and surrounding structural features within the protein structure. The KFC Server is accessible at http://kfc.mitchell-lab.org.
The Protein Interaction Network of Bacteriophage Lambda with Its Host, Escherichia coli
Blasche, Sonja; Wuchty, Stefan; Rajagopala, Seesandra V.
2013-01-01
Although most of the 73 open reading frames (ORFs) in bacteriophage λ have been investigated intensively, the function of many genes in host-phage interactions remains poorly understood. Using yeast two-hybrid screens of all lambda ORFs for interactions with its host Escherichia coli, we determined a raw data set of 631 host-phage interactions resulting in a set of 62 high-confidence interactions after multiple rounds of retesting. These links suggest novel regulatory interactions between the E. coli transcriptional network and lambda proteins. Targeted host proteins and genes required for lambda infection are enriched among highly connected proteins, suggesting that bacteriophages resemble interaction patterns of human viruses. Lambda tail proteins interact with both bacterial fimbrial proteins and E. coli proteins homologous to other phage proteins. Lambda appears to dramatically differ from other phages, such as T7, because of its unusually large number of modified and processed proteins, which reduces the number of host-virus interactions detectable by yeast two-hybrid screens. PMID:24049175
Wu, Min; Kwoh, Chee-Keong; Li, Xiaoli; Zheng, Jie
2014-09-11
The regulatory mechanism of recombination is one of the most fundamental problems in genomics, with wide applications in genome wide association studies (GWAS), birth-defect diseases, molecular evolution, cancer research, etc. Recombination events cluster into short genomic regions called "recombination hotspots". Recently, a zinc finger protein PRDM9 was reported to regulate recombination hotspots in human and mouse genomes. In addition, a 13-mer motif contained in the binding sites of PRDM9 is found to be enriched in human hotspots. However, this 13-mer motif only covers a fraction of hotspots, indicating that PRDM9 is not the only regulator of recombination hotspots. Therefore, the challenge of discovering other regulators of recombination hotspots becomes significant. Furthermore, recombination is a complex process. Hence, multiple proteins acting as machinery, rather than individual proteins, are more likely to carry out this process in a precise and stable manner. Therefore, the extension of the prediction of individual trans-regulators to protein complexes is also highly desired. In this paper, we introduce a pipeline to identify genes and protein complexes associated with recombination hotspots. First, we prioritize proteins associated with hotspots based on their preference of binding to hotspots and coldspots. Second, using the above identified genes as seeds, we apply the Random Walk with Restart algorithm (RWR) to propagate their influences to other proteins in protein-protein interaction (PPI) networks. Hence, many proteins without DNA-binding information will also be assigned a score to implicate their roles in recombination hotspots. Third, we construct sub-PPI networks induced by top genes ranked by RWR for various species (e.g., yeast, human and mouse) and detect protein complexes in those sub-PPI networks. The GO term analysis show that our prioritizing methods and the RWR algorithm are capable of identifying novel genes associated with
Nagao, Chioko; Izako, Nozomi; Soga, Shinji; Khan, Samia Haseeb; Kawabata, Shigeki; Shirai, Hiroki; Mizuguchi, Kenji
2012-10-01
Proteins interact with different partners to perform different functions and it is important to elucidate the determinants of partner specificity in protein complex formation. Although methods for detecting specificity determining positions have been developed previously, direct experimental evidence for these amino acid residues is scarce, and the lack of information has prevented further computational studies. In this article, we constructed a dataset that is likely to exhibit specificity in protein complex formation, based on available crystal structures and several intuitive ideas about interaction profiles and functional subclasses. We then defined a "structure-based specificity determining position (sbSDP)" as a set of equivalent residues in a protein family showing a large variation in their interaction energy with different partners. We investigated sequence and structural features of sbSDPs and demonstrated that their amino acid propensities significantly differed from those of other interacting residues and that the importance of many of these residues for determining specificity had been verified experimentally. Copyright © 2012 Wiley Periodicals, Inc.
Cao, HuanHuan; Zhang, YuHang; Zhao, Jia; Zhu, Liucun; Wang, Yi; Li, JiaRui; Feng, Yuan-Ming; Zhang, Ning
2017-01-01
Ebola hemorrhagic fever (EHF) is caused by Ebola virus (EBOV). It is reported that human could be infected by EBOV with a high fatality rate. However, association factors between EBOV and host still tend to be ambiguous. According to the "guilt by association" (GBA) principle, proteins interacting with each other are very likely to function similarly or the same. Based on this assumption, we tried to obtain EBOV infection-related human genes in a protein-protein interaction network using Dijkstra algorithm. We hope it could contribute to the discovery of novel effective treatments. Finally, 15 genes were selected as potential EBOV infection-related human genes. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Protein prenylation: a new mode of host-pathogen interaction.
Amaya, Moushimi; Baranova, Ancha; van Hoek, Monique L
2011-12-09
Post translational modifications are required for proteins to be fully functional. The three step process, prenylation, leads to farnesylation or geranylgeranylation, which increase the hydrophobicity of the prenylated protein for efficient anchoring into plasma membranes and/or organellar membranes. Prenylated proteins function in a number of signaling and regulatory pathways that are responsible for basic cell operations. Well characterized prenylated proteins include Ras, Rac and Rho. Recently, pathogenic prokaryotic proteins, such as SifA and AnkB, have been shown to be prenylated by eukaryotic host cell machinery, but their functions remain elusive. The identification of other bacterial proteins undergoing this type of host-directed post-translational modification shows promise in elucidating host-pathogen interactions to develop new therapeutics. This review incorporates new advances in the study of protein prenylation into a broader aspect of biology with a focus on host-pathogen interaction. Copyright © 2011 Elsevier Inc. All rights reserved.
Timasheff, Serge N
2002-07-23
Solvent additives (cosolvents, osmolytes) modulate biochemical reactions if, during the course of the reaction, there is a change in preferential interactions of solvent components with the reacting system. Preferential interactions can be expressed in terms of preferential binding of the cosolvent or its preferential exclusion (preferential hydration). The driving force is the perturbation by the protein of the chemical potential of the cosolvent. It is shown that the measured change of the amount of water in contact with protein during the course of the reaction modulated by an osmolyte is a change in preferential hydration that is strictly a measure of the cosolvent chemical potential perturbation by the protein in the ternary water-protein-cosolvent system. It is not equal to the change in water of hydration, because water of hydration is a reflection strictly of protein-water forces in a binary system. There is no direct relation between water of preferential hydration and water of hydration.
Visualization and targeted disruption of protein interactions in living cells
Herce, Henry D.; Deng, Wen; Helma, Jonas; Leonhardt, Heinrich; Cardoso, M. Cristina
2013-01-01
Protein–protein interactions are the basis of all processes in living cells, but most studies of these interactions rely on biochemical in vitro assays. Here we present a simple and versatile fluorescent-three-hybrid (F3H) strategy to visualize and target protein–protein interactions. A high-affinity nanobody anchors a GFP-fusion protein of interest at a defined cellular structure and the enrichment of red-labelled interacting proteins is measured at these sites. With this approach, we visualize the p53–HDM2 interaction in living cells and directly monitor the disruption of this interaction by Nutlin 3, a drug developed to boost p53 activity in cancer therapy. We further use this approach to develop a cell-permeable vector that releases a highly specific peptide disrupting the p53 and HDM2 interaction. The availability of multiple anchor sites and the simple optical readout of this nanobody-based capture assay enable systematic and versatile analyses of protein–protein interactions in practically any cell type and species. PMID:24154492
Determination and Quantification of Molecular Interactions in Protein Films: A Review
Directory of Open Access Journals (Sweden)
Felicia Hammann
2014-12-01
Full Text Available Protein based films are nowadays also prepared with the aim of replacing expensive, crude oil-based polymers as environmentally friendly and renewable alternatives. The protein structure determines the ability of protein chains to form intra- and intermolecular bonds, whereas the degree of cross-linking depends on the amino acid composition and molecular weight of the protein, besides the conditions used in film preparation and processing. The functionality varies significantly depending on the type of protein and affects the resulting film quality and properties. This paper reviews the methods used in examination of molecular interactions in protein films and discusses how these intermolecular interactions can be quantified. The qualitative determination methods can be distinguished by structural analysis of solutions (electrophoretic analysis, size exclusion chromatography and analysis of solid films (spectroscopy techniques, X-ray scattering methods. To quantify molecular interactions involved, two methods were found to be the most suitable: protein film swelling and solubility. The importance of non-covalent and covalent interactions in protein films can be investigated using different solvents. The research was focused on whey protein, whereas soy protein and wheat gluten were included as further examples of proteins.
Determination and Quantification of Molecular Interactions in Protein Films: A Review.
Hammann, Felicia; Schmid, Markus
2014-12-10
Protein based films are nowadays also prepared with the aim of replacing expensive, crude oil-based polymers as environmentally friendly and renewable alternatives. The protein structure determines the ability of protein chains to form intra- and intermolecular bonds, whereas the degree of cross-linking depends on the amino acid composition and molecular weight of the protein, besides the conditions used in film preparation and processing. The functionality varies significantly depending on the type of protein and affects the resulting film quality and properties. This paper reviews the methods used in examination of molecular interactions in protein films and discusses how these intermolecular interactions can be quantified. The qualitative determination methods can be distinguished by structural analysis of solutions (electrophoretic analysis, size exclusion chromatography) and analysis of solid films (spectroscopy techniques, X-ray scattering methods). To quantify molecular interactions involved, two methods were found to be the most suitable: protein film swelling and solubility. The importance of non-covalent and covalent interactions in protein films can be investigated using different solvents. The research was focused on whey protein, whereas soy protein and wheat gluten were included as further examples of proteins.
Beghein, Els; Gettemans, Jan
2017-01-01
Over the last two decades, nanobodies or single-domain antibodies have found their way in research, diagnostics, and therapy. These antigen-binding fragments, derived from Camelid heavy chain only antibodies, possess remarkable characteristics that favor their use over conventional antibodies or fragments thereof, in selected areas of research. In this review, we assess the current status of nanobodies as research tools in diverse aspects of fundamental research. We discuss the use of nanobodies as detection reagents in fluorescence microscopy and focus on recent advances in super-resolution microscopy. Second, application of nanobody technology in investigating protein-protein interactions is reviewed, with emphasis on possible uses in mass spectrometry. Finally, we discuss the potential value of nanobodies in studying protein function, and we focus on their recently reported application in targeted protein degradation. Throughout the review, we highlight state-of-the-art engineering strategies that could expand nanobody versatility and we suggest future applications of the technology in the selected areas of fundamental research.
Directory of Open Access Journals (Sweden)
Arnout Voet
Full Text Available One of the underlying principles in drug discovery is that a biologically active compound is complimentary in shape and molecular recognition features to its receptor. This principle infers that molecules binding to the same receptor may share some common features. Here, we have investigated whether the electrostatic similarity can be used for the discovery of small molecule protein-protein interaction inhibitors (SMPPIIs. We have developed a method that can be used to evaluate the similarity of electrostatic potentials between small molecules and known protein ligands. This method was implemented in a software called EleKit. Analyses of all available (at the time of research SMPPII structures indicate that SMPPIIs bear some similarities of electrostatic potential with the ligand proteins of the same receptor. This is especially true for the more polar SMPPIIs. Retrospective analysis of several successful SMPPIIs has shown the applicability of EleKit in the design of new SMPPIIs.
THERMODYNAMICS OF PROTEIN-LIGAND INTERACTIONS AND THEIR ANALYSIS
Directory of Open Access Journals (Sweden)
Rummi Devi Saini
2017-11-01
Full Text Available Physiological processes are controlled mainly by intermolecular recognition mechanisms which involve protein–protein and protein–ligand interactions with a high specificity and affinity to form a specific complex. Proteins being an important class of macromolecules in biological systems, it is important to understand their actions through binding to other molecules of proteins or ligands. In fact, the binding of low molecular weight ligands to proteins plays a significant role in regulating biological processes such as cellular metabolism and signal transmission. Therefore knowledge of the protein–ligand interactions and the knowledge of the mechanisms involved in the protein-ligand recognition and binding are key in understanding biology at molecular level which will facilitate the discovery, design, and development of drugs. In this review, the mechanisms involved in protein–ligand binding, the binding kinetics, thermodynamic concepts and binding driving forces are discussed. Thermodynamic mechanisms involved in a few important protein-ligand binding are described. Various spectroscopic, non-spectroscopic and computational method for analysis of protein–ligand binding are also discussed.
Miura, Toyokazu; Takeo, Satoru; Ntege, Edward H; Otsuki, Hitoshi; Sawasaki, Tatsuya; Ishino, Tomoko; Takashima, Eizo; Tsuboi, Takafumi
2018-06-02
Malaria merozoite apical organelles; microneme and rhoptry secreted proteins play functional roles during and following invasion of host erythrocytes. Among numerous proteins, the rhoptries discharge high molecular weight proteins known as RhopH complex. Recent reports suggest that the RhopH complex is essential for growth and survival of the malaria parasite within erythrocytes. However, an in-depth understanding of the host-parasite molecular interactions is indispensable. Here we utilized a comprehensive mouse erythrocyte protein library consisting of 443 proteins produced by a wheat germ cell-free system, combined with AlphaScreen technology to identify mouse erythrocyte calmyrin as an interacting molecule of the rodent malaria parasite Plasmodium yoelii RhopH complex (PyRhopH). The PyRhopH interaction was dependent on the calmyrin N-terminus and divalent cation capacity. The finding unveils a recommendable and invaluable usefulness of our comprehensive mouse erythrocyte protein library together with the AlphaScreen technology in investigating a wide-range of host-parasite molecular interactions. Copyright © 2018 Elsevier Inc. All rights reserved.
Can understanding the packing of side chains improve the design of protein-protein interactions?
Zhou, Alice; O'Hern, Corey; Regan, Lynne
2011-03-01
With the long-term goal to improve the design of protein-protein interactions, we have begun extensive computational studies to understand how side-chains of key residues of binding partners geometrically fit together at protein-peptide interfaces, e.g. the tetratrico-peptide repeat protein and its cognate peptide). We describe simple atomic-scale models of hydrophobic dipeptides, which include hard-core repulsion, bond length and angle constraints, and Van der Waals attraction. By completely enumerating all minimal energy structures in these systems, we are able to reproduce important features of the probability distributions of side chain dihedral angles of hydrophic residues in the protein data bank. These results are the crucial first step in developing computational models that can predict the side chain conformations of residues at protein-peptide interfaces. CSO acknowledges support from NSF grant no. CMMT-1006527.
[Interaction of protein with charged colloidal particles].
Durdenko, E V; Kuznetsova, S M; Basova, L V; Tikhonenko, S A; Saburova, E A
2011-01-01
The functional state of three proteins of different molecular weight (urease, lactate dehydrogenase, and hemoglobin) in the presence of the linear polyelectrolytes poly(allylamine hydrochloride) (PAA) and sodium poly(styrenesulfonate) (PSS) in the dissolved state and of the same polyelectrolytes bound to the surface of microspheres has been investigated. Microspheres were prepared by consecutive absorption of oppositely charged polyelectrolytes so that the outer layer of the shell was PAA for the acidic protein urease, and PSS for the alkaline proteins LDH and hemoglobin. It was shown that the dissolved polyelectrolyte completely inactivates all three proteins within one minute with a slight difference in the time constant. (By Hb inactivation are conventionally meant changes in the heme environment observed from the spectrum in the Soret band.) In the presence of microspheres, the proteins were adsorbed on their surface; in this case, more than 95% of the activity was retained within two hours. The proportion of the protein adsorbed on microspheres accounted for about 98% for urease, 72% for Hb, and 35% for LDH, as determined from the tryptophan fluorescence data. The interaction of hemoglobin with another type of charged colloidal particles, phospholipid vesicles, leads to the destruction of the tertiary structure of the protein, which made itself evident in the optical absorption spectra in the Soret band, as well as the spectra of tryptophan fluorescence and circular dichroism. In this case, according to circular dichroism, the percentage of alpha-helical structure of Hb was maintained. The differences in the physical and chemical mechanisms of interaction of proteins with these two types of charged colloidal particles that leads to differences in the degree of denaturing effects are discussed.
Waadt, Rainer; Manalansan, Bianca; Rauniyar, Navin; Munemasa, Shintaro; Booker, Matthew A.; Brandt, Benjamin; Waadt, Christian; Nusinow, Dmitri A.; Kay, Steve A.; Kunz, Hans-Henning; Schumacher, Karin; DeLong, Alison; Yates, John R.; Schroeder, Julian I.
2015-01-01
The plant hormone abscisic acid (ABA) controls growth and development and regulates plant water status through an established signaling pathway. In the presence of ABA, pyrabactin resistance/regulatory component of ABA receptor proteins inhibit type 2C protein phosphatases (PP2Cs). This, in turn, enables the activation of Sucrose Nonfermenting1-Related Protein Kinases2 (SnRK2). Open Stomata1 (OST1)/SnRK2.6/SRK2E is a major SnRK2-type protein kinase responsible for mediating ABA responses. Arabidopsis (Arabidopsis thaliana) expressing an epitope-tagged OST1 in the recessive ost1-3 mutant background was used for the copurification and identification of OST1-interacting proteins after osmotic stress and ABA treatments. These analyses, which were confirmed using bimolecular fluorescence complementation and coimmunoprecipitation, unexpectedly revealed homo- and heteromerization of OST1 with SnRK2.2, SnRK2.3, OST1, and SnRK2.8. Furthermore, several OST1-complexed proteins were identified as type 2A protein phosphatase (PP2A) subunits and as proteins involved in lipid and galactolipid metabolism. More detailed analyses suggested an interaction network between ABA-activated SnRK2-type protein kinases and several PP2A-type protein phosphatase regulatory subunits. pp2a double mutants exhibited a reduced sensitivity to ABA during seed germination and stomatal closure and an enhanced ABA sensitivity in root growth regulation. These analyses add PP2A-type protein phosphatases as another class of protein phosphatases to the interaction network of SnRK2-type protein kinases. PMID:26175513
Notable Aspects of Glycan-Protein Interactions
Directory of Open Access Journals (Sweden)
Miriam Cohen
2015-09-01
Full Text Available This mini review highlights several interesting aspects of glycan-mediated interactions that are common between cells, bacteria, and viruses. Glycans are ubiquitously found on all living cells, and in the extracellular milieu of multicellular organisms. They are known to mediate initial binding and recognition events of both immune cells and pathogens with their target cells or tissues. The host target tissues are hidden under a layer of secreted glycosylated decoy targets. In addition, pathogens can utilize and display host glycans to prevent identification as foreign by the host’s immune system (molecular mimicry. Both the host and pathogens continually evolve. The host evolves to prevent infection and the pathogens evolve to evade host defenses. Many pathogens express both glycan-binding proteins and glycosidases. Interestingly, these proteins are often located at the tip of elongated protrusions in bacteria, or in the leading edge of the cell. Glycan-protein interactions have low affinity and, as a result, multivalent interactions are often required to achieve biologically relevant binding. These enable dynamic forms of adhesion mechanisms, reviewed here, and include rolling (cells, stick and roll (bacteria or surfacing (viruses.
Felgueiras, Juliana; Silva, Joana Vieira; Fardilha, Margarida
2018-01-16
"A man is known by the company he keeps" is a popular expression that perfectly fits proteins. A common approach to characterize the function of a target protein is to identify its interacting partners and thus infer its roles based on the known functions of the interactors. Protein-protein interaction networks (PPINs) have been created for several organisms, including humans, primarily as results of high-throughput screenings, such as yeast two-hybrid (Y2H). Their unequivocal use to understand events underlying human pathophysiology is promising in identifying genes and proteins associated with diseases. Therefore, numerous opportunities have emerged for PPINs as tools for clinical management of diseases: network-based disease classification systems, discovery of biomarkers and identification of therapeutic targets. Despite the great advantages of PPINs, their use is still unrecognised by several researchers who generate high-throughput data to generally characterize interactions in a certain model or to select an interaction to study in detail. We strongly believe that both approaches are not exclusive and that we can use PPINs as a complementary methodology and rich-source of information to the initial study proposal. Here, we suggest a pipeline to deal with Y2H results using bioinformatics tools freely available for academics. Yeast two-hybrid is widely-used to identify protein-protein interactions. Conventionally, the positive clones that result from a yeast two-hybrid screening are sequenced to identify the interactors of the protein of interest (also known as bait protein), and few interactions, thought as potentially relevant for the model in study, are selected for further validation using biochemical methods (e.g. co-immunoprecipitation and co-localization). The huge amount of data that is potentially lost during this conservative approach motivated us to write this tutorial-like review, so that researchers feel encouraged to take advantage of
Directory of Open Access Journals (Sweden)
Nagesh R Aragam
Full Text Available Circumsporozoite protein (CS is a leading vaccine antigen for falciparum malaria, but is highly polymorphic in natural parasite populations. The factors driving this diversity are unclear, but non-random assortment of the T cell epitopes TH2 and TH3 has been observed in a Kenyan parasite population. The recent publication of the crystal structure of the variable C terminal region of the protein allows the assessment of the impact of diversity on protein structure and T cell epitope assortment. Using data from the Gambia (55 isolates and Malawi (235 isolates, we evaluated the patterns of diversity within and between epitopes in these two distantly-separated populations. Only non-synonymous mutations were observed with the vast majority in both populations at similar frequencies suggesting strong selection on this region. A non-random pattern of T cell epitope assortment was seen in Malawi and in the Gambia, but structural analysis indicates no intramolecular spatial interactions. Using the information from these parasite populations, structural analysis reveals that polymorphic amino acids within TH2 and TH3 colocalize to one side of the protein, surround, but do not involve, the hydrophobic pocket in CS, and predominately involve charge switches. In addition, free energy analysis suggests residues forming and behind the novel pocket within CS are tightly constrained and well conserved in all alleles. In addition, free energy analysis shows polymorphic residues tend to be populated by energetically unfavorable amino acids. In combination, these findings suggest the diversity of T cell epitopes in CS may be primarily an evolutionary response to intermolecular interactions at the surface of the protein potentially counteracting antibody-mediated immune recognition or evolving host receptor diversity.
International Nuclear Information System (INIS)
Duchene, Johan; Chauhan, Sharmila D.; Lopez, Frederic; Pecher, Christiane; Esteve, Jean-Pierre; Girolami, Jean-Pierre; Bascands, Jean-Loup; Schanstra, Joost P.
2005-01-01
Recently, we have described a novel protein-protein interaction between the G-protein coupled bradykinin B2 receptor and tyrosine phosphatase SHP-2 via an immunoreceptor tyrosine-based inhibition motif (ITIM) sequence located in the C-terminal part of the B2 receptor and the Src homology (SH2) domains of SHP-2. Here we show that phospholipase C (PLC)γ1, another SH2 domain containing protein, can also interact with this ITIM sequence. Using surface plasmon resonance analysis, we observed that PLCγ1 interacted with a peptide containing the phosphorylated form of the bradykinin B2 receptor ITIM sequence. In CHO cells expressing the wild-type B2 receptor, bradykinin-induced transient recruitment and activation of PLCγ1. Interestingly, this interaction was only observed in quiescent and not in proliferating cells. Mutation of the key ITIM residue abolished this interaction with and activation of PLCγ1. Finally we also identified bradykinin-induced PLCγ1 recruitment and activation in primary culture renal mesangial cells
International Nuclear Information System (INIS)
Galvão, C.W.; Souza, E.M.; Etto, R.M.; Pedrosa, F.O.; Chubatsu, L.S.; Yates, M.G.; Schumacher, J.; Buck, M.; Steffens, M.B.R.
2012-01-01
DNA repair is crucial to the survival of all organisms. The bacterial RecA protein is a central component in the SOS response and in recombinational and SOS DNA repairs. The RecX protein has been characterized as a negative modulator of RecA activity in many bacteria. The recA and recX genes of Herbaspirillum seropedicae constitute a single operon, and evidence suggests that RecX participates in SOS repair. In the present study, we show that the H. seropedicae RecX protein (RecX Hs ) can interact with the H. seropedicae RecA protein (RecA Hs ) and that RecA Hs possesses ATP binding, ATP hydrolyzing and DNA strand exchange activities. RecX Hs inhibited 90% of the RecA Hs DNA strand exchange activity even when present in a 50-fold lower molar concentration than RecA Hs . RecA Hs ATP binding was not affected by the addition of RecX, but the ATPase activity was reduced. When RecX Hs was present before the formation of RecA filaments (RecA-ssDNA), inhibition of ATPase activity was substantially reduced and excess ssDNA also partially suppressed this inhibition. The results suggest that the RecX Hs protein negatively modulates the RecA Hs activities by protein-protein interactions and also by DNA-protein interactions
Directory of Open Access Journals (Sweden)
C.W. Galvão
2012-12-01
Full Text Available DNA repair is crucial to the survival of all organisms. The bacterial RecA protein is a central component in the SOS response and in recombinational and SOS DNA repairs. The RecX protein has been characterized as a negative modulator of RecA activity in many bacteria. The recA and recX genes of Herbaspirillum seropedicae constitute a single operon, and evidence suggests that RecX participates in SOS repair. In the present study, we show that the H. seropedicae RecX protein (RecX Hs can interact with the H. seropedicaeRecA protein (RecA Hs and that RecA Hs possesses ATP binding, ATP hydrolyzing and DNA strand exchange activities. RecX Hs inhibited 90% of the RecA Hs DNA strand exchange activity even when present in a 50-fold lower molar concentration than RecA Hs. RecA Hs ATP binding was not affected by the addition of RecX, but the ATPase activity was reduced. When RecX Hs was present before the formation of RecA filaments (RecA-ssDNA, inhibition of ATPase activity was substantially reduced and excess ssDNA also partially suppressed this inhibition. The results suggest that the RecX Hs protein negatively modulates the RecA Hs activities by protein-protein interactions and also by DNA-protein interactions.
Galvão, C W; Souza, E M; Etto, R M; Pedrosa, F O; Chubatsu, L S; Yates, M G; Schumacher, J; Buck, M; Steffens, M B R
2012-12-01
DNA repair is crucial to the survival of all organisms. The bacterial RecA protein is a central component in the SOS response and in recombinational and SOS DNA repairs. The RecX protein has been characterized as a negative modulator of RecA activity in many bacteria. The recA and recX genes of Herbaspirillum seropedicae constitute a single operon, and evidence suggests that RecX participates in SOS repair. In the present study, we show that the H. seropedicae RecX protein (RecX Hs) can interact with the H. seropedicaeRecA protein (RecA Hs) and that RecA Hs possesses ATP binding, ATP hydrolyzing and DNA strand exchange activities. RecX Hs inhibited 90% of the RecA Hs DNA strand exchange activity even when present in a 50-fold lower molar concentration than RecA Hs. RecA Hs ATP binding was not affected by the addition of RecX, but the ATPase activity was reduced. When RecX Hs was present before the formation of RecA filaments (RecA-ssDNA), inhibition of ATPase activity was substantially reduced and excess ssDNA also partially suppressed this inhibition. The results suggest that the RecX Hs protein negatively modulates the RecA Hs activities by protein-protein interactions and also by DNA-protein interactions.
Marsh, Lorraine
2015-01-01
Many systems in biology rely on binding of ligands to target proteins in a single high-affinity conformation with a favorable ΔG. Alternatively, interactions of ligands with protein regions that allow diffuse binding, distributed over multiple sites and conformations, can exhibit favorable ΔG because of their higher entropy. Diffuse binding may be biologically important for multidrug transporters and carrier proteins. A fine-grained computational method for numerical integration of total binding ΔG arising from diffuse regional interaction of a ligand in multiple conformations using a Markov Chain Monte Carlo (MCMC) approach is presented. This method yields a metric that quantifies the influence on overall ligand affinity of ligand binding to multiple, distinct sites within a protein binding region. This metric is essentially a measure of dispersion in equilibrium ligand binding and depends on both the number of potential sites of interaction and the distribution of their individual predicted affinities. Analysis of test cases indicates that, for some ligand/protein pairs involving transporters and carrier proteins, diffuse binding contributes greatly to total affinity, whereas in other cases the influence is modest. This approach may be useful for studying situations where "nonspecific" interactions contribute to biological function.