WorldWideScience

Sample records for comparative protein modeling

  1. Exploration of freely available web-interfaces for comparative homology modelling of microbial proteins.

    Science.gov (United States)

    Nema, Vijay; Pal, Sudhir Kumar

    2013-01-01

    This study was conducted to find the best suited freely available software for modelling of proteins by taking a few sample proteins. The proteins used were small to big in size with available crystal structures for the purpose of benchmarking. Key players like Phyre2, Swiss-Model, CPHmodels-3.0, Homer, (PS)2, (PS)(2)-V(2), Modweb were used for the comparison and model generation. Benchmarking process was done for four proteins, Icl, InhA, and KatG of Mycobacterium tuberculosis and RpoB of Thermus Thermophilus to get the most suited software. Parameters compared during analysis gave relatively better values for Phyre2 and Swiss-Model. This comparative study gave the information that Phyre2 and Swiss-Model make good models of small and large proteins as compared to other screened software. Other software was also good but is often not very efficient in providing full-length and properly folded structure.

  2. @TOME-2: a new pipeline for comparative modeling of protein-ligand complexes.

    Science.gov (United States)

    Pons, Jean-Luc; Labesse, Gilles

    2009-07-01

    @TOME 2.0 is new web pipeline dedicated to protein structure modeling and small ligand docking based on comparative analyses. @TOME 2.0 allows fold recognition, template selection, structural alignment editing, structure comparisons, 3D-model building and evaluation. These tasks are routinely used in sequence analyses for structure prediction. In our pipeline the necessary software is efficiently interconnected in an original manner to accelerate all the processes. Furthermore, we have also connected comparative docking of small ligands that is performed using protein-protein superposition. The input is a simple protein sequence in one-letter code with no comment. The resulting 3D model, protein-ligand complexes and structural alignments can be visualized through dedicated Web interfaces or can be downloaded for further studies. These original features will aid in the functional annotation of proteins and the selection of templates for molecular modeling and virtual screening. Several examples are described to highlight some of the new functionalities provided by this pipeline. The server and its documentation are freely available at http://abcis.cbs.cnrs.fr/AT2/

  3. Automated protein structure modeling with SWISS-MODEL Workspace and the Protein Model Portal.

    Science.gov (United States)

    Bordoli, Lorenza; Schwede, Torsten

    2012-01-01

    Comparative protein structure modeling is a computational approach to build three-dimensional structural models for proteins using experimental structures of related protein family members as templates. Regular blind assessments of modeling accuracy have demonstrated that comparative protein structure modeling is currently the most reliable technique to model protein structures. Homology models are often sufficiently accurate to substitute for experimental structures in a wide variety of applications. Since the usefulness of a model for specific application is determined by its accuracy, model quality estimation is an essential component of protein structure prediction. Comparative protein modeling has become a routine approach in many areas of life science research since fully automated modeling systems allow also nonexperts to build reliable models. In this chapter, we describe practical approaches for automated protein structure modeling with SWISS-MODEL Workspace and the Protein Model Portal.

  4. Automated Protein Structure Modeling with SWISS-MODEL Workspace and the Protein Model Portal

    OpenAIRE

    Bordoli, Lorenza; Schwede, Torsten

    2012-01-01

    Comparative protein structure modeling is a computational approach to build three-dimensional structural models for proteins using experimental structures of related protein family members as templates. Regular blind assessments of modeling accuracy have demonstrated that comparative protein structure modeling is currently the most reliable technique to model protein structures. Homology models are often sufficiently accurate to substitute for experimental structures in a wide variety of appl...

  5. Comparative Study of Elastic Network Model and Protein Contact Network for Protein Complexes: The Hemoglobin Case

    Directory of Open Access Journals (Sweden)

    Guang Hu

    2017-01-01

    Full Text Available The overall topology and interfacial interactions play key roles in understanding structural and functional principles of protein complexes. Elastic Network Model (ENM and Protein Contact Network (PCN are two widely used methods for high throughput investigation of structures and interactions within protein complexes. In this work, the comparative analysis of ENM and PCN relative to hemoglobin (Hb was taken as case study. We examine four types of structural and dynamical paradigms, namely, conformational change between different states of Hbs, modular analysis, allosteric mechanisms studies, and interface characterization of an Hb. The comparative study shows that ENM has an advantage in studying dynamical properties and protein-protein interfaces, while PCN is better for describing protein structures quantitatively both from local and from global levels. We suggest that the integration of ENM and PCN would give a potential but powerful tool in structural systems biology.

  6. Comparing side chain packing in soluble proteins, protein-protein interfaces, and transmembrane proteins.

    Science.gov (United States)

    Gaines, J C; Acebes, S; Virrueta, A; Butler, M; Regan, L; O'Hern, C S

    2018-05-01

    We compare side chain prediction and packing of core and non-core regions of soluble proteins, protein-protein interfaces, and transmembrane proteins. We first identified or created comparable databases of high-resolution crystal structures of these 3 protein classes. We show that the solvent-inaccessible cores of the 3 classes of proteins are equally densely packed. As a result, the side chains of core residues at protein-protein interfaces and in the membrane-exposed regions of transmembrane proteins can be predicted by the hard-sphere plus stereochemical constraint model with the same high prediction accuracies (>90%) as core residues in soluble proteins. We also find that for all 3 classes of proteins, as one moves away from the solvent-inaccessible core, the packing fraction decreases as the solvent accessibility increases. However, the side chain predictability remains high (80% within 30°) up to a relative solvent accessibility, rSASA≲0.3, for all 3 protein classes. Our results show that ≈40% of the interface regions in protein complexes are "core", that is, densely packed with side chain conformations that can be accurately predicted using the hard-sphere model. We propose packing fraction as a metric that can be used to distinguish real protein-protein interactions from designed, non-binding, decoys. Our results also show that cores of membrane proteins are the same as cores of soluble proteins. Thus, the computational methods we are developing for the analysis of the effect of hydrophobic core mutations in soluble proteins will be equally applicable to analyses of mutations in membrane proteins. © 2018 Wiley Periodicals, Inc.

  7. Assessment and Challenges of Ligand Docking into Comparative Models of G-Protein Coupled Receptors

    DEFF Research Database (Denmark)

    Nguyen, E.D.; Meiler, J.; Norn, C.

    2013-01-01

    screening and to design and optimize drug candidates. However, low sequence identity between receptors, conformational flexibility, and chemical diversity of ligands present an enormous challenge to molecular modeling approaches. It is our hypothesis that rapid Monte-Carlo sampling of protein backbone...... extracellular loop. Furthermore, these models are consistently correlated with low Rosetta energy score. To predict their binding modes, ligand conformers of the 14 ligands co-crystalized with the GPCRs were docked against the top ranked comparative models. In contrast to the comparative models themselves...

  8. The Protein Model Portal

    OpenAIRE

    Arnold, Konstantin; Kiefer, Florian; Kopp, J?rgen; Battey, James N. D.; Podvinec, Michael; Westbrook, John D.; Berman, Helen M.; Bordoli, Lorenza; Schwede, Torsten

    2008-01-01

    Structural Genomics has been successful in determining the structures of many unique proteins in a high throughput manner. Still, the number of known protein sequences is much larger than the number of experimentally solved protein structures. Homology (or comparative) modeling methods make use of experimental protein structures to build models for evolutionary related proteins. Thereby, experimental structure determination efforts and homology modeling complement each other in the exploratio...

  9. Effective comparative analysis of protein-protein interaction networks by measuring the steady-state network flow using a Markov model.

    Science.gov (United States)

    Jeong, Hyundoo; Qian, Xiaoning; Yoon, Byung-Jun

    2016-10-06

    Comparative analysis of protein-protein interaction (PPI) networks provides an effective means of detecting conserved functional network modules across different species. Such modules typically consist of orthologous proteins with conserved interactions, which can be exploited to computationally predict the modules through network comparison. In this work, we propose a novel probabilistic framework for comparing PPI networks and effectively predicting the correspondence between proteins, represented as network nodes, that belong to conserved functional modules across the given PPI networks. The basic idea is to estimate the steady-state network flow between nodes that belong to different PPI networks based on a Markov random walk model. The random walker is designed to make random moves to adjacent nodes within a PPI network as well as cross-network moves between potential orthologous nodes with high sequence similarity. Based on this Markov random walk model, we estimate the steady-state network flow - or the long-term relative frequency of the transitions that the random walker makes - between nodes in different PPI networks, which can be used as a probabilistic score measuring their potential correspondence. Subsequently, the estimated scores can be used for detecting orthologous proteins in conserved functional modules through network alignment. Through evaluations based on multiple real PPI networks, we demonstrate that the proposed scheme leads to improved alignment results that are biologically more meaningful at reduced computational cost, outperforming the current state-of-the-art algorithms. The source code and datasets can be downloaded from http://www.ece.tamu.edu/~bjyoon/CUFID .

  10. The utility of comparative models and the local model quality for protein crystal structure determination by Molecular Replacement

    Directory of Open Access Journals (Sweden)

    Pawlowski Marcin

    2012-11-01

    Full Text Available Abstract Background Computational models of protein structures were proved to be useful as search models in Molecular Replacement (MR, a common method to solve the phase problem faced by macromolecular crystallography. The success of MR depends on the accuracy of a search model. Unfortunately, this parameter remains unknown until the final structure of the target protein is determined. During the last few years, several Model Quality Assessment Programs (MQAPs that predict the local accuracy of theoretical models have been developed. In this article, we analyze whether the application of MQAPs improves the utility of theoretical models in MR. Results For our dataset of 615 search models, the real local accuracy of a model increases the MR success ratio by 101% compared to corresponding polyalanine templates. On the contrary, when local model quality is not utilized in MR, the computational models solved only 4.5% more MR searches than polyalanine templates. For the same dataset of the 615 models, a workflow combining MR with predicted local accuracy of a model found 45% more correct solution than polyalanine templates. To predict such accuracy MetaMQAPclust, a “clustering MQAP” was used. Conclusions Using comparative models only marginally increases the MR success ratio in comparison to polyalanine structures of templates. However, the situation changes dramatically once comparative models are used together with their predicted local accuracy. A new functionality was added to the GeneSilico Fold Prediction Metaserver in order to build models that are more useful for MR searches. Additionally, we have developed a simple method, AmIgoMR (Am I good for MR?, to predict if an MR search with a template-based model for a given template is likely to find the correct solution.

  11. The Protein Model Portal.

    Science.gov (United States)

    Arnold, Konstantin; Kiefer, Florian; Kopp, Jürgen; Battey, James N D; Podvinec, Michael; Westbrook, John D; Berman, Helen M; Bordoli, Lorenza; Schwede, Torsten

    2009-03-01

    Structural Genomics has been successful in determining the structures of many unique proteins in a high throughput manner. Still, the number of known protein sequences is much larger than the number of experimentally solved protein structures. Homology (or comparative) modeling methods make use of experimental protein structures to build models for evolutionary related proteins. Thereby, experimental structure determination efforts and homology modeling complement each other in the exploration of the protein structure space. One of the challenges in using model information effectively has been to access all models available for a specific protein in heterogeneous formats at different sites using various incompatible accession code systems. Often, structure models for hundreds of proteins can be derived from a given experimentally determined structure, using a variety of established methods. This has been done by all of the PSI centers, and by various independent modeling groups. The goal of the Protein Model Portal (PMP) is to provide a single portal which gives access to the various models that can be leveraged from PSI targets and other experimental protein structures. A single interface allows all existing pre-computed models across these various sites to be queried simultaneously, and provides links to interactive services for template selection, target-template alignment, model building, and quality assessment. The current release of the portal consists of 7.6 million model structures provided by different partner resources (CSMP, JCSG, MCSG, NESG, NYSGXRC, JCMM, ModBase, SWISS-MODEL Repository). The PMP is available at http://www.proteinmodelportal.org and from the PSI Structural Genomics Knowledgebase.

  12. Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

    Science.gov (United States)

    Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

    2014-11-01

    As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of

  13. New tips for structure prediction by comparative modeling

    Science.gov (United States)

    Rayan, Anwar

    2009-01-01

    Comparative modelling is utilized to predict the 3-dimensional conformation of a given protein (target) based on its sequence alignment to experimentally determined protein structure (template). The use of such technique is already rewarding and increasingly widespread in biological research and drug development. The accuracy of the predictions as commonly accepted depends on the score of sequence identity of the target protein to the template. To assess the relationship between sequence identity and model quality, we carried out an analysis of a set of 4753 sequence and structure alignments. Throughout this research, the model accuracy was measured by root mean square deviations of Cα atoms of the target-template structures. Surprisingly, the results show that sequence identity of the target protein to the template is not a good descriptor to predict the accuracy of the 3-D structure model. However, in a large number of cases, comparative modelling with lower sequence identity of target to template proteins led to more accurate 3-D structure model. As a consequence of this study, we suggest new tips for improving the quality of omparative models, particularly for models whose target-template sequence identity is below 50%. PMID:19255646

  14. ProteinHistorian: tools for the comparative analysis of eukaryote protein origin.

    Directory of Open Access Journals (Sweden)

    John A Capra

    Full Text Available The evolutionary history of a protein reflects the functional history of its ancestors. Recent phylogenetic studies identified distinct evolutionary signatures that characterize proteins involved in cancer, Mendelian disease, and different ontogenic stages. Despite the potential to yield insight into the cellular functions and interactions of proteins, such comparative phylogenetic analyses are rarely performed, because they require custom algorithms. We developed ProteinHistorian to make tools for performing analyses of protein origins widely available. Given a list of proteins of interest, ProteinHistorian estimates the phylogenetic age of each protein, quantifies enrichment for proteins of specific ages, and compares variation in protein age with other protein attributes. ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships. We illustrate the use of ProteinHistorian with three example analyses. First, we demonstrate that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Next, we show that human proteins with annotated regulatory functions are significantly younger than proteins with catalytic functions. Finally, we compare protein length and age in many eukaryotic species and, as expected from previous studies, find a positive, though often weak, correlation between protein age and length. ProteinHistorian is available through a web server with an intuitive interface and as a set of command line tools; this allows biologists and bioinformaticians alike to integrate these approaches into their analysis pipelines. ProteinHistorian's modular, extensible design facilitates the integration of new datasets and algorithms. The ProteinHistorian web server, source code, and pre-computed ages for 32 eukaryotic genomes are

  15. Modeling complexes of modeled proteins.

    Science.gov (United States)

    Anishchenko, Ivan; Kundrotas, Petras J; Vakser, Ilya A

    2017-03-01

    Structural characterization of proteins is essential for understanding life processes at the molecular level. However, only a fraction of known proteins have experimentally determined structures. This fraction is even smaller for protein-protein complexes. Thus, structural modeling of protein-protein interactions (docking) primarily has to rely on modeled structures of the individual proteins, which typically are less accurate than the experimentally determined ones. Such "double" modeling is the Grand Challenge of structural reconstruction of the interactome. Yet it remains so far largely untested in a systematic way. We present a comprehensive validation of template-based and free docking on a set of 165 complexes, where each protein model has six levels of structural accuracy, from 1 to 6 Å C α RMSD. Many template-based docking predictions fall into acceptable quality category, according to the CAPRI criteria, even for highly inaccurate proteins (5-6 Å RMSD), although the number of such models (and, consequently, the docking success rate) drops significantly for models with RMSD > 4 Å. The results show that the existing docking methodologies can be successfully applied to protein models with a broad range of structural accuracy, and the template-based docking is much less sensitive to inaccuracies of protein models than the free docking. Proteins 2017; 85:470-478. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  16. Comparative assessment of fluorescent proteins for in vivo imaging in an animal model system.

    Science.gov (United States)

    Heppert, Jennifer K; Dickinson, Daniel J; Pani, Ariel M; Higgins, Christopher D; Steward, Annette; Ahringer, Julie; Kuhn, Jeffrey R; Goldstein, Bob

    2016-11-07

    Fluorescent protein tags are fundamental tools used to visualize gene products and analyze their dynamics in vivo. Recent advances in genome editing have expedited the precise insertion of fluorescent protein tags into the genomes of diverse organisms. These advances expand the potential of in vivo imaging experiments and facilitate experimentation with new, bright, photostable fluorescent proteins. Most quantitative comparisons of the brightness and photostability of different fluorescent proteins have been made in vitro, removed from biological variables that govern their performance in cells or organisms. To address the gap, we quantitatively assessed fluorescent protein properties in vivo in an animal model system. We generated transgenic Caenorhabditis elegans strains expressing green, yellow, or red fluorescent proteins in embryos and imaged embryos expressing different fluorescent proteins under the same conditions for direct comparison. We found that mNeonGreen was not as bright in vivo as predicted based on in vitro data but is a better tag than GFP for specific kinds of experiments, and we report on optimal red fluorescent proteins. These results identify ideal fluorescent proteins for imaging in vivo in C. elegans embryos and suggest good candidate fluorescent proteins to test in other animal model systems for in vivo imaging experiments. © 2016 Heppert et al. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

  17. Coarse-grain modelling of protein-protein interactions

    NARCIS (Netherlands)

    Baaden, Marc; Marrink, Siewert J.

    2013-01-01

    Here, we review recent advances towards the modelling of protein-protein interactions (PPI) at the coarse-grained (CG) level, a technique that is now widely used to understand protein affinity, aggregation and self-assembly behaviour. PPI models of soluble proteins and membrane proteins are

  18. WEBnm@ v2.0: Web server and services for comparing protein flexibility.

    Science.gov (United States)

    Tiwari, Sandhya P; Fuglebakk, Edvin; Hollup, Siv M; Skjærven, Lars; Cragnolini, Tristan; Grindhaug, Svenn H; Tekle, Kidane M; Reuter, Nathalie

    2014-12-30

    Normal mode analysis (NMA) using elastic network models is a reliable and cost-effective computational method to characterise protein flexibility and by extension, their dynamics. Further insight into the dynamics-function relationship can be gained by comparing protein motions between protein homologs and functional classifications. This can be achieved by comparing normal modes obtained from sets of evolutionary related proteins. We have developed an automated tool for comparative NMA of a set of pre-aligned protein structures. The user can submit a sequence alignment in the FASTA format and the corresponding coordinate files in the Protein Data Bank (PDB) format. The computed normalised squared atomic fluctuations and atomic deformation energies of the submitted structures can be easily compared on graphs provided by the web user interface. The web server provides pairwise comparison of the dynamics of all proteins included in the submitted set using two measures: the Root Mean Squared Inner Product and the Bhattacharyya Coefficient. The Comparative Analysis has been implemented on our web server for NMA, WEBnm@, which also provides recently upgraded functionality for NMA of single protein structures. This includes new visualisations of protein motion, visualisation of inter-residue correlations and the analysis of conformational change using the overlap analysis. In addition, programmatic access to WEBnm@ is now available through a SOAP-based web service. Webnm@ is available at http://apps.cbu.uib.no/webnma . WEBnm@ v2.0 is an online tool offering unique capability for comparative NMA on multiple protein structures. Along with a convenient web interface, powerful computing resources, and several methods for mode analyses, WEBnm@ facilitates the assessment of protein flexibility within protein families and superfamilies. These analyses can give a good view of how the structures move and how the flexibility is conserved over the different structures.

  19. Compare local pocket and global protein structure models by small structure patterns

    KAUST Repository

    Cui, Xuefeng

    2015-09-09

    Researchers proposed several criteria to assess the quality of predicted protein structures because it is one of the essential tasks in the Critical Assessment of Techniques for Protein Structure Prediction (CASP) competitions. Popular criteria include root mean squared deviation (RMSD), MaxSub score, TM-score, GDT-TS and GDT-HA scores. All these criteria require calculation of rigid transformations to superimpose the the predicted protein structure to the native protein structure. Yet, how to obtain the rigid transformations is unknown or with high time complexity, and, hence, heuristic algorithms were proposed. In this work, we carefully design various small structure patterns, including the ones specifically tuned for local pockets. Such structure patterns are biologically meaningful, and address the issue of relying on a sufficient number of backbone residue fragments for existing methods. We sample the rigid transformations from these small structure patterns; and the optimal superpositions yield by these small structures are refined and reported. As a result, among 11; 669 pairs of predicted and native local protein pocket models from the CASP10 dataset, the GDT-TS scores calculated by our method are significantly higher than those calculated by LGA. Moreover, our program is computationally much more efficient. Source codes and executables are publicly available at http://www.cbrc.kaust.edu.sa/prosta/

  20. Comparative Proteomics Identifies Host Immune System Proteins Affected by Infection with Mycobacterium bovis.

    Directory of Open Access Journals (Sweden)

    Vladimir López

    2016-03-01

    Full Text Available Mycobacteria of the Mycobacterium tuberculosis complex (MTBC greatly impact human and animal health worldwide. The mycobacterial life cycle is complex, and the mechanisms resulting in pathogen infection and survival in host cells are not fully understood. Eurasian wild boar (Sus scrofa are natural reservoir hosts for MTBC and a model for mycobacterial infection and tuberculosis (TB. In the wild boar TB model, mycobacterial infection affects the expression of innate and adaptive immune response genes in mandibular lymph nodes and oropharyngeal tonsils, and biomarkers have been proposed as correlates with resistance to natural infection. However, the mechanisms used by mycobacteria to manipulate host immune response are not fully characterized. Our hypothesis is that the immune system proteins under-represented in infected animals, when compared to uninfected controls, are used by mycobacteria to guarantee pathogen infection and transmission. To address this hypothesis, a comparative proteomics approach was used to compare host response between uninfected (TB- and M. bovis-infected young (TB+ and adult animals with different infection status [TB lesions localized in the head (TB+ or affecting multiple organs (TB++]. The results identified host immune system proteins that play an important role in host response to mycobacteria. Calcium binding protein A9, Heme peroxidase, Lactotransferrin, Cathelicidin and Peptidoglycan-recognition protein were under-represented in TB+ animals when compared to uninfected TB- controls, but protein levels were higher as infection progressed in TB++ animals when compared to TB- and/or TB+ adult wild boar. MHCI was the only protein over-represented in TB+ adult wild boar when compared to uninfected TB- controls. The results reported here suggest that M. bovis manipulates host immune response by reducing the production of immune system proteins. However, as infection progresses, wild boar immune response recovers to

  1. Comparative Proteomics Identifies Host Immune System Proteins Affected by Infection with Mycobacterium bovis.

    Science.gov (United States)

    López, Vladimir; Villar, Margarita; Queirós, João; Vicente, Joaquín; Mateos-Hernández, Lourdes; Díez-Delgado, Iratxe; Contreras, Marinela; Alves, Paulo C; Alberdi, Pilar; Gortázar, Christian; de la Fuente, José

    2016-03-01

    Mycobacteria of the Mycobacterium tuberculosis complex (MTBC) greatly impact human and animal health worldwide. The mycobacterial life cycle is complex, and the mechanisms resulting in pathogen infection and survival in host cells are not fully understood. Eurasian wild boar (Sus scrofa) are natural reservoir hosts for MTBC and a model for mycobacterial infection and tuberculosis (TB). In the wild boar TB model, mycobacterial infection affects the expression of innate and adaptive immune response genes in mandibular lymph nodes and oropharyngeal tonsils, and biomarkers have been proposed as correlates with resistance to natural infection. However, the mechanisms used by mycobacteria to manipulate host immune response are not fully characterized. Our hypothesis is that the immune system proteins under-represented in infected animals, when compared to uninfected controls, are used by mycobacteria to guarantee pathogen infection and transmission. To address this hypothesis, a comparative proteomics approach was used to compare host response between uninfected (TB-) and M. bovis-infected young (TB+) and adult animals with different infection status [TB lesions localized in the head (TB+) or affecting multiple organs (TB++)]. The results identified host immune system proteins that play an important role in host response to mycobacteria. Calcium binding protein A9, Heme peroxidase, Lactotransferrin, Cathelicidin and Peptidoglycan-recognition protein were under-represented in TB+ animals when compared to uninfected TB- controls, but protein levels were higher as infection progressed in TB++ animals when compared to TB- and/or TB+ adult wild boar. MHCI was the only protein over-represented in TB+ adult wild boar when compared to uninfected TB- controls. The results reported here suggest that M. bovis manipulates host immune response by reducing the production of immune system proteins. However, as infection progresses, wild boar immune response recovers to limit pathogen

  2. Structural modelling and comparative analysis of homologous, analogous and specific proteins from Trypanosoma cruzi versus Homo sapiens: putative drug targets for chagas' disease treatment.

    Science.gov (United States)

    Capriles, Priscila V S Z; Guimarães, Ana C R; Otto, Thomas D; Miranda, Antonio B; Dardenne, Laurent E; Degrave, Wim M

    2010-10-29

    Trypanosoma cruzi is the etiological agent of Chagas' disease, an endemic infection that causes thousands of deaths every year in Latin America. Therapeutic options remain inefficient, demanding the search for new drugs and/or new molecular targets. Such efforts can focus on proteins that are specific to the parasite, but analogous enzymes and enzymes with a three-dimensional (3D) structure sufficiently different from the corresponding host proteins may represent equally interesting targets. In order to find these targets we used the workflows MHOLline and AnEnΠ obtaining 3D models from homologous, analogous and specific proteins of Trypanosoma cruzi versus Homo sapiens. We applied genome wide comparative modelling techniques to obtain 3D models for 3,286 predicted proteins of T. cruzi. In combination with comparative genome analysis to Homo sapiens, we were able to identify a subset of 397 enzyme sequences, of which 356 are homologous, 3 analogous and 38 specific to the parasite. In this work, we present a set of 397 enzyme models of T. cruzi that can constitute potential structure-based drug targets to be investigated for the development of new strategies to fight Chagas' disease. The strategies presented here support the concept of structural analysis in conjunction with protein functional analysis as an interesting computational methodology to detect potential targets for structure-based rational drug design. For example, 2,4-dienoyl-CoA reductase (EC 1.3.1.34) and triacylglycerol lipase (EC 3.1.1.3), classified as analogous proteins in relation to H. sapiens enzymes, were identified as new potential molecular targets.

  3. New tips for structure prediction by comparative modeling

    OpenAIRE

    Rayan, Anwar

    2009-01-01

    Comparative modelling is utilized to predict the 3-dimensional conformation of a given protein (target) based on its sequence alignment to experimentally determined protein structure (template). The use of such technique is already rewarding and increasingly widespread in biological research and drug development. The accuracy of the predictions as commonly accepted depends on the score of sequence identity of the target protein to the template. To assess the relationship between sequence iden...

  4. Are animal models predictive for human postmortem muscle protein degradation?

    Science.gov (United States)

    Ehrenfellner, Bianca; Zissler, Angela; Steinbacher, Peter; Monticelli, Fabio C; Pittner, Stefan

    2017-11-01

    A most precise determination of the postmortem interval (PMI) is a crucial aspect in forensic casework. Although there are diverse approaches available to date, the high heterogeneity of cases together with the respective postmortal changes often limit the validity and sufficiency of many methods. Recently, a novel approach for time since death estimation by the analysis of postmortal changes of muscle proteins was proposed. It is however necessary to improve the reliability and accuracy, especially by analysis of possible influencing factors on protein degradation. This is ideally investigated on standardized animal models that, however, require legitimization by a comparison of human and animal tissue, and in this specific case of protein degradation profiles. Only if protein degradation events occur in comparable fashion within different species, respective findings can sufficiently be transferred from the animal model to application in humans. Therefor samples from two frequently used animal models (mouse and pig), as well as forensic cases with representative protein profiles of highly differing PMIs were analyzed. Despite physical and physiological differences between species, western blot analysis revealed similar patterns in most of the investigated proteins. Even most degradation events occurred in comparable fashion. In some other aspects, however, human and animal profiles depicted distinct differences. The results of this experimental series clearly indicate the huge importance of comparative studies, whenever animal models are considered. Although animal models could be shown to reflect the basic principles of protein degradation processes in humans, we also gained insight in the difficulties and limitations of the applicability of the developed methodology in different mammalian species regarding protein specificity and methodic functionality.

  5. Comparative proteomics analysis of oral cancer cell lines: identification of cancer associated proteins

    Science.gov (United States)

    2014-01-01

    Background A limiting factor in performing proteomics analysis on cancerous cells is the difficulty in obtaining sufficient amounts of starting material. Cell lines can be used as a simplified model system for studying changes that accompany tumorigenesis. This study used two-dimensional gel electrophoresis (2DE) to compare the whole cell proteome of oral cancer cell lines vs normal cells in an attempt to identify cancer associated proteins. Results Three primary cell cultures of normal cells with a limited lifespan without hTERT immortalization have been successfully established. 2DE was used to compare the whole cell proteome of these cells with that of three oral cancer cell lines. Twenty four protein spots were found to have changed in abundance. MALDI TOF/TOF was then used to determine the identity of these proteins. Identified proteins were classified into seven functional categories – structural proteins, enzymes, regulatory proteins, chaperones and others. IPA core analysis predicted that 18 proteins were related to cancer with involvements in hyperplasia, metastasis, invasion, growth and tumorigenesis. The mRNA expressions of two proteins – 14-3-3 protein sigma and Stress-induced-phosphoprotein 1 – were found to correlate with the corresponding proteins’ abundance. Conclusions The outcome of this analysis demonstrated that a comparative study of whole cell proteome of cancer versus normal cell lines can be used to identify cancer associated proteins. PMID:24422745

  6. Comparative Proteomic Analysis of Two Uveitis Models in Lewis Rats.

    Science.gov (United States)

    Pepple, Kathryn L; Rotkis, Lauren; Wilson, Leslie; Sandt, Angela; Van Gelder, Russell N

    2015-12-01

    Inflammation generates changes in the protein constituents of the aqueous humor. Proteins that change in multiple models of uveitis may be good biomarkers of disease or targets for therapeutic intervention. The present study was conducted to identify differentially-expressed proteins in the inflamed aqueous humor. Two models of uveitis were induced in Lewis rats: experimental autoimmune uveitis (EAU) and primed mycobacterial uveitis (PMU). Differential gel electrophoresis was used to compare naïve and inflamed aqueous humor. Differentially-expressed proteins were separated by using 2-D gel electrophoresis and excised for identification with matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF). Expression of select proteins was verified by Western blot analysis in both the aqueous and vitreous. The inflamed aqueous from both models demonstrated an increase in total protein concentration when compared to naïve aqueous. Calprotectin, a heterodimer of S100A8 and S100A9, was increased in the aqueous in both PMU and EAU. In the vitreous, S100A8 and S100A9 were preferentially elevated in PMU. Apolipoprotein E was elevated in the aqueous of both uveitis models but was preferentially elevated in EAU. Beta-B2-crystallin levels decreased in the aqueous and vitreous of EAU but not PMU. The proinflammatory molecules S100A8 and S100A9 were elevated in both models of uveitis but may play a more significant role in PMU than EAU. The neuroprotective protein β-B2-crystallin was found to decline in EAU. Therapies to modulate these proteins in vivo may be good targets in the treatment of ocular inflammation.

  7. Comparing pharmacophore models derived from crystallography and NMR ensembles

    Science.gov (United States)

    Ghanakota, Phani; Carlson, Heather A.

    2017-11-01

    NMR and X-ray crystallography are the two most widely used methods for determining protein structures. Our previous study examining NMR versus X-Ray sources of protein conformations showed improved performance with NMR structures when used in our Multiple Protein Structures (MPS) method for receptor-based pharmacophores (Damm, Carlson, J Am Chem Soc 129:8225-8235, 2007). However, that work was based on a single test case, HIV-1 protease, because of the rich data available for that system. New data for more systems are available now, which calls for further examination of the effect of different sources of protein conformations. The MPS technique was applied to Growth factor receptor bound protein 2 (Grb2), Src SH2 homology domain (Src-SH2), FK506-binding protein 1A (FKBP12), and Peroxisome proliferator-activated receptor-γ (PPAR-γ). Pharmacophore models from both crystal and NMR ensembles were able to discriminate between high-affinity, low-affinity, and decoy molecules. As we found in our original study, NMR models showed optimal performance when all elements were used. The crystal models had more pharmacophore elements compared to their NMR counterparts. The crystal-based models exhibited optimum performance only when pharmacophore elements were dropped. This supports our assertion that the higher flexibility in NMR ensembles helps focus the models on the most essential interactions with the protein. Our studies suggest that the "extra" pharmacophore elements seen at the periphery in X-ray models arise as a result of decreased protein flexibility and make very little contribution to model performance.

  8. The Protein Model Portal--a comprehensive resource for protein structure and model information.

    Science.gov (United States)

    Haas, Juergen; Roth, Steven; Arnold, Konstantin; Kiefer, Florian; Schmidt, Tobias; Bordoli, Lorenza; Schwede, Torsten

    2013-01-01

    The Protein Model Portal (PMP) has been developed to foster effective use of 3D molecular models in biomedical research by providing convenient and comprehensive access to structural information for proteins. Both experimental structures and theoretical models for a given protein can be searched simultaneously and analyzed for structural variability. By providing a comprehensive view on structural information, PMP offers the opportunity to apply consistent assessment and validation criteria to the complete set of structural models available for proteins. PMP is an open project so that new methods developed by the community can contribute to PMP, for example, new modeling servers for creating homology models and model quality estimation servers for model validation. The accuracy of participating modeling servers is continuously evaluated by the Continuous Automated Model EvaluatiOn (CAMEO) project. The PMP offers a unique interface to visualize structural coverage of a protein combining both theoretical models and experimental structures, allowing straightforward assessment of the model quality and hence their utility. The portal is updated regularly and actively developed to include latest methods in the field of computational structural biology. Database URL: http://www.proteinmodelportal.org.

  9. The Protein Model Portal—a comprehensive resource for protein structure and model information

    Science.gov (United States)

    Haas, Juergen; Roth, Steven; Arnold, Konstantin; Kiefer, Florian; Schmidt, Tobias; Bordoli, Lorenza; Schwede, Torsten

    2013-01-01

    The Protein Model Portal (PMP) has been developed to foster effective use of 3D molecular models in biomedical research by providing convenient and comprehensive access to structural information for proteins. Both experimental structures and theoretical models for a given protein can be searched simultaneously and analyzed for structural variability. By providing a comprehensive view on structural information, PMP offers the opportunity to apply consistent assessment and validation criteria to the complete set of structural models available for proteins. PMP is an open project so that new methods developed by the community can contribute to PMP, for example, new modeling servers for creating homology models and model quality estimation servers for model validation. The accuracy of participating modeling servers is continuously evaluated by the Continuous Automated Model EvaluatiOn (CAMEO) project. The PMP offers a unique interface to visualize structural coverage of a protein combining both theoretical models and experimental structures, allowing straightforward assessment of the model quality and hence their utility. The portal is updated regularly and actively developed to include latest methods in the field of computational structural biology. Database URL: http://www.proteinmodelportal.org PMID:23624946

  10. Interpretation of protein quantitation using the Bradford assay: comparison with two calculation models.

    Science.gov (United States)

    Ku, Hyung-Keun; Lim, Hyuk-Min; Oh, Kyong-Hwa; Yang, Hyo-Jin; Jeong, Ji-Seon; Kim, Sook-Kyung

    2013-03-01

    The Bradford assay is a simple method for protein quantitation, but variation in the results between proteins is a matter of concern. In this study, we compared and normalized quantitative values from two models for protein quantitation, where the residues in the protein that bind to anionic Coomassie Brilliant Blue G-250 comprise either Arg and Lys (Method 1, M1) or Arg, Lys, and His (Method 2, M2). Use of the M2 model yielded much more consistent quantitation values compared with use of the M1 model, which exhibited marked overestimations against protein standards. Copyright © 2012 Elsevier Inc. All rights reserved.

  11. Predictive and comparative analysis of Ebolavirus proteins

    Science.gov (United States)

    Cong, Qian; Pei, Jimin; Grishin, Nick V

    2015-01-01

    Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus. PMID:26158395

  12. Predictive and comparative analysis of Ebolavirus proteins.

    Science.gov (United States)

    Cong, Qian; Pei, Jimin; Grishin, Nick V

    2015-01-01

    Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus.

  13. The PMDB Protein Model Database

    Science.gov (United States)

    Castrignanò, Tiziana; De Meo, Paolo D'Onorio; Cozzetto, Domenico; Talamo, Ivano Giuseppe; Tramontano, Anna

    2006-01-01

    The Protein Model Database (PMDB) is a public resource aimed at storing manually built 3D models of proteins. The database is designed to provide access to models published in the scientific literature, together with validating experimental data. It is a relational database and it currently contains >74 000 models for ∼240 proteins. The system is accessible at and allows predictors to submit models along with related supporting evidence and users to download them through a simple and intuitive interface. Users can navigate in the database and retrieve models referring to the same target protein or to different regions of the same protein. Each model is assigned a unique identifier that allows interested users to directly access the data. PMID:16381873

  14. Comparative characteristic of the methods of protein antigens epitope mapping

    Directory of Open Access Journals (Sweden)

    O. Yu. Galkin

    2014-08-01

    Full Text Available Comparative analysis of experimental methods of epitope mapping of protein antigens has been carried out. The vast majority of known techniques are involved in immunochemical study of the interaction of protein molecules or peptides with antibodies of corresponding specifici­ty. The most effective and widely applicable metho­dological techniques are those that use synthetic and genetically engineered peptides. Over the past 30 years, these groups of methods have travelled a notable evolutionary path up to the maximum automation and the detection of antigenic determinants of various types (linear and conformational epitopes, and mimotopes. Most of epitope searching algorithms were integrated into a computer program, which greatly facilitates the analysis of experimental data and makes it possible to create spatial models. It is possible to use comparative epitope mapping for solving the applied problems; this less time-consuming method is based on the analysis of competition between different antibodies interactions with the same antigen. The physical method of antigenic structure study is X-ray analysis of antigen-antibody complexes, which may be applied only to crystallizing­ proteins, and nuclear magnetic resonance.

  15. Identify High-Quality Protein Structural Models by Enhanced K-Means.

    Science.gov (United States)

    Wu, Hongjie; Li, Haiou; Jiang, Min; Chen, Cheng; Lv, Qiang; Wu, Chuang

    2017-01-01

    Background. One critical issue in protein three-dimensional structure prediction using either ab initio or comparative modeling involves identification of high-quality protein structural models from generated decoys. Currently, clustering algorithms are widely used to identify near-native models; however, their performance is dependent upon different conformational decoys, and, for some algorithms, the accuracy declines when the decoy population increases. Results. Here, we proposed two enhanced K -means clustering algorithms capable of robustly identifying high-quality protein structural models. The first one employs the clustering algorithm SPICKER to determine the initial centroids for basic K -means clustering ( SK -means), whereas the other employs squared distance to optimize the initial centroids ( K -means++). Our results showed that SK -means and K -means++ were more robust as compared with SPICKER alone, detecting 33 (59%) and 42 (75%) of 56 targets, respectively, with template modeling scores better than or equal to those of SPICKER. Conclusions. We observed that the classic K -means algorithm showed a similar performance to that of SPICKER, which is a widely used algorithm for protein-structure identification. Both SK -means and K -means++ demonstrated substantial improvements relative to results from SPICKER and classical K -means.

  16. Folding 19 proteins to their native state and stability of large proteins from a coarse-grained model.

    Science.gov (United States)

    Kapoor, Abhijeet; Travesset, Alex

    2014-03-01

    We develop an intermediate resolution model, where the backbone is modeled with atomic resolution but the side chain with a single bead, by extending our previous model (Proteins (2013) DOI: 10.1002/prot.24269) to properly include proline, preproline residues and backbone rigidity. Starting from random configurations, the model properly folds 19 proteins (including a mutant 2A3D sequence) into native states containing β sheet, α helix, and mixed α/β. As a further test, the stability of H-RAS (a 169 residue protein, critical in many signaling pathways) is investigated: The protein is stable, with excellent agreement with experimental B-factors. Despite that proteins containing only α helices fold to their native state at lower backbone rigidity, and other limitations, which we discuss thoroughly, the model provides a reliable description of the dynamics as compared with all atom simulations, but does not constrain secondary structures as it is typically the case in more coarse-grained models. Further implications are described. Copyright © 2013 Wiley Periodicals, Inc.

  17. @TOME-2: a new pipeline for comparative modeling of protein–ligand complexes

    Science.gov (United States)

    Pons, Jean-Luc; Labesse, Gilles

    2009-01-01

    @TOME 2.0 is new web pipeline dedicated to protein structure modeling and small ligand docking based on comparative analyses. @TOME 2.0 allows fold recognition, template selection, structural alignment editing, structure comparisons, 3D-model building and evaluation. These tasks are routinely used in sequence analyses for structure prediction. In our pipeline the necessary software is efficiently interconnected in an original manner to accelerate all the processes. Furthermore, we have also connected comparative docking of small ligands that is performed using protein–protein superposition. The input is a simple protein sequence in one-letter code with no comment. The resulting 3D model, protein–ligand complexes and structural alignments can be visualized through dedicated Web interfaces or can be downloaded for further studies. These original features will aid in the functional annotation of proteins and the selection of templates for molecular modeling and virtual screening. Several examples are described to highlight some of the new functionalities provided by this pipeline. The server and its documentation are freely available at http://abcis.cbs.cnrs.fr/AT2/ PMID:19443448

  18. Comparative sequence and structural analyses of G-protein-coupled receptor crystal structures and implications for molecular models.

    Directory of Open Access Journals (Sweden)

    Catherine L Worth

    Full Text Available BACKGROUND: Up until recently the only available experimental (high resolution structure of a G-protein-coupled receptor (GPCR was that of bovine rhodopsin. In the past few years the determination of GPCR structures has accelerated with three new receptors, as well as squid rhodopsin, being successfully crystallized. All share a common molecular architecture of seven transmembrane helices and can therefore serve as templates for building molecular models of homologous GPCRs. However, despite the common general architecture of these structures key differences do exist between them. The choice of which experimental GPCR structure(s to use for building a comparative model of a particular GPCR is unclear and without detailed structural and sequence analyses, could be arbitrary. The aim of this study is therefore to perform a systematic and detailed analysis of sequence-structure relationships of known GPCR structures. METHODOLOGY: We analyzed in detail conserved and unique sequence motifs and structural features in experimentally-determined GPCR structures. Deeper insight into specific and important structural features of GPCRs as well as valuable information for template selection has been gained. Using key features a workflow has been formulated for identifying the most appropriate template(s for building homology models of GPCRs of unknown structure. This workflow was applied to a set of 14 human family A GPCRs suggesting for each the most appropriate template(s for building a comparative molecular model. CONCLUSIONS: The available crystal structures represent only a subset of all possible structural variation in family A GPCRs. Some GPCRs have structural features that are distributed over different crystal structures or which are not present in the templates suggesting that homology models should be built using multiple templates. This study provides a systematic analysis of GPCR crystal structures and a consistent method for identifying

  19. Comparative sequence and structural analyses of G-protein-coupled receptor crystal structures and implications for molecular models.

    Science.gov (United States)

    Worth, Catherine L; Kleinau, Gunnar; Krause, Gerd

    2009-09-16

    Up until recently the only available experimental (high resolution) structure of a G-protein-coupled receptor (GPCR) was that of bovine rhodopsin. In the past few years the determination of GPCR structures has accelerated with three new receptors, as well as squid rhodopsin, being successfully crystallized. All share a common molecular architecture of seven transmembrane helices and can therefore serve as templates for building molecular models of homologous GPCRs. However, despite the common general architecture of these structures key differences do exist between them. The choice of which experimental GPCR structure(s) to use for building a comparative model of a particular GPCR is unclear and without detailed structural and sequence analyses, could be arbitrary. The aim of this study is therefore to perform a systematic and detailed analysis of sequence-structure relationships of known GPCR structures. We analyzed in detail conserved and unique sequence motifs and structural features in experimentally-determined GPCR structures. Deeper insight into specific and important structural features of GPCRs as well as valuable information for template selection has been gained. Using key features a workflow has been formulated for identifying the most appropriate template(s) for building homology models of GPCRs of unknown structure. This workflow was applied to a set of 14 human family A GPCRs suggesting for each the most appropriate template(s) for building a comparative molecular model. The available crystal structures represent only a subset of all possible structural variation in family A GPCRs. Some GPCRs have structural features that are distributed over different crystal structures or which are not present in the templates suggesting that homology models should be built using multiple templates. This study provides a systematic analysis of GPCR crystal structures and a consistent method for identifying suitable templates for GPCR homology modelling that will

  20. Predicting nucleic acid binding interfaces from structural models of proteins.

    Science.gov (United States)

    Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael

    2012-02-01

    The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However, the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three-dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared with patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. Copyright © 2011 Wiley Periodicals, Inc.

  1. Mean protein evolutionary distance: a method for comparative protein evolution and its application.

    Directory of Open Access Journals (Sweden)

    Michael J Wise

    Full Text Available Proteins are under tight evolutionary constraints, so if a protein changes it can only do so in ways that do not compromise its function. In addition, the proteins in an organism evolve at different rates. Leveraging the history of patristic distance methods, a new method for analysing comparative protein evolution, called Mean Protein Evolutionary Distance (MeaPED, measures differential resistance to evolutionary pressure across viral proteomes and is thereby able to point to the proteins' roles. Different species' proteomes can also be compared because the results, consistent across virus subtypes, concisely reflect the very different lifestyles of the viruses. The MeaPED method is here applied to influenza A virus, hepatitis C virus, human immunodeficiency virus (HIV, dengue virus, rotavirus A, polyomavirus BK and measles, which span the positive and negative single-stranded, doubled-stranded and reverse transcribing RNA viruses, and double-stranded DNA viruses. From this analysis, host interaction proteins including hemagglutinin (influenza, and viroporins agnoprotein (polyomavirus, p7 (hepatitis C and VPU (HIV emerge as evolutionary hot-spots. By contrast, RNA-directed RNA polymerase proteins including L (measles, PB1/PB2 (influenza and VP1 (rotavirus, and internal serine proteases such as NS3 (dengue and hepatitis C virus emerge as evolutionary cold-spots. The hot spot influenza hemagglutinin protein is contrasted with the related cold spot H protein from measles. It is proposed that evolutionary cold-spot proteins can become significant targets for second-line anti-viral therapeutics, in cases where front-line vaccines are not available or have become ineffective due to mutations in the hot-spot, generally more antigenically exposed proteins. The MeaPED package is available from www.pam1.bcs.uwa.edu.au/~michaelw/ftp/src/meaped.tar.gz.

  2. Mean protein evolutionary distance: a method for comparative protein evolution and its application.

    Science.gov (United States)

    Wise, Michael J

    2013-01-01

    Proteins are under tight evolutionary constraints, so if a protein changes it can only do so in ways that do not compromise its function. In addition, the proteins in an organism evolve at different rates. Leveraging the history of patristic distance methods, a new method for analysing comparative protein evolution, called Mean Protein Evolutionary Distance (MeaPED), measures differential resistance to evolutionary pressure across viral proteomes and is thereby able to point to the proteins' roles. Different species' proteomes can also be compared because the results, consistent across virus subtypes, concisely reflect the very different lifestyles of the viruses. The MeaPED method is here applied to influenza A virus, hepatitis C virus, human immunodeficiency virus (HIV), dengue virus, rotavirus A, polyomavirus BK and measles, which span the positive and negative single-stranded, doubled-stranded and reverse transcribing RNA viruses, and double-stranded DNA viruses. From this analysis, host interaction proteins including hemagglutinin (influenza), and viroporins agnoprotein (polyomavirus), p7 (hepatitis C) and VPU (HIV) emerge as evolutionary hot-spots. By contrast, RNA-directed RNA polymerase proteins including L (measles), PB1/PB2 (influenza) and VP1 (rotavirus), and internal serine proteases such as NS3 (dengue and hepatitis C virus) emerge as evolutionary cold-spots. The hot spot influenza hemagglutinin protein is contrasted with the related cold spot H protein from measles. It is proposed that evolutionary cold-spot proteins can become significant targets for second-line anti-viral therapeutics, in cases where front-line vaccines are not available or have become ineffective due to mutations in the hot-spot, generally more antigenically exposed proteins. The MeaPED package is available from www.pam1.bcs.uwa.edu.au/~michaelw/ftp/src/meaped.tar.gz.

  3. Scoring predictive models using a reduced representation of proteins: model and energy definition.

    Science.gov (United States)

    Fogolari, Federico; Pieri, Lidia; Dovier, Agostino; Bortolussi, Luca; Giugliarelli, Gilberto; Corazza, Alessandra; Esposito, Gennaro; Viglino, Paolo

    2007-03-23

    Reduced representations of proteins have been playing a keyrole in the study of protein folding. Many such models are available, with different representation detail. Although the usefulness of many such models for structural bioinformatics applications has been demonstrated in recent years, there are few intermediate resolution models endowed with an energy model capable, for instance, of detecting native or native-like structures among decoy sets. The aim of the present work is to provide a discrete empirical potential for a reduced protein model termed here PC2CA, because it employs a PseudoCovalent structure with only 2 Centers of interactions per Amino acid, suitable for protein model quality assessment. All protein structures in the set top500H have been converted in reduced form. The distribution of pseudobonds, pseudoangle, pseudodihedrals and distances between centers of interactions have been converted into potentials of mean force. A suitable reference distribution has been defined for non-bonded interactions which takes into account excluded volume effects and protein finite size. The correlation between adjacent main chain pseudodihedrals has been converted in an additional energetic term which is able to account for cooperative effects in secondary structure elements. Local energy surface exploration is performed in order to increase the robustness of the energy function. The model and the energy definition proposed have been tested on all the multiple decoys' sets in the Decoys'R'us database. The energetic model is able to recognize, for almost all sets, native-like structures (RMSD less than 2.0 A). These results and those obtained in the blind CASP7 quality assessment experiment suggest that the model compares well with scoring potentials with finer granularity and could be useful for fast exploration of conformational space. Parameters are available at the url: http://www.dstb.uniud.it/~ffogolari/download/.

  4. N-terminal protein processing: A comparative proteogenomic analysis

    Energy Technology Data Exchange (ETDEWEB)

    Bonissone, Stefano; Gupta, Nitin; Romine, Margaret F.; Bradshaw, Ralph A.; Pevzner, Pavel A.

    2013-01-01

    N-Terminal Methionine Excision (NME) is a universally conserved mechanism with the same specificity across all life forms that removes the first Methionine in proteins when the second residue is Gly, Ala, Ser, Cys, Thr, Pro, or Val. In spite of its necessity for proper cell functioning, the functional role of NME remains unclear. In 1988, Arfin and Bradshaw connected NME with the N-end protein degradation rule and postulated that the role of NME is to expose the stabilizing residues with the goal to resist protein degradation. While this explanation (that treats 7 stabilizing residues in the same manner) has become the de facto dogma of NME, comparative proteogenomics analysis of NME tells a different story. We suggest that the primary role of NME is to expose only two (rather than seven) amino acids Ala and Ser for post-translational modifications (e.g., acetylation) rather than to regulate protein degradation. We argue that, contrary to the existing view, NME is not crucially important for proteins with 5 other stabilizing residue at the 2nd positions that are merely bystanders (their function is not affected by NME) that become exposed to NME because their sizes are comparable or smaller than the size of Ala and Ser.

  5. Metastasis-related plasma membrane proteins of human breast cancer cells identified by comparative quantitative mass spectrometry

    DEFF Research Database (Denmark)

    Leth-Larsen, Rikke; Lund, Rikke; Hansen, Helle V

    2009-01-01

    The spread of cancer cells from a primary tumor to form metastasis at distant sites is a complex multi-step process. The cancer cell proteins, and plasma membrane proteins in particular, involved in this process are poorly defined and a study of the very early events of the metastatic process using...... clinical samples or in vitro assays is not feasible. We have used a unique model system consisting of two isogenic human breast cancer cell lines that are equally tumorigenic in mice, but while one gives rise to metastasis, the other disseminates single cells that remain dormant at distant organs. Membrane...... purification and comparative quantitative LC-MS/MS proteomic analysis identified 13 membrane proteins that were expressed at higher levels and 3 that were under-expressed in the metastatic compared to the non-metastatic cell line from a total of 1919 identified protein entries. Among the proteins were ecto-5...

  6. Completion of autobuilt protein models using a database of protein fragments

    International Nuclear Information System (INIS)

    Cowtan, Kevin

    2012-01-01

    Two developments in the process of automated protein model building in the Buccaneer software are described: the use of a database of protein fragments in improving the model completeness and the assembly of disconnected chain fragments into complete molecules. Two developments in the process of automated protein model building in the Buccaneer software are presented. A general-purpose library for protein fragments of arbitrary size is described, with a highly optimized search method allowing the use of a larger database than in previous work. The problem of assembling an autobuilt model into complete chains is discussed. This involves the assembly of disconnected chain fragments into complete molecules and the use of the database of protein fragments in improving the model completeness. Assembly of fragments into molecules is a standard step in existing model-building software, but the methods have not received detailed discussion in the literature

  7. An Efficient Null Model for Conformational Fluctuations in Proteins

    DEFF Research Database (Denmark)

    Harder, Tim Philipp; Borg, Mikael; Bottaro, Sandro

    2012-01-01

    Protein dynamics play a crucial role in function, catalytic activity, and pathogenesis. Consequently, there is great interest in computational methods that probe the conformational fluctuations of a protein. However, molecular dynamics simulations are computationally costly and therefore are often...... limited to comparatively short timescales. TYPHON is a probabilistic method to explore the conformational space of proteins under the guidance of a sophisticated probabilistic model of local structure and a given set of restraints that represent nonlocal interactions, such as hydrogen bonds or disulfide...... on conformational fluctuations that is in correspondence with experimental measurements. TYPHON provides a flexible, yet computationally efficient, method to explore possible conformational fluctuations in proteins....

  8. A semi-nonparametric mixture model for selecting functionally consistent proteins.

    Science.gov (United States)

    Yu, Lianbo; Doerge, Rw

    2010-09-28

    High-throughput technologies have led to a new era of proteomics. Although protein microarray experiments are becoming more common place there are a variety of experimental and statistical issues that have yet to be addressed, and that will carry over to new high-throughput technologies unless they are investigated. One of the largest of these challenges is the selection of functionally consistent proteins. We present a novel semi-nonparametric mixture model for classifying proteins as consistent or inconsistent while controlling the false discovery rate and the false non-discovery rate. The performance of the proposed approach is compared to current methods via simulation under a variety of experimental conditions. We provide a statistical method for selecting functionally consistent proteins in the context of protein microarray experiments, but the proposed semi-nonparametric mixture model method can certainly be generalized to solve other mixture data problems. The main advantage of this approach is that it provides the posterior probability of consistency for each protein.

  9. DROIDS 1.20: A GUI-Based Pipeline for GPU-Accelerated Comparative Protein Dynamics.

    Science.gov (United States)

    Babbitt, Gregory A; Mortensen, Jamie S; Coppola, Erin E; Adams, Lily E; Liao, Justin K

    2018-03-13

    Traditional informatics in comparative genomics work only with static representations of biomolecules (i.e., sequence and structure), thereby ignoring the molecular dynamics (MD) of proteins that define function in the cell. A comparative approach applied to MD would connect this very short timescale process, defined in femtoseconds, to one of the longest in the universe: molecular evolution measured in millions of years. Here, we leverage advances in graphics-processing-unit-accelerated MD simulation software to develop a comparative method of MD analysis and visualization that can be applied to any two homologous Protein Data Bank structures. Our open-source pipeline, DROIDS (Detecting Relative Outlier Impacts in Dynamic Simulations), works in conjunction with existing molecular modeling software to convert any Linux gaming personal computer into a "comparative computational microscope" for observing the biophysical effects of mutations and other chemical changes in proteins. DROIDS implements structural alignment and Benjamini-Hochberg-corrected Kolmogorov-Smirnov statistics to compare nanosecond-scale atom bond fluctuations on the protein backbone, color mapping the significant differences identified in protein MD with single-amino-acid resolution. DROIDS is simple to use, incorporating graphical user interface control for Amber16 MD simulations, cpptraj analysis, and the final statistical and visual representations in R graphics and UCSF Chimera. We demonstrate that DROIDS can be utilized to visually investigate molecular evolution and disease-related functional changes in MD due to genetic mutation and epigenetic modification. DROIDS can also be used to potentially investigate binding interactions of pharmaceuticals, toxins, or other biomolecules in a functional evolutionary context as well. Copyright © 2018 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  10. Denatured state is critical in determining the properties of model proteins designed on different folds

    DEFF Research Database (Denmark)

    Amatori, Andrea; Ferkinghoff-Borg, Jesper; Tiana, Guido

    2008-01-01

    The thermodynamics of proteins designed on three common folds (SH3, chymotrypsin inhibitor 2 [CI2], and protein G) is studied with a simplified C alpha, model and compared with the thermodynamics of proteins designed on random-generated folds. The model allows to design sequences to fold within a...

  11. Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions

    KAUST Repository

    Najibi, Seyed Morteza; Maadooliat, Mehdi; Zhou, Lan; Huang, Jianhua Z.; Gao, Xin

    2017-01-01

    Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.

  12. Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions

    KAUST Repository

    Najibi, Seyed Morteza

    2017-02-08

    Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.

  13. In silico modeling of the yeast protein and protein family interaction network

    Science.gov (United States)

    Goh, K.-I.; Kahng, B.; Kim, D.

    2004-03-01

    Understanding of how protein interaction networks of living organisms have evolved or are organized can be the first stepping stone in unveiling how life works on a fundamental ground. Here we introduce an in silico ``coevolutionary'' model for the protein interaction network and the protein family network. The essential ingredient of the model includes the protein family identity and its robustness under evolution, as well as the three previously proposed: gene duplication, divergence, and mutation. This model produces a prototypical feature of complex networks in a wide range of parameter space, following the generalized Pareto distribution in connectivity. Moreover, we investigate other structural properties of our model in detail with some specific values of parameters relevant to the yeast Saccharomyces cerevisiae, showing excellent agreement with the empirical data. Our model indicates that the physical constraints encoded via the domain structure of proteins play a crucial role in protein interactions.

  14. HIV-specific probabilistic models of protein evolution.

    Directory of Open Access Journals (Sweden)

    David C Nickle

    2007-06-01

    Full Text Available Comparative sequence analyses, including such fundamental bioinformatics techniques as similarity searching, sequence alignment and phylogenetic inference, have become a mainstay for researchers studying type 1 Human Immunodeficiency Virus (HIV-1 genome structure and evolution. Implicit in comparative analyses is an underlying model of evolution, and the chosen model can significantly affect the results. In general, evolutionary models describe the probabilities of replacing one amino acid character with another over a period of time. Most widely used evolutionary models for protein sequences have been derived from curated alignments of hundreds of proteins, usually based on mammalian genomes. It is unclear to what extent these empirical models are generalizable to a very different organism, such as HIV-1-the most extensively sequenced organism in existence. We developed a maximum likelihood model fitting procedure to a collection of HIV-1 alignments sampled from different viral genes, and inferred two empirical substitution models, suitable for describing between-and within-host evolution. Our procedure pools the information from multiple sequence alignments, and provided software implementation can be run efficiently in parallel on a computer cluster. We describe how the inferred substitution models can be used to generate scoring matrices suitable for alignment and similarity searches. Our models had a consistently superior fit relative to the best existing models and to parameter-rich data-driven models when benchmarked on independent HIV-1 alignments, demonstrating evolutionary biases in amino-acid substitution that are unique to HIV, and that are not captured by the existing models. The scoring matrices derived from the models showed a marked difference from common amino-acid scoring matrices. The use of an appropriate evolutionary model recovered a known viral transmission history, whereas a poorly chosen model introduced phylogenetic

  15. Quality assessment of protein model-structures based on structural and functional similarities.

    Science.gov (United States)

    Konopka, Bogumil M; Nebel, Jean-Christophe; Kotulska, Malgorzata

    2012-09-21

    one of CASP9, compared to the contest participants. Consequently, GOBA offers a novel single model quality assessment program that addresses the practical needs of biologists. In conjunction with other Model Quality Assessment Programs (MQAPs), it would prove useful for the evaluation of single protein models.

  16. A Mesoscopic Model for Protein-Protein Interactions in Solution

    OpenAIRE

    Lund, Mikael; Jönsson, Bo

    2003-01-01

    Protein self-association may be detrimental in biological systems, but can be utilized in a controlled fashion for protein crystallization. It is hence of considerable interest to understand how factors like solution conditions prevent or promote aggregation. Here we present a computational model describing interactions between protein molecules in solution. The calculations are based on a molecular description capturing the detailed structure of the protein molecule using x-ray or nuclear ma...

  17. Compare local pocket and global protein structure models by small structure patterns

    KAUST Repository

    Cui, Xuefeng; Kuwahara, Hiroyuki; Li, Shuai Cheng; Gao, Xin

    2015-01-01

    Researchers proposed several criteria to assess the quality of predicted protein structures because it is one of the essential tasks in the Critical Assessment of Techniques for Protein Structure Prediction (CASP) competitions. Popular criteria

  18. Validation of protein models by a neural network approach

    Directory of Open Access Journals (Sweden)

    Fantucci Piercarlo

    2008-01-01

    Full Text Available Abstract Background The development and improvement of reliable computational methods designed to evaluate the quality of protein models is relevant in the context of protein structure refinement, which has been recently identified as one of the bottlenecks limiting the quality and usefulness of protein structure prediction. Results In this contribution, we present a computational method (Artificial Intelligence Decoys Evaluator: AIDE which is able to consistently discriminate between correct and incorrect protein models. In particular, the method is based on neural networks that use as input 15 structural parameters, which include energy, solvent accessible surface, hydrophobic contacts and secondary structure content. The results obtained with AIDE on a set of decoy structures were evaluated using statistical indicators such as Pearson correlation coefficients, Znat, fraction enrichment, as well as ROC plots. It turned out that AIDE performances are comparable and often complementary to available state-of-the-art learning-based methods. Conclusion In light of the results obtained with AIDE, as well as its comparison with available learning-based methods, it can be concluded that AIDE can be successfully used to evaluate the quality of protein structures. The use of AIDE in combination with other evaluation tools is expected to further enhance protein refinement efforts.

  19. Prediction of protein-protein interactions between viruses and human by an SVM model

    Directory of Open Access Journals (Sweden)

    Cui Guangyu

    2012-05-01

    Full Text Available Abstract Background Several computational methods have been developed to predict protein-protein interactions from amino acid sequences, but most of those methods are intended for the interactions within a species rather than for interactions across different species. Methods for predicting interactions between homogeneous proteins are not appropriate for finding those between heterogeneous proteins since they do not distinguish the interactions between proteins of the same species from those of different species. Results We developed a new method for representing a protein sequence of variable length in a frequency vector of fixed length, which encodes the relative frequency of three consecutive amino acids of a sequence. We built a support vector machine (SVM model to predict human proteins that interact with virus proteins. In two types of viruses, human papillomaviruses (HPV and hepatitis C virus (HCV, our SVM model achieved an average accuracy above 80%, which is higher than that of another SVM model with a different representation scheme. Using the SVM model and Gene Ontology (GO annotations of proteins, we predicted new interactions between virus proteins and human proteins. Conclusions Encoding the relative frequency of amino acid triplets of a protein sequence is a simple yet powerful representation method for predicting protein-protein interactions across different species. The representation method has several advantages: (1 it enables a prediction model to achieve a better performance than other representations, (2 it generates feature vectors of fixed length regardless of the sequence length, and (3 the same representation is applicable to different types of proteins.

  20. Comparative Molecular Dynamics Simulations of Mitogen-Activated Protein Kinase-Activated Protein Kinase 5

    Directory of Open Access Journals (Sweden)

    Inger Lindin

    2014-03-01

    Full Text Available The mitogen-activated protein kinase-activated protein kinase MK5 is a substrate of the mitogen-activated protein kinases p38, ERK3 and ERK4. Cell culture and animal studies have demonstrated that MK5 is involved in tumour suppression and promotion, embryogenesis, anxiety, cell motility and cell cycle regulation. In the present study, homology models of MK5 were used for molecular dynamics (MD simulations of: (1 MK5 alone; (2 MK5 in complex with an inhibitor; and (3 MK5 in complex with the interaction partner p38α. The calculations showed that the inhibitor occupied the active site and disrupted the intramolecular network of amino acids. However, intramolecular interactions consistent with an inactive protein kinase fold were not formed. MD with p38α showed that not only the p38 docking region, but also amino acids in the activation segment, αH helix, P-loop, regulatory phosphorylation region and the C-terminal of MK5 may be involved in forming a very stable MK5-p38α complex, and that p38α binding decreases the residual fluctuation of the MK5 model. Electrostatic Potential Surface (EPS calculations of MK5 and p38α showed that electrostatic interactions are important for recognition and binding.

  1. Modeling disordered regions in proteins using Rosetta.

    Directory of Open Access Journals (Sweden)

    Ray Yu-Ruei Wang

    Full Text Available Protein structure prediction methods such as Rosetta search for the lowest energy conformation of the polypeptide chain. However, the experimentally observed native state is at a minimum of the free energy, rather than the energy. The neglect of the missing configurational entropy contribution to the free energy can be partially justified by the assumption that the entropies of alternative folded states, while very much less than unfolded states, are not too different from one another, and hence can be to a first approximation neglected when searching for the lowest free energy state. The shortcomings of current structure prediction methods may be due in part to the breakdown of this assumption. Particularly problematic are proteins with significant disordered regions which do not populate single low energy conformations even in the native state. We describe two approaches within the Rosetta structure modeling methodology for treating such regions. The first does not require advance knowledge of the regions likely to be disordered; instead these are identified by minimizing a simple free energy function used previously to model protein folding landscapes and transition states. In this model, residues can be either completely ordered or completely disordered; they are considered disordered if the gain in entropy outweighs the loss of favorable energetic interactions with the rest of the protein chain. The second approach requires identification in advance of the disordered regions either from sequence alone using for example the DISOPRED server or from experimental data such as NMR chemical shifts. During Rosetta structure prediction calculations the disordered regions make only unfavorable repulsive contributions to the total energy. We find that the second approach has greater practical utility and illustrate this with examples from de novo structure prediction, NMR structure calculation, and comparative modeling.

  2. Mechanical strength of 17,134 model proteins and cysteine slipknots.

    Directory of Open Access Journals (Sweden)

    Mateusz Sikora

    2009-10-01

    Full Text Available A new theoretical survey of proteins' resistance to constant speed stretching is performed for a set of 17,134 proteins as described by a structure-based model. The proteins selected have no gaps in their structure determination and consist of no more than 250 amino acids. Our previous studies have dealt with 7510 proteins of no more than 150 amino acids. The proteins are ranked according to the strength of the resistance. Most of the predicted top-strength proteins have not yet been studied experimentally. Architectures and folds which are likely to yield large forces are identified. New types of potent force clamps are discovered. They involve disulphide bridges and, in particular, cysteine slipknots. An effective energy parameter of the model is estimated by comparing the theoretical data on characteristic forces to the corresponding experimental values combined with an extrapolation of the theoretical data to the experimental pulling speeds. These studies provide guidance for future experiments on single molecule manipulation and should lead to selection of proteins for applications. A new class of proteins, involving cysteine slipknots, is identified as one that is expected to lead to the strongest force clamps known. This class is characterized through molecular dynamics simulations.

  3. Comparative proteomic analysis of plasma membrane proteins between human osteosarcoma and normal osteoblastic cell lines

    International Nuclear Information System (INIS)

    Zhang, Zhiyu; Ma, Fang; Cai, Zhengdong; Zhang, Lijun; Hua, Yingqi; Jia, Xiaofang; Li, Jian; Hu, Shuo; Peng, Xia; Yang, Pengyuan; Sun, Mengxiong

    2010-01-01

    Osteosarcoma (OS) is the most common primary malignant tumor of bone in children and adolescents. However, the knowledge in diagnostic modalities has progressed less. To identify new biomarkers for the early diagnosis of OS as well as for potential novel therapeutic candidates, we performed a sub-cellular comparative proteomic research. An osteosarcoma cell line (MG-63) and human osteoblastic cells (hFOB1.19) were used as our comparative model. Plasma membrane (PM) was obtained by aqueous two-phase partition. Proteins were analyzed through iTRAQ-based quantitative differential LC/MS/MS. The location and function of differential proteins were analyzed through GO database. Protein-protein interaction was examined through String software. One of differentially expressed proteins was verified by immunohistochemistry. 342 non-redundant proteins were identified, 68 of which were differentially expressed with 1.5-fold difference, with 25 up-regulated and 43 down-regulated. Among those differential proteins, 69% ware plasma membrane, which are related to the biological processes of binding, cell structure, signal transduction, cell adhesion, etc., and interaction with each other. One protein--CD151 located in net nodes was verified to be over-expressed in osteosarcoma tissue by immunohistochemistry. It is the first time to use plasma membrane proteomics for studying the OS membrane proteins according to our knowledge. We generated preliminary but comprehensive data about membrane protein of osteosarcoma. Among these, CD151 was further validated in patient samples, and this small molecule membrane might be a new target for OS research. The plasma membrane proteins identified in this study may provide new insight into osteosarcoma biology and potential diagnostic and therapeutic biomarkers

  4. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San

    2015-01-01

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  5. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun

    2015-06-11

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  6. Comparative nutritional value of Jatropha curcas protein isolate and soy protein isolate in common carp.

    Science.gov (United States)

    Nepal, Sunil; Kumar, Vikas; Makkar, Harinder P S; Stadtlander, Timo; Romano, Nicholas; Becker, Klaus

    2018-02-01

    Jatropha seed cake (JSC) is an excellent source of protein but does contain some antinutritional factors (ANF) that can act as toxins and thus negatively affect the growth and health status of fish. While this can limit the use of JSC, detoxified Jatropha protein isolate (DJPI) may be a better option. An 8-week study was performed to evaluate dietary DJPI to common carp Cyprinus carpio. Five iso-nitrogenous diets (crude protein of 38%) were formulated that consisted of a C ontrol (fish meal (FM) based protein), J 50 or J 75 (50 and 75% of FM protein replaced by DJPI), and S 50 or S 75 (50 and 75% of FM protein replaced by soy protein isolate, SPI) and fed to triplicate groups of 75 carp fingerlings (75; av. wt. ± SD; 11.4 ± 0.25 g). The growth, feeding efficiencies, digestibility, plasma biochemistry, and intestinal enzymes were measured. Results showed that growth performance of fish fed the S 75 - or DJPI-based diets were not significantly different from those fed the C ontrol diet, while carp fed the S 50 had significantly better growth than the J 75 diet. Fish fed the J 75 diet had significantly lower protein and lipid digestibility as well as significantly lower intestinal amylase and protease activities than all other groups. However, all plant protein-based diets led to significantly higher crude protein, crude lipid, and gross energy in the body of common carp compared to the control treatment. Plasma cholesterol and creatinine significantly decreased in the plant protein fed groups, although plasma triglyceride as well as the red blood cells count, hematocrit, albumin, globulin, total plasma protein, and lysozyme activity were higher in plant protein fed groups compared to FM fed group. White blood cells, hemoglobulin concentration, alkaline phosphatase and alanine transaminase activities, and glucose level in blood did not differ significantly among treatments. The results suggest that the DJPI is non-toxic to carp and can be used to replace FM in

  7. Protein folding simulations: from coarse-grained model to all-atom model.

    Science.gov (United States)

    Zhang, Jian; Li, Wenfei; Wang, Jun; Qin, Meng; Wu, Lei; Yan, Zhiqiang; Xu, Weixin; Zuo, Guanghong; Wang, Wei

    2009-06-01

    Protein folding is an important and challenging problem in molecular biology. During the last two decades, molecular dynamics (MD) simulation has proved to be a paramount tool and was widely used to study protein structures, folding kinetics and thermodynamics, and structure-stability-function relationship. It was also used to help engineering and designing new proteins, and to answer even more general questions such as the minimal number of amino acid or the evolution principle of protein families. Nowadays, the MD simulation is still undergoing rapid developments. The first trend is to toward developing new coarse-grained models and studying larger and more complex molecular systems such as protein-protein complex and their assembling process, amyloid related aggregations, and structure and motion of chaperons, motors, channels and virus capsides; the second trend is toward building high resolution models and explore more detailed and accurate pictures of protein folding and the associated processes, such as the coordination bond or disulfide bond involved folding, the polarization, charge transfer and protonate/deprotonate process involved in metal coupled folding, and the ion permeation and its coupling with the kinetics of channels. On these new territories, MD simulations have given many promising results and will continue to offer exciting views. Here, we review several new subjects investigated by using MD simulations as well as the corresponding developments of appropriate protein models. These include but are not limited to the attempt to go beyond the topology based Gō-like model and characterize the energetic factors in protein structures and dynamics, the study of the thermodynamics and kinetics of disulfide bond involved protein folding, the modeling of the interactions between chaperonin and the encapsulated protein and the protein folding under this circumstance, the effort to clarify the important yet still elusive folding mechanism of protein BBL

  8. Comparative architecture of octahedral protein cages. I. Indexed enclosing forms

    Science.gov (United States)

    Janner, A.

    2008-07-01

    The architecture of four protein cages (bacterio ferritin, human mitochondrial ferritin, sulfur oxygenase reductase and small heat-shock protein) are compared top-to-bottom, starting from polyhedra with vertices at cubic lattice points enclosing the cage down to indexed polyhedral forms of single monomers.

  9. DockQ: A Quality Measure for Protein-Protein Docking Models.

    Directory of Open Access Journals (Sweden)

    Sankar Basu

    Full Text Available The state-of-the-art to assess the structural quality of docking models is currently based on three related yet independent quality measures: Fnat, LRMS, and iRMS as proposed and standardized by CAPRI. These quality measures quantify different aspects of the quality of a particular docking model and need to be viewed together to reveal the true quality, e.g. a model with relatively poor LRMS (>10Å might still qualify as 'acceptable' with a descent Fnat (>0.50 and iRMS (<3.0Å. This is also the reason why the so called CAPRI criteria for assessing the quality of docking models is defined by applying various ad-hoc cutoffs on these measures to classify a docking model into the four classes: Incorrect, Acceptable, Medium, or High quality. This classification has been useful in CAPRI, but since models are grouped in only four bins it is also rather limiting, making it difficult to rank models, correlate with scoring functions or use it as target function in machine learning algorithms. Here, we present DockQ, a continuous protein-protein docking model quality measure derived by combining Fnat, LRMS, and iRMS to a single score in the range [0, 1] that can be used to assess the quality of protein docking models. By using DockQ on CAPRI models it is possible to almost completely reproduce the original CAPRI classification into Incorrect, Acceptable, Medium and High quality. An average PPV of 94% at 90% Recall demonstrating that there is no need to apply predefined ad-hoc cutoffs to classify docking models. Since DockQ recapitulates the CAPRI classification almost perfectly, it can be viewed as a higher resolution version of the CAPRI classification, making it possible to estimate model quality in a more quantitative way using Z-scores or sum of top ranked models, which has been so valuable for the CASP community. The possibility to directly correlate a quality measure to a scoring function has been crucial for the development of scoring functions for

  10. DeepQA: improving the estimation of single protein model quality with deep belief networks.

    Science.gov (United States)

    Cao, Renzhi; Bhattacharya, Debswapna; Hou, Jie; Cheng, Jianlin

    2016-12-05

    Protein quality assessment (QA) useful for ranking and selecting protein models has long been viewed as one of the major challenges for protein tertiary structure prediction. Especially, estimating the quality of a single protein model, which is important for selecting a few good models out of a large model pool consisting of mostly low-quality models, is still a largely unsolved problem. We introduce a novel single-model quality assessment method DeepQA based on deep belief network that utilizes a number of selected features describing the quality of a model from different perspectives, such as energy, physio-chemical characteristics, and structural information. The deep belief network is trained on several large datasets consisting of models from the Critical Assessment of Protein Structure Prediction (CASP) experiments, several publicly available datasets, and models generated by our in-house ab initio method. Our experiments demonstrate that deep belief network has better performance compared to Support Vector Machines and Neural Networks on the protein model quality assessment problem, and our method DeepQA achieves the state-of-the-art performance on CASP11 dataset. It also outperformed two well-established methods in selecting good outlier models from a large set of models of mostly low quality generated by ab initio modeling methods. DeepQA is a useful deep learning tool for protein single model quality assessment and protein structure prediction. The source code, executable, document and training/test datasets of DeepQA for Linux is freely available to non-commercial users at http://cactus.rnet.missouri.edu/DeepQA/ .

  11. Using an alignment of fragment strings for comparing protein structures

    DEFF Research Database (Denmark)

    Friedberg, Iddo; Harder, Tim; Kolodny, Rachel

    2007-01-01

    . RESULTS: Here we describe the use of a particular structure fragment library, denoted here as KL-strings, for the 1D representation of protein structure. Using KL-strings, we develop an infrastructure for comparing protein structures with a 1D representation. This study focuses on the added value gained...

  12. Refinement of protein termini in template-based modeling using conformational space annealing.

    Science.gov (United States)

    Park, Hahnbeom; Ko, Junsu; Joo, Keehyoung; Lee, Julian; Seok, Chaok; Lee, Jooyoung

    2011-09-01

    The rapid increase in the number of experimentally determined protein structures in recent years enables us to obtain more reliable protein tertiary structure models than ever by template-based modeling. However, refinement of template-based models beyond the limit available from the best templates is still needed for understanding protein function in atomic detail. In this work, we develop a new method for protein terminus modeling that can be applied to refinement of models with unreliable terminus structures. The energy function for terminus modeling consists of both physics-based and knowledge-based potential terms with carefully optimized relative weights. Effective sampling of both the framework and terminus is performed using the conformational space annealing technique. This method has been tested on a set of termini derived from a nonredundant structure database and two sets of termini from the CASP8 targets. The performance of the terminus modeling method is significantly improved over our previous method that does not employ terminus refinement. It is also comparable or superior to the best server methods tested in CASP8. The success of the current approach suggests that similar strategy may be applied to other types of refinement problems such as loop modeling or secondary structure rearrangement. Copyright © 2011 Wiley-Liss, Inc.

  13. Modelling of proteins in membranes

    DEFF Research Database (Denmark)

    Sperotto, Maria Maddalena; May, S.; Baumgaertner, A.

    2006-01-01

    This review describes some recent theories and simulations of mesoscopic and microscopic models of lipid membranes with embedded or attached proteins. We summarize results supporting our understanding of phenomena for which the activities of proteins in membranes are expected to be significantly ...

  14. Evaluating, Comparing, and Interpreting Protein Domain Hierarchies

    Science.gov (United States)

    2014-01-01

    Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108

  15. Mixing Energy Models in Genetic Algorithms for On-Lattice Protein Structure Prediction

    Directory of Open Access Journals (Sweden)

    Mahmood A. Rashid

    2013-01-01

    Full Text Available Protein structure prediction (PSP is computationally a very challenging problem. The challenge largely comes from the fact that the energy function that needs to be minimised in order to obtain the native structure of a given protein is not clearly known. A high resolution 20×20 energy model could better capture the behaviour of the actual energy function than a low resolution energy model such as hydrophobic polar. However, the fine grained details of the high resolution interaction energy matrix are often not very informative for guiding the search. In contrast, a low resolution energy model could effectively bias the search towards certain promising directions. In this paper, we develop a genetic algorithm that mainly uses a high resolution energy model for protein structure evaluation but uses a low resolution HP energy model in focussing the search towards exploring structures that have hydrophobic cores. We experimentally show that this mixing of energy models leads to significant lower energy structures compared to the state-of-the-art results.

  16. Protein adsorption on nanoparticles: model development using computer simulation

    International Nuclear Information System (INIS)

    Shao, Qing; Hall, Carol K

    2016-01-01

    The adsorption of proteins on nanoparticles results in the formation of the protein corona, the composition of which determines how nanoparticles influence their biological surroundings. We seek to better understand corona formation by developing models that describe protein adsorption on nanoparticles using computer simulation results as data. Using a coarse-grained protein model, discontinuous molecular dynamics simulations are conducted to investigate the adsorption of two small proteins (Trp-cage and WW domain) on a model nanoparticle of diameter 10.0 nm at protein concentrations ranging from 0.5 to 5 mM. The resulting adsorption isotherms are well described by the Langmuir, Freundlich, Temkin and Kiselev models, but not by the Elovich, Fowler–Guggenheim and Hill–de Boer models. We also try to develop a generalized model that can describe protein adsorption equilibrium on nanoparticles of different diameters in terms of dimensionless size parameters. The simulation results for three proteins (Trp-cage, WW domain, and GB3) on four nanoparticles (diameter  =  5.0, 10.0, 15.0, and 20.0 nm) illustrate both the promise and the challenge associated with developing generalized models of protein adsorption on nanoparticles. (paper)

  17. Comparative proteomics of cucurbit phloem indicates both unique and shared sets of proteins.

    Science.gov (United States)

    Lopez-Cobollo, Rosa M; Filippis, Ioannis; Bennett, Mark H; Turnbull, Colin G N

    2016-11-01

    Cucurbits are well-studied models for phloem biology but unusually possess both fascicular phloem (FP) within vascular bundles and additional extrafascicular phloem (EFP). Although the functional differences between the two systems are not yet clear, sugar analysis and limited protein profiling have established that FP and EFP have divergent compositions. Here we report a detailed comparative proteomics study of FP and EFP in two cucurbits, pumpkin and cucumber. We re-examined the sites of exudation by video microscopy, and confirmed that in both species, the spontaneous exudate following tissue cutting derives almost exclusively from EFP. Comparative gel electrophoresis and mass spectrometry-based proteomics of exudates, sieve element contents and microdissected stem tissues established that EFP and FP profiles are highly dissimilar, and that there are also species differences. Searches against cucurbit databases enabled identification of more than 300 FP proteins from each species. Few of the detected proteins (about 10%) were shared between the sieve element contents of FP and EFP, and enriched Gene Ontology categories also differed. To explore quantitative differences in the proteomes, we developed multiple reaction monitoring methods for cucumber proteins that are representative markers for FP or EFP and assessed exudate composition at different times after tissue cutting. Based on failure to detect FP markers in exudate samples, we conclude that FP is blocked very rapidly and therefore makes a minimal contribution to the exudates. Overall, the highly divergent contents of FP and EFP indicate that they are substantially independent vascular compartments. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  18. Overproduction of a Model Sec- and Tat-Dependent Secretory Protein Elicits Different Cellular Responses in Streptomyces lividans.

    Directory of Open Access Journals (Sweden)

    Sonia Gullón

    Full Text Available Streptomyces lividans is considered an efficient host for the secretory production of homologous and heterologous proteins. To identify possible bottlenecks in the protein production process, a comparative transcriptomic approach was adopted to study cellular responses during the overproduction of a Sec-dependent model protein (alpha-amylase and a Tat-dependent model protein (agarase in Streptomyces lividans. The overproduction of the model secretory proteins via the Sec or the Tat route in S. lividans does elicit a different major cell response in the bacterium. The stringent response is a bacterial response to nutrients' depletion, which naturally occurs at late times of the bacterial cell growth. While the induction of the stringent response at the exponential phase of growth may limit overall productivity in the case of the Tat route, the induction of that response does not take place in the case of the Sec route, which comparatively is an advantage in secretory protein production processes. Hence, this study identifies a potential major drawback in the secretory protein production process depending on the secretory route, and provides clues to improving S. lividans as a protein production host.

  19. The Ising model for prediction of disordered residues from protein sequence alone

    International Nuclear Information System (INIS)

    Lobanov, Michail Yu; Galzitskaya, Oxana V

    2011-01-01

    Intrinsically disordered regions serve as molecular recognition elements, which play an important role in the control of many cellular processes and signaling pathways. It is useful to be able to predict positions of disordered residues and disordered regions in protein chains using protein sequence alone. A new method (IsUnstruct) based on the Ising model for prediction of disordered residues from protein sequence alone has been developed. According to this model, each residue can be in one of two states: ordered or disordered. The model is an approximation of the Ising model in which the interaction term between neighbors has been replaced by a penalty for changing between states (the energy of border). The IsUnstruct has been compared with other available methods and found to perform well. The method correctly finds 77% of disordered residues as well as 87% of ordered residues in the CASP8 database, and 72% of disordered residues as well as 85% of ordered residues in the DisProt database

  20. Protein Comparability Assessments and Potential Applicability of High Throughput Biophysical Methods and Data Visualization Tools to Compare Physical Stability Profiles

    Directory of Open Access Journals (Sweden)

    Mohammad A. Alsenaidy

    2014-03-01

    Full Text Available In this review, some of the challenges and opportunities encountered during protein comparability assessments are summarized with an emphasis on developing new analytical approaches to better monitor higher-order protein structures. Several case studies are presented using high throughput biophysical methods to collect protein physical stability data as function of temperature, agitation, ionic strength and/or solution pH. These large data sets were then used to construct empirical phase diagrams (EPDs, radar charts, and comparative signature diagrams (CSDs for data visualization and structural comparisons between the different proteins. Protein samples with different sizes, post-translational modifications, and inherent stability are presented: acidic fibroblast growth factor (FGF-1 mutants, different glycoforms of an IgG1 mAb prepared by deglycosylation, as well as comparisons of different formulations of an IgG1 mAb and granulocyte colony stimulating factor (GCSF. Using this approach, differences in structural integrity and conformational stability profiles were detected under stress conditions that could not be resolved by using the same techniques under ambient conditions (i.e., no stress. Thus, an evaluation of conformational stability differences may serve as an effective surrogate to monitor differences in higher-order structure between protein samples. These case studies are discussed in the context of potential utility in protein comparability studies.

  1. Protein comparability assessments and potential applicability of high throughput biophysical methods and data visualization tools to compare physical stability profiles.

    Science.gov (United States)

    Alsenaidy, Mohammad A; Jain, Nishant K; Kim, Jae H; Middaugh, C Russell; Volkin, David B

    2014-01-01

    In this review, some of the challenges and opportunities encountered during protein comparability assessments are summarized with an emphasis on developing new analytical approaches to better monitor higher-order protein structures. Several case studies are presented using high throughput biophysical methods to collect protein physical stability data as function of temperature, agitation, ionic strength and/or solution pH. These large data sets were then used to construct empirical phase diagrams (EPDs), radar charts, and comparative signature diagrams (CSDs) for data visualization and structural comparisons between the different proteins. Protein samples with different sizes, post-translational modifications, and inherent stability are presented: acidic fibroblast growth factor (FGF-1) mutants, different glycoforms of an IgG1 mAb prepared by deglycosylation, as well as comparisons of different formulations of an IgG1 mAb and granulocyte colony stimulating factor (GCSF). Using this approach, differences in structural integrity and conformational stability profiles were detected under stress conditions that could not be resolved by using the same techniques under ambient conditions (i.e., no stress). Thus, an evaluation of conformational stability differences may serve as an effective surrogate to monitor differences in higher-order structure between protein samples. These case studies are discussed in the context of potential utility in protein comparability studies.

  2. Feature-Based and String-Based Models for Predicting RNA-Protein Interaction

    Directory of Open Access Journals (Sweden)

    Donald Adjeroh

    2018-03-01

    Full Text Available In this work, we study two approaches for the problem of RNA-Protein Interaction (RPI. In the first approach, we use a feature-based technique by combining extracted features from both sequences and secondary structures. The feature-based approach enhanced the prediction accuracy as it included much more available information about the RNA-protein pairs. In the second approach, we apply search algorithms and data structures to extract effective string patterns for prediction of RPI, using both sequence information (protein and RNA sequences, and structure information (protein and RNA secondary structures. This led to different string-based models for predicting interacting RNA-protein pairs. We show results that demonstrate the effectiveness of the proposed approaches, including comparative results against leading state-of-the-art methods.

  3. Quantitative chemogenomics: machine-learning models of protein-ligand interaction.

    Science.gov (United States)

    Andersson, Claes R; Gustafsson, Mats G; Strömbergsson, Helena

    2011-01-01

    Chemogenomics is an emerging interdisciplinary field that lies in the interface of biology, chemistry, and informatics. Most of the currently used drugs are small molecules that interact with proteins. Understanding protein-ligand interaction is therefore central to drug discovery and design. In the subfield of chemogenomics known as proteochemometrics, protein-ligand-interaction models are induced from data matrices that consist of both protein and ligand information along with some experimentally measured variable. The two general aims of this quantitative multi-structure-property-relationship modeling (QMSPR) approach are to exploit sparse/incomplete information sources and to obtain more general models covering larger parts of the protein-ligand space, than traditional approaches that focuses mainly on specific targets or ligands. The data matrices, usually obtained from multiple sparse/incomplete sources, typically contain series of proteins and ligands together with quantitative information about their interactions. A useful model should ideally be easy to interpret and generalize well to new unseen protein-ligand combinations. Resolving this requires sophisticated machine-learning methods for model induction, combined with adequate validation. This review is intended to provide a guide to methods and data sources suitable for this kind of protein-ligand-interaction modeling. An overview of the modeling process is presented including data collection, protein and ligand descriptor computation, data preprocessing, machine-learning-model induction and validation. Concerns and issues specific for each step in this kind of data-driven modeling will be discussed. © 2011 Bentham Science Publishers

  4. Accuracy comparison of several common implicit solvent models and their implementations in the context of protein-ligand binding.

    Science.gov (United States)

    Katkova, E V; Onufriev, A V; Aguilar, B; Sulimov, V B

    2017-03-01

    In this study several commonly used implicit solvent models are compared with respect to their accuracy of estimating solvation energies of small molecules and proteins, as well as desolvation penalty in protein-ligand binding. The test set consists of 19 small proteins, 104 small molecules, and 15 protein-ligand complexes. We compared predicted hydration energies of small molecules with their experimental values; the results of the solvation and desolvation energy calculations for small molecules, proteins and protein-ligand complexes in water were also compared with Thermodynamic Integration calculations based on TIP3P water model and Amber12 force field. The following implicit solvent (water) models considered here are: PCM (Polarized Continuum Model implemented in DISOLV and MCBHSOLV programs), GB (Generalized Born method implemented in DISOLV program, S-GB, and GBNSR6 stand-alone version), COSMO (COnductor-like Screening Model implemented in the DISOLV program and the MOPAC package) and the Poisson-Boltzmann model (implemented in the APBS program). Different parameterizations of the molecules were examined: we compared MMFF94 force field, Amber12 force field and the quantum-chemical semi-empirical PM7 method implemented in the MOPAC package. For small molecules, all of the implicit solvent models tested here yield high correlation coefficients (0.87-0.93) between the calculated solvation energies and the experimental values of hydration energies. For small molecules high correlation (0.82-0.97) with the explicit solvent energies is seen as well. On the other hand, estimated protein solvation energies and protein-ligand binding desolvation energies show substantial discrepancy (up to 10kcal/mol) with the explicit solvent reference. The correlation of polar protein solvation energies and protein-ligand desolvation energies with the corresponding explicit solvent results is 0.65-0.99 and 0.76-0.96 respectively, though this difference in correlations is caused

  5. Physiologically Based Pharmacokinetic Modeling of Therapeutic Proteins.

    Science.gov (United States)

    Wong, Harvey; Chow, Timothy W

    2017-09-01

    Biologics or therapeutic proteins are becoming increasingly important as treatments for disease. The most common class of biologics are monoclonal antibodies (mAbs). Recently, there has been an increase in the use of physiologically based pharmacokinetic (PBPK) modeling in the pharmaceutical industry in drug development. We review PBPK models for therapeutic proteins with an emphasis on mAbs. Due to their size and similarity to endogenous antibodies, there are distinct differences between PBPK models for small molecules and mAbs. The high-level organization of a typical mAb PBPK model consists of a whole-body PBPK model with organ compartments interconnected by both blood and lymph flows. The whole-body PBPK model is coupled with tissue-level submodels used to describe key mechanisms governing mAb disposition including tissue efflux via the lymphatic system, elimination by catabolism, protection from catabolism binding to the neonatal Fc (FcRn) receptor, and nonlinear binding to specific pharmacological targets of interest. The use of PBPK modeling in the development of therapeutic proteins is still in its infancy. Further application of PBPK modeling for therapeutic proteins will help to define its developing role in drug discovery and development. Copyright © 2017 American Pharmacists Association®. Published by Elsevier Inc. All rights reserved.

  6. A hidden markov model derived structural alphabet for proteins.

    Science.gov (United States)

    Camproux, A C; Gautier, R; Tufféry, P

    2004-06-04

    Understanding and predicting protein structures depends on the complexity and the accuracy of the models used to represent them. We have set up a hidden Markov model that discretizes protein backbone conformation as series of overlapping fragments (states) of four residues length. This approach learns simultaneously the geometry of the states and their connections. We obtain, using a statistical criterion, an optimal systematic decomposition of the conformational variability of the protein peptidic chain in 27 states with strong connection logic. This result is stable over different protein sets. Our model fits well the previous knowledge related to protein architecture organisation and seems able to grab some subtle details of protein organisation, such as helix sub-level organisation schemes. Taking into account the dependence between the states results in a description of local protein structure of low complexity. On an average, the model makes use of only 8.3 states among 27 to describe each position of a protein structure. Although we use short fragments, the learning process on entire protein conformations captures the logic of the assembly on a larger scale. Using such a model, the structure of proteins can be reconstructed with an average accuracy close to 1.1A root-mean-square deviation and for a complexity of only 3. Finally, we also observe that sequence specificity increases with the number of states of the structural alphabet. Such models can constitute a very relevant approach to the analysis of protein architecture in particular for protein structure prediction.

  7. A modeling strategy for G-protein coupled receptors

    Directory of Open Access Journals (Sweden)

    Anna Kahler

    2016-03-01

    Full Text Available Cell responses can be triggered via G-protein coupled receptors (GPCRs that interact with small molecules, peptides or proteins and transmit the signal over the membrane via structural changes to activate intracellular pathways. GPCRs are characterized by a rather low sequence similarity and exhibit structural differences even for functionally closely related GPCRs. An accurate structure prediction for GPCRs is therefore not straightforward. We propose a computational approach that relies on the generation of several independent models based on different template structures, which are subsequently refined by molecular dynamics simulations. A comparison of their conformational stability and the agreement with GPCR-typical structural features is then used to select a favorable model. This strategy was applied to predict the structure of the herpesviral chemokine receptor US28 by generating three independent models based on the known structures of the chemokine receptors CXCR1, CXCR4, and CCR5. Model refinement and evaluation suggested that the model based on CCR5 exhibits the most favorable structural properties. In particular, the GPCR-typical structural features, such as a conserved water cluster or conserved non-covalent contacts, are present to a larger extent in the model based on CCR5 compared to the other models. A final model validation based on the recently published US28 crystal structure confirms that the CCR5-based model is the most accurate and exhibits 80.8% correctly modeled residues within the transmembrane helices. The structural agreement between the selected model and the crystal structure suggests that our modeling strategy may also be more generally applicable to other GPCRs of unknown structure.

  8. DockQ: A Quality Measure for Protein-Protein Docking Models

    Science.gov (United States)

    Basu, Sankar

    2016-01-01

    The state-of-the-art to assess the structural quality of docking models is currently based on three related yet independent quality measures: Fnat, LRMS, and iRMS as proposed and standardized by CAPRI. These quality measures quantify different aspects of the quality of a particular docking model and need to be viewed together to reveal the true quality, e.g. a model with relatively poor LRMS (>10Å) might still qualify as 'acceptable' with a descent Fnat (>0.50) and iRMS (iRMS to a single score in the range [0, 1] that can be used to assess the quality of protein docking models. By using DockQ on CAPRI models it is possible to almost completely reproduce the original CAPRI classification into Incorrect, Acceptable, Medium and High quality. An average PPV of 94% at 90% Recall demonstrating that there is no need to apply predefined ad-hoc cutoffs to classify docking models. Since DockQ recapitulates the CAPRI classification almost perfectly, it can be viewed as a higher resolution version of the CAPRI classification, making it possible to estimate model quality in a more quantitative way using Z-scores or sum of top ranked models, which has been so valuable for the CASP community. The possibility to directly correlate a quality measure to a scoring function has been crucial for the development of scoring functions for protein structure prediction, and DockQ should be useful in a similar development in the protein docking field. DockQ is available at http://github.com/bjornwallner/DockQ/ PMID:27560519

  9. Comparative genome analysis of entomopathogenic fungi reveals a complex set of secreted proteins.

    Science.gov (United States)

    Staats, Charley Christian; Junges, Angela; Guedes, Rafael Lucas Muniz; Thompson, Claudia Elizabeth; de Morais, Guilherme Loss; Boldo, Juliano Tomazzoni; de Almeida, Luiz Gonzaga Paula; Andreis, Fábio Carrer; Gerber, Alexandra Lehmkuhl; Sbaraini, Nicolau; da Paixão, Rana Louise de Andrade; Broetto, Leonardo; Landell, Melissa; Santi, Lucélia; Beys-da-Silva, Walter Orlando; Silveira, Carolina Pereira; Serrano, Thaiane Rispoli; de Oliveira, Eder Silva; Kmetzsch, Lívia; Vainstein, Marilene Henning; de Vasconcelos, Ana Tereza Ribeiro; Schrank, Augusto

    2014-09-29

    Metarhizium anisopliae is an entomopathogenic fungus used in the biological control of some agricultural insect pests, and efforts are underway to use this fungus in the control of insect-borne human diseases. A large repertoire of proteins must be secreted by M. anisopliae to cope with the various available nutrients as this fungus switches through different lifestyles, i.e., from a saprophytic, to an infectious, to a plant endophytic stage. To further evaluate the predicted secretome of M. anisopliae, we employed genomic and transcriptomic analyses, coupled with phylogenomic analysis, focusing on the identification and characterization of secreted proteins. We determined the M. anisopliae E6 genome sequence and compared this sequence to other entomopathogenic fungi genomes. A robust pipeline was generated to evaluate the predicted secretomes of M. anisopliae and 15 other filamentous fungi, leading to the identification of a core of secreted proteins. Transcriptomic analysis using the tick Rhipicephalus microplus cuticle as an infection model during two periods of infection (48 and 144 h) allowed the identification of several differentially expressed genes. This analysis concluded that a large proportion of the predicted secretome coding genes contained altered transcript levels in the conditions analyzed in this study. In addition, some specific secreted proteins from Metarhizium have an evolutionary history similar to orthologs found in Beauveria/Cordyceps. This similarity suggests that a set of secreted proteins has evolved to participate in entomopathogenicity. The data presented represents an important step to the characterization of the role of secreted proteins in the virulence and pathogenicity of M. anisopliae.

  10. Comparative Genomics and Disorder Prediction Identify Biologically Relevant SH3 Protein Interactions.

    Directory of Open Access Journals (Sweden)

    2005-08-01

    Full Text Available Protein interaction networks are an important part of the post-genomic effort to integrate a part-list view of the cell into system-level understanding. Using a set of 11 yeast genomes we show that combining comparative genomics and secondary structure information greatly increases consensus-based prediction of SH3 targets. Benchmarking of our method against positive and negative standards gave 83% accuracy with 26% coverage. The concept of an optimal divergence time for effective comparative genomics studies was analyzed, demonstrating that genomes of species that diverged very recently from Saccharomyces cerevisiae(S. mikatae, S. bayanus, and S. paradoxus, or a long time ago (Neurospora crassa and Schizosaccharomyces pombe, contain less information for accurate prediction of SH3 targets than species within the optimal divergence time proposed. We also show here that intrinsically disordered SH3 domain targets are more probable sites of interaction than equivalent sites within ordered regions. Our findings highlight several novel S. cerevisiae SH3 protein interactions, the value of selection of optimal divergence times in comparative genomics studies, and the importance of intrinsic disorder for protein interactions. Based on our results we propose novel roles for the S. cerevisiae proteins Abp1p in endocytosis and Hse1p in endosome protein sorting.

  11. Comparative genomics and disorder prediction identify biologically relevant SH3 protein interactions.

    Directory of Open Access Journals (Sweden)

    Pedro Beltrao

    2005-08-01

    Full Text Available Protein interaction networks are an important part of the post-genomic effort to integrate a part-list view of the cell into system-level understanding. Using a set of 11 yeast genomes we show that combining comparative genomics and secondary structure information greatly increases consensus-based prediction of SH3 targets. Benchmarking of our method against positive and negative standards gave 83% accuracy with 26% coverage. The concept of an optimal divergence time for effective comparative genomics studies was analyzed, demonstrating that genomes of species that diverged very recently from Saccharomyces cerevisiae(S. mikatae, S. bayanus, and S. paradoxus, or a long time ago (Neurospora crassa and Schizosaccharomyces pombe, contain less information for accurate prediction of SH3 targets than species within the optimal divergence time proposed. We also show here that intrinsically disordered SH3 domain targets are more probable sites of interaction than equivalent sites within ordered regions. Our findings highlight several novel S. cerevisiae SH3 protein interactions, the value of selection of optimal divergence times in comparative genomics studies, and the importance of intrinsic disorder for protein interactions. Based on our results we propose novel roles for the S. cerevisiae proteins Abp1p in endocytosis and Hse1p in endosome protein sorting.

  12. Mathematical modeling and comparison of protein size distribution in different plant, animal, fungal and microbial species reveals a negative correlation between protein size and protein number, thus providing insight into the evolution of proteomes

    Directory of Open Access Journals (Sweden)

    Tiessen Axel

    2012-02-01

    Full Text Available Abstract Background The sizes of proteins are relevant to their biochemical structure and for their biological function. The statistical distribution of protein lengths across a diverse set of taxa can provide hints about the evolution of proteomes. Results Using the full genomic sequences of over 1,302 prokaryotic and 140 eukaryotic species two datasets containing 1.2 and 6.1 million proteins were generated and analyzed statistically. The lengthwise distribution of proteins can be roughly described with a gamma type or log-normal model, depending on the species. However the shape parameter of the gamma model has not a fixed value of 2, as previously suggested, but varies between 1.5 and 3 in different species. A gamma model with unrestricted shape parameter described best the distributions in ~48% of the species, whereas the log-normal distribution described better the observed protein sizes in 42% of the species. The gamma restricted function and the sum of exponentials distribution had a better fitting in only ~5% of the species. Eukaryotic proteins have an average size of 472 aa, whereas bacterial (320 aa and archaeal (283 aa proteins are significantly smaller (33-40% on average. Average protein sizes in different phylogenetic groups were: Alveolata (628 aa, Amoebozoa (533 aa, Fornicata (543 aa, Placozoa (453 aa, Eumetazoa (486 aa, Fungi (487 aa, Stramenopila (486 aa, Viridiplantae (392 aa. Amino acid composition is biased according to protein size. Protein length correlated negatively with %C, %M, %K, %F, %R, %W, %Y and positively with %D, %E, %Q, %S and %T. Prokaryotic proteins had a different protein size bias for %E, %G, %K and %M as compared to eukaryotes. Conclusions Mathematical modeling of protein length empirical distributions can be used to asses the quality of small ORFs annotation in genomic releases (detection of too many false positive small ORFs. There is a negative correlation between average protein size and total number of

  13. Emergence of modularity and disassortativity in protein-protein interaction networks.

    Science.gov (United States)

    Wan, Xi; Cai, Shuiming; Zhou, Jin; Liu, Zengrong

    2010-12-01

    In this paper, we present a simple evolution model of protein-protein interaction networks by introducing a rule of small-preference duplication of a node, meaning that the probability of a node chosen to duplicate is inversely proportional to its degree, and subsequent divergence plus nonuniform heterodimerization based on some plausible mechanisms in biology. We show that our model cannot only reproduce scale-free connectivity and small-world pattern, but also exhibit hierarchical modularity and disassortativity. After comparing the features of our model with those of real protein-protein interaction networks, we believe that our model can provide relevant insights into the mechanism underlying the evolution of protein-protein interaction networks. © 2010 American Institute of Physics.

  14. Models of protein-ligand crystal structures: trust, but verify.

    Science.gov (United States)

    Deller, Marc C; Rupp, Bernhard

    2015-09-01

    X-ray crystallography provides the most accurate models of protein-ligand structures. These models serve as the foundation of many computational methods including structure prediction, molecular modelling, and structure-based drug design. The success of these computational methods ultimately depends on the quality of the underlying protein-ligand models. X-ray crystallography offers the unparalleled advantage of a clear mathematical formalism relating the experimental data to the protein-ligand model. In the case of X-ray crystallography, the primary experimental evidence is the electron density of the molecules forming the crystal. The first step in the generation of an accurate and precise crystallographic model is the interpretation of the electron density of the crystal, typically carried out by construction of an atomic model. The atomic model must then be validated for fit to the experimental electron density and also for agreement with prior expectations of stereochemistry. Stringent validation of protein-ligand models has become possible as a result of the mandatory deposition of primary diffraction data, and many computational tools are now available to aid in the validation process. Validation of protein-ligand complexes has revealed some instances of overenthusiastic interpretation of ligand density. Fundamental concepts and metrics of protein-ligand quality validation are discussed and we highlight software tools to assist in this process. It is essential that end users select high quality protein-ligand models for their computational and biological studies, and we provide an overview of how this can be achieved.

  15. Markov dynamic models for long-timescale protein motion.

    KAUST Repository

    Chiang, Tsung-Han

    2010-06-01

    Molecular dynamics (MD) simulation is a well-established method for studying protein motion at the atomic scale. However, it is computationally intensive and generates massive amounts of data. One way of addressing the dual challenges of computation efficiency and data analysis is to construct simplified models of long-timescale protein motion from MD simulation data. In this direction, we propose to use Markov models with hidden states, in which the Markovian states represent potentially overlapping probabilistic distributions over protein conformations. We also propose a principled criterion for evaluating the quality of a model by its ability to predict long-timescale protein motions. Our method was tested on 2D synthetic energy landscapes and two extensively studied peptides, alanine dipeptide and the villin headpiece subdomain (HP-35 NleNle). One interesting finding is that although a widely accepted model of alanine dipeptide contains six states, a simpler model with only three states is equally good for predicting long-timescale motions. We also used the constructed Markov models to estimate important kinetic and dynamic quantities for protein folding, in particular, mean first-passage time. The results are consistent with available experimental measurements.

  16. Markov dynamic models for long-timescale protein motion.

    KAUST Repository

    Chiang, Tsung-Han; Hsu, David; Latombe, Jean-Claude

    2010-01-01

    Molecular dynamics (MD) simulation is a well-established method for studying protein motion at the atomic scale. However, it is computationally intensive and generates massive amounts of data. One way of addressing the dual challenges of computation efficiency and data analysis is to construct simplified models of long-timescale protein motion from MD simulation data. In this direction, we propose to use Markov models with hidden states, in which the Markovian states represent potentially overlapping probabilistic distributions over protein conformations. We also propose a principled criterion for evaluating the quality of a model by its ability to predict long-timescale protein motions. Our method was tested on 2D synthetic energy landscapes and two extensively studied peptides, alanine dipeptide and the villin headpiece subdomain (HP-35 NleNle). One interesting finding is that although a widely accepted model of alanine dipeptide contains six states, a simpler model with only three states is equally good for predicting long-timescale motions. We also used the constructed Markov models to estimate important kinetic and dynamic quantities for protein folding, in particular, mean first-passage time. The results are consistent with available experimental measurements.

  17. Fragment-based modelling of single stranded RNA bound to RNA recognition motif containing proteins

    Science.gov (United States)

    de Beauchene, Isaure Chauvot; de Vries, Sjoerd J.; Zacharias, Martin

    2016-01-01

    Abstract Protein-RNA complexes are important for many biological processes. However, structural modeling of such complexes is hampered by the high flexibility of RNA. Particularly challenging is the docking of single-stranded RNA (ssRNA). We have developed a fragment-based approach to model the structure of ssRNA bound to a protein, based on only the protein structure, the RNA sequence and conserved contacts. The conformational diversity of each RNA fragment is sampled by an exhaustive library of trinucleotides extracted from all known experimental protein–RNA complexes. The method was applied to ssRNA with up to 12 nucleotides which bind to dimers of the RNA recognition motifs (RRMs), a highly abundant eukaryotic RNA-binding domain. The fragment based docking allows a precise de novo atomic modeling of protein-bound ssRNA chains. On a benchmark of seven experimental ssRNA–RRM complexes, near-native models (with a mean heavy-atom deviation of <3 Å from experiment) were generated for six out of seven bound RNA chains, and even more precise models (deviation < 2 Å) were obtained for five out of seven cases, a significant improvement compared to the state of the art. The method is not restricted to RRMs but was also successfully applied to Pumilio RNA binding proteins. PMID:27131381

  18. Evolving Transcription Factor Binding Site Models From Protein Binding Microarray Data

    KAUST Repository

    Wong, Ka-Chun; Peng, Chengbin; Li, Yue

    2016-01-01

    Protein binding microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner. In this paper, we describe the PBM motif model building problem. We apply several evolutionary computation methods and compare their performance with the interior point method, demonstrating their performance advantages. In addition, given the PBM domain knowledge, we propose and describe a novel method called kmerGA which makes domain-specific assumptions to exploit PBM data properties to build more accurate models than the other models built. The effectiveness and robustness of kmerGA is supported by comprehensive performance benchmarking on more than 200 datasets, time complexity analysis, convergence analysis, parameter analysis, and case studies. To demonstrate its utility further, kmerGA is applied to two real world applications: 1) PBM rotation testing and 2) ChIP-Seq peak sequence prediction. The results support the biological relevance of the models learned by kmerGA, and thus its real world applicability.

  19. Evolving Transcription Factor Binding Site Models From Protein Binding Microarray Data

    KAUST Repository

    Wong, Ka-Chun

    2016-02-02

    Protein binding microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner. In this paper, we describe the PBM motif model building problem. We apply several evolutionary computation methods and compare their performance with the interior point method, demonstrating their performance advantages. In addition, given the PBM domain knowledge, we propose and describe a novel method called kmerGA which makes domain-specific assumptions to exploit PBM data properties to build more accurate models than the other models built. The effectiveness and robustness of kmerGA is supported by comprehensive performance benchmarking on more than 200 datasets, time complexity analysis, convergence analysis, parameter analysis, and case studies. To demonstrate its utility further, kmerGA is applied to two real world applications: 1) PBM rotation testing and 2) ChIP-Seq peak sequence prediction. The results support the biological relevance of the models learned by kmerGA, and thus its real world applicability.

  20. Comparative genome analysis reveals a conserved family of actin-like proteins in apicomplexan parasites

    Directory of Open Access Journals (Sweden)

    Sibley L David

    2005-12-01

    Full Text Available Abstract Background The phylum Apicomplexa is an early-branching eukaryotic lineage that contains a number of important human and animal pathogens. Their complex life cycles and unique cytoskeletal features distinguish them from other model eukaryotes. Apicomplexans rely on actin-based motility for cell invasion, yet the regulation of this system remains largely unknown. Consequently, we focused our efforts on identifying actin-related proteins in the recently completed genomes of Toxoplasma gondii, Plasmodium spp., Cryptosporidium spp., and Theileria spp. Results Comparative genomic and phylogenetic studies of apicomplexan genomes reveals that most contain only a single conventional actin and yet they each have 8–10 additional actin-related proteins. Among these are a highly conserved Arp1 protein (likely part of a conserved dynactin complex, and Arp4 and Arp6 homologues (subunits of the chromatin-remodeling machinery. In contrast, apicomplexans lack canonical Arp2 or Arp3 proteins, suggesting they lost the Arp2/3 actin polymerization complex on their evolutionary path towards intracellular parasitism. Seven of these actin-like proteins (ALPs are novel to apicomplexans. They show no phylogenetic associations to the known Arp groups and likely serve functions specific to this important group of intracellular parasites. Conclusion The large diversity of actin-like proteins in apicomplexans suggests that the actin protein family has diverged to fulfill various roles in the unique biology of intracellular parasites. Conserved Arps likely participate in vesicular transport and gene expression, while apicomplexan-specific ALPs may control unique biological traits such as actin-based gliding motility.

  1. Electrostatic interactions in protein adsorption probed by comparing lysozyme and succinylated lysozyme

    NARCIS (Netherlands)

    Veen, van der M.; Norde, W.; Cohen Stuart, M.A.

    2004-01-01

    The influence of electrostatic interactions on protein adsorption was studied by comparing the adsorption of lysozyme and succinylated lysozyme at silica surfaces. The succinylation affects the charge of the protein, but also the stability. Although changes in stability can have an influence on

  2. Comparative analysis of the prion protein gene sequences in African lion.

    Science.gov (United States)

    Wu, Chang-De; Pang, Wan-Yong; Zhao, De-Ming

    2006-10-01

    The prion protein gene of African lion (Panthera Leo) was first cloned and polymorphisms screened. The results suggest that the prion protein gene of eight African lions is highly homogenous. The amino acid sequences of the prion protein (PrP) of all samples tested were identical. Four single nucleotide polymorphisms (C42T, C81A, C420T, T600C) in the prion protein gene (Prnp) of African lion were found, but no amino acid substitutions. Sequence analysis showed that the higher homology is observed to felis catus AF003087 (96.7%) and to sheep number M31313.1 (96.2%) Genbank accessed. With respect to all the mammalian prion protein sequences compared, the African lion prion protein sequence has three amino acid substitutions. The homology might in turn affect the potential intermolecular interactions critical for cross species transmission of prion disease.

  3. Modelling Transcapillary Transport of Fluid and Proteins in Hemodialysis Patients.

    Directory of Open Access Journals (Sweden)

    Mauro Pietribiasi

    Full Text Available The kinetics of protein transport to and from the vascular compartment play a major role in the determination of fluid balance and plasma refilling during hemodialysis (HD sessions. In this study we propose a whole-body mathematical model describing water and protein shifts across the capillary membrane during HD and compare its output to clinical data while evaluating the impact of choosing specific values for selected parameters.The model follows a two-compartment structure (vascular and interstitial space and is based on balance equations of protein mass and water volume in each compartment. The capillary membrane was described according to the three-pore theory. Two transport parameters, the fractional contribution of large pores (αLP and the total hydraulic conductivity (LpS of the capillary membrane, were estimated from patient data. Changes in the intensity and direction of individual fluid and solute flows through each part of the transport system were analyzed in relation to the choice of different values of small pores radius and fractional conductivity, lymphatic sensitivity to hydraulic pressure, and steady-state interstitial-to-plasma protein concentration ratio.The estimated values of LpS and αLP were respectively 10.0 ± 8.4 mL/min/mmHg (mean ± standard deviation and 0.062 ± 0.041. The model was able to predict with good accuracy the profiles of plasma volume and serum total protein concentration in most of the patients (average root-mean-square deviation < 2% of the measured value.The applied model provides a mechanistic interpretation of fluid transport processes induced by ultrafiltration during HD, using a minimum of tuned parameters and assumptions. The simulated values of individual flows through each kind of pore and lymphatic absorption rate yielded by the model may suggest answers to unsolved questions on the relative impact of these not-measurable quantities on total vascular refilling and fluid balance.

  4. Binding free energy analysis of protein-protein docking model structures by evERdock.

    Science.gov (United States)

    Takemura, Kazuhiro; Matubayasi, Nobuyuki; Kitao, Akio

    2018-03-14

    To aid the evaluation of protein-protein complex model structures generated by protein docking prediction (decoys), we previously developed a method to calculate the binding free energies for complexes. The method combines a short (2 ns) all-atom molecular dynamics simulation with explicit solvent and solution theory in the energy representation (ER). We showed that this method successfully selected structures similar to the native complex structure (near-native decoys) as the lowest binding free energy structures. In our current work, we applied this method (evERdock) to 100 or 300 model structures of four protein-protein complexes. The crystal structures and the near-native decoys showed the lowest binding free energy of all the examined structures, indicating that evERdock can successfully evaluate decoys. Several decoys that show low interface root-mean-square distance but relatively high binding free energy were also identified. Analysis of the fraction of native contacts, hydrogen bonds, and salt bridges at the protein-protein interface indicated that these decoys were insufficiently optimized at the interface. After optimizing the interactions around the interface by including interfacial water molecules, the binding free energies of these decoys were improved. We also investigated the effect of solute entropy on binding free energy and found that consideration of the entropy term does not necessarily improve the evaluations of decoys using the normal model analysis for entropy calculation.

  5. Comparative proteomics of milk fat globule membrane proteins from transgenic cloned cattle.

    Directory of Open Access Journals (Sweden)

    Shunchao Sui

    Full Text Available The use of transgenic livestock is providing new methods for obtaining pharmaceutically useful proteins. However, the protein expression profiles of the transgenic animals, including expression of milk fat globule membrane (MFGM proteins, have not been well characterized. In this study, we compared the MFGM protein expression profile of the colostrum and mature milk from three lines of transgenic cloned (TC cattle, i.e., expressing recombinant human α-lactalbumin (TC-LA, lactoferrin (TC-LF or lysozyme (TC-LZ in the mammary gland, with those from cloned non-transgenic (C and conventionally bred normal animals (N. We identified 1, 225 proteins in milk MFGM, 166 of which were specifically expressed only in the TC-LA group, 265 only in the TC-LF group, and 184 only in the TC-LZ group. There were 43 proteins expressed only in the transgenic cloned animals, but the concentrations of these proteins were below the detection limit of silver staining. Functional analysis also showed that the 43 proteins had no obvious influence on the bovine mammary gland. Quantitative comparison revealed that MFGM proteins were up- or down-regulated more than twofold in the TC and C groups compared to N group: 126 in colostrum and 77 in mature milk of the TC-LA group; 157 in colostrum and 222 in mature milk of the TC-LF group; 49 in colostrum and 98 in mature milk of the TC-LZ group; 98 in colostrum and 132 in mature milk in the C group. These up- and down-regulated proteins in the transgenic animals were not associated with a particular biological function or pathway, which appears that expression of certain exogenous proteins has no general deleterious effects on the cattle mammary gland.

  6. Comparative changes in monthly blood urea nitrogen, total protein ...

    African Journals Online (AJOL)

    The objective of this study was to determine the comparative changes in the monthly blood urea nitrogen (BUN) concentration, total protein (TP) concentration in blood serum and the body condition score of Nguni cows and heifers raised on sweetveld. Twenty-four clinically healthy animals in different parities, namely Parity ...

  7. Model for calculation of electrostatic contribution into protein stability

    Science.gov (United States)

    Kundrotas, Petras; Karshikoff, Andrey

    2003-03-01

    Existing models of the denatured state of proteins consider only one possible spatial distribution of protein charges and therefore are applicable to a limited number of cases. In this presentation a more general framework for the modeling of the denatured state is proposed. It is based on the assumption that the titratable groups of an unfolded protein can adopt a quasi-random distribution, restricted by the protein sequence. The model was tested on two proteins, barnase and N-terminal domain of the ribosomal protein L9. The calculated free energy of denaturation, Δ G( pH), reproduces the experimental data essentially better than the commonly used null approximation (NA). It was demonstrated that the seemingly good agreement with experimental data obtained by NA originates from the compensatory effect between the pair-wise electrostatic interactions and the desolvation energy of the individual sites. It was also found that the ionization properties of denatured proteins are influenced by the protein sequence.

  8. Protein loop modeling using a new hybrid energy function and its application to modeling in inaccurate structural environments.

    Directory of Open Access Journals (Sweden)

    Hahnbeom Park

    Full Text Available Protein loop modeling is a tool for predicting protein local structures of particular interest, providing opportunities for applications involving protein structure prediction and de novo protein design. Until recently, the majority of loop modeling methods have been developed and tested by reconstructing loops in frameworks of experimentally resolved structures. In many practical applications, however, the protein loops to be modeled are located in inaccurate structural environments. These include loops in model structures, low-resolution experimental structures, or experimental structures of different functional forms. Accordingly, discrepancies in the accuracy of the structural environment assumed in development of the method and that in practical applications present additional challenges to modern loop modeling methods. This study demonstrates a new strategy for employing a hybrid energy function combining physics-based and knowledge-based components to help tackle this challenge. The hybrid energy function is designed to combine the strengths of each energy component, simultaneously maintaining accurate loop structure prediction in a high-resolution framework structure and tolerating minor environmental errors in low-resolution structures. A loop modeling method based on global optimization of this new energy function is tested on loop targets situated in different levels of environmental errors, ranging from experimental structures to structures perturbed in backbone as well as side chains and template-based model structures. The new method performs comparably to force field-based approaches in loop reconstruction in crystal structures and better in loop prediction in inaccurate framework structures. This result suggests that higher-accuracy predictions would be possible for a broader range of applications. The web server for this method is available at http://galaxy.seoklab.org/loop with the PS2 option for the scoring function.

  9. Benchmarking of protein descriptor sets in proteochemometric modeling (part 2): modeling performance of 13 amino acid descriptor sets

    Science.gov (United States)

    2013-01-01

    Background While a large body of work exists on comparing and benchmarking descriptors of molecular structures, a similar comparison of protein descriptor sets is lacking. Hence, in the current work a total of 13 amino acid descriptor sets have been benchmarked with respect to their ability of establishing bioactivity models. The descriptor sets included in the study are Z-scales (3 variants), VHSE, T-scales, ST-scales, MS-WHIM, FASGAI, BLOSUM, a novel protein descriptor set (termed ProtFP (4 variants)), and in addition we created and benchmarked three pairs of descriptor combinations. Prediction performance was evaluated in seven structure-activity benchmarks which comprise Angiotensin Converting Enzyme (ACE) dipeptidic inhibitor data, and three proteochemometric data sets, namely (1) GPCR ligands modeled against a GPCR panel, (2) enzyme inhibitors (NNRTIs) with associated bioactivities against a set of HIV enzyme mutants, and (3) enzyme inhibitors (PIs) with associated bioactivities on a large set of HIV enzyme mutants. Results The amino acid descriptor sets compared here show similar performance (set differences ( > 0.3 log units RMSE difference and >0.7 difference in MCC). Combining different descriptor sets generally leads to better modeling performance than utilizing individual sets. The best performers were Z-scales (3) combined with ProtFP (Feature), or Z-Scales (3) combined with an average Z-Scale value for each target, while ProtFP (PCA8), ST-Scales, and ProtFP (Feature) rank last. Conclusions While amino acid descriptor sets capture different aspects of amino acids their ability to be used for bioactivity modeling is still – on average – surprisingly similar. Still, combining sets describing complementary information consistently leads to small but consistent improvement in modeling performance (average MCC 0.01 better, average RMSE 0.01 log units lower). Finally, performance differences exist between the targets compared thereby underlining that

  10. Discrete persistent-chain model for protein binding on DNA.

    Science.gov (United States)

    Lam, Pui-Man; Zhen, Yi

    2011-04-01

    We describe and solve a discrete persistent-chain model of protein binding on DNA, involving an extra σ(i) at a site i of the DNA. This variable takes the value 1 or 0, depending on whether or not the site is occupied by a protein. In addition, if the site is occupied by a protein, there is an extra energy cost ɛ. For a small force, we obtain analytic expressions for the force-extension curve and the fraction of bound protein on the DNA. For higher forces, the model can be solved numerically to obtain force-extension curves and the average fraction of bound proteins as a function of applied force. Our model can be used to analyze experimental force-extension curves of protein binding on DNA, and hence deduce the number of bound proteins in the case of nonspecific binding. ©2011 American Physical Society

  11. Modeling Mercury in Proteins

    Energy Technology Data Exchange (ETDEWEB)

    Smith, Jeremy C [ORNL; Parks, Jerry M [ORNL

    2016-01-01

    Mercury (Hg) is a naturally occurring element that is released into the biosphere both by natural processes and anthropogenic activities. Although its reduced, elemental form Hg(0) is relatively non-toxic, other forms such as Hg2+ and, in particular, its methylated form, methylmercury, are toxic, with deleterious effects on both ecosystems and humans. Microorganisms play important roles in the transformation of mercury in the environment. Inorganic Hg2+ can be methylated by certain bacteria and archaea to form methylmercury. Conversely, bacteria also demethylate methylmercury and reduce Hg2+ to relatively inert Hg(0). Transformations and toxicity occur as a result of mercury interacting with various proteins. Clearly, then, understanding the toxic effects of mercury and its cycling in the environment requires characterization of these interactions. Computational approaches are ideally suited to studies of mercury in proteins because they can provide a detailed picture and circumvent issues associated with toxicity. Here we describe computational methods for investigating and characterizing how mercury binds to proteins, how inter- and intra-protein transfer of mercury is orchestrated in biological systems, and how chemical reactions in proteins transform the metal. We describe quantum chemical analyses of aqueous Hg(II), which reveal critical factors that determine ligand binding propensities. We then provide a perspective on how we used chemical reasoning to discover how microorganisms methylate mercury. We also highlight our combined computational and experimental studies of the proteins and enzymes of the mer operon, a suite of genes that confers mercury resistance in many bacteria. Lastly, we place work on mercury in proteins in the context of what is needed for a comprehensive multi-scale model of environmental mercury cycling.

  12. Comparative Study of Lectin Domains in Model Species: New Insights into Evolutionary Dynamics

    Directory of Open Access Journals (Sweden)

    Sofie Van Holle

    2017-05-01

    Full Text Available Lectins are present throughout the plant kingdom and are reported to be involved in diverse biological processes. In this study, we provide a comparative analysis of the lectin families from model species in a phylogenetic framework. The analysis focuses on the different plant lectin domains identified in five representative core angiosperm genomes (Arabidopsis thaliana, Glycine max, Cucumis sativus, Oryza sativa ssp. japonica and Oryza sativa ssp. indica. The genomes were screened for genes encoding lectin domains using a combination of Basic Local Alignment Search Tool (BLAST, hidden Markov models, and InterProScan analysis. Additionally, phylogenetic relationships were investigated by constructing maximum likelihood phylogenetic trees. The results demonstrate that the majority of the lectin families are present in each of the species under study. Domain organization analysis showed that most identified proteins are multi-domain proteins, owing to the modular rearrangement of protein domains during evolution. Most of these multi-domain proteins are widespread, while others display a lineage-specific distribution. Furthermore, the phylogenetic analyses reveal that some lectin families evolved to be similar to the phylogeny of the plant species, while others share a closer evolutionary history based on the corresponding protein domain architecture. Our results yield insights into the evolutionary relationships and functional divergence of plant lectins.

  13. Protein Folding: Search for Basic Physical Models

    Directory of Open Access Journals (Sweden)

    Ivan Y. Torshin

    2003-01-01

    Full Text Available How a unique three-dimensional structure is rapidly formed from the linear sequence of a polypeptide is one of the important questions in contemporary science. Apart from biological context of in vivo protein folding (which has been studied only for a few proteins, the roles of the fundamental physical forces in the in vitro folding remain largely unstudied. Despite a degree of success in using descriptions based on statistical and/or thermodynamic approaches, few of the current models explicitly include more basic physical forces (such as electrostatics and Van Der Waals forces. Moreover, the present-day models rarely take into account that the protein folding is, essentially, a rapid process that produces a highly specific architecture. This review considers several physical models that may provide more direct links between sequence and tertiary structure in terms of the physical forces. In particular, elaboration of such simple models is likely to produce extremely effective computational techniques with value for modern genomics.

  14. Models of crk adaptor proteins in cancer.

    Science.gov (United States)

    Bell, Emily S; Park, Morag

    2012-05-01

    The Crk family of adaptor proteins (CrkI, CrkII, and CrkL), originally discovered as the oncogene fusion product, v-Crk, of the CT10 chicken retrovirus, lacks catalytic activity but engages with multiple signaling pathways through their SH2 and SH3 domains. Crk proteins link upstream tyrosine kinase and integrin-dependent signals to downstream effectors, acting as adaptors in diverse signaling pathways and cellular processes. Crk proteins are now recognized to play a role in the malignancy of many human cancers, stimulating renewed interest in their mechanism of action in cancer progression. The contribution of Crk signaling to malignancy has been predominantly studied in fibroblasts and in hematopoietic models and more recently in epithelial models. A mechanistic understanding of Crk proteins in cancer progression in vivo is still poorly understood in part due to the highly pleiotropic nature of Crk signaling. Recent advances in the structural organization of Crk domains, new roles in kinase regulation, and increased knowledge of the mechanisms and frequency of Crk overexpression in human cancers have provided an incentive for further study in in vivo models. An understanding of the mechanisms through which Crk proteins act as oncogenic drivers could have important implications in therapeutic targeting.

  15. Modelling Protein Dynamics on the Microsecond Time Scale

    DEFF Research Database (Denmark)

    Siuda, Iwona Anna

    Recent years have shown an increase in coarse-grained (CG) molecular dynamics simulations, providing structural and dynamic details of large proteins and enabling studies of self-assembly of biological materials. It is not easy to acquire such data experimentally, and access is also still limited...... in atomistic simulations. During her PhD studies, Iwona Siuda used MARTINI CG models to study the dynamics of different globular and membrane proteins. In several cases, the MARTINI model was sufficient to study conformational changes of small, purely alpha-helical proteins. However, in studies of larger......ELNEDIN was therefore proposed as part of the work. Iwona Siuda’s results from the CG simulations had biological implications that provide insights into possible mechanisms of the periplasmic leucine-binding protein, the sarco(endo)plasmic reticulum calcium pump, and several proteins from the saposin-like proteins...

  16. Immunogenicity of therapeutic proteins: the use of animal models.

    Science.gov (United States)

    Brinks, Vera; Jiskoot, Wim; Schellekens, Huub

    2011-10-01

    Immunogenicity of therapeutic proteins lowers patient well-being and drastically increases therapeutic costs. Preventing immunogenicity is an important issue to consider when developing novel therapeutic proteins and applying them in the clinic. Animal models are increasingly used to study immunogenicity of therapeutic proteins. They are employed as predictive tools to assess different aspects of immunogenicity during drug development and have become vital in studying the mechanisms underlying immunogenicity of therapeutic proteins. However, the use of animal models needs critical evaluation. Because of species differences, predictive value of such models is limited, and mechanistic studies can be restricted. This review addresses the suitability of animal models for immunogenicity prediction and summarizes the insights in immunogenicity that they have given so far.

  17. Protein-protein interaction networks identify targets which rescue the MPP+ cellular model of Parkinson’s disease

    Science.gov (United States)

    Keane, Harriet; Ryan, Brent J.; Jackson, Brendan; Whitmore, Alan; Wade-Martins, Richard

    2015-11-01

    Neurodegenerative diseases are complex multifactorial disorders characterised by the interplay of many dysregulated physiological processes. As an exemplar, Parkinson’s disease (PD) involves multiple perturbed cellular functions, including mitochondrial dysfunction and autophagic dysregulation in preferentially-sensitive dopamine neurons, a selective pathophysiology recapitulated in vitro using the neurotoxin MPP+. Here we explore a network science approach for the selection of therapeutic protein targets in the cellular MPP+ model. We hypothesised that analysis of protein-protein interaction networks modelling MPP+ toxicity could identify proteins critical for mediating MPP+ toxicity. Analysis of protein-protein interaction networks constructed to model the interplay of mitochondrial dysfunction and autophagic dysregulation (key aspects of MPP+ toxicity) enabled us to identify four proteins predicted to be key for MPP+ toxicity (P62, GABARAP, GBRL1 and GBRL2). Combined, but not individual, knockdown of these proteins increased cellular susceptibility to MPP+ toxicity. Conversely, combined, but not individual, over-expression of the network targets provided rescue of MPP+ toxicity associated with the formation of autophagosome-like structures. We also found that modulation of two distinct proteins in the protein-protein interaction network was necessary and sufficient to mitigate neurotoxicity. Together, these findings validate our network science approach to multi-target identification in complex neurological diseases.

  18. In Silico Characterization and Structural Modeling of Dermacentor andersoni p36 Immunosuppressive Protein

    Directory of Open Access Journals (Sweden)

    Martin Omulindi Oyugi

    2018-01-01

    Full Text Available Ticks cause approximately $17–19 billion economic losses to the livestock industry globally. Development of recombinant antitick vaccine is greatly hindered by insufficient knowledge and understanding of proteins expressed by ticks. Ticks secrete immunosuppressant proteins that modulate the host’s immune system during blood feeding; these molecules could be a target for antivector vaccine development. Recombinant p36, a 36 kDa immunosuppressor from the saliva of female Dermacentor andersoni, suppresses T-lymphocytes proliferation in vitro. To identify potential unique structural and dynamic properties responsible for the immunosuppressive function of p36 proteins, this study utilized bioinformatic tool to characterize and model structure of D. andersoni p36 protein. Evaluation of p36 protein family as suitable vaccine antigens predicted a p36 homolog in Rhipicephalus appendiculatus, the tick vector of East Coast fever, with an antigenicity score of 0.7701 that compares well with that of Bm86 (0.7681, the protein antigen that constitute commercial tick vaccine Tickgard™. Ab initio modeling of the D. andersoni p36 protein yielded a 3D structure that predicted conserved antigenic region, which has potential of binding immunomodulating ligands including glycerol and lactose, found located within exposed loop, suggesting a likely role in immunosuppressive function of tick p36 proteins. Laboratory confirmation of these preliminary results is necessary in future studies.

  19. A Machine Learning Approach for Hot-Spot Detection at Protein-Protein Interfaces

    NARCIS (Netherlands)

    Melo, Rita; Fieldhouse, Robert; Melo, André; Correia, João D G; Cordeiro, Maria Natália D S; Gümüş, Zeynep H; Costa, Joaquim; Bonvin, Alexandre M J J; de Sousa Moreira, Irina

    2016-01-01

    Understanding protein-protein interactions is a key challenge in biochemistry. In this work, we describe a more accurate methodology to predict Hot-Spots (HS) in protein-protein interfaces from their native complex structure compared to previous published Machine Learning (ML) techniques. Our model

  20. Comparative sensitivity of 125I-protein A and enzyme-conjugated antibodies for detection of immunoblotted proteins

    International Nuclear Information System (INIS)

    Bernstein, J.M.; Stokes, C.E.; Fernie, B.

    1987-01-01

    Immunoblotting is a powerful technique for the detection of small amounts of immunologically interesting proteins in unpurified preparations. Iodinated protein A (PA) has been widely used as a second antibody for detection of proteins; however, it does not bind equally well to immunoglobulins from different species nor does it bind to all subclasses of immunoglobulin G (IgG). We compared the sensitivity of [ 125 I]PA with those of both horseradish peroxidase-conjugated second antibodies (HRP) and glucose oxidase-anti-glucose oxidase (GAG) soluble complexes for visualizing bovine serum albumin, human IgG, or human C3 which was either dot blotted or electroblotted to nitrocellulose. [ 125 I]PA was uniformly 10- to 100-fold less sensitive than either HRP or GAG. GAG was more sensitive than HRP except for C3 (electroblotting) and bovine serum albumin and IgG (dot blotting), in which they were equivalent. In general, dot blotting was 10- to 1000-fold more sensitive than electroblotting. Although relative sensitivities varied depending on the proteins analyzed and the antisera used, GAG appeared to be superior to [ 125 I]PA and HRP for detection of immunoblotted proteins

  1. Comparative study on the effects of whey protein isolate and isolated soy protein on healthy adults

    International Nuclear Information System (INIS)

    Ezzal-arab, A.

    2003-01-01

    The objective of study was to investigate the effects of whey protein isolate (WPI) and isolated soy protein (ISP) on total serum levels of amino acids (glutathione, methionine and cystine), triglycerides serum cholesterol, HDL-cholesterol, LDL-cholesterol, thyroxine (T 4 ) and estradiol hormone (E 2 ). The results revealed that the WPI group showed significant increased glutathione, methionine and cystine levels while the SPI group exhibited only significant decreased cystine level. The WPI group did not show significant change in T 4 thyroid activity, whereas the SPI group had significant decreased T 4 thyroid hormone. Females ingested the WPI showed significant decrease in estradiol levels compared to those in the SPI group. The data showed significant decreases in total cholesterol, triglycerides and LDL-cholesterol regardless of ingesting whey protein or soy protein while HDL-cholesterol did not show any change with both proteins. The results of this study support the hypothesis that WPI may be more conductive to good health than SPI because it can increase the levels of some amino acids which responsible for the life of the cell, enhance immunity and promote health in general

  2. PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.

    Science.gov (United States)

    Wang, Yanbin; You, Zhuhong; Li, Xiao; Chen, Xing; Jiang, Tonghai; Zhang, Jingting

    2017-05-11

    Protein-protein interactions (PPIs) are essential for most living organisms' process. Thus, detecting PPIs is extremely important to understand the molecular mechanisms of biological systems. Although many PPIs data have been generated by high-throughput technologies for a variety of organisms, the whole interatom is still far from complete. In addition, the high-throughput technologies for detecting PPIs has some unavoidable defects, including time consumption, high cost, and high error rate. In recent years, with the development of machine learning, computational methods have been broadly used to predict PPIs, and can achieve good prediction rate. In this paper, we present here PCVMZM, a computational method based on a Probabilistic Classification Vector Machines (PCVM) model and Zernike moments (ZM) descriptor for predicting the PPIs from protein amino acids sequences. Specifically, a Zernike moments (ZM) descriptor is used to extract protein evolutionary information from Position-Specific Scoring Matrix (PSSM) generated by Position-Specific Iterated Basic Local Alignment Search Tool (PSI-BLAST). Then, PCVM classifier is used to infer the interactions among protein. When performed on PPIs datasets of Yeast and H. Pylori , the proposed method can achieve the average prediction accuracy of 94.48% and 91.25%, respectively. In order to further evaluate the performance of the proposed method, the state-of-the-art support vector machines (SVM) classifier is used and compares with the PCVM model. Experimental results on the Yeast dataset show that the performance of PCVM classifier is better than that of SVM classifier. The experimental results indicate that our proposed method is robust, powerful and feasible, which can be used as a helpful tool for proteomics research.

  3. A comparative study of ribosomal proteins: linkage between amino acid distribution and ribosomal assembly

    International Nuclear Information System (INIS)

    Lott, Brittany Burton; Wang, Yongmei; Nakazato, Takuya

    2013-01-01

    Assembly of the ribosome from its protein and RNA constituents must occur quickly and efficiently in order to synthesize the proteins necessary for all cellular activity. Since the early 1960’s, certain characteristics of possible assembly pathways have been elucidated, yet the mechanisms that govern the precise recognition events remain unclear. We utilize a comparative analysis to investigate the amino acid composition of ribosomal proteins (r-proteins) with respect to their role in the assembly process. We compared small subunit (30S) r-protein sequences to those of other housekeeping proteins from 560 bacterial species and searched for correlations between r-protein amino acid content and factors such as assembly binding order, environmental growth temperature, protein size, and contact with ribosomal RNA (rRNA) in the 30S complex. We find r-proteins have a significantly high percent of positive residues, which are highly represented at rRNA contact sites. An inverse correlation between the percent of positive residues and r-protein size was identified and is mainly due to the content of Lysine residues, rather than Arginine. Nearly all r-proteins carry a net positive charge, but no statistical correlation between the net charge and the binding order was detected. Thermophilic (high-temperature) r-proteins contain increased Arginine, Isoleucine, and Tyrosine, and decreased Serine and Threonine compared to mesophilic (lower-temperature), reflecting a known distinction between thermophiles and mesophiles, possibly to account for protein thermostability. However, this difference in amino acid content does not extend to rRNA contact sites, as the proportions of thermophilic and mesophilic contact residues are not significantly different. Given the significantly higher level of positively charged residues in r-proteins and at contact sites, we conclude that ribosome assembly relies heavily on an electrostatic component of interaction. However, the binding order of

  4. In silico sequence analysis and homology modeling of predicted beta-amylase 7-like protein in Brachypodium distachyon L.

    Directory of Open Access Journals (Sweden)

    ERTUĞRUL FILIZ

    2014-04-01

    Full Text Available Beta-amylase (β-amylase, EC 3.2.1.2 is an enzyme that catalyses hydrolysis of glucosidic bonds in polysaccharides. In this study, we analyzed protein sequence of predicted beta-amylase 7-like protein in Brachypodium distachyon. pI (isoelectric point value was found as 5.23 in acidic character, while the instability index (II was found as 50.28 with accepted unstable protein. The prediction of subcellular localization was revealed that the protein may reside in chloroplast by using CELLO v.2.5. The 3D structure of protein was performed using comparative homology modeling with SWISS-MODEL. The accuracy of the predicted 3D structure was checked using Ramachandran plot analysis showed that 95.4% in favored region. The results of our study contribute to understanding of β-amylase protein structure in grass species and will be scientific base for 3D modeling of beta-amylase proteins in further studies.

  5. A study of quality measures for protein threading models

    Directory of Open Access Journals (Sweden)

    Rychlewski Leszek

    2001-08-01

    Full Text Available Abstract Background Prediction of protein structures is one of the fundamental challenges in biology today. To fully understand how well different prediction methods perform, it is necessary to use measures that evaluate their performance. Every two years, starting in 1994, the CASP (Critical Assessment of protein Structure Prediction process has been organized to evaluate the ability of different predictors to blindly predict the structure of proteins. To capture different features of the models, several measures have been developed during the CASP processes. However, these measures have not been examined in detail before. In an attempt to develop fully automatic measures that can be used in CASP, as well as in other type of benchmarking experiments, we have compared twenty-one measures. These measures include the measures used in CASP3 and CASP2 as well as have measures introduced later. We have studied their ability to distinguish between the better and worse models submitted to CASP3 and the correlation between them. Results Using a small set of 1340 models for 23 different targets we show that most methods correlate with each other. Most pairs of measures show a correlation coefficient of about 0.5. The correlation is slightly higher for measures of similar types. We found that a significant problem when developing automatic measures is how to deal with proteins of different length. Also the comparisons between different measures is complicated as many measures are dependent on the size of the target. We show that the manual assessment can be reproduced to about 70% using automatic measures. Alignment independent measures, detects slightly more of the models with the correct fold, while alignment dependent measures agree better when selecting the best models for each target. Finally we show that using automatic measures would, to a large extent, reproduce the assessors ranking of the predictors at CASP3. Conclusions We show that given a

  6. Soy compared with milk protein in a Western diet changes fecal microbiota and decreases hepatic steatosis in obese OLETF rats.

    Science.gov (United States)

    Panasevich, Matthew R; Schuster, Colin M; Phillips, Kathryn E; Meers, Grace M; Chintapalli, Sree V; Wankhade, Umesh D; Shankar, Kartik; Butteiger, Dustie N; Krul, Elaine S; Thyfault, John P; Rector, R Scott

    2017-08-01

    Soy protein is effective at preventing hepatic steatosis; however, the mechanisms are poorly understood. We tested the hypothesis that soy vs. dairy protein-based diet would alter microbiota and attenuate hepatic steatosis in hyperphagic Otsuka Long-Evans Tokushima fatty (OLETF) rats. Male OLETF rats were randomized to "Western" diets containing milk protein isolate (MPI), soy protein isolate (SPI) or 50:50 MPI/SPI (MS) (n=9-10/group; 21% kcal protein) for 16 weeks. SPI attenuated (Pcontent, and hepatic 16:1 n-7 and 18:1 n-7 PUFA concentrations) (Pbacterial 16S rRNA analysis revealed SPI-intake elicited increases (P<.05) in Lactobacillus and decreases (P<.05) in Blautia and Lachnospiraceae suggesting decreases in fecal secondary bile acids in SPI rats. SPI and MS exhibited greater (P<.05) hepatic Fxr, Fgfr4, Hnf4a, HmgCoA reductase and synthase mRNA expression compared with MPI. Overall, dietary SPI compared with MPI decreased hepatic steatosis and diacylglycerols, changed microbiota populations and altered bile acid signaling and cholesterol homeostasis in a rodent model of obesity. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Fast loop modeling for protein structures

    Science.gov (United States)

    Zhang, Jiong; Nguyen, Son; Shang, Yi; Xu, Dong; Kosztin, Ioan

    2015-03-01

    X-ray crystallography is the main method for determining 3D protein structures. In many cases, however, flexible loop regions of proteins cannot be resolved by this approach. This leads to incomplete structures in the protein data bank, preventing further computational study and analysis of these proteins. For instance, all-atom molecular dynamics (MD) simulation studies of structure-function relationship require complete protein structures. To address this shortcoming, we have developed and implemented an efficient computational method for building missing protein loops. The method is database driven and uses deep learning and multi-dimensional scaling algorithms. We have implemented the method as a simple stand-alone program, which can also be used as a plugin in existing molecular modeling software, e.g., VMD. The quality and stability of the generated structures are assessed and tested via energy scoring functions and by equilibrium MD simulations. The proposed method can also be used in template-based protein structure prediction. Work supported by the National Institutes of Health [R01 GM100701]. Computer time was provided by the University of Missouri Bioinformatics Consortium.

  8. Protein secondary structure prediction for a single-sequence using hidden semi-Markov models

    Directory of Open Access Journals (Sweden)

    Borodovsky Mark

    2006-03-01

    Full Text Available Abstract Background The accuracy of protein secondary structure prediction has been improving steadily towards the 88% estimated theoretical limit. There are two types of prediction algorithms: Single-sequence prediction algorithms imply that information about other (homologous proteins is not available, while algorithms of the second type imply that information about homologous proteins is available, and use it intensively. The single-sequence algorithms could make an important contribution to studies of proteins with no detected homologs, however the accuracy of protein secondary structure prediction from a single-sequence is not as high as when the additional evolutionary information is present. Results In this paper, we further refine and extend the hidden semi-Markov model (HSMM initially considered in the BSPSS algorithm. We introduce an improved residue dependency model by considering the patterns of statistically significant amino acid correlation at structural segment borders. We also derive models that specialize on different sections of the dependency structure and incorporate them into HSMM. In addition, we implement an iterative training method to refine estimates of HSMM parameters. The three-state-per-residue accuracy and other accuracy measures of the new method, IPSSP, are shown to be comparable or better than ones for BSPSS as well as for PSIPRED, tested under the single-sequence condition. Conclusions We have shown that new dependency models and training methods bring further improvements to single-sequence protein secondary structure prediction. The results are obtained under cross-validation conditions using a dataset with no pair of sequences having significant sequence similarity. As new sequences are added to the database it is possible to augment the dependency structure and obtain even higher accuracy. Current and future advances should contribute to the improvement of function prediction for orphan proteins inscrutable

  9. Gastric Emptying and Gastrointestinal Transit Compared among Native and Hydrolyzed Whey and Casein Milk Proteins in an Aged Rat Model.

    Science.gov (United States)

    Dalziel, Julie E; Young, Wayne; McKenzie, Catherine M; Haggarty, Neill W; Roy, Nicole C

    2017-12-13

    Little is known about how milk proteins affect gastrointestinal (GI) transit, particularly for the elderly, in whom digestion has been observed to be slowed. We tested the hypothesis that GI transit is faster for whey than for casein and that this effect is accentuated with hydrolysates, similar to soy. Adult male rats (18 months old) were fed native whey or casein, hydrolyzed whey (WPH) or casein (CPH), hydrolyzed blend (HB; 60% whey:40% casein), or hydrolyzed soy for 14 days then treated with loperamide, prucalopride, or vehicle-control for 7 days. X-ray imaging tracked bead-transit for: gastric emptying (GE; 4 h), small intestine (SI) transit (9 h), and large intestine (LI) transit (12 h). GE for whey was 33 ± 12% faster than that for either casein or CPH. SI transit was decreased by 37 ± 9% for casein and 24 ± 6% for whey compared with hydrolyzed soy, and persisted for casein at 12 h. Although CPH and WPH did not alter transit compared with their respective intact counterparts, fecal output was increased by WPH. Slowed transit by casein was reversed by prucalopride (9-h), but not loperamide. However, rapid GE and slower SI transit for the HB compared with intact forms were inhibited by loperamide. The expected slower GI transit for casein relative to soy provided a comparative benchmark, and opioid receptor involvement was corroborated. Our findings provide new evidence that whey slowed SI transit compared with soy, independent of GE. Increased GI transit from stomach to colon for the HB compared with casein suggests that including hydrolyzed milk proteins in foods may benefit those with slowed intestinal transit.

  10. Modeling protein structures: construction and their applications.

    Science.gov (United States)

    Ring, C S; Cohen, F E

    1993-06-01

    Although no general solution to the protein folding problem exists, the three-dimensional structures of proteins are being successfully predicted when experimentally derived constraints are used in conjunction with heuristic methods. In the case of interleukin-4, mutagenesis data and CD spectroscopy were instrumental in the accurate assignment of secondary structure. In addition, the tertiary structure was highly constrained by six cysteines separated by many residues that formed three disulfide bridges. Although the correct structure was a member of a short list of plausible structures, the "best" structure was the topological enantiomer of the experimentally determined conformation. For many proteases, other experimentally derived structures can be used as templates to identify the secondary structure elements. In a procedure called modeling by homology, the structure of a known protein is used as a scaffold to predict the structure of another related protein. This method has been used to model a serine and a cysteine protease that are important in the schistosome and malarial life cycles, respectively. The model structures were then used to identify putative small molecule enzyme inhibitors computationally. Experiments confirm that some of these nonpeptidic compounds are active at concentrations of less than 10 microM.

  11. A computational model of the LGI1 protein suggests a common binding site for ADAM proteins.

    Directory of Open Access Journals (Sweden)

    Emanuela Leonardi

    Full Text Available Mutations of human leucine-rich glioma inactivated (LGI1 gene encoding the epitempin protein cause autosomal dominant temporal lateral epilepsy (ADTLE, a rare familial partial epileptic syndrome. The LGI1 gene seems to have a role on the transmission of neuronal messages but the exact molecular mechanism remains unclear. In contrast to other genes involved in epileptic disorders, epitempin shows no homology with known ion channel genes but contains two domains, composed of repeated structural units, known to mediate protein-protein interactions.A three dimensional in silico model of the two epitempin domains was built to predict the structure-function relationship and propose a functional model integrating previous experimental findings. Conserved and electrostatic charged regions of the model surface suggest a possible arrangement between the two domains and identifies a possible ADAM protein binding site in the β-propeller domain and another protein binding site in the leucine-rich repeat domain. The functional model indicates that epitempin could mediate the interaction between proteins localized to different synaptic sides in a static way, by forming a dimer, or in a dynamic way, by binding proteins at different times.The model was also used to predict effects of known disease-causing missense mutations. Most of the variants are predicted to alter protein folding while several other map to functional surface regions. In agreement with experimental evidence, this suggests that non-secreted LGI1 mutants could be retained within the cell by quality control mechanisms or by altering interactions required for the secretion process.

  12. Scalable rule-based modelling of allosteric proteins and biochemical networks.

    Directory of Open Access Journals (Sweden)

    Julien F Ollivier

    2010-11-01

    Full Text Available Much of the complexity of biochemical networks comes from the information-processing abilities of allosteric proteins, be they receptors, ion-channels, signalling molecules or transcription factors. An allosteric protein can be uniquely regulated by each combination of input molecules that it binds. This "regulatory complexity" causes a combinatorial increase in the number of parameters required to fit experimental data as the number of protein interactions increases. It therefore challenges the creation, updating, and re-use of biochemical models. Here, we propose a rule-based modelling framework that exploits the intrinsic modularity of protein structure to address regulatory complexity. Rather than treating proteins as "black boxes", we model their hierarchical structure and, as conformational changes, internal dynamics. By modelling the regulation of allosteric proteins through these conformational changes, we often decrease the number of parameters required to fit data, and so reduce over-fitting and improve the predictive power of a model. Our method is thermodynamically grounded, imposes detailed balance, and also includes molecular cross-talk and the background activity of enzymes. We use our Allosteric Network Compiler to examine how allostery can facilitate macromolecular assembly and how competitive ligands can change the observed cooperativity of an allosteric protein. We also develop a parsimonious model of G protein-coupled receptors that explains functional selectivity and can predict the rank order of potency of agonists acting through a receptor. Our methodology should provide a basis for scalable, modular and executable modelling of biochemical networks in systems and synthetic biology.

  13. The AnnoLite and AnnoLyze programs for comparative annotation of protein structures

    Directory of Open Access Journals (Sweden)

    Dopazo Joaquín

    2007-05-01

    Full Text Available Abstract Background Advances in structural biology, including structural genomics, have resulted in a rapid increase in the number of experimentally determined protein structures. However, about half of the structures deposited by the structural genomics consortia have little or no information about their biological function. Therefore, there is a need for tools for automatically and comprehensively annotating the function of protein structures. We aim to provide such tools by applying comparative protein structure annotation that relies on detectable relationships between protein structures to transfer functional annotations. Here we introduce two programs, AnnoLite and AnnoLyze, which use the structural alignments deposited in the DBAli database. Description AnnoLite predicts the SCOP, CATH, EC, InterPro, PfamA, and GO terms with an average sensitivity of ~90% and average precision of ~80%. AnnoLyze predicts ligand binding site and domain interaction patches with an average sensitivity of ~70% and average precision of ~30%, correctly localizing binding sites for small molecules in ~95% of its predictions. Conclusion The AnnoLite and AnnoLyze programs for comparative annotation of protein structures can reliably and automatically annotate new protein structures. The programs are fully accessible via the Internet as part of the DBAli suite of tools at http://salilab.org/DBAli/.

  14. C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein families

    Directory of Open Access Journals (Sweden)

    Cutler Sean R

    2007-06-01

    Full Text Available Abstract Background The carboxy termini of proteins are a frequent site of activity for a variety of biologically important functions, ranging from post-translational modification to protein targeting. Several short peptide motifs involved in protein sorting roles and dependent upon their proximity to the C-terminus for proper function have already been characterized. As a limited number of such motifs have been identified, the potential exists for genome-wide statistical analysis and comparative genomics to reveal novel peptide signatures functioning in a C-terminal dependent manner. We have applied a novel methodology to the prediction of C-terminal-anchored peptide motifs involving a simple z-statistic and several techniques for improving the signal-to-noise ratio. Results We examined the statistical over-representation of position-specific C-terminal tripeptides in 7 eukaryotic proteomes. Sequence randomization models and simple-sequence masking were applied to the successful reduction of background noise. Similarly, as C-terminal homology among members of large protein families may artificially inflate tripeptide counts in an irrelevant and obfuscating manner, gene-family clustering was performed prior to the analysis in order to assess tripeptide over-representation across protein families as opposed to across all proteins. Finally, comparative genomics was used to identify tripeptides significantly occurring in multiple species. This approach has been able to predict, to our knowledge, all C-terminally anchored targeting motifs present in the literature. These include the PTS1 peroxisomal targeting signal (SKL*, the ER-retention signal (K/HDEL*, the ER-retrieval signal for membrane bound proteins (KKxx*, the prenylation signal (CC* and the CaaX box prenylation motif. In addition to a high statistical over-representation of these known motifs, a collection of significant tripeptides with a high propensity for biological function exists

  15. Simulated x-ray scattering of protein solutions using explicit-solvent models

    International Nuclear Information System (INIS)

    Park, Sanghyun; Bardhan, Jaydeep P.; Makowski, Lee; Roux, Benoit

    2009-01-01

    X-ray solution scattering shows new promise for the study of protein structures, complementing crystallography and nuclear magnetic resonance. In order to realize the full potential of solution scattering, it is necessary to not only improve experimental techniques but also develop accurate and efficient computational schemes to relate atomistic models to measurements. Previous computational methods, based on continuum models of water, have been unable to calculate scattering patterns accurately, especially in the wide-angle regime which contains most of the information on the secondary, tertiary, and quaternary structures. Here we present a novel formulation based on the atomistic description of water, in which scattering patterns are calculated from atomic coordinates of protein and water. Without any empirical adjustments, this method produces scattering patterns of unprecedented accuracy in the length scale between 5 and 100 A, as we demonstrate by comparing simulated and observed scattering patterns for myoglobin and lysozyme.

  16. Trade-off between positive and negative design of protein stability: from lattice models to real proteins.

    Directory of Open Access Journals (Sweden)

    Orly Noivirt-Brik

    2009-12-01

    Full Text Available Two different strategies for stabilizing proteins are (i positive design in which the native state is stabilized and (ii negative design in which competing non-native conformations are destabilized. Here, the circumstances under which one strategy might be favored over the other are explored in the case of lattice models of proteins and then generalized and discussed with regard to real proteins. The balance between positive and negative design of proteins is found to be determined by their average "contact-frequency", a property that corresponds to the fraction of states in the conformational ensemble of the sequence in which a pair of residues is in contact. Lattice model proteins with a high average contact-frequency are found to use negative design more than model proteins with a low average contact-frequency. A mathematical derivation of this result indicates that it is general and likely to hold also for real proteins. Comparison of the results of correlated mutation analysis for real proteins with typical contact-frequencies to those of proteins likely to have high contact-frequencies (such as disordered proteins and proteins that are dependent on chaperonins for their folding indicates that the latter tend to have stronger interactions between residues that are not in contact in their native conformation. Hence, our work indicates that negative design is employed when insufficient stabilization is achieved via positive design owing to high contact-frequencies.

  17. Roles of beta-turns in protein folding: from peptide models to protein engineering.

    Science.gov (United States)

    Marcelino, Anna Marie C; Gierasch, Lila M

    2008-05-01

    Reverse turns are a major class of protein secondary structure; they represent sites of chain reversal and thus sites where the globular character of a protein is created. It has been speculated for many years that turns may nucleate the formation of structure in protein folding, as their propensity to occur will favor the approximation of their flanking regions and their general tendency to be hydrophilic will favor their disposition at the solvent-accessible surface. Reverse turns are local features, and it is therefore not surprising that their structural properties have been extensively studied using peptide models. In this article, we review research on peptide models of turns to test the hypothesis that the propensities of turns to form in short peptides will relate to the roles of corresponding sequences in protein folding. Turns with significant stability as isolated entities should actively promote the folding of a protein, and by contrast, turn sequences that merely allow the chain to adopt conformations required for chain reversal are predicted to be passive in the folding mechanism. We discuss results of protein engineering studies of the roles of turn residues in folding mechanisms. Factors that correlate with the importance of turns in folding indeed include their intrinsic stability, as well as their topological context and their participation in hydrophobic networks within the protein's structure.

  18. An Integrated Framework Advancing Membrane Protein Modeling and Design.

    Directory of Open Access Journals (Sweden)

    Rebecca F Alford

    2015-09-01

    Full Text Available Membrane proteins are critical functional molecules in the human body, constituting more than 30% of open reading frames in the human genome. Unfortunately, a myriad of difficulties in overexpression and reconstitution into membrane mimetics severely limit our ability to determine their structures. Computational tools are therefore instrumental to membrane protein structure prediction, consequently increasing our understanding of membrane protein function and their role in disease. Here, we describe a general framework facilitating membrane protein modeling and design that combines the scientific principles for membrane protein modeling with the flexible software architecture of Rosetta3. This new framework, called RosettaMP, provides a general membrane representation that interfaces with scoring, conformational sampling, and mutation routines that can be easily combined to create new protocols. To demonstrate the capabilities of this implementation, we developed four proof-of-concept applications for (1 prediction of free energy changes upon mutation; (2 high-resolution structural refinement; (3 protein-protein docking; and (4 assembly of symmetric protein complexes, all in the membrane environment. Preliminary data show that these algorithms can produce meaningful scores and structures. The data also suggest needed improvements to both sampling routines and score functions. Importantly, the applications collectively demonstrate the potential of combining the flexible nature of RosettaMP with the power of Rosetta algorithms to facilitate membrane protein modeling and design.

  19. Protein (multi-)location prediction: utilizing interdependencies via a generative model.

    Science.gov (United States)

    Simha, Ramanuja; Briesemeister, Sebastian; Kohlbacher, Oliver; Shatkay, Hagit

    2015-06-15

    Proteins are responsible for a multitude of vital tasks in all living organisms. Given that a protein's function and role are strongly related to its subcellular location, protein location prediction is an important research area. While proteins move from one location to another and can localize to multiple locations, most existing location prediction systems assign only a single location per protein. A few recent systems attempt to predict multiple locations for proteins, however, their performance leaves much room for improvement. Moreover, such systems do not capture dependencies among locations and usually consider locations as independent. We hypothesize that a multi-location predictor that captures location inter-dependencies can improve location predictions for proteins. We introduce a probabilistic generative model for protein localization, and develop a system based on it-which we call MDLoc-that utilizes inter-dependencies among locations to predict multiple locations for proteins. The model captures location inter-dependencies using Bayesian networks and represents dependency between features and locations using a mixture model. We use iterative processes for learning model parameters and for estimating protein locations. We evaluate our classifier MDLoc, on a dataset of single- and multi-localized proteins derived from the DBMLoc dataset, which is the most comprehensive protein multi-localization dataset currently available. Our results, obtained by using MDLoc, significantly improve upon results obtained by an initial simpler classifier, as well as on results reported by other top systems. MDLoc is available at: http://www.eecis.udel.edu/∼compbio/mdloc. © The Author 2015. Published by Oxford University Press.

  20. Mathematical model of the binding of allosteric effectors to the Escherichia coli PII signal transduction protein GlnB.

    Science.gov (United States)

    da Rocha, Ricardo Alves; Weschenfelder, Thiago André; de Castilhos, Fernanda; de Souza, Emanuel Maltempi; Huergo, Luciano Fernandes; Mitchell, David Alexander

    2013-04-16

    PII proteins are important regulators of nitrogen metabolism in a wide variety of organisms: the binding of the allosteric effectors ATP, ADP, and 2-oxoglutarate (2-OG) to PII proteins affects their ability to interact with target proteins. We modeled the simultaneous binding of ATP, ADP, and 2-OG to one PII protein, namely GlnB of Escherichia coli, using a modeling approach that allows the prediction of the proportions of individual binding states. Four models with different binding rules were compared. We selected one of these models (that assumes that the binding of the first nucleotide to GlnB makes it harder for subsequent nucleotides to bind) and used it to explore how physiological concentrations of ATP, ADP, and 2-OG would affect the proportions of those states of GlnB that interact with the target proteins ATase and NtrB. Our simulations indicate that GlnB can, as suggested by previous researchers, act as a sensor of both 2-OG and the ATP:ADP ratio. We conclude that our modeling approach will be an important tool in future studies concerning the PII binding states and their interactions with target proteins.

  1. Comparative proteome analysis between C . briggsae embryos and larvae reveals a role of chromatin modification proteins in embryonic cell division

    KAUST Repository

    An, Xiaomeng

    2017-06-21

    Caenorhabditis briggsae has emerged as a model for comparative biology against model organism C. elegans. Most of its cell fate specifications are completed during embryogenesis whereas its cell growth is achieved mainly in larval stages. The molecular mechanism underlying the drastic developmental changes is poorly understood. To gain insights into the molecular changes between the two stages, we compared the proteomes between the two stages using iTRAQ. We identified a total of 2,791 proteins in the C. briggsae embryos and larvae, 247 of which undergo up- or down-regulation between the two stages. The proteins that are upregulated in the larval stages are enriched in the Gene Ontology categories of energy production, protein translation, and cytoskeleton; whereas those upregulated in the embryonic stage are enriched in the categories of chromatin dynamics and posttranslational modification, suggesting a more active chromatin modification in the embryos than in the larva. Perturbation of a subset of chromatin modifiers followed by cell lineage analysis suggests their roles in controlling cell division pace. Taken together, we demonstrate a general molecular switch from chromatin modification to metabolism during the transition from C. briggsae embryonic to its larval stages using iTRAQ approach. The switch might be conserved across metazoans.

  2. Mechanical Modeling and Computer Simulation of Protein Folding

    Science.gov (United States)

    Prigozhin, Maxim B.; Scott, Gregory E.; Denos, Sharlene

    2014-01-01

    In this activity, science education and modern technology are bridged to teach students at the high school and undergraduate levels about protein folding and to strengthen their model building skills. Students are guided from a textbook picture of a protein as a rigid crystal structure to a more realistic view: proteins are highly dynamic…

  3. Change detection in the dynamics of an intracellular protein synthesis model using nonlinear Kalman filtering.

    Science.gov (United States)

    Rigatos, Gerasimos G; Rigatou, Efthymia G; Djida, Jean Daniel

    2015-10-01

    A method for early diagnosis of parametric changes in intracellular protein synthesis models (e.g. the p53 protein - mdm2 inhibitor model) is developed with the use of a nonlinear Kalman Filtering approach (Derivative-free nonlinear Kalman Filter) and of statistical change detection methods. The intracellular protein synthesis dynamic model is described by a set of coupled nonlinear differential equations. It is shown that such a dynamical system satisfies differential flatness properties and this allows to transform it, through a change of variables (diffeomorphism), to the so-called linear canonical form. For the linearized equivalent of the dynamical system, state estimation can be performed using the Kalman Filter recursion. Moreover, by applying an inverse transformation based on the previous diffeomorphism it becomes also possible to obtain estimates of the state variables of the initial nonlinear model. By comparing the output of the Kalman Filter (which is assumed to correspond to the undistorted dynamical model) with measurements obtained from the monitored protein synthesis system, a sequence of differences (residuals) is obtained. The statistical processing of the residuals with the use of x2 change detection tests, can provide indication within specific confidence intervals about parametric changes in the considered biological system and consequently indications about the appearance of specific diseases (e.g. malignancies).

  4. Modeling curvature-dependent subcellular localization of a small sporulation protein in Bacillus subtilis

    Science.gov (United States)

    Wasnik, Vaibhav; Wingreen, Ned; Mukhopadhyay, Ranjan

    2012-02-01

    Recent experiments suggest that in the bacterium, B. subtilis, the cue for the localization of small sporulation protein, SpoVM, that plays a central role in spore coat formation, is curvature of the bacterial plasma membrane. This curvature-dependent localization is puzzling given the orders of magnitude difference in lengthscale of an individual protein and radius of curvature of the membrane. Here we develop a minimal model to study the relationship between curvature-dependent membrane absorption of SpoVM and clustering of membrane-associated SpoVM and compare our results with experiments.

  5. Molecular modeling of protein materials: case study of elastin

    International Nuclear Information System (INIS)

    Tarakanova, Anna; Buehler, Markus J

    2013-01-01

    Molecular modeling of protein materials is a quickly growing area of research that has produced numerous contributions in fields ranging from structural engineering to medicine and biology. We review here the history and methods commonly employed in molecular modeling of protein materials, emphasizing the advantages for using modeling as a complement to experimental work. We then consider a case study of the protein elastin, a critically important ‘mechanical protein’ to exemplify the approach in an area where molecular modeling has made a significant impact. We outline the progression of computational modeling studies that have considerably enhanced our understanding of this important protein which endows elasticity and recoil to the tissues it is found in, including the skin, lungs, arteries and the heart. A vast collection of literature has been directed at studying the structure and function of this protein for over half a century, the first molecular dynamics study of elastin being reported in the 1980s. We review the pivotal computational works that have considerably enhanced our fundamental understanding of elastin's atomistic structure and its extraordinary qualities—focusing on two in particular: elastin's superb elasticity and the inverse temperature transition—the remarkable ability of elastin to take on a more structured conformation at higher temperatures, suggesting its effectiveness as a biomolecular switch. Our hope is to showcase these methods as both complementary and enriching to experimental approaches that have thus far dominated the study of most protein-based materials. (topical review)

  6. Monte Carlo simulation of a statistical mechanical model of multiple protein sequence alignment.

    Science.gov (United States)

    Kinjo, Akira R

    2017-01-01

    A grand canonical Monte Carlo (MC) algorithm is presented for studying the lattice gas model (LGM) of multiple protein sequence alignment, which coherently combines long-range interactions and variable-length insertions. MC simulations are used for both parameter optimization of the model and production runs to explore the sequence subspace around a given protein family. In this Note, I describe the details of the MC algorithm as well as some preliminary results of MC simulations with various temperatures and chemical potentials, and compare them with the mean-field approximation. The existence of a two-state transition in the sequence space is suggested for the SH3 domain family, and inappropriateness of the mean-field approximation for the LGM is demonstrated.

  7. Protein (multi-)location prediction: utilizing interdependencies via a generative model

    Science.gov (United States)

    Shatkay, Hagit

    2015-01-01

    Motivation: Proteins are responsible for a multitude of vital tasks in all living organisms. Given that a protein’s function and role are strongly related to its subcellular location, protein location prediction is an important research area. While proteins move from one location to another and can localize to multiple locations, most existing location prediction systems assign only a single location per protein. A few recent systems attempt to predict multiple locations for proteins, however, their performance leaves much room for improvement. Moreover, such systems do not capture dependencies among locations and usually consider locations as independent. We hypothesize that a multi-location predictor that captures location inter-dependencies can improve location predictions for proteins. Results: We introduce a probabilistic generative model for protein localization, and develop a system based on it—which we call MDLoc—that utilizes inter-dependencies among locations to predict multiple locations for proteins. The model captures location inter-dependencies using Bayesian networks and represents dependency between features and locations using a mixture model. We use iterative processes for learning model parameters and for estimating protein locations. We evaluate our classifier MDLoc, on a dataset of single- and multi-localized proteins derived from the DBMLoc dataset, which is the most comprehensive protein multi-localization dataset currently available. Our results, obtained by using MDLoc, significantly improve upon results obtained by an initial simpler classifier, as well as on results reported by other top systems. Availability and implementation: MDLoc is available at: http://www.eecis.udel.edu/∼compbio/mdloc. Contact: shatkay@udel.edu. PMID:26072505

  8. In silico modelling and validation of differential expressed proteins in lung cancer

    Directory of Open Access Journals (Sweden)

    Bhagavathi S

    2012-05-01

    Full Text Available Objective: The present study aims predict the three dimensional structure of three major proteins responsible for causing Lung cancer. Methods: These are the differentially expressed proteins in lung cancer dataset. Initially, the structural template for these proteins is identified from structural database using homology search and perform homology modelling approach to predict its native 3D structure. Three-dimensional model obtained was validated using Ramachandran plot analysis to find the reliability of the model. Results: Four proteins were differentially expressed and were significant proteins in causing lung cancer. Among the four proteins, Matrixmetallo proteinase (P39900 had a known 3D structure and hence was not considered for modelling. The remaining proteins Polo like kinase I Q58A51, Trophinin B1AKF1, Thrombomodulin P07204 were modelled and validated. Conclusions: The three dimensional structure of proteins provides insights about the functional aspect and regulatory aspect of the protein. Thus, this study will be a breakthrough for further lung cancer related studies.

  9. Protein single-model quality assessment by feature-based probability density functions.

    Science.gov (United States)

    Cao, Renzhi; Cheng, Jianlin

    2016-04-04

    Protein quality assessment (QA) has played an important role in protein structure prediction. We developed a novel single-model quality assessment method-Qprob. Qprob calculates the absolute error for each protein feature value against the true quality scores (i.e. GDT-TS scores) of protein structural models, and uses them to estimate its probability density distribution for quality assessment. Qprob has been blindly tested on the 11th Critical Assessment of Techniques for Protein Structure Prediction (CASP11) as MULTICOM-NOVEL server. The official CASP result shows that Qprob ranks as one of the top single-model QA methods. In addition, Qprob makes contributions to our protein tertiary structure predictor MULTICOM, which is officially ranked 3rd out of 143 predictors. The good performance shows that Qprob is good at assessing the quality of models of hard targets. These results demonstrate that this new probability density distribution based method is effective for protein single-model quality assessment and is useful for protein structure prediction. The webserver of Qprob is available at: http://calla.rnet.missouri.edu/qprob/. The software is now freely available in the web server of Qprob.

  10. Simulation studies of protein-induced bilayer deformations, and lipid-induced protein tilting, on a mesoscopic model for lipid bilayers with embedded proteins

    DEFF Research Database (Denmark)

    Venturoli, M.; Smit, B.; Sperotto, Maria Maddalena

    2005-01-01

    membranes. Here we present a mesoscopic model for lipid bilayers with embedded proteins, which we have studied with the help of the dissipative particle dynamics simulation technique. Because hydrophobic matching is believed to be one of the main physical mechanisms regulating lipid-protein interactions......-induced protein tilt, with the hydrophobic mismatch ( positive and negative) between the protein hydrophobic length and the pure lipid bilayer hydrophobic thickness. The protein-induced bilayer perturbation was quantified in terms of a coherence length, xi(P), of the lipid bilayer hydrophobic thickness pro. le...... for positive values of mismatch; a dependence on the protein size appears as well. In the case of large model proteins experiencing extreme mismatch conditions, in the region next to the so-called lipid annulus, there appears an undershooting ( or overshooting) region where the bilayer hydrophobic thickness...

  11. SynechoNET: integrated protein-protein interaction database of a model cyanobacterium Synechocystis sp. PCC 6803

    OpenAIRE

    Kim, Woo-Yeon; Kang, Sungsoo; Kim, Byoung-Chul; Oh, Jeehyun; Cho, Seongwoong; Bhak, Jong; Choi, Jong-Soon

    2008-01-01

    Background Cyanobacteria are model organisms for studying photosynthesis, carbon and nitrogen assimilation, evolution of plant plastids, and adaptability to environmental stresses. Despite many studies on cyanobacteria, there is no web-based database of their regulatory and signaling protein-protein interaction networks to date. Description We report a database and website SynechoNET that provides predicted protein-protein interactions. SynechoNET shows cyanobacterial domain-domain interactio...

  12. Probing the interaction of brain fatty acid binding protein (B-FABP with model membranes.

    Directory of Open Access Journals (Sweden)

    Fábio Dyszy

    Full Text Available Brain fatty acid-binding protein (B-FABP interacts with biological membranes and delivers polyunsaturated fatty acids (FAs via a collisional mechanism. The binding of FAs in the protein and the interaction with membranes involve a motif called "portal region", formed by two small α-helices, A1 and A2, connected by a loop. We used a combination of site-directed mutagenesis and electron spin resonance to probe the changes in the protein and in the membrane model induced by their interaction. Spin labeled B-FABP mutants and lipidic spin probes incorporated into a membrane model confirmed that B-FABP interacts with micelles through the portal region and led to structural changes in the protein as well in the micelles. These changes were greater in the presence of LPG when compared to the LPC models. ESR spectra of B-FABP labeled mutants showed the presence of two groups of residues that responded to the presence of micelles in opposite ways. In the presence of lysophospholipids, group I of residues, whose side chains point outwards from the contact region between the helices, had their mobility decreased in an environment of lower polarity when compared to the same residues in solution. The second group, composed by residues with side chains situated at the interface between the α-helices, experienced an increase in mobility in the presence of the model membranes. These modifications in the ESR spectra of B-FABP mutants are compatible with a less ordered structure of the portal region inner residues (group II that is likely to facilitate the delivery of FAs to target membranes. On the other hand, residues in group I and micelle components have their mobilities decreased probably as a result of the formation of a collisional complex. Our results bring new insights for the understanding of the gating and delivery mechanisms of FABPs.

  13. Intrinsically disordered proteins--relation to general model expressing the active role of the water environment.

    Science.gov (United States)

    Kalinowska, Barbara; Banach, Mateusz; Konieczny, Leszek; Marchewka, Damian; Roterman, Irena

    2014-01-01

    This work discusses the role of unstructured polypeptide chain fragments in shaping the protein's hydrophobic core. Based on the "fuzzy oil drop" model, which assumes an idealized distribution of hydrophobicity density described by the 3D Gaussian, we can determine which fragments make up the core and pinpoint residues whose location conflicts with theoretical predictions. We show that the structural influence of the water environment determines the positions of disordered fragments, leading to the formation of a hydrophobic core overlaid by a hydrophilic mantle. This phenomenon is further described by studying selected proteins which are known to be unstable and contain intrinsically disordered fragments. Their properties are established quantitatively, explaining the causative relation between the protein's structure and function and facilitating further comparative analyses of various structural models. © 2014 Elsevier Inc. All rights reserved.

  14. Comparative analysis of twin-arginine (Tat)-dependent protein secretion of a heterologous model protein (GFP) in three different Gram-positive bacteria

    NARCIS (Netherlands)

    Meissner, Daniel; Vollstedt, Angela; van Dijl, Jan Maarten; Freudl, Roland

    In contrast to the general protein secretion (Sec) system, the twin-arginine translocation (Tat) export pathway allows the translocation of proteins across the bacterial plasma membrane in a fully folded conformation. Due to this feature, the Tat pathway provides an attractive alternative to the

  15. Comparative molecular modeling study of Arabidopsis NADPH-dependent thioredoxin reductase and its hybrid protein.

    Directory of Open Access Journals (Sweden)

    Yuno Lee

    Full Text Available 2-Cys peroxiredoxins (Prxs play important roles in the protection of chloroplast proteins from oxidative damage. Arabidopsis NADPH-dependent thioredoxin reductase isotype C (AtNTRC was identified as efficient electron donor for chloroplastic 2-Cys Prx-A. There are three isotypes (A, B, and C of thioredoxin reductase (TrxR in Arabidopsis. AtNTRA contains only TrxR domain, but AtNTRC consists of N-terminal TrxR and C-terminal thioredoxin (Trx domains. AtNTRC has various oligomer structures, and Trx domain is important for chaperone activity. Our previous experimental study has reported that the hybrid protein (AtNTRA-(Trx-D, which was a fusion of AtNTRA and Trx domain from AtNTRC, has formed variety of structures and shown strong chaperone activity. But, electron transfer mechanism was not detected at all. To find out the reason of this problem with structural basis, we performed two different molecular dynamics (MD simulations on AtNTRC and AtNTRA-(Trx-D proteins with same cofactors such as NADPH and flavin adenine dinucleotide (FAD for 50 ns. Structural difference has found from superimposition of two structures that were taken relatively close to average structure. The main reason that AtNTRA-(Trx-D cannot transfer the electron from TrxR domain to Trx domain is due to the difference of key catalytic residues in active site. The long distance between TrxR C153 and disulfide bond of Trx C387-C390 has been observed in AtNTRA-(Trx-D because of following reasons: i unstable and unfavorable interaction of the linker region, ii shifted Trx domain, and iii different or weak interface interaction of Trx domains. This study is one of the good examples for understanding the relationship between structure formation and reaction activity in hybrid protein. In addition, this study would be helpful for further study on the mechanism of electron transfer reaction in NADPH-dependent thioredoxin reductase proteins.

  16. Using the Relevance Vector Machine Model Combined with Local Phase Quantization to Predict Protein-Protein Interactions from Protein Sequences

    Directory of Open Access Journals (Sweden)

    Ji-Yong An

    2016-01-01

    Full Text Available We propose a novel computational method known as RVM-LPQ that combines the Relevance Vector Machine (RVM model and Local Phase Quantization (LPQ to predict PPIs from protein sequences. The main improvements are the results of representing protein sequences using the LPQ feature representation on a Position Specific Scoring Matrix (PSSM, reducing the influence of noise using a Principal Component Analysis (PCA, and using a Relevance Vector Machine (RVM based classifier. We perform 5-fold cross-validation experiments on Yeast and Human datasets, and we achieve very high accuracies of 92.65% and 97.62%, respectively, which is significantly better than previous works. To further evaluate the proposed method, we compare it with the state-of-the-art support vector machine (SVM classifier on the Yeast dataset. The experimental results demonstrate that our RVM-LPQ method is obviously better than the SVM-based method. The promising experimental results show the efficiency and simplicity of the proposed method, which can be an automatic decision support tool for future proteomics research.

  17. Differences in postprandial protein handling after beef compared with milk ingestion during postexercise recovery: a randomized controlled trial.

    Science.gov (United States)

    Burd, Nicholas A; Gorissen, Stefan H; van Vliet, Stephan; Snijders, Tim; van Loon, Luc Jc

    2015-10-01

    Protein consumed after resistance exercise increases postexercise muscle protein synthesis rates. To date, dairy protein has been studied extensively, with little known about the capacity of other protein-dense foods to augment postexercise muscle protein synthesis rates. We aimed to compare protein digestion and absorption kinetics, postprandial amino acid availability, anabolic signaling, and the subsequent myofibrillar protein synthetic response after the ingestion of milk compared with beef during recovery from resistance-type exercise. In crossover trials, 12 healthy young men performed a single bout of resistance exercise. Immediately after cessation of exercise, participants ingested 30 g protein by consuming isonitrogenous amounts of intrinsically l-[1-(13)C]phenylalanine-labeled beef or milk. Blood and muscle biopsy samples were collected at rest and after exercise during primed continuous infusions of l-[ring-(2)H5]phenylalanine and l-[ring-3,5-(2)H2]tyrosine to assess protein digestion and absorption kinetics, plasma amino acid availability, anabolic signaling, and subsequent myofibrillar protein synthesis rates in vivo in young men. Beef protein-derived phenylalanine appeared more rapidly in circulation compared with milk ingestion (P Nutrition.

  18. A comparative molecular dynamics study on thermostability of human and chicken prion proteins

    International Nuclear Information System (INIS)

    Ji, Hong-Fang; Zhang, Hong-Yu

    2007-01-01

    To compare the thermostabilities of human and chicken normal cellular prion proteins (HuPrP C and CkPrP C ), molecular dynamics (MD) simulations were performed for both proteins at an ensemble level (10 parallel simulations at 400 K and 5 parallel simulations at 300 K as a control). It is found that the thermostability of HuPrP C is comparable with that of CkPrP C , which implicates that the non-occurrence of prion diseases in non-mammals cannot be completely attributed to the thermodynamic properties of non-mammalian PrP C

  19. Hydration dynamics near a model protein surface

    International Nuclear Information System (INIS)

    Russo, Daniela; Hura, Greg; Head-Gordon, Teresa

    2003-01-01

    The evolution of water dynamics from dilute to very high concentration solutions of a prototypical hydrophobic amino acid with its polar backbone, N-acetyl-leucine-methylamide (NALMA), is studied by quasi-elastic neutron scattering and molecular dynamics simulation for both the completely deuterated and completely hydrogenated leucine monomer. We observe several unexpected features in the dynamics of these biological solutions under ambient conditions. The NALMA dynamics shows evidence of de Gennes narrowing, an indication of coherent long timescale structural relaxation dynamics. The translational water dynamics are analyzed in a first approximation with a jump diffusion model. At the highest solute concentrations, the hydration water dynamics is significantly suppressed and characterized by a long residential time and a slow diffusion coefficient. The analysis of the more dilute concentration solutions takes into account the results of the 2.0M solution as a model of the first hydration shell. Subtracting the first hydration layer based on the 2.0M spectra, the translational diffusion dynamics is still suppressed, although the rotational relaxation time and residential time are converged to bulk-water values. Molecular dynamics analysis shows spatially heterogeneous dynamics at high concentration that becomes homogeneous at more dilute concentrations. We discuss the hydration dynamics results of this model protein system in the context of glassy systems, protein function, and protein-protein interfaces

  20. Statistical-mechanical lattice models for protein-DNA binding in chromatin

    International Nuclear Information System (INIS)

    Teif, Vladimir B; Rippe, Karsten

    2010-01-01

    Statistical-mechanical lattice models for protein-DNA binding are well established as a method to describe complex ligand binding equilibria measured in vitro with purified DNA and protein components. Recently, a new field of applications has opened up for this approach since it has become possible to experimentally quantify genome-wide protein occupancies in relation to the DNA sequence. In particular, the organization of the eukaryotic genome by histone proteins into a nucleoprotein complex termed chromatin has been recognized as a key parameter that controls the access of transcription factors to the DNA sequence. New approaches have to be developed to derive statistical-mechanical lattice descriptions of chromatin-associated protein-DNA interactions. Here, we present the theoretical framework for lattice models of histone-DNA interactions in chromatin and investigate the (competitive) DNA binding of other chromosomal proteins and transcription factors. The results have a number of applications for quantitative models for the regulation of gene expression.

  1. Insight into the intermolecular recognition mechanism between Keap1 and IKKβ combining homology modelling, protein-protein docking, molecular dynamics simulations and virtual alanine mutation.

    Directory of Open Access Journals (Sweden)

    Zheng-Yu Jiang

    Full Text Available Degradation of certain proteins through the ubiquitin-proteasome pathway is a common strategy taken by the key modulators responsible for stress responses. Kelch-like ECH-associated protein-1(Keap1, a substrate adaptor component of the Cullin3 (Cul3-based ubiquitin E3 ligase complex, mediates the ubiquitination of two key modulators, NF-E2-related factor 2 (Nrf2 and IκB kinase β (IKKβ, which are involved in the redox control of gene transcription. However, compared to the Keap1-Nrf2 protein-protein interaction (PPI, the intermolecular recognition mechanism of Keap1 and IKKβ has been poorly investigated. In order to explore the binding pattern between Keap1 and IKKβ, the PPI model of Keap1 and IKKβ was investigated. The structure of human IKKβ was constructed by means of the homology modeling method and using reported crystal structure of Xenopus laevis IKKβ as the template. A protein-protein docking method was applied to develop the Keap1-IKKβ complex model. After the refinement and visual analysis of docked proteins, the chosen pose was further optimized through molecular dynamics simulations. The resulting structure was utilized to conduct the virtual alanine mutation for the exploration of hot-spots significant for the intermolecular interaction. Overall, our results provided structural insights into the PPI model of Keap1-IKKβ and suggest that the substrate specificity of Keap1 depend on the interaction with the key tyrosines, namely Tyr525, Tyr574 and Tyr334. The study presented in the current project may be useful to design molecules that selectively modulate Keap1. The selective recognition mechanism of Keap1 with IKKβ or Nrf2 will be helpful to further know the crosstalk between NF-κB and Nrf2 signaling.

  2. Exploiting conformational ensembles in modeling protein-protein interactions on the proteome scale

    Science.gov (United States)

    Kuzu, Guray; Gursoy, Attila; Nussinov, Ruth; Keskin, Ozlem

    2013-01-01

    Cellular functions are performed through protein-protein interactions; therefore, identification of these interactions is crucial for understanding biological processes. Recent studies suggest that knowledge-based approaches are more useful than ‘blind’ docking for modeling at large scales. However, a caveat of knowledge-based approaches is that they treat molecules as rigid structures. The Protein Data Bank (PDB) offers a wealth of conformations. Here, we exploited ensemble of the conformations in predictions by a knowledge-based method, PRISM. We tested ‘difficult’ cases in a docking-benchmark dataset, where the unbound and bound protein forms are structurally different. Considering alternative conformations for each protein, the percentage of successfully predicted interactions increased from ~26% to 66%, and 57% of the interactions were successfully predicted in an ‘unbiased’ scenario, in which data related to the bound forms were not utilized. If the appropriate conformation, or relevant template interface, is unavailable in the PDB, PRISM could not predict the interaction successfully. The pace of the growth of the PDB promises a rapid increase of ensemble conformations emphasizing the merit of such knowledge-based ensemble strategies for higher success rates in protein-protein interaction predictions on an interactome-scale. We constructed the structural network of ERK interacting proteins as a case study. PMID:23590674

  3. Stochastic lattice model of synaptic membrane protein domains.

    Science.gov (United States)

    Li, Yiwei; Kahraman, Osman; Haselwandter, Christoph A

    2017-05-01

    Neurotransmitter receptor molecules, concentrated in synaptic membrane domains along with scaffolds and other kinds of proteins, are crucial for signal transmission across chemical synapses. In common with other membrane protein domains, synaptic domains are characterized by low protein copy numbers and protein crowding, with rapid stochastic turnover of individual molecules. We study here in detail a stochastic lattice model of the receptor-scaffold reaction-diffusion dynamics at synaptic domains that was found previously to capture, at the mean-field level, the self-assembly, stability, and characteristic size of synaptic domains observed in experiments. We show that our stochastic lattice model yields quantitative agreement with mean-field models of nonlinear diffusion in crowded membranes. Through a combination of analytic and numerical solutions of the master equation governing the reaction dynamics at synaptic domains, together with kinetic Monte Carlo simulations, we find substantial discrepancies between mean-field and stochastic models for the reaction dynamics at synaptic domains. Based on the reaction and diffusion properties of synaptic receptors and scaffolds suggested by previous experiments and mean-field calculations, we show that the stochastic reaction-diffusion dynamics of synaptic receptors and scaffolds provide a simple physical mechanism for collective fluctuations in synaptic domains, the molecular turnover observed at synaptic domains, key features of the observed single-molecule trajectories, and spatial heterogeneity in the effective rates at which receptors and scaffolds are recycled at the cell membrane. Our work sheds light on the physical mechanisms and principles linking the collective properties of membrane protein domains to the stochastic dynamics that rule their molecular components.

  4. CONFOLD2: improved contact-driven ab initio protein structure modeling.

    Science.gov (United States)

    Adhikari, Badri; Cheng, Jianlin

    2018-01-25

    Contact-guided protein structure prediction methods are becoming more and more successful because of the latest advances in residue-residue contact prediction. To support contact-driven structure prediction, effective tools that can quickly build tertiary structural models of good quality from predicted contacts need to be developed. We develop an improved contact-driven protein modelling method, CONFOLD2, and study how it may be effectively used for ab initio protein structure prediction with predicted contacts as input. It builds models using various subsets of input contacts to explore the fold space under the guidance of a soft square energy function, and then clusters the models to obtain the top five models. CONFOLD2 obtains an average reconstruction accuracy of 0.57 TM-score for the 150 proteins in the PSICOV contact prediction dataset. When benchmarked on the CASP11 contacts predicted using CONSIP2 and CASP12 contacts predicted using Raptor-X, CONFOLD2 achieves a mean TM-score of 0.41 on both datasets. CONFOLD2 allows to quickly generate top five structural models for a protein sequence when its secondary structures and contacts predictions at hand. The source code of CONFOLD2 is publicly available at https://github.com/multicom-toolbox/CONFOLD2/ .

  5. Employing proteomic analysis to compare Paracoccidioides lutzii yeast and mycelium cell wall proteins.

    Science.gov (United States)

    Araújo, Danielle Silva; de Sousa Lima, Patrícia; Baeza, Lilian Cristiane; Parente, Ana Flávia Alves; Melo Bailão, Alexandre; Borges, Clayton Luiz; de Almeida Soares, Célia Maria

    2017-11-01

    Paracoccidioidomycosis is an important systemic mycosis caused by thermodimorphic fungi of the Paracoccidioides genus. During the infective process, the cell wall acts at the interface between the fungus and the host. In this way, the cell wall has a key role in growth, environment sensing and interaction, as well as morphogenesis of the fungus. Since the cell wall is absent in mammals, it may present molecules that are described as target sites for new antifungal drugs. Despite its importance, up to now few studies have been conducted employing proteomics in for the identification of cell wall proteins in Paracoccidioides spp. Here, a detailed proteomic approach, including cell wall-fractionation coupled to NanoUPLC-MS E , was used to study and compare the cell wall fractions from Paracoccidioides lutzii mycelia and yeast cells. The analyzed samples consisted of cell wall proteins extracted by hot SDS followed by extraction by mild alkali. In summary, 512 proteins constituting different cell wall fractions were identified, including 7 predicted GPI-dependent cell wall proteins that are potentially involved in cell wall metabolism. Adhesins previously described in Paracoccidioides spp. such as enolase, glyceraldehyde-3-phosphate dehydrogenase were identified. Comparing the proteins in mycelium and yeast cells, we detected some that are common to both fungal phases, such as Ecm33, and some specific proteins, as glucanase Crf1. All of those proteins were described in the metabolism of cell wall. Our study provides an important elucidation of cell wall composition of fractions in Paracoccidioides, opening a way to understand the fungus cell wall architecture. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Hidden Markov model-derived structural alphabet for proteins: the learning of protein local shapes captures sequence specificity.

    Science.gov (United States)

    Camproux, A C; Tufféry, P

    2005-08-05

    Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence.

  7. Reovirus FAST Protein Enhances Vesicular Stomatitis Virus Oncolytic Virotherapy in Primary and Metastatic Tumor Models

    Directory of Open Access Journals (Sweden)

    Fabrice Le Boeuf

    2017-09-01

    Full Text Available The reovirus fusion-associated small transmembrane (FAST proteins are the smallest known viral fusogens (∼100–150 amino acids and efficiently induce cell-cell fusion and syncytium formation in multiple cell types. Syncytium formation enhances cell-cell virus transmission and may also induce immunogenic cell death, a form of apoptosis that stimulates immune recognition of tumor cells. These properties suggest that FAST proteins might serve to enhance oncolytic virotherapy. The oncolytic activity of recombinant VSVΔM51 (an interferon-sensitive vesicular stomatitis virus [VSV] mutant encoding the p14 FAST protein (VSV-p14 was compared with a similar construct encoding GFP (VSV-GFP in cell culture and syngeneic BALB/c tumor models. Compared with VSV-GFP, VSV-p14 exhibited increased oncolytic activity against MCF-7 and 4T1 breast cancer spheroids in culture and reduced primary 4T1 breast tumor growth in vivo. VSV-p14 prolonged survival in both primary and metastatic 4T1 breast cancer models, and in a CT26 metastatic colon cancer model. As with VSV-GFP, VSV-p14 preferentially replicated in vivo in tumors and was cleared rapidly from other sites. Furthermore, VSV-p14 increased the numbers of activated splenic CD4, CD8, natural killer (NK, and natural killer T (NKT cells, and increased the number of activated CD4 and CD8 cells in tumors. FAST proteins may therefore provide a multi-pronged approach to improving oncolytic virotherapy via syncytium formation and enhanced immune stimulation.

  8. Hidden markov model for the prediction of transmembrane proteins using MATLAB.

    Science.gov (United States)

    Chaturvedi, Navaneet; Shanker, Sudhanshu; Singh, Vinay Kumar; Sinha, Dhiraj; Pandey, Paras Nath

    2011-01-01

    Since membranous proteins play a key role in drug targeting therefore transmembrane proteins prediction is active and challenging area of biological sciences. Location based prediction of transmembrane proteins are significant for functional annotation of protein sequences. Hidden markov model based method was widely applied for transmembrane topology prediction. Here we have presented a revised and a better understanding model than an existing one for transmembrane protein prediction. Scripting on MATLAB was built and compiled for parameter estimation of model and applied this model on amino acid sequence to know the transmembrane and its adjacent locations. Estimated model of transmembrane topology was based on TMHMM model architecture. Only 7 super states are defined in the given dataset, which were converted to 96 states on the basis of their length in sequence. Accuracy of the prediction of model was observed about 74 %, is a good enough in the area of transmembrane topology prediction. Therefore we have concluded the hidden markov model plays crucial role in transmembrane helices prediction on MATLAB platform and it could also be useful for drug discovery strategy. The database is available for free at bioinfonavneet@gmail.comvinaysingh@bhu.ac.in.

  9. Randomized, double-blind, controlled, comparative trial of formula food containing soy protein vs. milk protein in visceral fat obesity. FLAVO study

    International Nuclear Information System (INIS)

    Takahira, Masaya; Noda, Keita; Fukushima, Mikio; Zhang, Bo; Mitsutake, Ryoko; Uehara, Yoshinari; Ogawa, Masahiro; Saku, Keijiro; Kakuma, Tatsuyuki

    2011-01-01

    The purpose of the present study was to clarify the efficacy of soy at reducing visceral fat. A randomized, double-blind, controlled, comparative trial was carried out to compare formula food containing soy protein (SP) to the same food in which soy was replaced with milk protein (MP). Forty-eight participants were enrolled for the treatment of visceral fat obesity (visceral fat area >100 cm 2 on computed tomography). The SP formula contained 12 g of SP, 9 g of MP, and other nutrients, and was given for 20 weeks in the morning, while in the MP formula SP was replaced with MP. During the 20 weeks of the trial period, visceral fat area and subcutaneous fat area in the MP group were significantly reduced, while those in the SP group did not change as assessed on analysis of covariance. Although waist circumference was reduced in both the SP and MP groups, body weight and body mass index were significantly reduced only in the MP group. Based on a mixed-effects model, the difference in log-transformed visceral fat profiles between the 2 groups was statistically significant (P<0.05), while a negative relationship was observed between the changes in visceral fat and adiponectin in the MP group (P<0.001), but not in the SP group. Formula food containing MP is superior to that containing SP for reducing visceral and subcutaneous fat. (author)

  10. Comparative temporospatial expression profiling of murine amelotin protein during amelogenesis.

    Science.gov (United States)

    Somogyi-Ganss, Eszter; Nakayama, Yohei; Iwasaki, Kengo; Nakano, Yukiko; Stolf, Daiana; McKee, Marc D; Ganss, Bernhard

    2012-01-01

    Tooth enamel is formed in a typical biomineralization process under the guidance of specific organic components. Amelotin (AMTN) is a recently identified, secreted protein that is transcribed predominantly during the maturation stage of enamel formation, but its protein expression profile throughout amelogenesis has not been described in detail. The main objective of this study was to define the spatiotemporal expression profile of AMTN during tooth development in comparison with other known enamel proteins. A peptide antibody against AMTN was raised in rabbits, affinity purified and used for immunohistochemical analyses on sagittal and transverse paraffin sections of decalcified mouse hemimandibles. The localization of AMTN was compared to that of known enamel proteins amelogenin, ameloblastin, enamelin, odontogenic ameloblast-associated/amyloid in Pindborg tumors and kallikrein 4. Three-dimensional images of AMTN localization in molars at selected ages were reconstructed from serial stained sections, and transmission electron microscopy was used for ultrastructural localization of AMTN. AMTN was detected in ameloblasts of molars in a transient fashion, declining at the time of tooth eruption. Prominent expression in maturation stage ameloblasts of the continuously erupting incisor persisted into adulthood. In contrast, amelogenin, ameloblastin and enamelin were predominantly found during the early secretory stage, while odontogenic ameloblast-associated/amyloid in Pindborg tumors and kallikrein 4 expression in maturation stage ameloblasts paralleled that of AMTN. Secreted AMTN was detected at the interface between ameloblasts and the mineralized enamel. Recombinant AMTN protein did not mediate cell attachment in vitro. These results suggest a primary role for AMTN in the late stages of enamel mineralization. Copyright © 2011 S. Karger AG, Basel.

  11. An approach to creating a more realistic working model from a protein data bank entry.

    Science.gov (United States)

    Brandon, Christopher J; Martin, Benjamin P; McGee, Kelly J; Stewart, James J P; Braun-Sand, Sonja B

    2015-01-01

    An accurate model of three-dimensional protein structure is important in a variety of fields such as structure-based drug design and mechanistic studies of enzymatic reactions. While the entries in the Protein Data Bank ( http://www.pdb.org ) provide valuable information about protein structures, a small fraction of the PDB structures were found to contain anomalies not reported in the PDB file. The semiempirical PM7 method in MOPAC2012 was used for identifying anomalously short hydrogen bonds, C-H⋯O/C-H⋯N interactions, non-bonding close contacts, and unrealistic covalent bond lengths in recently published Protein Data Bank files. It was also used to generate new structures with these faults removed. When the semiempirical models were compared to those of PDB_REDO (http://www.cmbi.ru.nl/pdb_redo/), the clashscores, as defined by MolProbity ( http://molprobity.biochem.duke.edu/), were better in about 50% of the structures. The semiempirical models also had a lower root-mean-square-deviation value in nearly all cases than those from PDB_REDO, indicative of a better conservation of the tertiary structure. Finally, the semiempirical models were found to have lower clashscores than the initial PDB file in all but one case. Because this approach maintains as much of the original tertiary structure as possible while improving anomalous interactions, it should be useful to theoreticians, experimentalists, and crystallographers investigating the structure and function of proteins.

  12. A Physiologically Based Pharmacokinetic Model to Predict the Pharmacokinetics of Highly Protein-Bound Drugs and Impact of Errors in Plasma Protein Binding

    Science.gov (United States)

    Ye, Min; Nagar, Swati; Korzekwa, Ken

    2015-01-01

    Predicting the pharmacokinetics of highly protein-bound drugs is difficult. Also, since historical plasma protein binding data was often collected using unbuffered plasma, the resulting inaccurate binding data could contribute to incorrect predictions. This study uses a generic physiologically based pharmacokinetic (PBPK) model to predict human plasma concentration-time profiles for 22 highly protein-bound drugs. Tissue distribution was estimated from in vitro drug lipophilicity data, plasma protein binding, and blood: plasma ratio. Clearance was predicted with a well-stirred liver model. Underestimated hepatic clearance for acidic and neutral compounds was corrected by an empirical scaling factor. Predicted values (pharmacokinetic parameters, plasma concentration-time profile) were compared with observed data to evaluate model accuracy. Of the 22 drugs, less than a 2-fold error was obtained for terminal elimination half-life (t1/2, 100% of drugs), peak plasma concentration (Cmax, 100%), area under the plasma concentration-time curve (AUC0–t, 95.4%), clearance (CLh, 95.4%), mean retention time (MRT, 95.4%), and steady state volume (Vss, 90.9%). The impact of fup errors on CLh and Vss prediction was evaluated. Errors in fup resulted in proportional errors in clearance prediction for low-clearance compounds, and in Vss prediction for high-volume neutral drugs. For high-volume basic drugs, errors in fup did not propagate to errors in Vss prediction. This is due to the cancellation of errors in the calculations for tissue partitioning of basic drugs. Overall, plasma profiles were well simulated with the present PBPK model. PMID:26531057

  13. Modeling of axonal endoplasmic reticulum network by spastic paraplegia proteins.

    Science.gov (United States)

    Yalçın, Belgin; Zhao, Lu; Stofanko, Martin; O'Sullivan, Niamh C; Kang, Zi Han; Roost, Annika; Thomas, Matthew R; Zaessinger, Sophie; Blard, Olivier; Patto, Alex L; Sohail, Anood; Baena, Valentina; Terasaki, Mark; O'Kane, Cahir J

    2017-07-25

    Axons contain a smooth tubular endoplasmic reticulum (ER) network that is thought to be continuous with ER throughout the neuron; the mechanisms that form this axonal network are unknown. Mutations affecting reticulon or REEP proteins, with intramembrane hairpin domains that model ER membranes, cause an axon degenerative disease, hereditary spastic paraplegia (HSP). We show that Drosophila axons have a dynamic axonal ER network, which these proteins help to model. Loss of HSP hairpin proteins causes ER sheet expansion, partial loss of ER from distal motor axons, and occasional discontinuities in axonal ER. Ultrastructural analysis reveals an extensive ER network in axons, which shows larger and fewer tubules in larvae that lack reticulon and REEP proteins, consistent with loss of membrane curvature. Therefore HSP hairpin-containing proteins are required for shaping and continuity of axonal ER, thus suggesting roles for ER modeling in axon maintenance and function.

  14. A model in which heat shock protein 90 targets protein-folding clefts: rationale for a new approach to neuroprotective treatment of protein folding diseases.

    Science.gov (United States)

    Pratt, William B; Morishima, Yoshihiro; Gestwicki, Jason E; Lieberman, Andrew P; Osawa, Yoichi

    2014-11-01

    In an EBM Minireview published in 2010, we proposed that the heat shock protein (Hsp)90/Hsp70-based chaperone machinery played a major role in determining the selection of proteins that have undergone oxidative or other toxic damage for ubiquitination and proteasomal degradation. The proposal was based on a model in which the Hsp90 chaperone machinery regulates signaling by modulating ligand-binding clefts. The model provides a framework for thinking about the development of neuroprotective therapies for protein-folding diseases like Alzheimer's disease (AD), Parkinson's disease (PD), and the polyglutamine expansion disorders, such as Huntington's disease (HD) and spinal and bulbar muscular atrophy (SBMA). Major aberrant proteins that misfold and accumulate in these diseases are "client" proteins of the abundant and ubiquitous stress chaperone Hsp90. These Hsp90 client proteins include tau (AD), α-synuclein (PD), huntingtin (HD), and the expanded glutamine androgen receptor (polyQ AR) (SBMA). In this Minireview, we update our model in which Hsp90 acts on protein-folding clefts and show how it forms a rational basis for developing drugs that promote the targeted elimination of these aberrant proteins. © 2014 by the Society for Experimental Biology and Medicine.

  15. Models of protein and amino acid requirements for cattle

    Directory of Open Access Journals (Sweden)

    Luis Orlindo Tedeschi

    2015-03-01

    Full Text Available Protein supply and requirements by ruminants have been studied for more than a century. These studies led to the accumulation of lots of scientific information about digestion and metabolism of protein by ruminants as well as the characterization of the dietary protein in order to maximize animal performance. During the 1980s and 1990s, when computers became more accessible and powerful, scientists began to conceptualize and develop mathematical nutrition models, and to program them into computers to assist with ration balancing and formulation for domesticated ruminants, specifically dairy and beef cattle. The most commonly known nutrition models developed during this period were the National Research Council (NRC in the United States, Agricultural Research Council (ARC in the United Kingdom, Institut National de la Recherche Agronomique (INRA in France, and the Commonwealth Scientific and Industrial Research Organization (CSIRO in Australia. Others were derivative works from these models with different degrees of modifications in the supply or requirement calculations, and the modeling nature (e.g., static or dynamic, mechanistic, or deterministic. Circa 1990s, most models adopted the metabolizable protein (MP system over the crude protein (CP and digestible CP systems to estimate supply of MP and the factorial system to calculate MP required by the animal. The MP system included two portions of protein (i.e., the rumen-undegraded dietary CP - RUP - and the contributions of microbial CP - MCP as the main sources of MP for the animal. Some models would explicitly account for the impact of dry matter intake (DMI on the MP required for maintenance (MPm; e.g., Cornell Net Carbohydrate and Protein System - CNCPS, the Dutch system - DVE/OEB, while others would simply account for scurf, urinary, metabolic fecal, and endogenous contributions independently of DMI. All models included milk yield and its components in estimating MP required for lactation

  16. A resource for benchmarking the usefulness of protein structure models.

    Science.gov (United States)

    Carbajo, Daniel; Tramontano, Anna

    2012-08-02

    Increasingly, biologists and biochemists use computational tools to design experiments to probe the function of proteins and/or to engineer them for a variety of different purposes. The most effective strategies rely on the knowledge of the three-dimensional structure of the protein of interest. However it is often the case that an experimental structure is not available and that models of different quality are used instead. On the other hand, the relationship between the quality of a model and its appropriate use is not easy to derive in general, and so far it has been analyzed in detail only for specific application. This paper describes a database and related software tools that allow testing of a given structure based method on models of a protein representing different levels of accuracy. The comparison of the results of a computational experiment on the experimental structure and on a set of its decoy models will allow developers and users to assess which is the specific threshold of accuracy required to perform the task effectively. The ModelDB server automatically builds decoy models of different accuracy for a given protein of known structure and provides a set of useful tools for their analysis. Pre-computed data for a non-redundant set of deposited protein structures are available for analysis and download in the ModelDB database. IMPLEMENTATION, AVAILABILITY AND REQUIREMENTS: Project name: A resource for benchmarking the usefulness of protein structure models. Project home page: http://bl210.caspur.it/MODEL-DB/MODEL-DB_web/MODindex.php.Operating system(s): Platform independent. Programming language: Perl-BioPerl (program); mySQL, Perl DBI and DBD modules (database); php, JavaScript, Jmol scripting (web server). Other requirements: Java Runtime Environment v1.4 or later, Perl, BioPerl, CPAN modules, HHsearch, Modeller, LGA, NCBI Blast package, DSSP, Speedfill (Surfnet) and PSAIA. License: Free. Any restrictions to use by non-academics: No.

  17. A resource for benchmarking the usefulness of protein structure models

    Directory of Open Access Journals (Sweden)

    Carbajo Daniel

    2012-08-01

    Full Text Available Abstract Background Increasingly, biologists and biochemists use computational tools to design experiments to probe the function of proteins and/or to engineer them for a variety of different purposes. The most effective strategies rely on the knowledge of the three-dimensional structure of the protein of interest. However it is often the case that an experimental structure is not available and that models of different quality are used instead. On the other hand, the relationship between the quality of a model and its appropriate use is not easy to derive in general, and so far it has been analyzed in detail only for specific application. Results This paper describes a database and related software tools that allow testing of a given structure based method on models of a protein representing different levels of accuracy. The comparison of the results of a computational experiment on the experimental structure and on a set of its decoy models will allow developers and users to assess which is the specific threshold of accuracy required to perform the task effectively. Conclusions The ModelDB server automatically builds decoy models of different accuracy for a given protein of known structure and provides a set of useful tools for their analysis. Pre-computed data for a non-redundant set of deposited protein structures are available for analysis and download in the ModelDB database. Implementation, availability and requirements Project name: A resource for benchmarking the usefulness of protein structure models. Project home page: http://bl210.caspur.it/MODEL-DB/MODEL-DB_web/MODindex.php. Operating system(s: Platform independent. Programming language: Perl-BioPerl (program; mySQL, Perl DBI and DBD modules (database; php, JavaScript, Jmol scripting (web server. Other requirements: Java Runtime Environment v1.4 or later, Perl, BioPerl, CPAN modules, HHsearch, Modeller, LGA, NCBI Blast package, DSSP, Speedfill (Surfnet and PSAIA. License: Free. Any

  18. A resource for benchmarking the usefulness of protein structure models.

    KAUST Repository

    Carbajo, Daniel

    2012-08-02

    BACKGROUND: Increasingly, biologists and biochemists use computational tools to design experiments to probe the function of proteins and/or to engineer them for a variety of different purposes. The most effective strategies rely on the knowledge of the three-dimensional structure of the protein of interest. However it is often the case that an experimental structure is not available and that models of different quality are used instead. On the other hand, the relationship between the quality of a model and its appropriate use is not easy to derive in general, and so far it has been analyzed in detail only for specific application. RESULTS: This paper describes a database and related software tools that allow testing of a given structure based method on models of a protein representing different levels of accuracy. The comparison of the results of a computational experiment on the experimental structure and on a set of its decoy models will allow developers and users to assess which is the specific threshold of accuracy required to perform the task effectively. CONCLUSIONS: The ModelDB server automatically builds decoy models of different accuracy for a given protein of known structure and provides a set of useful tools for their analysis. Pre-computed data for a non-redundant set of deposited protein structures are available for analysis and download in the ModelDB database. IMPLEMENTATION, AVAILABILITY AND REQUIREMENTS: Project name: A resource for benchmarking the usefulness of protein structure models. Project home page: http://bl210.caspur.it/MODEL-DB/MODEL-DB_web/MODindex.php.Operating system(s): Platform independent. Programming language: Perl-BioPerl (program); mySQL, Perl DBI and DBD modules (database); php, JavaScript, Jmol scripting (web server). Other requirements: Java Runtime Environment v1.4 or later, Perl, BioPerl, CPAN modules, HHsearch, Modeller, LGA, NCBI Blast package, DSSP, Speedfill (Surfnet) and PSAIA. License: Free. Any restrictions to use by

  19. A resource for benchmarking the usefulness of protein structure models.

    KAUST Repository

    Carbajo, Daniel; Tramontano, Anna

    2012-01-01

    BACKGROUND: Increasingly, biologists and biochemists use computational tools to design experiments to probe the function of proteins and/or to engineer them for a variety of different purposes. The most effective strategies rely on the knowledge of the three-dimensional structure of the protein of interest. However it is often the case that an experimental structure is not available and that models of different quality are used instead. On the other hand, the relationship between the quality of a model and its appropriate use is not easy to derive in general, and so far it has been analyzed in detail only for specific application. RESULTS: This paper describes a database and related software tools that allow testing of a given structure based method on models of a protein representing different levels of accuracy. The comparison of the results of a computational experiment on the experimental structure and on a set of its decoy models will allow developers and users to assess which is the specific threshold of accuracy required to perform the task effectively. CONCLUSIONS: The ModelDB server automatically builds decoy models of different accuracy for a given protein of known structure and provides a set of useful tools for their analysis. Pre-computed data for a non-redundant set of deposited protein structures are available for analysis and download in the ModelDB database. IMPLEMENTATION, AVAILABILITY AND REQUIREMENTS: Project name: A resource for benchmarking the usefulness of protein structure models. Project home page: http://bl210.caspur.it/MODEL-DB/MODEL-DB_web/MODindex.php.Operating system(s): Platform independent. Programming language: Perl-BioPerl (program); mySQL, Perl DBI and DBD modules (database); php, JavaScript, Jmol scripting (web server). Other requirements: Java Runtime Environment v1.4 or later, Perl, BioPerl, CPAN modules, HHsearch, Modeller, LGA, NCBI Blast package, DSSP, Speedfill (Surfnet) and PSAIA. License: Free. Any restrictions to use by

  20. Hagfish slime threads as a biomimetic model for high performance protein fibres

    International Nuclear Information System (INIS)

    Fudge, Douglas S; Hillis, Sonja; Levy, Nimrod; Gosline, John M

    2010-01-01

    Textile manufacturing is one of the largest industries in the world, and synthetic fibres represent two-thirds of the global textile market. Synthetic fibres are manufactured from petroleum-based feedstocks, which are becoming increasingly expensive as demand for finite petroleum reserves continues to rise. For the last three decades, spider silks have been held up as a model that could inspire the production of protein fibres exhibiting high performance and ecological sustainability, but unfortunately, artificial spider silks have yet to fulfil this promise. Previous work on the biomechanics of protein fibres from the slime of hagfishes suggests that these fibres might be a superior biomimetic model to spider silks. Based on the fact that the proteins within these 'slime threads' adopt conformations that are similar to those in spider silks when they are stretched, we hypothesized that draw processing of slime threads should yield fibres that are comparable to spider dragline silk in their mechanical performance. Here we show that draw-processed slime threads are indeed exceptionally strong and tough. We also show that post-drawing steps such as annealing, dehydration and covalent cross-linking can dramatically improve the long-term dimensional stability of the threads. The data presented here suggest that hagfish slime threads are a model that should be pursued in the quest to produce fibres that are ecologically sustainable and economically viable.

  1. Preclinical models used for immunogenicity prediction of therapeutic proteins.

    Science.gov (United States)

    Brinks, Vera; Weinbuch, Daniel; Baker, Matthew; Dean, Yann; Stas, Philippe; Kostense, Stefan; Rup, Bonita; Jiskoot, Wim

    2013-07-01

    All therapeutic proteins are potentially immunogenic. Antibodies formed against these drugs can decrease efficacy, leading to drastically increased therapeutic costs and in rare cases to serious and sometimes life threatening side-effects. Many efforts are therefore undertaken to develop therapeutic proteins with minimal immunogenicity. For this, immunogenicity prediction of candidate drugs during early drug development is essential. Several in silico, in vitro and in vivo models are used to predict immunogenicity of drug leads, to modify potentially immunogenic properties and to continue development of drug candidates with expected low immunogenicity. Despite the extensive use of these predictive models, their actual predictive value varies. Important reasons for this uncertainty are the limited/insufficient knowledge on the immune mechanisms underlying immunogenicity of therapeutic proteins, the fact that different predictive models explore different components of the immune system and the lack of an integrated clinical validation. In this review, we discuss the predictive models in use, summarize aspects of immunogenicity that these models predict and explore the merits and the limitations of each of the models.

  2. Model test on the relationship feed energy and protein ratio to the production and quality of milk protein

    Science.gov (United States)

    Hartanto, R.; Jantra, M. A. C.; Santosa, S. A. B.; Purnomoadi, A.

    2018-01-01

    The purpose of this research was to find an appropriate relationship model between the feed energy and protein ratio with the amount of production and quality of milk proteins. This research was conducted at Getasan Sub-district, Semarang Regency, Central Java Province, Indonesia using 40 samples (Holstein Friesian cattle, lactation period II-III and lactation month 3-4). Data were analyzed using linear and quadratic regressions, to predict the production and quality of milk protein from feed energy and protein ratio that describe the diet. The significance of model was tested using analysis of variance. Coefficient of determination (R2), residual variance (RV) and root mean square prediction error (RMSPE) were reported for the developed equations as an indicator of the goodness of model fit. The results showed no relationship in milk protein (kg), milk casein (%), milk casein (kg) and milk urea N (mg/dl) as function of CP/TDN. The significant relationship was observed in milk production (L or kg) and milk protein (%) as function of CP/TDN, both in linear and quadratic models. In addition, a quadratic change in milk production (L) (P = 0.003), milk production (kg) (P = 0.003) and milk protein concentration (%) (P = 0.026) were observed with increase of CP/TDN. It can be concluded that quadratic equation was the good fitting model for this research, because quadratic equation has larger R2, smaller RV and smaller RMSPE than those of linear equation.

  3. Probing the role of interfacial waters in protein-DNA recognition using a hybrid implicit/explicit solvation model

    Science.gov (United States)

    Li, Shen; Bradley, Philip

    2013-01-01

    When proteins bind to their DNA target sites, ordered water molecules are often present at the protein-DNA interface bridging protein and DNA through hydrogen bonds. What is the role of these ordered interfacial waters? Are they important determinants of the specificity of DNA sequence recognition, or do they act in binding in a primarily non-specific manner, by improving packing of the interface, shielding unfavorable electrostatic interactions, and solvating unsatisfied polar groups that are inaccessible to bulk solvent? When modeling details of structure and binding preferences, can fully implicit solvent models be fruitfully applied to protein-DNA interfaces, or must the individualistic properties of these interfacial waters be accounted for? To address these questions, we have developed a hybrid implicit/explicit solvation model that specifically accounts for the locations and orientations of small numbers of DNA-bound water molecules while treating the majority of the solvent implicitly. Comparing the performance of this model to its fully implicit counterpart, we find that explicit treatment of interfacial waters results in a modest but significant improvement in protein sidechain placement and DNA sequence recovery. Base-by-base comparison of the performance of the two models highlights DNA sequence positions whose recognition may be dependent on interfacial water. Our study offers large-scale statistical evidence for the role of ordered water for protein DNA recognition, together with detailed examination of several well-characterized systems. In addition, our approach provides a template for modeling explicit water molecules at interfaces that should be extensible to other systems. PMID:23444044

  4. Protein structure analysis using the resonant recognition model and wavelet transforms

    International Nuclear Information System (INIS)

    Fang, Q.; Cosic, I.

    1998-01-01

    An approach based on the resonant recognition model and the discrete wavelet transform is introduced here for characterising proteins' biological function. The protein sequence is converted into a numerical series by assigning the electron-ion interaction potential to each amino acid from N-terminal to C-terminal. A set of peaks is found after performing a wavelet transform onto a numerical series representing a group of homologous proteins. These peaks are related to protein structural and functional properties and named characteristic vector of that protein group. Further more, the amino acids contributing mostly to a protein's biological functions, the so-called 'hot spots' amino acids, are predicted by the continuous wavelet transform. It is found that the hot spots are clustered around the protein's cleft structure. The wavelets approach provides a novel methods for amino acid sequence analysis as well as an expansion for the newly established macromolecular interaction model: the resonant recognition model. Copyright (1998) Australasian Physical and Engineering Sciences in Medicine

  5. In silico modeling and experimental evidence of coagulant protein interaction with precursors for nanoparticle functionalization.

    Science.gov (United States)

    Okoli, Chuka; Sengottaiyan, Selvaraj; Arul Murugan, N; Pavankumar, Asalapuram R; Agren, Hans; Kuttuva Rajarao, Gunaratna

    2013-10-01

    The design of novel protein-nanoparticle hybrid systems has applications in many fields of science ranging from biomedicine, catalysis, water treatment, etc. The main barrier in devising such tool is lack of adequate information or poor understanding of protein-ligand chemistry. Here, we establish a new strategy based on computational modeling for protein and precursor linkers that can decorate the nanoparticles. Moringa oleifera (MO2.1) seed protein that has coagulation and antimicrobial properties was used. Superparamagnetic nanoparticles (SPION) with precursor ligands were used for the protein-ligand interaction studies. The molecular docking studies reveal that there are two binding sites, one is located at the core binding site; tetraethoxysilane (TEOS) or 3-aminopropyl trimethoxysilane (APTES) binds to this site while the other one is located at the side chain residues where trisodium citrate (TSC) or Si60 binds to this site. The protein-ligand distance profile analysis explains the differences in functional activity of the decorated SPION. Experimentally, TSC-coated nanoparticles showed higher coagulation activity as compared to TEOS- and APTES-coated SPION. To our knowledge, this is the first report on in vitro experimental data, which endorses the computational modeling studies as a powerful tool to design novel precursors for functionalization of nanomaterials; and develop interface hybrid systems for various applications.

  6. A generative, probabilistic model of local protein structure

    DEFF Research Database (Denmark)

    Boomsma, Wouter; Mardia, Kanti V.; Taylor, Charles C.

    2008-01-01

    Despite significant progress in recent years, protein structure prediction maintains its status as one of the prime unsolved problems in computational biology. One of the key remaining challenges is an efficient probabilistic exploration of the structural space that correctly reflects the relative...... conformational stabilities. Here, we present a fully probabilistic, continuous model of local protein structure in atomic detail. The generative model makes efficient conformational sampling possible and provides a framework for the rigorous analysis of local sequence-structure correlations in the native state...

  7. Biophysics of protein evolution and evolutionary protein biophysics

    Science.gov (United States)

    Sikosek, Tobias; Chan, Hue Sun

    2014-01-01

    The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence–structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by ‘hidden’ conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution. PMID:25165599

  8. Dynameomics: data-driven methods and models for utilizing large-scale protein structure repositories for improving fragment-based loop prediction.

    Science.gov (United States)

    Rysavy, Steven J; Beck, David A C; Daggett, Valerie

    2014-11-01

    Protein function is intimately linked to protein structure and dynamics yet experimentally determined structures frequently omit regions within a protein due to indeterminate data, which is often due protein dynamics. We propose that atomistic molecular dynamics simulations provide a diverse sampling of biologically relevant structures for these missing segments (and beyond) to improve structural modeling and structure prediction. Here we make use of the Dynameomics data warehouse, which contains simulations of representatives of essentially all known protein folds. We developed novel computational methods to efficiently identify, rank and retrieve small peptide structures, or fragments, from this database. We also created a novel data model to analyze and compare large repositories of structural data, such as contained within the Protein Data Bank and the Dynameomics data warehouse. Our evaluation compares these structural repositories for improving loop predictions and analyzes the utility of our methods and models. Using a standard set of loop structures, containing 510 loops, 30 for each loop length from 4 to 20 residues, we find that the inclusion of Dynameomics structures in fragment-based methods improves the quality of the loop predictions without being dependent on sequence homology. Depending on loop length, ∼ 25-75% of the best predictions came from the Dynameomics set, resulting in lower main chain root-mean-square deviations for all fragment lengths using the combined fragment library. We also provide specific cases where Dynameomics fragments provide better predictions for NMR loop structures than fragments from crystal structures. Online access to these fragment libraries is available at http://www.dynameomics.org/fragments. © 2014 The Protein Society.

  9. Thermodynamics of protein folding using a modified Wako-Saitô-Muñoz-Eaton model.

    Science.gov (United States)

    Tsai, Min-Yeh; Yuan, Jian-Min; Teranishi, Yoshiaki; Lin, Sheng Hsien

    2012-09-01

    Herein, we propose a modified version of the Wako-Saitô-Muñoz-Eaton (WSME) model. The proposed model introduces an empirical temperature parameter for the hypothetical structural units (i.e., foldons) in proteins to include site-dependent thermodynamic behavior. The thermodynamics for both our proposed model and the original WSME model were investigated. For a system with beta-hairpin topology, a mathematical treatment (contact-pair treatment) to facilitate the calculation of its partition function was developed. The results show that the proposed model provides better insight into the site-dependent thermodynamic behavior of the system, compared with the original WSME model. From this site-dependent point of view, the relationship between probe-dependent experimental results and model's thermodynamic predictions can be explained. The model allows for suggesting a general principle to identify foldon behavior. We also find that the backbone hydrogen bonds may play a role of structural constraints in modulating the cooperative system. Thus, our study may contribute to the understanding of the fundamental principles for the thermodynamics of protein folding.

  10. Conformational Sampling in Template-Free Protein Loop Structure Modeling: An Overview

    OpenAIRE

    Li, Yaohang

    2013-01-01

    Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a “mini protein folding problem” under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increas...

  11. Comparative proteome analysis of cryopreserved flagella and head plasma membrane proteins from sea bream spermatozoa: effect of antifreeze proteins.

    Science.gov (United States)

    Zilli, Loredana; Beirão, José; Schiavone, Roberta; Herraez, Maria Paz; Gnoni, Antonio; Vilella, Sebastiano

    2014-01-01

    Cryopreservation induces injuries to fish spermatozoa that in turn affect sperm quality in terms of fertilization ability, motility, DNA and protein integrity and larval survival. To reduce the loss of sperm quality due to freezing-thawing, it is necessary to improve these procedures. In the present study we investigated the ability of two antifreeze proteins (AFPI and AFPIII) to reduce the loss of quality of sea bream spermatozoa due to cryopreservation. To do so, we compared viability, motility, straight-line velocity and curvilinear velocity of fresh and (AFPs)-cryopreserved spermatozoa. AFPIII addition to cryopreservation medium improved viability, motility and straight-line velocity with respect to DMSO or DMSO plus AFPI. To clarify the molecular mechanism(s) underlying these findings, the protein profile of two different cryopreserved sperm domains, flagella and head plasma membranes, was analysed. The protein profiles differed between fresh and frozen-thawed semen and results of the image analysis demonstrated that, after cryopreservation, out of 270 proteins 12 were decreased and 7 were increased in isolated flagella, and out of 150 proteins 6 showed a significant decrease and 4 showed a significant increase in head membranes. Mass spectrometry analysis identified 6 proteins (4 from isolated flagella and 2 present both in flagella and head plasma membranes) within the protein spots affected by the freezing-thawing procedure. 3 out of 4 proteins from isolated flagella were involved in the sperm bioenergetic system. Our results indicate that the ability of AFPIII to protect sea bream sperm quality can be, at least in part, ascribed to reducing changes in the sperm protein profile occurring during the freezing-thawing procedure. Our results clearly demonstrated that AFPIII addition to cryopreservation medium improved the protection against freezing respect to DMSO or DMSO plus AFPI. In addition we propose specific proteins of spermatozoa as markers related to

  12. Protein folding optimization based on 3D off-lattice model via an improved artificial bee colony algorithm.

    Science.gov (United States)

    Li, Bai; Lin, Mu; Liu, Qiao; Li, Ya; Zhou, Changjun

    2015-10-01

    Protein folding is a fundamental topic in molecular biology. Conventional experimental techniques for protein structure identification or protein folding recognition require strict laboratory requirements and heavy operating burdens, which have largely limited their applications. Alternatively, computer-aided techniques have been developed to optimize protein structures or to predict the protein folding process. In this paper, we utilize a 3D off-lattice model to describe the original protein folding scheme as a simplified energy-optimal numerical problem, where all types of amino acid residues are binarized into hydrophobic and hydrophilic ones. We apply a balance-evolution artificial bee colony (BE-ABC) algorithm as the minimization solver, which is featured by the adaptive adjustment of search intensity to cater for the varying needs during the entire optimization process. In this work, we establish a benchmark case set with 13 real protein sequences from the Protein Data Bank database and evaluate the convergence performance of BE-ABC algorithm through strict comparisons with several state-of-the-art ABC variants in short-term numerical experiments. Besides that, our obtained best-so-far protein structures are compared to the ones in comprehensive previous literature. This study also provides preliminary insights into how artificial intelligence techniques can be applied to reveal the dynamics of protein folding. Graphical Abstract Protein folding optimization using 3D off-lattice model and advanced optimization techniques.

  13. AptRank: an adaptive PageRank model for protein function prediction on   bi-relational graphs.

    Science.gov (United States)

    Jiang, Biaobin; Kloster, Kyle; Gleich, David F; Gribskov, Michael

    2017-06-15

    Diffusion-based network models are widely used for protein function prediction using protein network data and have been shown to outperform neighborhood-based and module-based methods. Recent studies have shown that integrating the hierarchical structure of the Gene Ontology (GO) data dramatically improves prediction accuracy. However, previous methods usually either used the GO hierarchy to refine the prediction results of multiple classifiers, or flattened the hierarchy into a function-function similarity kernel. No study has taken the GO hierarchy into account together with the protein network as a two-layer network model. We first construct a Bi-relational graph (Birg) model comprised of both protein-protein association and function-function hierarchical networks. We then propose two diffusion-based methods, BirgRank and AptRank, both of which use PageRank to diffuse information on this two-layer graph model. BirgRank is a direct application of traditional PageRank with fixed decay parameters. In contrast, AptRank utilizes an adaptive diffusion mechanism to improve the performance of BirgRank. We evaluate the ability of both methods to predict protein function on yeast, fly and human protein datasets, and compare with four previous methods: GeneMANIA, TMC, ProteinRank and clusDCA. We design four different validation strategies: missing function prediction, de novo function prediction, guided function prediction and newly discovered function prediction to comprehensively evaluate predictability of all six methods. We find that both BirgRank and AptRank outperform the previous methods, especially in missing function prediction when using only 10% of the data for training. The MATLAB code is available at https://github.rcac.purdue.edu/mgribsko/aptrank . gribskov@purdue.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  14. Comparative genomics evidence that only protein toxins are tagging bad bugs

    Directory of Open Access Journals (Sweden)

    Kalliopi eGeorgiades

    2011-10-01

    Full Text Available The term toxin was introduced by Roux and Yersin and describes macromolecular substances that, when produced during infection or when introduced parenterally or orally, cause an impairment of physiological functions that lead to disease or to the death of the infected organism. Long after the discovery of toxins, early genetic studies on bacterial virulence demonstrated that removing a certain number of genes from pathogenic bacteria decreases their capacity to infect hosts. Each of the removed factors was therefore referred to as a virulence factor, and it was speculated that non-pathogenic bacteria lack such supplementary factors. However, many recent comparative studies demonstrate that the specialization of bacteria to eukaryotic hosts is associated with massive gene loss. We recently demonstrated that the only features that seem to characterize 12 epidemic bacteria are toxin-antitoxin (TA modules, which are addiction molecules in host bacteria. In this study, we investigated if protein toxins are indeed the only molecules specific to pathogenic bacteria by comparing 14 epidemic bacterial killers (bad bugs with their 14 closest non-epidemic relatives (controls. We found protein toxins in significantly more elevated numbers in all of the bad bugs. For the first time, statistical principal components analysis, including genome size, GC%, TA modules, restriction enzymes and toxins, revealed that toxins are the only proteins other than TA modules that are correlated with the pathogenic character of bacteria. Moreover, intracellular toxins appear to be more correlated with the pathogenic character of bacteria than secreted toxins. In conclusion, we hypothesize that the only truly identifiable phenomena, witnessing the convergent evolution of the most pathogenic bacteria for humans are the loss of metabolic activities, i.e., the outcome of the loss of regulatory and transcription factors and the presence of protein toxins, alone or coupled as TA

  15. Model of a DNA-protein complex of the architectural monomeric protein MC1 from Euryarchaea.

    Directory of Open Access Journals (Sweden)

    Françoise Paquet

    Full Text Available In Archaea the two major modes of DNA packaging are wrapping by histone proteins or bending by architectural non-histone proteins. To supplement our knowledge about the binding mode of the different DNA-bending proteins observed across the three domains of life, we present here the first model of a complex in which the monomeric Methanogen Chromosomal protein 1 (MC1 from Euryarchaea binds to the concave side of a strongly bent DNA. In laboratory growth conditions MC1 is the most abundant architectural protein present in Methanosarcina thermophila CHTI55. Like most proteins that strongly bend DNA, MC1 is known to bind in the minor groove. Interaction areas for MC1 and DNA were mapped by Nuclear Magnetic Resonance (NMR data. The polarity of protein binding was determined using paramagnetic probes attached to the DNA. The first structural model of the DNA-MC1 complex we propose here was obtained by two complementary docking approaches and is in good agreement with the experimental data previously provided by electron microscopy and biochemistry. Residues essential to DNA-binding and -bending were highlighted and confirmed by site-directed mutagenesis. It was found that the Arg25 side-chain was essential to neutralize the negative charge of two phosphates that come very close in response to a dramatic curvature of the DNA.

  16. Protein homology model refinement by large-scale energy optimization.

    Science.gov (United States)

    Park, Hahnbeom; Ovchinnikov, Sergey; Kim, David E; DiMaio, Frank; Baker, David

    2018-03-20

    Proteins fold to their lowest free-energy structures, and hence the most straightforward way to increase the accuracy of a partially incorrect protein structure model is to search for the lowest-energy nearby structure. This direct approach has met with little success for two reasons: first, energy function inaccuracies can lead to false energy minima, resulting in model degradation rather than improvement; and second, even with an accurate energy function, the search problem is formidable because the energy only drops considerably in the immediate vicinity of the global minimum, and there are a very large number of degrees of freedom. Here we describe a large-scale energy optimization-based refinement method that incorporates advances in both search and energy function accuracy that can substantially improve the accuracy of low-resolution homology models. The method refined low-resolution homology models into correct folds for 50 of 84 diverse protein families and generated improved models in recent blind structure prediction experiments. Analyses of the basis for these improvements reveal contributions from both the improvements in conformational sampling techniques and the energy function.

  17. Collective estimation of multiple bivariate density functions with application to angular-sampling-based protein loop modeling

    KAUST Repository

    Maadooliat, Mehdi

    2015-10-21

    This paper develops a method for simultaneous estimation of density functions for a collection of populations of protein backbone angle pairs using a data-driven, shared basis that is constructed by bivariate spline functions defined on a triangulation of the bivariate domain. The circular nature of angular data is taken into account by imposing appropriate smoothness constraints across boundaries of the triangles. Maximum penalized likelihood is used to fit the model and an alternating blockwise Newton-type algorithm is developed for computation. A simulation study shows that the collective estimation approach is statistically more efficient than estimating the densities individually. The proposed method was used to estimate neighbor-dependent distributions of protein backbone dihedral angles (i.e., Ramachandran distributions). The estimated distributions were applied to protein loop modeling, one of the most challenging open problems in protein structure prediction, by feeding them into an angular-sampling-based loop structure prediction framework. Our estimated distributions compared favorably to the Ramachandran distributions estimated by fitting a hierarchical Dirichlet process model; and in particular, our distributions showed significant improvements on the hard cases where existing methods do not work well.

  18. Collective estimation of multiple bivariate density functions with application to angular-sampling-based protein loop modeling

    KAUST Repository

    Maadooliat, Mehdi; Zhou, Lan; Najibi, Seyed Morteza; Gao, Xin; Huang, Jianhua Z.

    2015-01-01

    This paper develops a method for simultaneous estimation of density functions for a collection of populations of protein backbone angle pairs using a data-driven, shared basis that is constructed by bivariate spline functions defined on a triangulation of the bivariate domain. The circular nature of angular data is taken into account by imposing appropriate smoothness constraints across boundaries of the triangles. Maximum penalized likelihood is used to fit the model and an alternating blockwise Newton-type algorithm is developed for computation. A simulation study shows that the collective estimation approach is statistically more efficient than estimating the densities individually. The proposed method was used to estimate neighbor-dependent distributions of protein backbone dihedral angles (i.e., Ramachandran distributions). The estimated distributions were applied to protein loop modeling, one of the most challenging open problems in protein structure prediction, by feeding them into an angular-sampling-based loop structure prediction framework. Our estimated distributions compared favorably to the Ramachandran distributions estimated by fitting a hierarchical Dirichlet process model; and in particular, our distributions showed significant improvements on the hard cases where existing methods do not work well.

  19. Modeling on-column reduction of trisulfide bonds in monoclonal antibodies during protein A chromatography.

    Science.gov (United States)

    Ghose, Sanchayita; Rajshekaran, Rupshika; Labanca, Marisa; Conley, Lynn

    2017-01-06

    Trisulfides can be a common post-translational modification in many recombinant monoclonal antibodies. These are a source of product heterogeneity that add to the complexity of product characterization and hence, need to be reduced for consistent product quality. Trisulfide bonds can be converted to the regular disulfide bonds by incorporating a novel cysteine wash step during Protein A affinity chromatography. An empirical model is developed for this on-column reduction reaction to compare the reaction rates as a function of typical operating parameters such as temperature, cysteine concentration, reaction time and starting level of trisulfides. The model presented here is anticipated to assist in the development of optimal wash conditions for the Protein A step to effectively reduce trisulfides to desired levels. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. The research methods and model of protein turnover in animal

    International Nuclear Information System (INIS)

    Wu Xilin; Yang Feng

    2002-01-01

    The author discussed the concept and research methods of protein turnover in animal body. The existing problems and the research results of animal protein turnover in recent years were presented. Meanwhile, the measures to improve the models of animal protein turnover were analyzed

  1. A simple quantitative model of macromolecular crowding effects on protein folding: Application to the murine prion protein(121-231)

    Science.gov (United States)

    Bergasa-Caceres, Fernando; Rabitz, Herschel A.

    2013-06-01

    A model of protein folding kinetics is applied to study the effects of macromolecular crowding on protein folding rate and stability. Macromolecular crowding is found to promote a decrease of the entropic cost of folding of proteins that produces an increase of both the stability and the folding rate. The acceleration of the folding rate due to macromolecular crowding is shown to be a topology-dependent effect. The model is applied to the folding dynamics of the murine prion protein (121-231). The differential effect of macromolecular crowding as a function of protein topology suffices to make non-native configurations relatively more accessible.

  2. Possibilities of microscopic detection of isolated porcine proteins in model meat products

    Directory of Open Access Journals (Sweden)

    Michaela Petrášová

    2016-05-01

    Full Text Available In recent years, various protein additives intended for manufacture of meat products have increasing importance in the food industry. These ingredients include both, plant-origin as well as animal-origin proteins. Among animal proteins, blood plasma, milk protein or collagen are used most commonly. Collagen is obtained from pork, beef, and poultry or fish skin. Collagen does not contain all the essential amino acids, thus it is not a full protein in terms of essential amino acids supply for one's organism. However, it is rather rich in amino acids of glycine, hydroxyproline and proline which are almost absent in other proteins and their synthesis is very energy intensive. Collagen, which is added to the soft and small meat products in the form of isolated porcine protein, significantly affects the organoleptic properties of these products. This work focused on detection of isolated porcine protein in model meat products where detection of isolated porcine protein was verified by histological staining and light microscopy. Seven model meat products from poultry meat and 7 model meat products from beef and pork in the ratio of 1:1, which contained 2.5% concentration of various commercially produced isolated porcine proteins, were examined. These model meat products were histologically processed by means of cryosections and stained with hematoxylin-eosin staining, toluidine blue staining and Calleja. For the validation phase, Calleja was utilized. To determine the sensitivity and specificity, five model meat products containing the addition of isolated porcine protein and five model meat products free of it were used. The sensitivity was determined for isolated porcine protein at 1.00 and specificity was determined at 1.00. The detection limit of the method was at the level of 0.001% addition. Repeatability of the method was carried out using products with addition as well as without addition of isolated porcine protein and detection was repeated

  3. A linear programming model for protein inference problem in shotgun proteomics.

    Science.gov (United States)

    Huang, Ting; He, Zengyou

    2012-11-15

    Assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is an important issue in shotgun proteomics. The objective of protein inference is to find a subset of proteins that are truly present in the sample. Although many methods have been proposed for protein inference, several issues such as peptide degeneracy still remain unsolved. In this article, we present a linear programming model for protein inference. In this model, we use a transformation of the joint probability that each peptide/protein pair is present in the sample as the variable. Then, both the peptide probability and protein probability can be expressed as a formula in terms of the linear combination of these variables. Based on this simple fact, the protein inference problem is formulated as an optimization problem: minimize the number of proteins with non-zero probabilities under the constraint that the difference between the calculated peptide probability and the peptide probability generated from peptide identification algorithms should be less than some threshold. This model addresses the peptide degeneracy issue by forcing some joint probability variables involving degenerate peptides to be zero in a rigorous manner. The corresponding inference algorithm is named as ProteinLP. We test the performance of ProteinLP on six datasets. Experimental results show that our method is competitive with the state-of-the-art protein inference algorithms. The source code of our algorithm is available at: https://sourceforge.net/projects/prolp/. zyhe@dlut.edu.cn. Supplementary data are available at Bioinformatics Online.

  4. Development of a Model Protein Interaction Pair as a Benchmarking Tool for the Quantitative Analysis of 2-Site Protein-Protein Interactions.

    Science.gov (United States)

    Yamniuk, Aaron P; Newitt, John A; Doyle, Michael L; Arisaka, Fumio; Giannetti, Anthony M; Hensley, Preston; Myszka, David G; Schwarz, Fred P; Thomson, James A; Eisenstein, Edward

    2015-12-01

    A significant challenge in the molecular interaction field is to accurately determine the stoichiometry and stepwise binding affinity constants for macromolecules having >1 binding site. The mission of the Molecular Interactions Research Group (MIRG) of the Association of Biomolecular Resource Facilities (ABRF) is to show how biophysical technologies are used to quantitatively characterize molecular interactions, and to educate the ABRF members and scientific community on the utility and limitations of core technologies [such as biosensor, microcalorimetry, or analytic ultracentrifugation (AUC)]. In the present work, the MIRG has developed a robust model protein interaction pair consisting of a bivalent variant of the Bacillus amyloliquefaciens extracellular RNase barnase and a variant of its natural monovalent intracellular inhibitor protein barstar. It is demonstrated that this system can serve as a benchmarking tool for the quantitative analysis of 2-site protein-protein interactions. The protein interaction pair enables determination of precise binding constants for the barstar protein binding to 2 distinct sites on the bivalent barnase binding partner (termed binase), where the 2 binding sites were engineered to possess affinities that differed by 2 orders of magnitude. Multiple MIRG laboratories characterized the interaction using isothermal titration calorimetry (ITC), AUC, and surface plasmon resonance (SPR) methods to evaluate the feasibility of the system as a benchmarking model. Although general agreement was seen for the binding constants measured using solution-based ITC and AUC approaches, weaker affinity was seen for surface-based method SPR, with protein immobilization likely affecting affinity. An analysis of the results from multiple MIRG laboratories suggests that the bivalent barnase-barstar system is a suitable model for benchmarking new approaches for the quantitative characterization of complex biomolecular interactions.

  5. Exploring the function of protein kinases in schistosomes: perspectives from the laboratory and from comparative genomics

    Directory of Open Access Journals (Sweden)

    Anthony John Walker

    2014-07-01

    Full Text Available Eukaryotic protein kinases are well conserved through evolution. The genome of Schistosoma mansoni, which causes intestinal schistosomiasis, encodes over 250 putative protein kinases with all of the main eukaryotic groups represented. However, unraveling functional roles for these kinases is a considerable endeavour, particularly as protein kinases regulate multiple and sometimes overlapping cell and tissue functions in organisms. In this article, elucidating protein kinase signal transduction and function in schistosomes is considered from the perspective of the state-of-the-art methodologies used and comparative organismal biology, with a focus on current advances and future directions. Using the free-living nematode Caenorhabditis elegans as a comparator we predict roles for various schistosome protein kinases in processes vital for host invasion and successful parasitism such as sensory behaviour, growth and development. It is anticipated that the characterization of schistosome protein kinases in the context of parasite function will catalyze cutting edge research into host-parasite interactions and will reveal new targets for developing drug interventions against human schistosomiasis.

  6. Extraction of Protein Interaction Data: A Comparative Analysis of Methods in Use

    Directory of Open Access Journals (Sweden)

    Jose Hena

    2007-01-01

    Full Text Available Several natural language processing tools, both commercial and freely available, are used to extract protein interactions from publications. Methods used by these tools include pattern matching to dynamic programming with individual recall and precision rates. A methodical survey of these tools, keeping in mind the minimum interaction information a researcher would need, in comparison to manual analysis has not been carried out. We compared data generated using some of the selected NLP tools with manually curated protein interaction data (PathArt and IMaps to comparatively determine the recall and precision rate. The rates were found to be lower than the published scores when a normalized definition for interaction is considered. Each data point captured wrongly or not picked up by the tool was analyzed. Our evaluation brings forth critical failures of NLP tools and provides pointers for the development of an ideal NLP tool.

  7. Calculation of accurate small angle X-ray scattering curves from coarse-grained protein models

    Directory of Open Access Journals (Sweden)

    Stovgaard Kasper

    2010-08-01

    Full Text Available Abstract Background Genome sequencing projects have expanded the gap between the amount of known protein sequences and structures. The limitations of current high resolution structure determination methods make it unlikely that this gap will disappear in the near future. Small angle X-ray scattering (SAXS is an established low resolution method for routinely determining the structure of proteins in solution. The purpose of this study is to develop a method for the efficient calculation of accurate SAXS curves from coarse-grained protein models. Such a method can for example be used to construct a likelihood function, which is paramount for structure determination based on statistical inference. Results We present a method for the efficient calculation of accurate SAXS curves based on the Debye formula and a set of scattering form factors for dummy atom representations of amino acids. Such a method avoids the computationally costly iteration over all atoms. We estimated the form factors using generated data from a set of high quality protein structures. No ad hoc scaling or correction factors are applied in the calculation of the curves. Two coarse-grained representations of protein structure were investigated; two scattering bodies per amino acid led to significantly better results than a single scattering body. Conclusion We show that the obtained point estimates allow the calculation of accurate SAXS curves from coarse-grained protein models. The resulting curves are on par with the current state-of-the-art program CRYSOL, which requires full atomic detail. Our method was also comparable to CRYSOL in recognizing native structures among native-like decoys. As a proof-of-concept, we combined the coarse-grained Debye calculation with a previously described probabilistic model of protein structure, TorusDBN. This resulted in a significant improvement in the decoy recognition performance. In conclusion, the presented method shows great promise for

  8. NASCENT: an automatic protein interaction network generation tool for non-model organisms.

    Science.gov (United States)

    Banky, Daniel; Ordog, Rafael; Grolmusz, Vince

    2009-04-24

    Large quantity of reliable protein interaction data are available for model organisms in public depositories (e.g., MINT, DIP, HPRD, INTERACT). Most data correspond to experiments with the proteins of Saccharomyces cerevisiae, Drosophila melanogaster, Homo sapiens, Caenorhabditis elegans, Escherichia coli and Mus musculus. For other important organisms the data availability is poor or non-existent. Here we present NASCENT, a completely automatic web-based tool and also a downloadable Java program, capable of modeling and generating protein interaction networks even for non-model organisms. The tool performs protein interaction network modeling through gene-name mapping, and outputs the resulting network in graphical form and also in computer-readable graph-forms, directly applicable by popular network modeling software. http://nascent.pitgroup.org.

  9. Dynameomics: Data-driven methods and models for utilizing large-scale protein structure repositories for improving fragment-based loop prediction

    Science.gov (United States)

    Rysavy, Steven J; Beck, David AC; Daggett, Valerie

    2014-01-01

    Protein function is intimately linked to protein structure and dynamics yet experimentally determined structures frequently omit regions within a protein due to indeterminate data, which is often due protein dynamics. We propose that atomistic molecular dynamics simulations provide a diverse sampling of biologically relevant structures for these missing segments (and beyond) to improve structural modeling and structure prediction. Here we make use of the Dynameomics data warehouse, which contains simulations of representatives of essentially all known protein folds. We developed novel computational methods to efficiently identify, rank and retrieve small peptide structures, or fragments, from this database. We also created a novel data model to analyze and compare large repositories of structural data, such as contained within the Protein Data Bank and the Dynameomics data warehouse. Our evaluation compares these structural repositories for improving loop predictions and analyzes the utility of our methods and models. Using a standard set of loop structures, containing 510 loops, 30 for each loop length from 4 to 20 residues, we find that the inclusion of Dynameomics structures in fragment-based methods improves the quality of the loop predictions without being dependent on sequence homology. Depending on loop length, ∼25–75% of the best predictions came from the Dynameomics set, resulting in lower main chain root-mean-square deviations for all fragment lengths using the combined fragment library. We also provide specific cases where Dynameomics fragments provide better predictions for NMR loop structures than fragments from crystal structures. Online access to these fragment libraries is available at http://www.dynameomics.org/fragments. PMID:25142412

  10. DeepQA: Improving the estimation of single protein model quality with deep belief networks

    OpenAIRE

    Cao, Renzhi; Bhattacharya, Debswapna; Hou, Jie; Cheng, Jianlin

    2016-01-01

    Background Protein quality assessment (QA) useful for ranking and selecting protein models has long been viewed as one of the major challenges for protein tertiary structure prediction. Especially, estimating the quality of a single protein model, which is important for selecting a few good models out of a large model pool consisting of mostly low-quality models, is still a largely unsolved problem. Results We introduce a novel single-model quality assessment method DeepQA based on deep belie...

  11. Fast Proton Titration Scheme for Multiscale Modeling of Protein Solutions.

    Science.gov (United States)

    Teixeira, Andre Azevedo Reis; Lund, Mikael; da Silva, Fernando Luís Barroso

    2010-10-12

    Proton exchange between titratable amino acid residues and the surrounding solution gives rise to exciting electric processes in proteins. We present a proton titration scheme for studying acid-base equilibria in Metropolis Monte Carlo simulations where salt is treated at the Debye-Hückel level. The method, rooted in the Kirkwood model of impenetrable spheres, is applied on the three milk proteins α-lactalbumin, β-lactoglobulin, and lactoferrin, for which we investigate the net-charge, molecular dipole moment, and charge capacitance. Over a wide range of pH and salt conditions, excellent agreement is found with more elaborate simulations where salt is explicitly included. The implicit salt scheme is orders of magnitude faster than the explicit analog and allows for transparent interpretation of physical mechanisms. It is shown how the method can be expanded to multiscale modeling of aqueous salt solutions of many biomolecules with nonstatic charge distributions. Important examples are protein-protein aggregation, protein-polyelectrolyte complexation, and protein-membrane association.

  12. ProDis-ContSHC: learning protein dissimilarity measures and hierarchical context coherently for protein-protein comparison in protein database retrieval.

    Science.gov (United States)

    Wang, Jingyan; Gao, Xin; Wang, Quanquan; Li, Yongping

    2012-05-08

    The need to retrieve or classify protein molecules using structure or sequence-based similarity measures underlies a wide range of biomedical applications. Traditional protein search methods rely on a pairwise dissimilarity/similarity measure for comparing a pair of proteins. This kind of pairwise measures suffer from the limitation of neglecting the distribution of other proteins and thus cannot satisfy the need for high accuracy of the retrieval systems. Recent work in the machine learning community has shown that exploiting the global structure of the database and learning the contextual dissimilarity/similarity measures can improve the retrieval performance significantly. However, most existing contextual dissimilarity/similarity learning algorithms work in an unsupervised manner, which does not utilize the information of the known class labels of proteins in the database. In this paper, we propose a novel protein-protein dissimilarity learning algorithm, ProDis-ContSHC. ProDis-ContSHC regularizes an existing dissimilarity measure dij by considering the contextual information of the proteins. The context of a protein is defined by its neighboring proteins. The basic idea is, for a pair of proteins (i, j), if their context N(i) and N(j) is similar to each other, the two proteins should also have a high similarity. We implement this idea by regularizing dij by a factor learned from the context N(i) and N(j).Moreover, we divide the context to hierarchial sub-context and get the contextual dissimilarity vector for each protein pair. Using the class label information of the proteins, we select the relevant (a pair of proteins that has the same class labels) and irrelevant (with different labels) protein pairs, and train an SVM model to distinguish between their contextual dissimilarity vectors. The SVM model is further used to learn a supervised regularizing factor. Finally, with the new Supervised learned Dissimilarity measure, we update the Protein Hierarchial

  13. Genome-scale modeling of the protein secretory machinery in yeast

    DEFF Research Database (Denmark)

    Feizi, Amir; Österlund, Tobias; Petranovic, Dina

    2013-01-01

    The protein secretory machinery in Eukarya is involved in post-translational modification (PTMs) and sorting of the secretory and many transmembrane proteins. While the secretory machinery has been well-studied using classic reductionist approaches, a holistic view of its complex nature is lacking....... Here, we present the first genome-scale model for the yeast secretory machinery which captures the knowledge generated through more than 50 years of research. The model is based on the concept of a Protein Specific Information Matrix (PSIM: characterized by seven PTMs features). An algorithm...

  14. An attempt to understand kidney's protein handling function by comparing plasma and urine proteomes.

    Directory of Open Access Journals (Sweden)

    Lulu Jia

    Full Text Available BACKGROUND: With the help of proteomics technology, the human plasma and urine proteomes, which closely represent the protein compositions of the input and output of the kidney, respectively, have been profiled in much greater detail by different research teams. Many datasets have been accumulated to form "reference profiles" of the plasma and urine proteomes. Comparing these two proteomes may help us understand the protein handling aspect of kidney function in a way, however, which has been unavailable until the recent advances in proteomics technology. METHODOLOGY/PRINCIPAL FINDINGS: After removing secreted proteins downstream of the kidney, 2611 proteins in plasma and 1522 in urine were identified with high confidence and compared based on available proteomic data to generate three subproteomes, the plasma-only subproteome, the plasma-and-urine subproteome, and the urine-only subproteome, and they correspond to three groups of proteins that are handled in three different ways by the kidney. The available experimental molecular weights of the proteins in the three subproteomes were collected and analyzed. Since the functions of the overrepresented proteins in the plasma-and-urine subproteome are probably the major functions that can be routinely regulated by excretion from the kidney in physiological conditions, Gene Ontology term enrichment in the plasma-and-urine subproteome versus the whole plasma proteome was analyzed. Protease activity, calcium and growth factor binding proteins, and coagulation and immune response-related proteins were found to be enriched. CONCLUSION/SIGNIFICANCE: The comparison method described in this paper provides an illustration of a new approach for studying organ functions with a proteomics methodology. Because of its distinctive input (plasma and output (urine, it is reasonable to predict that the kidney will be the first organ whose functions are further elucidated by proteomic methods in the near future. It

  15. A Systematic Review of the Effects of Plant Compared with Animal Protein Sources on Features of Metabolic Syndrome.

    Science.gov (United States)

    Chalvon-Demersay, Tristan; Azzout-Marniche, Dalila; Arfsten, Judith; Egli, Léonie; Gaudichon, Claire; Karagounis, Leonidas G; Tomé, Daniel

    2017-03-01

    Dietary protein may play an important role in the prevention of metabolic dysfunctions. However, the way in which the protein source affects these dysfunctions has not been clearly established. The aim of the current systematic review was to compare the impact of plant- and animal-sourced dietary proteins on several features of metabolic syndrome in humans. The PubMed database was searched for both chronic and acute interventional studies, as well as observational studies, in healthy humans or those with metabolic dysfunctions, in which the impact of animal and plant protein intake was compared while using the following variables: cholesterolemia and triglyceridemia, blood pressure, glucose homeostasis, and body composition. Based on data extraction, we observed that soy protein consumption (with isoflavones), but not soy protein alone (without isoflavones) or other plant proteins (pea and lupine proteins, wheat gluten), leads to a 3% greater decrease in both total and LDL cholesterol compared with animal-sourced protein ingestion, especially in individuals with high fasting cholesterol concentrations. This observation was made when animal proteins were provided as a whole diet rather than given supplementally. Some observational studies reported an inverse association between plant protein intake and systolic and diastolic blood pressure, but this was not confirmed by intervention studies. Moreover, plant protein (wheat gluten, soy protein) intake as part of a mixed meal resulted in a lower postprandial insulin response than did whey. This systematic review provides some evidence that the intake of soy protein associated with isoflavones may prevent the onset of risk factors associated with cardiovascular disease, i.e., hypercholesterolemia and hypertension, in humans. However, we were not able to draw any further conclusions from the present work on the positive effects of plant proteins relating to glucose homeostasis and body composition. © 2017 American

  16. [Comparative investigation of the non-histone proteins of chromatin from pigeon erythroblasts and erythrocytes].

    Science.gov (United States)

    Fedina, A B; Gazarian, G G

    1976-01-01

    Chromosomal non-histone proteins are obtained from nuclei of two types of pigeon erythroid cells: erythroblasts (cells active in RNA synthesis) and erythrocytes (cells with repressed RNA synthesis). They are well soluble in solutions of low ionic strength. Electrophoretic separation of the obtained non-histone proteins in polyacrylamide gels with urea and SDS shows the presence of qualitative differences in the pattern of non-histone proteins of chromatine from erythroblasts and erythrocytes. By electrophoresis in urea some protein bands of non-histone proteins of chromatine from erythroblasts were found which disappear with the aging of cells. At the same time two protein fractions were observed in chromatine from erythrocytes which were absent in that of erythroblasts. Disappearance of some high molecular weight protein fractions from erythrocyte chromatine as compared to erythroblasts was observed by separation of the non-histone proteins in the presence of SDS. These fractions of the non-histone proteins disappearing during aging of cells are well extractable from erythroblast chromatine by 0.35 M NaCl solution. In the in vitro system with E. coli RNA polymerase addition of non-histone proteins of chromatine from erythroblasts to chromatine from erythrocytes increases RNA synthesis 2--3 times. At the same time addition of non-histone proteins from erythrocytes is either without any influence on this process or somewhat inhibiting.

  17. The composition and functional properties of whey protein concentrates produced from buttermilk are comparable with those of whey protein concentrates produced from skimmed milk.

    Science.gov (United States)

    Svanborg, Sigrid; Johansen, Anne-Grethe; Abrahamsen, Roger K; Skeie, Siv B

    2015-09-01

    The demand for whey protein is increasing in the food industry. Traditionally, whey protein concentrates (WPC) and isolates are produced from cheese whey. At present, microfiltration (MF) enables the utilization of whey from skim milk (SM) through milk protein fractionation. This study demonstrates that buttermilk (BM) can be a potential source for the production of a WPC with a comparable composition and functional properties to a WPC obtained by MF of SM. Through the production of WPC powder and a casein- and phospholipid (PL)-rich fraction by the MF of BM, sweet BM may be used in a more optimal and economical way. Sweet cream BM from industrial churning was skimmed before MF with 0.2-µm ceramic membranes at 55 to 58°C. The fractionations of BM and SM were performed under the same conditions using the same process, and the whey protein fractions from BM and SM were concentrated by ultrafiltration and diafiltration. The ultrafiltration and diafiltration was performed at 50°C using pasteurized tap water and a membrane with a 20-kDa cut-off to retain as little lactose as possible in the final WPC powders. The ultrafiltrates were subsequently spray dried, and their functional properties and chemical compositions were compared. The amounts of whey protein and PL in the WPC powder from BM (BMWPC) were comparable to the amounts found in the WPC from SM (SMWPC); however, the composition of the PL classes differed. The BMWPC contained less total protein, casein, and lactose compared with SMWPC, as well as higher contents of fat and citric acid. No difference in protein solubility was observed at pH values of 4.6 and 7.0, and the overrun was the same for BMWPC and SMWPC; however, the BMWPC made less stable foam than SMWPC. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  18. Comparative transcriptional and translational analysis of leptospiral outer membrane protein expression in response to temperature.

    Science.gov (United States)

    Lo, Miranda; Cordwell, Stuart J; Bulach, Dieter M; Adler, Ben

    2009-12-08

    Leptospirosis is a global zoonosis affecting millions of people annually. Transcriptional changes in response to temperature were previously investigated using microarrays to identify genes potentially expressed upon host entry. Past studies found that various leptospiral outer membrane proteins are differentially expressed at different temperatures. However, our microarray studies highlighted a divergence between protein abundance and transcript levels for some proteins. Given the abundance of post-transcriptional expression control mechanisms, this finding highlighted the importance of global protein analysis systems. To complement our previous transcription study, we evaluated differences in the proteins of the leptospiral outer membrane fraction in response to temperature upshift. Outer membrane protein-enriched fractions from Leptospira interrogans grown at 30 degrees C or overnight upshift to 37 degrees C were isolated and the relative abundance of each protein was determined by iTRAQ analysis coupled with two-dimensional liquid chromatography and tandem mass spectrometry (2-DLC/MS-MS). We identified 1026 proteins with 99% confidence; 27 and 66 were present at elevated and reduced abundance respectively. Protein abundance changes were compared with transcriptional differences determined from the microarray studies. While there was some correlation between the microarray and iTRAQ data, a subset of genes that showed no differential expression by microarray was found to encode temperature-regulated proteins. This set of genes is of particular interest as it is likely that regulation of their expression occurs post-transcriptionally, providing an opportunity to develop hypotheses about the molecular dynamics of the outer membrane of Leptospira in response to changing environments. This is the first study to compare transcriptional and translational responses to temperature shift in L. interrogans. The results thus provide an insight into the mechanisms used by L

  19. Comparative transcriptional and translational analysis of leptospiral outer membrane protein expression in response to temperature.

    Directory of Open Access Journals (Sweden)

    Miranda Lo

    Full Text Available BACKGROUND: Leptospirosis is a global zoonosis affecting millions of people annually. Transcriptional changes in response to temperature were previously investigated using microarrays to identify genes potentially expressed upon host entry. Past studies found that various leptospiral outer membrane proteins are differentially expressed at different temperatures. However, our microarray studies highlighted a divergence between protein abundance and transcript levels for some proteins. Given the abundance of post-transcriptional expression control mechanisms, this finding highlighted the importance of global protein analysis systems. METHODOLOGY/PRINCIPAL FINDINGS: To complement our previous transcription study, we evaluated differences in the proteins of the leptospiral outer membrane fraction in response to temperature upshift. Outer membrane protein-enriched fractions from Leptospira interrogans grown at 30 degrees C or overnight upshift to 37 degrees C were isolated and the relative abundance of each protein was determined by iTRAQ analysis coupled with two-dimensional liquid chromatography and tandem mass spectrometry (2-DLC/MS-MS. We identified 1026 proteins with 99% confidence; 27 and 66 were present at elevated and reduced abundance respectively. Protein abundance changes were compared with transcriptional differences determined from the microarray studies. While there was some correlation between the microarray and iTRAQ data, a subset of genes that showed no differential expression by microarray was found to encode temperature-regulated proteins. This set of genes is of particular interest as it is likely that regulation of their expression occurs post-transcriptionally, providing an opportunity to develop hypotheses about the molecular dynamics of the outer membrane of Leptospira in response to changing environments. CONCLUSIONS/SIGNIFICANCE: This is the first study to compare transcriptional and translational responses to temperature

  20. Comparative aspects of basic chromatin proteins in dinoflagellates.

    Science.gov (United States)

    Rizzo, P J

    1981-01-01

    Previous work on histone-like proteins in dinoflagellates is summarized, together with some new data to give an overview of basic proteins in these algae. The first two dinoflagellates studied were both found to contain one major acid-soluble protein that migrated to the same position in acidic-urea gels. When several other genera were studied however, it became apparent that the histone-like proteins from different dinoflagellates were similar but not identical. In view of the great diversity of living dinoflagellates it is speculated that further differences in dinoflagellate basic chromatin proteins will be revealed. Electrophoretic data from the eukaryotic (endosymbiont) nucleus of Peridinium balticum showed the presence of five major components. It is speculated that two of these proteins represent an H1-like doublet and two others correspond to the highly conserved histones H3 and H4. The fifth component is a new histone that may substitute for H2A and H2B in the nucleosome. Because histones and nucleosomes are present in all higher organisms but completely lacking in procaryotes, studies on basic proteins in dinoflagellates will provides insights into the evolution of histones and eucaryotic chromatin organization.

  1. Recombinant expression and purification of the RNA-binding LARP6 proteins from fish genetic model organisms.

    Science.gov (United States)

    Castro, José M; Horn, Daniel A; Pu, Xinzhu; Lewis, Karen A

    2017-06-01

    The RNA-binding proteins that comprise the La-related protein (LARP) superfamily have been implicated in a wide range of cellular functions, from tRNA maturation to regulation of protein synthesis. To more expansively characterize the biological function of the LARP6 subfamily, we have recombinantly expressed the full-length LARP6 proteins from two teleost fish, platyfish (Xiphophorus maculatus) and zebrafish (Danio rerio). The yields of the recombinant proteins were enhanced to >2 mg/L using a tandem approach of an N-terminal His 6 -SUMO tag and an iterative solubility screening assay to identify structurally stabilizing buffer components. The domain topologies of the purified fish proteins were probed with limited proteolysis. The fish proteins contain an internal, protease-resistant 40 kDa domain, which is considerably more stable than the comparable domain from the human LARP6 protein. The fish proteins are therefore a lucrative model system in which to study both the evolutionary divergence of this family of La-related proteins and the structure and conformational dynamics of the domains that comprise the LARP6 protein. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. Assessment of higher order structure comparability in therapeutic proteins using nuclear magnetic resonance spectroscopy.

    Science.gov (United States)

    Amezcua, Carlos A; Szabo, Christina M

    2013-06-01

    In this work, we applied nuclear magnetic resonance (NMR) spectroscopy to rapidly assess higher order structure (HOS) comparability in protein samples. Using a variation of the NMR fingerprinting approach described by Panjwani et al. [2010. J Pharm Sci 99(8):3334-3342], three nonglycosylated proteins spanning a molecular weight range of 6.5-67 kDa were analyzed. A simple statistical method termed easy comparability of HOS by NMR (ECHOS-NMR) was developed. In this method, HOS similarity between two samples is measured via the correlation coefficient derived from linear regression analysis of binned NMR spectra. Applications of this method include HOS comparability assessment during new product development, manufacturing process changes, supplier changes, next-generation products, and the development of biosimilars to name just a few. We foresee ECHOS-NMR becoming a routine technique applied to comparability exercises used to complement data from other analytical techniques. Copyright © 2013 Wiley Periodicals, Inc.

  3. The Phyre2 web portal for protein modeling, prediction and analysis.

    Science.gov (United States)

    Kelley, Lawrence A; Mezulis, Stefans; Yates, Christopher M; Wass, Mark N; Sternberg, Michael J E

    2015-06-01

    Phyre2 is a suite of tools available on the web to predict and analyze protein structure, function and mutations. The focus of Phyre2 is to provide biologists with a simple and intuitive interface to state-of-the-art protein bioinformatics tools. Phyre2 replaces Phyre, the original version of the server for which we previously published a paper in Nature Protocols. In this updated protocol, we describe Phyre2, which uses advanced remote homology detection methods to build 3D models, predict ligand binding sites and analyze the effect of amino acid variants (e.g., nonsynonymous SNPs (nsSNPs)) for a user's protein sequence. Users are guided through results by a simple interface at a level of detail they determine. This protocol will guide users from submitting a protein sequence to interpreting the secondary and tertiary structure of their models, their domain composition and model quality. A range of additional available tools is described to find a protein structure in a genome, to submit large number of sequences at once and to automatically run weekly searches for proteins that are difficult to model. The server is available at http://www.sbg.bio.ic.ac.uk/phyre2. A typical structure prediction will be returned between 30 min and 2 h after submission.

  4. Comparative Analyses of Tomato yellow leaf curl virus C4 Protein-Interacting Host Proteins in Healthy and Infected Tomato Tissues

    Directory of Open Access Journals (Sweden)

    Namgyu Kim

    2016-10-01

    Full Text Available Tomato yellow leaf curl virus (TYLCV, a member of the genus Begomovirus, is one of the most important viruses of cultivated tomatoes worldwide, mainly causing yellowing and curling of leaves with stunting in plants. TYLCV causes severe problems in sub-tropical and tropical countries, as well as in Korea. However, the mechanism of TYLCV infection remains unclear, although the function of each viral component has been identified. TYLCV C4 codes for a small protein involved in various cellular functions, including symptom determination, gene silencing, viral movement, and induction of the plant defense response. In this study, through yeast-two hybrid screenings, we identified TYLCV C4-interacting host proteins from both healthy and symptom-exhibiting tomato tissues, to determine the role of TYLCV C4 proteins in the infection processes. Comparative analyses of 28 proteins from healthy tissues and 36 from infected tissues showing interactions with TYLCV C4 indicated that TYLCV C4 mainly interacts with host proteins involved in translation, ubiquitination, and plant defense, and most interacting proteins differed between the two tissues but belong to similar molecular functional categories. Four proteins—two ribosomal proteins, S-adenosyl-L-homocysteine hydrolase, and 14-3-3 family protein—were detected in both tissues. Furthermore, the identified proteins in symptom-exhibiting tissues showed greater involvement in plant defenses. Some are key regulators, such as receptor-like kinases and pathogenesis-related proteins, of plant defenses. Thus, TYLCV C4 may contribute to the suppression of host defense during TYLCV infection and be involved in ubiquitination for viral infection.

  5. Post-Exercise Muscle Protein Synthesis in Rats after Ingestion of Acidified Bovine Milk Compared with Skim Milk.

    Science.gov (United States)

    Nakayama, Kyosuke; Kanda, Atsushi; Tagawa, Ryoichi; Sanbongi, Chiaki; Ikegami, Shuji; Itoh, Hiroyuki

    2017-09-27

    Bovine milk proteins have a low absorption rate due to gastric acid-induced coagulation. Acidified milk remains liquid under acidic conditions; therefore, the absorption rate of its protein may differ from that of untreated milk. To investigate how this would affect muscle protein synthesis (MPS), we compared MPS after ingestion of acidified versus skim milk in rats. Male Sprague-Dawley rats swam for 2 h and were immediately administered acidified or skim milk, then euthanized at 30, 60, 90, and 120 min afterwards. Triceps muscle samples were excised for assessing fractional synthetic rate (FSR), plasma components, intramuscular free amino acids and mTOR signaling. The FSR in the acidified milk group was significantly higher than in the skim milk group throughout the post-ingestive period. Plasma essential amino acids, leucine, and insulin levels were significantly increased in the acidified milk group at 30 min after administration compared to the skim milk group. In addition, acidified milk ingestion was associated with greater phosphorylation of protein kinase B (Akt) and ribosomal protein S6 kinase (S6K1), and sustained phosphorylation of 4E-binding protein 1 (4E-BP1). These results indicate that compared with untreated milk, acidified milk ingestion is associated with greater stimulation of post-exercise MPS.

  6. Post-Exercise Muscle Protein Synthesis in Rats after Ingestion of Acidified Bovine Milk Compared with Skim Milk

    Directory of Open Access Journals (Sweden)

    Kyosuke Nakayama

    2017-09-01

    Full Text Available Bovine milk proteins have a low absorption rate due to gastric acid-induced coagulation. Acidified milk remains liquid under acidic conditions; therefore, the absorption rate of its protein may differ from that of untreated milk. To investigate how this would affect muscle protein synthesis (MPS, we compared MPS after ingestion of acidified versus skim milk in rats. Male Sprague-Dawley rats swam for 2 h and were immediately administered acidified or skim milk, then euthanized at 30, 60, 90, and 120 min afterwards. Triceps muscle samples were excised for assessing fractional synthetic rate (FSR, plasma components, intramuscular free amino acids and mTOR signaling. The FSR in the acidified milk group was significantly higher than in the skim milk group throughout the post-ingestive period. Plasma essential amino acids, leucine, and insulin levels were significantly increased in the acidified milk group at 30 min after administration compared to the skim milk group. In addition, acidified milk ingestion was associated with greater phosphorylation of protein kinase B (Akt and ribosomal protein S6 kinase (S6K1, and sustained phosphorylation of 4E-binding protein 1 (4E-BP1. These results indicate that compared with untreated milk, acidified milk ingestion is associated with greater stimulation of post-exercise MPS.

  7. A physiologically based pharmacokinetic model to predict the pharmacokinetics of highly protein-bound drugs and the impact of errors in plasma protein binding.

    Science.gov (United States)

    Ye, Min; Nagar, Swati; Korzekwa, Ken

    2016-04-01

    Predicting the pharmacokinetics of highly protein-bound drugs is difficult. Also, since historical plasma protein binding data were often collected using unbuffered plasma, the resulting inaccurate binding data could contribute to incorrect predictions. This study uses a generic physiologically based pharmacokinetic (PBPK) model to predict human plasma concentration-time profiles for 22 highly protein-bound drugs. Tissue distribution was estimated from in vitro drug lipophilicity data, plasma protein binding and the blood: plasma ratio. Clearance was predicted with a well-stirred liver model. Underestimated hepatic clearance for acidic and neutral compounds was corrected by an empirical scaling factor. Predicted values (pharmacokinetic parameters, plasma concentration-time profile) were compared with observed data to evaluate the model accuracy. Of the 22 drugs, less than a 2-fold error was obtained for the terminal elimination half-life (t1/2 , 100% of drugs), peak plasma concentration (Cmax , 100%), area under the plasma concentration-time curve (AUC0-t , 95.4%), clearance (CLh , 95.4%), mean residence time (MRT, 95.4%) and steady state volume (Vss , 90.9%). The impact of fup errors on CLh and Vss prediction was evaluated. Errors in fup resulted in proportional errors in clearance prediction for low-clearance compounds, and in Vss prediction for high-volume neutral drugs. For high-volume basic drugs, errors in fup did not propagate to errors in Vss prediction. This is due to the cancellation of errors in the calculations for tissue partitioning of basic drugs. Overall, plasma profiles were well simulated with the present PBPK model. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  8. On the characterization and software implementation of general protein lattice models.

    Directory of Open Access Journals (Sweden)

    Alessio Bechini

    Full Text Available models of proteins have been widely used as a practical means to computationally investigate general properties of the system. In lattice models any sterically feasible conformation is represented as a self-avoiding walk on a lattice, and residue types are limited in number. So far, only two- or three-dimensional lattices have been used. The inspection of the neighborhood of alpha carbons in the core of real proteins reveals that also lattices with higher coordination numbers, possibly in higher dimensional spaces, can be adopted. In this paper, a new general parametric lattice model for simplified protein conformations is proposed and investigated. It is shown how the supporting software can be consistently designed to let algorithms that operate on protein structures be implemented in a lattice-agnostic way. The necessary theoretical foundations are developed and organically presented, pinpointing the role of the concept of main directions in lattice-agnostic model handling. Subsequently, the model features across dimensions and lattice types are explored in tests performed on benchmark protein sequences, using a Python implementation. Simulations give insights on the use of square and triangular lattices in a range of dimensions. The trend of potential minimum for sequences of different lengths, varying the lattice dimension, is uncovered. Moreover, an extensive quantitative characterization of the usage of the so-called "move types" is reported for the first time. The proposed general framework for the development of lattice models is simple yet complete, and an object-oriented architecture can be proficiently employed for the supporting software, by designing ad-hoc classes. The proposed framework represents a new general viewpoint that potentially subsumes a number of solutions previously studied. The adoption of the described model pushes to look at protein structure issues from a more general and essential perspective, making

  9. Automated de novo phasing and model building of coiled-coil proteins.

    Science.gov (United States)

    Rämisch, Sebastian; Lizatović, Robert; André, Ingemar

    2015-03-01

    Models generated by de novo structure prediction can be very useful starting points for molecular replacement for systems where suitable structural homologues cannot be readily identified. Protein-protein complexes and de novo-designed proteins are examples of systems that can be challenging to phase. In this study, the potential of de novo models of protein complexes for use as starting points for molecular replacement is investigated. The approach is demonstrated using homomeric coiled-coil proteins, which are excellent model systems for oligomeric systems. Despite the stereotypical fold of coiled coils, initial phase estimation can be difficult and many structures have to be solved with experimental phasing. A method was developed for automatic structure determination of homomeric coiled coils from X-ray diffraction data. In a benchmark set of 24 coiled coils, ranging from dimers to pentamers with resolutions down to 2.5 Å, 22 systems were automatically solved, 11 of which had previously been solved by experimental phasing. The generated models contained 71-103% of the residues present in the deposited structures, had the correct sequence and had free R values that deviated on average by 0.01 from those of the respective reference structures. The electron-density maps were of sufficient quality that only minor manual editing was necessary to produce final structures. The method, named CCsolve, combines methods for de novo structure prediction, initial phase estimation and automated model building into one pipeline. CCsolve is robust against errors in the initial models and can readily be modified to make use of alternative crystallographic software. The results demonstrate the feasibility of de novo phasing of protein-protein complexes, an approach that could also be employed for other small systems beyond coiled coils.

  10. Generic framework for mining cellular automata models on protein-folding simulations.

    Science.gov (United States)

    Diaz, N; Tischer, I

    2016-05-13

    Cellular automata model identification is an important way of building simplified simulation models. In this study, we describe a generic architectural framework to ease the development process of new metaheuristic-based algorithms for cellular automata model identification in protein-folding trajectories. Our framework was developed by a methodology based on design patterns that allow an improved experience for new algorithms development. The usefulness of the proposed framework is demonstrated by the implementation of four algorithms, able to obtain extremely precise cellular automata models of the protein-folding process with a protein contact map representation. Dynamic rules obtained by the proposed approach are discussed, and future use for the new tool is outlined.

  11. SURF'S UP! – Protein classification by surface comparisons

    Indian Academy of Sciences (India)

    Prakash

    encounter large protein families with only a few members of ... server for analysis of functional relationships in protein families, as inferred from protein surface maps comparison ... features, SURF'S UP! can work with models obtained from comparative modelling. ... 1997) or, if the user is confident in the quality of automated.

  12. Structure modeling of all identified G protein-coupled receptors in the human genome.

    Science.gov (United States)

    Zhang, Yang; Devries, Mark E; Skolnick, Jeffrey

    2006-02-01

    G protein-coupled receptors (GPCRs), encoded by about 5% of human genes, comprise the largest family of integral membrane proteins and act as cell surface receptors responsible for the transduction of endogenous signal into a cellular response. Although tertiary structural information is crucial for function annotation and drug design, there are few experimentally determined GPCR structures. To address this issue, we employ the recently developed threading assembly refinement (TASSER) method to generate structure predictions for all 907 putative GPCRs in the human genome. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. These features are essential for the comprehensive modeling of all human GPCRs when close homologous templates are absent. Based on a benchmarked confidence score, approximately 820 predicted models should have the correct folds. The majority of GPCR models share the characteristic seven-transmembrane helix topology, but 45 ORFs are predicted to have different structures. This is due to GPCR fragments that are predominantly from extracellular or intracellular domains as well as database annotation errors. Our preliminary validation includes the automated modeling of bovine rhodopsin, the only solved GPCR in the Protein Data Bank. With homologous templates excluded, the final model built by TASSER has a global C(alpha) root-mean-squared deviation from native of 4.6 angstroms, with a root-mean-squared deviation in the transmembrane helix region of 2.1 angstroms. Models of several representative GPCRs are compared with mutagenesis and affinity labeling data, and consistent agreement is demonstrated. Structure clustering of the predicted models shows that GPCRs with similar structures tend to belong to a similar functional class even when their sequences are diverse. These results demonstrate the usefulness and robustness

  13. Structure modeling of all identified G protein-coupled receptors in the human genome.

    Directory of Open Access Journals (Sweden)

    Yang Zhang

    2006-02-01

    Full Text Available G protein-coupled receptors (GPCRs, encoded by about 5% of human genes, comprise the largest family of integral membrane proteins and act as cell surface receptors responsible for the transduction of endogenous signal into a cellular response. Although tertiary structural information is crucial for function annotation and drug design, there are few experimentally determined GPCR structures. To address this issue, we employ the recently developed threading assembly refinement (TASSER method to generate structure predictions for all 907 putative GPCRs in the human genome. Unlike traditional homology modeling approaches, TASSER modeling does not require solved homologous template structures; moreover, it often refines the structures closer to native. These features are essential for the comprehensive modeling of all human GPCRs when close homologous templates are absent. Based on a benchmarked confidence score, approximately 820 predicted models should have the correct folds. The majority of GPCR models share the characteristic seven-transmembrane helix topology, but 45 ORFs are predicted to have different structures. This is due to GPCR fragments that are predominantly from extracellular or intracellular domains as well as database annotation errors. Our preliminary validation includes the automated modeling of bovine rhodopsin, the only solved GPCR in the Protein Data Bank. With homologous templates excluded, the final model built by TASSER has a global C(alpha root-mean-squared deviation from native of 4.6 angstroms, with a root-mean-squared deviation in the transmembrane helix region of 2.1 angstroms. Models of several representative GPCRs are compared with mutagenesis and affinity labeling data, and consistent agreement is demonstrated. Structure clustering of the predicted models shows that GPCRs with similar structures tend to belong to a similar functional class even when their sequences are diverse. These results demonstrate the usefulness

  14. Modeling Protein Structures in Feed and Seed Tissues Using Novel Synchrotron-Based Analytical Technique

    International Nuclear Information System (INIS)

    Yu, P.

    2008-01-01

    Traditional 'wet' chemical analyses usually looks for a specific known component (such as protein) through homogenization and separation of the components of interest from the complex tissue matrix. Traditional 'wet' chemical analyses rely heavily on the use of harsh chemicals and derivatization, therefore altering the native feed protein structures and possibly generating artifacts. The objective of this study was to introduce a novel and non-destructive method to estimate protein structures in feed and seeds within intact tissues using advanced synchrotron-based infrared microspectroscopy (SFTIRM). The experiments were performed at the National Synchrotron Light Source in Brookhaven National Laboratory (US Dept. of Energy, NY). The results show that with synchrotron-based SFTIRM, we are able to localize relatively 'pure' protein without destructions of the feed and seed tissues and qualify protein internal structures in terms of the proportions and ratios of a-helix, β-sheet, random coil and β-turns on a relative basis using multi-peak modeling procedures. These protein structure profile (a-helix, β-sheet, etc.) may influence protein quality and availability in animals. Several examples of feed and seeds were provided. The implications of this study are that we can use this new method to compare internal protein structures between feeds and between seed verities. We can also use this method to detect heat-induced the structural changes of protein in feeds.

  15. A lock-and-key model for protein–protein interactions

    OpenAIRE

    Morrison, Julie L.; Breitling, Rainer; Higham, Desmond J.; Gilbert, David R.

    2006-01-01

    Motivation: Protein–protein interaction networks are one of the major post-genomic data sources available to molecular biologists. They provide a comprehensive view of the global interaction structure of an organism’s proteome, as well as detailed information on specific interactions. Here we suggest a physical model of protein interactions that can be used to extract additional information at an intermediate level: It enables us to identify proteins which share biological interaction motifs,...

  16. Protein buffering in model systems and in whole human saliva.

    Directory of Open Access Journals (Sweden)

    Andreas Lamanda

    Full Text Available The aim of this study was to quantify the buffer attributes (value, power, range and optimum of two model systems for whole human resting saliva, the purified proteins from whole human resting saliva and single proteins. Two model systems, the first containing amyloglucosidase and lysozyme, and the second containing amyloglucosidase and alpha-amylase, were shown to provide, in combination with hydrogencarbonate and di-hydrogenphosphate, almost identical buffer attributes as whole human resting saliva. It was further demonstrated that changes in the protein concentration as small as 0.1% may change the buffer value of a buffer solution up to 15 times. Additionally, it was shown that there was a protein concentration change in the same range (0.16% between saliva samples collected at the time periods of 13:00 and others collected at 9:00 am and 17:00. The mode of the protein expression changed between these samples corresponded to the change in basic buffer power and the change of the buffer value at pH 6.7. Finally, SDS Page and Ruthenium II tris (bathophenantroline disulfonate staining unveiled a constant protein expression in all samples except for one 50 kDa protein band. As the change in the expression pattern of that 50 kDa protein band corresponded to the change in basic buffer power and the buffer value at pH 6.7, it was reasonable to conclude that this 50 kDa protein band may contain the protein(s belonging to the protein buffer system of human saliva.

  17. Analysis and Ranking of Protein-Protein Docking Models Using Inter-Residue Contacts and Inter-Molecular Contact Maps

    KAUST Repository

    Oliva, Romina; Chermak, Edrisse; Cavallo, Luigi

    2015-01-01

    In view of the increasing interest both in inhibitors of protein-protein interactions and in protein drugs themselves, analysis of the three-dimensional structure of protein-protein complexes is assuming greater relevance in drug design. In the many cases where an experimental structure is not available, protein-protein docking becomes the method of choice for predicting the arrangement of the complex. However, reliably scoring protein-protein docking poses is still an unsolved problem. As a consequence, the screening of many docking models is usually required in the analysis step, to possibly single out the correct ones. Here, making use of exemplary cases, we review our recently introduced methods for the analysis of protein complex structures and for the scoring of protein docking poses, based on the use of inter-residue contacts and their visualization in inter-molecular contact maps. We also show that the ensemble of tools we developed can be used in the context of rational drug design targeting protein-protein interactions.

  18. Analysis and Ranking of Protein-Protein Docking Models Using Inter-Residue Contacts and Inter-Molecular Contact Maps

    KAUST Repository

    Oliva, Romina

    2015-07-01

    In view of the increasing interest both in inhibitors of protein-protein interactions and in protein drugs themselves, analysis of the three-dimensional structure of protein-protein complexes is assuming greater relevance in drug design. In the many cases where an experimental structure is not available, protein-protein docking becomes the method of choice for predicting the arrangement of the complex. However, reliably scoring protein-protein docking poses is still an unsolved problem. As a consequence, the screening of many docking models is usually required in the analysis step, to possibly single out the correct ones. Here, making use of exemplary cases, we review our recently introduced methods for the analysis of protein complex structures and for the scoring of protein docking poses, based on the use of inter-residue contacts and their visualization in inter-molecular contact maps. We also show that the ensemble of tools we developed can be used in the context of rational drug design targeting protein-protein interactions.

  19. Anomalous diffusion in neutral evolution of model proteins

    Science.gov (United States)

    Nelson, Erik D.; Grishin, Nick V.

    2015-06-01

    Protein evolution is frequently explored using minimalist polymer models, however, little attention has been given to the problem of structural drift, or diffusion. Here, we study neutral evolution of small protein motifs using an off-lattice heteropolymer model in which individual monomers interact as low-resolution amino acids. In contrast to most earlier models, both the length and folded structure of the polymers are permitted to change. To describe structural change, we compute the mean-square distance (MSD) between monomers in homologous folds separated by n neutral mutations. We find that structural change is episodic, and, averaged over lineages (for example, those extending from a single sequence), exhibits a power-law dependence on n . We show that this exponent depends on the alignment method used, and we analyze the distribution of waiting times between neutral mutations. The latter are more disperse than for models required to maintain a specific fold, but exhibit a similar power-law tail.

  20. Conformational sampling in template-free protein loop structure modeling: an overview.

    Science.gov (United States)

    Li, Yaohang

    2013-01-01

    Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a "mini protein folding problem" under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized.

  1. CONFORMATIONAL SAMPLING IN TEMPLATE-FREE PROTEIN LOOP STRUCTURE MODELING: AN OVERVIEW

    Directory of Open Access Journals (Sweden)

    Yaohang Li

    2013-02-01

    Full Text Available Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a “mini protein folding problem” under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the approaches of sampling loop conformation space, which is a critical step to obtain high resolution models in template-free methods. We review the potential energy functions for loop modeling, loop buildup mechanisms to satisfy geometric constraints, and loop conformation sampling algorithms. The recent loop modeling results are also summarized.

  2. Direct and indirect radioiodination of protein: comparative study of chemotactic peptide labeling

    International Nuclear Information System (INIS)

    Lavinas, Tatiana

    2004-01-01

    The development of simple methods for protein radioiodination have stimulated the use of radioiodinated peptides in vivo. There are two basic methods for labeling proteins with radioiodine: direct labeling, reaction of an electrophilic radioiodine with functional activated groups on protein, like the phenol ring in the tyrosine residue, and the conjugation of a previously radioiodinated molecule to the protein, referred as indirect method. The great problem related to the direct radioiodination of proteins is the in vivo dehalogenation. This problem can be minimized if a non-phenolic prosthetic group is used in the indirect radioiodination of the peptide. The ATE prosthetic group, N-succinimidyl 3-(tri-n-butylstannyl) benzoate, when radioiodinated by electrophilic iododestannilation produces N-succinimidyl 3-[ 123 l/ 131 l] iodine benzoate (SIB) that is subsequently conjugated to the protein by the acylation of the lysine group. There are many radiopharmaceuticals employed in scintigraphic images of infection and inflammation used with some limitations. These limitations stimulated the improvement of a new class of radiopharmaceuticals, the receptor-specific related labeled peptides, as the mediators of the inflammatory response, that presents high affinity by receptors expressed in the inflammation process, and fast clearance from blood and non-target tissues. One of these molecules is the synthetic chemotactic peptide fNleLFNIeYK that presents potent chemotaxis for leukocytes, with high affinity by the receptors presented in polymorphonuclear leukocytes and mononuclear phagocytes. The objective of this work included the synthesis of ATE prosthetic group and comparative radioiodination of the chemotactic peptide fNleLFNIeYK by direct and indirect methods, with radiochemical purity determination and evaluation of in vivo and in vitro stability of the compounds. This work presented an original contribution in the comparative biological distribution studies of the

  3. ProDis-ContSHC: Learning protein dissimilarity measures and hierarchical context coherently for protein-protein comparison in protein database retrieval

    KAUST Repository

    Wang, Jim Jing-Yan

    2012-05-08

    Background: The need to retrieve or classify protein molecules using structure or sequence-based similarity measures underlies a wide range of biomedical applications. Traditional protein search methods rely on a pairwise dissimilarity/similarity measure for comparing a pair of proteins. This kind of pairwise measures suffer from the limitation of neglecting the distribution of other proteins and thus cannot satisfy the need for high accuracy of the retrieval systems. Recent work in the machine learning community has shown that exploiting the global structure of the database and learning the contextual dissimilarity/similarity measures can improve the retrieval performance significantly. However, most existing contextual dissimilarity/similarity learning algorithms work in an unsupervised manner, which does not utilize the information of the known class labels of proteins in the database.Results: In this paper, we propose a novel protein-protein dissimilarity learning algorithm, ProDis-ContSHC. ProDis-ContSHC regularizes an existing dissimilarity measure dij by considering the contextual information of the proteins. The context of a protein is defined by its neighboring proteins. The basic idea is, for a pair of proteins (i, j), if their context N (i) and N (j) is similar to each other, the two proteins should also have a high similarity. We implement this idea by regularizing dij by a factor learned from the context N (i) and N (j). Moreover, we divide the context to hierarchial sub-context and get the contextual dissimilarity vector for each protein pair. Using the class label information of the proteins, we select the relevant (a pair of proteins that has the same class labels) and irrelevant (with different labels) protein pairs, and train an SVM model to distinguish between their contextual dissimilarity vectors. The SVM model is further used to learn a supervised regularizing factor. Finally, with the new Supervised learned Dissimilarity measure, we update

  4. Comparative proteomics reveals key proteins recruited at the nucleoid of Deinococcus after irradiation-induced DNA damage

    International Nuclear Information System (INIS)

    Bouthier de la Tour, Claire; Passot, Fanny Marie; Toueille, Magali; Servant, Pascale; Sommer, Suzanne; Mirabella, Boris; Blanchard, Laurence; Groot, Arjan de; Guerin, Philippe; Armengaud, Jean

    2013-01-01

    The nucleoids of radiation-resistant Deinococcus species show a high degree of compaction maintained after ionizing irradiation. We identified proteins recruited after irradiation in nucleoids of Deinococcus radiodurans and Deinococcus deserti by means of comparative proteomics. Proteins in nucleoid-enriched fractions from unirradiated and irradiated Deinococcus were identified and semi quantified by shotgun proteomics. The ssDNA-binding protein SSB, DNA gyrase subunits GyrA and GyrB, DNA topoisomerase I, RecA recombinase, UvrA excinuclease, RecQ helicase, DdrA, DdrB, and DdrD proteins were found in significantly higher amounts in irradiated nucleoids of both Deinococcus species. We observed, by immunofluorescence microscopy, the subcellular localization of these proteins in D. radiodurans, showing for the first time the recruitment of the DdrD protein into the D. radiodurans nucleoid. We specifically followed the kinetics of recruitment of RecA, DdrA, and DdrD to the nucleoid after irradiation. Remarkably, RecA proteins formed irregular filament-like structures 1 h after irradiation, before being redistributed throughout the cells by 3 h post-irradiation. Comparable dynamics of DdrD localization were observed, suggesting a possible functional interaction between RecA and DdrD. Several proteins involved in nucleotide synthesis were also seen in higher quantities in the nucleoids of irradiated cells, indicative of the existence of a mechanism for orchestrating the presence of proteins involved in DNA metabolism in nucleoids in response to massive DNA damage. All MS data have been deposited in the ProteomeXchange with identifier PXD00196. (authors)

  5. The evolution of the protein synthesis system. I - A model of a primitive protein synthesis system

    Science.gov (United States)

    Mizutani, H.; Ponnamperuma, C.

    1977-01-01

    A model is developed to describe the evolution of the protein synthesis system. The model is comprised of two independent autocatalytic systems, one including one gene (A-gene) and two activated amino acid polymerases (O and A-polymerases), and the other including the addition of another gene (N-gene) and a nucleotide polymerase. Simulation results have suggested that even a small enzymic activity and polymerase specificity could lead the system to the most accurate protein synthesis, as far as permitted by transitions to systems with higher accuracy.

  6. Comparative proteomics analysis of proteins expressed in the I-1 and I-2 internodes of strawberry stolons

    Directory of Open Access Journals (Sweden)

    Lai Wenguo

    2011-05-01

    Full Text Available Abstract Background Strawberries (Fragaria ananassa reproduce asexually through stolons, which have strong tendencies to form adventitious roots at their second node. Understanding how the development of the proximal (I-1 and distal (I-2 internodes of stolons differ should facilitate nursery cultivation of strawberries. Results Herein, we compared the proteomic profiles of the strawberry stolon I-1 and I-2 internodes. Proteins extracted from the internodes were separated by two-dimensional gel electrophoresis, and 164 I-1 protein spots and 200 I-2 protein spots were examined further. Using mass spectrometry and database searches, 38 I-1 and 52 I-2 proteins were identified and categorized (8 and 10 groups, respectively according to their cellular compartmentalization and functionality. Many of the identified proteins are enzymes necessary for carbohydrate metabolism and photosynthesis. Furthermore, identification of proteins that interact revealed that many of the I-2 proteins form a dynamic network during development. Finally, given our results, we present a mechanistic scheme for adventitious root formation of new clonal plants at the second node. Conclusions Comparative proteomic analysis of I-1 and I-2 proteins revealed that the ubiquitin-proteasome pathway and sugar-hormone pathways might be important during adventitious root formation at the second node of new clonal plants.

  7. Membrane Compartmentalization Reducing the Mobility of Lipids and Proteins within a Model Plasma Membrane.

    Science.gov (United States)

    Koldsø, Heidi; Reddy, Tyler; Fowler, Philip W; Duncan, Anna L; Sansom, Mark S P

    2016-09-01

    The cytoskeleton underlying cell membranes may influence the dynamic organization of proteins and lipids within the bilayer by immobilizing certain transmembrane (TM) proteins and forming corrals within the membrane. Here, we present coarse-grained resolution simulations of a biologically realistic membrane model of asymmetrically organized lipids and TM proteins. We determine the effects of a model of cytoskeletal immobilization of selected membrane proteins using long time scale coarse-grained molecular dynamics simulations. By introducing compartments with varying degrees of restraints within the membrane models, we are able to reveal how compartmentalization caused by cytoskeletal immobilization leads to reduced and anomalous diffusional mobility of both proteins and lipids. This in turn results in a reduced rate of protein dimerization within the membrane and of hopping of membrane proteins between compartments. These simulations provide a molecular realization of hierarchical models often invoked to explain single-molecule imaging studies of membrane proteins.

  8. RCK: accurate and efficient inference of sequence- and structure-based protein-RNA binding models from RNAcompete data.

    Science.gov (United States)

    Orenstein, Yaron; Wang, Yuhao; Berger, Bonnie

    2016-06-15

    Protein-RNA interactions, which play vital roles in many processes, are mediated through both RNA sequence and structure. CLIP-based methods, which measure protein-RNA binding in vivo, suffer from experimental noise and systematic biases, whereas in vitro experiments capture a clearer signal of protein RNA-binding. Among them, RNAcompete provides binding affinities of a specific protein to more than 240 000 unstructured RNA probes in one experiment. The computational challenge is to infer RNA structure- and sequence-based binding models from these data. The state-of-the-art in sequence models, Deepbind, does not model structural preferences. RNAcontext models both sequence and structure preferences, but is outperformed by GraphProt. Unfortunately, GraphProt cannot detect structural preferences from RNAcompete data due to the unstructured nature of the data, as noted by its developers, nor can it be tractably run on the full RNACompete dataset. We develop RCK, an efficient, scalable algorithm that infers both sequence and structure preferences based on a new k-mer based model. Remarkably, even though RNAcompete data is designed to be unstructured, RCK can still learn structural preferences from it. RCK significantly outperforms both RNAcontext and Deepbind in in vitro binding prediction for 244 RNAcompete experiments. Moreover, RCK is also faster and uses less memory, which enables scalability. While currently on par with existing methods in in vivo binding prediction on a small scale test, we demonstrate that RCK will increasingly benefit from experimentally measured RNA structure profiles as compared to computationally predicted ones. By running RCK on the entire RNAcompete dataset, we generate and provide as a resource a set of protein-RNA structure-based models on an unprecedented scale. Software and models are freely available at http://rck.csail.mit.edu/ bab@mit.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by

  9. Efficient use of unlabeled data for protein sequence classification: a comparative study.

    Science.gov (United States)

    Kuksa, Pavel; Huang, Pai-Hsi; Pavlovic, Vladimir

    2009-04-29

    Recent studies in computational primary protein sequence analysis have leveraged the power of unlabeled data. For example, predictive models based on string kernels trained on sequences known to belong to particular folds or superfamilies, the so-called labeled data set, can attain significantly improved accuracy if this data is supplemented with protein sequences that lack any class tags-the unlabeled data. In this study, we present a principled and biologically motivated computational framework that more effectively exploits the unlabeled data by only using the sequence regions that are more likely to be biologically relevant for better prediction accuracy. As overly-represented sequences in large uncurated databases may bias the estimation of computational models that rely on unlabeled data, we also propose a method to remove this bias and improve performance of the resulting classifiers. Combined with state-of-the-art string kernels, our proposed computational framework achieves very accurate semi-supervised protein remote fold and homology detection on three large unlabeled databases. It outperforms current state-of-the-art methods and exhibits significant reduction in running time. The unlabeled sequences used under the semi-supervised setting resemble the unpolished gemstones; when used as-is, they may carry unnecessary features and hence compromise the classification accuracy but once cut and polished, they improve the accuracy of the classifiers considerably.

  10. Model for Stress-induced Protein Degradation in Lemna minor1

    Science.gov (United States)

    Cooke, Robert J.; Roberts, Keith; Davies, David D.

    1980-01-01

    Transfer of Lemna minor fronds to adverse or stress conditions produces a large increase in the rate of protein degradation. Cycloheximide partially inhibits stress-induced protein degradation and also partially inhibits the protein degradation which occurs in the absence of stress. The increased protein degradation does not appear to be due to an increase in activity of soluble proteolytic enzymes. Biochemical evidence indicates that stress, perhaps acting via hormones, affects the permeability of certain membranes, particularly the tonoplast. A general model for stress-induced protein degradation is presented in which changes in membrane properties allow vacuolar proteolytic enzymes increased access to cytoplasmic proteins. PMID:16661588

  11. The role of Rad 51 protein in radioresistance of spheroid model of Du 145 prostate carcinoma cell line

    International Nuclear Information System (INIS)

    Taghizadeh, M.; Khoei, S.; Nikoofar, A. R.; Ghamsari, L.; Goliaei, B.

    2009-01-01

    Rad 51 is a protein with critical role in double strand break repair. Down-regulation of this protein has a significant effect in radiosensitivity of some cell lines like prostate carcinoma. Compared to monolayer cell culture model, the spheroids are more resistant to radiation. The aim of the current study was to determine the Rad 51 protein level in Du 145 spheroids, and monolayer cells before and after exposure to gamma irradiation. Materials and Methods: In the present study, western blot was used to determine the level of Rad 51 protein in Du 145 cell line grown as monolayer and spheroid. Results: Western blot analysis showed that in the spheroid cells, Rad 51 had an elevated level before and after radiation in comparison with monolayer cells. Higher doses of radiation induced elevated expression of Rad 51 protein in both culture models.The level of at protein after exposure to gamma rays had been time-dependent. Conclusion: Rad 51 might act as a mediator of radiation resistance in tumor cells. Repression of Rad 51 activity could be a prominent strategy to overcome radiation resistance of tumors.

  12. Functional and conformational properties of phaseolin (Phaseolus vulgris L.) and kidney bean protein isolate: a comparative study.

    Science.gov (United States)

    Yin, Shou-Wei; Tang, Chuan-He; Wen, Qi-Biao; Yang, Xiao-Quan

    2010-03-15

    Kidney bean (Phaseolus vulgris L.) seed is an underutilised plant protein source with good potential to be applied in the food industry. Phaseolin (also named G1 globulin) represents about 50 g kg(-1) of total storage protein in the seed. The aim of the present study was to characterise physicochemical, functional and conformational properties of phaseolin, and to compare these properties with those of kidney bean protein isolate (KPI). Compared with kidney bean protein isolate (KPI), the acid-extracted phaseolin-rich protein product (PRP) had much lower protein recovery of 320 g kg(-1) (dry weight basis) but higher phaseolin purity (over 950 g kg(-1)). PRP contained much lower sulfhydryl (SH) and disulfide bond contents than KPI. Differential scanning calorimetry analyses showed that the phaseolin in PRP was less denatured than in KPI. Thermal analyses in the presence or absence of dithiothreitol, in combination with SH and SS content analyses showed the contributions of SS to the thermal stability of KPI. The analyses of near-UV circular dichroism and intrinsic fluorescence spectra indicated more compacted tertiary conformation of the proteins in PRP than in KPI. PRP exhibited much better protein solubility, emulsifying activity index, and gel-forming ability than KPI. The relatively poor functional properties of KPI may be associated with protein denaturation/unfolding, with subsequent protein aggregation. The results presented here suggest the potential for acid-extracted PRP to be applied in food formulations, in view of its functional properties.

  13. Predicting Protein Secondary Structure with Markov Models

    DEFF Research Database (Denmark)

    Fischer, Paul; Larsen, Simon; Thomsen, Claus

    2004-01-01

    we are considering here, is to predict the secondary structure from the primary one. To this end we train a Markov model on training data and then use it to classify parts of unknown protein sequences as sheets, helices or coils. We show how to exploit the directional information contained...... in the Markov model for this task. Classifications that are purely based on statistical models might not always be biologically meaningful. We present combinatorial methods to incorporate biological background knowledge to enhance the prediction performance....

  14. A KINETIC MODEL FOR MONO-LAYER GLOBULAR PROTEIN ADSORPTION ON SOLID/LIQUID INTERFACES

    Directory of Open Access Journals (Sweden)

    Kamal I. M. Al-Malah

    2012-12-01

    Full Text Available A kinetic model was derived for globular protein adsorption. The model takes into account the three possible scenarios of a protein molecule in solution, being exposed to an interface: adsorption step from the solution to the interface; the possible desorption back into the solution; and the surface-induced unfolding or spreading of the protein unto the substrate surface. A globular protein molecule is visualized as a sphere with radius D. In addition to the general case of protein adsorption, which portrays either the surface coverage (Theta or surface concentration (� as a function of the adsorption time, special cases, like equilibrium condition, lowsurface coverage, irreversible, and Langmuirian were also presented and treated in light of the derived model. The general model was simplified for each of the subset cases. The irreversibility versus reversibility of protein adsorption was discussed. The substrate surface energetics or effects are accounted for via the proposition of the percent relative change in D/V ratio for the adsorbing protein, called (D/VPRC parameter. (D/VPRC is calculated with respect to the monolayer surface concentration of protein, where the latter is given by D/Vratio. This can be used as a landmark to protein adsorption isotherms or even kinetics. This is visualized as an indicator for solid substrate effects on the adsorbing proteins. (D/VPRC can be zero (fresh monolayer, negative (aged monolayer, or positive (multi-layer. The reference surface concentration is reported for some selected proteins.

  15. Biology of the Heat Shock Response and Protein Chaperones: Budding Yeast (Saccharomyces cerevisiae) as a Model System

    Science.gov (United States)

    Verghese, Jacob; Abrams, Jennifer; Wang, Yanyu

    2012-01-01

    Summary: The eukaryotic heat shock response is an ancient and highly conserved transcriptional program that results in the immediate synthesis of a battery of cytoprotective genes in the presence of thermal and other environmental stresses. Many of these genes encode molecular chaperones, powerful protein remodelers with the capacity to shield, fold, or unfold substrates in a context-dependent manner. The budding yeast Saccharomyces cerevisiae continues to be an invaluable model for driving the discovery of regulatory features of this fundamental stress response. In addition, budding yeast has been an outstanding model system to elucidate the cell biology of protein chaperones and their organization into functional networks. In this review, we evaluate our understanding of the multifaceted response to heat shock. In addition, the chaperone complement of the cytosol is compared to those of mitochondria and the endoplasmic reticulum, organelles with their own unique protein homeostasis milieus. Finally, we examine recent advances in the understanding of the roles of protein chaperones and the heat shock response in pathogenic fungi, which is being accelerated by the wealth of information gained for budding yeast. PMID:22688810

  16. Comparative evolutionary analysis of protein complexes in E. coli and yeast

    Directory of Open Access Journals (Sweden)

    Ranea Juan AG

    2010-02-01

    Full Text Available Abstract Background Proteins do not act in isolation; they frequently act together in protein complexes to carry out concerted cellular functions. The evolution of complexes is poorly understood, especially in organisms other than yeast, where little experimental data has been available. Results We generated accurate, high coverage datasets of protein complexes for E. coli and yeast in order to study differences in the evolution of complexes between these two species. We show that substantial differences exist in how complexes have evolved between these organisms. A previously proposed model of complex evolution identified complexes with cores of interacting homologues. We support findings of the relative importance of this mode of evolution in yeast, but find that it is much less common in E. coli. Additionally it is shown that those homologues which do cluster in complexes are involved in eukaryote-specific functions. Furthermore we identify correlated pairs of non-homologous domains which occur in multiple protein complexes. These were identified in both yeast and E. coli and we present evidence that these too may represent complex cores in yeast but not those of E. coli. Conclusions Our results suggest that there are differences in the way protein complexes have evolved in E. coli and yeast. Whereas some yeast complexes have evolved by recruiting paralogues, this is not apparent in E. coli. Furthermore, such complexes are involved in eukaryotic-specific functions. This implies that the increase in gene family sizes seen in eukaryotes in part reflects multiple family members being used within complexes. However, in general, in both E. coli and yeast, homologous domains are used in different complexes.

  17. Genome-scale metabolic model of Pichia pastoris with native and humanized glycosylation of recombinant proteins.

    Science.gov (United States)

    Irani, Zahra Azimzadeh; Kerkhoven, Eduard J; Shojaosadati, Seyed Abbas; Nielsen, Jens

    2016-05-01

    Pichia pastoris is used for commercial production of human therapeutic proteins, and genome-scale models of P. pastoris metabolism have been generated in the past to study the metabolism and associated protein production by this yeast. A major challenge with clinical usage of recombinant proteins produced by P. pastoris is the difference in N-glycosylation of proteins produced by humans and this yeast. However, through metabolic engineering, a P. pastoris strain capable of producing humanized N-glycosylated proteins was constructed. The current genome-scale models of P. pastoris do not address native nor humanized N-glycosylation, and we therefore developed ihGlycopastoris, an extension to the iLC915 model with both native and humanized N-glycosylation for recombinant protein production, but also an estimation of N-glycosylation of P. pastoris native proteins. This new model gives a better prediction of protein yield, demonstrates the effect of the different types of N-glycosylation of protein yield, and can be used to predict potential targets for strain improvement. The model represents a step towards a more complete description of protein production in P. pastoris, which is required for using these models to understand and optimize protein production processes. © 2015 Wiley Periodicals, Inc.

  18. A knowledge-based approach for identification of drugs against vivapain-2 protein of Plasmodium vivax through pharmacophore-based virtual screening with comparative modelling.

    Science.gov (United States)

    Yadav, Manoj Kumar; Singh, Amisha; Swati, D

    2014-08-01

    Malaria is one of the most infectious diseases in the world. Plasmodium vivax, the pathogen causing endemic malaria in humans worldwide, is responsible for extensive disease morbidity. Due to the emergence of resistance to common anti-malarial drugs, there is a continuous need to develop a new class of drugs for this pathogen. P. vivax cysteine protease, also known as vivapain-2, plays an important role in haemoglobin hydrolysis and is considered essential for the survival of the parasite. The three-dimensional (3D) structure of vivapain-2 is not predicted experimentally, so its structure is modelled by using comparative modelling approach and further validated by Qualitative Model Energy Analysis (QMEAN) and RAMPAGE tools. The potential binding site of selected vivapain-2 structure has been detected by grid-based function prediction method. Drug targets and their respective drugs similar to vivapain-2 have been identified using three publicly available databases: STITCH 3.1, DrugBank and Therapeutic Target Database (TTD). The second approach of this work focuses on docking study of selected drug E-64 against vivapain-2 protein. Docking reveals crucial information about key residues (Asn281, Cys283, Val396 and Asp398) that are responsible for holding the ligand in the active site. The similarity-search criterion is used for the preparation of our in-house database of drugs, obtained from filtering the drugs from the DrugBank database. A five-point 3D pharmacophore model is generated for the docked complex of vivapain-2 with E-64. This study of 3D pharmacophore-based virtual screening results in identifying three new drugs, amongst which one is approved and the other two are experimentally proved. The ADMET properties of these drugs are found to be in the desired range. These drugs with novel scaffolds may act as potent drugs for treating malaria caused by P. vivax.

  19. Comparative transcriptional analysis of Bacillus subtilis cells overproducing either secreted proteins, lipoproteins or membrane proteins

    Directory of Open Access Journals (Sweden)

    Marciniak Bogumiła C

    2012-05-01

    Full Text Available Abstract Background Bacillus subtilis is a favorable host for the production of industrially relevant proteins because of its capacity of secreting proteins into the medium to high levels, its GRAS (Generally Recognized As Safe status, its genetic accessibility and its capacity to grow in large fermentations. However, production of heterologous proteins still faces limitations. Results This study aimed at the identification of bottlenecks in secretory protein production by analyzing the response of B. subtilis at the transcriptome level to overproduction of eight secretory proteins of endogenous and heterologous origin and with different subcellular or extracellular destination: secreted proteins (NprE and XynA of B. subtilis, Usp45 of Lactococcus lactis, TEM-1 β-lactamase of Escherichia coli, membrane proteins (LmrA of L. lactis and XylP of Lactobacillus pentosus and lipoproteins (MntA and YcdH of B. subtilis. Responses specific for proteins with a common localization as well as more general stress responses were observed. The latter include upregulation of genes encoding intracellular stress proteins (groES/EL, CtsR regulated genes. Specific responses include upregulation of the liaIHGFSR operon under Usp45 and TEM-1 β-lactamase overproduction; cssRS, htrA and htrB under all secreted proteins overproduction; sigW and SigW-regulated genes mainly under membrane proteins overproduction; and ykrL (encoding an HtpX homologue specifically under membrane proteins overproduction. Conclusions The results give better insights into B. subtilis responses to protein overproduction stress and provide potential targets for genetic engineering in order to further improve B. subtilis as a protein production host.

  20. SiteBinder: an improved approach for comparing multiple protein structural motifs.

    Science.gov (United States)

    Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav

    2012-02-27

    There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.

  1. Plant G-Proteins Come of Age: Breaking the Bond with Animal Models.

    Science.gov (United States)

    Trusov, Yuri; Botella, José R

    2016-01-01

    G-proteins are universal signal transducers mediating many cellular responses. Plant G-protein signaling has been modeled on the well-established animal paradigm but accumulated experimental evidence indicates that G-protein-dependent signaling in plants has taken a very different evolutionary path. Here we review the differences between plant and animal G-proteins reported over past two decades. Most importantly, while in animal systems the G-protein signaling cycle is activated by seven transmembrane-spanning G-protein coupled receptors, the existence of these type of receptors in plants is highly controversial. Instead plant G-proteins have been proven to be functionally associated with atypical receptors such as the Arabidopsis RGS1 and a number of receptor-like kinases. We propose that, instead of the GTP/GDP cycle used in animals, plant G-proteins are activated/de-activated by phosphorylation/de-phosphorylation. We discuss the need of a fresh new look at these signaling molecules and provide a hypothetical model that departs from the accepted animal paradigm.

  2. Saccharomyces cerevisiae as a model organism: a comparative study.

    Directory of Open Access Journals (Sweden)

    Hiren Karathia

    Full Text Available BACKGROUND: Model organisms are used for research because they provide a framework on which to develop and optimize methods that facilitate and standardize analysis. Such organisms should be representative of the living beings for which they are to serve as proxy. However, in practice, a model organism is often selected ad hoc, and without considering its representativeness, because a systematic and rational method to include this consideration in the selection process is still lacking. METHODOLOGY/PRINCIPAL FINDINGS: In this work we propose such a method and apply it in a pilot study of strengths and limitations of Saccharomyces cerevisiae as a model organism. The method relies on the functional classification of proteins into different biological pathways and processes and on full proteome comparisons between the putative model organism and other organisms for which we would like to extrapolate results. Here we compare S. cerevisiae to 704 other organisms from various phyla. For each organism, our results identify the pathways and processes for which S. cerevisiae is predicted to be a good model to extrapolate from. We find that animals in general and Homo sapiens in particular are some of the non-fungal organisms for which S. cerevisiae is likely to be a good model in which to study a significant fraction of common biological processes. We validate our approach by correctly predicting which organisms are phenotypically more distant from S. cerevisiae with respect to several different biological processes. CONCLUSIONS/SIGNIFICANCE: The method we propose could be used to choose appropriate substitute model organisms for the study of biological processes in other species that are harder to study. For example, one could identify appropriate models to study either pathologies in humans or specific biological processes in species with a long development time, such as plants.

  3. Information assessment on predicting protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Gerstein Mark

    2004-10-01

    Full Text Available Abstract Background Identifying protein-protein interactions is fundamental for understanding the molecular machinery of the cell. Proteome-wide studies of protein-protein interactions are of significant value, but the high-throughput experimental technologies suffer from high rates of both false positive and false negative predictions. In addition to high-throughput experimental data, many diverse types of genomic data can help predict protein-protein interactions, such as mRNA expression, localization, essentiality, and functional annotation. Evaluations of the information contributions from different evidences help to establish more parsimonious models with comparable or better prediction accuracy, and to obtain biological insights of the relationships between protein-protein interactions and other genomic information. Results Our assessment is based on the genomic features used in a Bayesian network approach to predict protein-protein interactions genome-wide in yeast. In the special case, when one does not have any missing information about any of the features, our analysis shows that there is a larger information contribution from the functional-classification than from expression correlations or essentiality. We also show that in this case alternative models, such as logistic regression and random forest, may be more effective than Bayesian networks for predicting interactions. Conclusions In the restricted problem posed by the complete-information subset, we identified that the MIPS and Gene Ontology (GO functional similarity datasets as the dominating information contributors for predicting the protein-protein interactions under the framework proposed by Jansen et al. Random forests based on the MIPS and GO information alone can give highly accurate classifications. In this particular subset of complete information, adding other genomic data does little for improving predictions. We also found that the data discretizations used in the

  4. Model systems for understanding absorption tuning by opsin proteins

    DEFF Research Database (Denmark)

    Nielsen, Mogens Brøndsted

    2009-01-01

    This tutorial review reports on model systems that have been synthesised and investigated for elucidating how opsin proteins tune the absorption of the protonated retinal Schiff base chromophore. In particular, the importance of the counteranion is highlighted. In addition, the review advocates...... is avoided, and it becomes clear that opsin proteins induce blueshifts in the chromophore absorption rather than redshifts....

  5. Accurate protein structure modeling using sparse NMR data and homologous structure information.

    Science.gov (United States)

    Thompson, James M; Sgourakis, Nikolaos G; Liu, Gaohua; Rossi, Paolo; Tang, Yuefeng; Mills, Jeffrey L; Szyperski, Thomas; Montelione, Gaetano T; Baker, David

    2012-06-19

    While information from homologous structures plays a central role in X-ray structure determination by molecular replacement, such information is rarely used in NMR structure determination because it can be incorrect, both locally and globally, when evolutionary relationships are inferred incorrectly or there has been considerable evolutionary structural divergence. Here we describe a method that allows robust modeling of protein structures of up to 225 residues by combining (1)H(N), (13)C, and (15)N backbone and (13)Cβ chemical shift data, distance restraints derived from homologous structures, and a physically realistic all-atom energy function. Accurate models are distinguished from inaccurate models generated using incorrect sequence alignments by requiring that (i) the all-atom energies of models generated using the restraints are lower than models generated in unrestrained calculations and (ii) the low-energy structures converge to within 2.0 Å backbone rmsd over 75% of the protein. Benchmark calculations on known structures and blind targets show that the method can accurately model protein structures, even with very remote homology information, to a backbone rmsd of 1.2-1.9 Å relative to the conventional determined NMR ensembles and of 0.9-1.6 Å relative to X-ray structures for well-defined regions of the protein structures. This approach facilitates the accurate modeling of protein structures using backbone chemical shift data without need for side-chain resonance assignments and extensive analysis of NOESY cross-peak assignments.

  6. The Colonic Microbiome and Epithelial Transcriptome Are Altered in Rats Fed a High-Protein Diet Compared with a Normal-Protein Diet.

    Science.gov (United States)

    Mu, Chunlong; Yang, Yuxiang; Luo, Zhen; Guan, Leluo; Zhu, Weiyun

    2016-03-01

    A high-protein diet (HPD) can produce hazardous compounds and reduce butyrate-producing bacteria in feces, which may be detrimental to gut health. However, information on whether HPD affects intestinal function is limited. The aim of this study was to determine the impact of an HPD on the microbiota, microbial metabolites, and epithelial transcriptome in the colons of rats. Adult male Wistar rats were fed either a normal-protein diet (20% protein, 56% carbohydrate) or an HPD (45% protein, 30% carbohydrate) for 6 wk (n = 10 rats per group, individually fed). After 6 wk, the colonic microbiome, microbial metabolites, and epithelial transcriptome were determined. Compared with the normal-protein diet, the HPD adversely altered the colonic microbiota by increasing (P 0.7, P < 0.05) with genes and metabolites generally regarded as being involved in disease pathogenesis, suggesting these bacteria may mediate the detrimental effects of HPDs on colonic health. Our findings suggest that the HPD altered the colonic microbial community, shifted the metabolic profile, and affected the host response in the colons of rats toward an increased risk of colonic disease. © 2016 American Society for Nutrition.

  7. Construction of a biodynamic model for Cry protein production studies.

    Science.gov (United States)

    Navarro-Mtz, Ana Karin; Pérez-Guevara, Fermín

    2014-12-01

    Mathematical models have been used from growth kinetic simulation to gen regulatory networks prediction for B. thuringiensis culture. However, this culture is a time dependent dynamic process where cells physiology suffers several changes depending on the changes in the cell environment. Therefore, through its culture, B. thuringiensis presents three phases related with the predominance of three major metabolic pathways: vegetative growth (Embded-Meyerhof-Parnas pathway), transition (γ-aminobutiric cycle) and sporulation (tricarboxylic acid cycle). There is not available a mathematical model that relates the different stages of cultivation with the metabolic pathway active on each one of them. Therefore, in the present study, and based on published data, a biodynamic model was generated to describe the dynamic of the three different phases based on their major metabolic pathways. The biodynamic model is used to study the interrelation between the different culture phases and their relationship with the Cry protein production. The model consists of three interconnected modules where each module represents one culture phase and its principal metabolic pathway. For model validation four new fermentations were done showing that the model constructed describes reasonably well the dynamic of the three phases. The main results of this model imply that poly-β-hydroxybutyrate is crucial for endospore and Cry protein production. According to the yields of dipicolinic acid and Cry from poly-β-hydroxybutyrate, calculated with the model, the endospore and Cry protein production are not just simultaneous and parallel processes they are also competitive processes.

  8. Comparative immunological evaluation of recombinant Salmonella Typhimurium strains expressing model antigens as live oral vaccines.

    Science.gov (United States)

    Zheng, Song-yue; Yu, Bin; Zhang, Ke; Chen, Min; Hua, Yan-Hong; Yuan, Shuofeng; Watt, Rory M; Zheng, Bo-Jian; Yuen, Kwok-Yung; Huang, Jian-Dong

    2012-09-26

    Despite the development of various systems to generate live recombinant Salmonella Typhimurium vaccine strains, little work has been performed to systematically evaluate and compare their relative immunogenicity. Such information would provide invaluable guidance for the future rational design of live recombinant Salmonella oral vaccines. To compare vaccine strains encoded with different antigen delivery and expression strategies, a series of recombinant Salmonella Typhimurium strains were constructed that expressed either the enhanced green fluorescent protein (EGFP) or a fragment of the hemagglutinin (HA) protein from the H5N1 influenza virus, as model antigens. The antigens were expressed from the chromosome, from high or low-copy plasmids, or encoded on a eukaryotic expression plasmid. Antigens were targeted for expression in either the cytoplasm or the outer membrane. Combinations of strategies were employed to evaluate the efficacy of combined delivery/expression approaches. After investigating in vitro and in vivo antigen expression, growth and infection abilities; the immunogenicity of the constructed recombinant Salmonella strains was evaluated in mice. Using the soluble model antigen EGFP, our results indicated that vaccine strains with high and stable antigen expression exhibited high B cell responses, whilst eukaryotic expression or colonization with good construct stability was critical for T cell responses. For the insoluble model antigen HA, an outer membrane expression strategy induced better B cell and T cell responses than a cytoplasmic strategy. Most notably, the combination of two different expression strategies did not increase the immune response elicited. Through systematically evaluating and comparing the immunogenicity of the constructed recombinant Salmonella strains in mice, we identified their respective advantages and deleterious or synergistic effects. Different construction strategies were optimally-required for soluble versus

  9. Modeling structure of G protein-coupled receptors in huan genome

    KAUST Repository

    Zhang, Yang

    2016-01-26

    G protein-coupled receptors (or GPCRs) are integral transmembrane proteins responsible to various cellular signal transductions. Human GPCR proteins are encoded by 5% of human genes but account for the targets of 40% of the FDA approved drugs. Due to difficulties in crystallization, experimental structure determination remains extremely difficult for human GPCRs, which have been a major barrier in modern structure-based drug discovery. We proposed a new hybrid protocol, GPCR-I-TASSER, to construct GPCR structure models by integrating experimental mutagenesis data with ab initio transmembrane-helix assembly simulations, assisted by the predicted transmembrane-helix interaction networks. The method was tested in recent community-wide GPCRDock experiments and constructed models with a root mean square deviation 1.26 Å for Dopamine-3 and 2.08 Å for Chemokine-4 receptors in the transmembrane domain regions, which were significantly closer to the native than the best templates available in the PDB. GPCR-I-TASSER has been applied to model all 1,026 putative GPCRs in the human genome, where 923 are found to have correct folds based on the confidence score analysis and mutagenesis data comparison. The successfully modeled GPCRs contain many pharmaceutically important families that do not have previously solved structures, including Trace amine, Prostanoids, Releasing hormones, Melanocortins, Vasopressin and Neuropeptide Y receptors. All the human GPCR models have been made publicly available through the GPCR-HGmod database at http://zhanglab.ccmb.med.umich.edu/GPCR-HGmod/ The results demonstrate new progress on genome-wide structure modeling of transmembrane proteins which should bring useful impact on the effort of GPCR-targeted drug discovery.

  10. Electrostatics of cysteine residues in proteins: Parameterization and validation of a simple model

    Science.gov (United States)

    Salsbury, Freddie R.; Poole, Leslie B.; Fetrow, Jacquelyn S.

    2013-01-01

    One of the most popular and simple models for the calculation of pKas from a protein structure is the semi-macroscopic electrostatic model MEAD. This model requires empirical parameters for each residue to calculate pKas. Analysis of current, widely used empirical parameters for cysteine residues showed that they did not reproduce expected cysteine pKas; thus, we set out to identify parameters consistent with the CHARMM27 force field that capture both the behavior of typical cysteines in proteins and the behavior of cysteines which have perturbed pKas. The new parameters were validated in three ways: (1) calculation across a large set of typical cysteines in proteins (where the calculations are expected to reproduce expected ensemble behavior); (2) calculation across a set of perturbed cysteines in proteins (where the calculations are expected to reproduce the shifted ensemble behavior); and (3) comparison to experimentally determined pKa values (where the calculation should reproduce the pKa within experimental error). Both the general behavior of cysteines in proteins and the perturbed pKa in some proteins can be predicted reasonably well using the newly determined empirical parameters within the MEAD model for protein electrostatics. This study provides the first general analysis of the electrostatics of cysteines in proteins, with specific attention paid to capturing both the behavior of typical cysteines in a protein and the behavior of cysteines whose pKa should be shifted, and validation of force field parameters for cysteine residues. PMID:22777874

  11. Comparative genomic analysis identified a mutation related to enhanced heterologous protein production in the filamentous fungus Aspergillus oryzae.

    Science.gov (United States)

    Jin, Feng-Jie; Katayama, Takuya; Maruyama, Jun-Ichi; Kitamoto, Katsuhiko

    2016-11-01

    Genomic mapping of mutations using next-generation sequencing technologies has facilitated the identification of genes contributing to fundamental biological processes, including human diseases. However, few studies have used this approach to identify mutations contributing to heterologous protein production in industrial strains of filamentous fungi, such as Aspergillus oryzae. In a screening of A. oryzae strains that hyper-produce human lysozyme (HLY), we previously isolated an AUT1 mutant that showed higher production of various heterologous proteins; however, the underlying factors contributing to the increased heterologous protein production remained unclear. Here, using a comparative genomic approach performed with whole-genome sequences, we attempted to identify the genes responsible for the high-level production of heterologous proteins in the AUT1 mutant. The comparative sequence analysis led to the detection of a gene (AO090120000003), designated autA, which was predicted to encode an unknown cytoplasmic protein containing an alpha/beta-hydrolase fold domain. Mutation or deletion of autA was associated with higher production levels of HLY. Specifically, the HLY yields of the autA mutant and deletion strains were twofold higher than that of the control strain during the early stages of cultivation. Taken together, these results indicate that combining classical mutagenesis approaches with comparative genomic analysis facilitates the identification of novel genes involved in heterologous protein production in filamentous fungi.

  12. Hidden Markov model approach for identifying the modular framework of the protein backbone.

    Science.gov (United States)

    Camproux, A C; Tuffery, P; Chevrolat, J P; Boisvieux, J F; Hazout, S

    1999-12-01

    The hidden Markov model (HMM) was used to identify recurrent short 3D structural building blocks (SBBs) describing protein backbones, independently of any a priori knowledge. Polypeptide chains are decomposed into a series of short segments defined by their inter-alpha-carbon distances. Basically, the model takes into account the sequentiality of the observed segments and assumes that each one corresponds to one of several possible SBBs. Fitting the model to a database of non-redundant proteins allowed us to decode proteins in terms of 12 distinct SBBs with different roles in protein structure. Some SBBs correspond to classical regular secondary structures. Others correspond to a significant subdivision of their bounding regions previously considered to be a single pattern. The major contribution of the HMM is that this model implicitly takes into account the sequential connections between SBBs and thus describes the most probable pathways by which the blocks are connected to form the framework of the protein structures. Validation of the SBBs code was performed by extracting SBB series repeated in recoding proteins and examining their structural similarities. Preliminary results on the sequence specificity of SBBs suggest promising perspectives for the prediction of SBBs or series of SBBs from the protein sequences.

  13. Comparative vesicle proteomics reveals selective regulation of protein expression in chestnut blight fungus by a hypovirus.

    Science.gov (United States)

    Wang, Jinzi; Wang, Fangzhen; Feng, Youjun; Mi, Ke; Chen, Qi; Shang, Jinjie; Chen, Baoshan

    2013-01-14

    The chestnut blight fungus (Cryphonectria parasitica) and hypovirus constitute a model system to study fungal pathogenesis and mycovirus-host interaction. Knowledge in this field has been gained largely from investigations at gene transcription level so far. Here we report a systematic analysis of the vesicle proteins of the host fungus with/without hypovirus infection. Thirty-three differentially expressed protein spots were identified in the purified vesicle protein samples by two-dimensional electrophoresis and mass spectrometry. Down-regulated proteins were mostly cargo proteins involved in primary metabolism and energy generation and up-regulated proteins were mostly vesicle associated proteins and ABC transporter. A virus-encoded protein p48 was found to have four forms with different molecular mass in vesicles from the virus-infected strain. While a few of the randomly selected differentially expressed proteins were in accordance with their transcription profiles, majority were not in agreement with their mRNA accumulation patterns, suggesting that an extensive post-transcriptional regulation may have occurred in the host fungus upon a hypovirus infection. Copyright © 2012 Elsevier B.V. All rights reserved.

  14. 3D modelling of the pathogenic Leptospira protein LipL32: A bioinformatics approach.

    Science.gov (United States)

    Kumaran, Sharmilah Kumari; Bakar, Mohd Faizal Abu; Mohd-Padil, Hirzahida; Mat-Sharani, Shuhaila; Sakinah, S; Poorani, K; Alsaeedy, Hiba; Peli, Amira; Wei, Teh Seoh; Ling, Mok Pooi; Hamat, Rukman Awang; Neela, Vasantha Kumari; Higuchi, Akon; Alarfaj, Abdullah A; Rajan, Mariappan; Benelli, Giovanni; Arulselvan, Palanisamy; Kumar, S Suresh

    2017-12-01

    Leptospirosis is a widespread zoonotic disease caused by pathogenic Leptospira species (Leptospiraceae). LipL32 is an abundant lipoprotein from the outer membrane proteins (OMPs) group, highly conserved among pathogenic and intermediate Leptospira species. Several studies used LipL32 as a specific gene to identify the presence of leptospires. This research was aimed to study the characteristics of LipL32 protein gene code, to fill the knowledge gap concerning the most appropriate gene that can be used as antigen to detect the Leptospira. Here, we investigated the features of LipL32 in fourteen Leptospira pathogenic strains based on comparative analyses of their primary, secondary structures and 3D modeling using a bioinformatics approach. Furthermore, the physicochemical properties of LipL32 in different strains were studied, shedding light on the identity of signal peptides, as well as on the secondary and tertiary structure of the LipL32 protein, supported by 3D modelling assays. The results showed that the LipL32 gene was present in all the fourteen pathogenic Leptospira strains used in this study, with limited diversity in terms of sequence conservation, hydrophobic group, hydrophilic group and number of turns (random coil). Overall, these results add basic knowledge to the characteristics of LipL32 protein, contributing to the identification of potential antigen candidates in future research, in order to ensure prompt and reliable detection of pathogenic Leptospira species. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. A Novel Approach for Protein-Named Entity Recognition and Protein-Protein Interaction Extraction

    Directory of Open Access Journals (Sweden)

    Meijing Li

    2015-01-01

    Full Text Available Many researchers focus on developing protein-named entity recognition (Protein-NER or PPI extraction systems. However, the studies about these two topics cannot be merged well; then existing PPI extraction systems’ Protein-NER still needs to improve. In this paper, we developed the protein-protein interaction extraction system named PPIMiner based on Support Vector Machine (SVM and parsing tree. PPIMiner consists of three main models: natural language processing (NLP model, Protein-NER model, and PPI discovery model. The Protein-NER model, which is named ProNER, identifies the protein names based on two methods: dictionary-based method and machine learning-based method. ProNER is capable of identifying more proteins than dictionary-based Protein-NER model in other existing systems. The final discovered PPIs extracted via PPI discovery model are represented in detail because we showed the protein interaction types and the occurrence frequency through two different methods. In the experiments, the result shows that the performances achieved by our ProNER and PPI discovery model are better than other existing tools. PPIMiner applied this protein-named entity recognition approach and parsing tree based PPI extraction method to improve the performance of PPI extraction. We also provide an easy-to-use interface to access PPIs database and an online system for PPIs extraction and Protein-NER.

  16. Comparative differential gene expression analysis of nucleus-encoded proteins for Rafflesia cantleyi against Arabidopsis thaliana

    Science.gov (United States)

    Ng, Siuk-Mun; Lee, Xin-Wei; Wan, Kiew-Lian; Firdaus-Raih, Mohd

    2015-09-01

    Regulation of functional nucleus-encoded proteins targeting the plastidial functions was comparatively studied for a plant parasite, Rafflesia cantleyi versus a photosynthetic plant, Arabidopsis thaliana. This study involved two species of different feeding modes and different developmental stages. A total of 30 nucleus-encoded proteins were found to be differentially-regulated during two stages in the parasite; whereas 17 nucleus-encoded proteins were differentially-expressed during two developmental stages in Arabidopsis thaliana. One notable finding observed for the two plants was the identification of genes involved in the regulation of photosynthesis-related processes where these processes, as expected, seem to be present only in the autotroph.

  17. Predicting turns in proteins with a unified model.

    Directory of Open Access Journals (Sweden)

    Qi Song

    Full Text Available MOTIVATION: Turns are a critical element of the structure of a protein; turns play a crucial role in loops, folds, and interactions. Current prediction methods are well developed for the prediction of individual turn types, including α-turn, β-turn, and γ-turn, etc. However, for further protein structure and function prediction it is necessary to develop a uniform model that can accurately predict all types of turns simultaneously. RESULTS: In this study, we present a novel approach, TurnP, which offers the ability to investigate all the turns in a protein based on a unified model. The main characteristics of TurnP are: (i using newly exploited features of structural evolution information (secondary structure and shape string of protein based on structure homologies, (ii considering all types of turns in a unified model, and (iii practical capability of accurate prediction of all turns simultaneously for a query. TurnP utilizes predicted secondary structures and predicted shape strings, both of which have greater accuracy, based on innovative technologies which were both developed by our group. Then, sequence and structural evolution features, which are profile of sequence, profile of secondary structures and profile of shape strings are generated by sequence and structure alignment. When TurnP was validated on a non-redundant dataset (4,107 entries by five-fold cross-validation, we achieved an accuracy of 88.8% and a sensitivity of 71.8%, which exceeded the most state-of-the-art predictors of certain type of turn. Newly determined sequences, the EVA and CASP9 datasets were used as independent tests and the results we achieved were outstanding for turn predictions and confirmed the good performance of TurnP for practical applications.

  18. Rosetta comparative modeling for library design: Engineering alternative inducer specificity in a transcription factor.

    Science.gov (United States)

    Jha, Ramesh K; Chakraborti, Subhendu; Kern, Theresa L; Fox, David T; Strauss, Charlie E M

    2015-07-01

    Structure-based rational mutagenesis for engineering protein functionality has been limited by the scarcity and difficulty of obtaining crystal structures of desired proteins. On the other hand, when high-throughput selection is possible, directed evolution-based approaches for gaining protein functionalities have been random and fortuitous with limited rationalization. We combine comparative modeling of dimer structures, ab initio loop reconstruction, and ligand docking to select positions for mutagenesis to create a library focused on the ligand-contacting residues. The rationally reduced library requirement enabled conservative control of the substitutions by oligonucleotide synthesis and bounding its size within practical transformation efficiencies (∼ 10(7) variants). This rational approach was successfully applied on an inducer-binding domain of an Acinetobacter transcription factor (TF), pobR, which shows high specificity for natural effector molecule, 4-hydroxy benzoate (4HB), but no native response to 3,4-dihydroxy benzoate (34DHB). Selection for mutants with high transcriptional induction by 34DHB was carried out at the single-cell level under flow cytometry (via green fluorescent protein expression under the control of pobR promoter). Critically, this selection protocol allows both selection for induction and rejection of constitutively active mutants. In addition to gain-of-function for 34DHB induction, the selected mutants also showed enhanced sensitivity and response for 4HB (native inducer) while no sensitivity was observed for a non-targeted but chemically similar molecule, 2-hydroxy benzoate (2HB). This is unique application of the Rosetta modeling protocols for library design to engineer a TF. Our approach extends applicability of the Rosetta redesign protocol into regimes without a priori precision structural information. © 2015 Wiley Periodicals, Inc.

  19. Prediction of beta-turns in proteins using the first-order Markov models.

    Science.gov (United States)

    Lin, Thy-Hou; Wang, Ging-Ming; Wang, Yen-Tseng

    2002-01-01

    We present a method based on the first-order Markov models for predicting simple beta-turns and loops containing multiple turns in proteins. Sequences of 338 proteins in a database are divided using the published turn criteria into the following three regions, namely, the turn, the boundary, and the nonturn ones. A transition probability matrix is constructed for either the turn or the nonturn region using the weighted transition probabilities computed for dipeptides identified from each region. There are two such matrices constructed for the boundary region since the transition probabilities for dipeptides immediately preceding or following a turn are different. The window used for scanning a protein sequence from amino (N-) to carboxyl (C-) terminal is a hexapeptide since the transition probability computed for a turn tetrapeptide is capped at both the N- and C- termini with a boundary transition probability indexed respectively from the two boundary transition matrices. A sum of the averaged product of the transition probabilities of all the hexapeptides involving each residue is computed. This is then weighted with a probability computed from assuming that all the hexapeptides are from the nonturn region to give the final prediction quantity. Both simple beta-turns and loops containing multiple turns in a protein are then identified by the rising of the prediction quantity computed. The performance of the prediction scheme or the percentage (%) of correct prediction is evaluated through computation of Matthews correlation coefficients for each protein predicted. It is found that the prediction method is capable of giving prediction results with better correlation between the percent of correct prediction and the Matthews correlation coefficients for a group of test proteins as compared with those predicted using some secondary structural prediction methods. The prediction accuracy for about 40% of proteins in the database or 50% of proteins in the test set is

  20. Electronic transport on the spatial structure of the protein: Three-dimensional lattice model

    International Nuclear Information System (INIS)

    Sarmento, R.G.; Frazão, N.F.; Macedo-Filho, A.

    2017-01-01

    Highlights: • The electronic transport on the structure of the three-dimensional lattice model of the protein is studied. • The signing of the current–voltage is directly affected by permutations of the weak bonds in the structure. • Semiconductor behave of the proteins suggest a potential application in the development of novel biosensors. - Abstract: We report a numerical analysis of the electronic transport in protein chain consisting of thirty-six standard amino acids. The protein chains studied have three-dimensional structure, which can present itself in three distinct conformations and the difference consist in the presence or absence of thirteen hydrogen-bondings. Our theoretical method uses an electronic tight-binding Hamiltonian model, appropriate to describe the protein segments modeled by the amino acid chain. We note that the presence and the permutations between weak bonds in the structure of proteins are directly related to the signing of the current–voltage. Furthermore, the electronic transport depends on the effect of temperature. In addition, we have found a semiconductor behave in the models investigated and it suggest a potential application in the development of novel biosensors for molecular diagnostics.

  1. Electronic transport on the spatial structure of the protein: Three-dimensional lattice model

    Energy Technology Data Exchange (ETDEWEB)

    Sarmento, R.G. [Departamento de Ciências Biológicas, Universidade Federal do Piauí, 64800-000 Floriano, PI (Brazil); Frazão, N.F. [Centro de Educação e Saúde, Universidade Federal de Campina Grande, 581750-000 Cuité, PB (Brazil); Macedo-Filho, A., E-mail: amfilho@gmail.com [Campus Prof. Antonio Geovanne Alves de Sousa, Universidade Estadual do Piauí, 64260-000 Piripiri, PI (Brazil)

    2017-01-30

    Highlights: • The electronic transport on the structure of the three-dimensional lattice model of the protein is studied. • The signing of the current–voltage is directly affected by permutations of the weak bonds in the structure. • Semiconductor behave of the proteins suggest a potential application in the development of novel biosensors. - Abstract: We report a numerical analysis of the electronic transport in protein chain consisting of thirty-six standard amino acids. The protein chains studied have three-dimensional structure, which can present itself in three distinct conformations and the difference consist in the presence or absence of thirteen hydrogen-bondings. Our theoretical method uses an electronic tight-binding Hamiltonian model, appropriate to describe the protein segments modeled by the amino acid chain. We note that the presence and the permutations between weak bonds in the structure of proteins are directly related to the signing of the current–voltage. Furthermore, the electronic transport depends on the effect of temperature. In addition, we have found a semiconductor behave in the models investigated and it suggest a potential application in the development of novel biosensors for molecular diagnostics.

  2. Predicted cycloartenol synthase protein from Kandelia obovata and Rhizophora stylosa using online software of Phyre2 and Swiss-model

    Science.gov (United States)

    Basyuni, M.; Sulistiyono, N.; Wati, R.; Sumardi; Oku, H.; Baba, S.; Sagami, H.

    2018-03-01

    Cloning of Kandelia obovata KcCAS gene (previously known as Kandelia candel) and Rhizophora stylosa RsCAS have already have been reported and encoded cycloartenol synthases. In this study, the predicted KcCAS and RsCAS protein were analyzed using online software of Phyre2 and Swiss-model. The protein modelling for KcCAS and RsCAS cycloartenol synthases was determined using Pyre2 had similar results with slightly different in sequence identity. By contrast, the Swiss-model for KcCAS slightly had higher sequence identity (47.31%) and Qmean (0.70) compared to RsCAS. No difference of ligands binding site which is considered as modulators for both cycloartenol synthases. The range of predicted protein derived from 91-757 amino acid residues with coverage sequence similarities 0.86, respectively from template model of lanosterol synthase from the human. Homology modelling revealed that 706 residues (93% of the amino acid sequence) had been modelled with 100.0% confidence by the single highest scoring template for both KcCAS and RsCAS using Phyre2. This coverage was more elevated than swiss-model predicted (86%). The present study suggested that both genes are responsible for the genesis of cycloartenol in these mangrove plants.

  3. Modelling responses of broiler chickens to dietary balanced protein

    NARCIS (Netherlands)

    Eits, R.M.

    2004-01-01

    Protein is an important nutrient for growing broiler chickens, as it affects broiler performance, feed cost as well as nitrogen excretion. The objective of this dissertation was to develop a growth model for broiler chickens that could be easily used by practical nutritionists. The model should

  4. Mathematical Modeling of Protein Misfolding Mechanisms in Neurological Diseases: A Historical Overview.

    Science.gov (United States)

    Carbonell, Felix; Iturria-Medina, Yasser; Evans, Alan C

    2018-01-01

    Protein misfolding refers to a process where proteins become structurally abnormal and lose their specific 3-dimensional spatial configuration. The histopathological presence of misfolded protein (MP) aggregates has been associated as the primary evidence of multiple neurological diseases, including Prion diseases, Alzheimer's disease, Parkinson's disease, and Creutzfeldt-Jacob disease. However, the exact mechanisms of MP aggregation and propagation, as well as their impact in the long-term patient's clinical condition are still not well understood. With this aim, a variety of mathematical models has been proposed for a better insight into the kinetic rate laws that govern the microscopic processes of protein aggregation. Complementary, another class of large-scale models rely on modern molecular imaging techniques for describing the phenomenological effects of MP propagation over the whole brain. Unfortunately, those neuroimaging-based studies do not take full advantage of the tremendous capabilities offered by the chemical kinetics modeling approach. Actually, it has been barely acknowledged that the vast majority of large-scale models have foundations on previous mathematical approaches that describe the chemical kinetics of protein replication and propagation. The purpose of the current manuscript is to present a historical review about the development of mathematical models for describing both microscopic processes that occur during the MP aggregation and large-scale events that characterize the progression of neurodegenerative MP-mediated diseases.

  5. MOLECULAR MODELING INDICATES THAT HOMOCYSTEINE INDUCES CONFORMATIONAL CHANGES IN THE STRUCTURE OF PUTATIVE TARGET PROTEINS

    Directory of Open Access Journals (Sweden)

    Yumnam Silla

    2015-09-01

    Full Text Available An elevated level of homocysteine, a reactive thiol containing amino acid is associated with a multitude of complex diseases. A majority (>80% of homocysteine in circulation is bound to protein cysteine residues. Although, till date only 21 proteins have been experimentally shown to bind with homocysteine, using an insilico approach we had earlier identified several potential target proteins that could bind with homocysteine. Shomocysteinylation of proteins could potentially alter the structure and/or function of the protein. Earlier studies have shown that binding of homocysteine to protein alters its function. However, the effect of homocysteine on the target protein structure has not yet been documented. In the present work, we assess conformational or structural changes if any due to protein homocysteinylation using two proteins, granzyme B (GRAB and junctional adhesion molecule 1 (JAM1, which could potentially bind to homocysteine. We, for the first time, constructed computational models of homocysteine bound to target proteins and monitored their structural changes using explicit solvent molecular dynamic (MD simulation. Analysis of homocysteine bound trajectories revealed higher flexibility of the active site residues and local structural perturbations compared to the unbound native structure’s simulation, which could affect the stability of the protein. In addition, secondary structure analysis of homocysteine bound trajectories also revealed disappearance of â-helix within the G-helix and linker region that connects between the domain regions (as defined in the crystal structure. Our study thus captures the conformational transitions induced by homocysteine and we suggest these structural alterations might have implications for hyperhomocysteinemia induced pathologies.

  6. Model system-guided protein interaction mapping for virus isolated from phloem tissue

    Science.gov (United States)

    Potato leafroll virus (PLRV) is an agriculturally important phloem-limited pathogen that causes significant yield loss in potato (Solanum tuberosum) and a model virus in the Luteoviridae. Encoding only a small repertoire of viral proteins, PLRV relies on carefully orchestrated protein-protein intera...

  7. SCT: a suite of programs for comparing atomistic models with small-angle scattering data.

    Science.gov (United States)

    Wright, David W; Perkins, Stephen J

    2015-06-01

    Small-angle X-ray and neutron scattering techniques characterize proteins in solution and complement high-resolution structural studies. They are of particular utility when large proteins cannot be crystallized or when the structure is altered by solution conditions. Atomistic models of the averaged structure can be generated through constrained modelling, a technique in which known domain or subunit structures are combined with linker models to produce candidate global conformations. By randomizing the configuration adopted by the different elements of the model, thousands of candidate structures are produced. Next, theoretical scattering curves are generated for each model for trial-and-error fits to the experimental data. From these, a small family of best-fit models is identified. In order to facilitate both the computation of theoretical scattering curves from atomistic models and their comparison with experiment, the SCT suite of tools was developed. SCT also includes programs that provide sequence-based estimates of protein volume (either incorporating hydration or not) and add a hydration layer to models for X-ray scattering modelling. The original SCT software, written in Fortran, resulted in the first atomistic scattering structures to be deposited in the Protein Data Bank, and 77 structures for antibodies, complement proteins and anionic oligosaccharides were determined between 1998 and 2014. For the first time, this software is publicly available, alongside an easier-to-use reimplementation of the same algorithms in Python. Both versions of SCT have been released as open-source software under the Apache 2 license and are available for download from https://github.com/dww100/sct.

  8. Markov state models of protein misfolding

    Science.gov (United States)

    Sirur, Anshul; De Sancho, David; Best, Robert B.

    2016-02-01

    Markov state models (MSMs) are an extremely useful tool for understanding the conformational dynamics of macromolecules and for analyzing MD simulations in a quantitative fashion. They have been extensively used for peptide and protein folding, for small molecule binding, and for the study of native ensemble dynamics. Here, we adapt the MSM methodology to gain insight into the dynamics of misfolded states. To overcome possible flaws in root-mean-square deviation (RMSD)-based metrics, we introduce a novel discretization approach, based on coarse-grained contact maps. In addition, we extend the MSM methodology to include "sink" states in order to account for the irreversibility (on simulation time scales) of processes like protein misfolding. We apply this method to analyze the mechanism of misfolding of tandem repeats of titin domains, and how it is influenced by confinement in a chaperonin-like cavity.

  9. A Comparative of business process modelling techniques

    Science.gov (United States)

    Tangkawarow, I. R. H. T.; Waworuntu, J.

    2016-04-01

    In this era, there is a lot of business process modeling techniques. This article is the research about differences of business process modeling techniques. For each technique will explain about the definition and the structure. This paper presents a comparative analysis of some popular business process modelling techniques. The comparative framework is based on 2 criteria: notation and how it works when implemented in Somerleyton Animal Park. Each technique will end with the advantages and disadvantages. The final conclusion will give recommend of business process modeling techniques that easy to use and serve the basis for evaluating further modelling techniques.

  10. The Princeton Protein Orthology Database (P-POD): a comparative genomics analysis tool for biologists.

    OpenAIRE

    Sven Heinicke; Michael S Livstone; Charles Lu; Rose Oughtred; Fan Kang; Samuel V Angiuoli; Owen White; David Botstein; Kara Dolinski

    2007-01-01

    Many biological databases that provide comparative genomics information and tools are now available on the internet. While certainly quite useful, to our knowledge none of the existing databases combine results from multiple comparative genomics methods with manually curated information from the literature. Here we describe the Princeton Protein Orthology Database (P-POD, http://ortholog.princeton.edu), a user-friendly database system that allows users to find and visualize the phylogenetic r...

  11. Electrostatics of cysteine residues in proteins: parameterization and validation of a simple model.

    Science.gov (United States)

    Salsbury, Freddie R; Poole, Leslie B; Fetrow, Jacquelyn S

    2012-11-01

    One of the most popular and simple models for the calculation of pK(a) s from a protein structure is the semi-macroscopic electrostatic model MEAD. This model requires empirical parameters for each residue to calculate pK(a) s. Analysis of current, widely used empirical parameters for cysteine residues showed that they did not reproduce expected cysteine pK(a) s; thus, we set out to identify parameters consistent with the CHARMM27 force field that capture both the behavior of typical cysteines in proteins and the behavior of cysteines which have perturbed pK(a) s. The new parameters were validated in three ways: (1) calculation across a large set of typical cysteines in proteins (where the calculations are expected to reproduce expected ensemble behavior); (2) calculation across a set of perturbed cysteines in proteins (where the calculations are expected to reproduce the shifted ensemble behavior); and (3) comparison to experimentally determined pK(a) values (where the calculation should reproduce the pK(a) within experimental error). Both the general behavior of cysteines in proteins and the perturbed pK(a) in some proteins can be predicted reasonably well using the newly determined empirical parameters within the MEAD model for protein electrostatics. This study provides the first general analysis of the electrostatics of cysteines in proteins, with specific attention paid to capturing both the behavior of typical cysteines in a protein and the behavior of cysteines whose pK(a) should be shifted, and validation of force field parameters for cysteine residues. Copyright © 2012 Wiley Periodicals, Inc.

  12. Protein-protein interactions in paralogues: Electrostatics modulates specificity on a conserved steric scaffold.

    Directory of Open Access Journals (Sweden)

    Stefan M Ivanov

    Full Text Available An improved knowledge of protein-protein interactions is essential for better understanding of metabolic and signaling networks, and cellular function. Progress tends to be based on structure determination and predictions using known structures, along with computational methods based on evolutionary information or detailed atomistic descriptions. We hypothesized that for the case of interactions across a common interface, between proteins from a pair of paralogue families or within a family of paralogues, a relatively simple interface description could distinguish between binding and non-binding pairs. Using binding data for several systems, and large-scale comparative modeling based on known template complex structures, it is found that charge-charge interactions (for groups bearing net charge are generally a better discriminant than buried non-polar surface. This is particularly the case for paralogue families that are less divergent, with more reliable comparative modeling. We suggest that electrostatic interactions are major determinants of specificity in such systems, an observation that could be used to predict binding partners.

  13. Protein-protein interactions in paralogues: Electrostatics modulates specificity on a conserved steric scaffold.

    Science.gov (United States)

    Ivanov, Stefan M; Cawley, Andrew; Huber, Roland G; Bond, Peter J; Warwicker, Jim

    2017-01-01

    An improved knowledge of protein-protein interactions is essential for better understanding of metabolic and signaling networks, and cellular function. Progress tends to be based on structure determination and predictions using known structures, along with computational methods based on evolutionary information or detailed atomistic descriptions. We hypothesized that for the case of interactions across a common interface, between proteins from a pair of paralogue families or within a family of paralogues, a relatively simple interface description could distinguish between binding and non-binding pairs. Using binding data for several systems, and large-scale comparative modeling based on known template complex structures, it is found that charge-charge interactions (for groups bearing net charge) are generally a better discriminant than buried non-polar surface. This is particularly the case for paralogue families that are less divergent, with more reliable comparative modeling. We suggest that electrostatic interactions are major determinants of specificity in such systems, an observation that could be used to predict binding partners.

  14. Putative drug and vaccine target protein identification using comparative genomic analysis of KEGG annotated metabolic pathways of Mycoplasma hyopneumoniae.

    Science.gov (United States)

    Damte, Dereje; Suh, Joo-Won; Lee, Seung-Jin; Yohannes, Sileshi Belew; Hossain, Md Akil; Park, Seung-Chun

    2013-07-01

    In the present study, a computational comparative and subtractive genomic/proteomic analysis aimed at the identification of putative therapeutic target and vaccine candidate proteins from Kyoto Encyclopedia of Genes and Genomes (KEGG) annotated metabolic pathways of Mycoplasma hyopneumoniae was performed for drug design and vaccine production pipelines against M.hyopneumoniae. The employed comparative genomic and metabolic pathway analysis with a predefined computational systemic workflow extracted a total of 41 annotated metabolic pathways from KEGG among which five were unique to M. hyopneumoniae. A total of 234 proteins were identified to be involved in these metabolic pathways. Although 125 non homologous and predicted essential proteins were found from the total that could serve as potential drug targets and vaccine candidates, additional prioritizing parameters characterize 21 proteins as vaccine candidate while druggability of each of the identified proteins evaluated by the DrugBank database prioritized 42 proteins suitable for drug targets. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. A Comparative Analysis of the Mechanism of Toll-Like Receptor-Disruption by TIR-Containing Protein C from Uropathogenic Escherichia coli

    Directory of Open Access Journals (Sweden)

    Anna Waldhuber

    2016-02-01

    Full Text Available The TIR-containing protein C (TcpC of uropathogenic Escherichia coli strains is a powerful virulence factor by impairing the signaling cascade of Toll-like receptors (TLRs. Several other bacterial pathogens like Salmonella, Yersinia, Staphylococcus aureus but also non-pathogens express similar proteins. We discuss here the pathogenic potential of TcpC and its interaction with TLRs and TLR-adapter proteins on the molecular level and compare its activity with the activity of other bacterial TIR-containing proteins. Finally, we analyze and compare the structure of bacterial TIR-domains with the TIR-domains of TLRs and TLR-adapters.

  16. Bayesian Proteoform Modeling Improves Protein Quantification of Global Proteomic Measurements

    Energy Technology Data Exchange (ETDEWEB)

    Webb-Robertson, Bobbie-Jo M.; Matzke, Melissa M.; Datta, Susmita; Payne, Samuel H.; Kang, Jiyun; Bramer, Lisa M.; Nicora, Carrie D.; Shukla, Anil K.; Metz, Thomas O.; Rodland, Karin D.; Smith, Richard D.; Tardiff, Mark F.; McDermott, Jason E.; Pounds, Joel G.; Waters, Katrina M.

    2014-12-01

    As the capability of mass spectrometry-based proteomics has matured, tens of thousands of peptides can be measured simultaneously, which has the benefit of offering a systems view of protein expression. However, a major challenge is that with an increase in throughput, protein quantification estimation from the native measured peptides has become a computational task. A limitation to existing computationally-driven protein quantification methods is that most ignore protein variation, such as alternate splicing of the RNA transcript and post-translational modifications or other possible proteoforms, which will affect a significant fraction of the proteome. The consequence of this assumption is that statistical inference at the protein level, and consequently downstream analyses, such as network and pathway modeling, have only limited power for biomarker discovery. Here, we describe a Bayesian model (BP-Quant) that uses statistically derived peptides signatures to identify peptides that are outside the dominant pattern, or the existence of multiple over-expressed patterns to improve relative protein abundance estimates. It is a research-driven approach that utilizes the objectives of the experiment, defined in the context of a standard statistical hypothesis, to identify a set of peptides exhibiting similar statistical behavior relating to a protein. This approach infers that changes in relative protein abundance can be used as a surrogate for changes in function, without necessarily taking into account the effect of differential post-translational modifications, processing, or splicing in altering protein function. We verify the approach using a dilution study from mouse plasma samples and demonstrate that BP-Quant achieves similar accuracy as the current state-of-the-art methods at proteoform identification with significantly better specificity. BP-Quant is available as a MatLab ® and R packages at https://github.com/PNNL-Comp-Mass-Spec/BP-Quant.

  17. Protein modelling of triterpene synthase genes from mangrove plants using Phyre2 and Swiss-model

    Science.gov (United States)

    Basyuni, M.; Wati, R.; Sulistiyono, N.; Hayati, R.; Sumardi; Oku, H.; Baba, S.; Sagami, H.

    2018-03-01

    Molecular cloning of five oxidosqualene cyclases (OSC) genes from Bruguiera gymnorrhiza, Kandelia candel, and Rhizophora stylosa had previously been cloned, characterized, and encoded mono and -multi triterpene synthases. The present study analyzed protein modelling of triterpene synthase genes from mangrove using Phyre2 and Swiss-model. The diversity was noted within protein modelling of triterpene synthases using Phyre2 from sequence identity (38-43%) and residue (696-703). RsM2 was distinguishable from others for template structure; it used lanosterol synthase as a template (PDB ID: w6j.1.A). By contrast, other genes used human lanosterol synthase (1w6k.1.A). The predicted bind sites were correlated with the product of triterpene synthase, the product of BgbAS was β-amyrin, while RsM1 contained a significant amount of β-amyrin. Similarly BgLUS and KcMS, both main products was lupeol, on the other hand, RsM2 with the outcome of taraxerol. Homology modelling revealed that 696 residues of BgbAS, BgLUS, RsM1, and RsM2 (91-92% of the amino acid sequence) had been modelled with 100% confidence by the single highest scoring template using Phyre2. This coverage was higher than Swiss-model (85-90%). The present study suggested that molecular cloning of triterpene genes provides useful tools for studying the protein modelling related regulation of isoprenoids biosynthesis in mangrove forests.

  18. A feature-based approach to modeling protein-protein interaction hot spots.

    Science.gov (United States)

    Cho, Kyu-il; Kim, Dongsup; Lee, Doheon

    2009-05-01

    Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed features with conventional features, we develop a predictive model for interaction hot spots. Initially, 54 multifaceted features, composed of different levels of information including structure, sequence and molecular interaction information, are quantified. Then, to identify the best subset of features for predicting hot spots, feature selection is performed using a decision tree. Based on the selected features, a predictive model for hot spots is created using support vector machine (SVM) and tested on an independent test set. Our model shows better overall predictive accuracy than previous methods such as the alanine scanning methods Robetta and FOLDEF, and the knowledge-based method KFC. Subsequent analysis yields several findings about hot spots. As expected, hot spots have a larger relative surface area burial and are more hydrophobic than other residues. Unexpectedly, however, residue conservation displays a rather complicated tendency depending on the types of protein complexes, indicating that this feature is not good for identifying hot spots. Of the selected features, the weighted atomic packing density, relative surface area burial and weighted hydrophobicity are the top 3, with the weighted atomic packing density proving to be the most effective feature for predicting hot spots. Notably, we find that hot spots are closely related to pi-related interactions, especially pi . . . pi interactions.

  19. Comparing human-Salmonella with plant-Salmonella protein-protein interaction predictions

    Directory of Open Access Journals (Sweden)

    Sylvia eSchleker

    2015-01-01

    Full Text Available Salmonellosis is the most frequent food-borne disease world-wide and can be transmitted to humans by a variety of routes, especially via animal and plant products. Salmonella bacteria are believed to use not only animal and human but also plant hosts despite their evolutionary distance. This raises the question if Salmonella employs similar mechanisms in infection of these diverse hosts. Given that most of our understanding comes from its interaction with human hosts, we investigate here to what degree knowledge of Salmonella-human interactions can be transferred to the Salmonella-plant system. Reviewed are recent publications on analysis and prediction of Salmonella-host interactomes. Putative protein-protein interactions (PPIs between Salmonella and its human and Arabidopsis hosts were retrieved utilizing purely interolog-based approaches in which predictions were inferred based on available sequence and domain information of known PPIs, and machine learning approaches that integrate a larger set of useful information from different sources. Transfer learning is an especially suitable machine learning technique to predict plant host targets from the knowledge of human host targets. A comparison of the prediction results with transcriptomic data shows a clear overlap between the host proteins predicted to be targeted by PPIs and their gene ontology enrichment in both host species and regulation of gene expression. In particular, the cellular processes Salmonella interferes with in plants and humans are catabolic processes. The details of how these processes are targeted, however, are quite different between the two organisms, as expected based on their evolutionary and habitat differences. Possible implications of this observation on evolution of host-pathogen communication are discussed.

  20. A Mathematical Model of the Effect of Immunogenicity on Therapeutic Protein Pharmacokinetics

    OpenAIRE

    Chen, Xiaoying; Hickling, Timothy; Kraynov, Eugenia; Kuang, Bing; Parng, Chuenlei; Vicini, Paolo

    2013-01-01

    A mathematical pharmacokinetic/anti-drug-antibody (PK/ADA) model was constructed for quantitatively assessing immunogenicity for therapeutic proteins. The model is inspired by traditional pharmacokinetic/pharmacodynamic (PK/PD) models, and is based on the observed impact of ADA on protein drug clearance. The hypothesis for this work is that altered drug PK contains information about the extent and timing of ADA generation. By fitting drug PK profiles while accounting for ADA-mediated drug cle...

  1. Comparative proteomic analyses reveal that the regulators of G-protein signaling proteins regulate amino acid metabolism of the rice blast fungus Magnaporthe oryzae.

    Science.gov (United States)

    Zhang, Haifeng; Ma, Hongyu; Xie, Xin; Ji, Jun; Dong, Yanhan; Du, Yan; Tang, Wei; Zheng, Xiaobo; Wang, Ping; Zhang, Zhengguang

    2014-11-01

    The rice blast fungus Magnaporthe oryzae encodes eight regulators of G-protein (GTP-binding protein) signaling (RGS) proteins MoRgs1-MoRgs8 that orchestrate the growth, asexual/sexual production, appressorium differentiation, and pathogenicity. To address the mechanisms by which MoRgs proteins function, we conducted a 2DE proteome study and identified 82 differentially expressed proteins by comparing five ∆Morgs mutants with wild-type Guy11 strain. We found that the abundances of eight amino acid (AA) biosynthesis or degradation associated proteins were markedly altered in five ∆Morgs mutants, indicating one of the main collective roles for the MoRgs proteins is to influence AA metabolism. We showed that MoRgs proteins have distinct roles in AA metabolism and nutrient responses from growth assays. In addition, we characterized MoLys20 (Lys is lysine), a homocitrate synthase, whose abundance was significantly decreased in the ∆Morgs mutants. The ∆Molys20 mutant is auxotrophic for lys and exogenous lys could partially rescue its auxotrophic defects. Deletion of MoLYS20 resulted in defects in conidiation and infection, as well as pathogenicity on rice. Overall, our results indicate that one of the critical roles for MoRgs proteins is to regulate AA metabolism, and that MoLys20 may be directly or indirectly regulated by MoRgs and participated in lys biosynthesis, thereby affecting fungal development and pathogenicity. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Identification of proteins in hyperglycemia and stroke animal models.

    Science.gov (United States)

    Sung, Jin-Hee; Shah, Fawad-Ali; Gim, Sang-Ah; Koh, Phil-Ok

    2016-01-01

    Stroke is a major cause of disability and death in adults. Diabetes mellitus is a metabolic disorder that strongly increases the risk of severe vascular diseases. This study compared changes in proteins of the cerebral cortex during ischemic brain injury between nondiabetic and diabetic animals. Adult male rats were injected with streptozotocin (40 mg/kg) via the intraperitoneal route to induce diabetes and underwent surgical middle cerebral artery occlusion (MCAO) 4 wk after streptozotocin treatment. Cerebral cortex tissues were collected 24 h after MCAO and cerebral cortex proteins were analyzed by two-dimensional gel electrophoresis and mass spectrometry. Several proteins were identified as differentially expressed between nondiabetic and diabetic animals. Among the identified proteins, we focused on the following metabolism-related enzymes: isocitrate dehydrogenase, glyceraldehyde-3-phosphate dehydrogenase, adenosylhomocysteinase, pyruvate kinase, and glucose-6-phosphate isomerase (neuroleukin). Expression of these proteins was decreased in animals that underwent MCAO. Moreover, protein expression was reduced to a greater extent in diabetic animals than in nondiabetic animals. Reverse transcription-polymerase chain reaction analysis confirmed that the diabetic condition exacerbates the decrease in expression of metabolism-related proteins after MCAO. These results suggest that the diabetic condition may exacerbate brain damage during focal cerebral ischemia through the downregulation of metabolism-related proteins. Copyright © 2016 Elsevier Inc. All rights reserved.

  3. From the Protein's Perspective: The Benefits and Challenges of Protein Structure-Based Pharmacophore Modeling

    NARCIS (Netherlands)

    Sanders, M.P.A.; McGuire, R; Roumen, L.; de Esch, I.J.P.; de Vlieg, J; Klomp, J.P.G; de Graaf, C.

    2011-01-01

    A pharmacophore describes the arrangement of molecular features a ligand must contain to efficaciously bind a receptor. Pharmacophore models are developed to improve molecular understanding of ligand-protein interactions, and can be used as a tool to identify novel compounds that fulfil the

  4. Analysing the origin of long-range interactions in proteins using lattice models

    Directory of Open Access Journals (Sweden)

    Unger Ron

    2009-01-01

    Full Text Available Abstract Background Long-range communication is very common in proteins but the physical basis of this phenomenon remains unclear. In order to gain insight into this problem, we decided to explore whether long-range interactions exist in lattice models of proteins. Lattice models of proteins have proven to capture some of the basic properties of real proteins and, thus, can be used for elucidating general principles of protein stability and folding. Results Using a computational version of double-mutant cycle analysis, we show that long-range interactions emerge in lattice models even though they are not an input feature of them. The coupling energy of both short- and long-range pairwise interactions is found to become more positive (destabilizing in a linear fashion with increasing 'contact-frequency', an entropic term that corresponds to the fraction of states in the conformational ensemble of the sequence in which the pair of residues is in contact. A mathematical derivation of the linear dependence of the coupling energy on 'contact-frequency' is provided. Conclusion Our work shows how 'contact-frequency' should be taken into account in attempts to stabilize proteins by introducing (or stabilizing contacts in the native state and/or through 'negative design' of non-native contacts.

  5. Pathogenicity of Vibrio anguillarum serogroup O1 strains compared to plasmids, outer membrane protein profiles and siderophore production

    DEFF Research Database (Denmark)

    Pedersen, K.; Gram, Lone; Austin, D.A.

    1997-01-01

    The virulence of 18 strains of Vibrio anguillarum serogroup 01 was compared to plasmid content, expression of siderophores and outer membrane proteins. All strains, irrespective of plasmid content, produced siderophores and inducible outer membrane proteins under iron-limited conditions. Only str...

  6. Comparative analyses of transport proteins encoded within the genomes of Leptospira species.

    Science.gov (United States)

    Buyuktimkin, Bora; Saier, Milton H

    2016-09-01

    Select species of the bacterial genus Leptospira are causative agents of leptospirosis, an emerging global zoonosis affecting nearly one million people worldwide annually. We examined two Leptospira pathogens, Leptospira interrogans serovar Lai str. 56601 and Leptospira borgpetersenii serovar Hardjo-bovis str. L550, as well as the free-living leptospiral saprophyte, Leptospira biflexa serovar Patoc str. 'Patoc 1 (Ames)'. The transport proteins of these leptospires were identified and compared using bioinformatics to gain an appreciation for which proteins may be related to pathogenesis and saprophytism. L. biflexa possesses a disproportionately high number of secondary carriers for metabolite uptake and environmental adaptability as well as an increased number of inorganic cation transporters providing ionic homeostasis and effective osmoregulation in a rapidly changing environment. L. interrogans and L. borgpetersenii possess far fewer transporters, but those that they all have are remarkably similar, with near-equivalent representation in most transporter families. These two Leptospira pathogens also possess intact sphingomyelinases, holins, and virulence-related outer membrane porins. These virulence-related factors, in conjunction with decreased transporter substrate versatility, indicate that pathogenicity arose in Leptospira correlating to progressively narrowing ecological niches and the emergence of a limited set of proteins responsible for host invasion. The variability of host tropism and mortality rates by infectious leptospires suggests that small differences in individual sets of proteins play important physiological and pathological roles. Copyright © 2016. Published by Elsevier Ltd.

  7. Protein model discrimination using mutational sensitivity derived from deep sequencing.

    Science.gov (United States)

    Adkar, Bharat V; Tripathi, Arti; Sahoo, Anusmita; Bajaj, Kanika; Goswami, Devrishi; Chakrabarti, Purbani; Swarnkar, Mohit K; Gokhale, Rajesh S; Varadarajan, Raghavan

    2012-02-08

    A major bottleneck in protein structure prediction is the selection of correct models from a pool of decoys. Relative activities of ∼1,200 individual single-site mutants in a saturation library of the bacterial toxin CcdB were estimated by determining their relative populations using deep sequencing. This phenotypic information was used to define an empirical score for each residue (RankScore), which correlated with the residue depth, and identify active-site residues. Using these correlations, ∼98% of correct models of CcdB (RMSD ≤ 4Å) were identified from a large set of decoys. The model-discrimination methodology was further validated on eleven different monomeric proteins using simulated RankScore values. The methodology is also a rapid, accurate way to obtain relative activities of each mutant in a large pool and derive sequence-structure-function relationships without protein isolation or characterization. It can be applied to any system in which mutational effects can be monitored by a phenotypic readout. Copyright © 2012 Elsevier Ltd. All rights reserved.

  8. Toward a rigorous network of protein-protein interactions of the model sulfate reducer Desulfovibrio vulgaris Hildenborough

    Energy Technology Data Exchange (ETDEWEB)

    Chhabra, S.R.; Joachimiak, M.P.; Petzold, C.J.; Zane, G.M.; Price, M.N.; Gaucher, S.; Reveco, S.A.; Fok, V.; Johanson, A.R.; Batth, T.S.; Singer, M.; Chandonia, J.M.; Joyner, D.; Hazen, T.C.; Arkin, A.P.; Wall, J.D.; Singh, A.K.; Keasling, J.D.

    2011-05-01

    Protein–protein interactions offer an insight into cellular processes beyond what may be obtained by the quantitative functional genomics tools of proteomics and transcriptomics. The aforementioned tools have been extensively applied to study E. coli and other aerobes and more recently to study the stress response behavior of Desulfovibrio 5 vulgaris Hildenborough, a model anaerobe and sulfate reducer. In this paper we present the first attempt to identify protein-protein interactions in an obligate anaerobic bacterium. We used suicide vector-assisted chromosomal modification of 12 open reading frames encoded by this sulfate reducer to append an eight amino acid affinity tag to the carboxy-terminus of the chosen proteins. Three biological replicates of the 10 ‘pulled-down’ proteins were separated and analyzed using liquid chromatography-mass spectrometry. Replicate agreement ranged between 35% and 69%. An interaction network among 12 bait and 90 prey proteins was reconstructed based on 134 bait-prey interactions computationally identified to be of high confidence. We discuss the biological significance of several unique metabolic features of D. vulgaris revealed by this protein-protein interaction data 15 and protein modifications that were observed. These include the distinct role of the putative carbon monoxide-induced hydrogenase, unique electron transfer routes associated with different oxidoreductases, and the possible role of methylation in regulating sulfate reduction.

  9. Towards a rigorous network of protein-protein interactions of the model sulfate reducer Desulfovibrio vulgaris Hildenborough.

    Directory of Open Access Journals (Sweden)

    Swapnil R Chhabra

    Full Text Available Protein-protein interactions offer an insight into cellular processes beyond what may be obtained by the quantitative functional genomics tools of proteomics and transcriptomics. The aforementioned tools have been extensively applied to study Escherichia coli and other aerobes and more recently to study the stress response behavior of Desulfovibrio vulgaris Hildenborough, a model obligate anaerobe and sulfate reducer and the subject of this study. Here we carried out affinity purification followed by mass spectrometry to reconstruct an interaction network among 12 chromosomally encoded bait and 90 prey proteins based on 134 bait-prey interactions identified to be of high confidence. Protein-protein interaction data are often plagued by the lack of adequate controls and replication analyses necessary to assess confidence in the results, including identification of potential false positives. We addressed these issues through the use of biological replication, exponentially modified protein abundance indices, results from an experimental negative control, and a statistical test to assign confidence to each putative interacting pair applicable to small interaction data studies. We discuss the biological significance of metabolic features of D. vulgaris revealed by these protein-protein interaction data and the observed protein modifications. These include the distinct role of the putative carbon monoxide-induced hydrogenase, unique electron transfer routes associated with different oxidoreductases, and the possible role of methylation in regulating sulfate reduction.

  10. Modelling of microbial polyhydroxyalkanoate surface binding protein PhaP for rational mutagenesis.

    Science.gov (United States)

    Zhao, Hongyu; Yao, Zhenyu; Chen, Xiangbin; Wang, Xinquan; Chen, Guo-Qiang

    2017-11-01

    Phasins are unusual amphiphilic proteins that bind to microbial polyhydroxyalkanoate (PHA) granules in nature and show great potential for various applications in biotechnology and medicine. Despite their remarkable diversity, only the crystal structure of PhaP A h from Aeromonas hydrophila has been solved to date. Based on the structure of PhaP A h , homology models of PhaP A z from Azotobacter sp. FA-8 and PhaP TD from Halomonas bluephagenesis TD were successfully established, allowing rational mutagenesis to be conducted to enhance the stability and surfactant properties of these proteins. PhaP A z mutants, including PhaP A z Q38L and PhaP A z Q78L, as well as PhaP TD mutants, including PhaP TD Q38M and PhaP TD Q72M, showed better emulsification properties and improved thermostability (6-10°C higher melting temperatures) compared with their wild-type homologues under the same conditions. Importantly, the established PhaP homology-modelling approach, based on the high-resolution structure of PhaP A h , can be generalized to facilitate the study of other PhaP members. © 2017 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.

  11. Modelling small-angle scattering data from complex protein-lipid systems

    DEFF Research Database (Denmark)

    Kynde, Søren Andreas Røssell

    This thesis consists of two parts. The rst part is divided into five chapters. Chapter 1 gives a general introduction to the bio-molecular systems that have been studied. These are membrane proteins and their lipid environments in the form of phospholipid nanodiscs. Membrane proteins...... the techniques very well suited for the study of the nanodisc system. Chapter 3 explains two different modelling approaches that can be used in the analysis of small-angle scattering data from lipid-protein complexes. These are the continuous approach where the system of interest is modelled as a few regular...... combine the bene ts of each of the methods and give unique structural information about relevant bio-molecular complexes in solution. Chapter 4 describes the work behind a proposal of a small-angle neutron scattering instrument for the European Spallation Source under construction in Lund. The instrument...

  12. Lessons from Animal Models of Cytoplasmic Intermediate Filament Proteins.

    Science.gov (United States)

    Bouameur, Jamal-Eddine; Magin, Thomas M

    Cytoplasmic intermediate filaments (IFs) represent a major cytoskeletal network contributing to cell shape, adhesion and migration as well as to tissue resilience and renewal in numerous bilaterians, including mammals. The observation that IFs are dispensable in cultured mammalian cells, but cause tissue-specific, life-threatening disorders, has pushed the need to investigate their function in vivo. In keeping with human disease, the deletion or mutation of murine IF genes resulted in highly specific pathologies. Epidermal keratins, together with desmin, are essential to protect corresponding tissues against mechanical force but also participate in stabilizing cell adhesion and in inflammatory signalling. Surprisingly, other IF proteins contribute to tissue integrity to a much lesser extent than anticipated, pointing towards their role in stress situations. In support, the overexpression of small chaperones or the interference with inflammatory signalling in several settings has been shown to rescue severe tissue pathologies that resulted from the expression of mutant IF proteins. It stills remains an open issue whether the wide range of IF disorders share similar pathomechanisms. Moreover, we lack an understanding how IF proteins participate in signalling processes. Now, with a large number of mouse models in hand, the next challenge will be to develop organotypic cell culture models to dissect pathomechanisms at the molecular level, to employ Crispr/Cas-mediated genome engineering to optimize models and, finally, to combine available animal models with medicinal chemistry for the development of molecular therapies.

  13. Dynamical modeling of microRNA action on the protein translation process.

    Science.gov (United States)

    Zinovyev, Andrei; Morozova, Nadya; Nonne, Nora; Barillot, Emmanuel; Harel-Bellan, Annick; Gorban, Alexander N

    2010-02-24

    Protein translation is a multistep process which can be represented as a cascade of biochemical reactions (initiation, ribosome assembly, elongation, etc.), the rate of which can be regulated by small non-coding microRNAs through multiple mechanisms. It remains unclear what mechanisms of microRNA action are the most dominant: moreover, many experimental reports deliver controversial messages on what is the concrete mechanism actually observed in the experiment. Nissan and Parker have recently demonstrated that it might be impossible to distinguish alternative biological hypotheses using the steady state data on the rate of protein synthesis. For their analysis they used two simple kinetic models of protein translation. In contrary to the study by Nissan and Parker, we show that dynamical data allow discriminating some of the mechanisms of microRNA action. We demonstrate this using the same models as developed by Nissan and Parker for the sake of comparison but the methods developed (asymptotology of biochemical networks) can be used for other models. We formulate a hypothesis that the effect of microRNA action is measurable and observable only if it affects the dominant system (generalization of the limiting step notion for complex networks) of the protein translation machinery. The dominant system can vary in different experimental conditions that can partially explain the existing controversy of some of the experimental data. Our analysis of the transient protein translation dynamics shows that it gives enough information to verify or reject a hypothesis about a particular molecular mechanism of microRNA action on protein translation. For multiscale systems only that action of microRNA is distinguishable which affects the parameters of dominant system (critical parameters), or changes the dominant system itself. Dominant systems generalize and further develop the old and very popular idea of limiting step. Algorithms for identifying dominant systems in multiscale

  14. Dynamical modeling of microRNA action on the protein translation process

    Directory of Open Access Journals (Sweden)

    Barillot Emmanuel

    2010-02-01

    Full Text Available Abstract Background Protein translation is a multistep process which can be represented as a cascade of biochemical reactions (initiation, ribosome assembly, elongation, etc., the rate of which can be regulated by small non-coding microRNAs through multiple mechanisms. It remains unclear what mechanisms of microRNA action are the most dominant: moreover, many experimental reports deliver controversial messages on what is the concrete mechanism actually observed in the experiment. Nissan and Parker have recently demonstrated that it might be impossible to distinguish alternative biological hypotheses using the steady state data on the rate of protein synthesis. For their analysis they used two simple kinetic models of protein translation. Results In contrary to the study by Nissan and Parker, we show that dynamical data allow discriminating some of the mechanisms of microRNA action. We demonstrate this using the same models as developed by Nissan and Parker for the sake of comparison but the methods developed (asymptotology of biochemical networks can be used for other models. We formulate a hypothesis that the effect of microRNA action is measurable and observable only if it affects the dominant system (generalization of the limiting step notion for complex networks of the protein translation machinery. The dominant system can vary in different experimental conditions that can partially explain the existing controversy of some of the experimental data. Conclusions Our analysis of the transient protein translation dynamics shows that it gives enough information to verify or reject a hypothesis about a particular molecular mechanism of microRNA action on protein translation. For multiscale systems only that action of microRNA is distinguishable which affects the parameters of dominant system (critical parameters, or changes the dominant system itself. Dominant systems generalize and further develop the old and very popular idea of limiting step

  15. Global optimization of proteins using a dynamical lattice model: Ground states and energy landscapes

    OpenAIRE

    Dressel, F.; Kobe, S.

    2004-01-01

    A simple approach is proposed to investigate the protein structure. Using a low complexity model, a simple pairwise interaction and the concept of global optimization, we are able to calculate ground states of proteins, which are in agreement with experimental data. All possible model structures of small proteins are available below a certain energy threshold. The exact lowenergy landscapes for the trp cage protein (1L2Y) is presented showing the connectivity of all states and energy barriers.

  16. Protein structure modelling and evaluation based on a 4-distance description of side-chain interactions

    Directory of Open Access Journals (Sweden)

    Inbar Yuval

    2010-07-01

    Full Text Available Abstract Background Accurate evaluation and modelling of residue-residue interactions within and between proteins is a key aspect of computational structure prediction including homology modelling, protein-protein docking, refinement of low-resolution structures, and computational protein design. Results Here we introduce a method for accurate protein structure modelling and evaluation based on a novel 4-distance description of residue-residue interaction geometry. Statistical 4-distance preferences were extracted from high-resolution protein structures and were used as a basis for a knowledge-based potential, called Hunter. We demonstrate that 4-distance description of side chain interactions can be used reliably to discriminate the native structure from a set of decoys. Hunter ranked the native structure as the top one in 217 out of 220 high-resolution decoy sets, in 25 out of 28 "Decoys 'R' Us" decoy sets and in 24 out of 27 high-resolution CASP7/8 decoy sets. The same concept was applied to side chain modelling in protein structures. On a set of very high-resolution protein structures the average RMSD was 1.47 Å for all residues and 0.73 Å for buried residues, which is in the range of attainable accuracy for a model. Finally, we show that Hunter performs as good or better than other top methods in homology modelling based on results from the CASP7 experiment. The supporting web site http://bioinfo.weizmann.ac.il/hunter/ was developed to enable the use of Hunter and for visualization and interactive exploration of 4-distance distributions. Conclusions Our results suggest that Hunter can be used as a tool for evaluation and for accurate modelling of residue-residue interactions in protein structures. The same methodology is applicable to other areas involving high-resolution modelling of biomolecules.

  17. Modeling of allergen proteins found in sea food products

    Directory of Open Access Journals (Sweden)

    Nataly Galán-Freyle

    2012-06-01

    Full Text Available Shellfish are a source of food allergens, and their consumption is the cause of severe allergic reactions in humans. Tropomyosins, a family of muscle proteins, have been identified as the major allergens in shellfish and mollusks species. Nevertheless, few experimentally determined three-dimensional structures are available in the Protein Data Base (PDB. In this study, 3D models of several homologous of tropomyosins present in marine shellfish and mollusk species (Chaf 1, Met e1, Hom a1, Per v1, and Pen a1 were constructed, validated, and their immunoglobulin E binding epitopes were identified using bioinformatics tools. All protein models for these allergens consisted of long alpha-helices. Chaf 1, Met e1, and Hom a1 had six conserved regions with sequence similarities to known epitopes, whereas Per v1 and Pen a1 contained only one. Lipophilic potentials of identified epitopes revealed a high propensity of hydrophobic amino acids in the immunoglobulin E binding site. This information could be useful to design tropomyosin-specific immunotherapy for sea food allergies.

  18. Photobinding of tiaprofenic acid and suprofen to proteins and cells: a combined study using radiolabeling, antibodies and laser flash photolysis of model bichromophores.

    Science.gov (United States)

    Castell, J V; Hernández, D; Gómez-Lechón, M J; Lahoz, A; Miranda, M A; Morera, I M; Pérez-Prieto, J; Sarabia, Z

    1998-11-01

    Drug photoallergy is a matter of current concern. It involves the formation of drug-protein photoadducts (photoantigens) that may ultimately trigger an immunological response. Tyrosine residues appear to be key binding sites in proteins. The present work has investigated the photobinding of tiaprofenic and (TPA) and the closely related isomer suprofen (SUP) to proteins and cells by means of radioactive labelling and drug-directed antibodies. To ascertain whether preassociation with the protein may play a role in photoreactivity, two model bichromophoric compounds (TPA-Tyr and SUP-Tyr) have been prepared and studied by laser flash photolysis. The results of this work show that (a) TPA and SUP photobind to proteins with similar efficiencies, (b) both drugs form photoadducts that share a basic common structure, as they are recognized by the same antibody and (c) drug-protein preassociation must play a key role in photoreactivity, as indicated by the dramatic decrease in the triplet state lifetimes of the model bichromophores compared to the parent drugs.

  19. The Yeast Saccharomyces cerevisiae as a Model for Understanding RAS Proteins and Their Role in Human Tumorigenesis

    Science.gov (United States)

    Cazzanelli, Giulia; Francisco, Rita; Azevedo, Luísa; Carvalho, Patrícia Dias; Almeida, Ana; Côrte-Real, Manuela; Oliveira, Maria José; Lucas, Cândida; Sousa, Maria João

    2018-01-01

    The exploitation of the yeast Saccharomyces cerevisiae as a biological model for the investigation of complex molecular processes conserved in multicellular organisms, such as humans, has allowed fundamental biological discoveries. When comparing yeast and human proteins, it is clear that both amino acid sequences and protein functions are often very well conserved. One example of the high degree of conservation between human and yeast proteins is highlighted by the members of the RAS family. Indeed, the study of the signaling pathways regulated by RAS in yeast cells led to the discovery of properties that were often found interchangeable with RAS proto-oncogenes in human pathways, and vice versa. In this work, we performed an updated critical literature review on human and yeast RAS pathways, specifically highlighting the similarities and differences between them. Moreover, we emphasized the contribution of studying yeast RAS pathways for the understanding of human RAS and how this model organism can contribute to unveil the roles of RAS oncoproteins in the regulation of mechanisms important in the tumorigenic process, like autophagy. PMID:29463063

  20. Effects of Whey Protein Hydrolysate Ingestion on Postprandial Aminoacidemia Compared with a Free Amino Acid Mixture in Young Men

    Directory of Open Access Journals (Sweden)

    Kyosuke Nakayama

    2018-04-01

    Full Text Available To stimulate muscle protein synthesis, it is important to increase the plasma levels of essential amino acids (EAA, especially leucine, by ingesting proteins. Protein hydrolysate ingestion can induce postprandial hyperaminoacidemia; however, it is unclear whether protein hydrolysate is associated with higher levels of aminoacidemia compared with a free amino acid mixture when both are ingested orally. We assessed the effects of whey protein hydrolysate (WPH ingestion on postprandial aminoacidemia, especially plasma leucine levels, compared to ingestion of a free amino acid mixture. This study was an open-label, randomized, 4 × 4 Latin square design. After 12–15 h of fasting, 11 healthy young men ingested the WPH (3.3, 5.0, or 7.5 g of protein or the EAA mixture (2.5 g. Blood samples were collected before ingestion and at time points from 10 to 120 min after ingestion, and amino acids, insulin, glucose and insulin-like growth factor-1 (IGF-1 concentrations in plasma were measured. Even though the EAA mixture and 5.0 g of the WPH contained similar amounts of EAA and leucine, the WPH was associated with significantly higher plasma EAA and leucine levels. These results suggest that the WPH can induce a higher level of aminoacidemia compared with a free amino acid mixture when both are ingested orally.

  1. Model to predict inhomogeneous protein-sugar distribution in powders prepared by spray drying

    NARCIS (Netherlands)

    Grasmeijer, Niels; Frijlink, Henderik W.; Hinrichs, Wouter L. J.

    2016-01-01

    A protein can be stabilized by spray drying an aqueous solution of the protein and a sugar, thereby incorporating the protein into a glassy sugar matrix. For optimal stability, the protein should be homogeneously distributed inside the sugar matrix. The aim of this study was to develop a model that

  2. On the application of the MARTINI coarse-grained model to immersion of a protein in a phospholipid bilayer

    Energy Technology Data Exchange (ETDEWEB)

    Mustafa, Ghulam, E-mail: Ghulam.Mustafa@h-its.org, E-mail: rebecca.wade@h-its.org; Nandekar, Prajwal P.; Yu, Xiaofeng [Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS), Schloß-Wolfsbrunnenweg 35, 69118 Heidelberg (Germany); Wade, Rebecca C., E-mail: Ghulam.Mustafa@h-its.org, E-mail: rebecca.wade@h-its.org [Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS), Schloß-Wolfsbrunnenweg 35, 69118 Heidelberg (Germany); Zentrum für Molekulare Biologie der Universität Heidelberg, DKFZ-ZMBH Alliance, INF 282, 69120 Heidelberg (Germany); Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, INF 368, 69120 Heidelberg (Germany)

    2015-12-28

    An important step in the simulation of a membrane protein in a phospholipid bilayer is the correct immersion of the protein in the bilayer. Crystal structures are determined without the bilayer. Particularly for proteins with monotopic domains, it can be unclear how deeply and in which orientation the protein is being inserted in the membrane. We have previously developed a procedure combining coarse-grain (CG) with all-atom (AA) molecular dynamics (MD) simulations to insert and simulate a cytochrome P450 (CYP) possessing an N-terminal transmembrane helix connected by a flexible linker region to a globular domain that dips into the membrane. The CG simulations provide a computationally efficient means to explore different orientations and conformations of the CYP in the membrane. Converged configurations obtained in the CG simulations are then refined in AA simulations. Here, we tested different variants of the MARTINI CG model, differing in the water model, the treatment of long-range non-bonded interactions, and the implementation (GROMACS 4.5.5 vs 5.0.4), for this purpose. We examined the behavior of the models for simulating a 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC) bilayer in water and for the immersion of CYP3A4 in a POPC bilayer, and compared the CG-MD results with the previously reported experimental and simulation results. We also tested the methodology on a set of four other CYPs. Finally, we propose an optimized protocol for modeling such protein-membrane systems that provides the most plausible configurations and is computationally efficient; this incorporates the standard non-polar water model and the GROMACS 5.0.4 implementation with a reaction field treatment of long-range interactions.

  3. On the application of the MARTINI coarse-grained model to immersion of a protein in a phospholipid bilayer

    International Nuclear Information System (INIS)

    Mustafa, Ghulam; Nandekar, Prajwal P.; Yu, Xiaofeng; Wade, Rebecca C.

    2015-01-01

    An important step in the simulation of a membrane protein in a phospholipid bilayer is the correct immersion of the protein in the bilayer. Crystal structures are determined without the bilayer. Particularly for proteins with monotopic domains, it can be unclear how deeply and in which orientation the protein is being inserted in the membrane. We have previously developed a procedure combining coarse-grain (CG) with all-atom (AA) molecular dynamics (MD) simulations to insert and simulate a cytochrome P450 (CYP) possessing an N-terminal transmembrane helix connected by a flexible linker region to a globular domain that dips into the membrane. The CG simulations provide a computationally efficient means to explore different orientations and conformations of the CYP in the membrane. Converged configurations obtained in the CG simulations are then refined in AA simulations. Here, we tested different variants of the MARTINI CG model, differing in the water model, the treatment of long-range non-bonded interactions, and the implementation (GROMACS 4.5.5 vs 5.0.4), for this purpose. We examined the behavior of the models for simulating a 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC) bilayer in water and for the immersion of CYP3A4 in a POPC bilayer, and compared the CG-MD results with the previously reported experimental and simulation results. We also tested the methodology on a set of four other CYPs. Finally, we propose an optimized protocol for modeling such protein-membrane systems that provides the most plausible configurations and is computationally efficient; this incorporates the standard non-polar water model and the GROMACS 5.0.4 implementation with a reaction field treatment of long-range interactions

  4. Animal Models of Congenital Cardiomyopathies Associated With Mutations in Z-Line Proteins.

    Science.gov (United States)

    Bang, Marie-Louise

    2017-01-01

    The cardiac Z-line at the boundary between sarcomeres is a multiprotein complex connecting the contractile apparatus with the cytoskeleton and the extracellular matrix. The Z-line is important for efficient force generation and transmission as well as the maintenance of structural stability and integrity. Furthermore, it is a nodal point for intracellular signaling, in particular mechanosensing and mechanotransduction. Mutations in various genes encoding Z-line proteins have been associated with different cardiomyopathies, including dilated cardiomyopathy, hypertrophic cardiomyopathy, arrhythmogenic right ventricular cardiomyopathy, restrictive cardiomyopathy, and left ventricular noncompaction, and mutations even within the same gene can cause widely different pathologies. Animal models have contributed to a great advancement in the understanding of the physiological function of Z-line proteins and the pathways leading from mutations in Z-line proteins to cardiomyopathy, although genotype-phenotype prediction remains a great challenge. This review presents an overview of the currently available animal models for Z-line and Z-line associated proteins involved in human cardiomyopathies with special emphasis on knock-in and transgenic mouse models recapitulating the clinical phenotypes of human cardiomyopathy patients carrying mutations in Z-line proteins. Pros and cons of mouse models will be discussed and a future outlook will be given. J. Cell. Physiol. 232: 38-52, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  5. Oral sensitization to food proteins: A Brown Norway rat model

    NARCIS (Netherlands)

    Knippels, L.M.J.; Penninks, A.H.; Spanhaak, S.; Houben, G.F.

    1998-01-01

    Background: Although several in vivo antigenicity assays using parenteral immunization are operational, no adequate enteral sensitization models are available to study food allergy and allergenicity of food proteins. Objective: This paper describes the development of an enteral model for food

  6. A comparative modeling and molecular docking study on Mycobacterium tuberculosis targets involved in peptidoglycan biosynthesis.

    Science.gov (United States)

    Fakhar, Zeynab; Naiker, Suhashni; Alves, Claudio N; Govender, Thavendran; Maguire, Glenn E M; Lameira, Jeronimo; Lamichhane, Gyanu; Kruger, Hendrik G; Honarparvar, Bahareh

    2016-11-01

    An alarming rise of multidrug-resistant Mycobacterium tuberculosis strains and the continuous high global morbidity of tuberculosis have reinvigorated the need to identify novel targets to combat the disease. The enzymes that catalyze the biosynthesis of peptidoglycan in M. tuberculosis are essential and noteworthy therapeutic targets. In this study, the biochemical function and homology modeling of MurI, MurG, MraY, DapE, DapA, Alr, and Ddl enzymes of the CDC1551 M. tuberculosis strain involved in the biosynthesis of peptidoglycan cell wall are reported. Generation of the 3D structures was achieved with Modeller 9.13. To assess the structural quality of the obtained homology modeled targets, the models were validated using PROCHECK, PDBsum, QMEAN, and ERRAT scores. Molecular dynamics simulations were performed to calculate root mean square deviation (RMSD) and radius of gyration (Rg) of MurI and MurG target proteins and their corresponding templates. For further model validation, RMSD and Rg for selected targets/templates were investigated to compare the close proximity of their dynamic behavior in terms of protein stability and average distances. To identify the potential binding mode required for molecular docking, binding site information of all modeled targets was obtained using two prediction algorithms. A docking study was performed for MurI to determine the potential mode of interaction between the inhibitor and the active site residues. This study presents the first accounts of the 3D structural information for the selected M. tuberculosis targets involved in peptidoglycan biosynthesis.

  7. Incorporating water-release and lateral protein interactions in modeling equilibrium adsorption for ion-exchange chromatography.

    Science.gov (United States)

    Thrash, Marvin E; Pinto, Neville G

    2006-09-08

    The equilibrium adsorption of two albumin proteins on a commercial ion exchanger has been studied using a colloidal model. The model accounts for electrostatic and van der Waals forces between proteins and the ion exchanger surface, the energy of interaction between adsorbed proteins, and the contribution of entropy from water-release accompanying protein adsorption. Protein-surface interactions were calculated using methods previously reported in the literature. Lateral interactions between adsorbed proteins were experimentally measured with microcalorimetry. Water-release was estimated by applying the preferential interaction approach to chromatographic retention data. The adsorption of ovalbumin and bovine serum albumin on an anion exchanger at solution pH>pI of protein was measured. The experimental isotherms have been modeled from the linear region to saturation, and the influence of three modulating alkali chlorides on capacity has been evaluated. The heat of adsorption is endothermic for all cases studied, despite the fact that the net charge on the protein is opposite that of the adsorbing surface. Strong repulsive forces between adsorbed proteins underlie the endothermic heat of adsorption, and these forces intensify with protein loading. It was found that the driving force for adsorption is the entropy increase due to the release of water from the protein and adsorbent surfaces. It is shown that the colloidal model predicts protein adsorption capacity in both the linear and non-linear isotherm regions, and can account for the effects of modulating salt.

  8. Postprandial lipemia detects the effect of soy protein on cardiovascular disease risk compared with the fasting lipid profile.

    Science.gov (United States)

    Santo, Antonio S; Santo, Ariana M; Browne, Richard W; Burton, Harold; Leddy, John J; Horvath, Steven M; Horvath, Peter J

    2010-12-01

    Studies examining the effect of soy protein on cardiovascular disease (CVD) risk factors have not taken advantage of the postprandial state as an adjunct to the fasting lipid profile. The American Heart Association has acknowledged the efficacy of soy protein in reducing CVD risk factors to be limited. We hypothesized that the postprandial state would be more sensitive to any favorable changes associated with consuming soy protein compared with the fasting lipid profile. Furthermore, the presence of isoflavones in soy would enhance this effect. Thirty sedentary males aged 18-30 years were randomly assigned to milk protein (Milk), isoflavone-poor soy (Soy-), or isoflavone-rich soy (Soy+). Usual diets were supplemented with 25 g/day of protein for 28 days. Serum samples were collected before and after supplementation in a fasted state and postprandially at 30, 60, 120, 240, and 360 min after a high-fat, 1,000 kcal shake. Triacylglycerol (TAG), total cholesterol, non-esterified fatty acids, apolipoproteins B-100 and A-I and glucose concentrations were quantified. Fasting concentrations were not different after any protein supplementation. Postprandial TAG and TAG AUC increased after Soy-consumption supporting the postprandial state as a more sensitive indicator of soy ingestion effects on CVD risk factors compared with the fasting lipid profile. Furthermore, the absence of isoflavones in soy protein may have deleterious consequences on purported cardio-protective effects.

  9. Building alternate protein structures using the elastic network model.

    Science.gov (United States)

    Yang, Qingyi; Sharp, Kim A

    2009-02-15

    We describe a method for efficiently generating ensembles of alternate, all-atom protein structures that (a) differ significantly from the starting structure, (b) have good stereochemistry (bonded geometry), and (c) have good steric properties (absence of atomic overlap). The method uses reconstruction from a series of backbone framework structures that are obtained from a modified elastic network model (ENM) by perturbation along low-frequency normal modes. To ensure good quality backbone frameworks, the single force parameter ENM is modified by introducing two more force parameters to characterize the interaction between the consecutive carbon alphas and those within the same secondary structure domain. The relative stiffness of the three parameters is parameterized to reproduce B-factors, while maintaining good bonded geometry. After parameterization, violations of experimental Calpha-Calpha distances and Calpha-Calpha-Calpha pseudo angles along the backbone are reduced to less than 1%. Simultaneously, the average B-factor correlation coefficient improves to R = 0.77. Two applications illustrate the potential of the approach. (1) 102,051 protein backbones spanning a conformational space of 15 A root mean square deviation were generated from 148 nonredundant proteins in the PDB database, and all-atom models with minimal bonded and nonbonded violations were produced from this ensemble of backbone structures using the SCWRL side chain building program. (2) Improved backbone templates for homology modeling. Fifteen query sequences were each modeled on two targets. For each of the 30 target frameworks, dozens of improved templates could be produced In all cases, improved full atom homology models resulted, of which 50% could be identified blind using the D-Fire statistical potential. (c) 2008 Wiley-Liss, Inc.

  10. Comparative proteomic analysis of differentially expressed proteins in the urine of reservoir hosts of leptospirosis.

    Directory of Open Access Journals (Sweden)

    Jarlath E Nally

    Full Text Available Rattus norvegicus is a natural reservoir host for pathogenic species of Leptospira. Experimentally infected rats remain clinically normal, yet persistently excrete large numbers of leptospires from colonized renal tubules via urine, despite a specific host immune response. Whilst persistent renal colonization and shedding is facilitated in part by differential antigen expression by leptospires to evade host immune responses, there is limited understanding of kidney and urinary proteins expressed by the host that facilitates such biological equilibrium. Urine pellets were collected from experimentally infected rats shedding leptospires and compared to urine from non-infected controls spiked with in vitro cultivated leptospires for analysis by 2-D DIGE. Differentially expressed host proteins include membrane metallo endopeptidase, napsin A aspartic peptidase, vacuolar H+ATPase, kidney aminopeptidase and immunoglobulin G and A. Loa22, a virulence factor of Leptospira, as well as the GroEL, were increased in leptospires excreted in urine compared to in vitro cultivated leptospires. Urinary IgG from infected rats was specific for leptospires. Results confirm differential protein expression by both host and pathogen during chronic disease and include markers of kidney function and immunoglobulin which are potential biomarkers of infection.

  11. Refining comparative proteomics by spectral counting to account for shared peptides and multiple search engines.

    Science.gov (United States)

    Chen, Yao-Yi; Dasari, Surendra; Ma, Ze-Qiang; Vega-Montoto, Lorenzo J; Li, Ming; Tabb, David L

    2012-09-01

    Spectral counting has become a widely used approach for measuring and comparing protein abundance in label-free shotgun proteomics. However, when analyzing complex samples, the ambiguity of matching between peptides and proteins greatly affects the assessment of peptide and protein inventories, differentiation, and quantification. Meanwhile, the configuration of database searching algorithms that assign peptides to MS/MS spectra may produce different results in comparative proteomic analysis. Here, we present three strategies to improve comparative proteomics through spectral counting. We show that comparing spectral counts for peptide groups rather than for protein groups forestalls problems introduced by shared peptides. We demonstrate the advantage and flexibility of this new method in two datasets. We present four models to combine four popular search engines that lead to significant gains in spectral counting differentiation. Among these models, we demonstrate a powerful vote counting model that scales well for multiple search engines. We also show that semi-tryptic searching outperforms tryptic searching for comparative proteomics. Overall, these techniques considerably improve protein differentiation on the basis of spectral count tables.

  12. Comparative Study of Bancruptcy Prediction Models

    Directory of Open Access Journals (Sweden)

    Isye Arieshanti

    2013-09-01

    Full Text Available Early indication of bancruptcy is important for a company. If companies aware of  potency of their bancruptcy, they can take a preventive action to anticipate the bancruptcy. In order to detect the potency of a bancruptcy, a company can utilize a a model of bancruptcy prediction. The prediction model can be built using a machine learning methods. However, the choice of machine learning methods should be performed carefully. Because the suitability of a model depends on the problem specifically. Therefore, in this paper we perform a comparative study of several machine leaning methods for bancruptcy prediction. According to the comparative study, the performance of several models that based on machine learning methods (k-NN, fuzzy k-NN, SVM, Bagging Nearest Neighbour SVM, Multilayer Perceptron(MLP, Hybrid of MLP + Multiple Linear Regression, it can be showed that fuzzy k-NN method achieve the best performance with accuracy 77.5%

  13. PocketMatch: A new algorithm to compare binding sites in protein structures

    Directory of Open Access Journals (Sweden)

    Chandra Nagasuma

    2008-12-01

    Full Text Available Abstract Background Recognizing similarities and deriving relationships among protein molecules is a fundamental requirement in present-day biology. Similarities can be present at various levels which can be detected through comparison of protein sequences or their structural folds. In some cases similarities obscure at these levels could be present merely in the substructures at their binding sites. Inferring functional similarities between protein molecules by comparing their binding sites is still largely exploratory and not as yet a routine protocol. One of the main reasons for this is the limitation in the choice of appropriate analytical tools that can compare binding sites with high sensitivity. To benefit from the enormous amount of structural data that is being rapidly accumulated, it is essential to have high throughput tools that enable large scale binding site comparison. Results Here we present a new algorithm PocketMatch for comparison of binding sites in a frame invariant manner. Each binding site is represented by 90 lists of sorted distances capturing shape and chemical nature of the site. The sorted arrays are then aligned using an incremental alignment method and scored to obtain PMScores for pairs of sites. A comprehensive sensitivity analysis and an extensive validation of the algorithm have been carried out. A comparison with other site matching algorithms is also presented. Perturbation studies where the geometry of a given site was retained but the residue types were changed randomly, indicated that chance similarities were virtually non-existent. Our analysis also demonstrates that shape information alone is insufficient to discriminate between diverse binding sites, unless combined with chemical nature of amino acids. Conclusion A new algorithm has been developed to compare binding sites in accurate, efficient and high-throughput manner. Though the representation used is conceptually simplistic, we demonstrate that

  14. Comparative Investigation of Normal Modes and Molecular Dynamics of Hepatitis C NS5B Protein

    International Nuclear Information System (INIS)

    Asafi, M S; Tekpinar, M; Yildirim, A

    2016-01-01

    Understanding dynamics of proteins has many practical implications in terms of finding a cure for many protein related diseases. Normal mode analysis and molecular dynamics methods are widely used physics-based computational methods for investigating dynamics of proteins. In this work, we studied dynamics of Hepatitis C NS5B protein with molecular dynamics and normal mode analysis. Principal components obtained from a 100 nanoseconds molecular dynamics simulation show good overlaps with normal modes calculated with a coarse-grained elastic network model. Coarse-grained normal mode analysis takes at least an order of magnitude shorter time. Encouraged by this good overlaps and short computation times, we analyzed further low frequency normal modes of Hepatitis C NS5B. Motion directions and average spatial fluctuations have been analyzed in detail. Finally, biological implications of these motions in drug design efforts against Hepatitis C infections have been elaborated. (paper)

  15. Application of model bread baking in the examination of arabinoxylan-protein complexes in rye bread.

    Science.gov (United States)

    Buksa, Krzysztof

    2016-09-05

    The changes in molecular mass of arabinoxylan (AX) and protein caused by bread baking process were examined using a model rye bread. Instead of the normal flour, the dough contained starch, water-extractable AX and protein which were isolated from rye wholemeal. From the crumb of selected model breads, starch was removed releasing AX-protein complexes, which were further examined by size exclusion chromatography. On the basis of the research, it was concluded that optimum model mix can be composed of 3-6% AX and 3-6% rye protein isolate at 94-88% of rye starch meaning with the most similar properties to low extraction rye flour. Application of model rye bread allowed to examine the interactions between AX and proteins. Bread baked with a share of AX, rye protein and starch, from which the complexes of the highest molar mass were isolated, was characterized by the strongest structure of the bread crumb. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Homology modeling and protein engineering of alkane monooxygenase in Burkholderia thailandensis MSMB121: in silico insights.

    Science.gov (United States)

    Jain, Chakresh Kumar; Gupta, Money; Prasad, Yamuna; Wadhwa, Gulshan; Sharma, Sanjeev Kumar

    2014-07-01

    The degradation of hydrocarbons plays an important role in the eco-balancing of petroleum products, pesticides and other toxic products in the environment. The degradation of hydrocarbons by microbes such as Geobacillus thermodenitrificans, Burkhulderia, Gordonia sp. and Acinetobacter sp. has been studied intensively in the literature. The present study focused on the in silico protein engineering of alkane monooxygenase (ladA)-a protein involved in the alkane degradation pathway. We demonstrated the improvement in substrate binding energy with engineered ladA in Burkholderia thailandensis MSMB121. We identified an ortholog of ladA monooxygenase found in B. thailandensis MSMB121, and showed it to be an enzyme involved in an alkane degradation pathway studied extensively in Geobacillus thermodenitrificans. Homology modeling of the three-dimensional structure of ladA was performed with a crystal structure (protein databank ID: 3B9N) as a template in MODELLER 9v11, and further validated using PROCHECK, VERIFY-3D and WHATIF tools. Specific amino acids were substituted in the region corresponding to amino acids 305-370 of ladA protein, resulting in an enhancement of binding energy in different alkane chain molecules as compared to wild protein structures in the docking experiments. The substrate binding energy with the protein was calculated using Vina (Implemented in VEGAZZ). Molecular dynamics simulations were performed to study the dynamics of different alkane chain molecules inside the binding pockets of wild and mutated ladA. Here, we hypothesize an improvement in binding energies and accessibility of substrates towards engineered ladA enzyme, which could be further facilitated for wet laboratory-based experiments for validation of the alkane degradation pathway in this organism.

  17. Proteomic Identification of Altered Cerebral Proteins in the Complex Regional Pain Syndrome Animal Model

    Directory of Open Access Journals (Sweden)

    Francis Sahngun Nahm

    2014-01-01

    Full Text Available Background. Complex regional pain syndrome (CRPS is a rare but debilitating pain disorder. Although the exact pathophysiology of CRPS is not fully understood, central and peripheral mechanisms might be involved in the development of this disorder. To reveal the central mechanism of CRPS, we conducted a proteomic analysis of rat cerebrum using the chronic postischemia pain (CPIP model, a novel experimental model of CRPS. Materials and Methods. After generating the CPIP animal model, we performed a proteomic analysis of the rat cerebrum using a multidimensional protein identification technology, and screened the proteins differentially expressed between the CPIP and control groups. Results. A total of 155 proteins were differentially expressed between the CPIP and control groups: 125 increased and 30 decreased; expressions of proteins related to cell signaling, synaptic plasticity, regulation of cell proliferation, and cytoskeletal formation were increased in the CPIP group. However, proenkephalin A, cereblon, and neuroserpin were decreased in CPIP group. Conclusion. Altered expression of cerebral proteins in the CPIP model indicates cerebral involvement in the pathogenesis of CRPS. Further study is required to elucidate the roles of these proteins in the development and maintenance of CRPS.

  18. Proteomic identification of altered cerebral proteins in the complex regional pain syndrome animal model.

    Science.gov (United States)

    Nahm, Francis Sahngun; Park, Zee-Yong; Nahm, Sang-Soep; Kim, Yong Chul; Lee, Pyung Bok

    2014-01-01

    Complex regional pain syndrome (CRPS) is a rare but debilitating pain disorder. Although the exact pathophysiology of CRPS is not fully understood, central and peripheral mechanisms might be involved in the development of this disorder. To reveal the central mechanism of CRPS, we conducted a proteomic analysis of rat cerebrum using the chronic postischemia pain (CPIP) model, a novel experimental model of CRPS. After generating the CPIP animal model, we performed a proteomic analysis of the rat cerebrum using a multidimensional protein identification technology, and screened the proteins differentially expressed between the CPIP and control groups. Results. A total of 155 proteins were differentially expressed between the CPIP and control groups: 125 increased and 30 decreased; expressions of proteins related to cell signaling, synaptic plasticity, regulation of cell proliferation, and cytoskeletal formation were increased in the CPIP group. However, proenkephalin A, cereblon, and neuroserpin were decreased in CPIP group. Altered expression of cerebral proteins in the CPIP model indicates cerebral involvement in the pathogenesis of CRPS. Further study is required to elucidate the roles of these proteins in the development and maintenance of CRPS.

  19. COMPARATIVE ANALYSIS OF SOFTWARE DEVELOPMENT MODELS

    OpenAIRE

    Sandeep Kaur*

    2017-01-01

    No geek is unfamiliar with the concept of software development life cycle (SDLC). This research deals with the various SDLC models covering waterfall, spiral, and iterative, agile, V-shaped, prototype model. In the modern era, all the software systems are fallible as they can’t stand with certainty. So, it is tried to compare all aspects of the various models, their pros and cons so that it could be easy to choose a particular model at the time of need

  20. Protein identification and quantification from riverbank grape, Vitis riparia: Comparing SDS-PAGE and FASP-GPF techniques for shotgun proteomic analysis.

    Science.gov (United States)

    George, Iniga S; Fennell, Anne Y; Haynes, Paul A

    2015-09-01

    Protein sample preparation optimisation is critical for establishing reproducible high throughput proteomic analysis. In this study, two different fractionation sample preparation techniques (in-gel digestion and in-solution digestion) for shotgun proteomics were used to quantitatively compare proteins identified in Vitis riparia leaf samples. The total number of proteins and peptides identified were compared between filter aided sample preparation (FASP) coupled with gas phase fractionation (GPF) and SDS-PAGE methods. There was a 24% increase in the total number of reproducibly identified proteins when FASP-GPF was used. FASP-GPF is more reproducible, less expensive and a better method than SDS-PAGE for shotgun proteomics of grapevine samples as it significantly increases protein identification across biological replicates. Total peptide and protein information from the two fractionation techniques is available in PRIDE with the identifier PXD001399 (http://proteomecentral.proteomexchange.org/dataset/PXD001399). © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  1. Protein supplements after weight loss do not improve weight maintenance compared with recommended dietary protein intake despite beneficial effects on appetite sensation and energy expenditure

    DEFF Research Database (Denmark)

    Kjølbæk, Louise; Sørensen, Lone Brinkmann; Søndertoft, Nadja Buus

    2017-01-01

    Background: High-protein diets increase weight loss (WL) during energy restriction; therefore, it has been suggested that additional protein intake may improve weight maintenance (WM) after WL.Objective: We investigated the effect of protein supplements from either whey with or without calcium...... were performed to investigate diet-induced-thermogenesis (DIT) and appetite sensation. Compliance was tested by 24-h urinary nitrogen excretion.Results: A total of 151 participants completed the WM period. The control and 3 protein supplements did not result in different mean ± SD weight regains (whey.......58 ± 1.4 kg; and control: 1.74 ± 1.4 kg; P = 0.50) during WM. Changes in blood pressure and blood biochemistry were not different between groups. Compared with the control, protein supplementation resulted in higher DIT (∼30 kJ/2.5 h) and resting energy expenditure (243 kJ/d) and an anorexigenic appetite...

  2. Protein Simulation Data in the Relational Model.

    Science.gov (United States)

    Simms, Andrew M; Daggett, Valerie

    2012-10-01

    High performance computing is leading to unprecedented volumes of data. Relational databases offer a robust and scalable model for storing and analyzing scientific data. However, these features do not come without a cost-significant design effort is required to build a functional and efficient repository. Modeling protein simulation data in a relational database presents several challenges: the data captured from individual simulations are large, multi-dimensional, and must integrate with both simulation software and external data sites. Here we present the dimensional design and relational implementation of a comprehensive data warehouse for storing and analyzing molecular dynamics simulations using SQL Server.

  3. Comparative immunoblot analysis with 10 different, partially overlapping recombinant fusion proteins derived from 5 different cytomegalovirus proteins

    NARCIS (Netherlands)

    van Zanten, J.; LAZZAROTTO, T; CAMPISI, B; VORNHAGEN, R; JAHN, G; LANDINI, MP; The, T. Hauw

    Ten fusion proteins derived from five various CMV encoded proteins were used for the detection of specific antibody response by immunoblot technique in sera from renal transplant recipients. The fusion proteins were derived from the following CMV specific proteins: the assembly protein ppUL80a with

  4. SNP2Structure: A Public and Versatile Resource for Mapping and Three-Dimensional Modeling of Missense SNPs on Human Protein Structures

    Directory of Open Access Journals (Sweden)

    Difei Wang

    2015-01-01

    Full Text Available One of the long-standing challenges in biology is to understand how non-synonymous single nucleotide polymorphisms (nsSNPs change protein structure and further affect their function. While it is impractical to solve all the mutated protein structures experimentally, it is quite feasible to model the mutated structures in silico. Toward this goal, we built a publicly available structure database resource (SNP2Structure, https://apps.icbi.georgetown.edu/snp2structure focusing on missense mutations, msSNP. Compared with web portals with similar aims, SNP2Structure has the following major advantages. First, our portal offers direct comparison of two related 3D structures. Second, the protein models include all interacting molecules in the original PDB structures, so users are able to determine regions of potential interaction changes when a protein mutation occurs. Third, the mutated structures are available to download locally for further structural and functional analysis. Fourth, we used Jsmol package to display the protein structure that has no system compatibility issue. SNP2Structure provides reliable, high quality mapping of nsSNPs to 3D protein structures enabling researchers to explore the likely functional impact of human disease-causing mutations.

  5. Structure Based Thermostability Prediction Models for Protein Single Point Mutations with Machine Learning Tools.

    Directory of Open Access Journals (Sweden)

    Lei Jia

    Full Text Available Thermostability issue of protein point mutations is a common occurrence in protein engineering. An application which predicts the thermostability of mutants can be helpful for guiding decision making process in protein design via mutagenesis. An in silico point mutation scanning method is frequently used to find "hot spots" in proteins for focused mutagenesis. ProTherm (http://gibk26.bio.kyutech.ac.jp/jouhou/Protherm/protherm.html is a public database that consists of thousands of protein mutants' experimentally measured thermostability. Two data sets based on two differently measured thermostability properties of protein single point mutations, namely the unfolding free energy change (ddG and melting temperature change (dTm were obtained from this database. Folding free energy change calculation from Rosetta, structural information of the point mutations as well as amino acid physical properties were obtained for building thermostability prediction models with informatics modeling tools. Five supervised machine learning methods (support vector machine, random forests, artificial neural network, naïve Bayes classifier, K nearest neighbor and partial least squares regression are used for building the prediction models. Binary and ternary classifications as well as regression models were built and evaluated. Data set redundancy and balancing, the reverse mutations technique, feature selection, and comparison to other published methods were discussed. Rosetta calculated folding free energy change ranked as the most influential features in all prediction models. Other descriptors also made significant contributions to increasing the accuracy of the prediction models.

  6. Illustrating and homology modeling the proteins of the Zika virus [version 2; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Sean Ekins

    2016-09-01

    Full Text Available The Zika virus (ZIKV is a flavivirus of the family Flaviviridae, which is similar to dengue virus, yellow fever and West Nile virus. Recent outbreaks in South America, Latin America, the Caribbean and in particular Brazil have led to concern for the spread of the disease and potential to cause Guillain-Barré syndrome and microcephaly. Although ZIKV has been known of for over 60 years there is very little in the way of knowledge of the virus with few publications and no crystal structures. No antivirals have been tested against it either in vitro or in vivo. ZIKV therefore epitomizes a neglected disease. Several suggested steps have been proposed which could be taken to initiate ZIKV antiviral drug discovery using both high throughput screens as well as structure-based design based on homology models for the key proteins. We now describe preliminary homology models created for NS5, FtsJ, NS4B, NS4A, HELICc, DEXDc, peptidase S7, NS2B, NS2A, NS1, E stem, glycoprotein M, propeptide, capsid and glycoprotein E using SWISS-MODEL. Eleven out of 15 models pass our model quality criteria for their further use. While a ZIKV glycoprotein E homology model was initially described in the immature conformation as a trimer, we now describe the mature dimer conformer which allowed the construction of an illustration of the complete virion. By comparing illustrations of ZIKV based on this new homology model and the dengue virus crystal structure we propose potential differences that could be exploited for antiviral and vaccine design. The prediction of sites for glycosylation on this protein may also be useful in this regard. While we await a cryo-EM structure of ZIKV and eventual crystal structures of the individual proteins, these homology models provide the community with a starting point for structure-based design of drugs and vaccines as well as a for computational virtual screening.

  7. Comparative genomic analyses of transport proteins encoded within the genomes of Leptospira species.

    Science.gov (United States)

    Buyuktimkin, Bora; Saier, Milton H

    2015-11-01

    Select species of the bacterial genus Leptospira are causative agents of leptospirosis, an emerging global zoonosis affecting nearly one million people worldwide annually. We examined two Leptospira pathogens, Leptospira interrogans serovar Lai str. 56601 and Leptospira borgpetersenii serovar Hardjo-bovis str. L550, as well as the free-living leptospiral saprophyte, Leptospira biflexa serovar Patoc str. 'Patoc 1 (Ames)'. The transport proteins of these leptospires were identified and compared using bioinformatics to gain an appreciation for which proteins may be related to pathogenesis and saprophytism. L. biflexa possesses a disproportionately high number of secondary carriers for metabolite uptake and environmental adaptability as well as an increased number of inorganic cation transporters providing ionic homeostasis and effective osmoregulation in a rapidly changing environment. L. interrogans and L. borgpetersenii possess far fewer transporters, but those that they have are remarkably similar, with near-equivalent representation in most transporter families. These two Leptospira pathogens also possess intact sphingomyelinases, holins, and virulence-related outer membrane porins. These virulence-related factors, in conjunction with decreased transporter substrate versatility, indicate that pathogenicity was accompanied by progressively narrowing ecological niches and the emergence of a limited set of proteins responsible for host invasion. The variability of host tropism and mortality rates by infectious leptospires suggests that small differences in individual sets of proteins play important physiological and pathological roles. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. Virtual Screening of M3 Protein Antagonists for Finding a Model to Study the Gammaherpesvirus Damaged Immune System and Chemokine Related Diseases

    Directory of Open Access Journals (Sweden)

    Ibrahim Torktaz

    2013-12-01

    Full Text Available Introduction: M3 protein is a chemokine decoy receptor involved in pathogenesis of persistent infection with gammaherpesvirus and complications related to the latency of this pathogen. We proposed that antagonists of the M3 would provide a unique opportunity for studying new therapeutic strategies in disordered immune system, immune-deficient states and role of chemokines in pathogenesis development. Methods: Comparative modeling and fold recognition algorithms have been used for prediction of M3 protein 3-D model. Evaluation of the models using Q-mean and ProSA-web score, has led to choosing predicted model by fold recognition algorithm as the best model which was minimized regarding energy level using Molegro Virtual Docker 2011.4.3.0 (MVD software. Pockets and active sites of model were recognized using MVD cavity detection, and MetaPocket algorithms. Ten thousand compounds accessible on KEGG database were screened; MVD was used for computer simulated docking study; MolDock SE was selected as docking scoring function and final results were evaluated based on MolDock and Re-rank score. Results: Docking data suggested that prilocaine, which is generally applied as a topical anesthetic, binds strongly to 3-D model of M3 protein. Conclusion: This study proposes that prilocaine is a potential inhibitor of M3 protein and possibly has immune enhancing properties.

  9. Can Natural Proteins Designed with ‘Inverted’ Peptide Sequences Adopt Native-Like Protein Folds?

    Science.gov (United States)

    Sridhar, Settu; Guruprasad, Kunchur

    2014-01-01

    We have carried out a systematic computational analysis on a representative dataset of proteins of known three-dimensional structure, in order to evaluate whether it would possible to ‘swap’ certain short peptide sequences in naturally occurring proteins with their corresponding ‘inverted’ peptides and generate ‘artificial’ proteins that are predicted to retain native-like protein fold. The analysis of 3,967 representative proteins from the Protein Data Bank revealed 102,677 unique identical inverted peptide sequence pairs that vary in sequence length between 5–12 and 18 amino acid residues. Our analysis illustrates with examples that such ‘artificial’ proteins may be generated by identifying peptides with ‘similar structural environment’ and by using comparative protein modeling and validation studies. Our analysis suggests that natural proteins may be tolerant to accommodating such peptides. PMID:25210740

  10. Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models

    Science.gov (United States)

    Ekeberg, Magnus; Lövkvist, Cecilia; Lan, Yueheng; Weigt, Martin; Aurell, Erik

    2013-01-01

    Spatially proximate amino acids in a protein tend to coevolve. A protein's three-dimensional (3D) structure hence leaves an echo of correlations in the evolutionary record. Reverse engineering 3D structures from such correlations is an open problem in structural biology, pursued with increasing vigor as more and more protein sequences continue to fill the data banks. Within this task lies a statistical inference problem, rooted in the following: correlation between two sites in a protein sequence can arise from firsthand interaction but can also be network-propagated via intermediate sites; observed correlation is not enough to guarantee proximity. To separate direct from indirect interactions is an instance of the general problem of inverse statistical mechanics, where the task is to learn model parameters (fields, couplings) from observables (magnetizations, correlations, samples) in large systems. In the context of protein sequences, the approach has been referred to as direct-coupling analysis. Here we show that the pseudolikelihood method, applied to 21-state Potts models describing the statistical properties of families of evolutionarily related proteins, significantly outperforms existing approaches to the direct-coupling analysis, the latter being based on standard mean-field techniques. This improved performance also relies on a modified score for the coupling strength. The results are verified using known crystal structures of specific sequence instances of various protein families. Code implementing the new method can be found at http://plmdca.csc.kth.se/.

  11. Nucleation phenomena in protein folding: the modulating role of protein sequence

    International Nuclear Information System (INIS)

    Travasso, Rui D M; FaIsca, Patricia F N; Gama, Margarida M Telo da

    2007-01-01

    For the vast majority of naturally occurring, small, single-domain proteins, folding is often described as a two-state process that lacks detectable intermediates. This observation has often been rationalized on the basis of a nucleation mechanism for protein folding whose basic premise is the idea that, after completion of a specific set of contacts forming the so-called folding nucleus, the native state is achieved promptly. Here we propose a methodology to identify folding nuclei in small lattice polymers and apply it to the study of protein molecules with a chain length of N = 48. To investigate the extent to which protein topology is a robust determinant of the nucleation mechanism, we compare the nucleation scenario of a native-centric model with that of a sequence-specific model sharing the same native fold. To evaluate the impact of the sequence's finer details in the nucleation mechanism, we consider the folding of two non-homologous sequences. We conclude that, in a sequence-specific model, the folding nucleus is, to some extent, formed by the most stable contacts in the protein and that the less stable linkages in the folding nucleus are solely determined by the fold's topology. We have also found that, independently of the protein sequence, the folding nucleus performs the same 'topological' function. This unifying feature of the nucleation mechanism results from the residues forming the folding nucleus being distributed along the protein chain in a similar and well-defined manner that is determined by the fold's topological features

  12. The ModFOLD4 server for the quality assessment of 3D protein models

    OpenAIRE

    McGuffin, Liam J.; Buenavista, Maria T.; Roche, Daniel B.

    2013-01-01

    Once you have generated a 3D model of a protein,\\ud how do you know whether it bears any resemblance\\ud to the actual structure? To determine the usefulness\\ud of 3D models of proteins, they must be assessed in\\ud terms of their quality by methods that predict their\\ud similarity to the native structure. The ModFOLD4\\ud server is the latest version of our leading independent\\ud server for the estimation of both the global and\\ud local (per-residue) quality of 3D protein models. The\\ud server ...

  13. Overexpression of retinal degeneration slow (RDS protein adversely affects rods in the rd7 model of enhanced S-cone syndrome.

    Directory of Open Access Journals (Sweden)

    Dibyendu Chakraborty

    Full Text Available The nuclear receptor NR2E3 promotes expression of rod photoreceptor genes while repressing cone genes. Mice lacking NR2E3 (Nr2e3(rd7/rd7 referred to here as rd7 are a model for enhanced S-cone syndrome, a disease associated with increased sensitivity to blue light and night blindness. Rd7 retinas have reduced levels of the outer segment (OS structural protein retinal degeneration slow (RDS. We test the hypothesis that increasing RDS levels would improve the Rd7 phenotype. Transgenic mice over-expressing normal mouse peripherin/RDS (NMP in rods and cones were crossed onto the rd7 background. Disease phenotypes were assessed in NMP/rd7 eyes and compared to wild-type (WT and rd7 eyes at postnatal day 30. NMP/rd7 retinas expressed total RDS (transgenic and endogenous message at WT levels, and NMP protein was correctly localized to the OS. NMP/rd7 retinas have shorter OSs compared to rd7 and WT and significantly reduced number of rosettes. NMP/rd7 mice also exhibited significant deficits in scotopic ERG amplitudes compared to rd7 while photopic amplitudes remained unaffected. Protein levels of rhodopsin, RDS, and the RDS homologue ROM-1 were significantly reduced in the NMP/rd7 retinas compared to rd7. We show that correcting the levels of RDS gene expression does not improve the phenotype of the rd7 suggesting that RDS deficiency is not responsible for the defect in this model. We suggest that the specific rod defect in the NMP/rd7 is likely associated with ongoing problems in the rd7 that are related to the expression of cone genes in rod cells, a characteristic of the model.

  14. Exact protein distributions for stochastic models of gene expression using partitioning of Poisson processes.

    Science.gov (United States)

    Pendar, Hodjat; Platini, Thierry; Kulkarni, Rahul V

    2013-04-01

    Stochasticity in gene expression gives rise to fluctuations in protein levels across a population of genetically identical cells. Such fluctuations can lead to phenotypic variation in clonal populations; hence, there is considerable interest in quantifying noise in gene expression using stochastic models. However, obtaining exact analytical results for protein distributions has been an intractable task for all but the simplest models. Here, we invoke the partitioning property of Poisson processes to develop a mapping that significantly simplifies the analysis of stochastic models of gene expression. The mapping leads to exact protein distributions using results for mRNA distributions in models with promoter-based regulation. Using this approach, we derive exact analytical results for steady-state and time-dependent distributions for the basic two-stage model of gene expression. Furthermore, we show how the mapping leads to exact protein distributions for extensions of the basic model that include the effects of posttranscriptional and posttranslational regulation. The approach developed in this work is widely applicable and can contribute to a quantitative understanding of stochasticity in gene expression and its regulation.

  15. Exact protein distributions for stochastic models of gene expression using partitioning of Poisson processes

    Science.gov (United States)

    Pendar, Hodjat; Platini, Thierry; Kulkarni, Rahul V.

    2013-04-01

    Stochasticity in gene expression gives rise to fluctuations in protein levels across a population of genetically identical cells. Such fluctuations can lead to phenotypic variation in clonal populations; hence, there is considerable interest in quantifying noise in gene expression using stochastic models. However, obtaining exact analytical results for protein distributions has been an intractable task for all but the simplest models. Here, we invoke the partitioning property of Poisson processes to develop a mapping that significantly simplifies the analysis of stochastic models of gene expression. The mapping leads to exact protein distributions using results for mRNA distributions in models with promoter-based regulation. Using this approach, we derive exact analytical results for steady-state and time-dependent distributions for the basic two-stage model of gene expression. Furthermore, we show how the mapping leads to exact protein distributions for extensions of the basic model that include the effects of posttranscriptional and posttranslational regulation. The approach developed in this work is widely applicable and can contribute to a quantitative understanding of stochasticity in gene expression and its regulation.

  16. A 3D model of the membrane protein complex formed by the white spot syndrome virus structural proteins.

    Directory of Open Access Journals (Sweden)

    Yun-Shiang Chang

    Full Text Available BACKGROUND: Outbreaks of white spot disease have had a large negative economic impact on cultured shrimp worldwide. However, the pathogenesis of the causative virus, WSSV (whit spot syndrome virus, is not yet well understood. WSSV is a large enveloped virus. The WSSV virion has three structural layers surrounding its core DNA: an outer envelope, a tegument and a nucleocapsid. In this study, we investigated the protein-protein interactions of the major WSSV structural proteins, including several envelope and tegument proteins that are known to be involved in the infection process. PRINCIPAL FINDINGS: In the present report, we used coimmunoprecipitation and yeast two-hybrid assays to elucidate and/or confirm all the interactions that occur among the WSSV structural (envelope and tegument proteins VP51A, VP19, VP24, VP26 and VP28. We found that VP51A interacted directly not only with VP26 but also with VP19 and VP24. VP51A, VP19 and VP24 were also shown to have an affinity for self-interaction. Chemical cross-linking assays showed that these three self-interacting proteins could occur as dimers. CONCLUSIONS: From our present results in conjunction with other previously established interactions we construct a 3D model in which VP24 acts as a core protein that directly associates with VP26, VP28, VP38A, VP51A and WSV010 to form a membrane-associated protein complex. VP19 and VP37 are attached to this complex via association with VP51A and VP28, respectively. Through the VP26-VP51C interaction this envelope complex is anchored to the nucleocapsid, which is made of layers of rings formed by VP664. A 3D model of the nucleocapsid and the surrounding outer membrane is presented.

  17. Protein carbonylation, protein aggregation and neuronal cell death in a murine model of multiple sclerosis

    Science.gov (United States)

    Dasgupta, Anushka

    Many studies have suggested that oxidative stress plays an important role in the pathophysiology of both multiple sclerosis (MS) and its animal model experimental autoimmune encephalomyelitis (EAE). Yet, the mechanism by which oxidative stress leads to tissue damage in these disorders is unclear. Recent work from our laboratory has revealed that protein carbonylation, a major oxidative modification caused by severe and/or chronic oxidative stress conditions, is elevated in MS and EAE. Furthermore, protein carbonylation has been shown to alter protein structure leading to misfolding/aggregation. These findings prompted me to hypothesize that carbonylated proteins, formed as a consequence of oxidative stress and/or decreased proteasomal activity, promote protein aggregation to mediate neuronal apoptosis in vitro and in EAE. To test this novel hypothesis, I first characterized protein carbonylation, protein aggregation and apoptosis along the spinal cord during the course of myelin-oligodendrocyte glycoprotein (MOG)35-55 peptide-induced EAE in C57BL/6 mice [Chapter 2]. The results show that carbonylated proteins accumulate throughout the course of the disease, albeit by different mechanisms: increased oxidative stress in acute EAE and decreased proteasomal activity in chronic EAE. I discovered not only that there is a temporal correlation between protein carbonylation and apoptosis but also that carbonyl levels are significantly higher in apoptotic cells. A high number of juxta-nuclear and cytoplasmic protein aggregates containing the majority of the oxidized proteins are also present during the course of EAE, which seems to be due to reduced autophagy. In chapter 3, I show that when gluthathione levels are reduced to those in EAE spinal cord, both neuron-like PC12 (nPC12) cells and primary neuronal cultures accumulate carbonylated proteins and undergo cell death (both by necrosis and apoptosis). Immunocytochemical and biochemical studies also revealed a temporal

  18. Bayesian network model for identification of pathways by integrating protein interaction with genetic interaction data.

    Science.gov (United States)

    Fu, Changhe; Deng, Su; Jin, Guangxu; Wang, Xinxin; Yu, Zu-Guo

    2017-09-21

    Molecular interaction data at proteomic and genetic levels provide physical and functional insights into a molecular biosystem and are helpful for the construction of pathway structures complementarily. Despite advances in inferring biological pathways using genetic interaction data, there still exists weakness in developed models, such as, activity pathway networks (APN), when integrating the data from proteomic and genetic levels. It is necessary to develop new methods to infer pathway structure by both of interaction data. We utilized probabilistic graphical model to develop a new method that integrates genetic interaction and protein interaction data and infers exquisitely detailed pathway structure. We modeled the pathway network as Bayesian network and applied this model to infer pathways for the coherent subsets of the global genetic interaction profiles, and the available data set of endoplasmic reticulum genes. The protein interaction data were derived from the BioGRID database. Our method can accurately reconstruct known cellular pathway structures, including SWR complex, ER-Associated Degradation (ERAD) pathway, N-Glycan biosynthesis pathway, Elongator complex, Retromer complex, and Urmylation pathway. By comparing N-Glycan biosynthesis pathway and Urmylation pathway identified from our approach with that from APN, we found that our method is able to overcome its weakness (certain edges are inexplicable). According to underlying protein interaction network, we defined a simple scoring function that only adopts genetic interaction information to avoid the balance difficulty in the APN. Using the effective stochastic simulation algorithm, the performance of our proposed method is significantly high. We developed a new method based on Bayesian network to infer detailed pathway structures from interaction data at proteomic and genetic levels. The results indicate that the developed method performs better in predicting signaling pathways than previously

  19. Lipoprotein(a) and dietary proteins: casein lowers lipoprotein(a) concentrations as compared with soy protein1-3

    DEFF Research Database (Denmark)

    Nilausen, Karin Johanne; Meinertz, H.

    1999-01-01

    Lipoprotein(a), plasma lipoproteins, dietary proteins, soy protein, casein, liquid-formula, coronary artery disease, men, Denmark......Lipoprotein(a), plasma lipoproteins, dietary proteins, soy protein, casein, liquid-formula, coronary artery disease, men, Denmark...

  20. Determining and comparing protein function in Bacterial genome sequences

    DEFF Research Database (Denmark)

    Vesth, Tammi Camilla

    of this class have very little homology to other known genomes making functional annotation based on sequence similarity very difficult. Inspired in part by this analysis, an approach for comparative functional annotation was created based public sequenced genomes, CMGfunc. Functionally related groups......In November 2013, there was around 21.000 different prokaryotic genomes sequenced and publicly available, and the number is growing daily with another 20.000 or more genomes expected to be sequenced and deposited by the end of 2014. An important part of the analysis of this data is the functional...... annotation of genes – the descriptions assigned to genes that describe the likely function of the encoded proteins. This process is limited by several factors, including the definition of a function which can be more or less specific as well as how many genes can actually be assigned a function based...

  1. A mathematical model for generating bipartite graphs and its application to protein networks

    International Nuclear Information System (INIS)

    Nacher, J C; Ochiai, T; Hayashida, M; Akutsu, T

    2009-01-01

    Complex systems arise in many different contexts from large communication systems and transportation infrastructures to molecular biology. Most of these systems can be organized into networks composed of nodes and interacting edges. Here, we present a theoretical model that constructs bipartite networks with the particular feature that the degree distribution can be tuned depending on the probability rate of fundamental processes. We then use this model to investigate protein-domain networks. A protein can be composed of up to hundreds of domains. Each domain represents a conserved sequence segment with specific functional tasks. We analyze the distribution of domains in Homo sapiens and Arabidopsis thaliana organisms and the statistical analysis shows that while (a) the number of domain types shared by k proteins exhibits a power-law distribution, (b) the number of proteins composed of k types of domains decays as an exponential distribution. The proposed mathematical model generates bipartite graphs and predicts the emergence of this mixing of (a) power-law and (b) exponential distributions. Our theoretical and computational results show that this model requires (1) growth process and (2) copy mechanism.

  2. A mathematical model for generating bipartite graphs and its application to protein networks

    Science.gov (United States)

    Nacher, J. C.; Ochiai, T.; Hayashida, M.; Akutsu, T.

    2009-12-01

    Complex systems arise in many different contexts from large communication systems and transportation infrastructures to molecular biology. Most of these systems can be organized into networks composed of nodes and interacting edges. Here, we present a theoretical model that constructs bipartite networks with the particular feature that the degree distribution can be tuned depending on the probability rate of fundamental processes. We then use this model to investigate protein-domain networks. A protein can be composed of up to hundreds of domains. Each domain represents a conserved sequence segment with specific functional tasks. We analyze the distribution of domains in Homo sapiens and Arabidopsis thaliana organisms and the statistical analysis shows that while (a) the number of domain types shared by k proteins exhibits a power-law distribution, (b) the number of proteins composed of k types of domains decays as an exponential distribution. The proposed mathematical model generates bipartite graphs and predicts the emergence of this mixing of (a) power-law and (b) exponential distributions. Our theoretical and computational results show that this model requires (1) growth process and (2) copy mechanism.

  3. A mathematical model for generating bipartite graphs and its application to protein networks

    Energy Technology Data Exchange (ETDEWEB)

    Nacher, J C [Department of Complex Systems, Future University-Hakodate (Japan); Ochiai, T [Faculty of Engineering, Toyama Prefectural University (Japan); Hayashida, M; Akutsu, T [Bioinformatics Center, Institute for Chemical Research, Kyoto University (Japan)

    2009-12-04

    Complex systems arise in many different contexts from large communication systems and transportation infrastructures to molecular biology. Most of these systems can be organized into networks composed of nodes and interacting edges. Here, we present a theoretical model that constructs bipartite networks with the particular feature that the degree distribution can be tuned depending on the probability rate of fundamental processes. We then use this model to investigate protein-domain networks. A protein can be composed of up to hundreds of domains. Each domain represents a conserved sequence segment with specific functional tasks. We analyze the distribution of domains in Homo sapiens and Arabidopsis thaliana organisms and the statistical analysis shows that while (a) the number of domain types shared by k proteins exhibits a power-law distribution, (b) the number of proteins composed of k types of domains decays as an exponential distribution. The proposed mathematical model generates bipartite graphs and predicts the emergence of this mixing of (a) power-law and (b) exponential distributions. Our theoretical and computational results show that this model requires (1) growth process and (2) copy mechanism.

  4. Formulation of probabilistic models of protein structure in atomic detail using the reference ratio method

    DEFF Research Database (Denmark)

    Valentin, Jan B.; Andreetta, Christian; Boomsma, Wouter

    2014-01-01

    We propose a method to formulate probabilistic models of protein structure in atomic detail, for a given amino acid sequence, based on Bayesian principles, while retaining a close link to physics. We start from two previously developed probabilistic models of protein structure on a local length s....... The results indicate that the proposed method and the probabilistic models show considerable promise for probabilistic protein structure prediction and related applications. © 2013 Wiley Periodicals, Inc....

  5. The unfolded protein response has a protective role in yeast models of classic galactosemia

    Directory of Open Access Journals (Sweden)

    Evandro A. De-Souza

    2014-01-01

    Full Text Available Classic galactosemia is a human autosomal recessive disorder caused by mutations in the GALT gene (GAL7 in yeast, which encodes the enzyme galactose-1-phosphate uridyltransferase. Here we show that the unfolded protein response pathway is triggered by galactose in two yeast models of galactosemia: lithium-treated cells and the gal7Δ mutant. The synthesis of galactose-1-phosphate is essential to trigger the unfolded protein response under these conditions because the deletion of the galactokinase-encoding gene GAL1 completely abolishes unfolded protein response activation and galactose toxicity. Impairment of the unfolded protein response in both yeast models makes cells even more sensitive to galactose, unmasking its cytotoxic effect. These results indicate that endoplasmic reticulum stress is induced under galactosemic conditions and underscores the importance of the unfolded protein response pathway to cellular adaptation in these models of classic galactosemia.

  6. ESI-MS studies of the reactions of novel platinum(II) complexes containing O,O'-chelated acetylacetonate and sulfur ligands with selected model proteins.

    Science.gov (United States)

    Marzo, Tiziano; De Pascali, Sandra A; Gabbiani, Chiara; Fanizzi, Francesco P; Messori, Luigi; Pratesi, Alessandro

    2017-08-01

    A group of mixed-ligand Pt(II) complexes bearing acetylacetonate and sulphur ligands were recently developed in the University of Lecce as a new class of prospective anticancer agents that manifested promising pharma-cological properties in preliminary in vitro and in vivo tests. Though modelled on the basis of cisplatin, these Pt(II) complexes turned out to exhibit a profoundly distinct mode of action as they were found to act mainly on non-genomic targets rather than on DNA. Accordingly, we have explored here their reactions with two representative model proteins through an established ESI-MS procedure with the aim to describe their general interaction mechanism with protein targets. A pronounced reactivity with the tested proteins was indeed documented; the nature of the resulting metallodrug-protein interactions could be characterised in depth in the various cases. Preferential binding to protein targets compared to DNA is supported by independent ICP-OES measurements. The implications of these findings are discussed.

  7. A discriminative method for family-based protein remote homology detection that combines inductive logic programming and propositional models.

    Science.gov (United States)

    Bernardes, Juliana S; Carbone, Alessandra; Zaverucha, Gerson

    2011-03-23

    Remote homology detection is a hard computational problem. Most approaches have trained computational models by using either full protein sequences or multiple sequence alignments (MSA), including all positions. However, when we deal with proteins in the "twilight zone" we can observe that only some segments of sequences (motifs) are conserved. We introduce a novel logical representation that allows us to represent physico-chemical properties of sequences, conserved amino acid positions and conserved physico-chemical positions in the MSA. From this, Inductive Logic Programming (ILP) finds the most frequent patterns (motifs) and uses them to train propositional models, such as decision trees and support vector machines (SVM). We use the SCOP database to perform our experiments by evaluating protein recognition within the same superfamily. Our results show that our methodology when using SVM performs significantly better than some of the state of the art methods, and comparable to other. However, our method provides a comprehensible set of logical rules that can help to understand what determines a protein function. The strategy of selecting only the most frequent patterns is effective for the remote homology detection. This is possible through a suitable first-order logical representation of homologous properties, and through a set of frequent patterns, found by an ILP system, that summarizes essential features of protein functions.

  8. Comparative studies of human and chicken retinol-binding proteins and prealbumins.

    Science.gov (United States)

    Kopelman, M; Mokady, S; Cogan, U

    1976-08-09

    Microheterogeneity of retinol-binding proteins of human plasma and urine, and of chicken plasma was studied by polyacrylamide gel electrophoresis. All three protein systems were found microheterogenous. Incorporation of retinol into the protein preparations on the one hand, and depletion of these proteins from retinol on the other hand, enabled us to clarify the extent to which the presence or absence of the ligand affects the apparent heterogeneity. Upon electrophoresis, each of the native proteins displayed two pairs of protein zones. It appeared that within each pair the fast moving band corresponded to aporetinol-binding protein which upon binding of retinol was converted to a holoprotein with a slightly lower mobility. However, it did not seem that proteins of one pair were converted to proteins of the second pair upon binding of retinol, substantiating ghe microheterogenous character of this protein system. A rapid, two step procedure for isolation of prealbumins from plasma is described. The method which consists of DEAE-cellulose chromatography follwed by preparative electrophoresis was utilized to separate human and chicken prealbumins. Routine dodecyl sulphate electrophoresis resulted in partial dissociation of human prealbumin but in no dissociation of the chicken protein. More drastic treatments prior to electrophoresis were needed to effect complete disruption of both proteins into subunits.

  9. Network based approaches reveal clustering in protein point patterns

    Science.gov (United States)

    Parker, Joshua; Barr, Valarie; Aldridge, Joshua; Samelson, Lawrence E.; Losert, Wolfgang

    2014-03-01

    Recent advances in super-resolution imaging have allowed for the sub-diffraction measurement of the spatial location of proteins on the surfaces of T-cells. The challenge is to connect these complex point patterns to the internal processes and interactions, both protein-protein and protein-membrane. We begin analyzing these patterns by forming a geometric network amongst the proteins and looking at network measures, such the degree distribution. This allows us to compare experimentally observed patterns to models. Specifically, we find that the experimental patterns differ from heterogeneous Poisson processes, highlighting an internal clustering structure. Further work will be to compare our results to simulated protein-protein interactions to determine clustering mechanisms.

  10. A generic whole body physiologically based pharmacokinetic model for therapeutic proteins in PK-Sim.

    Science.gov (United States)

    Niederalt, Christoph; Kuepfer, Lars; Solodenko, Juri; Eissing, Thomas; Siegmund, Hans-Ulrich; Block, Michael; Willmann, Stefan; Lippert, Jörg

    2018-04-01

    Proteins are an increasingly important class of drugs used as therapeutic as well as diagnostic agents. A generic physiologically based pharmacokinetic (PBPK) model was developed in order to represent at whole body level the fundamental mechanisms driving the distribution and clearance of large molecules like therapeutic proteins. The model was built as an extension of the PK-Sim model for small molecules incorporating (i) the two-pore formalism for drug extravasation from blood plasma to interstitial space, (ii) lymph flow, (iii) endosomal clearance and (iv) protection from endosomal clearance by neonatal Fc receptor (FcRn) mediated recycling as especially relevant for antibodies. For model development and evaluation, PK data was used for compounds with a wide range of solute radii. The model supports the integration of knowledge gained during all development phases of therapeutic proteins, enables translation from pre-clinical species to human and allows predictions of tissue concentration profiles which are of relevance for the analysis of on-target pharmacodynamic effects as well as off-target toxicity. The current implementation of the model replaces the generic protein PBPK model available in PK-Sim since version 4.2 and becomes part of the Open Systems Pharmacology Suite.

  11. Comparative study of C-Reactive Protein and other biochemical ...

    African Journals Online (AJOL)

    Serum levels of C-reactive proteins (CRP), Alanine aminotransferase (ALT), Aspartate aminotransferase (AST), total protein, albumin and globulins were investigated using high sensitivity Immunoturbidometric and colorimetric techniques in individuals with hepatitis (n=50), Malaria (n=50) and 40 control subjects in age ...

  12. eMatchSite: sequence order-independent structure alignments of ligand binding pockets in protein models.

    Directory of Open Access Journals (Sweden)

    Michal Brylinski

    2014-09-01

    Full Text Available Detecting similarities between ligand binding sites in the absence of global homology between target proteins has been recognized as one of the critical components of modern drug discovery. Local binding site alignments can be constructed using sequence order-independent techniques, however, to achieve a high accuracy, many current algorithms for binding site comparison require high-quality experimental protein structures, preferably in the bound conformational state. This, in turn, complicates proteome scale applications, where only various quality structure models are available for the majority of gene products. To improve the state-of-the-art, we developed eMatchSite, a new method for constructing sequence order-independent alignments of ligand binding sites in protein models. Large-scale benchmarking calculations using adenine-binding pockets in crystal structures demonstrate that eMatchSite generates accurate alignments for almost three times more protein pairs than SOIPPA. More importantly, eMatchSite offers a high tolerance to structural distortions in ligand binding regions in protein models. For example, the percentage of correctly aligned pairs of adenine-binding sites in weakly homologous protein models is only 4-9% lower than those aligned using crystal structures. This represents a significant improvement over other algorithms, e.g. the performance of eMatchSite in recognizing similar binding sites is 6% and 13% higher than that of SiteEngine using high- and moderate-quality protein models, respectively. Constructing biologically correct alignments using predicted ligand binding sites in protein models opens up the possibility to investigate drug-protein interaction networks for complete proteomes with prospective systems-level applications in polypharmacology and rational drug repositioning. eMatchSite is freely available to the academic community as a web-server and a stand-alone software distribution at http://www.brylinski.org/ematchsite.

  13. A Machine Learning Approach for Hot-Spot Detection at Protein-Protein Interfaces

    Directory of Open Access Journals (Sweden)

    Rita Melo

    2016-07-01

    Full Text Available Understanding protein-protein interactions is a key challenge in biochemistry. In this work, we describe a more accurate methodology to predict Hot-Spots (HS in protein-protein interfaces from their native complex structure compared to previous published Machine Learning (ML techniques. Our model is trained on a large number of complexes and on a significantly larger number of different structural- and evolutionary sequence-based features. In particular, we added interface size, type of interaction between residues at the interface of the complex, number of different types of residues at the interface and the Position-Specific Scoring Matrix (PSSM, for a total of 79 features. We used twenty-seven algorithms from a simple linear-based function to support-vector machine models with different cost functions. The best model was achieved by the use of the conditional inference random forest (c-forest algorithm with a dataset pre-processed by the normalization of features and with up-sampling of the minor class. The method has an overall accuracy of 0.80, an F1-score of 0.73, a sensitivity of 0.76 and a specificity of 0.82 for the independent test set.

  14. Associations of Dietary Protein and Energy Intakes With Protein-Energy Wasting Syndrome in Hemodialysis Patients.

    Science.gov (United States)

    Beddhu, Srinivasan; Wei, Guo; Chen, Xiaorui; Boucher, Robert; Kiani, Rabia; Raj, Dominic; Chonchol, Michel; Greene, Tom; Murtaugh, Maureen A

    2017-09-01

    The associations of dietary protein and/or energy intakes with protein or energy wasting in patients on maintenance hemodialysis are controversial. We examined these in the Hemodialysis (HEMO) Study. In 1487 participants in the HEMO Study, baseline dietary protein intake (grams per kilogram per day) and dietary energy intake (kilocalories per kilograms per day) were related to the presence of the protein-energy wasting (PEW) syndrome at month 12 (defined as the presence of at least 1 criteria in 2 of the 3 categories of low serum chemistry, low body mass, and low muscle mass) in logistic regression models. In additional separate models, protein intake estimated from equilibrated normalized protein catabolic rate (enPCR) was also related to the PEW syndrome. Compared with the lowest quartile, the highest quartile of baseline dietary protein intake was paradoxically associated with increased risk of the PEW syndrome at month 12 (odds ratio [OR]: 4.11; 95% confidence interval [CI]: 2.79-6.05). This relationship was completely attenuated (OR: 1.35; 95% CI: 0.88-2.06) with adjustment for baseline body weight, which suggested mathematical coupling. Results were similar for dietary energy intake. Compared with the lowest quartile of baseline enPCR, the highest quartile was not associated with the PEW syndrome at 12 months (OR: 0.78; 95% CI: 0.54-1.12). These data do not support the use of dietary protein intake or dietary energy intake criteria in the definition of the PEW syndrome in patients on maintenance hemodialysis.

  15. Poly lactic acid based injectable delivery systems for controlled release of a model protein, lysozyme.

    Science.gov (United States)

    Al-Tahami, Khaled; Meyer, Amanda; Singh, Jagdish

    2006-02-01

    The objective of this study was to evaluate the critical formulation parameters (i.e., polymer concentration, polymer molecular weight, and solvent nature) affecting the controlled delivery of a model protein, lysozyme, from injectable polymeric implants. The conformational stability and biological activity of the released lysozyme were also investigated. Three formulations containing 10%, 20%, and 30% (w/v) poly lactic acid (PLA) in triacetin were investigated. It was found that increasing polymer concentration in the formulations led to a lower burst effect and a slower release rate. Formulation with a high molecular weight polymer showed a greater burst effect as compared to those containing low molecular weight. Conformational stability and biological activity of released samples were studied by differential scanning calorimeter and enzyme activity assay, respectively. The released samples had significantly (P solution kept at same conditions). Increasing polymer concentration increased both the conformational stability and the biological activity of released lysozyme. In conclusion, phase sensitive polymer-based delivery systems were able to deliver a model protein, lysozyme, in a conformationally stable and biologically active form at a controlled rate over an extended period.

  16. A Minimalistic Resource Allocation Model to Explain Ubiquitous Increase in Protein Expression with Growth Rate.

    Directory of Open Access Journals (Sweden)

    Uri Barenholz

    Full Text Available Most proteins show changes in level across growth conditions. Many of these changes seem to be coordinated with the specific growth rate rather than the growth environment or the protein function. Although cellular growth rates, gene expression levels and gene regulation have been at the center of biological research for decades, there are only a few models giving a base line prediction of the dependence of the proteome fraction occupied by a gene with the specific growth rate. We present a simple model that predicts a widely coordinated increase in the fraction of many proteins out of the proteome, proportionally with the growth rate. The model reveals how passive redistribution of resources, due to active regulation of only a few proteins, can have proteome wide effects that are quantitatively predictable. Our model provides a potential explanation for why and how such a coordinated response of a large fraction of the proteome to the specific growth rate arises under different environmental conditions. The simplicity of our model can also be useful by serving as a baseline null hypothesis in the search for active regulation. We exemplify the usage of the model by analyzing the relationship between growth rate and proteome composition for the model microorganism E.coli as reflected in recent proteomics data sets spanning various growth conditions. We find that the fraction out of the proteome of a large number of proteins, and from different cellular processes, increases proportionally with the growth rate. Notably, ribosomal proteins, which have been previously reported to increase in fraction with growth rate, are only a small part of this group of proteins. We suggest that, although the fractions of many proteins change with the growth rate, such changes may be partially driven by a global effect, not necessarily requiring specific cellular control mechanisms.

  17. Twisting, supercoiling and stretching in protein bound DNA

    Science.gov (United States)

    Lam, Pui-Man; Zhen, Yi

    2018-04-01

    We have calculated theoretical results for the torque and slope of the twisted DNA, with various proteins bound on it, using the Neukirch-Marko model, in the regime where plectonemes exist. We found that the torque in the protein bound DNA decreases compared to that in the bare DNA. This is caused by the decrease in the free energy g(f) , and hence the smaller persistence lengths, in the case of protein bound DNA. We hope our results will encourage experimental investigations of supercoiling in protein bound DNA, which can provide further tests of the Neukirch-Marko model.

  18. Green Fluorescent Protein as a Model for Protein Crystal Growth Studies

    Science.gov (United States)

    Agena, Sabine; Smith, Lori; Karr, Laurel; Pusey, Marc

    1998-01-01

    Green fluorescent protein (GFP) from jellyfish Aequorea Victoria has become a popular marker for e.g. mutagenesis work. Its fluorescent property, which originates from a chromophore located in the center of the molecule, makes it widely applicable as a research too]. GFP clones have been produced with a variety of spectral properties, such as blue and yellow emitting species. The protein is a single chain of molecular weight 27 kDa and its structure has been determined at 1.9 Angstrom resolution. The combination of GFP's fluorescent property, the knowledge of its several crystallization conditions, and its increasing use in biophysical and biochemical studies, all led us to consider it as a model material for macromolecular crystal growth studies. Initial preparations of GFP were from E.coli with yields of approximately 5 mg/L of culture media. Current yields are now in the 50 - 120 mg/L range, and we hope to further increase this by expression of the GFP gene in the Pichia system. The results of these efforts and of preliminary crystal growth studies will be presented.

  19. Comparative study of two models of combined pulmonary fibrosis and emphysema in mice.

    Science.gov (United States)

    Zhang, Wan-Guang; Wu, Si-Si; He, Li; Yang, Qun; Feng, Yi-Kuan; Chen, Yue-Tao; Zhen, Guo-Hua; Xu, Yong-Jian; Zhang, Zhen-Xiang; Zhao, Jian-Ping; Zhang, Hui-Lan

    2017-04-01

    Combined pulmonary fibrosis and emphysema (CPFE) is an "umbrella term" encompassing emphysema and pulmonary fibrosis, but its pathogenesis is not known. We established two models of CPFE in mice using tracheal instillation with bleomycin (BLM) or murine gammaherpesvirus 68 (MHV-68). Experimental mice were divided randomly into four groups: A (normal control, n=6), B (emphysema, n=6), C (emphysema+MHV-68, n=24), D (emphysema+BLM, n=6). Group C was subdivided into four groups: C1 (sacrificed on day 367, 7 days after tracheal instillation of MHV-68); C2 (day 374; 14days); C3 (day 381; 21days); C4 (day 388; 28days). Conspicuous emphysema and interstitial fibrosis were observed in BLM and MHV-68 CPFE mouse models. However, BLM induced diffuse pulmonary interstitial fibrosis with severely diffuse pulmonary inflammation; MHV-68 induced relatively modest inflammation and fibrosis, and the inflammation and fibrosis were not diffuse, but instead around bronchioles. Inflammation and fibrosis were detectable in the day-7 subgroup and reached a peak in the day-28 subgroup in the emphysema + MHV-68 group. Levels of macrophage chemoattractant protein-1, macrophage inflammatory protein-1α, interleukin-13, and transforming growth factor-β1 in bronchoalveolar lavage fluid were increased significantly in both models. Percentage of apoptotic type-2 lung epithelial cells was significantly higher; however, all four types of cytokine and number of macrophages were significantly lower in the emphysema+MHV-68 group compared with the emphysema +BLM group. The different changes in pathology between BLM and MHV-68 mice models demonstrated different pathology subtypes of CPFE: macrophage infiltration and apoptosis of type-II lung epithelial cells increased with increasing pathology score for pulmonary fibrosis. Copyright © 2017 Elsevier GmbH. All rights reserved.

  20. COMPARATIVE STUDY ON ANGIOTENSIN CONVERTING ENZYME INHIBITORY ACTIVITY OF HYDROLYSATE OF MEAT PROTEIN OF INDONESIAN LOCAL LIVESTOCKS

    Directory of Open Access Journals (Sweden)

    J. Jamhari

    2014-10-01

    Full Text Available The experiment was conducted to investigate the angiotensin converting enzyme (ACE inhibitoryactivity of hydrolysate in meat protein of Bali cattle, Kacang goat, native chicken, and local duck. Themeats of Bali cattle, Kacang goat, native chicken, and local duck were used in this study. The meatswere ground using food processor added with aquadest to obtain meat extract. The meat extracts werethen hydrolyzed using protease enzymes to obtain hydrolysate of meat protein. Protein concentration ofmeat extract and hydrolysate of meat protein were determined, and were confirmed by sodium dodecylsulfate - poly acrylamide gel electrophoresis (SDS-PAGE. ACE inhibitory activity of hydrolysate ofmeat protein derived from Bali cattle, Kacang goat, native chicken, and local duck was also determined.The results showed that protein concentration of hydrolysate of meat protein of Bali cattle, Kacang goat,native chicken, and local duck meat was significantly higher than their meat extracts. SDS-PAGEanalysis indicated that hydrolysate of meat protein of Bali cattle, Kacang goat, native chicken, and localduck had more peptides with lower molecular weight, compared to their meat extracts. Hydrolysate ofmeat protein of Bali cattle, Kacang goat, native chicken, and local duck had potencies in inhibiting ACEactivity, so it will potentially reduce blood pressure.

  1. Formulation of probabilistic models of protein structure in atomic detail using the reference ratio method.

    Science.gov (United States)

    Valentin, Jan B; Andreetta, Christian; Boomsma, Wouter; Bottaro, Sandro; Ferkinghoff-Borg, Jesper; Frellsen, Jes; Mardia, Kanti V; Tian, Pengfei; Hamelryck, Thomas

    2014-02-01

    We propose a method to formulate probabilistic models of protein structure in atomic detail, for a given amino acid sequence, based on Bayesian principles, while retaining a close link to physics. We start from two previously developed probabilistic models of protein structure on a local length scale, which concern the dihedral angles in main chain and side chains, respectively. Conceptually, this constitutes a probabilistic and continuous alternative to the use of discrete fragment and rotamer libraries. The local model is combined with a nonlocal model that involves a small number of energy terms according to a physical force field, and some information on the overall secondary structure content. In this initial study we focus on the formulation of the joint model and the evaluation of the use of an energy vector as a descriptor of a protein's nonlocal structure; hence, we derive the parameters of the nonlocal model from the native structure without loss of generality. The local and nonlocal models are combined using the reference ratio method, which is a well-justified probabilistic construction. For evaluation, we use the resulting joint models to predict the structure of four proteins. The results indicate that the proposed method and the probabilistic models show considerable promise for probabilistic protein structure prediction and related applications. Copyright © 2013 Wiley Periodicals, Inc.

  2. A comparative evaluation of the ultrastructure and protein yields of ...

    African Journals Online (AJOL)

    ... efficient as the detergent solubilized technique. Additionally some axial filaments were also released. The protein yield of L. hardjo from the two techniques of leptospire protein preparation was significantly lower than the other test serovars (P<0.01). Overall, the protein yield of antigens produced by the SDS solubilization ...

  3. Combining NMR ensembles and molecular dynamics simulations provides more realistic models of protein structures in solution and leads to better chemical shift prediction

    International Nuclear Information System (INIS)

    Lehtivarjo, Juuso; Tuppurainen, Kari; Hassinen, Tommi; Laatikainen, Reino; Peräkylä, Mikael

    2012-01-01

    While chemical shifts are invaluable for obtaining structural information from proteins, they also offer one of the rare ways to obtain information about protein dynamics. A necessary tool in transforming chemical shifts into structural and dynamic information is chemical shift prediction. In our previous work we developed a method for 4D prediction of protein 1 H chemical shifts in which molecular motions, the 4th dimension, were modeled using molecular dynamics (MD) simulations. Although the approach clearly improved the prediction, the X-ray structures and single NMR conformers used in the model cannot be considered fully realistic models of protein in solution. In this work, NMR ensembles (NMRE) were used to expand the conformational space of proteins (e.g. side chains, flexible loops, termini), followed by MD simulations for each conformer to map the local fluctuations. Compared with the non-dynamic model, the NMRE+MD model gave 6–17% lower root-mean-square (RMS) errors for different backbone nuclei. The improved prediction indicates that NMR ensembles with MD simulations can be used to obtain a more realistic picture of protein structures in solutions and moreover underlines the importance of short and long time-scale dynamics for the prediction. The RMS errors of the NMRE+MD model were 0.24, 0.43, 0.98, 1.03, 1.16 and 2.39 ppm for 1 Hα, 1 HN, 13 Cα, 13 Cβ, 13 CO and backbone 15 N chemical shifts, respectively. The model is implemented in the prediction program 4DSPOT, available at http://www.uef.fi/4dspothttp://www.uef.fi/4dspot.

  4. Combining NMR ensembles and molecular dynamics simulations provides more realistic models of protein structures in solution and leads to better chemical shift prediction

    Energy Technology Data Exchange (ETDEWEB)

    Lehtivarjo, Juuso, E-mail: juuso.lehtivarjo@uef.fi; Tuppurainen, Kari; Hassinen, Tommi; Laatikainen, Reino [University of Eastern Finland, School of Pharmacy (Finland); Peraekylae, Mikael [University of Eastern Finland, Institute of Biomedicine (Finland)

    2012-03-15

    While chemical shifts are invaluable for obtaining structural information from proteins, they also offer one of the rare ways to obtain information about protein dynamics. A necessary tool in transforming chemical shifts into structural and dynamic information is chemical shift prediction. In our previous work we developed a method for 4D prediction of protein {sup 1}H chemical shifts in which molecular motions, the 4th dimension, were modeled using molecular dynamics (MD) simulations. Although the approach clearly improved the prediction, the X-ray structures and single NMR conformers used in the model cannot be considered fully realistic models of protein in solution. In this work, NMR ensembles (NMRE) were used to expand the conformational space of proteins (e.g. side chains, flexible loops, termini), followed by MD simulations for each conformer to map the local fluctuations. Compared with the non-dynamic model, the NMRE+MD model gave 6-17% lower root-mean-square (RMS) errors for different backbone nuclei. The improved prediction indicates that NMR ensembles with MD simulations can be used to obtain a more realistic picture of protein structures in solutions and moreover underlines the importance of short and long time-scale dynamics for the prediction. The RMS errors of the NMRE+MD model were 0.24, 0.43, 0.98, 1.03, 1.16 and 2.39 ppm for {sup 1}H{alpha}, {sup 1}HN, {sup 13}C{alpha}, {sup 13}C{beta}, {sup 13}CO and backbone {sup 15}N chemical shifts, respectively. The model is implemented in the prediction program 4DSPOT, available at http://www.uef.fi/4dspothttp://www.uef.fi/4dspot.

  5. Comparative method of protein expression and isolation of EBV epitope in E.coli DH5α

    Science.gov (United States)

    Anyndita, Nadya V. M.; Dluha, Nurul; Himmah, Karimatul; Rifa'i, Muhaimin; Widodo

    2017-11-01

    Epstein-Barr Virus (EBV) or human herpes virus 4 (HHV-4) is a virus that infects human B cell and leads to nasopharyngeal carcinoma (NPC). The prevention of this disease remains unsuccessful since the vaccine has not been discovered. The objective of this study is to over-produce EBV gp350/220 epitope using several methods in E.coli DH5α. EBV epitope sequences were inserted into pMAL-p5x vector, then transformed into DH5α E.coli and over-produced using 0.3, 1 and 2 mM IPTG. Plasmid transformation was validated using AflIII restriction enzyme in 0.8% agarose. Periplasmic protein was isolated using 2 comparative methods and then analyzed using SDS-PAGE. Method A produced a protein band around 50 kDa and appeared only at transformant. Method B failed to isolate the protein, indicated by no protein band appearing. In addition, any variations in IPTG concentration didn't give a different result. Thus it can be concluded that even the lowest IPTG concentration is able to induce protein expression.

  6. Modeling the time evolution of the nanoparticle-protein corona in a body fluid.

    Directory of Open Access Journals (Sweden)

    Daniele Dell'Orco

    Full Text Available BACKGROUND: Nanoparticles in contact with biological fluids interact with proteins and other biomolecules, thus forming a dynamic corona whose composition varies over time due to continuous protein association and dissociation events. Eventually equilibrium is reached, at which point the continued exchange will not affect the composition of the corona. RESULTS: We developed a simple and effective dynamic model of the nanoparticle protein corona in a body fluid, namely human plasma. The model predicts the time evolution and equilibrium composition of the corona based on affinities, stoichiometries and rate constants. An application to the interaction of human serum albumin, high density lipoprotein (HDL and fibrinogen with 70 nm N-iso-propylacrylamide/N-tert-butylacrylamide copolymer nanoparticles is presented, including novel experimental data for HDL. CONCLUSIONS: The simple model presented here can easily be modified to mimic the interaction of the nanoparticle protein corona with a novel biological fluid or compartment once new data will be available, thus opening novel applications in nanotoxicity and nanomedicine.

  7. Interaction of sucralose with whey protein: Experimental and molecular modeling studies

    Science.gov (United States)

    Zhang, Hongmei; Sun, Shixin; Wang, Yanqing; Cao, Jian

    2017-12-01

    The objective of this research was to study the interactions of sucralose with whey protein isolate (WPI) by using the three-dimensional fluorescence spectroscopy, circular dichroism spectroscopy and molecular modeling. The results showed that the peptide strands structure of WPI had been changed by sucralose. Sucralose binding induced the secondary structural changes and increased content of aperiodic structure of WPI. Sucralose decreased the thermal stability of WPI and acted as a structure destabilizer during the thermal unfolding process of protein. In addition, the existence of sucralose decreased the reversibility of the unfolding of WPI. Nonetheless, sucralose-WPI complex was less stable than protein alone. The molecular modeling result showed that van der Waals and hydrogen bonding interactions contribute to the complexation free binding energy. There are more than one possible binding sites of WPI with sucralose by surface binding mode.

  8. A restraint molecular dynamics and simulated annealing approach for protein homology modeling utilizing mean angles

    Directory of Open Access Journals (Sweden)

    Maurer Till

    2005-04-01

    Full Text Available Abstract Background We have developed the program PERMOL for semi-automated homology modeling of proteins. It is based on restrained molecular dynamics using a simulated annealing protocol in torsion angle space. As main restraints defining the optimal local geometry of the structure weighted mean dihedral angles and their standard deviations are used which are calculated with an algorithm described earlier by Döker et al. (1999, BBRC, 257, 348–350. The overall long-range contacts are established via a small number of distance restraints between atoms involved in hydrogen bonds and backbone atoms of conserved residues. Employing the restraints generated by PERMOL three-dimensional structures are obtained using standard molecular dynamics programs such as DYANA or CNS. Results To test this modeling approach it has been used for predicting the structure of the histidine-containing phosphocarrier protein HPr from E. coli and the structure of the human peroxisome proliferator activated receptor γ (Ppar γ. The divergence between the modeled HPr and the previously determined X-ray structure was comparable to the divergence between the X-ray structure and the published NMR structure. The modeled structure of Ppar γ was also very close to the previously solved X-ray structure with an RMSD of 0.262 nm for the backbone atoms. Conclusion In summary, we present a new method for homology modeling capable of producing high-quality structure models. An advantage of the method is that it can be used in combination with incomplete NMR data to obtain reasonable structure models in accordance with the experimental data.

  9. Hyperglycemia decreases expression of 14-3-3 proteins in an animal model of stroke.

    Science.gov (United States)

    Jeon, Seong-Jun; Sung, Jin-Hee; Koh, Phil-Ok

    2016-07-28

    Diabetes is a severe metabolic disorder and a major risk factor for stroke. Stroke severity is worse in patients with diabetes compared to the non-diabetic population. The 14-3-3 proteins are a family of conserved acidic proteins that are ubiquitously expressed in cells and tissues. These proteins are involved in many cellular processes including metabolic pathways, signal transduction, protein trafficking, protein synthesis, and cell cycle control. This study investigated 14-3-3 proteins expression in the cerebral cortex of animals with diabetes, cerebral ischemic injury and a combination of both diabetes and cerebral ischemic injury. Diabetes was induced by intraperitoneal injection of streptozotocin (40mg/kg) in adult male rats. After 4 weeks of treatment, middle cerebral artery occlusion (MCAO) was performed for the induction of focal cerebral ischemia and cerebral cortex tissue was collected 24h after MCAO. We confirmed that diabetes increases infarct volume following MCAO compared to non-diabetic animals. In diabetic animals with MCAO injury, reduction of 14-3-3 β/α, 14-3-3 ζ/δ, 14-3-3 γ, and 14-3-3 ε isoforms was detected. The expression of these proteins was significantly decreased in diabetic animals with MCAO injury compared to diabetic-only and MCAO-only animals. Moreover, Western blot analysis ascertained the decreased expression of 14-3-3 family proteins in diabetic animals with MCAO injury, including β/α, ζ/δ, γ, ε, τ, and η isoforms. These results show the changes of 14-3-3 proteins expression in streptozotocin-induced diabetic animals with MCAO injury. Thus, these findings suggest that decreases in 14-3-3 proteins might be involved in the regulation of 14-3-3 proteins under the presence of diabetes following MCAO. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  10. Proteomic Profiling Comparing the Effects of Different Heat Treatments on Camel (Camelus dromedarius) Milk Whey Proteins.

    Science.gov (United States)

    Benabdelkamel, Hicham; Masood, Afshan; Alanazi, Ibrahim O; Alzahrani, Dunia A; Alrabiah, Deema K; AlYahya, Sami A; Alfadda, Assim A

    2017-03-28

    Camel milk is consumed in the Middle East because of its high nutritional value. Traditional heating methods and the duration of heating affect the protein content and nutritional quality of the milk. We examined the denaturation of whey proteins in camel milk by assessing the effects of temperature on the whey protein profile at room temperature (RT), moderate heating at 63 °C, and at 98 °C, for 1 h. The qualitative and quantitative variations in the whey proteins before and after heat treatments were determined using quantitative 2D-difference in gel electrophoresis (DIGE)-mass spectrometry. Qualitative gel image analysis revealed a similar spot distribution between samples at RT and those heated at 63 °C, while the spot distribution between RT and samples heated at 98 °C differed. One hundred sixteen protein spots were determined to be significantly different ( p protein spots were decreased in common in both the heat-treated samples and an additional 25 spots were further decreased in the 98 °C sample. The proteins with decreased abundance included serum albumin, lactadherin, fibrinogen β and γ chain, lactotransferrin, active receptor type-2A, arginase-1, glutathione peroxidase-1 and, thiopurine S, etc. Eight protein spots were increased in common to both the samples when compared to RT and included α-lactalbumin, a glycosylation-dependent cell adhesion molecule. Whey proteins present in camel milk were less affected by heating at 63 °C than at 98 °C. This experimental study showed that denaturation increased significantly as the temperature increased from 63 to 98 °C.

  11. Subcellular localization for Gram positive and Gram negative bacterial proteins using linear interpolation smoothing model.

    Science.gov (United States)

    Saini, Harsh; Raicar, Gaurav; Dehzangi, Abdollah; Lal, Sunil; Sharma, Alok

    2015-12-07

    Protein subcellular localization is an important topic in proteomics since it is related to a protein׳s overall function, helps in the understanding of metabolic pathways, and in drug design and discovery. In this paper, a basic approximation technique from natural language processing called the linear interpolation smoothing model is applied for predicting protein subcellular localizations. The proposed approach extracts features from syntactical information in protein sequences to build probabilistic profiles using dependency models, which are used in linear interpolation to determine how likely is a sequence to belong to a particular subcellular location. This technique builds a statistical model based on maximum likelihood. It is able to deal effectively with high dimensionality that hinders other traditional classifiers such as Support Vector Machines or k-Nearest Neighbours without sacrificing performance. This approach has been evaluated by predicting subcellular localizations of Gram positive and Gram negative bacterial proteins. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. A model for variable phytoplankton stoichiometry based on cell protein regulation

    Directory of Open Access Journals (Sweden)

    J. A. Bonachela

    2013-06-01

    Full Text Available The elemental ratios of marine phytoplankton emerge from complex interactions between the biotic and abiotic components of the ocean, and reflect the plastic response of individuals to changes in their environment. The stoichiometry of phytoplankton is, thus, dynamic and dependent on the physiological state of the cell. We present a theoretical model for the dynamics of the carbon, nitrogen and phosphorus contents of a phytoplankton population. By representing the regulatory processes controlling nutrient uptake, and focusing on the relation between nutrient content and protein synthesis, our model qualitatively replicates existing experimental observations for nutrient content and ratios. The population described by our model takes up nutrients in proportions that match the input ratios for a broad range of growth conditions. In addition, there are two zones of single-nutrient limitation separated by a wide zone of co-limitation. Within the co-limitation zone, a single point can be identified where nutrients are supplied in an optimal ratio. When different species compete, the existence of a wide co-limitation zone implies a more complex pattern of coexistence and exclusion compared to previous model predictions. However, additional comprehensive laboratory experiments are needed to test our predictions. Our model contributes to the understanding of the global cycles of oceanic nitrogen and phosphorus, as well as the elemental ratios of these nutrients in phytoplankton populations.

  13. Sorting protein lists with nwCompare: a simple and fast algorithm for n-way comparison of proteomic data files.

    Science.gov (United States)

    Pont, Frédéric; Fournié, Jean Jacques

    2010-03-01

    MS, the reference technology for proteomics, routinely produces large numbers of protein lists whose fast comparison would prove very useful. Unfortunately, most softwares only allow comparisons of two to three lists at once. We introduce here nwCompare, a simple tool for n-way comparison of several protein lists without any query language, and exemplify its use with differential and shared cancer cell proteomes. As the software compares character strings, it can be applied to any type of data mining, such as genomic or metabolomic datalists.

  14. Systematic identification of yeast proteins extracted into model wine during aging on the yeast lees.

    Science.gov (United States)

    Rowe, Jeffrey D; Harbertson, James F; Osborne, James P; Freitag, Michael; Lim, Juyun; Bakalinsky, Alan T

    2010-02-24

    Total protein and protein-associated mannan concentrations were measured, and individual proteins were identified during extraction into model wines over 9 months of aging on the yeast lees following completion of fermentations by seven wine strains of Saccharomyces cerevisiae. In aged wines, protein-associated mannan increased about 6-fold (+/-66%), while total protein only increased 2-fold (+/-20%), which resulted in a significantly greater protein-associated mannan/total protein ratio for three strains. A total of 219 proteins were identified among all wine samples taken over the entire time course. Of the 17 "long-lived" proteins detected in all 9 month samples, 13 were cell wall mannoproteins, and four were glycolytic enzymes. Most cytosolic proteins were not detected after 6 months. Native mannosylated yeast invertase was assayed for binding to wine tannin and was found to have a 10-fold lower affinity than nonglycosylated bovine serum albumin. Enrichment of mannoproteins in the aged model wines implies greater solution stability than other yeast proteins and the possibility that their contributions to wine quality may persist long after bottling.

  15. Comparing coefficients of nested nonlinear probability models

    DEFF Research Database (Denmark)

    Kohler, Ulrich; Karlson, Kristian Bernt; Holm, Anders

    2011-01-01

    In a series of recent articles, Karlson, Holm and Breen have developed a method for comparing the estimated coeffcients of two nested nonlinear probability models. This article describes this method and the user-written program khb that implements the method. The KHB-method is a general decomposi......In a series of recent articles, Karlson, Holm and Breen have developed a method for comparing the estimated coeffcients of two nested nonlinear probability models. This article describes this method and the user-written program khb that implements the method. The KHB-method is a general...... decomposition method that is unaffected by the rescaling or attenuation bias that arise in cross-model comparisons in nonlinear models. It recovers the degree to which a control variable, Z, mediates or explains the relationship between X and a latent outcome variable, Y*, underlying the nonlinear probability...

  16. Protein design using continuous rotamers.

    Directory of Open Access Journals (Sweden)

    Pablo Gainza

    2012-01-01

    Full Text Available Optimizing amino acid conformation and identity is a central problem in computational protein design. Protein design algorithms must allow realistic protein flexibility to occur during this optimization, or they may fail to find the best sequence with the lowest energy. Most design algorithms implement side-chain flexibility by allowing the side chains to move between a small set of discrete, low-energy states, which we call rigid rotamers. In this work we show that allowing continuous side-chain flexibility (which we call continuous rotamers greatly improves protein flexibility modeling. We present a large-scale study that compares the sequences and best energy conformations in 69 protein-core redesigns using a rigid-rotamer model versus a continuous-rotamer model. We show that in nearly all of our redesigns the sequence found by the continuous-rotamer model is different and has a lower energy than the one found by the rigid-rotamer model. Moreover, the sequences found by the continuous-rotamer model are more similar to the native sequences. We then show that the seemingly easy solution of sampling more rigid rotamers within the continuous region is not a practical alternative to a continuous-rotamer model: at computationally feasible resolutions, using more rigid rotamers was never better than a continuous-rotamer model and almost always resulted in higher energies. Finally, we present a new protein design algorithm based on the dead-end elimination (DEE algorithm, which we call iMinDEE, that makes the use of continuous rotamers feasible in larger systems. iMinDEE guarantees finding the optimal answer while pruning the search space with close to the same efficiency of DEE.Software is available under the Lesser GNU Public License v3. Contact the authors for source code.

  17. Digestion of cooked meat proteins is slightly affected by age as assessed using the dynamic gastrointestinal TIM model and mass spectrometry.

    Science.gov (United States)

    Denis, S; Sayd, T; Georges, A; Chambon, C; Chalancon, S; Santé-Lhoutellier, V; Blanquet-Diot, S

    2016-06-15

    In humans, meat ensures the supply of proteins with high nutritional value and indispensable amino acids. The main goal of the present study was to compare the degradation of meat proteins in adult and elderly digestive conditions. Cooked meat was subjected to in vitro digestion in the dynamic multi-compartmental TIM (TNO gastroIntestinal Model) system. Digestibility and bioaccessibility were determined using nitrogen balance and digestion products were identified using mass spectrometry. The TIM model was adapted according to in vivo data to mimic the specific digestive conditions of elderly people. Meat protein digestibility and bioaccessibility were around 96 and 60% respectively and were not influenced by age (P > 0.05). As much as 800 peptides were identified in the duodenal and jejunal compartments issued from 50 meat proteins with a percentage of coverage varying from 13 to 69%. Six proteins, mainly from the cytosol, were differentially hydrolyzed under the adult and elderly digestive conditions. Pyruvate kinase was the only protein clearly showing a delay in its degradation under elderly digestive conditions. This study provides significant insights into the understanding of meat protein dynamic digestion. Such data will be helpful to design in vivo studies aiming to evaluate dietary strategies that can attenuate muscle mass loss and more generally maintain a better quality of life in the elderly population.

  18. Selective Advantage of Recombination in Evolving Protein Populations:. a Lattice Model Study

    Science.gov (United States)

    Williams, Paul D.; Pollock, David D.; Goldstein, Richard A.

    Recent research has attempted to clarify the contributions of several mutational processes, such as substitutions or homologous recombination. Simplistic, tractable protein models, which determine the compact native structure phenotype from the sequence genotype, are well-suited to such studies. In this paper, we use a lattice-protein model to examine the effects of point mutation and homologous recombination on evolving populations of proteins. We find that while the majority of mutation and recombination events are neutral or deleterious, recombination is far more likely to be beneficial. This results in a faster increase in fitness during evolution, although the final fitness level is not significantly changed. This transient advantage provides an evolutionary advantage to subpopulations that undergo recombination, allowing fixation of recombination to occur in the population.

  19. Visualization and Dissemination of Multidimensional Proteomics Data Comparing Protein Abundance During Caenorhabditis elegans Development

    Science.gov (United States)

    Riffle, Michael; Merrihew, Gennifer E.; Jaschob, Daniel; Sharma, Vagisha; Davis, Trisha N.; Noble, William S.; MacCoss, Michael J.

    2015-11-01

    Regulation of protein abundance is a critical aspect of cellular function, organism development, and aging. Alternative splicing may give rise to multiple possible proteoforms of gene products where the abundance of each proteoform is independently regulated. Understanding how the abundances of these distinct gene products change is essential to understanding the underlying mechanisms of many biological processes. Bottom-up proteomics mass spectrometry techniques may be used to estimate protein abundance indirectly by sequencing and quantifying peptides that are later mapped to proteins based on sequence. However, quantifying the abundance of distinct gene products is routinely confounded by peptides that map to multiple possible proteoforms. In this work, we describe a technique that may be used to help mitigate the effects of confounding ambiguous peptides and multiple proteoforms when quantifying proteins. We have applied this technique to visualize the distribution of distinct gene products for the whole proteome across 11 developmental stages of the model organism Caenorhabditis elegans. The result is a large multidimensional dataset for which web-based tools were developed for visualizing how translated gene products change during development and identifying possible proteoforms. The underlying instrument raw files and tandem mass spectra may also be downloaded. The data resource is freely available on the web at http://www.yeastrc.org/wormpes/.

  20. Fragile X mental retardation protein expression in Alzheimer’s disease

    Directory of Open Access Journals (Sweden)

    Abigail J Renoux

    2014-10-01

    Full Text Available The FMR1 protein product, FMRP, is an mRNA binding protein associated with translational inhibition of target transcripts. One FMRP target is the amyloid precursor protein (APP mRNA, and APP levels are elevated in Fmr1 KO mice. Given that elevated APP protein expression can elicit Alzheimer’s disease (AD in patients and model systems, we evaluated whether FMRP expression might be altered in Alzheimer’s autopsy brain samples and mouse models compared to controls. In a double transgenic mouse model of AD (APP/PS1, we found no difference in FMRP expression in aged AD model mice compared to littermate controls. FMRP expression was also similar in AD and control patient frontal cortex and cerebellum samples. Fragile X-associated tremor/ataxia syndrome (FXTAS is an age related neurodegenerative disorder caused by expanded CGG repeats in the 5’UTR of the FMR1 gene. Patients experience cognitive impairment and dementia in addition to motor symptoms. In parallel studies, we measured FMRP expression in cortex and cerebellum from three FXTAS patients and found reduced expression compared to both controls and Alzheimer’s patient brains, consistent with animal models. We also find increased APP levels in cerebellar, but not cortical, samples of FXTAS patients compared to controls. Taken together, these data suggest that a decrease in FMRP expression is unlikely to be a primary contributor to Alzheimer’s disease pathogenesis.

  1. Comparative study of void fraction models

    International Nuclear Information System (INIS)

    Borges, R.C.; Freitas, R.L.

    1985-01-01

    Some models for the calculation of void fraction in water in sub-cooled boiling and saturated vertical upward flow with forced convection have been selected and compared with experimental results in the pressure range of 1 to 150 bar. In order to know the void fraction axial distribution it is necessary to determine the net generation of vapour and the fluid temperature distribution in the slightly sub-cooled boiling region. It was verified that the net generation of vapour was well represented by the Saha-Zuber model. The selected models for the void fraction calculation present adequate results but with a tendency to super-estimate the experimental results, in particular the homogeneous models. The drift flux model is recommended, followed by the Armand and Smith models. (F.E.) [pt

  2. Comparing the Discrete and Continuous Logistic Models

    Science.gov (United States)

    Gordon, Sheldon P.

    2008-01-01

    The solutions of the discrete logistic growth model based on a difference equation and the continuous logistic growth model based on a differential equation are compared and contrasted. The investigation is conducted using a dynamic interactive spreadsheet. (Contains 5 figures.)

  3. Protein permeation through an electrically tunable membrane

    International Nuclear Information System (INIS)

    Jou, Ining A; Melnikov, Dmitriy V; Gracheva, Maria E

    2016-01-01

    Protein filtration is important in many fields of science and technology such as medicine, biology, chemistry, and engineering. Recently, protein separation and filtering with nanoporous membranes has attracted interest due to the possibility of fast separation and high throughput volume. This, however, requires understanding of the protein’s dynamics inside and in the vicinity of the nanopore. In this work, we utilize a Brownian dynamics approach to study the motion of the model protein insulin in the membrane–electrolyte electrostatic potential. We compare the results of the atomic model of the protein with the results of a coarse-grained and a single-bead model, and find that the coarse-grained representation of protein strikes the best balance between the accuracy of the results and the computational effort required. Contrary to common belief, we find that to adequately describe the protein, a single-bead model cannot be utilized without a significant effort to tabulate the simulation parameters. Similar to results for nanoparticle dynamics, our findings also indicate that the electric field and the electro-osmotic flow due to the applied membrane and electrolyte biases affect the capture and translocation of the biomolecule by either attracting or repelling it to or from the nanopore. Our computational model can also be applied to other types of proteins and separation conditions. (paper)

  4. Truly Absorbed Microbial Protein Synthesis, Rumen Bypass Protein, Endogenous Protein, and Total Metabolizable Protein from Starchy and Protein-Rich Raw Materials

    NARCIS (Netherlands)

    Parand, Ehsan; Vakili, Alireza; Mesgaran, Mohsen Danesh; Duinkerken, Van Gert; Yu, Peiqiang

    2015-01-01

    This study was carried out to measure truly absorbed microbial protein synthesis, rumen bypass protein, and endogenous protein loss, as well as total metabolizable protein, from starchy and protein-rich raw feed materials with model comparisons. Predictions by the DVE2010 system as a more

  5. Masseter muscle myofibrillar protein synthesis and degradation in an experimental critical illness myopathy model.

    Directory of Open Access Journals (Sweden)

    Hazem Akkad

    Full Text Available Critical illness myopathy (CIM is a debilitating common consequence of modern intensive care, characterized by severe muscle wasting, weakness and a decreased myosin/actin (M/A ratio. Limb/trunk muscles are primarily affected by this myopathy while cranial nerve innervated muscles are spared or less affected, but the mechanisms underlying these muscle-specific differences remain unknown. In this time-resolved study, the cranial nerve innervated masseter muscle was studied in a unique experimental rat intensive care unit (ICU model, where animals were exposed to sedation, neuromuscular blockade (NMB, mechanical ventilation, and immobilization for durations varying between 6 h and 14d. Gel electrophoresis, immunoblotting, RT-PCR and morphological staining techniques were used to analyze M/A ratios, myofiber size, synthesis and degradation of myofibrillar proteins, and levels of heat shock proteins (HSPs. Results obtained in the masseter muscle were compared with previous observations in experimental and clinical studies of limb muscles. Significant muscle-specific differences were observed, i.e., in the masseter, the decline in M/A ratio and muscle fiber size was small and delayed. Furthermore, transcriptional regulation of myosin and actin synthesis was maintained, and Akt phosphorylation was only briefly reduced. In studied degradation pathways, only mRNA, but not protein levels of MuRF1, atrogin-1 and the autophagy marker LC3b were activated by the ICU condition. The matrix metalloproteinase MMP-2 was inhibited and protective HSPs were up-regulated early. These results confirm that the cranial nerve innervated masticatory muscles is less affected by the ICU-stress response than limb muscles, in accordance with clinical observation in ICU patients with CIM, supporting the model' credibility as a valid CIM model.

  6. Modeling of the structure of ribosomal protein L1 from the archaeon Haloarcula marismortui

    Science.gov (United States)

    Nevskaya, N. A.; Kljashtorny, V. G.; Vakhrusheva, A. V.; Garber, M. B.; Nikonov, S. V.

    2017-07-01

    The halophilic archaeon Haloarcula marismortui proliferates in the Dead Sea at extremely high salt concentrations (higher than 3 M). This is the only archaeon, for which the crystal structure of the ribosomal 50S subunit was determined. However, the structure of the functionally important side protuberance containing the abnormally negatively charged protein L1 (HmaL1) was not visualized. Attempts to crystallize HmaL1 in the isolated state or as its complex with RNA using normal salt concentrations (≤500 mM) failed. A theoretical model of HmaL1 was built based on the structural data for homologs of the protein L1 from other organisms, and this model was refined by molecular dynamics methods. Analysis of this model showed that the protein HmaL1 can undergo aggregation due to the presence of a cluster of positive charges unique for proteins L1. This cluster is located at the RNA-protein interface, which interferes with the crystallization of HmaL1 and the binding of the latter to RNA.

  7. Comparative transcriptional analysis of Bacillus subtilis cells overproducing either secreted proteins, lipoproteins or membrane proteins

    NARCIS (Netherlands)

    Marciniak, Bogumila C.; Trip, Hein; van-der Veek, Patricia J.; Kuipers, Oscar P.; Marciniak, Bogumiła C.

    2012-01-01

    Background: Bacillus subtilis is a favorable host for the production of industrially relevant proteins because of its capacity of secreting proteins into the medium to high levels, its GRAS (Generally Recognized As Safe) status, its genetic accessibility and its capacity to grow in large

  8. Sculpting proteins interactively: continual energy minimization embedded in a graphical modeling system.

    Science.gov (United States)

    Surles, M C; Richardson, J S; Richardson, D C; Brooks, F P

    1994-02-01

    We describe a new paradigm for modeling proteins in interactive computer graphics systems--continual maintenance of a physically valid representation, combined with direct user control and visualization. This is achieved by a fast algorithm for energy minimization, capable of real-time performance on all atoms of a small protein, plus graphically specified user tugs. The modeling system, called Sculpt, rigidly constrains bond lengths, bond angles, and planar groups (similar to existing interactive modeling programs), while it applies elastic restraints to minimize the potential energy due to torsions, hydrogen bonds, and van der Waals and electrostatic interactions (similar to existing batch minimization programs), and user-specified springs. The graphical interface can show bad and/or favorable contacts, and individual energy terms can be turned on or off to determine their effects and interactions. Sculpt finds a local minimum of the total energy that satisfies all the constraints using an augmented Lagrange-multiplier method; calculation time increases only linearly with the number of atoms because the matrix of constraint gradients is sparse and banded. On a 100-MHz MIPS R4000 processor (Silicon Graphics Indigo), Sculpt achieves 11 updates per second on a 20-residue fragment and 2 updates per second on an 80-residue protein, using all atoms except non-H-bonding hydrogens, and without electrostatic interactions. Applications of Sculpt are described: to reverse the direction of bundle packing in a designed 4-helix bundle protein, to fold up a 2-stranded beta-ribbon into an approximate beta-barrel, and to design the sequence and conformation of a 30-residue peptide that mimics one partner of a protein subunit interaction. Computer models that are both interactive and physically realistic (within the limitations of a given force field) have 2 significant advantages: (1) they make feasible the modeling of very large changes (such as needed for de novo design), and

  9. A comparative study on the radioactive labelling of proteins

    International Nuclear Information System (INIS)

    Koch, G.K.; Heertje, I.; Stijn, F. van

    1977-01-01

    The main methods in protein labelling are exchange labelling, iodination, acylation and alkylation. The universal application of the techniques is evaluated by a number of criteria, derived from the demand that labelled proteins should be as identical to the native ones as possible. From our experiences on labelling methods it is concluded that reductive methylation meets most requirements. (orig.) [de

  10. Modeling of human factor Va inactivation by activated protein C

    Directory of Open Access Journals (Sweden)

    Bravo Maria

    2012-05-01

    Full Text Available Abstract Background Because understanding of the inventory, connectivity and dynamics of the components characterizing the process of coagulation is relatively mature, it has become an attractive target for physiochemical modeling. Such models can potentially improve the design of therapeutics. The prothrombinase complex (composed of the protease factor (FXa and its cofactor FVa plays a central role in this network as the main producer of thrombin, which catalyses both the activation of platelets and the conversion of fibrinogen to fibrin, the main substances of a clot. A key negative feedback loop that prevents clot propagation beyond the site of injury is the thrombin-dependent generation of activated protein C (APC, an enzyme that inactivates FVa, thus neutralizing the prothrombinase complex. APC inactivation of FVa is complex, involving the production of partially active intermediates and “protection” of FVa from APC by both FXa and prothrombin. An empirically validated mathematical model of this process would be useful in advancing the predictive capacity of comprehensive models of coagulation. Results A model of human APC inactivation of prothrombinase was constructed in a stepwise fashion by analyzing time courses of FVa inactivation in empirical reaction systems with increasing number of interacting components and generating corresponding model constructs of each reaction system. Reaction mechanisms, rate constants and equilibrium constants informing these model constructs were initially derived from various research groups reporting on APC inactivation of FVa in isolation, or in the presence of FXa or prothrombin. Model predictions were assessed against empirical data measuring the appearance and disappearance of multiple FVa degradation intermediates as well as prothrombinase activity changes, with plasma proteins derived from multiple preparations. Our work integrates previously published findings and through the cooperative

  11. Dietary soya protein improves intra-myocardial lipid deposition and altered glucose metabolism in a hypertensive, dyslipidaemic, insulin-resistant rat model.

    Science.gov (United States)

    Oliva, María E; Creus, Agustina; Ferreira, María R; Chicco, Adriana; Lombardo, Yolanda B

    2018-01-01

    This study investigates the effects of replacing dietary casein by soya protein on the underlying mechanisms involved in the impaired metabolic fate of glucose and lipid metabolisms in the heart of dyslipidaemic rats chronically fed (8 months) a sucrose-rich (62·5 %) diet (SRD). To test this hypothesis, Wistar rats were fed an SRD for 4 months. From months 4 to 8, half the animals continued with the SRD and the other half were fed an SRD in which casein was substituted by soya. The control group received a diet with maize starch as the carbohydrate source. Compared with the SRD-fed group, the following results were obtained. First, soya protein significantly (Psoya protein significantly increased (Psoya protein upon the altered pathways of glucose and lipid metabolism in the heart muscle of this rat model.

  12. Protein Loop Dynamics Are Complex and Depend on the Motions of the Whole Protein

    Directory of Open Access Journals (Sweden)

    Michael T. Zimmermann

    2012-04-01

    Full Text Available We investigate the relationship between the motions of the same peptide loop segment incorporated within a protein structure and motions of free or end-constrained peptides. As a reference point we also compare against alanine chains having the same length as the loop. Both the analysis of atomic molecular dynamics trajectories and structure-based elastic network models, reveal no general dependence on loop length or on the number of solvent exposed residues. Rather, the whole structure affects the motions in complex ways that depend strongly and specifically on the tertiary structure of the whole protein. Both the Elastic Network Models and Molecular Dynamics confirm the differences in loop dynamics between the free and structured contexts; there is strong agreement between the behaviors observed from molecular dynamics and the elastic network models. There is no apparent simple relationship between loop mobility and its size, exposure, or position within a loop. Free peptides do not behave the same as the loops in the proteins. Surface loops do not behave as if they were random coils, and the tertiary structure has a critical influence upon the apparent motions. This strongly implies that entropy evaluation of protein loops requires knowledge of the motions of the entire protein structure.

  13. Genome-scale metabolic model of Pichia pastoris with native and humanized glycosylation of recombinant proteins

    DEFF Research Database (Denmark)

    Irani, Zahra Azimzadeh; Kerkhoven, Eduard J.; Shojaosadati, Seyed Abbas

    2016-01-01

    prediction of protein yield, demonstrates the effect of the different types of N-glycosylation of protein yield, and can be used to predict potential targets for strain improvement. The model represents a step towards a more complete description of protein production in P. pastoris, which is required...... for using these models to understand and optimize protein production processes....

  14. Structural characterisation of medically relevant protein assemblies by integrating mass spectrometry with computational modelling.

    Science.gov (United States)

    Politis, Argyris; Schmidt, Carla

    2018-03-20

    Structural mass spectrometry with its various techniques is a powerful tool for the structural elucidation of medically relevant protein assemblies. It delivers information on the composition, stoichiometries, interactions and topologies of these assemblies. Most importantly it can deal with heterogeneous mixtures and assemblies which makes it universal among the conventional structural techniques. In this review we summarise recent advances and challenges in structural mass spectrometric techniques. We describe how the combination of the different mass spectrometry-based methods with computational strategies enable structural models at molecular levels of resolution. These models hold significant potential for helping us in characterizing the function of protein assemblies related to human health and disease. In this review we summarise the techniques of structural mass spectrometry often applied when studying protein-ligand complexes. We exemplify these techniques through recent examples from literature that helped in the understanding of medically relevant protein assemblies. We further provide a detailed introduction into various computational approaches that can be integrated with these mass spectrometric techniques. Last but not least we discuss case studies that integrated mass spectrometry and computational modelling approaches and yielded models of medically important protein assembly states such as fibrils and amyloids. Copyright © 2017 The Author(s). Published by Elsevier B.V. All rights reserved.

  15. Overexpressed Proteins in Hypervirulent Clade 8 and Clade 6 Strains of Escherichia coli O157:H7 Compared to E. coli O157:H7 EDL933 Clade 3 Strain.

    Directory of Open Access Journals (Sweden)

    Natalia Amigo

    Full Text Available Escherichia coli O157:H7 is responsible for severe diarrhea and hemolytic uremic syndrome (HUS, and predominantly affects children under 5 years. The major virulence traits are Shiga toxins, necessary to develop HUS and the Type III Secretion System (T3SS through which bacteria translocate effector proteins directly into the host cell. By SNPs typing, E. coli O157:H7 was separated into nine different clades. Clade 8 and clade 6 strains were more frequently associated with severe disease and HUS. In this study, we aimed to identify differentially expressed proteins in two strains of E. coli O157:H7 (clade 8 and clade 6, obtained from cattle and compared them with the well characterized reference EDL933 strain (clade 3. Clade 8 and clade 6 strains show enhanced pathogenicity in a mouse model and virulence-related properties. Proteins were extracted and analyzed using the TMT-6plex labeling strategy associated with two dimensional liquid chromatography and mass spectrometry in tandem. We detected 2241 proteins in the cell extract and 1787 proteins in the culture supernatants. Attention was focused on the proteins related to virulence, overexpressed in clade 6 and 8 strains compared to EDL933 strain. The proteins relevant overexpressed in clade 8 strain were the curli protein CsgC, a transcriptional activator (PchE, phage proteins, Stx2, FlgM and FlgD, a dienelactone hydrolase, CheW and CheY, and the SPATE protease EspP. For clade 6 strain, a high overexpression of phage proteins was detected, mostly from Stx2 encoding phage, including Stx2, flagellin and the protease TagA, EDL933_p0016, dienelactone hydrolase, and Haemolysin A, amongst others with unknown function. Some of these proteins were analyzed by RT-qPCR to corroborate the proteomic data. Clade 6 and clade 8 strains showed enhanced transcription of 10 out of 12 genes compared to EDL933. These results may provide new insights in E. coli O157:H7 mechanisms of pathogenesis.

  16. Comparative Proteomic Analysis Reveals Proteins Putatively Involved in Toxin Biosynthesis in the Marine Dinoflagellate Alexandrium catenella

    Directory of Open Access Journals (Sweden)

    Da-Zhi Wang

    2013-01-01

    Full Text Available Alexandrium is a neurotoxin-producing dinoflagellate genus resulting in paralytic shellfish poisonings around the world. However, little is known about the toxin biosynthesis mechanism in Alexandrium. This study compared protein profiles of A. catenella collected at different toxin biosynthesis stages (non-toxin synthesis, initial toxin synthesis and toxin synthesizing coupled with the cell cycle, and identified differentially expressed proteins using 2-DE and MALDI-TOF-TOF mass spectrometry. The results showed that toxin biosynthesis of A. catenella occurred within a defined time frame in the G1 phase of the cell cycle. Proteomic analysis indicated that 102 protein spots altered significantly in abundance (P < 0.05, and 53 proteins were identified using database searching. These proteins were involved in a variety of biological processes, i.e., protein modification and biosynthesis, metabolism, cell division, oxidative stress, transport, signal transduction, and translation. Among them, nine proteins with known functions in paralytic shellfish toxin-producing cyanobacteria, i.e., methionine S-adenosyltransferase, chloroplast ferredoxin-NADP+ reductase, S-adenosylhomocysteinase, adenosylhomocysteinase, ornithine carbamoyltransferase, inorganic pyrophosphatase, sulfotransferase (similar to, alcohol dehydrogenase and arginine deiminase, varied significantly at different toxin biosynthesis stages and formed an interaction network, indicating that they might be involved in toxin biosynthesis in A. catenella. This study is the first step in the dissection of the behavior of the A. catenella proteome during different toxin biosynthesis stages and provides new insights into toxin biosynthesis in dinoflagellates.

  17. Maillard-reaction-induced modification and aggregation of proteins and hardening of texture in protein bar model systems.

    Science.gov (United States)

    Zhou, Peng; Guo, Mufan; Liu, Dasong; Liu, Xiaoming; Labuza, Teodore P

    2013-03-01

    The hardening of high-protein bars causes problems in their acceptability to consumers. The objective of this study was to determine the progress of the Maillard reaction in model systems of high-protein nutritional bars containing reducing sugars, and to illustrate the influences of the Maillard reaction on the modification and aggregation of proteins and the hardening of bar matrices during storage. The progress of the Maillard reaction, glycation, and aggregation of proteins, and textural changes in bar matrices were investigated during storage at 25, 35, and 45 °C. The initial development of the Maillard reaction caused little changes in hardness; however, further storage resulted in dramatic modification of protein with formation of high-molecular-weight polymers, resulting in the hardening in texture. The replacement of reducing sugars with nonreducing ingredients such as sugar alcohols in the formula minimized the changes in texture. The hardening of high-protein bars causes problems in their acceptability to consumers. Maillard reaction is one of the mechanisms contributing to the hardening of bar matrix, particularly for the late stage of storage. The replacement of reducing sugars with nonreducing ingredients such as sugar alcohols in the formula will minimize the changes in texture. © 2013 Institute of Food Technologists®

  18. Wellness Model of Supervision: A Comparative Analysis

    Science.gov (United States)

    Lenz, A. Stephen; Sangganjanavanich, Varunee Faii; Balkin, Richard S.; Oliver, Marvarene; Smith, Robert L.

    2012-01-01

    This quasi-experimental study compared the effectiveness of the Wellness Model of Supervision (WELMS; Lenz & Smith, 2010) with alternative supervision models for developing wellness constructs, total personal wellness, and helping skills among counselors-in-training. Participants were 32 master's-level counseling students completing their…

  19. Looping and clustering model for the organization of protein-DNA complexes on the bacterial genome

    Science.gov (United States)

    Walter, Jean-Charles; Walliser, Nils-Ole; David, Gabriel; Dorignac, Jérôme; Geniet, Frédéric; Palmeri, John; Parmeggiani, Andrea; Wingreen, Ned S.; Broedersz, Chase P.

    2018-03-01

    The bacterial genome is organized by a variety of associated proteins inside a structure called the nucleoid. These proteins can form complexes on DNA that play a central role in various biological processes, including chromosome segregation. A prominent example is the large ParB-DNA complex, which forms an essential component of the segregation machinery in many bacteria. ChIP-Seq experiments show that ParB proteins localize around centromere-like parS sites on the DNA to which ParB binds specifically, and spreads from there over large sections of the chromosome. Recent theoretical and experimental studies suggest that DNA-bound ParB proteins can interact with each other to condense into a coherent 3D complex on the DNA. However, the structural organization of this protein-DNA complex remains unclear, and a predictive quantitative theory for the distribution of ParB proteins on DNA is lacking. Here, we propose the looping and clustering model, which employs a statistical physics approach to describe protein-DNA complexes. The looping and clustering model accounts for the extrusion of DNA loops from a cluster of interacting DNA-bound proteins that is organized around a single high-affinity binding site. Conceptually, the structure of the protein-DNA complex is determined by a competition between attractive protein interactions and loop closure entropy of this protein-DNA cluster on the one hand, and the positional entropy for placing loops within the cluster on the other. Indeed, we show that the protein interaction strength determines the ‘tightness’ of the loopy protein-DNA complex. Thus, our model provides a theoretical framework for quantitatively computing the binding profiles of ParB-like proteins around a cognate (parS) binding site.

  20. Moderate carbohydrate, moderate protein weight loss diet reduces cardiovascular disease risk compared to high carbohydrate, low protein diet in obese adults: A randomized clinical trial

    Directory of Open Access Journals (Sweden)

    Evans Ellen M

    2008-11-01

    Full Text Available Abstract Background To evaluate the metabolic effects of two weight loss diets differing in macronutrient composition on features of dyslipidemia and post-prandial insulin (INS response to a meal challenge in overweight/obese individuals. Methods This study was a parallel-arm randomized 4 mo weight loss trial. Adults (n = 50, 47 ± 7 y matched on BMI (33.6 ± 0.6 kg/m2, P = 0.79 consumed energy restricted diets (deficit ~500 kcal/d: PRO (1.6 g.kg-1.d-1 protein and -1.d-1 protein and > 220 g/d carbohydrate for 4 mos. Meal challenges of respective diets were utilized for determination of blood lipids and post-prandial INS and glucose response at the beginning and end of the study. Results There was a trend for PRO to lose more weight (-9.1% vs. -7.3%, P = 0.07 with a significant reduction in percent fat mass compared to CHO (-8.7% vs. -5.7%; P = 0.03. PRO also favored reductions in triacylglycerol (-34% vs. -14%; P P = 0.05; however, CHO favored reduction in LDL-C (-7% vs. +2.5%; P P P Conclusion A weight loss diet with moderate carbohydrate, moderate protein results in more favorable changes in body composition, dyslipidemia, and post-prandial INS response compared to a high carbohydrate, low protein diet suggesting an additional benefit beyond weight management to include augmented risk reduction for metabolic disease.

  1. Classification of Beta-lactamases and penicillin binding proteins using ligand-centric network models.

    Directory of Open Access Journals (Sweden)

    Hakime Öztürk

    Full Text Available β-lactamase mediated antibiotic resistance is an important health issue and the discovery of new β-lactam type antibiotics or β-lactamase inhibitors is an area of intense research. Today, there are about a thousand β-lactamases due to the evolutionary pressure exerted by these ligands. While β-lactamases hydrolyse the β-lactam ring of antibiotics, rendering them ineffective, Penicillin-Binding Proteins (PBPs, which share high structural similarity with β-lactamases, also confer antibiotic resistance to their host organism by acquiring mutations that allow them to continue their participation in cell wall biosynthesis. In this paper, we propose a novel approach to include ligand sharing information for classifying and clustering β-lactamases and PBPs in an effort to elucidate the ligand induced evolution of these β-lactam binding proteins. We first present a detailed summary of the β-lactamase and PBP families in the Protein Data Bank, as well as the compounds they bind to. Then, we build two different types of networks in which the proteins are represented as nodes, and two proteins are connected by an edge with a weight that depends on the number of shared identical or similar ligands. These models are analyzed under three different edge weight settings, namely unweighted, weighted, and normalized weighted. A detailed comparison of these six networks showed that the use of ligand sharing information to cluster proteins resulted in modules comprising proteins with not only sequence similarity but also functional similarity. Consideration of ligand similarity highlighted some interactions that were not detected in the identical ligand network. Analysing the β-lactamases and PBPs using ligand-centric network models enabled the identification of novel relationships, suggesting that these models can be used to examine other protein families to obtain information on their ligand induced evolutionary paths.

  2. Evaluation of plasma C-reactive protein levels in pregnant women with and without periodontal disease: A comparative study.

    Science.gov (United States)

    Sharma, Anupriya; Ramesh, Amitha; Thomas, Biju

    2009-09-01

    Circulating C-reactive protein (CRP) levels are a marker of systemic inflammation and are associated with periodontal disease, a chronic bacterial infection associated with elevation of proinflammatory cytokines and prostaglandins. CRP has been associated with adverse pregnancy outcomes, including preterm delivery, preeclampsia, and intrauterine growth restriction. Furthermore, periodontal disease has been associated with increased risk of preterm low birth weight, low birth weight, and preterm birth. The present study was conducted to assess plasma CRP levels in pregnant women with and without periodontal disease; to evaluate the effect of periodontal therapy on the incidence of preterm delivery; and to compare the incidence of preterm delivery in pregnant women with and without periodontal disease. A total of 90 pregnant women aged between 18-35 years with gestational age between 12-28 weeks were recruited and divided into three equal groups (control group, study group, treatment group) of 30 each. Blood samples were taken for estimation of C-reactive protein levels from all groups at 12-20 weeks of gestation, determined using ultrasensitive turbidimetric immunoassay (QUANTIA-CRP US). The treatment group comprised plaque control, scaling, and root planning and daily rinsing with 0.2% chlorhexidine mouth before 28 weeks of gestation. The mean value of C-reactive protein levels in subjects with periodontal disease was higher compared to control group i.e., 1.20 +/- 0.247 mg/dl and 1.22 +/- 0.250 mg/dl, respectively, compared to 0.713 +/- 0.139 mg/ dl (P = 0.001). The mean value of CRP levels before treatment was greater than the mean value after treatment i.e., 1.22 +/- 0.25 compared to 0.84 +/- 0.189 (P periodontal disease group (study group) compared to 8.3% in the control group (P = 0.001). The incidence of preterm delivery in the treatment group was 15.0% compared to 31.7% in the nontreatment group (study group). The findings from the study suggest that

  3. Soy Flour Adhesive Strength Compared with That of Purified Soy Proteins*

    Science.gov (United States)

    Linda Lorenz; Michael Birkeland; Chera Daurio; Charles R. Frihart

    2015-01-01

    Except for the substitution of soy flour in phenolic resins (Frihart et al. 2013) and the use of soy flour at high pHs (Lambuth 2003), the literature on soy protein properties for adhesives has mainly focused on soy protein isolate and specific protein fractions (Sun 2005b). The assumption is that proteins are the main portion of soy flour giving bond strength and the...

  4. MATHEMATICAL AND COMPUTATIONAL MODELLING OF RIBOSOMAL MOVEMENT AND PROTEIN SYNTHESIS: AN OVERVIEW

    Directory of Open Access Journals (Sweden)

    Tobias von der Haar

    2012-04-01

    Full Text Available Translation or protein synthesis consists of a complex system of chemical reactions, which ultimately result in decoding of the mRNA and the production of a protein. The complexity of this reaction system makes it difficult to quantitatively connect its input parameters (such as translation factor or ribosome concentrations, codon composition of the mRNA, or energy availability to output parameters (such as protein synthesis rates or ribosome densities on mRNAs. Mathematical and computational models of translation have now been used for nearly five decades to investigate translation, and to shed light on the relationship between the different reactions in the system. This review gives an overview over the principal approaches used in the modelling efforts, and summarises some of the major findings that were made.

  5. Comparative Proteomic Characterization of 4 Human Liver-Derived Single Cell Culture Models Reveals Significant Variation in the Capacity for Drug Disposition, Bioactivation, and Detoxication.

    Science.gov (United States)

    Sison-Young, Rowena L C; Mitsa, Dimitra; Jenkins, Rosalind E; Mottram, David; Alexandre, Eliane; Richert, Lysiane; Aerts, Hélène; Weaver, Richard J; Jones, Robert P; Johann, Esther; Hewitt, Philip G; Ingelman-Sundberg, Magnus; Goldring, Christopher E P; Kitteringham, Neil R; Park, B Kevin

    2015-10-01

    In vitro preclinical models for the assessment of drug-induced liver injury (DILI) are usually based on cryopreserved primary human hepatocytes (cPHH) or human hepatic tumor-derived cell lines; however, it is unclear how well such cell models reflect the normal function of liver cells. The physiological, pharmacological, and toxicological phenotyping of available cell-based systems is necessary in order to decide the testing purpose for which they are fit. We have therefore undertaken a global proteomic analysis of 3 human-derived hepatic cell lines (HepG2, Upcyte, and HepaRG) in comparison with cPHH with a focus on drug metabolizing enzymes and transport proteins (DMETs), as well as Nrf2-regulated proteins. In total, 4946 proteins were identified, of which 2722 proteins were common across all cell models, including 128 DMETs. Approximately 90% reduction in expression of cytochromes P450 was observed in HepG2 and Upcyte cells, and approximately 60% in HepaRG cells relative to cPHH. Drug transporter expression was also lower compared with cPHH with the exception of MRP3 and P-gp (MDR1) which appeared to be significantly expressed in HepaRG cells. In contrast, a high proportion of Nrf2-regulated proteins were more highly expressed in the cell lines compared with cPHH. The proteomic database derived here will provide a rational basis for the context-specific selection of the most appropriate 'hepatocyte-like' cell for the evaluation of particular cellular functions associated with DILI and, at the same time, assist in the construction of a testing paradigm which takes into account the in vivo disposition of a new drug. © The Author 2015. Published by Oxford University Press on behalf of the Society of Toxicology.

  6. Ileal and faecal protein digestibility measurement in humans and other non-ruminants - a comparative species view

    NARCIS (Netherlands)

    Hendriks, W.H.; Baal, van J.; Bosch, G.

    2012-01-01

    A comparative non-ruminant species view of the contribution of the large intestinal metabolism to inaccuracies in nitrogen and amino acid absorption measurements is provided to assess potential implications for the determination of crude protein/amino acid digestibility in adult humans consuming

  7. Protein S is protective in pulmonary fibrosis.

    Science.gov (United States)

    Urawa, M; Kobayashi, T; D'Alessandro-Gabazza, C N; Fujimoto, H; Toda, M; Roeen, Z; Hinneh, J A; Yasuma, T; Takei, Y; Taguchi, O; Gabazza, E C

    2016-08-01

    Essentials Epithelial cell apoptosis is critical in the pathogenesis of idiopathic pulmonary fibrosis. Protein S, a circulating anticoagulant, inhibited apoptosis of lung epithelial cells. Overexpression of protein S in lung cells reduced bleomycin-induced pulmonary fibrosis. Intranasal therapy with exogenous protein S ameliorated bleomycin-induced pulmonary fibrosis. Background Pulmonary fibrosis is the terminal stage of interstitial lung diseases, some of them being incurable and of unknown etiology. Apoptosis plays a critical role in lung fibrogenesis. Protein S is a plasma anticoagulant with potent antiapoptotic activity. The role of protein S in pulmonary fibrosis is unknown. Objectives To evaluate the clinical relevance of protein S and its protective role in pulmonary fibrosis. Methods and Results The circulating level of protein S was measured in patients with pulmonary fibrosis and controls by the use of enzyme immunoassays. Pulmonary fibrosis was induced with bleomycin in transgenic mice overexpressing human protein S and wild-type mice, and exogenous protein S or vehicle was administered to wild-type mice; fibrosis was then compared in both models. Patients with pulmonary fibrosis had reduced circulating levels of protein S as compared with controls. Inflammatory changes, the levels of profibrotic cytokines, fibrosis score, hydroxyproline content in the lungs and oxygen desaturation were significantly reduced in protein S-transgenic mice as compared with wild-type mice. Wild-type mice treated with exogenous protein S showed significant decreases in the levels of inflammatory and profibrotic markers and fibrosis in the lungs as compared with untreated control mice. After bleomycin infusion, mice overexpressing human protein S showed significantly low caspase-3 activity, enhanced expression of antiapoptotic molecules and enhanced Akt and Axl kinase phosphorylation as compared with wild-type counterparts. Protein S also inhibited apoptosis of alveolar

  8. Normal mode analysis as a method to derive protein dynamics information from the Protein Data Bank.

    Science.gov (United States)

    Wako, Hiroshi; Endo, Shigeru

    2017-12-01

    Normal mode analysis (NMA) can facilitate quick and systematic investigation of protein dynamics using data from the Protein Data Bank (PDB). We developed an elastic network model-based NMA program using dihedral angles as independent variables. Compared to the NMA programs that use Cartesian coordinates as independent variables, key attributes of the proposed program are as follows: (1) chain connectivity related to the folding pattern of a polypeptide chain is naturally embedded in the model; (2) the full-atom system is acceptable, and owing to a considerably smaller number of independent variables, the PDB data can be used without further manipulation; (3) the number of variables can be easily reduced by some of the rotatable dihedral angles; (4) the PDB data for any molecule besides proteins can be considered without coarse-graining; and (5) individual motions of constituent subunits and ligand molecules can be easily decomposed into external and internal motions to examine their mutual and intrinsic motions. Its performance is illustrated with an example of a DNA-binding allosteric protein, a catabolite activator protein. In particular, the focus is on the conformational change upon cAMP and DNA binding, and on the communication between their binding sites remotely located from each other. In this illustration, NMA creates a vivid picture of the protein dynamics at various levels of the structures, i.e., atoms, residues, secondary structures, domains, subunits, and the complete system, including DNA and cAMP. Comparative studies of the specific protein in different states, e.g., apo- and holo-conformations, and free and complexed configurations, provide useful information for studying structurally and functionally important aspects of the protein.

  9. Multiscale modeling and simulation of microtubule–motor-protein assemblies

    Science.gov (United States)

    Gao, Tong; Blackwell, Robert; Glaser, Matthew A.; Betterton, M. D.; Shelley, Michael J.

    2016-01-01

    Microtubules and motor proteins self-organize into biologically important assemblies including the mitotic spindle and the centrosomal microtubule array. Outside of cells, microtubule-motor mixtures can form novel active liquid-crystalline materials driven out of equilibrium by adenosine triphosphate–consuming motor proteins. Microscopic motor activity causes polarity-dependent interactions between motor proteins and microtubules, but how these interactions yield larger-scale dynamical behavior such as complex flows and defect dynamics is not well understood. We develop a multiscale theory for microtubule-motor systems in which Brownian dynamics simulations of polar microtubules driven by motors are used to study microscopic organization and stresses created by motor-mediated microtubule interactions. We identify polarity-sorting and crosslink tether relaxation as two polar-specific sources of active destabilizing stress. We then develop a continuum Doi-Onsager model that captures polarity sorting and the hydrodynamic flows generated by these polar-specific active stresses. In simulations of active nematic flows on immersed surfaces, the active stresses drive turbulent flow dynamics and continuous generation and annihilation of disclination defects. The dynamics follow from two instabilities, and accounting for the immersed nature of the experiment yields unambiguous characteristic length and time scales. When turning off the hydrodynamics in the Doi-Onsager model, we capture formation of polar lanes as observed in the Brownian dynamics simulation. PMID:26764729

  10. Multiscale modeling and simulation of microtubule-motor-protein assemblies.

    Science.gov (United States)

    Gao, Tong; Blackwell, Robert; Glaser, Matthew A; Betterton, M D; Shelley, Michael J

    2015-01-01

    Microtubules and motor proteins self-organize into biologically important assemblies including the mitotic spindle and the centrosomal microtubule array. Outside of cells, microtubule-motor mixtures can form novel active liquid-crystalline materials driven out of equilibrium by adenosine triphosphate-consuming motor proteins. Microscopic motor activity causes polarity-dependent interactions between motor proteins and microtubules, but how these interactions yield larger-scale dynamical behavior such as complex flows and defect dynamics is not well understood. We develop a multiscale theory for microtubule-motor systems in which Brownian dynamics simulations of polar microtubules driven by motors are used to study microscopic organization and stresses created by motor-mediated microtubule interactions. We identify polarity-sorting and crosslink tether relaxation as two polar-specific sources of active destabilizing stress. We then develop a continuum Doi-Onsager model that captures polarity sorting and the hydrodynamic flows generated by these polar-specific active stresses. In simulations of active nematic flows on immersed surfaces, the active stresses drive turbulent flow dynamics and continuous generation and annihilation of disclination defects. The dynamics follow from two instabilities, and accounting for the immersed nature of the experiment yields unambiguous characteristic length and time scales. When turning off the hydrodynamics in the Doi-Onsager model, we capture formation of polar lanes as observed in the Brownian dynamics simulation.

  11. Multiscale modeling and simulation of microtubule-motor-protein assemblies

    Science.gov (United States)

    Gao, Tong; Blackwell, Robert; Glaser, Matthew A.; Betterton, M. D.; Shelley, Michael J.

    2015-12-01

    Microtubules and motor proteins self-organize into biologically important assemblies including the mitotic spindle and the centrosomal microtubule array. Outside of cells, microtubule-motor mixtures can form novel active liquid-crystalline materials driven out of equilibrium by adenosine triphosphate-consuming motor proteins. Microscopic motor activity causes polarity-dependent interactions between motor proteins and microtubules, but how these interactions yield larger-scale dynamical behavior such as complex flows and defect dynamics is not well understood. We develop a multiscale theory for microtubule-motor systems in which Brownian dynamics simulations of polar microtubules driven by motors are used to study microscopic organization and stresses created by motor-mediated microtubule interactions. We identify polarity-sorting and crosslink tether relaxation as two polar-specific sources of active destabilizing stress. We then develop a continuum Doi-Onsager model that captures polarity sorting and the hydrodynamic flows generated by these polar-specific active stresses. In simulations of active nematic flows on immersed surfaces, the active stresses drive turbulent flow dynamics and continuous generation and annihilation of disclination defects. The dynamics follow from two instabilities, and accounting for the immersed nature of the experiment yields unambiguous characteristic length and time scales. When turning off the hydrodynamics in the Doi-Onsager model, we capture formation of polar lanes as observed in the Brownian dynamics simulation.

  12. Connecting protein and mRNA burst distributions for stochastic models of gene expression

    International Nuclear Information System (INIS)

    Elgart, Vlad; Jia, Tao; Fenley, Andrew T; Kulkarni, Rahul

    2011-01-01

    The intrinsic stochasticity of gene expression can lead to large variability in protein levels for genetically identical cells. Such variability in protein levels can arise from infrequent synthesis of mRNAs which in turn give rise to bursts of protein expression. Protein expression occurring in bursts has indeed been observed experimentally and recent studies have also found evidence for transcriptional bursting, i.e. production of mRNAs in bursts. Given that there are distinct experimental techniques for quantifying the noise at different stages of gene expression, it is of interest to derive analytical results connecting experimental observations at different levels. In this work, we consider stochastic models of gene expression for which mRNA and protein production occurs in independent bursts. For such models, we derive analytical expressions connecting protein and mRNA burst distributions which show how the functional form of the mRNA burst distribution can be inferred from the protein burst distribution. Additionally, if gene expression is repressed such that observed protein bursts arise only from single mRNAs, we show how observations of protein burst distributions (repressed and unrepressed) can be used to completely determine the mRNA burst distribution. Assuming independent contributions from individual bursts, we derive analytical expressions connecting means and variances for burst and steady-state protein distributions. Finally, we validate our general analytical results by considering a specific reaction scheme involving regulation of protein bursts by small RNAs. For a range of parameters, we derive analytical expressions for regulated protein distributions that are validated using stochastic simulations. The analytical results obtained in this work can thus serve as useful inputs for a broad range of studies focusing on stochasticity in gene expression

  13. Integrated and comparative proteomics of high-oil and high-protein soybean seeds.

    Science.gov (United States)

    Xu, Xiu Ping; Liu, Hui; Tian, Lihong; Dong, Xiang Bai; Shen, Shi Hua; Qu, Le Qing

    2015-04-01

    We analysed the global protein expression in seeds of a high-oil soybean cultivar (Jiyu 73, JY73) by proteomics. More than 700 protein spots were detected and 363 protein spots were successfully identified. Comparison of the protein profile of JY73 with that of a high-protein cultivar (Zhonghuang 13, ZH13) revealed 40 differentially expressed proteins, including oil synthesis, redox/stress, hydrolysis and storage-related proteins. All redox/stress proteins were less or not expressed in JY73, whereas the expression of the major storage proteins, nitrogen and carbon metabolism-related proteins was higher in ZH13. Biochemical analysis of JY73 revealed that it was in a low oxidation state, with a high content of polyunsaturated fatty acids and vitamin E. Vitamin E was more active than antioxidant enzymes and protected the soybean seed in a lower oxidation state. The characteristics of high oil and high protein in soybean, we revealed, might provide a reference for soybean nutrition and soybean breeding. Copyright © 2014 Elsevier Ltd. All rights reserved.

  14. APOLLO: a quality assessment service for single and multiple protein models.

    Science.gov (United States)

    Wang, Zheng; Eickholt, Jesse; Cheng, Jianlin

    2011-06-15

    We built a web server named APOLLO, which can evaluate the absolute global and local qualities of a single protein model using machine learning methods or the global and local qualities of a pool of models using a pair-wise comparison approach. Based on our evaluations on 107 CASP9 (Critical Assessment of Techniques for Protein Structure Prediction) targets, the predicted quality scores generated from our machine learning and pair-wise methods have an average per-target correlation of 0.671 and 0.917, respectively, with the true model quality scores. Based on our test on 92 CASP9 targets, our predicted absolute local qualities have an average difference of 2.60 Å with the actual distances to native structure. http://sysbio.rnet.missouri.edu/apollo/. Single and pair-wise global quality assessment software is also available at the site.

  15. Biological assessment of neonicotinoids imidacloprid and its major metabolites for potentially human health using globular proteins as a model.

    Science.gov (United States)

    Ding, Fei; Peng, Wei

    2015-06-01

    The assessment of biological activities of imidacloprid and its two major metabolites, namely 6-chloronicotinic acid and 2-imidazolidone for nontarget organism, by employing essentially functional biomacromolecules, albumin and hemoglobin as a potentially model with the use of circular dichroism (CD), fluorescence, extrinsic 8-anilino-1-naphthalenesulfonic acid (ANS) fluorescence as well as molecular modeling is the theme of this work. By dint of CD spectra and synchronous fluorescence, it was clear that the orderly weak interactions between amino acid residues within globular proteins were disturbed by imidacloprid, and this event led to marginally alterations or self-regulations of protein conformation so as to lodge imidacloprid more tightly. Both steady state and time-resolved fluorescence suggested that the fluorescence of Trp residues in proteins was quenched after the presence of imidacloprid, corresponding to noncovalent protein-imidacloprid complexes formation and, the reaction belongs to moderate association (K=1.888/1.614×10(4)M(-1) for albumin/hemoglobin-imidacloprid, respectively), hydrogen bonds and π stacking performed a vital role in stabilizing the complexes, as derived from thermodynamic analysis and molecular modeling. With the aid of hydrophobic ANS experiments, subdomain IIA and α1β2 interface of albumin and hemoglobin, respectively, were found to be preserved high-affinity for imidacloprid. These results ties in with the subsequently molecular modeling laying imidacloprid in the Sudlow's site I and close to Trp-213 residue on albumin, while settling down B/Trp-37 residue nearby in hemoglobin, and these conclusions further confirmed by site-directed mutagenesis and molecular dynamics simulation. But, at the same time, several crucial noncovalent bonds came from other amino acid residues, e.g. Arg-194 and Arg-198 (albumin) and B/Arg-40, B/Asp-99 and B/Asn-102 (hemoglobin) cannot be ignored completely. Based on the comparative studies of

  16. Comparative gene expression profiling of in vitro differentiated megakaryocytes and erythroblasts identifies novel activatory and inhibitory platelet membrane proteins.

    Science.gov (United States)

    Macaulay, Iain C; Tijssen, Marloes R; Thijssen-Timmer, Daphne C; Gusnanto, Arief; Steward, Michael; Burns, Philippa; Langford, Cordelia F; Ellis, Peter D; Dudbridge, Frank; Zwaginga, Jaap-Jan; Watkins, Nicholas A; van der Schoot, C Ellen; Ouwehand, Willem H

    2007-04-15

    To identify previously unknown platelet receptors we compared the transcriptomes of in vitro differentiated megakaryocytes (MKs) and erythroblasts (EBs). RNA was obtained from purified, biologically paired MK and EB cultures and compared using cDNA microarrays. Bioinformatical analysis of MK-up-regulated genes identified 151 transcripts encoding transmembrane domain-containing proteins. Although many of these were known platelet genes, a number of previously unidentified or poorly characterized transcripts were also detected. Many of these transcripts, including G6b, G6f, LRRC32, LAT2, and the G protein-coupled receptor SUCNR1, encode proteins with structural features or functions that suggest they may be involved in the modulation of platelet function. Immunoblotting on platelets confirmed the presence of the encoded proteins, and flow cytometric analysis confirmed the expression of G6b, G6f, and LRRC32 on the surface of platelets. Through comparative analysis of expression in platelets and other blood cells we demonstrated that G6b, G6f, and LRRC32 are restricted to the platelet lineage, whereas LAT2 and SUCNR1 were also detected in other blood cells. The identification of the succinate receptor SUCNR1 in platelets is of particular interest, because physiologically relevant concentrations of succinate were shown to potentiate the effect of low doses of a variety of platelet agonists.

  17. Insulin as a model to teach three-dimensional structure of proteins

    Directory of Open Access Journals (Sweden)

    João Batista Teixeira da Rocha

    2018-02-01

    Proteins are the most ubiquitous macromolecules found in the living cells and have innumerous physiological functions. Therefore, it is fundamental to build a solid knowledge about the proteins three dimensional structure to better understand the living state. The hierarchical structure of proteins is usually studied in the undergraduate discipline of Biochemistry. Here we described pedagogical interventions designed to increase the preservice teacher chemistry students’ knowledge about protein structure. The activities were made using alternative and cheap materials to encourage the application of these simple methodologies by the future teachers in the secondary school. From the primary structure of insulin chains, students had to construct a three-dimensional structure of insulin. After the activities, the students highlighted an improvement of their previous knowledge about proteins structure. The construction of a tridimensional model together with other activities seems to be an efficient way to promote the learning about the structure of proteins to undergraduate students. The methodology used was inexpensiveness and simple and it can be used both in the university and in the high-school.

  18. Domain analyses of Usher syndrome causing Clarin-1 and GPR98 protein models.

    Science.gov (United States)

    Khan, Sehrish Haider; Javed, Muhammad Rizwan; Qasim, Muhammad; Shahzadi, Samar; Jalil, Asma; Rehman, Shahid Ur

    2014-01-01

    Usher syndrome is an autosomal recessive disorder that causes hearing loss, Retinitis Pigmentosa (RP) and vestibular dysfunction. It is clinically and genetically heterogeneous disorder which is clinically divided into three types i.e. type I, type II and type III. To date, there are about twelve loci and ten identified genes which are associated with Usher syndrome. A mutation in any of these genes e.g. CDH23, CLRN1, GPR98, MYO7A, PCDH15, USH1C, USH1G, USH2A and DFNB31 can result in Usher syndrome or non-syndromic deafness. These genes provide instructions for making proteins that play important roles in normal hearing, balance and vision. Studies have shown that protein structures of only seven genes have been determined experimentally and there are still three genes whose structures are unavailable. These genes are Clarin-1, GPR98 and Usherin. In the absence of an experimentally determined structure, homology modeling and threading often provide a useful 3D model of a protein. Therefore in the current study Clarin-1 and GPR98 proteins have been analyzed for signal peptide, domains and motifs. Clarin-1 protein was found to be without any signal peptide and consists of prokar lipoprotein domain. Clarin-1 is classified within claudin 2 super family and consists of twelve motifs. Whereas, GPR98 has a 29 amino acids long signal peptide and classified within GPCR family 2 having Concanavalin A-like lectin/glucanase superfamily. It was found to be consists of GPS and G protein receptor F2 domains and twenty nine motifs. Their 3D structures have been predicted using I-TASSER server. The model of Clarin-1 showed only α-helix but no beta sheets while model of GPR98 showed both α-helix and β sheets. The predicted structures were then evaluated and validated by MolProbity and Ramachandran plot. The evaluation of the predicted structures showed 78.9% residues of Clarin-1 and 78.9% residues of GPR98 within favored regions. The findings of present study has resulted in the

  19. Comparative proteome analysis of monolayer and spheroid culture of canine osteosarcoma cells.

    Science.gov (United States)

    Gebhard, Christiane; Miller, Ingrid; Hummel, Karin; Neschi Née Ondrovics, Martina; Schlosser, Sarah; Walter, Ingrid

    2018-04-15

    Osteosarcoma is an aggressive bone tumor with high metastasis rate in the lungs and affects both humans and dogs in a similar way. Three-dimensional tumor cell cultures mimic the in vivo situation of micro-tumors and metastases and are therefore better experimental in vitro models than the often applied two-dimensional monolayer cultures. The aim of the present study was to perform comparative proteomics of standard monolayer cultures of canine osteosarcoma cells (D17) and three-dimensional spheroid cultures, to better characterize the 3D model before starting with experiments like migration assays. Using DIGE in combination with MALDI-TOF/TOF we found 27 unique canine proteins differently represented between these two culture systems, most of them being part of a functional network including mainly chaperones, structural proteins, stress-related proteins, proteins of the glycolysis/gluconeogenesis pathway and oxidoreductases. In monolayer cells, a noticeable shift to more acidic pI values was noticed for several proteins of medium to high abundance; two proteins (protein disulfide isomerase A3, stress-induced-phosphoprotein 1) showed an increase of phosphorylated protein species. Protein distribution within the cells, as detected by immunohistochemistry, displayed a switch of stress-induced-phosphoprotein 1 from the cytoplasm (in monolayer cultures) to the nucleus (in spheroid cultures). Additionally, Western blot testing revealed upregulated concentrations of metastasin (S100A4), triosephosphate isomerase 1 and septin 2 in spheroid cultures, in contrast to decreased concentrations of CCT2, a subunit of the T-complex. Results indicate regulation of stress proteins in the process of three-dimensional organization characterized by a hypoxic and nutrient-deficient environment comparable to tumor micro-metastases. Osteosarcoma is an aggressive bone tumor that early spreads to the lungs. Three-dimensional tumor cell cultures represent the avascular stage of micro

  20. Comparison of Nitrogen Bioaccessibility from Salmon and Whey Protein Hydrolysates using a Human Gastrointestinal Model (TIM-1

    Directory of Open Access Journals (Sweden)

    Bomi Framroze

    2014-05-01

    Full Text Available Background: The TIM-1 system is a computer-controlled multi-compartmental dynamic model that closely simulates in vivo gastrointestinal tract digestion in humans. During digestion, the compounds released from meal matrix by gastric and intestinal secretions (enzymes are progressively absorbed through semipermeable membranes depending on their molecular weight. These absorbed (dialysed compounds are considered as bioaccessible, which means that they can be theoretically absorbed by the small intestine in the body. Methods: Salmon protein hydrolysate (SPH, whey protein hydrolysates extensively (WPHHigh or weakly (WPH-Low hydrolysed, non-hydrolysed whey protein isolate (WPI and mixtures of WPI:SPH (90:10, 80:20 were digested in TIM-1 using the conditions for a fast gastrointestinal transit that simulate the digestion of a liquid meal in human adults. During digestion (2 hours, samples were collected in intestinal compartments (duodenum, jejunum, and ileum and in both jejunal and ileal dialysates to determine their nitrogen content. All the products were compared in terms of kinetics of nitrogen absorption through the semipermeable membranes (bioaccessible nitrogen and nitrogen distribution throughout the intestinal compartments at the end of the 2 hour digestion. Results: After a 2 h-digestion in TIM-1, SPH was the protein substrate from which the highest amount of nitrogen (67.0% becomes available for the small intestine absorption. WPH-High had the second highest amount (56.0% of bioaccessible nitrogen while this amount decreased to 38.5–42.2% for the other protein substrates. The high nitrogen bioaccessibility of SPH is consistent with its richness in low molecular weight peptides (50% < 1000 Da. Conclusions: The results of this study indicate that SPH provides a higher proportion of bioaccessible nitrogen to a healthy adult compared to all forms of whey proteins, including extensively hydrolysed whey protein hydrolysate. The substitution of

  1. Comparative Genomics Identifies Epidermal Proteins Associated with the Evolution of the Turtle Shell.

    Science.gov (United States)

    Holthaus, Karin Brigit; Strasser, Bettina; Sipos, Wolfgang; Schmidt, Heiko A; Mlitz, Veronika; Sukseree, Supawadee; Weissenbacher, Anton; Tschachler, Erwin; Alibardi, Lorenzo; Eckhart, Leopold

    2016-03-01

    The evolution of reptiles, birds, and mammals was associated with the origin of unique integumentary structures. Studies on lizards, chicken, and humans have suggested that the evolution of major structural proteins of the outermost, cornified layers of the epidermis was driven by the diversification of a gene cluster called Epidermal Differentiation Complex (EDC). Turtles have evolved unique defense mechanisms that depend on mechanically resilient modifications of the epidermis. To investigate whether the evolution of the integument in these reptiles was associated with specific adaptations of the sequences and expression patterns of EDC-related genes, we utilized newly available genome sequences to determine the epidermal differentiation gene complement of turtles. The EDC of the western painted turtle (Chrysemys picta bellii) comprises more than 100 genes, including at least 48 genes that encode proteins referred to as beta-keratins or corneous beta-proteins. Several EDC proteins have evolved cysteine/proline contents beyond 50% of total amino acid residues. Comparative genomics suggests that distinct subfamilies of EDC genes have been expanded and partly translocated to loci outside of the EDC in turtles. Gene expression analysis in the European pond turtle (Emys orbicularis) showed that EDC genes are differentially expressed in the skin of the various body sites and that a subset of beta-keratin genes within the EDC as well as those located outside of the EDC are expressed predominantly in the shell. Our findings give strong support to the hypothesis that the evolutionary innovation of the turtle shell involved specific molecular adaptations of epidermal differentiation. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  2. Transforming between discrete and continuous angle distribution models: application to protein χ1 torsions

    International Nuclear Information System (INIS)

    Schmidt, Jürgen M.

    2012-01-01

    Two commonly employed angular-mobility models for describing amino-acid side-chain χ 1 torsion conformation, the staggered-rotamer jump and the normal probability density, are discussed and performance differences in applications to scalar-coupling data interpretation highlighted. Both models differ in their distinct statistical concepts, representing discrete and continuous angle distributions, respectively. Circular statistics, introduced for describing torsion-angle distributions by using a universal circular order parameter central to all models, suggest another distribution of the continuous class, here referred to as the elliptic model. Characteristic of the elliptic model is that order parameter and circular variance form complementary moduli. Transformations between the parameter sets that describe the probability density functions underlying the different models are provided. Numerical aspects of parameter optimization are considered. The issues are typified by using a set of χ 1 related 3 J coupling constants available for FK506-binding protein. The discrete staggered-rotamer model is found generally to produce lower order parameters, implying elevated rotatory variability in the amino-acid side chains, whereas continuous models tend to give higher order parameters that suggest comparatively less variation in angle conformations. The differences perceived regarding angular mobility are attributed to conceptually different features inherent to the models.

  3. Altered protein glycosylation predicts Alzheimer's disease and modulates its pathology in disease model Drosophila.

    Science.gov (United States)

    Frenkel-Pinter, Moran; Stempler, Shiri; Tal-Mazaki, Sharon; Losev, Yelena; Singh-Anand, Avnika; Escobar-Álvarez, Daniela; Lezmy, Jonathan; Gazit, Ehud; Ruppin, Eytan; Segal, Daniel

    2017-08-01

    The pathological hallmarks of Alzheimer's disease (AD) are pathogenic oligomers and fibrils of misfolded amyloidogenic proteins (e.g., β-amyloid and hyper-phosphorylated tau in AD), which cause progressive loss of neurons in the brain and nervous system. Although deviations from normal protein glycosylation have been documented in AD, their role in disease pathology has been barely explored. Here our analysis of available expression data sets indicates that many glycosylation-related genes are differentially expressed in brains of AD patients compared with healthy controls. The robust differences found enabled us to predict the occurrence of AD with remarkable accuracy in a test cohort and identify a set of key genes whose expression determines this classification. We then studied in vivo the effect of reducing expression of homologs of 6 of these genes in transgenic Drosophila overexpressing human tau, a well-established invertebrate AD model. These experiments have led to the identification of glycosylation genes that may augment or ameliorate tauopathy phenotypes. Our results indicate that OstDelta, l(2)not and beta4GalT7 are tauopathy suppressors, whereas pgnat5 and CG33303 are enhancers, of tauopathy. These results suggest that specific alterations in protein glycosylation may play a causal role in AD etiology. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Uncoupling of Protein Aggregation and Neurodegeneration in a Mouse Amyotrophic Lateral Sclerosis Model.

    Science.gov (United States)

    Lee, Joo-Yong; Kawaguchi, Yoshiharu; Li, Ming; Kapur, Meghan; Choi, Su Jin; Kim, Hak-June; Park, Song-Yi; Zhu, Haining; Yao, Tso-Pang

    2015-01-01

    Aberrant accumulation of protein aggregates is a pathological hallmark of many neurodegenerative diseases, including amyotrophic lateral sclerosis (ALS). Although a buildup of protein aggregates frequently leads to cell death, whether it is the key pathogenic factor in driving neurodegenerative disease remains controversial. HDAC6, a cytosolic ubiquitin-binding deacetylase, has emerged as an important regulator of ubiquitin-dependent quality control autophagy, a lysosome-dependent degradative system responsible for the disposal of misfolded protein aggregates and damaged organelles. Here, we show that in cell models HDAC6 plays a protective role against multiple disease-associated and aggregation-prone cytosolic proteins by facilitating their degradation. We further show that HDAC6 is required for efficient localization of lysosomes to protein aggregates, indicating that lysosome targeting to autophagic substrates is regulated. Supporting a critical role of HDAC6 in protein aggregate disposal in vivo, genetic ablation of HDAC6 in a transgenic SOD1G93A mouse, a model of ALS, leads to dramatic accumulation of ubiquitinated SOD1G93A protein aggregates. Surprisingly, despite a robust buildup of SOD1G93A aggregates, deletion of HDAC6 only moderately modified the motor phenotypes. These findings indicate that SOD1G93A aggregation is not the only determining factor to drive neurodegeneration in ALS, and that HDAC6 likely modulates neurodegeneration through additional mechanisms beyond protein aggregate clearance. © 2015 S. Karger AG, Basel.

  5. Determination of protein and solvent volumes in protein crystals from contrast variation data

    Energy Technology Data Exchange (ETDEWEB)

    Badger, J. [Brandeis Univ., Waltham, MA (United States)

    1994-12-31

    By varying the relative values of protein and solvent scattering densities in a crystal, it is possible to obtain information on the shape and dimensions of protein molecular envelopes. Neutron diffraction methods are ideally suited to these contrast variation experiments because H/D exchange leads to large differential changes in the protein and solvent scattering densities and is structurally non-perturbing. Low resolution structure factors have been measured from cubic insulin crystals with differing H/D contents. Structure factors calculated from a simple binary density model, in which uniform scattering densities represent the protein and solvent volumes in the crystals, were compared with these data. The contrast variation differences in the sets of measured structure factors were found to be accurately fitted by this simple model. Trial applications to two problems in crystal structure determination illustrate how this fact may be exploited. (1) A translation function that employs contrast variation data gave a sharp minimum within 1-9{Angstrom} of the correctly positioned insulin molecule and is relatively insensitive to errors in the atomic model. (2) An ab initio phasing method for the contrast variation data, based on analyzing histograms of the density distributions in trial maps, was found to recover the correct molecular envelope.

  6. Introducing Students to Protein Analysis Techniques: Separation and Comparative Analysis of Gluten Proteins in Various Wheat Strains

    Science.gov (United States)

    Pirinelli, Alyssa L.; Trinidad, Jonathan C.; Pohl, Nicola L. B.

    2016-01-01

    Polyacrylamide gel electrophoresis (PAGE) is commonly taught in undergraduate laboratory classes as a traditional method to analyze proteins. An experiment has been developed to teach these basic protein gel skills in the context of gluten protein isolation from various types of wheat flour. A further goal is to relate this technique to current…

  7. Comparative genome analysis to identify SNPs associated with high oleic acid and elevated protein content in soybean.

    Science.gov (United States)

    Kulkarni, Krishnanand P; Patil, Gunvant; Valliyodan, Babu; Vuong, Tri D; Shannon, J Grover; Nguyen, Henry T; Lee, Jeong-Dong

    2018-03-01

    The objective of this study was to determine the genetic relationship between the oleic acid and protein content. The genotypes having high oleic acid and elevated protein (HOEP) content were crossed with five elite lines having normal oleic acid and average protein (NOAP) content. The selected accessions were grown at six environments in three different locations and phenotyped for protein, oil, and fatty acid components. The mean protein content of parents, HOEP, and NOAP lines was 34.6%, 38%, and 34.9%, respectively. The oleic acid concentration of parents, HOEP, and NOAP lines was 21.7%, 80.5%, and 20.8%, respectively. The HOEP plants carried both FAD2-1A (S117N) and FAD2-1B (P137R) mutant alleles contributing to the high oleic acid phenotype. Comparative genome analysis using whole-genome resequencing data identified six genes having single nucleotide polymorphism (SNP) significantly associated with the traits analyzed. A single SNP in the putative gene Glyma.10G275800 was associated with the elevated protein content, and palmitic, oleic, and linoleic acids. The genes from the marker intervals of previously identified QTL did not carry SNPs associated with protein content and fatty acid composition in the lines used in this study, indicating that all the genes except Glyma.10G278000 may be the new genes associated with the respective traits.

  8. Comparative Membrane Proteomics Reveals a Nonannotated E. coli Heat Shock Protein.

    Science.gov (United States)

    Yuan, Peijia; D'Lima, Nadia G; Slavoff, Sarah A

    2018-01-09

    Recent advances in proteomics and genomics have enabled discovery of thousands of previously nonannotated small open reading frames (smORFs) in genomes across evolutionary space. Furthermore, quantitative mass spectrometry has recently been applied to analysis of regulated smORF expression. However, bottom-up proteomics has remained relatively insensitive to membrane proteins, suggesting they may have been underdetected in previous studies. In this report, we add biochemical membrane protein enrichment to our previously developed label-free quantitative proteomics protocol, revealing a never-before-identified heat shock protein in Escherichia coli K12. This putative smORF-encoded heat shock protein, GndA, is likely to be ∼36-55 amino acids in length and contains a predicted transmembrane helix. We validate heat shock-regulated expression of the gndA smORF and demonstrate that a GndA-GFP fusion protein cofractionates with the cell membrane. Quantitative membrane proteomics therefore has the ability to reveal nonannotated small proteins that may play roles in bacterial stress responses.

  9. Optimised 'on demand' protein arraying from DNA by cell free expression with the 'DNA to Protein Array' (DAPA) technology.

    Science.gov (United States)

    Schmidt, Ronny; Cook, Elizabeth A; Kastelic, Damjana; Taussig, Michael J; Stoevesandt, Oda

    2013-08-02

    We have previously described a protein arraying process based on cell free expression from DNA template arrays (DNA Array to Protein Array, DAPA). Here, we have investigated the influence of different array support coatings (Ni-NTA, Epoxy, 3D-Epoxy and Polyethylene glycol methacrylate (PEGMA)). Their optimal combination yields an increased amount of detected protein and an optimised spot morphology on the resulting protein array compared to the previously published protocol. The specificity of protein capture was improved using a tag-specific capture antibody on a protein repellent surface coating. The conditions for protein expression were optimised to yield the maximum amount of protein or the best detection results using specific monoclonal antibodies or a scaffold binder against the expressed targets. The optimised DAPA system was able to increase by threefold the expression of a representative model protein while conserving recognition by a specific antibody. The amount of expressed protein in DAPA was comparable to those of classically spotted protein arrays. Reaction conditions can be tailored to suit the application of interest. DAPA represents a cost effective, easy and convenient way of producing protein arrays on demand. The reported work is expected to facilitate the application of DAPA for personalized medicine and screening purposes. Copyright © 2013 Elsevier B.V. All rights reserved.

  10. Mechanical stretching of proteins-a theoretical survey of the Protein Data Bank

    International Nuclear Information System (INIS)

    Sulkowska, Joanna I; Cieplak, Marek

    2007-01-01

    The mechanical stretching of single proteins has been studied experimentally for about 50 proteins, yielding a variety of force patterns and peak forces. Here we perform a theoretical survey of proteins of known native structure and map out the landscape of possible dynamical behaviours under stretching at constant speed. We consider 7510 proteins comprising not more than 150 amino acids and 239 longer proteins. The model used is constructed based on the native geometry. It is solved by methods of molecular dynamics and validated by comparing the theoretical predictions to experimental results. We characterize the distribution of peak forces and investigate correlations with the system size and with the structure classification as characterized by the CATH scheme. Despite the presence of such correlations, proteins with the same CATH index may belong to different classes of dynamical behaviour. We identify proteins with the biggest forces and show that they belong to few topology classes. We determine which protein segments act as mechanical clamps and show that, in most cases, they correspond to long stretches of parallel β-strands, but other mechanisms are also possible. (topical review)

  11. Comparative GO: a web application for comparative gene ontology and gene ontology-based gene selection in bacteria.

    Directory of Open Access Journals (Sweden)

    Mario Fruzangohar

    Full Text Available The primary means of classifying new functions for genes and proteins relies on Gene Ontology (GO, which defines genes/proteins using a controlled vocabulary in terms of their Molecular Function, Biological Process and Cellular Component. The challenge is to present this information to researchers to compare and discover patterns in multiple datasets using visually comprehensible and user-friendly statistical reports. Importantly, while there are many GO resources available for eukaryotes, there are none suitable for simultaneous, graphical and statistical comparison between multiple datasets. In addition, none of them supports comprehensive resources for bacteria. By using Streptococcus pneumoniae as a model, we identified and collected GO resources including genes, proteins, taxonomy and GO relationships from NCBI, UniProt and GO organisations. Then, we designed database tables in PostgreSQL database server and developed a Java application to extract data from source files and loaded into database automatically. We developed a PHP web application based on Model-View-Control architecture, used a specific data structure as well as current and novel algorithms to estimate GO graphs parameters. We designed different navigation and visualization methods on the graphs and integrated these into graphical reports. This tool is particularly significant when comparing GO groups between multiple samples (including those of pathogenic bacteria from different sources simultaneously. Comparing GO protein distribution among up- or down-regulated genes from different samples can improve understanding of biological pathways, and mechanism(s of infection. It can also aid in the discovery of genes associated with specific function(s for investigation as a novel vaccine or therapeutic targets.http://turing.ersa.edu.au/BacteriaGO.

  12. Polyphenol-enriched berry extracts naturally modulate reactive proteins in model foods.

    Science.gov (United States)

    Lila, Mary Ann; Schneider, Maggie; Devlin, Amy; Plundrich, Nathalie; Laster, Scott; Foegeding, E Allen

    2017-12-13

    Healthy foods like polyphenol-rich berries and high quality edible proteins are in demand in today's functional food marketplace, but it can be difficult to formulate convenient food products with physiologically-relevant amounts of these ingredients and still maintain product quality. In part, this is because proteins can interact with other food ingredients and precipitate destabilizing events, which can disrupt food structure and diminish shelf life. Proteins in foods can also interact with human receptors to provoke adverse consequences such as allergies. When proteins and polyphenols were pre-aggregated into stable colloidal particles prior to use as ingredients, highly palatable food formulations (with reduced astringency of polyphenols) could be prepared, and the overall structural properties of food formulations were significantly improved. All of the nutritive and phytoactive benefits of the proteins and concentrated polyphenols remained highly bioavailable, but the protein molecules in the particle matrix did not self-aggregate into networks or react with other food ingredients. Both the drainage half-life (a marker of structural stability) and the yield stress (resistance to flow) of model foams made with the protein-polyphenol particles were increased in a dose-dependent manner. Of high significance in this complexation process, the reactive allergenic epitopes of certain proteins were effectively blunted by binding with polyphenols, attenuating the allergenicity of the food proteins. Porcine macrophages produced TNF-α proinflammatory cytokine when provoked with whey protein, but, this response was blocked completely when the cells were stimulated with particles that complexed whey protein with cinnamon-derived polyphenols. Cytokine and chemokine production characteristic of allergic reactions were blocked by the polyphenols, allowing for the potential creation of hypoallergenic protein-berry polyphenol enriched foods.

  13. Comparative study to develop a single method for retrieving wide class of recombinant proteins from classical inclusion bodies.

    Science.gov (United States)

    Padhiar, Arshad Ahmed; Chanda, Warren; Joseph, Thomson Patrick; Guo, Xuefang; Liu, Min; Sha, Li; Batool, Samana; Gao, Yifan; Zhang, Wei; Huang, Min; Zhong, Mintao

    2018-03-01

    The formation of inclusion bodies (IBs) is considered as an Achilles heel of heterologous protein expression in bacterial hosts. Wide array of techniques has been developed to recover biochemically challenging proteins from IBs. However, acquiring the active state even from the same protein family was found to be an independent of single established method. Here, we present a new strategy for the recovery of wide sub-classes of recombinant protein from harsh IBs. We found that numerous methods and their combinations for reducing IB formation and producing soluble proteins were not effective, if the inclusion bodies were harsh in nature. On the other hand, different practices with mild solubilization buffers were able to solubilize IBs completely, yet the recovery of active protein requires large screening of refolding buffers. With the integration of previously reported mild solubilization techniques, we proposed an improved method, which comprised low sarkosyl concentration, ranging from 0.05 to 0.1% coupled with slow freezing (- 1 °C/min) and fast thaw (room temperature), resulting in greater solubility and the integrity of solubilized protein. Dilution method was employed with single buffer to restore activity for every sub-class of recombinant protein. Results showed that the recovered protein's activity was significantly higher compared with traditional solubilization/refolding approach. Solubilization of IBs by the described method was proved milder in nature, which restored native-like conformation of proteins within IBs.

  14. Quantification of surfactant proteins in tears of patients suffering from dry eye disease compared to healthy subjects.

    Science.gov (United States)

    Posa, Andreas; Paulsen, Friedrich; Dietz, Richard; Garreis, Fabian; Sander, Ralph; Schicht, Martin; Sel, Saadettin; Scholz, Michael; Hammer, Christian M; Bräuer, Lars

    2018-03-01

    To quantify and compare the amounts of surfactant proteins SP-A, SP-B, SP-C and SP-D in the tear fluid collected from patients with dry eye syndrome and from individuals with a healthy ocular surface. Schirmer strips were used to collect tear fluid from both eyes of 241 volunteers (99 men, 142 women; age range: 18-87 years). Dry eye syndrome was diagnosed by ophthalmologists in 125 patients, whereas the healthy control group comprised 116 individuals. The total protein concentration was determined via Bradford assay. The relative concentration of surfactant proteins SP-A through -D was measured by enzyme-linked immuno-sorbent assay (ELISA). The mean relative concentrations of SP-A, SP-C and SP-D were significantly higher in the dry eye group as compared to the healthy controls (pdry eye group, but the difference to the control group was not statistically significant. The upregulation of SP-A and SP-D in the dry eye group is probably related to these proteins' known antimicrobial and immunomodulatory effects at the ocular surface. It may represent a pathophysiological response to the inflammatory condition of the ocular surface in dry eye. The upregulation of SP-B and SP-C may represent an effort of the lacrimal system to reduce surface tension and thus to counteract the increased tendency of the tear film to tear in dry eye. Copyright © 2017 Elsevier GmbH. All rights reserved.

  15. Towards predictive resistance models for agrochemicals by combining chemical and protein similarity via proteochemometric modelling.

    Science.gov (United States)

    van Westen, Gerard J P; Bender, Andreas; Overington, John P

    2014-10-01

    Resistance to pesticides is an increasing problem in agriculture. Despite practices such as phased use and cycling of 'orthogonally resistant' agents, resistance remains a major risk to national and global food security. To combat this problem, there is a need for both new approaches for pesticide design, as well as for novel chemical entities themselves. As summarized in this opinion article, a technique termed 'proteochemometric modelling' (PCM), from the field of chemoinformatics, could aid in the quantification and prediction of resistance that acts via point mutations in the target proteins of an agent. The technique combines information from both the chemical and biological domain to generate bioactivity models across large numbers of ligands as well as protein targets. PCM has previously been validated in prospective, experimental work in the medicinal chemistry area, and it draws on the growing amount of bioactivity information available in the public domain. Here, two potential applications of proteochemometric modelling to agrochemical data are described, based on previously published examples from the medicinal chemistry literature.

  16. Comparing the cardiovascular therapeutic indices of glycopyrronium and tiotropium in an integrated rat pharmacokinetic, pharmacodynamic and safety model

    International Nuclear Information System (INIS)

    Trifilieff, Alexandre; Ethell, Brian T.; Sykes, David A.; Watson, Kenny J.; Collingwood, Steve; Charlton, Steven J.; Kent, Toby C.

    2015-01-01

    Long acting inhaled muscarinic receptor antagonists, such as tiotropium, are widely used as bronchodilator therapy for chronic obstructive pulmonary disease (COPD). Although this class of compounds is generally considered to be safe and well tolerated in COPD patients the cardiovascular safety of tiotropium has recently been questioned. We describe a rat in vivo model that allows the concurrent assessment of muscarinic antagonist potency, bronchodilator efficacy and a potential for side effects, and we use this model to compare tiotropium with NVA237 (glycopyrronium bromide), a recently approved inhaled muscarinic antagonist for COPD. Anaesthetized Brown Norway rats were dosed intratracheally at 1 or 6 h prior to receiving increasing doses of intravenous methacholine. Changes in airway resistance and cardiovascular function were recorded and therapeutic indices were calculated against the ED 50 values for the inhibition of methacholine-induced bronchoconstriction. At both time points studied, greater therapeutic indices for hypotension and bradycardia were observed with glycopyrronium (19.5 and 28.5 fold at 1 h; > 200 fold at 6 h) than with tiotropium (1.5 and 4.2 fold at 1 h; 4.6 and 5.5 fold at 6 h). Pharmacokinetic, protein plasma binding and rat muscarinic receptor binding properties for both compounds were determined and used to generate an integrated model of systemic M 2 muscarinic receptor occupancy, which predicted significantly higher M 2 receptor blockade at ED 50 doses with tiotropium than with glycopyrronium. In our preclinical model there was an improved safety profile for glycopyrronium when compared with tiotropium. - Highlights: • We use an in vivo rat model to study CV safety of inhaled muscarinic antagonists. • We integrate protein and receptor binding and PK of tiotropium and glycopyrrolate. • At ED 50 doses for bronchoprotection we model systemic M 2 receptor occupancy. • Glycopyrrolate demonstrates lower M 2 occupancy at

  17. Comparing the cardiovascular therapeutic indices of glycopyrronium and tiotropium in an integrated rat pharmacokinetic, pharmacodynamic and safety model

    Energy Technology Data Exchange (ETDEWEB)

    Trifilieff, Alexandre; Ethell, Brian T. [Respiratory Disease Area, Novartis Institutes for Biomedical Research, Wimblehurst Road, Horsham, West Sussex RH12 5AB (United Kingdom); Sykes, David A. [Respiratory Disease Area, Novartis Institutes for Biomedical Research, Wimblehurst Road, Horsham, West Sussex RH12 5AB (United Kingdom); School of Life Sciences, Queen' s Medical Centre, University of Nottingham, Nottingham, NG7 2UH (United Kingdom); Watson, Kenny J.; Collingwood, Steve [Respiratory Disease Area, Novartis Institutes for Biomedical Research, Wimblehurst Road, Horsham, West Sussex RH12 5AB (United Kingdom); Charlton, Steven J. [Respiratory Disease Area, Novartis Institutes for Biomedical Research, Wimblehurst Road, Horsham, West Sussex RH12 5AB (United Kingdom); School of Life Sciences, Queen' s Medical Centre, University of Nottingham, Nottingham, NG7 2UH (United Kingdom); Kent, Toby C., E-mail: tobykent@me.com [Respiratory Disease Area, Novartis Institutes for Biomedical Research, Wimblehurst Road, Horsham, West Sussex RH12 5AB (United Kingdom)

    2015-08-15

    Long acting inhaled muscarinic receptor antagonists, such as tiotropium, are widely used as bronchodilator therapy for chronic obstructive pulmonary disease (COPD). Although this class of compounds is generally considered to be safe and well tolerated in COPD patients the cardiovascular safety of tiotropium has recently been questioned. We describe a rat in vivo model that allows the concurrent assessment of muscarinic antagonist potency, bronchodilator efficacy and a potential for side effects, and we use this model to compare tiotropium with NVA237 (glycopyrronium bromide), a recently approved inhaled muscarinic antagonist for COPD. Anaesthetized Brown Norway rats were dosed intratracheally at 1 or 6 h prior to receiving increasing doses of intravenous methacholine. Changes in airway resistance and cardiovascular function were recorded and therapeutic indices were calculated against the ED{sub 50} values for the inhibition of methacholine-induced bronchoconstriction. At both time points studied, greater therapeutic indices for hypotension and bradycardia were observed with glycopyrronium (19.5 and 28.5 fold at 1 h; > 200 fold at 6 h) than with tiotropium (1.5 and 4.2 fold at 1 h; 4.6 and 5.5 fold at 6 h). Pharmacokinetic, protein plasma binding and rat muscarinic receptor binding properties for both compounds were determined and used to generate an integrated model of systemic M{sub 2} muscarinic receptor occupancy, which predicted significantly higher M{sub 2} receptor blockade at ED{sub 50} doses with tiotropium than with glycopyrronium. In our preclinical model there was an improved safety profile for glycopyrronium when compared with tiotropium. - Highlights: • We use an in vivo rat model to study CV safety of inhaled muscarinic antagonists. • We integrate protein and receptor binding and PK of tiotropium and glycopyrrolate. • At ED{sub 50} doses for bronchoprotection we model systemic M{sub 2} receptor occupancy. • Glycopyrrolate demonstrates lower M

  18. Comparative SPR study on the effect of nanomaterials on the biological activity of adsorbed proteins

    International Nuclear Information System (INIS)

    Mei, Q.; Chen, Y.; Hong, J.; Chen, H.; Ding, X.; Yin, Y.; Koh, K.; Lee, J.

    2012-01-01

    Bioactivity of proteins is evaluated to test the adverse effects of nanoparticles interjected into biological systems. Surface plasmon resonance (SPR) spectroscopy detects binding affinity that is normally related to biological activity. Utilizing SPR spectroscopy, a concise testing matrix is established by investigating the adsorption level of bovine serum albumin (BSA) and anti-BSA on the surface covered with 11-mercaptoundecanoic acid (MUA); magnetic nanoparticles (MNPs) and single-walled carbon nanotubes (SWCNTs), respectively. The immunoactivity of BSA on MNPs and SWCNT decreased by 18 % and 5 %, respectively, compared to that on the gold film modified with MUA. This indicates that MNPs cause a considerable loss of biological activity of adsorbed protein. This effect can be utilized for practical applications on detailed biophysical research and nanotoxicity studies. (author)

  19. Methods and models used in comparative risk studies

    International Nuclear Information System (INIS)

    Devooght, J.

    1983-01-01

    Comparative risk studies make use of a large number of methods and models based upon a set of assumptions incompletely formulated or of value judgements. Owing to the multidimensionality of risks and benefits, the economic and social context may notably influence the final result. Five classes of models are briefly reviewed: accounting of fluxes of effluents, radiation and energy; transport models and health effects; systems reliability and bayesian analysis; economic analysis of reliability and cost-risk-benefit analysis; decision theory in presence of uncertainty and multiple objectives. Purpose and prospect of comparative studies are assessed in view of probable diminishing returns for large generic comparisons [fr

  20. Evaluation of plasma C-reactive protein levels in pregnant women with and without periodontal disease: A comparative study

    Directory of Open Access Journals (Sweden)

    Sharma Anupriya

    2009-01-01

    Full Text Available Background and Objectives: Circulating C-reactive protein (CRP levels are a marker of systemic inflammation and are associated with periodontal disease, a chronic bacterial infection associated with elevation of proinflammatory cytokines and prostaglandins. CRP has been associated with adverse pregnancy outcomes, including preterm delivery, preeclampsia, and intrauterine growth restriction. Furthermore, periodontal disease has been associated with increased risk of preterm low birth weight, low birth weight, and preterm birth. The present study was conducted to assess plasma CRP levels in pregnant women with and without periodontal disease; to evaluate the effect of periodontal therapy on the incidence of preterm delivery; and to compare the incidence of preterm delivery in pregnant women with and without periodontal disease. Materials and Methods: A total of 90 pregnant women aged between 18-35 years with gestational age between 12-28 weeks were recruited and divided into three equal groups (control group, study group, treatment group of 30 each. Blood samples were taken for estimation of C-reactive protein levels from all groups at 12-20 weeks of gestation, determined using ultrasensitive turbidimetric immunoassay (QUANTIA-CRP US. The treatment group comprised plaque control, scaling, and root planning and daily rinsing with 0.2% chlorhexidine mouth before 28 weeks of gestation. Results: The mean value of C-reactive protein levels in subjects with periodontal disease was higher compared to control group i.e., 1.20 ± 0.247 mg/dl and 1.22 ± 0.250 mg/dl, respectively, compared to 0.713 ± 0.139 mg/ dl ( P = 0.001. The mean value of CRP levels before treatment was greater than the mean value after treatment i.e., 1.22 ± 0.25 compared to 0.84 ± 0.189 ( P < 0.001. The incidence of preterm delivery (< 37 weeks was 31.7% in the periodontal disease group (study group compared to 8.3% in the control group ( P = 0.001. The incidence of preterm

  1. Modelling lactation curve for milk fat to protein ratio in Iranian buffaloes (Bubalus bubalis) using non-linear mixed models.

    Science.gov (United States)

    Hossein-Zadeh, Navid Ghavi

    2016-08-01

    The aim of this study was to compare seven non-linear mathematical models (Brody, Wood, Dhanoa, Sikka, Nelder, Rook and Dijkstra) to examine their efficiency in describing the lactation curves for milk fat to protein ratio (FPR) in Iranian buffaloes. Data were 43 818 test-day records for FPR from the first three lactations of Iranian buffaloes which were collected on 523 dairy herds in the period from 1996 to 2012 by the Animal Breeding Center of Iran. Each model was fitted to monthly FPR records of buffaloes using the non-linear mixed model procedure (PROC NLMIXED) in SAS and the parameters were estimated. The models were tested for goodness of fit using Akaike's information criterion (AIC), Bayesian information criterion (BIC) and log maximum likelihood (-2 Log L). The Nelder and Sikka mixed models provided the best fit of lactation curve for FPR in the first and second lactations of Iranian buffaloes, respectively. However, Wood, Dhanoa and Sikka mixed models provided the best fit of lactation curve for FPR in the third parity buffaloes. Evaluation of first, second and third lactation features showed that all models, except for Dijkstra model in the third lactation, under-predicted test time at which daily FPR was minimum. On the other hand, minimum FPR was over-predicted by all equations. Evaluation of the different models used in this study indicated that non-linear mixed models were sufficient for fitting test-day FPR records of Iranian buffaloes.

  2. Crucial roles of the protein kinases MK2 and MK3 in a mouse model of glomerulonephritis.

    Directory of Open Access Journals (Sweden)

    Adam J Guess

    Full Text Available Elevated mitogen-activated protein kinase p38 (p38 MAPK signaling has been implicated in various experimental and human glomerulopathies, and its inhibition has proven beneficial in animal models of these diseases. p38 MAPK signaling is partially mediated through MK2 and MK3, two phylogenetically related protein kinases that are its direct substrates. The current study was designed to determine the specific roles of MK2 and MK3 in a mouse model of acute proliferative glomerulonephritis, using mice with disrupted MK2 and/or MK3 genes. We found that the absence of MK3 alone worsened the disease course and increased mortality slightly compared to wild-type mice, whereas the absence of MK2 alone exhibited no significant effect. However, in an MK3-free background, the disease course depended on the presence of MK2 in a gene dosage-dependent manner, with double knock-out mice being most susceptible to disease induction. Histological and renal functional analyses confirmed kidney damage following disease induction. Because the renal stress response plays a crucial role in kidney physiology and disease, we analyzed the stress response pattern in this disease model. We found that renal cortices of diseased mice exhibited a pronounced and specific pattern of expression and/or phosphorylation of stress proteins and other indicators of the stress response (HSPB1, HSPB6, HSPB8, CHOP, eIF2α, partially in a MK2/MK3 genotype-specific manner, and without induction of a general stress response. Similarly, the expression and activation patterns of other protein kinases downstream of p38 MAPK (MNK1, MSK1 depended partially on the MK2/MK3 genotype in this disease model. In conclusion, MK2 and MK3 together play crucial roles in the regulation of the renal stress response and in the development of glomerulonephritis, which can potentially be exploited to develop novel therapeutic approaches to treat glomerular disease.

  3. The interface of protein structure, protein biophysics, and molecular evolution

    Science.gov (United States)

    Liberles, David A; Teichmann, Sarah A; Bahar, Ivet; Bastolla, Ugo; Bloom, Jesse; Bornberg-Bauer, Erich; Colwell, Lucy J; de Koning, A P Jason; Dokholyan, Nikolay V; Echave, Julian; Elofsson, Arne; Gerloff, Dietlind L; Goldstein, Richard A; Grahnen, Johan A; Holder, Mark T; Lakner, Clemens; Lartillot, Nicholas; Lovell, Simon C; Naylor, Gavin; Perica, Tina; Pollock, David D; Pupko, Tal; Regan, Lynne; Roger, Andrew; Rubinstein, Nimrod; Shakhnovich, Eugene; Sjölander, Kimmen; Sunyaev, Shamil; Teufel, Ashley I; Thorne, Jeffrey L; Thornton, Joseph W; Weinreich, Daniel M; Whelan, Simon

    2012-01-01

    Abstract The interface of protein structural biology, protein biophysics, molecular evolution, and molecular population genetics forms the foundations for a mechanistic understanding of many aspects of protein biochemistry. Current efforts in interdisciplinary protein modeling are in their infancy and the state-of-the art of such models is described. Beyond the relationship between amino acid substitution and static protein structure, protein function, and corresponding organismal fitness, other considerations are also discussed. More complex mutational processes such as insertion and deletion and domain rearrangements and even circular permutations should be evaluated. The role of intrinsically disordered proteins is still controversial, but may be increasingly important to consider. Protein geometry and protein dynamics as a deviation from static considerations of protein structure are also important. Protein expression level is known to be a major determinant of evolutionary rate and several considerations including selection at the mRNA level and the role of interaction specificity are discussed. Lastly, the relationship between modeling and needed high-throughput experimental data as well as experimental examination of protein evolution using ancestral sequence resurrection and in vitro biochemistry are presented, towards an aim of ultimately generating better models for biological inference and prediction. PMID:22528593

  4. "Hot cores" in proteins: Comparative analysis of the apolar contact area in structures from hyper/thermophilic and mesophilic organisms

    Directory of Open Access Journals (Sweden)

    Bossa Francesco

    2008-02-01

    Full Text Available Abstract Background A wide variety of stabilizing factors have been invoked so far to elucidate the structural basis of protein thermostability. These include, amongst the others, a higher number of ion-pairs interactions and hydrogen bonds, together with a better packing of hydrophobic residues. It has been frequently observed that packing of hydrophobic side chains is improved in hyperthermophilic proteins, when compared to their mesophilic counterparts. In this work, protein crystal structures from hyper/thermophilic organisms and their mesophilic homologs have been compared, in order to quantify the difference of apolar contact area and to assess the role played by the hydrophobic contacts in the stabilization of the protein core, at high temperatures. Results The construction of two datasets was carried out so as to satisfy several restrictive criteria, such as minimum redundancy, resolution and R-value thresholds and lack of any structural defect in the collected structures. This approach allowed to quantify with relatively high precision the apolar contact area between interacting residues, reducing the uncertainty due to the position of atoms in the crystal structures, the redundancy of data and the size of the dataset. To identify the common core regions of these proteins, the study was focused on segments that conserve a similar main chain conformation in the structures analyzed, excluding the intervening regions whose structure differs markedly. The results indicated that hyperthermophilic proteins underwent a significant increase of the hydrophobic contact area contributed by those residues composing the alpha-helices of the structurally conserved regions. Conclusion This study indicates the decreased flexibility of alpha-helices in proteins core as a major factor contributing to the enhanced termostability of a number of hyperthermophilic proteins. This effect, in turn, may be due to an increased number of buried methyl groups in

  5. Image based 3D city modeling : Comparative study

    Directory of Open Access Journals (Sweden)

    S. P. Singh

    2014-06-01

    Full Text Available 3D city model is a digital representation of the Earth’s surface and it’s related objects such as building, tree, vegetation, and some manmade feature belonging to urban area. The demand of 3D city modeling is increasing rapidly for various engineering and non-engineering applications. Generally four main image based approaches were used for virtual 3D city models generation. In first approach, researchers were used Sketch based modeling, second method is Procedural grammar based modeling, third approach is Close range photogrammetry based modeling and fourth approach is mainly based on Computer Vision techniques. SketchUp, CityEngine, Photomodeler and Agisoft Photoscan are the main softwares to represent these approaches respectively. These softwares have different approaches & methods suitable for image based 3D city modeling. Literature study shows that till date, there is no complete such type of comparative study available to create complete 3D city model by using images. This paper gives a comparative assessment of these four image based 3D modeling approaches. This comparative study is mainly based on data acquisition methods, data processing techniques and output 3D model products. For this research work, study area is the campus of civil engineering department, Indian Institute of Technology, Roorkee (India. This 3D campus acts as a prototype for city. This study also explains various governing parameters, factors and work experiences. This research work also gives a brief introduction, strengths and weakness of these four image based techniques. Some personal comment is also given as what can do or what can’t do from these softwares. At the last, this study shows; it concluded that, each and every software has some advantages and limitations. Choice of software depends on user requirements of 3D project. For normal visualization project, SketchUp software is a good option. For 3D documentation record, Photomodeler gives good

  6. Computational Modeling of Proteins based on Cellular Automata: A Method of HP Folding Approximation.

    Science.gov (United States)

    Madain, Alia; Abu Dalhoum, Abdel Latif; Sleit, Azzam

    2018-06-01

    The design of a protein folding approximation algorithm is not straightforward even when a simplified model is used. The folding problem is a combinatorial problem, where approximation and heuristic algorithms are usually used to find near optimal folds of proteins primary structures. Approximation algorithms provide guarantees on the distance to the optimal solution. The folding approximation approach proposed here depends on two-dimensional cellular automata to fold proteins presented in a well-studied simplified model called the hydrophobic-hydrophilic model. Cellular automata are discrete computational models that rely on local rules to produce some overall global behavior. One-third and one-fourth approximation algorithms choose a subset of the hydrophobic amino acids to form H-H contacts. Those algorithms start with finding a point to fold the protein sequence into two sides where one side ignores H's at even positions and the other side ignores H's at odd positions. In addition, blocks or groups of amino acids fold the same way according to a predefined normal form. We intend to improve approximation algorithms by considering all hydrophobic amino acids and folding based on the local neighborhood instead of using normal forms. The CA does not assume a fixed folding point. The proposed approach guarantees one half approximation minus the H-H endpoints. This lower bound guaranteed applies to short sequences only. This is proved as the core and the folds of the protein will have two identical sides for all short sequences.

  7. Constraint Logic Programming approach to protein structure prediction

    Directory of Open Access Journals (Sweden)

    Fogolari Federico

    2004-11-01

    Full Text Available Abstract Background The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The protein structure prediction problem can be cast in the form of an optimization problem. Notwithstanding its importance, the problem has very seldom been tackled by Constraint Logic Programming, a declarative programming paradigm suitable for solving combinatorial optimization problems. Results Constraint Logic Programming techniques have been applied to the protein structure prediction problem on the face-centered cube lattice model. Molecular dynamics techniques, endowed with the notion of constraint, have been also exploited. Even using a very simplified model, Constraint Logic Programming on the face-centered cube lattice model allowed us to obtain acceptable results for a few small proteins. As a test implementation their (known secondary structure and the presence of disulfide bridges are used as constraints. Simplified structures obtained in this way have been converted to all atom models with plausible structure. Results have been compared with a similar approach using a well-established technique as molecular dynamics. Conclusions The results obtained on small proteins show that Constraint Logic Programming techniques can be employed for studying protein simplified models, which can be converted into realistic all atom models. The advantage of Constraint Logic Programming over other, much more explored, methodologies, resides in the rapid software prototyping, in the easy way of encoding heuristics, and in exploiting all the advances made in this research area, e.g. in constraint propagation and its use for pruning the huge search space.

  8. Constraint Logic Programming approach to protein structure prediction.

    Science.gov (United States)

    Dal Palù, Alessandro; Dovier, Agostino; Fogolari, Federico

    2004-11-30

    The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The protein structure prediction problem can be cast in the form of an optimization problem. Notwithstanding its importance, the problem has very seldom been tackled by Constraint Logic Programming, a declarative programming paradigm suitable for solving combinatorial optimization problems. Constraint Logic Programming techniques have been applied to the protein structure prediction problem on the face-centered cube lattice model. Molecular dynamics techniques, endowed with the notion of constraint, have been also exploited. Even using a very simplified model, Constraint Logic Programming on the face-centered cube lattice model allowed us to obtain acceptable results for a few small proteins. As a test implementation their (known) secondary structure and the presence of disulfide bridges are used as constraints. Simplified structures obtained in this way have been converted to all atom models with plausible structure. Results have been compared with a similar approach using a well-established technique as molecular dynamics. The results obtained on small proteins show that Constraint Logic Programming techniques can be employed for studying protein simplified models, which can be converted into realistic all atom models. The advantage of Constraint Logic Programming over other, much more explored, methodologies, resides in the rapid software prototyping, in the easy way of encoding heuristics, and in exploiting all the advances made in this research area, e.g. in constraint propagation and its use for pruning the huge search space.

  9. Evaluation of molecular brain changes associated with environmental stress in rodent models compared to human major depressive disorder: A proteomic systems approach.

    Science.gov (United States)

    Cox, David Alan; Gottschalk, Michael Gerd; Stelzhammer, Viktoria; Wesseling, Hendrik; Cooper, Jason David; Bahn, Sabine

    2016-11-25

    Rodent models of major depressive disorder (MDD) are indispensable when screening for novel treatments, but assessing their translational relevance with human brain pathology has proved difficult. Using a novel systems approach, proteomics data obtained from post-mortem MDD anterior prefrontal cortex tissue (n = 12) and matched controls (n = 23) were compared with equivalent data from three commonly used preclinical models exposed to environmental stressors (chronic mild stress, prenatal stress and social defeat). Functional pathophysiological features associated with depression-like behaviour were identified in these models through enrichment of protein-protein interaction networks. A cross-species comparison evaluated which model(s) represent human MDD pathology most closely. Seven functional domains associated with MDD and represented across at least two models such as "carbohydrate metabolism and cellular respiration" were identified. Through statistical evaluation using kernel-based machine learning techniques, the social defeat model was found to represent MDD brain changes most closely for four of the seven domains. This is the first study to apply a method for directly evaluating the relevance of the molecular pathology of multiple animal models to human MDD on the functional level. The methodology and findings outlined here could help to overcome translational obstacles of preclinical psychiatric research.

  10. Digestibility of Quinoa (Chenopodium quinoa Willd.) Protein Concentrate and Its Potential to Inhibit Lipid Peroxidation in the Zebrafish Larvae Model.

    Science.gov (United States)

    Vilcacundo, R; Barrio, D; Carpio, C; García-Ruiz, A; Rúales, J; Hernández-Ledesma, B; Carrillo, W

    2017-09-01

    Quinoa protein concentrate (QPC) was extracted and digested under in vitro gastrointestinal conditions. The protein content of QPC was in the range between 52.40 and 65.01% depending on the assay used. Quinoa proteins were almost completely hydrolyzed by pepsin at pH of 1.2, 2.0, and 3.2. At high pH, only partial hydrolysis was observed. During the duodenal phase, no intact proteins were visible, indicating their susceptibility to the in vitro simulated digestive conditions. Zebrafish larvae model was used to evaluate the in vivo ability of gastrointestinal digests to inhibit lipid peroxidation. Gastric digestion at pH 1.2 showed the highest lipid peroxidation inhibition percentage (75.15%). The lipid peroxidation activity increased after the duodenal phase. The digest obtained at the end of the digestive process showed an inhibition percentage of 82.10%, comparable to that showed when using BHT as positive control (87.13%).

  11. Identification of Novel STAT6-Regulated Proteins in Mouse B Cells by Comparative Transcriptome and Proteome Analysis.

    Science.gov (United States)

    Mokada-Gopal, Lavanya; Boeser, Alexander; Lehmann, Christian H K; Drepper, Friedel; Dudziak, Diana; Warscheid, Bettina; Voehringer, David

    2017-05-01

    The transcription factor STAT6 plays a key role in mediating signaling downstream of the receptors for IL-4 and IL-13. In B cells, STAT6 is required for class switch recombination to IgE and for germinal center formation during type 2 immune responses directed against allergens or helminths. In this study, we compared the transcriptomes and proteomes of primary mouse B cells from wild-type and STAT6-deficient mice cultured for 4 d in the presence or absence of IL-4. Microarray analysis revealed that 214 mRNAs were upregulated and 149 were downregulated >3-fold by IL-4 in a STAT6-dependent manner. Across all samples, ∼5000 proteins were identified by label-free quantitative liquid chromatography/mass spectrometry. A total of 149 proteins was found to be differentially expressed >3-fold between IL-4-stimulated wild-type and STAT6 -/- B cells (75 upregulated and 74 downregulated). Comparative analysis of the proteome and transcriptome revealed that expression of these proteins was mainly regulated at the transcriptional level, which argues against a major role for posttranscriptional mechanisms that modulate the STAT6-dependent proteome. Nine proteins were selected for confirmation by flow cytometry or Western blot. We show that CD30, CD79b, SLP-76, DEC205, IL-5Rα, STAT5, and Thy1 are induced by IL-4 in a STAT6-dependent manner. In contrast, Syk and Fc receptor-like 1 were downregulated. This dataset provides a framework for further functional analysis of newly identified IL-4-regulated proteins in B cells that may contribute to germinal center formation and IgE switching in type 2 immunity. Copyright © 2017 by The American Association of Immunologists, Inc.

  12. Comparative proteomic analysis reveals proteins putatively involved in toxin biosynthesis in the marine dinoflagellate Alexandrium catenella.

    Science.gov (United States)

    Wang, Da-Zhi; Gao, Yue; Lin, Lin; Hong, Hua-Sheng

    2013-01-22

    Alexandrium is a neurotoxin-producing dinoflagellate genus resulting in paralytic shellfish poisonings around the world. However, little is known about the toxin biosynthesis mechanism in Alexandrium. This study compared protein profiles of A. catenella collected at different toxin biosynthesis stages (non-toxin synthesis, initial toxin synthesis and toxin synthesizing) coupled with the cell cycle, and identified differentially expressed proteins using 2-DE and MALDI-TOF-TOF mass spectrometry. The results showed that toxin biosynthesis of A. catenella occurred within a defined time frame in the G1 phase of the cell cycle. Proteomic analysis indicated that 102 protein spots altered significantly in abundance (P translation. Among them, nine proteins with known functions in paralytic shellfish toxin-producing cyanobacteria, i.e., methionine S-adenosyltransferase, chloroplast ferredoxin-NADP+ reductase, S-adenosylhomocysteinase, adenosylhomocysteinase, ornithine carbamoyltransferase, inorganic pyrophosphatase, sulfotransferase (similar to), alcohol dehydrogenase and arginine deiminase, varied significantly at different toxin biosynthesis stages and formed an interaction network, indicating that they might be involved in toxin biosynthesis in A. catenella. This study is the first step in the dissection of the behavior of the A. catenella proteome during different toxin biosynthesis stages and provides new insights into toxin biosynthesis in dinoflagellates.

  13. Citrate synthase proteins in extremophilic organisms: Studies within a structure-based model

    International Nuclear Information System (INIS)

    Różycki, Bartosz; Cieplak, Marek

    2014-01-01

    We study four citrate synthase homodimeric proteins within a structure-based coarse-grained model. Two of these proteins come from thermophilic bacteria, one from a cryophilic bacterium and one from a mesophilic organism; three are in the closed and two in the open conformations. Even though the proteins belong to the same fold, the model distinguishes the properties of these proteins in a way which is consistent with experiments. For instance, the thermophilic proteins are more stable thermodynamically than their mesophilic and cryophilic homologues, which we observe both in the magnitude of thermal fluctuations near the native state and in the kinetics of thermal unfolding. The level of stability correlates with the average coordination number for amino acid contacts and with the degree of structural compactness. The pattern of positional fluctuations along the sequence in the closed conformation is different than in the open conformation, including within the active site. The modes of correlated and anticorrelated movements of pairs of amino acids forming the active site are very different in the open and closed conformations. Taken together, our results show that the precise location of amino acid contacts in the native structure appears to be a critical element in explaining the similarities and differences in the thermodynamic properties, local flexibility, and collective motions of the different forms of the enzyme

  14. [Drosophila melanogaster as a model for studying the function of animal viral proteins].

    Science.gov (United States)

    Omelianchuk, L V; Iudina, O S

    2011-07-01

    Studies in which Drosophila melanogaster individuals carrying transgenes of animal viruses were used to analyze the action of animal viral proteins on the cell are reviewed. The data presented suggest that host specificity of viruses is determined by their proteins responsible for the penetration of the virus into the cell, while viral proteins responsible for interactions with the host cell are much less host-specific. Due to this, the model of Drosophila with its developed system of searching for genetic interactions can be used to find intracellular targets for the action of viral proteins of the second group.

  15. Kinetic analysis and mathematical modeling of growth parameters of Lactobacillus plantarum in protein-rich isolates from tomato seed.

    Science.gov (United States)

    Mechmeche, Manel; Kachouri, Faten; Yaghlane, Hana B; Ksontini, Hamida; Setti, Khaoula; Hamdi, Moktar

    2017-03-01

    The aim of the present study was to evaluate the applicability of using protein-rich isolates from tomato seed as a sole source of nutrition for the growth of lactic acid bacteria. Unstructured mathematical and logistic models were proposed to describe growth, pH drop, lactic acid production and nutriment consumption by Lactobacillus plantarum in whole and defatted isolates in order to compare their suitability for the production of a fermented beverage. These media have considerable good quantities of nutriment that allowed the growth of L. plantarum, after which the cell numbers begin to decline. The maximum biomass was observed in defatted isolate (1.42 g L -1 ) followed by the whole isolate (1.24 g L -1 ). The lactic acid increased by about 5.5 and 6.5 times respectively in whole and defatted protein isolates. However, significant nutriment consumption occurred during the growth phase as well as stationary phase. A reduction of 61.90% and 95.88% in sugar content, as well as 21.91% and 16.93% reduction in protein content were observed respectively in whole and defatted isolates. In most cases, the proposed models adequately describe the biochemical changes taking place during fermentation and are a promising approach for the formulation of tomato seed-based functional foods.

  16. A multi-directional rapidly exploring random graph (mRRG) for protein folding

    KAUST Repository

    Nath, Shuvra Kanti; Thomas, Shawna; Ekenna, Chinwe; Amato, Nancy M.

    2012-01-01

    Modeling large-scale protein motions, such as those involved in folding and binding interactions, is crucial to better understanding not only how proteins move and interact with other molecules but also how proteins misfold, thus causing many devastating diseases. Robotic motion planning algorithms, such as Rapidly Exploring Random Trees (RRTs), have been successful in simulating protein folding pathways. Here, we propose a new multi-directional Rapidly Exploring Random Graph (mRRG) specifically tailored for proteins. Unlike traditional RRGs which only expand a parent conformation in a single direction, our strategy expands the parent conformation in multiple directions to generate new samples. Resulting samples are connected to the parent conformation and its nearest neighbors. By leveraging multiple directions, mRRG can model the protein motion landscape with reduced computational time compared to several other robotics-based methods for small to moderate-sized proteins. Our results on several proteins agree with experimental hydrogen out-exchange, pulse-labeling, and F-value analysis. We also show that mRRG covers the conformation space better as compared to the other computation methods. Copyright © 2012 ACM.

  17. Improving N-terminal protein annotation of Plasmodium species based on signal peptide prediction of orthologous proteins

    Directory of Open Access Journals (Sweden)

    Neto Armando

    2012-11-01

    Full Text Available Abstract Background Signal peptide is one of the most important motifs involved in protein trafficking and it ultimately influences protein function. Considering the expected functional conservation among orthologs it was hypothesized that divergence in signal peptides within orthologous groups is mainly due to N-terminal protein sequence misannotation. Thus, discrepancies in signal peptide prediction of orthologous proteins were used to identify misannotated proteins in five Plasmodium species. Methods Signal peptide (SignalP and orthology (OrthoMCL were combined in an innovative strategy to identify orthologous groups showing discrepancies in signal peptide prediction among their protein members (Mixed groups. In a comparative analysis, multiple alignments for each of these groups and gene models were visually inspected in search of misannotated proteins and, whenever possible, alternative gene models were proposed. Thresholds for signal peptide prediction parameters were also modified to reduce their impact as a possible source of discrepancy among orthologs. Validation of new gene models was based on RT-PCR (few examples or on experimental evidence already published (ApiLoc. Results The rate of misannotated proteins was significantly higher in Mixed groups than in Positive or Negative groups, corroborating the proposed hypothesis. A total of 478 proteins were reannotated and change of signal peptide prediction from negative to positive was the most common. Reannotations triggered the conversion of almost 50% of all Mixed groups, which were further reduced by optimization of signal peptide prediction parameters. Conclusions The methodological novelty proposed here combining orthology and signal peptide prediction proved to be an effective strategy for the identification of proteins showing wrongly N-terminal annotated sequences, and it might have an important impact in the available data for genome-wide searching of potential vaccine and drug

  18. Camel milk protein hydrolysates with improved technofunctional properties and enhanced antioxidant potential in in vitro and in food model systems.

    Science.gov (United States)

    Al-Shamsi, Kholoud Awad; Mudgil, Priti; Hassan, Hassan Mohamed; Maqsood, Sajid

    2018-01-01

    Camel milk protein hydrolysates (CMPH) were generated using proteolytic enzymes, such as alcalase, bromelain, and papain, to explore the effect on the technofunctional properties and antioxidant potential under in vitro and in real food model systems. Characterization of the CMPH via degree of hydrolysis, sodium dodecyl sulfate-PAGE, and HPLC revealed that different proteins in camel milk underwent degradation at different degrees after enzymatic hydrolysis using 3 different enzymes for 2, 4, and 6 h, with papain displaying the highest degradation. Technofunctional properties, such as emulsifying activity index, surface hydrophobicity, and protein solubility, were higher in CMPH than unhydrolyzed camel milk proteins. However, the water and fat absorption capacity were lower in CMPH compared with unhydrolyzed camel milk proteins. Antioxidant properties as assessed by 2,2-azinobis(3-ethylbenzthiazoline-6-sulfonic acid) and 2,2-diphenyl-1-picrylhydrazyl radical scavenging activities and metal-chelating activity were enhanced after hydrolysis, in contrast to ferric-reducing antioxidant power which showed a decrease after hydrolysis. The CMPH were also tested in real food model systems for their potential to inhibit lipid peroxidation in fish mince and grape seed oil-in-water emulsion, and we found that papain-produced hydrolysate displayed higher inhibition than alcalase- and bromelain-produced hydrolysates. Therefore, the CMPH demonstrated effective antioxidant potential in vitro as well as in real food systems and showed enhanced functional properties, which guarantees their potential applications in functional foods. The present study is one of few reports available on CMPH being explored in vitro as well as in real food model systems. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  19. Comparing linear probability model coefficients across groups

    DEFF Research Database (Denmark)

    Holm, Anders; Ejrnæs, Mette; Karlson, Kristian Bernt

    2015-01-01

    of the following three components: outcome truncation, scale parameters and distributional shape of the predictor variable. These results point to limitations in using linear probability model coefficients for group comparisons. We also provide Monte Carlo simulations and real examples to illustrate......This article offers a formal identification analysis of the problem in comparing coefficients from linear probability models between groups. We show that differences in coefficients from these models can result not only from genuine differences in effects, but also from differences in one or more...... these limitations, and we suggest a restricted approach to using linear probability model coefficients in group comparisons....

  20. Comparative proteomics analysis of chronic atrophic gastritis: changes of protein expression in chronic atrophic gastritis with out Helicobacter pylori infection

    International Nuclear Information System (INIS)

    Zhang, Lin; Hou, Yanhong; Wu, Kai; Li, Dan

    2012-01-01

    Chronic atrophic gastritis (CAG) is a very common gastritis and one of the major precursor lesions of gastric cancer, one of the most common cancers worldwide. The molecular mechanism underlying CAG is unclear, but its elucidation is essential for the prevention and early detection of gastric cancer and appropriate intervention. A combination of two-dimensional gel electrophoresis and mass spectrometry was used in the present study to analyze the differentially expressed proteins. Samples from 21 patients (9 females and 12 males; mean age: 61.8 years) were used. We identified 18 differentially expressed proteins in CAG compared with matched normal mucosa. Eight proteins were up-regulated and 10 down-regulated in CAG when compared with the same amounts of proteins in individually matched normal gastric mucosa. Two novel proteins, proteasome activator subunit 1 (PSME1), which was down-regulated in CAG, and ribosomal protein S12 (RPS12), which was up-regulated in CAG, were further investigated. Their expression was validated by Western blot and RT-PCR in 15 CAG samples matched with normal mucosa. The expression level of RPS12 was significantly higher in CAG than in matched normal gastric mucosa (P < 0.05). In contrast, the expression level of PSME1 in CAG was significantly lower than in matched normal gastric mucosa (P < 0.05). This study clearly demonstrated that there are some changes in protein expression between CAG and normal mucosa. In these changes, down-regulation of PSME1 and up-regulation of RPS12 could be involved in the development of CAG. Thus, the differentially expressed proteins might play important roles in CAG as functional molecules

  1. Comparative proteomics analysis of chronic atrophic gastritis: changes of protein expression in chronic atrophic gastritis with out Helicobacter pylori infection

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Lin; Hou, Yanhong; Wu, Kai; Li, Dan [Department of Gastroenterology and Hepatology, The 309 Hospital of People' s Liberation Army, Beijing (China)

    2012-03-02

    Chronic atrophic gastritis (CAG) is a very common gastritis and one of the major precursor lesions of gastric cancer, one of the most common cancers worldwide. The molecular mechanism underlying CAG is unclear, but its elucidation is essential for the prevention and early detection of gastric cancer and appropriate intervention. A combination of two-dimensional gel electrophoresis and mass spectrometry was used in the present study to analyze the differentially expressed proteins. Samples from 21 patients (9 females and 12 males; mean age: 61.8 years) were used. We identified 18 differentially expressed proteins in CAG compared with matched normal mucosa. Eight proteins were up-regulated and 10 down-regulated in CAG when compared with the same amounts of proteins in individually matched normal gastric mucosa. Two novel proteins, proteasome activator subunit 1 (PSME1), which was down-regulated in CAG, and ribosomal protein S12 (RPS12), which was up-regulated in CAG, were further investigated. Their expression was validated by Western blot and RT-PCR in 15 CAG samples matched with normal mucosa. The expression level of RPS12 was significantly higher in CAG than in matched normal gastric mucosa (P < 0.05). In contrast, the expression level of PSME1 in CAG was significantly lower than in matched normal gastric mucosa (P < 0.05). This study clearly demonstrated that there are some changes in protein expression between CAG and normal mucosa. In these changes, down-regulation of PSME1 and up-regulation of RPS12 could be involved in the development of CAG. Thus, the differentially expressed proteins might play important roles in CAG as functional molecules.

  2. Activity of the kinesin spindle protein inhibitor ispinesib (SB-715992) in models of breast cancer

    Energy Technology Data Exchange (ETDEWEB)

    Purcell, James W; Davis, Jefferson; Reddy, Mamatha; Martin, Shamra; Samayoa, Kimberly; Vo, Hung; Thomsen, Karen; Bean, Peter; Kuo, Wen Lin; Ziyad, Safiyyah; Billig, Jessica; Feiler, Heidi S; Gray, Joe W; Wood, Kenneth W; Cases, Sylvaine

    2009-06-10

    Ispinesib (SB-715992) is a potent inhibitor of kinesin spindle protein (KSP), a kinesin motor protein essential for the formation of a bipolar mitotic spindle and cell cycle progression through mitosis. Clinical studies of ispinesib have demonstrated a 9% response rate in patients with locally advanced or metastatic breast cancer, and a favorable safety profile without significant neurotoxicities, gastrointestinal toxicities or hair loss. To better understand the potential of ispinesib in the treatment of breast cancer we explored the activity of ispinesib alone and in combination several therapies approved for the treatment of breast cancer. We measured the ispinesib sensitivity and pharmacodynamic response of breast cancer cell lines representative of various subtypes in vitro and as xenografts in vivo, and tested the ability of ispinesib to enhance the anti-tumor activity of approved therapies. In vitro, ispinesib displayed broad anti-proliferative activity against a panel of 53 breast cell-lines. In vivo, ispinesib produced regressions in each of five breast cancer models, and tumor free survivors in three of these models. The effects of ispinesib treatment on pharmacodynamic markers of mitosis and apoptosis were examined in vitro and in vivo, revealing a greater increase in both mitotic and apoptotic markers in the MDA-MB-468 model than in the less sensitive BT-474 model. In vivo, ispinesib enhanced the anti-tumor activity of trastuzumab, lapatinib, doxorubicin, and capecitabine, and exhibited activity comparable to paclitaxel and ixabepilone. These findings support further clinical exploration of KSP inhibitors for the treatment of breast cancer.

  3. Comparative sequence analysis of acid sensitive/resistance proteins in Escherichia coli and Shigella flexneri

    Science.gov (United States)

    Manikandan, Selvaraj; Balaji, Seetharaaman; Kumar, Anil; Kumar, Rita

    2007-01-01

    The molecular basis for the survival of bacteria under extreme conditions in which growth is inhibited is a question of great current interest. A preliminary study was carried out to determine residue pattern conservation among the antiporters of enteric bacteria, responsible for extreme acid sensitivity especially in Escherichia coli and Shigella flexneri. Here we found the molecular evidence that proved the relationship between E. coli and S. flexneri. Multiple sequence alignment of the gadC coded acid sensitive antiporter showed many conserved residue patterns at regular intervals at the N-terminal region. It was observed that as the alignment approaches towards the C-terminal, the number of conserved residues decreases, indicating that the N-terminal region of this protein has much active role when compared to the carboxyl terminal. The motif, FHLVFFLLLGG, is well conserved within the entire gadC coded protein at the amino terminal. The motif is also partially conserved among other antiporters (which are not coded by gadC) but involved in acid sensitive/resistance mechanism. Phylogenetic cluster analysis proves the relationship of Escherichia coli and Shigella flexneri. The gadC coded proteins are converged as a clade and diverged from other antiporters belongs to the amino acid-polyamine-organocation (APC) superfamily. PMID:21670792

  4. A Finite Element Solution of Lateral Periodic Poisson–Boltzmann Model for Membrane Channel Proteins

    Science.gov (United States)

    Xu, Jingjie; Lu, Benzhuo

    2018-01-01

    Membrane channel proteins control the diffusion of ions across biological membranes. They are closely related to the processes of various organizational mechanisms, such as: cardiac impulse, muscle contraction and hormone secretion. Introducing a membrane region into implicit solvation models extends the ability of the Poisson–Boltzmann (PB) equation to handle membrane proteins. The use of lateral periodic boundary conditions can properly simulate the discrete distribution of membrane proteins on the membrane plane and avoid boundary effects, which are caused by the finite box size in the traditional PB calculations. In this work, we: (1) develop a first finite element solver (FEPB) to solve the PB equation with a two-dimensional periodicity for membrane channel proteins, with different numerical treatments of the singular charges distributions in the channel protein; (2) add the membrane as a dielectric slab in the PB model, and use an improved mesh construction method to automatically identify the membrane channel/pore region even with a tilt angle relative to the z-axis; and (3) add a non-polar solvation energy term to complete the estimation of the total solvation energy of a membrane protein. A mesh resolution of about 0.25 Å (cubic grid space)/0.36 Å (tetrahedron edge length) is found to be most accurate in linear finite element calculation of the PB solvation energy. Computational studies are performed on a few exemplary molecules. The results indicate that all factors, the membrane thickness, the length of periodic box, membrane dielectric constant, pore region dielectric constant, and ionic strength, have individually considerable influence on the solvation energy of a channel protein. This demonstrates the necessity to treat all of those effects in the PB model for membrane protein simulations. PMID:29495644

  5. Alignment of non-covalent interactions at protein-protein interfaces.

    Directory of Open Access Journals (Sweden)

    Hongbo Zhu

    Full Text Available BACKGROUND: The study and comparison of protein-protein interfaces is essential for the understanding of the mechanisms of interaction between proteins. While there are many methods for comparing protein structures and protein binding sites, so far no methods have been reported for comparing the geometry of non-covalent interactions occurring at protein-protein interfaces. METHODOLOGY/PRINCIPAL FINDINGS: Here we present a method for aligning non-covalent interactions between different protein-protein interfaces. The method aligns the vector representations of van der Waals interactions and hydrogen bonds based on their geometry. The method has been applied to a dataset which comprises a variety of protein-protein interfaces. The alignments are consistent to a large extent with the results obtained using two other complementary approaches. In addition, we apply the method to three examples of protein mimicry. The method successfully aligns respective interfaces and allows for recognizing conserved interface regions. CONCLUSIONS/SIGNIFICANCE: The Galinter method has been validated in the comparison of interfaces in which homologous subunits are involved, including cases of mimicry. The method is also applicable to comparing interfaces involving non-peptidic compounds. Galinter assists users in identifying local interface regions with similar patterns of non-covalent interactions. This is particularly relevant to the investigation of the molecular basis of interaction mimicry.

  6. Application of TZERO calibrated modulated temperature differential scanning calorimetry to characterize model protein formulations.

    Science.gov (United States)

    Badkar, Aniket; Yohannes, Paulos; Banga, Ajay

    2006-02-17

    The objective of this study was to evaluate the feasibility of using T(ZERO) modulated temperature differential scanning calorimetry (MDSC) as a novel technique to characterize protein solutions using lysozyme as a model protein and IgG as a model monoclonal antibody. MDSC involves the application of modulated heating program, along with the standard heating program that enables the separation of overlapping thermal transitions. Although characterization of unfolding transitions for protein solutions requires the application of high sensitive DSC, separation of overlapping transitions like aggregation and other exothermic events may be possible only by use of MDSC. A newer T(ZERO) calibrated MDSC model from TA instruments that has improved sensitivity than previous models was used. MDSC analysis showed total, reversing and non-reversing heat flow signals. Total heat flow signals showed a combination of melting endotherms and overlapping exothermic events. Under the operating conditions used, the melting endotherms were seen in reversing heat flow signal while the exothermic events were seen in non-reversing heat flow signal. This enabled the separation of overlapping thermal transitions, improved data analysis and decreased baseline noise. MDSC was used here for characterization of lysozyme solutions, but its feasibility for characterizing therapeutic protein solutions needs further assessment.

  7. A network model to correlate conformational change and the impedance spectrum of single proteins

    Energy Technology Data Exchange (ETDEWEB)

    Alfinito, Eleonora; Pennetta, Cecilia; Reggiani, Lino [Dipartimento di Ingegneria dell' Innovazione, Universita del Salento, Via Arnesano, Lecce (Italy); Consorzio Nazionale Interuniversitario per le Scienze Fisiche della Materia (CNISM) (Italy)

    2008-02-13

    Integrated nanodevices based on proteins or biomolecules are attracting increasing interest in today's research. In fact, it has been shown that proteins such as azurin and bacteriorhodopsin manifest some electrical properties that are promising for the development of active components of molecular electronic devices. Here we focus on two relevant kinds of protein: bovine rhodopsin, prototype of G-protein-coupled-receptor (GPCR) proteins, and the enzyme acetylcholinesterase (AChE), whose inhibition is one of the most qualified treatments of Alzheimer's disease. Both these proteins exert their function starting with a conformational change of their native structure. Our guess is that such a change should be accompanied with a detectable variation of their electrical properties. To investigate this conjecture, we present an impedance network model of proteins, able to estimate the different impedance spectra associated with the different configurations. The distinct types of conformational change of rhodopsin and AChE agree with their dissimilar electrical responses. In particular, for rhodopsin the model predicts variations of the impedance spectra up to about 30%, while for AChE the same variations are limited to about 10%, which supports the existence of a dynamical equilibrium between its native and complexed states.

  8. Genome-wide investigation and expression analyses of WD40 protein family in the model plant foxtail millet (Setaria italica L..

    Directory of Open Access Journals (Sweden)

    Awdhesh Kumar Mishra

    Full Text Available WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I-V. Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement

  9. Genome-wide investigation and expression analyses of WD40 protein family in the model plant foxtail millet (Setaria italica L.).

    Science.gov (United States)

    Mishra, Awdhesh Kumar; Muthamilarasan, Mehanathan; Khan, Yusuf; Parida, Swarup Kumar; Prasad, Manoj

    2014-01-01

    WD40 proteins play a crucial role in diverse protein-protein interactions by acting as scaffolding molecules and thus assisting in the proper activity of proteins. Hence, systematic characterization and expression profiling of these WD40 genes in foxtail millet would enable us to understand the networks of WD40 proteins and their biological processes and gene functions. In the present study, a genome-wide survey was conducted and 225 potential WD40 genes were identified. Phylogenetic analysis categorized the WD40 proteins into 5 distinct sub-families (I-V). Gene Ontology annotation revealed the biological roles of the WD40 proteins along with its cellular components and molecular functions. In silico comparative mapping with sorghum, maize and rice demonstrated the orthologous relationships and chromosomal rearrangements including duplication, inversion and deletion of WD40 genes. Estimation of synonymous and non-synonymous substitution rates revealed its evolutionary significance in terms of gene-duplication and divergence. Expression profiling against abiotic stresses provided novel insights into specific and/or overlapping expression patterns of SiWD40 genes. Homology modeling enabled three-dimensional structure prediction was performed to understand the molecular functions of WD40 proteins. Although, recent findings had shown the importance of WD40 domains in acting as hubs for cellular networks during many biological processes, it has invited a lesser research attention unlike other common domains. Being a most promiscuous interactors, WD40 domains are versatile in mediating critical cellular functions and hence this genome-wide study especially in the model crop foxtail millet would serve as a blue-print for functional characterization of WD40s in millets and bioenergy grass species. In addition, the present analyses would also assist the research community in choosing the candidate WD40s for comprehensive studies towards crop improvement of millets and

  10. Lower fetuin-A, retinol binding protein 4 and several metabolites after gastric bypass compared to sleeve gastrectomy in patients with type 2 diabetes.

    Directory of Open Access Journals (Sweden)

    Mia Jüllig

    Full Text Available BACKGROUND: Bypass of foregut secreted factors promoting insulin resistance is hypothesized to be one of the mechanisms by which resolution of type 2 diabetes (T2D follows roux-en-y gastric bypass (GBP surgery. AIM: To identify insulin resistance-associated proteins and metabolites which decrease more after GBP than after sleeve gastrectomy (SG prior to diabetes remission. METHODS: Fasting plasma from 15 subjects with T2D undergoing GBP or SG was analyzed by proteomic and metabolomic methods 3 days before and 3 days after surgery. Subjects were matched for age, BMI, metformin therapy and glycemic control. Insulin resistance was calculated using homeostasis model assessment (HOMA-IR. For proteomics, samples were depleted of abundant plasma proteins, digested with trypsin and labeled with iTRAQ isobaric tags prior to liquid chromatography-tandem mass spectrometry analysis. Metabolomic analysis was performed using gas chromatography-mass spectrometry. The effect of the respective bariatric surgery on identified proteins and metabolites was evaluated using two-way analysis of variance and appropriate post-hoc tests. RESULTS: HOMA-IR improved, albeit not significantly, in both groups after surgery. Proteomic analysis yielded seven proteins which decreased significantly after GBP only, including Fetuin-A and Retinol binding protein 4, both previously linked to insulin resistance. Significant decrease in Fetuin-A and Retinol binding protein 4 after GBP was confirmed using ELISA and immunoassay. Metabolomic analysis identified significant decrease of citrate, proline, histidine and decanoic acid specifically after GBP. CONCLUSION: Greater early decrease was seen for Fetuin-A, Retinol binding protein 4, and several metabolites after GBP compared to SG, preceding significant weight loss. This may contribute to enhanced T2D remission observed following foregut bypass procedures.

  11. New statistical potential for quality assessment of protein models and a survey of energy functions

    Directory of Open Access Journals (Sweden)

    Rykunov Dmitry

    2010-03-01

    Full Text Available Abstract Background Scoring functions, such as molecular mechanic forcefields and statistical potentials are fundamentally important tools in protein structure modeling and quality assessment. Results The performances of a number of publicly available scoring functions are compared with a statistical rigor, with an emphasis on knowledge-based potentials. We explored the effect on accuracy of alternative choices for representing interaction center types and other features of scoring functions, such as using information on solvent accessibility, on torsion angles, accounting for secondary structure preferences and side chain orientation. Partially based on the observations made, we present a novel residue based statistical potential, which employs a shuffled reference state definition and takes into account the mutual orientation of residue side chains. Atom- and residue-level statistical potentials and Linux executables to calculate the energy of a given protein proposed in this work can be downloaded from http://www.fiserlab.org/potentials. Conclusions Among the most influential terms we observed a critical role of a proper reference state definition and the benefits of including information about the microenvironment of interaction centers. Molecular mechanical potentials were also tested and found to be over-sensitive to small local imperfections in a structure, requiring unfeasible long energy relaxation before energy scores started to correlate with model quality.

  12. Molecular modeling of the conformational dynamics of the cellular prion protein

    Science.gov (United States)

    Nguyen, Charles; Colling, Ian; Bartz, Jason; Soto, Patricia

    2014-03-01

    Prions are infectious agents responsible for transmissible spongiform encephalopathies (TSEs), a type of fatal neurodegenerative disease in mammals. Prions propagate biological information by conversion of the non-pathological version of the prion protein to the infectious conformation, PrPSc. A wealth of knowledge has shed light on the nature and mechanism of prion protein conversion. In spite of the significance of this problem, we are far from fully understanding the conformational dynamics of the cellular isoform. To remedy this situation we employ multiple biomolecular modeling techniques such as docking and molecular dynamics simulations to map the free energy landscape and determine what specific regions of the prion protein are most conductive to binding. The overall goal is to characterize the conformational dynamics of the cell form of the prion protein, PrPc, to gain insight into inhibition pathways against misfolding. NE EPSCoR FIRST Award to Patricia Soto.

  13. Self-organized critical model for protein folding

    Science.gov (United States)

    Moret, M. A.

    2011-09-01

    The major factor that drives a protein toward collapse and folding is the hydrophobic effect. At the folding process a hydrophobic core is shielded by the solvent-accessible surface area of the protein. We study the fractal behavior of 5526 protein structures present in the Brookhaven Protein Data Bank. Power laws of protein mass, volume and solvent-accessible surface area are measured independently. The present findings indicate that self-organized criticality is an alternative explanation for the protein folding. Also we note that the protein packing is an independent and constant value because the self-similar behavior of the volumes and protein masses have the same fractal dimension. This power law guarantees that a protein is a complex system. From the analyzed data, q-Gaussian distributions seem to fit well this class of systems.

  14. Efficient expression of SRK intracellular domain by a modeling-based protein engineering.

    Science.gov (United States)

    Murase, Kohji; Hirano, Yoshinori; Takayama, Seiji; Hakoshima, Toshio

    2017-03-01

    S-locus protein kinase (SRK) is a receptor kinase that plays a critical role in self-recognition in the Brassicaceae self-incompatibility (SI) response. SRK is activated by binding of its ligand S-locus protein 11 (SP11) and subsequently induced phosphorylation of the intracellular kinase domain. However, a detailed activation mechanism of SRK is still largely unknown because of the difficulty in stably expressing SRK recombinant proteins. Here, we performed modeling-based protein engineering of the SRK kinase domain for stable expression in Escherichia coli. The engineered SRK intracellular domain was expressed about 54-fold higher production than wild type SRK, without loss of the kinase activity, suggesting it could be useful for further biochemical and structural studies. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Automated measurement of serum thyroxine with the ''AIRA II,'' as compared with competitive protein binding and radioimmunoassay

    International Nuclear Information System (INIS)

    Reese, M.G.; Johnson, L.V.R.

    1978-01-01

    Two conventional serum thyroxine assays, run in separate laboratories, one by competitive protein binding and one by radioimmunoassay, were used to evaluate the automated ARIA II (Becton Dickinson Immunodiagnostics) serum thyroxine assay. Competitive protein binding as compared to ARIA II with 111 clinical serum samples gave a slope of 1.04 and a correlation coefficient of 0.94. The radioimmunoassay comparison to ARIA II with 53 clinical serum samples gave a slope of 1.05 and a correlation coefficient of 0.92. The ARIA II inter-assay coefficient of variation for 10 replicates of low, medium, and high thyroxine serum samples was 6.2, 6.0, and 2.9%, respectively, with an inter-assay coefficient of variation among 15 different assays of 15.5, 10.1, and 7.9%. The automated ARIA II, with a 2.2-min cycle per sample, gives results that compare well with those by manual methodology

  16. Kazakh therapy on differential protein expression of Achilles tendon healing in a 7-day postoperative rabbit model.

    Science.gov (United States)

    Nuerai, Shawutali; Ainuer, Jialili; Jiasharete, Jielile; Darebai, Redati; Kayrat, Aldyarhan; Tang, Bin; Jiangannur, Zheyiken; Bai, Jingping; Makabel, Bolat

    2011-12-01

    To compare the effect of cast immobilization with that of early Kiymil arkili emdew (Kazakh exercise therapy) on the post-operative healing of Achilles tendon rupture in rabbits, and to observe the influence of early Kiymil arkili emdew on the differentially expressed proteins in the healing tendon. Forty-five New Zealand white rabbits were randomly divided into three groups (Arm A: control group; Arm B: postoperative immobilization group; and Arm C: postoperative early Kiymil arkili emdew group). After tenotomy, the rabbits of the two experimental groups received microsurgery to repair the ruptured tendons, and then received either cast immobilization or early Kiymil arkili emdew treatment. Achilles tendon tissue samples were collected 7 days after the surgery, and two-dimensional gel electrophoresis and MALDI-TOF-MS technique were used to analyze differentially expressed proteins in the tendon tissue of the three Arms. A total of 462.67 +/- 11.59, 532.33 +/- 27.79, and 515.33 +/- 6.56 protein spots were detected by the two-dimensional polyacrylamide gels in the Achilles tendon samples of the rabbits in Arms A, B, and C, respectively. Nineteen differentially expressed protein spots were randomly selected from Arm C. Among them, 7 were unique, and 15 had five times higher abundance than those in Arm B. These included annexin A2, gelsolin isoforms and alpha-1 Type III collagen. It was confirmed by western blot that gelsolin isoform b, annexin A2, etc. had specific and incremental expression in Arm C. The self-protective instincts of humans were overlooked in the classical postoperative treatment for Achilles tendon rupture with cast immobilization. Kiymil arkili emdew induced the specific and incremental expression of proteins in the repaired Achilles tendon in the early healing stage in a rabbit model, compared with those treated with postoperative cast immobilization. These differentially expressed proteins may contribute to the healing of the Achilles tendon via

  17. Protein-x of hepatitis B virus in interaction with CCAAT/enhancer-binding protein α (C/EBPα - an in silico analysis approach

    Directory of Open Access Journals (Sweden)

    Mohamadkhani Ashraf

    2011-10-01

    Full Text Available Abstract Background Even though many functions of protein-x from the Hepatitis B virus (HBV have been revealed, the nature of protein-x is yet unknown. This protein is well-known for its transactivation activity through interaction with several cellular transcription factors, it is also known as an oncogene. In this work, we have presented computational approaches to design a model to show the structure of protein-x and its respective binding sites associated with the CCAAT/enhancer-binding protein α (C/EBPα. C/EBPα belongs to the bZip family of transcription factors, which activates transcription of several genes through its binding sites in liver and fat cells. The C/EBPα has been shown to bind and modulate enhancer I and the enhancer II/core promoter of HBV. In this study using the bioinformatics tools we tried to present a reliable model for the protein-x interaction with C/EBPα. Results The amino acid sequence of protein-x was extracted from UniProt [UniProt:Q80IU5] and the x-ray crystal structure of the partial CCAAT-enhancer α [PDB:1NWQ] was retrieved from the Protein Data Bank (PDB. Similarity search for protein-x was carried out by psi-blast and bl2seq using NCBI [GenBank: BAC65106.1] and Local Meta-Threading-Server (LOMETS was used as a threading server for determining the maximum tertiary structure similarities. Advanced MODELLER was implemented to design a comparative model, however, due to the lack of a suitable template, Quark was used for ab initio tertiary structure prediction. The PDB-blast search indicated a maximum of 23% sequence identity and 33% similarity with crystal structure of the porcine reproductive and respiratory syndrome virus leader protease Nsp1α [PDB:3IFU]. This meant that protein-x does not have a suitable template to predict its tertiary structure using comparative modeling tools, therefore we used QUARK as an ab initio 3D prediction approach. Docking results from the ab initio tertiary structure of

  18. Probing the Energetics of Dynactin Filament Assembly and the Binding of Cargo Adaptor Proteins Using Molecular Dynamics Simulation and Electrostatics-Based Structural Modeling.

    Science.gov (United States)

    Zheng, Wenjun

    2017-01-10

    Dynactin, a large multiprotein complex, binds with the cytoplasmic dynein-1 motor and various adaptor proteins to allow recruitment and transportation of cellular cargoes toward the minus end of microtubules. The structure of the dynactin complex is built around an actin-like minifilament with a defined length, which has been visualized in a high-resolution structure of the dynactin filament determined by cryo-electron microscopy (cryo-EM). To understand the energetic basis of dynactin filament assembly, we used molecular dynamics simulation to probe the intersubunit interactions among the actin-like proteins, various capping proteins, and four extended regions of the dynactin shoulder. Our simulations revealed stronger intersubunit interactions at the barbed and pointed ends of the filament and involving the extended regions (compared with the interactions within the filament), which may energetically drive filament termination by the capping proteins and recruitment of the actin-like proteins by the extended regions, two key features of the dynactin filament assembly process. Next, we modeled the unknown binding configuration among dynactin, dynein tails, and a number of coiled-coil adaptor proteins (including several Bicaudal-D and related proteins and three HOOK proteins), and predicted a key set of charged residues involved in their electrostatic interactions. Our modeling is consistent with previous findings of conserved regions, functional sites, and disease mutations in the adaptor proteins and will provide a structural framework for future functional and mutational studies of these adaptor proteins. In sum, this study yielded rich structural and energetic information about dynactin and associated adaptor proteins that cannot be directly obtained from the cryo-EM structures with limited resolutions.

  19. Gene ontology based transfer learning for protein subcellular localization

    Directory of Open Access Journals (Sweden)

    Zhou Shuigeng

    2011-02-01

    Full Text Available Abstract Background Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as GO, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the GO terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology. Results In this paper, we propose a Gene Ontology Based Transfer Learning Model (GO-TLM for large-scale protein subcellular localization. The model transfers the signature-based homologous GO terms to the target proteins, and further constructs a reliable learning system to reduce the adverse affect of the potential false GO terms that are resulted from evolutionary divergence. We derive three GO kernels from the three aspects of gene ontology to measure the GO similarity of two proteins, and derive two other spectrum kernels to measure the similarity of two protein sequences. We use simple non-parametric cross validation to explicitly weigh the discriminative abilities of the five kernels, such that the time & space computational complexities are greatly reduced when compared to the complicated semi-definite programming and semi-indefinite linear programming. The five kernels are then linearly merged into one single kernel for

  20. The potential of chitosan in enhancing peptide and protein absorption across the TR146 cell culture model-an in vitro model of the buccal epithelium

    DEFF Research Database (Denmark)

    Portero, Ana; Remuñán-López, Carmen; Nielsen, Hanne Mørck

    2002-01-01

    To investigate the potential of chitosan (CS) to enhance buccal peptide and protein absorption, the TR146 cell culture model, a model of the buccal epithelium, was used.......To investigate the potential of chitosan (CS) to enhance buccal peptide and protein absorption, the TR146 cell culture model, a model of the buccal epithelium, was used....